Zehuan Yuan
6
Papers
152
Total Citations
Papers (6)
General Object Foundation Model for Images and Videos at Scale
CVPR 2024
79
citations
Goku: Flow Based Video Generative Foundation Models
CVPR 2025arXiv
53
citations
EVE: Efficient Vision-Language Pre-training with Masked Prediction and Modality-Aware MoE
AAAI 2024arXiv
20
citations
TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation
CVPR 2025
0
citations
Infinity∞: Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis
CVPR 2025
0
citations
Generative Region-Language Pretraining for Open-Ended Object Detection
CVPR 2024
0
citations