Yinfei Yang
6
Papers
1,427
Total Citations
Papers (6)
Scaling Autoregressive Models for Content-Rich Text-to-Image Generation
ICLR 2024
1,366
citations
MIA-Bench: Towards Better Instruction Following Evaluation of Multimodal LLMs
ICLR 2025
41
citations
STIV: Scalable Text and Image Conditioned Video Generation
ICCV 2025
20
citations
Multimodal Autoregressive Pre-training of Large Vision Encoders
CVPR 2025
0
citations
MM-Spatial: Exploring 3D Spatial Understanding in Multimodal LLMs
ICCV 2025
0
citations
UniVG: A Generalist Diffusion Model for Unified Image Generation and Editing
ICCV 2025
0
citations