Quan Sun
4
papers
441
total citations
papers (4)
Generative Multimodal Models are In-Context Learners
CVPR 2024arXiv
422
citations
Taming Teacher Forcing for Masked Autoregressive Video Generation
CVPR 2025arXiv
19
citations
CapsFusion: Rethinking Image-Text Data at Scale
CVPR 2024arXiv
0
citations
EVA: Exploring the Limits of Masked Visual Representation Learning at Scale
CVPR 2023arXiv
0
citations