Quan Sun

4

papers

441

total citations

papers (4)

Generative Multimodal Models are In-Context Learners

Taming Teacher Forcing for Masked Autoregressive Video Generation

CapsFusion: Rethinking Image-Text Data at Scale

EVA: Exploring the Limits of Masked Visual Representation Learning at Scale