Yunze Man
6
Papers
130
Total Citations
Papers (6)
RandAR: Decoder-only Autoregressive Visual Generation in Random Orders
CVPR 2025arXiv
61
citations
Frozen Transformers in Language Models Are Effective Visual Encoder Layers
ICLR 2024
48
citations
Argus: Vision-Centric Reasoning with Grounded Chain-of-Thought
CVPR 2025arXiv
19
citations
AgMMU: A Comprehensive Agricultural Multimodal Understanding Benchmark
NeurIPS 2025
2
citations
Floating No More: Object-Ground Reconstruction from a Single Image
CVPR 2025
0
citations
Situational Awareness Matters in 3D Vision Language Reasoning
CVPR 2024
0
citations