Yongming Rao
7
Papers
496
Total Citations
Papers (7)
Generative Multimodal Models are In-Context Learners
CVPR 2024
422
citations
Sherpa3D: Boosting High-Fidelity Text-to-3D Generation via Coarse 3D Prior
CVPR 2024
37
citations
Efficient Inference of Vision Instruction-Following Models with Elastic Cache
ECCV 2024
25
citations
Coarse Correspondences Boost Spatial-Temporal Reasoning in Multimodal Language Model
CVPR 2025arXiv
9
citations
SparseMM: Head Sparsity Emerges from Visual Concept Responses in MLLMs
ICCV 2025
3
citations
Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models
CVPR 2025
0
citations
X-3D: Explicit 3D Structure Modeling for Point Cloud Recognition
CVPR 2024
0
citations