Xinlong Wang
7
Papers
654
Total Citations
Papers (7)
Generative Multimodal Models are In-Context Learners
CVPR 2024
422
citations
Uni3D: Exploring Unified 3D Representation at Scale
ICLR 2024
165
citations
You See it, You Got it: Learning 3D Creation on Pose-Free Videos at Scale
CVPR 2025arXiv
49
citations
EVEv2: Improved Baselines for Encoder-Free Vision-Language Models
ICCV 2025
18
citations
CapsFusion: Rethinking Image-Text Data at Scale
CVPR 2024
0
citations
Unveiling Parts Beyond Objects: Towards Finer-Granularity Referring Expression Segmentation
CVPR 2024
0
citations
Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model
ICML 2024
0
citations