Wenwei Zhang
11
Papers
522
Total Citations
Papers (11)
LLaVA-3D: A Simple yet Effective Pathway to Empowering LMMs with 3D Capabilities
ICCV 2025
127
citations
OMG-Seg: Is One Model Good Enough For All Segmentation?
CVPR 2024
106
citations
CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction
ICLR 2024
104
citations
Unified Human-Scene Interaction via Prompted Chain-of-Contacts
ICLR 2024
100
citations
Harmonizing Visual Representations for Unified Multimodal Understanding and Generation
ICCV 2025
33
citations
CLIM: Contrastive Language-Image Mosaic for Region Representation
AAAI 2024arXiv
24
citations
F-LMM: Grounding Frozen Large Multimodal Models
CVPR 2025arXiv
21
citations
Rethinking Verification for LLM Code Generation: From Generation to Testing
NeurIPS 2025
7
citations
EmbodiedScan: A Holistic Multi-Modal 3D Perception Suite Towards Embodied AI
CVPR 2024
0
citations
Are VLMs Ready for Autonomous Driving? An Empirical Study from the Reliability, Data and Metric Perspectives
ICCV 2025
0
citations
Can AI Assistants Know What They Don't Know?
ICML 2024
0
citations