Xiao Chen
8
Papers
92
Total Citations
Papers (8)
EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions
CVPR 2025arXiv
44
citations
GenNBV: Generalizable Next-Best-View Policy for Active 3D Reconstruction
CVPR 2024
34
citations
From One to More: Contextual Part Latents for 3D Generation
ICCV 2025arXiv
8
citations
FIRM: Flexible Interactive Reflection ReMoval
AAAI 2025
3
citations
UPME: An Unsupervised Peer Review Framework for Multimodal Large Language Model Evaluation
CVPR 2025arXiv
2
citations
Label-Efficient Data Augmentation with Video Diffusion Models for Guidewire Segmentation in Cardiac Fluoroscopy
AAAI 2025
1
citations
EmbodiedScan: A Holistic Multi-Modal 3D Perception Suite Towards Embodied AI
CVPR 2024
0
citations
GLEAM: Learning Generalizable Exploration Policy for Active Mapping in Complex 3D Indoor Scene
ICCV 2025
0
citations