Xiao Chen
12
Papers
90
Total Citations
Papers (12)
EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions
CVPR 2025
44
citations
GenNBV: Generalizable Next-Best-View Policy for Active 3D Reconstruction
CVPR 2024
34
citations
From One to More: Contextual Part Latents for 3D Generation
ICCV 2025arXiv
8
citations
FIRM: Flexible Interactive Reflection ReMoval
AAAI 2025
3
citations
Label-Efficient Data Augmentation with Video Diffusion Models for Guidewire Segmentation in Cardiac Fluoroscopy
AAAI 2025
1
citations
UPME: An Unsupervised Peer Review Framework for Multimodal Large Language Model Evaluation
CVPR 2025
0
citations
GLEAM: Learning Generalizable Exploration Policy for Active Mapping in Complex 3D Indoor Scene
ICCV 2025
0
citations
EmbodiedScan: A Holistic Multi-Modal 3D Perception Suite Towards Embodied AI
CVPR 2024
0
citations
FOAL: Fast Online Adaptive Learning for Cardiac Motion Estimation
CVPR 2020arXiv
0
citations
Robust Landmark-Based Stent Tracking in X-Ray Fluoroscopy
ECCV 2022
0
citations
DynaBERT: Dynamic BERT with Adaptive Width and Depth
NeurIPS 2020
0
citations
M4Singer: A Multi-Style, Multi-Singer and Musical Score Provided Mandarin Singing Corpus
NeurIPS 2022
0
citations