Chuang Gan
19
Papers
197
Total Citations
Papers (19)
MultiPLY: A Multisensory Object-Centric Embodied Large Language Model in 3D World
CVPR 2024
51
citations
Learning 4D Embodied World Models
ICCV 2025arXiv
43
citations
COMBO: Compositional World Models for Embodied Multi-Agent Cooperation
ICLR 2025
33
citations
LSceneLLM: Enhancing Large 3D Scene Understanding Using Adaptive Visual Preferences
CVPR 2025
25
citations
Learning 3D Persistent Embodied World Models
NeurIPS 2025
17
citations
Scaling Autonomous Agents via Automatic Reward Modeling And Planning
ICLR 2025
13
citations
UniMuMo: Unified Text, Music, and Motion Generation
AAAI 2025
12
citations
RapVerse: Coherent Vocals and Whole-Body Motion Generation from Text
ICCV 2025arXiv
3
citations
RoboDreamer: Learning Compositional World Models for Robot Imagination
ICML 2024
0
citations
ContPhy: Continuum Physical Concept Learning and Reasoning from Videos
ICML 2024
0
citations
3D-VLA: A 3D Vision-Language-Action Generative World Model
ICML 2024
0
citations
RoboGen: Towards Unleashing Infinite Data for Automated Robot Learning via Generative Simulation
ICML 2024
0
citations
3D-Mem: 3D Scene Memory for Embodied Exploration and Reasoning
CVPR 2025
0
citations
VCA: Video Curious Agent for Long Video Understanding
ICCV 2025
0
citations
SOK-Bench: A Situated Video Reasoning Benchmark with Aligned Open-World Knowledge
CVPR 2024
0
citations
Open3DIS: Open-Vocabulary 3D Instance Segmentation with 2D Mask Guidance
CVPR 2024
0
citations
RILA: Reflective and Imaginative Language Agent for Zero-Shot Semantic Audio-Visual Navigation
CVPR 2024
0
citations
LLM and Simulation as Bilevel Optimizers: A New Paradigm to Advance Physical Scientific Discovery
ICML 2024
0
citations
Speech Self-Supervised Learning Using Diffusion Model Synthetic Data
ICML 2024
0
citations