Chuang Gan

19
Papers
197
Total Citations

Papers (19)

MultiPLY: A Multisensory Object-Centric Embodied Large Language Model in 3D World

CVPR 2024
51
citations

Learning 4D Embodied World Models

ICCV 2025arXiv
43
citations

COMBO: Compositional World Models for Embodied Multi-Agent Cooperation

ICLR 2025
33
citations

LSceneLLM: Enhancing Large 3D Scene Understanding Using Adaptive Visual Preferences

CVPR 2025
25
citations

Learning 3D Persistent Embodied World Models

NeurIPS 2025
17
citations

Scaling Autonomous Agents via Automatic Reward Modeling And Planning

ICLR 2025
13
citations

UniMuMo: Unified Text, Music, and Motion Generation

AAAI 2025
12
citations

RapVerse: Coherent Vocals and Whole-Body Motion Generation from Text

ICCV 2025arXiv
3
citations

RoboDreamer: Learning Compositional World Models for Robot Imagination

ICML 2024
0
citations

ContPhy: Continuum Physical Concept Learning and Reasoning from Videos

ICML 2024
0
citations

3D-VLA: A 3D Vision-Language-Action Generative World Model

ICML 2024
0
citations

RoboGen: Towards Unleashing Infinite Data for Automated Robot Learning via Generative Simulation

ICML 2024
0
citations

3D-Mem: 3D Scene Memory for Embodied Exploration and Reasoning

CVPR 2025
0
citations

VCA: Video Curious Agent for Long Video Understanding

ICCV 2025
0
citations

SOK-Bench: A Situated Video Reasoning Benchmark with Aligned Open-World Knowledge

CVPR 2024
0
citations

Open3DIS: Open-Vocabulary 3D Instance Segmentation with 2D Mask Guidance

CVPR 2024
0
citations

RILA: Reflective and Imaginative Language Agent for Zero-Shot Semantic Audio-Visual Navigation

CVPR 2024
0
citations

LLM and Simulation as Bilevel Optimizers: A New Paradigm to Advance Physical Scientific Discovery

ICML 2024
0
citations

Speech Self-Supervised Learning Using Diffusion Model Synthetic Data

ICML 2024
0
citations