Yilun Du
48
Papers
748
Total Citations
1
Affiliations
Affiliations
MIT
Papers (48)
Learning Interactive Real-World Simulators
ICLR 2024
334
citations
Video Language Planning
ICLR 2024
144
citations
Large-scale Reinforcement Learning for Diffusion Models
ECCV 2024
69
citations
History-Guided Video Diffusion
ICML 2025
66
citations
Looped Transformers for Length Generalization
ICLR 2025
33
citations
COMBO: Compositional World Models for Embodied Multi-Agent Cooperation
ICLR 2025
33
citations
Generative Trajectory Stitching through Diffusion Composition
NeurIPS 2025
20
citations
Learning 3D Persistent Embodied World Models
NeurIPS 2025
17
citations
Compositional Generative Inverse Design
ICLR 2024
15
citations
Solving New Tasks by Adapting Internet Video Knowledge
ICLR 2025
12
citations
Win Fast or Lose Slow: Balancing Speed and Accuracy in Latency-Sensitive Decisions of LLMs
NeurIPS 2025
4
citations
Compositional Scene Understanding through Inverse Generative Modeling
ICML 2025
1
citations
3D Concept Learning and Reasoning From Multi-View Images
CVPR 2023arXiv
0
citations
Neural Radiance Flow for 4D View Synthesis and Video Processing
ICCV 2021arXiv
0
citations
3D Shape Generation and Completion Through Point-Voxel Diffusion
ICCV 2021arXiv
0
citations
Weakly Supervised Human-Object Interaction Detection in Video via Contrastive Spatiotemporal Regions
ICCV 2021arXiv
0
citations
Curious Representation Learning for Embodied Intelligence
ICCV 2021arXiv
0
citations
Unsupervised Compositional Concepts Discovery with Text-to-Image Generative Models
ICCV 2023arXiv
0
citations
3D-Mem: 3D Scene Memory for Embodied Exploration and Reasoning
CVPR 2025
0
citations
Compositional Visual Generation with Composable Diffusion Models
ECCV 2022
0
citations
Learning 4D Embodied World Models
ICCV 2025
0
citations
Position: Compositional Generative Modeling: A Single Model is Not All You Need
ICML 2024
0
citations
Improving Factuality and Reasoning in Language Models through Multiagent Debate
ICML 2024
0
citations
Learning Iterative Reasoning through Energy Diffusion
ICML 2024
0
citations
Potential Based Diffusion Motion Planning
ICML 2024
0
citations
RoboDreamer: Learning Compositional World Models for Robot Imagination
ICML 2024
0
citations
3D-VLA: A 3D Vision-Language-Action Generative World Model
ICML 2024
0
citations
Compositional Image Decomposition with Diffusion Models
ICML 2024
0
citations
Position: Video as the New Language for Real-World Decision Making
ICML 2024
0
citations
Kubric: A Scalable Dataset Generator
CVPR 2022arXiv
0
citations
Learning To Render Novel Views From Wide-Baseline Stereo Pairs
CVPR 2023arXiv
0
citations
Learning to Exploit Stability for 3D Scene Parsing
NeurIPS 2018
0
citations
Implicit Generation and Modeling with Energy Based Models
NeurIPS 2019
0
citations
Compositional Visual Generation with Energy Based Models
NeurIPS 2020
0
citations
Learning Signal-Agnostic Manifolds of Neural Fields
NeurIPS 2021
0
citations
Unsupervised Learning of Compositional Energy Concepts
NeurIPS 2021
0
citations
Learning to Compose Visual Relations
NeurIPS 2021
0
citations
Learning Neural Acoustic Fields
NeurIPS 2022
0
citations
3D Concept Grounding on Neural Fields
NeurIPS 2022
0
citations
Pre-Trained Language Models for Interactive Decision-Making
NeurIPS 2022
0
citations
FlowCam: Training Generalizable 3D Radiance Fields without Camera Poses via Pixel-Aligned Scene Flow
NeurIPS 2023
0
citations
Learning Universal Policies via Text-Guided Video Generation
NeurIPS 2023
0
citations
3D-LLM: Injecting the 3D World into Large Language Models
NeurIPS 2023
0
citations
Compositional Foundation Models for Hierarchical Planning
NeurIPS 2023
0
citations
Adaptive Online Replanning with Diffusion Models
NeurIPS 2023
0
citations
DiffuseBot: Breeding Soft Robots With Physics-Augmented Generative Diffusion Models
NeurIPS 2023
0
citations
Secure Out-of-Distribution Task Generalization with Energy-Based Models
NeurIPS 2023
0
citations
Task-Agnostic Dynamics Priors for Deep Reinforcement Learning
ICML 2019
0
citations