Baoxiong Jia
22
Papers
127
Total Citations
Papers (22)
Move as You Say Interact as You Can: Language-guided Human Motion Generation with Scene Affordance
CVPR 2024
78
citations
Move to Understand a 3D Scene: Bridging Visual Grounding and Exploration for Efficient and Versatile Embodied Navigation
ICCV 2025arXiv
24
citations
Unveiling the Mist over 3D Vision-Language Understanding: Object-centric Evaluation with Chain-of-Analysis
CVPR 2025
17
citations
SceneWeaver: All-in-One 3D Scene Synthesis with an Extensible and Self-Reflective Agent
NeurIPS 2025
8
citations
GWM: Towards Scalable Gaussian World Models for Robotic Manipulation
ICCV 2025
0
citations
PhyScene: Physically Interactable 3D Scene Synthesis for Embodied AI
CVPR 2024
0
citations
An Embodied Generalist Agent in 3D World
ICML 2024
0
citations
RAVEN: A Dataset for Relational and Analogical Visual REasoNing
CVPR 2019
0
citations
Abstract Spatial-Temporal Reasoning via Probabilistic Abduction and Execution
CVPR 2021arXiv
0
citations
Diffusion-Based Generation, Optimization, and Planning in 3D Scenes
CVPR 2023arXiv
0
citations
X-VoE: Measuring eXplanatory Violation of Expectation in Physical Events
ICCV 2023
0
citations
ARNOLD: A Benchmark for Language-Grounded Task Learning with Continuous States in Realistic 3D Scenes
ICCV 2023arXiv
0
citations
LEMMA: A Multi-view Dataset for LEarning Multi-agent Multi-task Activities
ECCV 2020
0
citations
Learning Algebraic Representation for Systematic Generalization in Abstract Reasoning
ECCV 2022
0
citations
ACRE: Abstract Causal REasoning Beyond Covariation
CVPR 2021arXiv
0
citations
Masked Point-Entity Contrast for Open-Vocabulary 3D Scene Understanding
CVPR 2025
0
citations
MOVIS: Enhancing Multi-Object Novel View Synthesis for Indoor Scenes
CVPR 2025
0
citations
METASCENES: Towards Automated Replica Creation for Real-world 3D Scans
CVPR 2025
0
citations
Learning Perceptual Inference by Contrasting
NeurIPS 2019
0
citations
EgoTaskQA: Understanding Human Tasks in Egocentric Videos
NeurIPS 2022
0
citations
ProBio: A Protocol-guided Multimodal Dataset for Molecular Biology Lab
NeurIPS 2023
0
citations
Generalized Earley Parser: Bridging Symbolic Grammars and Sequence Data for Future Prediction
ICML 2018
0
citations