Jiangmiao Pang
15
Papers
359
Total Citations
Papers (15)
LLaVA-3D: A Simple yet Effective Pathway to Empowering LMMs with 3D Capabilities
ICCV 2025
127
citations
Unified Human-Scene Interaction via Prompted Chain-of-Contacts
ICLR 2024
100
citations
Aether: Geometric-Aware Unified World Modeling
ICCV 2025
47
citations
GenNBV: Generalizable Next-Best-View Policy for Active 3D Reconstruction
CVPR 2024
34
citations
RoboGround: Robotic Manipulation with Grounded Vision-Language Priors
CVPR 2025arXiv
15
citations
EgoExoBench: A Benchmark for First- and Third-person View Video Understanding in MLLMs
NeurIPS 2025
10
citations
OST-Bench: Evaluating the Capabilities of MLLMs in Online Spatio-temporal Scene Understanding
NeurIPS 2025
6
citations
GENMANIP: LLM-driven Simulation for Generalizable Instruction-Following Manipulation
CVPR 2025
5
citations
Rethinking the Embodied Gap in Vision-and-Language Navigation: A Holistic Study of Physical and Visual Disparities
ICCV 2025
4
citations
ObjectGS: Object-aware Scene Reconstruction and Scene Understanding via Gaussian Splatting
ICCV 2025arXiv
4
citations
A Data-Centric Revisit of Pre-Trained Vision Models for Robot Learning
CVPR 2025
3
citations
LabUtopia: High-Fidelity Simulation and Hierarchical Benchmark for Scientific Embodied Agents
NeurIPS 2025arXiv
2
citations
VFlowOpt: A Token Pruning Framework for LMMs with Visual Information Flow-Guided Optimization
ICCV 2025
2
citations
GLEAM: Learning Generalizable Exploration Policy for Active Mapping in Complex 3D Indoor Scene
ICCV 2025
0
citations
EmbodiedScan: A Holistic Multi-Modal 3D Perception Suite Towards Embodied AI
CVPR 2024
0
citations