Jiangmiao Pang
30
Papers
342
Total Citations
Papers (30)
LLaVA-3D: A Simple yet Effective Pathway to Empowering LMMs with 3D Capabilities
ICCV 2025
127
citations
Unified Human-Scene Interaction via Prompted Chain-of-Contacts
ICLR 2024
100
citations
Aether: Geometric-Aware Unified World Modeling
ICCV 2025
47
citations
GenNBV: Generalizable Next-Best-View Policy for Active 3D Reconstruction
CVPR 2024
34
citations
EgoExoBench: A Benchmark for First- and Third-person View Video Understanding in MLLMs
NeurIPS 2025
10
citations
OST-Bench: Evaluating the Capabilities of MLLMs in Online Spatio-temporal Scene Understanding
NeurIPS 2025
6
citations
GENMANIP: LLM-driven Simulation for Generalizable Instruction-Following Manipulation
CVPR 2025
5
citations
ObjectGS: Object-aware Scene Reconstruction and Scene Understanding via Gaussian Splatting
ICCV 2025
4
citations
Rethinking the Embodied Gap in Vision-and-Language Navigation: A Holistic Study of Physical and Visual Disparities
ICCV 2025
4
citations
A Data-Centric Revisit of Pre-Trained Vision Models for Robot Learning
CVPR 2025
3
citations
VFlowOpt: A Token Pruning Framework for LMMs with Visual Information Flow-Guided Optimization
ICCV 2025
2
citations
Dense Distinct Query for End-to-End Object Detection
CVPR 2023arXiv
0
citations
Tube-Link: A Flexible Cross Tube Framework for Universal Video Segmentation
ICCV 2023
0
citations
Side-Aware Boundary Localization for More Precise Object Detection
ECCV 2020
0
citations
Monocular 3D Object Detection with Depth from Motion
ECCV 2022
0
citations
Dense Siamese Network for Dense Unsupervised Learning
ECCV 2022
0
citations
Libra R-CNN: Towards Balanced Learning for Object Detection
CVPR 2019
0
citations
RoboGround: Robotic Manipulation with Grounded Vision-Language Priors
CVPR 2025
0
citations
GLEAM: Learning Generalizable Exploration Policy for Active Mapping in Complex 3D Indoor Scene
ICCV 2025
0
citations
EmbodiedScan: A Holistic Multi-Modal 3D Perception Suite Towards Embodied AI
CVPR 2024
0
citations
Adapting Object Detectors via Selective Cross-Domain Alignment
CVPR 2019
0
citations
Hybrid Task Cascade for Instance Segmentation
CVPR 2019
0
citations
Quasi-Dense Similarity Learning for Multiple Object Tracking
CVPR 2021arXiv
0
citations
Seesaw Loss for Long-Tailed Instance Segmentation
CVPR 2021arXiv
0
citations
Video K-Net: A Simple, Strong, and Unified Baseline for Video Segmentation
CVPR 2022
0
citations
MV-JAR: Masked Voxel Jigsaw and Reconstruction for LiDAR-Based Self-Supervised Pre-Training
CVPR 2023
0
citations
Observation-Centric SORT: Rethinking SORT for Robust Multi-Object Tracking
CVPR 2023arXiv
0
citations
FishNet: A Versatile Backbone for Image, Region, and Pixel Level Prediction
NeurIPS 2018
0
citations
K-Net: Towards Unified Image Segmentation
NeurIPS 2021
0
citations
OV-PARTS: Towards Open-Vocabulary Part Segmentation
NeurIPS 2023
0
citations