🧬Vision Recognition

3D Object Detection

Detecting objects in 3D space

315 papers(showing top 100)5,081 total citations
Compare with other topics
Mar '24 Feb '26266 papers
Also includes: 3d object detection, 3d detection, lidar detection, point cloud detection

Top Papers

#1

DETRs Beat YOLOs on Real-time Object Detection

Yian Zhao, Wenyu Lv, Shangliang Xu et al.

CVPR 2024arXiv:2304.08069
2,424
citations
#2

SplaTAM: Splat Track & Map 3D Gaussians for Dense RGB-D SLAM

Nikhil Keetha, Jay Karhade, Krishna Murthy Jatavallabhula et al.

CVPR 2024arXiv:2312.02126
477
citations
#3

FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects

Bowen Wen, Wei Yang, Jan Kautz et al.

CVPR 2024arXiv:2312.08344
412
citations
#4

DL3DV-10K: A Large-Scale Scene Dataset for Deep Learning-based 3D Vision

Lu Ling, Yichen Sheng, Zhi Tu et al.

CVPR 2024arXiv:2312.16256
266
citations
#5

Probing the 3D Awareness of Visual Foundation Models

Mohamed El Banani, Amit Raj, Kevis-kokitsi Maninis et al.

CVPR 2024arXiv:2404.08636
130
citations
#6

IS-Fusion: Instance-Scene Collaborative Fusion for Multimodal 3D Object Detection

Junbo Yin, Wenguan Wang, Runnan Chen et al.

CVPR 2024arXiv:2403.15241
81
citations
#7

MonoCD: Monocular 3D Object Detection with Complementary Depths

Longfei Yan, Pei Yan, Shengzhou Xiong et al.

CVPR 2024arXiv:2404.03181
64
citations
#8

Towards Scalable 3D Anomaly Detection and Localization: A Benchmark via 3D Anomaly Synthesis and A Self-Supervised Learning Network

wenqiao Li, Xiaohao Xu, Yao Gu et al.

CVPR 2024arXiv:2311.14897
50
citations
#9

DetCLIPv3: Towards Versatile Generative Open-vocabulary Object Detection

Lewei Yao, Renjie Pi, Jianhua Han et al.

CVPR 2024arXiv:2404.09216
45
citations
#10

UnScene3D: Unsupervised 3D Instance Segmentation for Indoor Scenes

David Rozenberszki, Or Litany, Angela Dai

CVPR 2024arXiv:2303.14541
40
citations
#11

EgoLifter: Open-world 3D Segmentation for Egocentric Perception

Qiao Gu, Zhaoyang Lv, Duncan Frost et al.

ECCV 2024arXiv:2403.18118
3d gaussian representationegocentric perceptionopen-world segmentationsegment anything model+3
40
citations
#12

Scene Adaptive Sparse Transformer for Event-based Object Detection

Yansong Peng, Li Hebei, Yueyi Zhang et al.

CVPR 2024arXiv:2404.01882
40
citations
#13

R3D-AD: Reconstruction via Diffusion for 3D Anomaly Detection

Zheyuan Zhou, Wang Le, Naiyu Fang et al.

ECCV 2024arXiv:2407.10862
36
citations
#14

AGILE3D: Attention Guided Interactive Multi-object 3D Segmentation

Yuanwen Yue, Sabarinath Mahadevan, Jonas Schult et al.

ICLR 2024arXiv:2306.00977
34
citations
#15

ADA-Track: End-to-End Multi-Camera 3D Multi-Object Tracking with Alternating Detection and Association

Shuxiao Ding, Lukas Schneider, Marius Cordts et al.

CVPR 2024arXiv:2405.08909
34
citations
#16

SAM-guided Graph Cut for 3D Instance Segmentation

Haoyu Guo, He Zhu, Sida Peng et al.

ECCV 2024arXiv:2312.08372
3d instance segmentationmulti-view image informationgraph cut problemsuperpoint graph+4
32
citations
#17

Towards Generalizable Multi-Object Tracking

Zheng Qin, Le Wang, Sanping Zhou et al.

CVPR 2024arXiv:2406.00429
32
citations
#18

CAT: Exploiting Inter-Class Dynamics for Domain Adaptive Object Detection

Mikhail Kennerley, Jian-Gang Wang, Bharadwaj Veeravalli et al.

CVPR 2024arXiv:2403.19278
31
citations
#19

Orient Anything: Learning Robust Object Orientation Estimation from Rendering 3D Models

Zehan Wang, Ziang Zhang, Tianyu Pang et al.

ICML 2025arXiv:2412.18605
28
citations
#20

Multi-Object Tracking in the Dark

Xinzhe Wang, Kang Ma, Qiankun Liu et al.

CVPR 2024arXiv:2405.06600
25
citations
#21

Improving Single Domain-Generalized Object Detection: A Focus on Diversification and Alignment

Muhammad Sohail Danish, Muhammad Haris Khan, Muhammad Akhtar Munir et al.

CVPR 2024arXiv:2405.14497
24
citations
#22

MonoDiff: Monocular 3D Object Detection and Pose Estimation with Diffusion Models

Yasiru Ranasinghe, Deepti Hegde, Vishal M. Patel

CVPR 2024
24
citations
#23

SimDistill: Simulated Multi-Modal Distillation for BEV 3D Object Detection

Haimei Zhao, Qiming Zhang, Shanshan Zhao et al.

AAAI 2024arXiv:2303.16818
3d object detectionmulti-view cameralidar-camera fusionbird's-eye-view space+4
24
citations
#24

LISO: Lidar-only Self-Supervised 3D Object Detection

Stefan Baur, Frank Moosmann, Andreas Geiger

ECCV 2024
24
citations
#25

OpenSight: A Simple Open-Vocabulary Framework for LiDAR-Based Object Detection

Hu Zhang, xu jianhua, Tao Tang et al.

ECCV 2024
24
citations
#26

Towards Robust 3D Object Detection with LiDAR and 4D Radar Fusion in Various Weather Conditions

Yujeong Chae, Hyeonseong Kim, Kuk-Jin Yoon

CVPR 2024
23
citations
#27

GAFusion: Adaptive Fusing LiDAR and Camera with Multiple Guidance for 3D Object Detection

Xiaotian Li, Baojie Fan, Jiandong Tian et al.

CVPR 2024arXiv:2411.00340
22
citations
#28

Open-YOLO 3D: Towards Fast and Accurate Open-Vocabulary 3D Instance Segmentation

Mohamed el amine Boudjoghra, Angela Dai, Jean Lahoud et al.

ICLR 2025
21
citations
#29

SAR3D: Autoregressive 3D Object Generation and Understanding via Multi-scale 3D VQVAE

YONGWEI CHEN, Yushi Lan, Shangchen Zhou et al.

CVPR 2025arXiv:2411.16856
3d object generationautoregressive modelsvector-quantized variational autoencodermulti-scale representation+3
20
citations
#30

Multi-Sensor Object Anomaly Detection: Unifying Appearance, Geometry, and Internal Properties

wenqiao Li, BoZhong Zheng, Xiaohao Xu et al.

CVPR 2025arXiv:2412.14592
20
citations
#31

SEED: A Simple and Effective 3D DETR in Point Clouds

Zhe Liu, Jinghua Hou, Xiaoqing Ye et al.

ECCV 2024arXiv:2407.10749
3d object detectiondetection transformerspoint cloud processingquery selection mechanisms+3
19
citations
#32

Decoupled Pseudo-labeling for Semi-Supervised Monocular 3D Object Detection

Jiacheng Zhang, Jiaming Li, Xiangru Lin et al.

CVPR 2024arXiv:2403.17387
19
citations
#33

Cubify Anything: Scaling Indoor 3D Object Detection

Justin Lazarow, David Griffiths, Gefen Kohavi et al.

CVPR 2025arXiv:2412.04458
18
citations
#34

Enhancing 3D Object Detection with 2D Detection-Guided Query Anchors

Haoxuanye Ji, Pengpeng Liang, Erkang Cheng

CVPR 2024arXiv:2403.06093
17
citations
#35

HandDiff: 3D Hand Pose Estimation with Diffusion on Image-Point Cloud

WENCAN CHENG, Hao Tang, Luc Van Gool et al.

CVPR 2024arXiv:2404.03159
17
citations
#36

Guess The Unseen: Dynamic 3D Scene Reconstruction from Partial 2D Glimpses

Inhee Lee, Byungjun Kim, Hanbyul Joo

CVPR 2024arXiv:2404.14410
16
citations
#37

Commonsense Prototype for Outdoor Unsupervised 3D Object Detection

Hai Wu, Shijia Zhao, Xun Huang et al.

CVPR 2024arXiv:2404.16493
16
citations
#38

HGSFusion: Radar-Camera Fusion with Hybrid Generation and Synchronization for 3D Object Detection

Zijian Gu, Jianwei Ma, Yan Huang et al.

AAAI 2025arXiv:2412.11489
14
citations
#39

Find n' Propagate: Open-Vocabulary 3D Object Detection in Urban Environments

Djamahl Etchegaray, Zi Helen Huang, Tatsuya Harada et al.

ECCV 2024arXiv:2403.13556
14
citations
#40

3D Open-Vocabulary Panoptic Segmentation with 2D-3D Vision-Language Distillation

Zihao Xiao, Longlong Jing, Shangxuan Wu et al.

ECCV 2024
13
citations
#41

Detect Anything 3D in the Wild

Hanxue Zhang, Haoran Jiang, Qingsong Yao et al.

ICCV 2025arXiv:2504.07958
3d object detectionzero-shot generalizationmonocular inputsfoundation models+3
12
citations
#42

OPEN: Object-wise Position Embedding for Multi-view 3D Object Detection

Jinghua Hou, Tong Wang, Xiaoqing Ye et al.

ECCV 2024arXiv:2407.10753
12
citations
#43

Unlocking Textual and Visual Wisdom: Open-Vocabulary 3D Object Detection Enhanced by Comprehensive Guidance from Text and Image

Pengkun Jiao, Na Zhao, Jingjing Chen et al.

ECCV 2024arXiv:2407.05256
open-vocabulary 3d detectionvision-language modelshierarchical alignmentzero-shot discovery+2
12
citations
#44

H2O-SDF: Two-phase Learning for 3D Indoor Reconstruction using Object Surface Fields

Minyoung Park, MIRAE DO, Yeon Jae Shin et al.

ICLR 2024arXiv:2402.08138
12
citations
#45

Weakly Supervised Monocular 3D Detection with a Single-View Image

Xueying Jiang, Sheng Jin, Lewei Lu et al.

CVPR 2024arXiv:2402.19144
12
citations
#46

GeoBEV: Learning Geometric BEV Representation for Multi-view 3D Object Detection

Jinqing Zhang, Yanan Zhang, Yunlong Qi et al.

AAAI 2025arXiv:2409.01816
12
citations
#47

RaCFormer: Towards High-Quality 3D Object Detection via Query-based Radar-Camera Fusion

Xiaomeng Chu, Jiajun Deng, Guoliang You et al.

CVPR 2025
11
citations
#48

LabelDistill: Label-guided Cross-modal Knowledge Distillation for Camera-based 3D Object Detection

Sanmin Kim, Youngseok Kim, Sihwan Hwang et al.

ECCV 2024
11
citations
#49

Instance Tracking in 3D Scenes from Egocentric Videos

Yunhan Zhao, Haoyu Ma, Shu Kong et al.

CVPR 2024arXiv:2312.04117
11
citations
#50

Unsigned Orthogonal Distance Fields: An Accurate Neural Implicit Representation for Diverse 3D Shapes

YuJie Lu, Long Wan, Nayu Ding et al.

CVPR 2024arXiv:2403.01414
10
citations
#51

Disentangled Pre-training for Human-Object Interaction Detection

Zhuolong Li, Xingao Li, Changxing Ding et al.

CVPR 2024arXiv:2404.01725
10
citations
#52

Revisiting Multimodal Fusion for 3D Anomaly Detection from an Architectural Perspective

Kaifang Long, Guoyang Xie, Lianbo Ma et al.

AAAI 2025arXiv:2412.17297
10
citations
#53

OctOcc: High-Resolution 3D Occupancy Prediction with Octree

Wenzhe Ouyang, Xiaolin Song, Bailan Feng et al.

AAAI 2024
10
citations
#54

YolOOD: Utilizing Object Detection Concepts for Multi-Label Out-of-Distribution Detection

Alon Zolfi, Guy AmiT, Amit Baras et al.

CVPR 2024arXiv:2212.02081
10
citations
#55

Geometry-Guided Domain Generalization for Monocular 3D Object Detection

Fan Yang, Hui Chen, Yuwei He et al.

AAAI 2024
10
citations
#56

Point2RBox-v2: Rethinking Point-supervised Oriented Object Detection with Spatial Layout Among Instances

Yi Yu, Botao Ren, Peiyuan Zhang et al.

CVPR 2025arXiv:2502.04268
oriented object detectionweakly-supervised detectionpoint annotationsgaussian overlap loss+4
10
citations
#57

V2X-R: Cooperative LiDAR-4D Radar Fusion with Denoising Diffusion for 3D Object Detection

Xun Huang, Jinlong Wang, Qiming Xia et al.

CVPR 2025arXiv:2411.08402
10
citations
#58

Semi-supervised 3D Object Detection with PatchTeacher and PillarMix

Xiaopei Wu, Liang Peng, Liang Xie et al.

AAAI 2024arXiv:2407.09787
semi-supervised learning3d object detectionpseudo label generationpartial scene detection+3
9
citations
#59

OpenAD: Open-World Autonomous Driving Benchmark for 3D Object Detection

Zhongyu Xia, Jishuo Li, Zhiwei Lin et al.

NeurIPS 2025arXiv:2411.17761
8
citations
#60

In-Hand 3D Object Reconstruction from a Monocular RGB Video

Shijian Jiang, Qi Ye, Rengan Xie et al.

AAAI 2024arXiv:2312.16425
in-hand 3d reconstructionmonocular rgb videoimplicit neural representationsocclusion elucidation+4
7
citations
#61

Weakly Supervised Few-Shot Object Detection with DETR

Chenbo Zhang, Yinglu Zhang, Lu Zhang et al.

AAAI 2024
7
citations
#62

CN-RMA: Combined Network with Ray Marching Aggregation for 3D Indoor Object Detection from Multi-view Images

Guanlin Shen, Jingwei Huang, Zhihua Hu et al.

CVPR 2024arXiv:2403.04198
7
citations
#63

Weak-to-Strong 3D Object Detection with X-Ray Distillation

Alexander Gambashidze, Aleksandr Dadukin, Maksim Golyadkin et al.

CVPR 2024arXiv:2404.00679
6
citations
#64

TetraSphere: A Neural Descriptor for O(3)-Invariant Point Cloud Analysis

Pavlo Melnyk, Andreas Robinson, Michael Felsberg et al.

CVPR 2024arXiv:2211.14456
6
citations
#65

Gaussian-Det: Learning Closed-Surface Gaussians for 3D Object Detection

Hongru Yan, Yu Zheng, Yueqi Duan

ICLR 2025
6
citations
#66

Segment, Lift and Fit: Automatic 3D Shape Labeling from 2D Prompts

Jianhao Li, Tianyu Sun, Zhongdao Wang et al.

ECCV 2024arXiv:2407.11382
3d shape prediction2d to 3d liftingautomatic 3d labelinginstance segmentation+3
6
citations
#67

Functionality Understanding and Segmentation in 3D Scenes

Jaime Corsetti, Francesco Giuliari, Alice Fasoli et al.

CVPR 2025
6
citations
#68

SpatialSplat: Efficient Semantic 3D from Sparse Unposed Images

Yu Sheng, Jiajun Deng, Xinran Zhang et al.

ICCV 2025arXiv:2505.23044
semantic 3d reconstruction3d gaussian primitivesfeedforward 3d reconstructiondual-field semantic representation+4
6
citations
#69

WildSeg3D: Segment Any 3D Objects in the Wild from 2D Images

Yansong Guo, Jie Hu, Yansong Qu et al.

ICCV 2025arXiv:2503.08407
3d object segmentationmulti-view alignmentfeed-forward mechanisminteractive segmentation+3
6
citations
#70

Harnessing Uncertainty-aware Bounding Boxes for Unsupervised 3D Object Detection

Ruiyang Zhang, Hu Zhang, Zhedong Zheng

ICCV 2025arXiv:2408.00619
6
citations
#71

Towards RAW Object Detection in Diverse Conditions

Zhong-Yu Li, Xin Jin, Bo-Yuan Sun et al.

CVPR 2025
5
citations
#72

Pos3R: 6D Pose Estimation for Unseen Objects Made Easy

Weijian Deng, Dylan Campbell, Chunyi Sun et al.

CVPR 2025
5
citations
#73

Learning High-resolution Vector Representation from Multi-Camera Images for 3D Object Detection

Zhili Chen, Shuangjie Xu, Maosheng Ye et al.

ECCV 2024arXiv:2407.15354
3d object detectionbird's-eye-view representationmulti-camera imagesvector representation+3
5
citations
#74

Omnidirectional Multi-Object Tracking

Kai Luo, Hao Shi, Sheng Wu et al.

CVPR 2025
5
citations
#75

Open-World Objectness Modeling Unifies Novel Object Detection

Shan Zhang, Yao Ni, Jinhao Du et al.

CVPR 2025
5
citations
#76

Pseudo Label Refinery for Unsupervised Domain Adaptation on Cross-dataset 3D Object Detection

Zhanwei Zhang, Minghao Chen, Shuai Xiao et al.

CVPR 2024arXiv:2404.19384
5
citations
#77

Dual-Perspective Knowledge Enrichment for Semi-supervised 3D Object Detection

Yucheng Han, Na Zhao, Weiling Chen et al.

AAAI 2024arXiv:2401.05011
semi-supervised 3d object detectionpseudo-label generationteacher-student models3d data annotation+4
5
citations
#78

iDet3D: Towards Efficient Interactive Object Detection for LiDAR Point Clouds

Dongmin Choi, Wonwoo Cho, Kangyeol Kim et al.

AAAI 2024arXiv:2312.15449
interactive object detectionlidar point clouds3d annotation pipelinesnegative click simulation+4
4
citations
#79

Towards Effective, Efficient and Unsupervised Social Event Detection in the Hyperbolic Space

Xiaoyan Yu, Yifan Wei, Shuaishuai Zhou et al.

AAAI 2025arXiv:2412.10712
4
citations
#80

Common3D: Self-Supervised Learning of 3D Morphable Models for Common Objects in Neural Feature Space

Leonhard Sommer, Olaf Dünkel, Christian Theobalt et al.

CVPR 2025arXiv:2504.21749
4
citations
#81

GeoAuxNet: Towards Universal 3D Representation Learning for Multi-sensor Point Clouds

Shengjun Zhang, Xin Fei, Yueqi Duan

CVPR 2024arXiv:2403.19220
4
citations
#82

Instantaneous Perception of Moving Objects in 3D

Di Liu, Bingbing Zhuang, Dimitris N. Metaxas et al.

CVPR 2024arXiv:2405.02781
3
citations
#83

WALT3D: Generating Realistic Training Data from Time-Lapse Imagery for Reconstructing Dynamic Objects Under Occlusion

Khiem Vuong, N. Dinesh Reddy, Robert Tamburo et al.

CVPR 2024arXiv:2403.19022
3
citations
#84

Seg2Box: 3D Object Detection by Point-Wise Semantics Supervision

Maoji Zheng, Ziyu Xu, Qiming Xia et al.

AAAI 2025arXiv:2503.16811
3
citations
#85

SSLFusion: Scale and Space Aligned Latent Fusion Model for Multimodal 3D Object Detection

Bonan Ding, Jin Xie, Jing Nie et al.

AAAI 2025
3
citations
#86

DALDet: Depth-Aware Learning Based Object Detection for Autonomous Driving

K. Hu, Tongbo Cao, Yuan Li et al.

AAAI 2024
3
citations
#87

LabelAny3D: Label Any Object 3D in the Wild

Jin Yao, Radowan Mahmud Redoy, Sebastian Elbaum et al.

NeurIPS 2025arXiv:2601.01676
monocular 3d detection3d bounding box annotationanalysis-by-synthesis frameworkopen-vocabulary detection+4
3
citations
#88

Details Matter for Indoor Open-vocabulary 3D Instance Segmentation

Sanghun Jung, Jingjing Zheng, Ke Zhang et al.

ICCV 2025
3
citations
#89

GO-N3RDet: Geometry Optimized NeRF-enhanced 3D Object Detector

Zechuan Li, Hongshan Yu, Yihao Ding et al.

CVPR 2025arXiv:2503.15211
neural radiance fields3d object detectionmulti-view feature fusionvoxel representation+3
3
citations
#90

3D-MOOD: Lifting 2D to 3D for Monocular Open-Set Object Detection

Yung-Hsu Yang, Luigi Piccinelli, Mattia Segu et al.

ICCV 2025arXiv:2507.23567
3
citations
#91

SA-Occ: Satellite-Assisted 3D Occupancy Prediction in Real World

Chen Chen, Zhirui Wang, Taowei Sheng et al.

ICCV 2025arXiv:2503.16399
3
citations
#92

Semantic Causality-Aware Vision-Based 3D Occupancy Prediction

Dubing Chen, Huan Zheng, Yucheng Zhou et al.

ICCV 2025arXiv:2509.08388
3d semantic occupancy predictionvision-based 3d reconstruction2d-to-3d transformationsemantic causality+4
3
citations
#93

Search and Detect: Training-Free Long Tail Object Detection via Web-Image Retrieval

Mankeerat Sidhu, Hetarth Chopra, Ansel Blume et al.

CVPR 2025arXiv:2409.18733
2
citations
#94

SP3D: Boosting Sparsely-Supervised 3D Object Detection via Accurate Cross-Modal Semantic Prompts

Shijia Zhao, Qiming Xia, Xusheng Guo et al.

CVPR 2025
2
citations
#95

Mitigating Ambiguities in 3D Classification with Gaussian Splatting

Ruiqi Zhang, Hao Zhu, Jingyi Zhao et al.

CVPR 2025arXiv:2503.08352
2
citations
#96

EVT: Efficient View Transformation for Multi-Modal 3D Object Detection

Yongjin Lee, Hyeon-Mun Jeong, Yurim Jeon et al.

ICCV 2025arXiv:2411.10715
2
citations
#97

Leveraging 3D Geometric Priors in 2D Rotation Symmetry Detection

Ahyun Seo, Minsu Cho

CVPR 2025arXiv:2503.20235
2
citations
#98

Interactive 3D Object Detection with Prompts

Ruifei Zhang, Xiangru Lin, Wei Zhang et al.

ECCV 2024
2
citations
#99

Rethinking Lanes and Points in Complex Scenarios for Monocular 3D Lane Detection

Yifan Chang, Junjie Huang, Xiaofeng Wang et al.

CVPR 2025
2
citations
#100

ContextHOI: Spatial Context Learning for Human-Object Interaction Detection

Mingda Jia, Liming Zhao, Ge Li et al.

AAAI 2025arXiv:2412.09050
2
citations