🧬Vision Recognition

Object Detection

Detecting and localizing objects in images

100 papers4,977 total citations

Compare with other topics

Feb '24 — Jan '26676 papers

Top Conferences

CVPR: 49 AAAI: 26 ECCV: 16 ICLR: 5 ICCV: 3 ICML: 1

Top Papers

#1

DETRs Beat YOLOs on Real-time Object Detection

Yian Zhao, Wenyu Lv, Shangliang Xu et al.

Efficient LoFTR: Semi-Dense Local Feature Matching with Sparse-Like Speed

Yifan Wang, Xingyi He, Sida Peng et al.

Pinwheel-shaped Convolution and Scale-based Dynamic Loss for Infrared Small Target Detection

Jiangnan Yang, Shuangli Liu, Jingjun Wu et al.

Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation

Sihan liu, Yiwei Ma, Xiaoqing Zhang et al.

CricaVPR: Cross-image Correlation-aware Representation Learning for Visual Place Recognition

Feng Lu, Xiangyuan Lan, Lijun Zhang et al.

Frequency-Spatial Entanglement Learning for Camouflaged Object Detection

Yanguang Sun, Chunyan Xu, Jian Yang et al.

DQ-DETR: DETR with Dynamic Query for Tiny Object Detection

Yi-Xin Huang, Hou-I Liu, Hong-Han Shuai et al.

ECCV 2024arXiv:2404.03507

tiny object detectiondetr-like methodsdynamic query selectionobject query adjustment+4

56

citations

#8

FBRT-YOLO: Faster and Better for Real-Time Aerial Image Detection

Yao Xiao, Tingfa Xu, Yu Xin et al.

PointOBB: Learning Oriented Object Detection via Single Point Supervision

Junwei Luo, Xue Yang, Yi Yu et al.

Few-Shot Object Detection with Foundation Models

Guangxing Han, Ser-Nam Lim

Cross-Domain Few-Shot Object Detection via Enhanced Open-Set Object Detector

Yuqian Fu, Yu Wang, Yixuan Pan et al.

SDDGR: Stable Diffusion-based Deep Generative Replay for Class Incremental Object Detection

JUNSU KIM, Hoseong Cho, Jihyeon Kim et al.

DetCLIPv3: Towards Versatile Generative Open-vocabulary Object Detection

Lewei Yao, Renjie Pi, Jianhua Han et al.

Boosting Object Detection with Zero-Shot Day-Night Domain Adaptation

Zhipeng Du, Miaojing Shi, Jiankang Deng

Point2RBox: Combine Knowledge from Synthetic Visual Patterns for End-to-end Oriented Object Detection with Single Point Supervision

Yi Yu, Xue Yang, Qingyun Li et al.

Towards Robust Event-guided Low-Light Image Enhancement: A Large-Scale Real-World Event-Image Dataset and Novel Approach

Guoqiang Liang, Kanghao Chen, Hangyu Li et al.

Scene Adaptive Sparse Transformer for Event-based Object Detection

Yansong Peng, Li Hebei, Yueyi Zhang et al.

A Diffusion-Based Framework for Multi-Class Anomaly Detection

Haoyang He, Jiangning Zhang, Hongxu Chen et al.

AAAI 2024arXiv:2312.06607

diffusion modelsanomaly detectionmulti-class settingsemantic-guided reconstruction+4

40

citations

#19

Watermark Anything With Localized Messages

Tom Sander, Pierre Fernandez, Alain Oliviero Durmus et al.

ICLR 2025arXiv:2411.07231

localized image watermarkingwatermark segmentationmultiple watermark embeddingimperceptibility constraints+4

38

citations

#20

5%>100%: Breaking Performance Shackles of Full Fine-Tuning on Visual Recognition Tasks

Dongshuo Yin, Leiyi Hu, Bin Li et al.

Unbiased Faster R-CNN for Single-source Domain Generalized Object Detection

Yajing Liu, Shijun Zhou, Xiyao Liu et al.

ElasticDiffusion: Training-free Arbitrary Size Image Generation through Global-Local Content Separation

Moayed Haji Ali, Guha Balakrishnan, Vicente Ordonez

DVLO: Deep Visual-LiDAR Odometry with Local-to-Global Feature Fusion and Bi-Directional Structure Alignment

Jiuming Liu, Dong Zhuo, Zhiheng Feng et al.

ECCV 2024arXiv:2403.18274

visual-lidar fusionodometry estimationstructure alignmentlocal-to-global fusion+3

36

citations

#24

UnO: Unsupervised Occupancy Fields for Perception and Forecasting

Ben Agro, Quinlan Sykora, Sergio Casas et al.

Add-it: Training-Free Object Insertion in Images With Pretrained Diffusion Models

Yoad Tewel, Rinon Gal, Dvir Samuel et al.

ICLR 2025arXiv:2411.07232

attention mechanismdiffusion modelssemantic image editingobject insertion+3

34

citations

#26

LEGION: Learning to Ground and Explain for Synthetic Image Detection

Hengrui Kang, Siwei Wen, Zichen Wen et al.

CAT: Exploiting Inter-Class Dynamics for Domain Adaptive Object Detection

Mikhail Kennerley, Jian-Gang Wang, Bharadwaj Veeravalli et al.

Localization Is All You Evaluate: Data Leakage in Online Mapping Datasets and How to Fix It

Adam Lilja, Junsheng Fu, Erik Stenborg et al.

LEOD: Label-Efficient Object Detection for Event Cameras

Ziyi Wu, Mathias Gehrig, Qing Lyu et al.

Exploring Intrinsic Normal Prototypes within a Single Image for Universal Anomaly Detection

Wei Luo, Yunkang Cao, Haiming Yao et al.

RUN: Reversible Unfolding Network for Concealed Object Segmentation

Chunming He, Rihan Zhang, Fengyang Xiao et al.

ECLIPSE: Efficient Continual Learning in Panoptic Segmentation with Visual Prompt Tuning

Beomyoung Kim, Joonsang Yu, Sung Ju Hwang

Text-Based Occluded Person Re-identification via Multi-Granularity Contrastive Consistency Learning

Xinyi Wu, Wentao Ma, Dan Guo et al.

The Devil is in the Fine-Grained Details: Evaluating Open-Vocabulary Object Detectors for Fine-Grained Understanding

Lorenzo Bianchi, Fabio Carrara, Nicola Messina et al.

FRED: Towards a Full Rotation-Equivariance in Aerial Image Object Detection

Chanho Lee, Jinsu Son, Hyounguk Shon et al.

AAAI 2024arXiv:2401.06159

rotation-equivarianceoriented object detectiondeformable convolutionaerial image analysis+4

26

citations

#36

Attention Disturbance and Dual-Path Constraint Network for Occluded Person Re-identification

Jiaer Xia, Lei Tan, Pingyang Dai et al.

AAAI 2024arXiv:2303.10976

occluded person re-identificationattention mechanismtransformer architecturegeneralization enhancement+3

24

citations

#37

MonoDiff: Monocular 3D Object Detection and Pose Estimation with Diffusion Models

Yasiru Ranasinghe, Deepti Hegde, Vishal M. Patel

OpenSight: A Simple Open-Vocabulary Framework for LiDAR-Based Object Detection

Hu Zhang, xu jianhua, Tao Tang et al.

Improving Single Domain-Generalized Object Detection: A Focus on Diversification and Alignment

Muhammad Sohail Danish, Muhammad Haris Khan, Muhammad Akhtar Munir et al.

Supervised Anomaly Detection for Complex Industrial Images

Aimira Baitieva, David Hurych, Victor Besnier et al.

Benchmarking Object Detectors with COCO: A New Path Forward

Shweta Singh, Aayan Yadav, Jitesh Jain et al.

Simple Image-Level Classification Improves Open-Vocabulary Object Detection

Ruohuan Fang, Guansong Pang, Xiao Bai

AAAI 2024arXiv:2312.10439

open-vocabulary object detectionvision-language modelscontextual scene understandingmulti-label recognition+3

22

citations

#43

Weakly-Supervised Temporal Action Localization by Inferring Salient Snippet-Feature

Wu Yun, Mengshi Qi, Chuanming Wang et al.

AAAI 2024arXiv:2303.12332

weakly-supervised temporal action localizationsalient snippet-feature inferencepseudo label generationtemporal structure exploitation+3

21

citations

#44

Sketch and Refine: Towards Fast and Accurate Lane Detection

Chao Chen, Jie Liu, Chang Zhou et al.

AAAI 2024arXiv:2401.14729

lane detectionproposal-based methodskeypoint-based methodslane segment association+2

20

citations

#45

360Loc: A Dataset and Benchmark for Omnidirectional Visual Localization with Cross-device Queries

Huajian Huang, Changkun Liu, Yipeng Zhu et al.

Densely Connected Parameter-Efficient Tuning for Referring Image Segmentation

Jiaqi Huang, Zunnan Xu, Ting Liu et al.

Decoupled Pseudo-labeling for Semi-Supervised Monocular 3D Object Detection

Jiacheng Zhang, Jiaming Li, Xiangru Lin et al.

Continuous Memory Representation for Anomaly Detection

Joo Chan Lee, Taejune Kim, Eunbyung Park et al.

ECCV 2024arXiv:2402.18293

anomaly detectionunsupervised learningmemory-based methodscontinuous representation+3

19

citations

#49

Object Detection using Event Camera: A MoE Heat Conduction based Detector and A New Benchmark Dataset

Xiao Wang, Yu Jin, Wentao Wu et al.

Dense Projection for Anomaly Detection

Dazhi Fu, Zhao Zhang, Jicong Fan

OmniGuard: Hybrid Manipulation Localization via Augmented Versatile Deep Image Watermarking

Xuanyu Zhang, Zecheng Tang, Zhipei Xu et al.

CVPR 2025arXiv:2412.01615

digital image watermarkingtamper localizationcopyright protectiongenerative ai editing+4

18

citations

#52

Visible and Clear: Finding Tiny Objects in Difference Map

Bing Cao, Haiyu Yao, Pengfei Zhu et al.

Zero-Shot Aerial Object Detection with Visual Description Regularization

Chenyu Lin, Zhengqing Zang, Chenwei Tang et al.

AAAI 2024arXiv:2402.18233

zero-shot detectionaerial object detectionvisual description regularizationsemantic-visual correlation+4

18

citations

#54

Crowd-SAM:SAM as a smart annotator for object detection in crowded scenes

Zhi Cai, Yingjie Gao, Yaoyan Zheng et al.

EffoVPR: Effective Foundation Model Utilization for Visual Place Recognition

Issar Tzachor, Boaz Lerner, Matan Levy et al.

PAIR Diffusion: A Comprehensive Multimodal Object-Level Image Editor

Vidit Goel, Elia Peruzzo, Yifan Jiang et al.

Enhancing 3D Object Detection with 2D Detection-Guided Query Anchors

Haoxuanye Ji, Pengpeng Liang, Erkang Cheng

PARE-Net: Position-Aware Rotation-Equivariant Networks for Robust Point Cloud Registration

Runzhao Yao, Shaoyi Du, Wenting Cui et al.

ECCV 2024arXiv:2407.10142

point cloud registrationrotation-equivariant networksrotation-invariant featuresposition-aware convolution+2

16

citations

#59

ProxyDet: Synthesizing Proxy Novel Classes via Classwise Mixup for Open Vocabulary Object Detection

Joonhyun Jeong, Geondo Park, Jayeon Yoo et al.

AAAI 2024arXiv:2312.07266

open vocabulary object detectionproxy novel classesclip embedding spaceclasswise mixup+4

16

citations

#60

Weakly Supervised Open-Vocabulary Object Detection

Jianghang Lin, Yunhang Shen, Bingquan Wang et al.

AAAI 2024arXiv:2312.12437

weakly supervised object detectionopen-vocabulary object detectionvision-language alignmentdataset bias adaptation+3

16

citations

#61

Motion Prior Knowledge Learning with Homogeneous Language Descriptions for Moving Infrared Small Target Detection

Shengjia Chen, Luping Ji, Weiwei Duan et al.

Around the World in 80 Timesteps: A Generative Approach to Global Visual Geolocation

Nicolas Dufour, Vicky Kalogeiton, David Picard et al.

CVPR 2025arXiv:2412.06781

visual geolocationgenerative geolocationdiffusion modelsriemannian flow matching+3

16

citations

#63

Commonsense Prototype for Outdoor Unsupervised 3D Object Detection

Hai Wu, Shijia Zhao, Xun Huang et al.

ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text Detection and Spotting

Chen Duan, Pei Fu, Shan Guo et al.

Semi-supervised Open-World Object Detection

Sahal Shaji Mullappilly, Abhishek Singh Gehlot, Rao Muhammad Anwer et al.

AAAI 2024arXiv:2402.16013

open-world object detectionsemi-supervised learningobject query representationsfeature-alignment scheme+4

15

citations

#66

FD2-Net: Frequency-Driven Feature Decomposition Network for Infrared-Visible Object Detection

Ke Li, Di Wang, Zhangyuan Hu et al.

Kandinsky Conformal Prediction: Efficient Calibration of Image Segmentation Algorithms

Joren Brunekreef, Eric Marcus, Ray Sheombarsing et al.

What How and When Should Object Detectors Update in Continually Changing Test Domains?

Jayeon Yoo, Dongkwan Lee, Inseop Chung et al.

ILIAS: Instance-Level Image retrieval At Scale

Giorgos Kordopatis-Zilos, Vladan Stojnić, Anna Manko et al.

PointOBB-v2: Towards Simpler, Faster, and Stronger Single Point Supervised Oriented Object Detection

Botao Ren, Xue Yang, Yi Yu et al.

Video Anomaly Detection with Motion and Appearance Guided Patch Diffusion Model

Hang Zhou, Jiale Cai, Yuteng Ye et al.

MANTA: A Large-Scale Multi-View and Visual-Text Anomaly Detection Dataset for Tiny Objects

Lei Fan, Dongdong Fan, Zhiguang Hu et al.

JamMa: Ultra-lightweight Local Feature Matching with Joint Mamba

Xiaoyong Lu, Songlin Du

CVPR 2025arXiv:2503.03437

local feature matchingmamba architecturelinear complexityscan-merge strategy+3

14

citations

#74

CLIFF: Continual Latent Diffusion for Open-Vocabulary Object Detection

Wuyang Li, Xinyu Liu, Jiayi Ma et al.

ECCV 2024

open-vocabulary object detectiondiffusion modelslatent space alignmentdistribution transfer+4

14

citations

#75

ISP-Teacher: Image Signal Process with Disentanglement Regularization for Unsupervised Domain Adaptive Dark Object Detection

Yin Zhang, Yongqiang Zhang, Zian Zhang et al.

Just a Hint: Point-Supervised Camouflaged Object Detection

Huafeng Chen, Dian SHAO, Guangqian Guo et al.

SmartEraser: Remove Anything from Images using Masked-Region Guidance

Longtao Jiang, Zhendong Wang, Jianmin Bao et al.

Focus on Local: Finding Reliable Discriminative Regions for Visual Place Recognition

Changwei Wang, Shunpeng Chen, Yukun Song et al.

Adapting Fine-Grained Cross-View Localization to Areas without Fine Ground Truth

Zimin Xia, Yujiao Shi, HONGDONG LI et al.

ECCV 2024arXiv:2406.00474

cross-view localizationweakly supervised learningknowledge self-distillationpseudo ground truth+3

12

citations

#80

Weakly Supervised Monocular 3D Detection with a Single-View Image

Xueying Jiang, Sheng Jin, Lewei Lu et al.

Rethinking Features-Fused-Pyramid-Neck for Object Detection

Hulin Li

BLADE: Box-Level Supervised Amodal Segmentation through Directed Expansion

Zhaochen Liu, Zhixuan Li, Tingting Jiang

AAAI 2024arXiv:2401.01642

amodal segmentationbox-level supervisiondirected expansionoccluded objects+3

11

citations

#83

LBM: Latent Bridge Matching for Fast Image-to-Image Translation

Clément Chadebec, Onur Tasar, Sanjeev Sreetharan et al.

MM-CamObj: A Comprehensive Multimodal Dataset for Camouflaged Object Scenarios

Jiacheng Ruan, Wenzhen Yuan, Zehao Lin et al.

Point2RBox-v2: Rethinking Point-supervised Oriented Object Detection with Spatial Layout Among Instances

Yi Yu, Botao Ren, Peiyuan Zhang et al.

CVPR 2025arXiv:2502.04268

oriented object detectionweakly-supervised detectionpoint annotationsgaussian overlap loss+4

10

citations

#86

Disentangled Pre-training for Human-Object Interaction Detection

Zhuolong Li, Xingao Li, Changxing Ding et al.

YolOOD: Utilizing Object Detection Concepts for Multi-Label Out-of-Distribution Detection

Alon Zolfi, Guy AmiT, Amit Baras et al.

Geometry-Guided Domain Generalization for Monocular 3D Object Detection

Fan Yang, Hui Chen, Yuwei He et al.

Hybrid Global-Local Representation with Augmented Spatial Guidance for Zero-Shot Referring Image Segmentation

Ting Liu, Siyuan Li

CVPR 2025arXiv:2504.00356

zero-shot referring image segmentationhybrid global-local representationspatial guidance augmentationmask region representation+4

10

citations

#90

GLASS: Guided Latent Slot Diffusion for Object-Centric Learning

Krishnakant Singh, Simone Schaub-Meyer, Stefan Roth

CVPR 2025arXiv:2407.17929

object-centric learningslot attention modelslatent slot diffusionobject discovery+3

9

citations

#91

Region-Based Optimization in Continual Learning for Audio Deepfake Detection

Yujie Chen, Jiangyan Yi, Cunhang Fan et al.

Symbol as Points: Panoptic Symbol Spotting via Point-based Representation

Wenlong Liu, Tianyu Yang, Yuhan Wang et al.

GIM: A Million-scale Benchmark for Generative Image Manipulation Detection and Localization

Yirui Chen, Xudong Huang, Quan Zhang et al.

Active Object Detection with Knowledge Aggregation and Distillation from Large Models

Dejie Yang, Yang Liu

Learning to Make Keypoints Sub-Pixel Accurate

Shinjeong Kim, Marc Pollefeys, Daniel Barath

SMILe: Leveraging Submodular Mutual Information For Robust Few-Shot Object Detection

Anay Majee, Ryan X Sharp, Rishabh Iyer

Perspective-Invariant 3D Object Detection

Alan Liang, Lingdong Kong, Dongyue Lu et al.

ICCV 2025arXiv:2507.17665

3d object detectionlidar-based perceptioncross-platform adaptationperspective-invariant detection+4

9

citations

#98

MVREC: A General Few-shot Defect Classification Model Using Multi-View Region-Context

Shuai Lyu, Rongchen Zhang, Zeqi Ma et al.

RAD: Region-Aware Diffusion Models for Image Inpainting

Sora Kim, Sungho Suh, Minsik Lee

Finer-CAM: Spotting the Difference Reveals Finer Details for Visual Explanation

Ziheng Zhang, Jianyang Gu, Arpita Chowdhury et al.

CVPR 2025

8

citations

Object Detection

Top Conferences

Related Topics (Vision Recognition)

Top Papers

DETRs Beat YOLOs on Real-time Object Detection

Efficient LoFTR: Semi-Dense Local Feature Matching with Sparse-Like Speed

Pinwheel-shaped Convolution and Scale-based Dynamic Loss for Infrared Small Target Detection

Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation

CricaVPR: Cross-image Correlation-aware Representation Learning for Visual Place Recognition

Frequency-Spatial Entanglement Learning for Camouflaged Object Detection

DQ-DETR: DETR with Dynamic Query for Tiny Object Detection

FBRT-YOLO: Faster and Better for Real-Time Aerial Image Detection

PointOBB: Learning Oriented Object Detection via Single Point Supervision

Few-Shot Object Detection with Foundation Models

Cross-Domain Few-Shot Object Detection via Enhanced Open-Set Object Detector

SDDGR: Stable Diffusion-based Deep Generative Replay for Class Incremental Object Detection

DetCLIPv3: Towards Versatile Generative Open-vocabulary Object Detection

Boosting Object Detection with Zero-Shot Day-Night Domain Adaptation

Point2RBox: Combine Knowledge from Synthetic Visual Patterns for End-to-end Oriented Object Detection with Single Point Supervision

Towards Robust Event-guided Low-Light Image Enhancement: A Large-Scale Real-World Event-Image Dataset and Novel Approach

Scene Adaptive Sparse Transformer for Event-based Object Detection

A Diffusion-Based Framework for Multi-Class Anomaly Detection

Watermark Anything With Localized Messages

5%>100%: Breaking Performance Shackles of Full Fine-Tuning on Visual Recognition Tasks

Unbiased Faster R-CNN for Single-source Domain Generalized Object Detection

ElasticDiffusion: Training-free Arbitrary Size Image Generation through Global-Local Content Separation

DVLO: Deep Visual-LiDAR Odometry with Local-to-Global Feature Fusion and Bi-Directional Structure Alignment

UnO: Unsupervised Occupancy Fields for Perception and Forecasting

Add-it: Training-Free Object Insertion in Images With Pretrained Diffusion Models

LEGION: Learning to Ground and Explain for Synthetic Image Detection

CAT: Exploiting Inter-Class Dynamics for Domain Adaptive Object Detection

Localization Is All You Evaluate: Data Leakage in Online Mapping Datasets and How to Fix It

LEOD: Label-Efficient Object Detection for Event Cameras

Exploring Intrinsic Normal Prototypes within a Single Image for Universal Anomaly Detection

RUN: Reversible Unfolding Network for Concealed Object Segmentation

ECLIPSE: Efficient Continual Learning in Panoptic Segmentation with Visual Prompt Tuning

Text-Based Occluded Person Re-identification via Multi-Granularity Contrastive Consistency Learning

The Devil is in the Fine-Grained Details: Evaluating Open-Vocabulary Object Detectors for Fine-Grained Understanding

FRED: Towards a Full Rotation-Equivariance in Aerial Image Object Detection

Attention Disturbance and Dual-Path Constraint Network for Occluded Person Re-identification

MonoDiff: Monocular 3D Object Detection and Pose Estimation with Diffusion Models

OpenSight: A Simple Open-Vocabulary Framework for LiDAR-Based Object Detection

Improving Single Domain-Generalized Object Detection: A Focus on Diversification and Alignment

Supervised Anomaly Detection for Complex Industrial Images

Benchmarking Object Detectors with COCO: A New Path Forward

Simple Image-Level Classification Improves Open-Vocabulary Object Detection

Weakly-Supervised Temporal Action Localization by Inferring Salient Snippet-Feature

Sketch and Refine: Towards Fast and Accurate Lane Detection

360Loc: A Dataset and Benchmark for Omnidirectional Visual Localization with Cross-device Queries

Densely Connected Parameter-Efficient Tuning for Referring Image Segmentation

Decoupled Pseudo-labeling for Semi-Supervised Monocular 3D Object Detection

Continuous Memory Representation for Anomaly Detection

Object Detection using Event Camera: A MoE Heat Conduction based Detector and A New Benchmark Dataset

Dense Projection for Anomaly Detection

OmniGuard: Hybrid Manipulation Localization via Augmented Versatile Deep Image Watermarking

Visible and Clear: Finding Tiny Objects in Difference Map

Zero-Shot Aerial Object Detection with Visual Description Regularization

Crowd-SAM:SAM as a smart annotator for object detection in crowded scenes

EffoVPR: Effective Foundation Model Utilization for Visual Place Recognition

PAIR Diffusion: A Comprehensive Multimodal Object-Level Image Editor

Enhancing 3D Object Detection with 2D Detection-Guided Query Anchors

PARE-Net: Position-Aware Rotation-Equivariant Networks for Robust Point Cloud Registration

ProxyDet: Synthesizing Proxy Novel Classes via Classwise Mixup for Open Vocabulary Object Detection

Weakly Supervised Open-Vocabulary Object Detection

Motion Prior Knowledge Learning with Homogeneous Language Descriptions for Moving Infrared Small Target Detection

Around the World in 80 Timesteps: A Generative Approach to Global Visual Geolocation

Commonsense Prototype for Outdoor Unsupervised 3D Object Detection

ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text Detection and Spotting

Semi-supervised Open-World Object Detection

FD2-Net: Frequency-Driven Feature Decomposition Network for Infrared-Visible Object Detection

Kandinsky Conformal Prediction: Efficient Calibration of Image Segmentation Algorithms

What How and When Should Object Detectors Update in Continually Changing Test Domains?

ILIAS: Instance-Level Image retrieval At Scale

PointOBB-v2: Towards Simpler, Faster, and Stronger Single Point Supervised Oriented Object Detection

Video Anomaly Detection with Motion and Appearance Guided Patch Diffusion Model

MANTA: A Large-Scale Multi-View and Visual-Text Anomaly Detection Dataset for Tiny Objects

JamMa: Ultra-lightweight Local Feature Matching with Joint Mamba

CLIFF: Continual Latent Diffusion for Open-Vocabulary Object Detection

ISP-Teacher: Image Signal Process with Disentanglement Regularization for Unsupervised Domain Adaptive Dark Object Detection

Just a Hint: Point-Supervised Camouflaged Object Detection