"object detection" Papers

45 papers found

COUNTS: Benchmarking Object Detectors and Multimodal Large Language Models under Distribution Shifts

Jiansheng Li, Xingxuan Zhang, Hao Zou et al.

CVPR 2025highlightarXiv:2504.10158
1
citations

End-to-End Low-Light Enhancement for Object Detection with Learned Metadata from RAWs

Xuelin Shen, Haifeng Jiao, Yitong Wang et al.

NeurIPS 2025poster

FRBNet: Revisiting Low-Light Vision through Frequency-Domain Radial Basis Network

Fangtong Sun, Congyu Li, Ke Yang et al.

NeurIPS 2025posterarXiv:2510.23444

Generalized Diffusion Detector: Mining Robust Features from Diffusion Models for Domain-Generalized Detection

Boyong He, Yuxiang Ji, Qianwen Ye et al.

CVPR 2025posterarXiv:2503.02101
5
citations

Harnessing Massive Satellite Imagery with Efficient Masked Image Modeling

Fengxiang Wang, Hongzhen Wang, Di Wang et al.

ICCV 2025posterarXiv:2406.11933
10
citations

MobileODE: An Extra Lightweight Network

Le Yu, Jun Wu, Bo Gou et al.

NeurIPS 2025poster

Multi-Kernel Correlation-Attention Vision Transformer for Enhanced Contextual Understanding and Multi-Scale Integration

Hongkang Zhang, Shao-Lun Huang, Ercan KURUOGLU et al.

NeurIPS 2025poster

Multiple Object Tracking as ID Prediction

Ruopeng Gao, Ji Qi, Limin Wang

CVPR 2025posterarXiv:2403.16848
53
citations

Real-Time Scene-Adaptive Tone Mapping for High-Dynamic Range Object Detection

Gongzhe Li, Linwei Qiu, Peibei Cao et al.

NeurIPS 2025poster

SoMA: Singular Value Decomposed Minor Components Adaptation for Domain Generalizable Representation Learning

Seokju Yun, Seunghye Chae, Dongheon Lee et al.

CVPR 2025highlightarXiv:2412.04077
8
citations

TESPEC: Temporally-Enhanced Self-Supervised Pretraining for Event Cameras

Mohammad Mohammadi, Ziyi Wu, Igor Gilitschenski

ICCV 2025posterarXiv:2508.00913

TinyViM: Frequency Decoupling for Tiny Hybrid Vision Mamba

Xiaowen Ma, Zhen-Liang Ni, Xinghao Chen

ICCV 2025posterarXiv:2411.17473
17
citations

T-norm Selection for Object Detection in Autonomous Driving with Logical Constraints

Thomas Eiter, Katsumi Inoue, Nelson Higuera et al.

NeurIPS 2025poster

UPRE: Zero-Shot Domain Adaptation for Object Detection via Unified Prompt and Representation Enhancement

Xiao Zhang, Fei Wei, Yong Wang et al.

ICCV 2025posterarXiv:2507.00721

VSSD: Vision Mamba with Non-Causal State Space Duality

Yuheng Shi, Mingjia Li, Minjing Dong et al.

ICCV 2025posterarXiv:2407.18559
24
citations

Adaptive Bounding Box Uncertainties via Two-Step Conformal Prediction

Alexander Timans, Christoph-Nikolas Straehle, Kaspar Sakmann et al.

ECCV 2024posterarXiv:2403.07263
18
citations

Agglomerative Token Clustering

Joakim Bruslund Haurum, Sergio Escalera, Graham W. Taylor et al.

ECCV 2024posterarXiv:2409.11923
7
citations

Cached Transformers: Improving Transformers with Differentiable Memory Cached

Zhaoyang Zhang, Wenqi Shao, Yixiao Ge et al.

AAAI 2024paperarXiv:2312.12742
5
citations

COALA: A Practical and Vision-Centric Federated Learning Platform

Weiming Zhuang, Jian Xu, Chen Chen et al.

ICML 2024poster

Data-free Neural Representation Compression with Riemannian Neural Dynamics

Zhengqi Pei, Anran Zhang, Shuhui Wang et al.

ICML 2024poster

DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTs

Donghyun Kim, Byeongho Heo, Dongyoon Han

ECCV 2024posterarXiv:2403.19588
40
citations

DetKDS: Knowledge Distillation Search for Object Detectors

Lujun Li, Yufan Bao, Peijie Dong et al.

ICML 2024poster

DFD: Distilling the Feature Disparity Differently for Detectors

Kang Liu, Yingyi Zhang, Jingyun Zhang et al.

ICML 2024poster

Differentiable Model Scaling using Differentiable Topk

Kai Liu, Ruohui Wang, Jianfei Gao et al.

ICML 2024poster

Discrete Latent Perspective Learning for Segmentation and Detection

Deyi Ji, Feng Zhao, Lanyun Zhu et al.

ICML 2024spotlight

Distilling Knowledge from Large-Scale Image Models for Object Detection

Gang Li, Wenhai Wang, Xiang Li et al.

ECCV 2024poster
3
citations

EAS-SNN: End-to-End Adaptive Sampling and Representation for Event-based Detection with Recurrent Spiking Neural Networks

Ziming Wang, Ziling Wang, Huaning Li et al.

ECCV 2024posterarXiv:2403.12574
24
citations

G-NAS: Generalizable Neural Architecture Search for Single Domain Generalization Object Detection

Fan Wu, Jinling Gao, Lanqing Hong et al.

AAAI 2024paperarXiv:2402.04672
22
citations

Improving fine-grained understanding in image-text pre-training

Ioana Bica, Anastasija Ilic, Matthias Bauer et al.

ICML 2024poster

Make RepVGG Greater Again: A Quantization-Aware Approach

Xuesong Nie, Yunfeng Yan, Siyuan Li et al.

AAAI 2024paperarXiv:2212.01593
65
citations

Mean Teacher DETR with Masked Feature Alignment: A Robust Domain Adaptive Detection Transformer Framework

Weixi Weng, Chun Yuan

AAAI 2024paperarXiv:2310.15646

Modality Translation for Object Detection Adaptation without forgetting prior knowledge

Heitor Rapela Medeiros, Masih Aminbeidokhti, Fidel A Guerrero Pena et al.

ECCV 2024posterarXiv:2404.01492
4
citations

Multi-scale Cross Distillation for Object Detection in Aerial Images

Kun Wang, Zi Wang, Zhang Li et al.

ECCV 2024poster
2
citations

One Step Learning, One Step Review

Huang Xiaolong, Qiankun Li, Xueran Li et al.

AAAI 2024paperarXiv:2401.10962
2
citations

Receptive Fields As Experts in Convolutional Neural Architectures

Dongze Lian, Weihao Yu, Xinchao Wang

ICML 2024poster

SCoRe: Submodular Combinatorial Representation Learning

Anay Majee, Suraj Kothawade, Krishnateja Killamsetty et al.

ICML 2024poster

Seeing Faces in Things: A Model and Dataset for Pareidolia

Mark T Hamilton, Simon Stent, Vasha G DuTell et al.

ECCV 2024posterarXiv:2409.16143
4
citations

Semantic-Aware Autoregressive Image Modeling for Visual Representation Learning

Kaiyou Song, Shan Zhang, Tong Wang

AAAI 2024paperarXiv:2312.10457
2
citations

Semantic-Aware Transformation-Invariant RoI Align

Guo-Ye Yang, Kiyohiro Nakayama, Zi-Kai Xiao et al.

AAAI 2024paperarXiv:2312.09609

SLAB: Efficient Transformers with Simplified Linear Attention and Progressive Re-parameterized Batch Normalization

Jialong Guo, Xinghao Chen, Yehui Tang et al.

ICML 2024poster

SlowTrack: Increasing the Latency of Camera-Based Perception in Autonomous Driving Using Adversarial Examples

Chen Ma, Ningfei Wang, Qi Alfred Chen et al.

AAAI 2024paperarXiv:2312.09520
37
citations

Sparse Cocktail: Every Sparse Pattern Every Sparse Ratio All At Once

Zhangheng Li, Shiwei Liu, Tianlong Chen et al.

ICML 2024poster

Visual Transformer with Differentiable Channel Selection: An Information Bottleneck Inspired Approach

Yancheng Wang, Ping Li, Yingzhen Yang

ICML 2024poster

Weighting Pseudo-Labels via High-Activation Feature Index Similarity and Object Detection for Semi-Supervised Segmentation

Prantik Howlader, Hieu Le, Dimitris Samaras

ECCV 2024posterarXiv:2407.12630
2
citations

YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information

Chien-Yao Wang, I-Hau Yeh, Hong-Yuan Mark Liao

ECCV 2024posterarXiv:2402.13616
2952
citations