Most Cited 2024 "active vision" Papers

12,324 papers found • Page 52 of 62

Filters:Most Cited 2024 active vision Clear all

Conference

AAAI 2025 (3,028)COLM 2025 (418)CVPR 2025 (2,873)ICCV 2025 (2,701)ICLR 2025 (3,827)ICML 2025 (3,340)ISMAR 2025 (229)NEURIPS 2025 (5,858)AAAI 2024 (2,289)CVPR 2024 (2,716)ECCV 2024 (2,387)ICLR 2024 (2,297)ICML 2024 (2,635)

Paper Type

poster (24,624)paper (8,558)oral (1,594)spotlight (1,421)highlight (975)

#10201

InterpretARA: Enhancing Hybrid Automatic Readability Assessment with Linguistic Feature Interpreter and Contrastive Learning

Jinshan Zeng, Xianchao Tong, Xianglong Yu et al.

AAAI 2024paper

#10202

Learning Multi-Modal Cross-Scale Deformable Transformer Network for Unregistered Hyperspectral Image Super-resolution

Wenqian Dong, Yang Xu, Jiahui Qu et al.

AAAI 2024paper

#10203

ScanERU: Interactive 3D Visual Grounding Based on Embodied Reference Understanding

Ziyang Lu, Yunqiang Pei, Guoqing Wang et al.

AAAI 2024paperarXiv:2303.13186

#10204

Model-Driven Deep Neural Network for Enhanced AoA Estimation Using 5G gNB

Shengheng Liu, Xingkang Li, Zihuan Mao et al.

AAAI 2024paperarXiv:2501.00009

#10205

Response Enhanced Semi-supervised Dialogue Query Generation

Jianheng Huang, Ante Wang, Linfeng Gao et al.

AAAI 2024paperarXiv:2312.12713

#10206

READ-PVLA: Recurrent Adapter with Partial Video-Language Alignment for Parameter-Efficient Transfer Learning in Low-Resource Video-Language Modeling

Thong Nguyen, Xiaobao Wu, Xinshuai Dong et al.

AAAI 2024paper

#10207

Winnie: Task-Oriented Dialog System with Structure-Aware Contrastive Learning and Enhanced Policy Planning

Kaizhi Gao, Tianyu Wang, Zhongjing Ma et al.

AAAI 2024paper

#10208

Dual-Prior Augmented Decoding Network for Long Tail Distribution in HOI Detection

Jiayi Gao, Kongming Liang, Tao Wei et al.

AAAI 2024paper

#10209

Low-Light Face Super-resolution via Illumination, Structure, and Texture Associated Representation

Chenyang Wang, Junjun Jiang, Kui Jiang et al.

AAAI 2024paper

#10210

One Self-Configurable Model to Solve Many Abstract Visual Reasoning Problems

Mikołaj Małkiński, Jacek Mańdziuk

AAAI 2024paperarXiv:2312.09997

#10211

A Diffusion Model with State Estimation for Degradation-Blind Inverse Imaging

Liya Ji, ZheFan Rao, Sinno Jialin Pan et al.

AAAI 2024paper

#10212

Self-Supervised 3D Human Mesh Recovery from a Single Image with Uncertainty-Aware Learning

Guoli Yan, Zichun Zhong, Jing Hua

AAAI 2024paper

#10213

Descanning: From Scanned to the Original Images with a Color Correction Diffusion Model

Junghun Cha, Ali Haider, Seoyun Yang et al.

AAAI 2024paperarXiv:2402.05350

#10214

SparseGNV: Generating Novel Views of Indoor Scenes with Sparse RGB-D Images

Weihao Cheng, Yan-Pei Cao, Ying Shan

AAAI 2024paper

#10215

Collaborative Tooth Motion Diffusion Model in Digital Orthodontics

Yeying Fan, Guangshun Wei, Chen Wang et al.

AAAI 2024paper

#10216

An Information-Flow Perspective on Algorithmic Fairness

Samuel Teuber, Bernhard Beckert

AAAI 2024paperarXiv:2312.10128

#10217

KeDuSR: Real-World Dual-Lens Super-resolution via Kernel-Free Matching

Huanjing Yue, Zifan Cui, Kun Li et al.

AAAI 2024paperarXiv:2312.17050

#10218

Robustly Train Normalizing Flows via KL Divergence Regularization

Kun Song, Ruben Solozabal Ochoa de Retana, Hao Li et al.

AAAI 2024paper

#10219

CoVR: Learning Composed Video Retrieval from Web Video Captions

Lucas Ventura, Antoine Yang, Cordelia Schmid et al.

AAAI 2024paper

#10220

Unknown-Aware Graph Regularization for Robust Semi-supervised Learning from Uncurated Data

Heejo Kong, Suneung Kim, Ho-Joong Kim et al.

AAAI 2024paper

#10221

Learning Encodings for Constructive Neural Combinatorial Optimization Needs to Regret

Rui Sun, Zhi Zheng, Zhenkun Wang

AAAI 2024paper

#10222

Taming the Sigmoid Bottleneck: Provably Argmaxable Sparse Multi-Label Classification

Andreas Grivas, Antonio Vergari, Adam Lopez

AAAI 2024paperarXiv:2310.10443

#10223

DP-AdamBC: Your DP-Adam Is Actually DP-SGD (Unless You Apply Bias Correction)

Qiaoyue Tang, Frederick Shpilevskiy, Mathias Lécuyer

AAAI 2024paperarXiv:2312.14334

#10224

Fine-Tuning Graph Neural Networks by Preserving Graph Generative Patterns

Yifei Sun, Qi Zhu, Yang Yang et al.

AAAI 2024paperarXiv:2312.13583

#10225

MEPSI: An MDL-Based Ensemble Pruning Approach with Structural Information

Xiao-Dong Bi, Shao-Qun Zhang, Yuan Jiang

AAAI 2024paper

#10226

Semi-supervised Learning of Dynamical Systems with Neural Ordinary Differential Equations: A Teacher-Student Model Approach

Yu Wang, Yuxuan Yin, Karthik Somayaji NS et al.

AAAI 2024paperarXiv:2310.13110

#10227

New Classes of the Greedy-Applicable Arm Feature Distributions in the Sparse Linear Bandit Problem

Koji Ichikawa, Shinji Ito, Daisuke Hatano et al.

AAAI 2024paperarXiv:2312.12400

#10228

Universal Weak Coreset

Ragesh Jaiswal, Amit Kumar

AAAI 2024paperarXiv:2305.16890

#10229

Contrastive Balancing Representation Learning for Heterogeneous Dose-Response Curves Estimation

Minqin Zhu, Anpeng Wu, Haoxuan Li et al.

AAAI 2024paperarXiv:2403.14232

#10230

RetroOOD: Understanding Out-of-Distribution Generalization in Retrosynthesis Prediction

Yemin Yu, Luotian Yuan, Ying WEI et al.

AAAI 2024paperarXiv:2312.10900

#10231

MemoryBank: Enhancing Large Language Models with Long-Term Memory

Wanjun Zhong, Lianghong Guo, Qiqi Gao et al.

AAAI 2024paperarXiv:2305.10250

#10232

REGLO: Provable Neural Network Repair for Global Robustness Properties

Feisi Fu, Zhilu Wang, Weichao Zhou et al.

AAAI 2024paper

#10233

CaMIL: Causal Multiple Instance Learning for Whole Slide Image Classification

Kaitao Chen, Shiliang Sun, Jing Zhao

AAAI 2024paper

#10234

Approximation Scheme for Weighted Metric Clustering via Sherali-Adams

Dmitrii Avdiukhin, Vaggos Chatziafratis, Konstantin Makarychev et al.

AAAI 2024paper

#10235

Inducing Point Operator Transformer: A Flexible and Scalable Architecture for Solving PDEs

Seungjun Lee, TaeIL Oh

AAAI 2024paperarXiv:2312.10975

#10236

Contextual Pandora’s Box

Alexia Atsidakou, Constantine Caramanis, Evangelia Gergatsouli et al.

AAAI 2024paper

#10237

Robust Distributed Gradient Aggregation Using Projections onto Gradient Manifolds

Kwang In Kim

AAAI 2024paper

#10238

Generative Model Perception Rectification Algorithm for Trade-Off between Diversity and Quality

Guipeng Lan, Shuai Xiao, Jiachen Yang et al.

AAAI 2024paper

#10239

Faithful Model Explanations through Energy-Constrained Conformal Counterfactuals

Patrick Altmeyer, Mojtaba Farmanbar, Arie Van Deursen et al.

AAAI 2024paperarXiv:2312.10648

#10240

Enhancing the Robustness of Spiking Neural Networks with Stochastic Gating Mechanisms

Jianhao Ding, Zhaofei Yu, Tiejun Huang et al.

AAAI 2024paper

#10241

A Closer Look at Curriculum Adversarial Training: From an Online Perspective

Lianghe Shi, Weiwei Liu

AAAI 2024paper

#10242

Provably Convergent Federated Trilevel Learning

Yang Jiao, Kai YANG, Tiancheng Wu et al.

AAAI 2024paperarXiv:2312.11835

#10243

Dynamic Knowledge Injection for AIXI Agents

Samuel Yang-Zhao, Kee Siong Ng, Marcus Hutter

AAAI 2024paperarXiv:2312.16184

#10244

Finite-Time Frequentist Regret Bounds of Multi-Agent Thompson Sampling on Sparse Hypergraphs

Tianyuan Jin, Hao-Lun Hsu, William Chang et al.

AAAI 2024paperarXiv:2312.15549

#10245

Feature Distribution Matching by Optimal Transport for Effective and Robust Coreset Selection

AAAI 2024paper

#10246

A Unified Self-Distillation Framework for Multimodal Sentiment Analysis with Uncertain Missing Modalities

AAAI 2024paper

#10247

Guiding a Harsh-Environments Robust Detector via RAW Data Characteristic Mining

AAAI 2024paper

#10248

Resisting Backdoor Attacks in Federated Learning via Bidirectional Elections and Individual Perspective

Zhen Qin, Feiyi Chen, Chen Zhi et al.

AAAI 2024paperarXiv:2309.16456

#10249

Transportable Representations for Domain Generalization

Kasra Jalaldoust, Elias Bareinboim

AAAI 2024paper

#10250

Exponential Hardness of Optimization from the Locality in Quantum Neural Networks

Hao-Kai Zhang, Chengkai Zhu, Geng Liu et al.

AAAI 2024paper

#10251

MFOS: Model-Free & One-Shot Object Pose Estimation

JongMin Lee, Yohann Cabon, Romain Brégier et al.

AAAI 2024paper

#10252

Hierarchical Topology Isomorphism Expertise Embedded Graph Contrastive Learning

Jiangmeng Li, Yifan Jin, Hang Gao et al.

AAAI 2024paperarXiv:2312.14222

#10253

PDE+: Enhancing Generalization via PDE with Adaptive Distributional Diffusion

Yige Yuan, Bingbing Xu, Bo Lin et al.

AAAI 2024paperarXiv:2305.15835

#10254

Learning Representations on the Unit Sphere: Investigating Angular Gaussian and Von Mises-Fisher Distributions for Online Continual Learning

Nicolas Michel, Giovanni Chierchia, Romain Negrel et al.

AAAI 2024paperarXiv:2306.03364

#10255

Towards Real-World Test-Time Adaptation: Tri-net Self-Training with Balanced Normalization

Yongyi Su, Xun Xu, Kui Jia

AAAI 2024paperarXiv:2309.14949

#10256

Probabilistic Offline Policy Ranking with Approximate Bayesian Computation

Longchao Da, Porter Jenkins, Trevor Schwantes et al.

AAAI 2024paperarXiv:2312.11551

#10257

Revisiting Disentanglement in Downstream Tasks: A Study on Its Necessity for Abstract Visual Reasoning

Ruiqian Nai, Zixin Wen, Ji Li et al.

AAAI 2024paperarXiv:2403.00352

#10258

Out of Thin Air: Exploring Data-Free Adversarial Robustness Distillation

Yuzheng Wang, Zhaoyu Chen, Dingkang Yang et al.

AAAI 2024paperarXiv:2303.11611

#10259

HAGO-Net: Hierarchical Geometric Massage Passing for Molecular Representation Learning

Hongbin Pei, Taile Chen, Chen A et al.

AAAI 2024paper

#10260

Unsupervised Template-assisted Point Cloud Shape Correspondence Network

Jiacheng Deng, Jiahao Lu, Tianzhu Zhang

CVPR 2024posterarXiv:2403.16412

#10261

X-3D: Explicit 3D Structure Modeling for Point Cloud Recognition

Shuofeng Sun, Yongming Rao, Jiwen Lu et al.

CVPR 2024posterarXiv:2404.15010

#10262

Efficient Model Stealing Defense with Noise Transition Matrix

Dong-Dong Wu, Chilin Fu, Weichang Wu et al.

CVPR 2024poster

#10263

MV-Adapter: Multimodal Video Transfer Learning for Video Text Retrieval

bowen zhang, Xiaojie Jin, Weibo Gong et al.

CVPR 2024posterarXiv:2301.07868

#10264

FLHetBench: Benchmarking Device and State Heterogeneity in Federated Learning

Junyuan Zhang, Shuang Zeng, Miao Zhang et al.

CVPR 2024poster

#10265

OmniParser: A Unified Framework for Text Spotting Key Information Extraction and Table Recognition

Jianqiang Wan, Sibo Song, Wenwen Yu et al.

CVPR 2024posterarXiv:2403.19128

#10266

CroSel: Cross Selection of Confident Pseudo Labels for Partial-Label Learning

Shiyu Tian, Hongxin Wei, Yiqun Wang et al.

CVPR 2024posterarXiv:2303.10365

#10267

PTM-VQA: Efficient Video Quality Assessment Leveraging Diverse PreTrained Models from the Wild

Kun Yuan, Hongbo Liu, Mading Li et al.

CVPR 2024posterarXiv:2405.17765

#10268

Improved Self-Training for Test-Time Adaptation

Jing Ma

CVPR 2024poster

#10269

Mudslide: A Universal Nuclear Instance Segmentation Method

Jun Wang

CVPR 2024highlight

#10270

Rewrite the Stars

Xu Ma, Xiyang Dai, Yue Bai et al.

CVPR 2024posterarXiv:2403.19967

#10271

Virtual Immunohistochemistry Staining for Histological Images Assisted by Weakly-supervised Learning

Jiahan Li, Jiuyang Dong, Shenjin Huang et al.

CVPR 2024poster

#10272

3DiffTection: 3D Object Detection with Geometry-Aware Diffusion Features

Chenfeng Xu, Huan Ling, Sanja Fidler et al.

CVPR 2024posterarXiv:2311.04391

#10273

Model Adaptation for Time Constrained Embodied Control

Jaehyun Song, Minjong Yoo, Honguk Woo

CVPR 2024posterarXiv:2406.11128

#10274

SCE-MAE: Selective Correspondence Enhancement with Masked Autoencoder for Self-Supervised Landmark Estimation

Kejia Yin, Varshanth Rao, Ruowei Jiang et al.

CVPR 2024posterarXiv:2405.18322

#10275

Residual Denoising Diffusion Models

Jiawei Liu, Qiang Wang, Huijie Fan et al.

CVPR 2024posterarXiv:2308.13712

#10276

Weakly Supervised Point Cloud Semantic Segmentation via Artificial Oracle

Hyeokjun Kweon, Jihun Kim, Kuk-Jin Yoon

CVPR 2024poster

#10277

Generating Content for HDR Deghosting from Frequency View

Tao Hu, Qingsen Yan, Yuankai Qi et al.

CVPR 2024posterarXiv:2404.00849

#10278

Not All Prompts Are Secure: A Switchable Backdoor Attack Against Pre-trained Vision Transfomers

Sheng Yang, Jiawang Bai, Kuofeng Gao et al.

CVPR 2024poster

#10279

LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding Reasoning and Planning

Sijin Chen, Xin Chen, Chi Zhang et al.

CVPR 2024poster

#10280

GenTron: Diffusion Transformers for Image and Video Generation

Shoufa Chen, Mengmeng Xu, Jiawei Ren et al.

CVPR 2024posterarXiv:2312.04557

#10281

Backpropagation-free Network for 3D Test-time Adaptation

YANSHUO WANG, Ali Cheraghian, Zeeshan Hayder et al.

CVPR 2024posterarXiv:2403.18442

#10282

TransNeXt: Robust Foveal Visual Perception for Vision Transformers

Dai Shi

CVPR 2024posterarXiv:2311.17132

#10283

Cross-Dimension Affinity Distillation for 3D EM Neuron Segmentation

Xiaoyu Liu, Miaomiao Cai, Yinda Chen et al.

CVPR 2024poster

#10284

EfficientDreamer: High-Fidelity and Robust 3D Creation via Orthogonal-view Diffusion Priors

Zhipeng Hu, Minda Zhao, Chaoyi Zhao et al.

CVPR 2024posterarXiv:2308.13223

#10285

RichDreamer: A Generalizable Normal-Depth Diffusion Model for Detail Richness in Text-to-3D

Lingteng Qiu, Guanying Chen, Xiaodong Gu et al.

CVPR 2024highlightarXiv:2311.16918

#10286

Robust Synthetic-to-Real Transfer for Stereo Matching

Jiawei Zhang, Jiahe Li, Lei Huang et al.

CVPR 2024posterarXiv:2403.07705

#10287

Understanding and Improving Source-free Domain Adaptation from a Theoretical Perspective

Yu Mitsuzumi, Akisato Kimura, Hisashi Kashima

CVPR 2024poster

#10288

CoDeF: Content Deformation Fields for Temporally Consistent Video Processing

Hao Ouyang, Qiuyu Wang, Yuxi Xiao et al.

CVPR 2024highlightarXiv:2308.07926

#10289

BT-Adapter: Video Conversation is Feasible Without Video Instruction Tuning

Ruyang Liu, Chen Li, Yixiao Ge et al.

CVPR 2024posterarXiv:2309.15785

#10290

Video Frame Interpolation via Direct Synthesis with the Event-based Reference

Yuhan Liu, Yongjian Deng, Hao Chen et al.

CVPR 2024poster

#10291

CorrMatch: Label Propagation via Correlation Matching for Semi-Supervised Semantic Segmentation

Bo-Yuan Sun, Yuqi Yang, Le Zhang et al.

CVPR 2024posterarXiv:2306.04300

#10292

Rethinking Boundary Discontinuity Problem for Oriented Object Detection

Hang Xu, Xinyuan Liu, Haonan Xu et al.

CVPR 2024posterarXiv:2305.10061

#10293

Dual Prior Unfolding for Snapshot Compressive Imaging

Jiancheng Zhang, Haijin Zeng, Jiezhang Cao et al.

CVPR 2024poster

#10294

MCNet: Rethinking the Core Ingredients for Accurate and Efficient Homography Estimation

Haokai Zhu, Si-Yuan Cao, Jianxin Hu et al.

CVPR 2024poster

#10295

Uncertainty-Guided Never-Ending Learning to Drive

Lei Lai, Eshed Ohn-Bar, Sanjay Arora et al.

CVPR 2024poster

#10296

Feedback-Guided Autonomous Driving

Jimuyang Zhang, Zanming Huang, Arijit Ray et al.

CVPR 2024highlight

#10297

Masked Autoencoders for Microscopy are Scalable Learners of Cellular Biology

Oren Kraus, Kian Kenyon-Dean, Saber Saberian et al.

CVPR 2024highlightarXiv:2404.10242

#10298

3DGStream: On-the-Fly Training of 3D Gaussians for Efficient Streaming of Photo-Realistic Free-Viewpoint Videos

Jiakai Sun, Han Jiao, Guangyuan Li et al.

CVPR 2024highlightarXiv:2403.01444

#10299

TextCraftor: Your Text Encoder Can be Image Quality Controller

Yanyu Li, Xian Liu, Anil Kag et al.

CVPR 2024posterarXiv:2403.18978

#10300

QN-Mixer: A Quasi-Newton MLP-Mixer Model for Sparse-View CT Reconstruction

Ishak Ayad, Nicolas Larue, Mai K. Nguyen

CVPR 2024posterarXiv:2402.17951

#10301

Prompt3D: Random Prompt Assisted Weakly-Supervised 3D Object Detection

Xiaohong Zhang, Huisheng Ye, Jingwen Li et al.

CVPR 2024poster

#10302

Efficient Meshflow and Optical Flow Estimation from Event Cameras

Xinglong Luo, Ao Luo, Zhengning Wang et al.

CVPR 2024poster

#10303

Boosting Continual Learning of Vision-Language Models via Mixture-of-Experts Adapters

Jiazuo Yu, Yunzhi Zhuge, Lu Zhang et al.

CVPR 2024posterarXiv:2403.11549

#10304

CORES: Convolutional Response-based Score for Out-of-distribution Detection

Keke Tang, Chao Hou, Weilong Peng et al.

CVPR 2024poster

#10305

Equivariant Multi-Modality Image Fusion

Zixiang Zhao, Haowen Bai, Jiangshe Zhang et al.

CVPR 2024posterarXiv:2305.11443

#10306

PromptKD: Unsupervised Prompt Distillation for Vision-Language Models

Zheng Li, Xiang Li, xinyi fu et al.

CVPR 2024posterarXiv:2403.02781

#10307

Domain Gap Embeddings for Generative Dataset Augmentation

Yinong Oliver Wang, Younjoon Chung, Chen Henry Wu et al.

CVPR 2024poster

#10308

Learn to Rectify the Bias of CLIP for Unsupervised Semantic Segmentation

Jingyun Wang, Guoliang Kang

CVPR 2024posterarXiv:2408.06747

#10309

Leveraging Vision-Language Models for Improving Domain Generalization in Image Classification

Sravanti Addepalli, Ashish Asokan, Lakshay Sharma et al.

CVPR 2024posterarXiv:2310.08255

#10310

Draw Step by Step: Reconstructing CAD Construction Sequences from Point Clouds via Multimodal Diffusion.

Weijian Ma, Shuaiqi Chen, Yunzhong Lou et al.

CVPR 2024poster

#10311

Open-Vocabulary 3D Semantic Segmentation with Foundation Models

Li Jiang, Shaoshuai Shi, Bernt Schiele

CVPR 2024highlight

#10312

Class Tokens Infusion for Weakly Supervised Semantic Segmentation

Sung-Hoon Yoon, Hoyong Kwon, Hyeonseong Kim et al.

CVPR 2024poster

#10313

GraphDreamer: Compositional 3D Scene Synthesis from Scene Graphs

Gege Gao, Weiyang Liu, Anpei Chen et al.

CVPR 2024posterarXiv:2312.00093

#10314

SeD: Semantic-Aware Discriminator for Image Super-Resolution

Bingchen Li, Xin Li, Hanxin Zhu et al.

CVPR 2024posterarXiv:2402.19387

#10315

JoAPR: Cleaning the Lens of Prompt Learning for Vision-Language Models

YUNCHENG GUO, Xiaodong Gu

CVPR 2024poster

#10316

View From Above: Orthogonal-View aware Cross-view Localization

Shan Wang, Chuong Nguyen, Jiawei Liu et al.

CVPR 2024poster

#10317

WorDepth: Variational Language Prior for Monocular Depth Estimation

Ziyao Zeng, Hyoungseob Park, Fengyu Yang et al.

CVPR 2024posterarXiv:2404.03635

#10318

EmbodiedScan: A Holistic Multi-Modal 3D Perception Suite Towards Embodied AI

Tai Wang, Xiaohan Mao, Chenming Zhu et al.

CVPR 2024posterarXiv:2312.16170

#10319

DIOD: Self-Distillation Meets Object Discovery

Sandra Kara, Hejer AMMAR, Julien Denize et al.

CVPR 2024poster

#10320

SNED: Superposition Network Architecture Search for Efficient Video Diffusion Model

Zhengang Li, Yan Kang, Yuchen Liu et al.

CVPR 2024posterarXiv:2406.00195

#10321

Deep Generative Model based Rate-Distortion for Image Downscaling Assessment

yuanbang liang, Bhavesh Garg, Paul L. Rosin et al.

CVPR 2024posterarXiv:2403.15139

#10322

VA3: Virtually Assured Amplification Attack on Probabilistic Copyright Protection for Text-to-Image Generative Models

Xiang Li, Qianli Shen, Kenji Kawaguchi

CVPR 2024highlightarXiv:2312.00057

#10323

SNI-SLAM: Semantic Neural Implicit SLAM

Siting Zhu, Guangming Wang, Hermann Blum et al.

CVPR 2024posterarXiv:2311.11016

#10324

TextureDreamer: Image-Guided Texture Synthesis Through Geometry-Aware Diffusion

Yu-Ying Yeh, Jia-Bin Huang, Changil Kim et al.

CVPR 2024posterarXiv:2401.09416

#10325

Blur2Blur: Blur Conversion for Unsupervised Image Deblurring on Unknown Domains

Bang-Dang Pham, Phong Tran, Anh Tran et al.

CVPR 2024posterarXiv:2403.16205

#10326

In-distribution Public Data Synthesis with Diffusion Models for Differentially Private Image Classification

Jinseong Park, Yujin Choi, Jaewook Lee

CVPR 2024poster

#10327

ArtAdapter: Text-to-Image Style Transfer using Multi-Level Style Encoder and Explicit Adaptation

Dar-Yen Chen, Hamish Tennent, Ching-Wen Hsu

CVPR 2024posterarXiv:2312.02109

#10328

Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation

Bingxin Ke, Anton Obukhov, Shengyu Huang et al.

CVPR 2024posterarXiv:2312.02145

#10329

GS-IR: 3D Gaussian Splatting for Inverse Rendering

Zhihao Liang, Qi Zhang, Ying Feng et al.

CVPR 2024posterarXiv:2311.16473

#10330

SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis

Ziqiao Peng, Wentao Hu, Yue Shi et al.

CVPR 2024posterarXiv:2311.17590

#10331

D3still: Decoupled Differential Distillation for Asymmetric Image Retrieval

Yi Xie, Yihong Lin, Wenjie Cai et al.

CVPR 2024poster

#10332

MTLoRA: Low-Rank Adaptation Approach for Efficient Multi-Task Learning

Ahmed Agiza, Marina Neseem, Sherief Reda

CVPR 2024highlight

#10333

SelfOcc: Self-Supervised Vision-Based 3D Occupancy Prediction

Yuanhui Huang, Wenzhao Zheng, Borui Zhang et al.

CVPR 2024posterarXiv:2311.12754

#10334

Analyzing and Improving the Training Dynamics of Diffusion Models

Tero Karras, Miika Aittala, Jaakko Lehtinen et al.

CVPR 2024posterarXiv:2312.02696

#10335

DiffusionGAN3D: Boosting Text-guided 3D Generation and Domain Adaptation by Combining 3D GANs and Diffusion Priors

Biwen Lei, Kai Yu, Mengyang Feng et al.

CVPR 2024posterarXiv:2312.16837

#10336

Groupwise Query Specialization and Quality-Aware Multi-Assignment for Transformer-based Visual Relationship Detection

Jongha Kim, Jihwan Park, Jinyoung Park et al.

CVPR 2024posterarXiv:2403.17709

#10337

SD2Event:Self-supervised Learning of Dynamic Detectors and Contextual Descriptors for Event Cameras

Yuan Gao, Yuqing Zhu, Xinjun Li et al.

CVPR 2024poster

#10338

DiSR-NeRF: Diffusion-Guided View-Consistent Super-Resolution NeRF

Jie Long Lee, Chen Li, Gim Hee Lee

CVPR 2024posterarXiv:2404.00874

#10339

PaReNeRF: Toward Fast Large-scale Dynamic NeRF with Patch-based Reference

Xiao Tang, Min Yang, Penghui Sun et al.

CVPR 2024poster

#10340

Effective Video Mirror Detection with Inconsistent Motion Cues

Alex Warren, Ke Xu, Jiaying Lin et al.

CVPR 2024poster

#10341

Desigen: A Pipeline for Controllable Design Template Generation

Haohan Weng, Danqing Huang, YU QIAO et al.

CVPR 2024posterarXiv:2403.09093

#10342

Rich Human Feedback for Text-to-Image Generation

Youwei Liang, Junfeng He, Gang Li et al.

CVPR 2024posterarXiv:2312.10240

#10343

Dr. Bokeh: DiffeRentiable Occlusion-aware Bokeh Rendering

Yichen Sheng, Zixun Yu, Lu Ling et al.

CVPR 2024poster

#10344

Learning from Observer Gaze: Zero-Shot Attention Prediction Oriented by Human-Object Interaction Recognition

Yuchen Zhou, Linkai Liu, Chao Gou

CVPR 2024poster

#10345

Super-Resolution Reconstruction from Bayer-Pattern Spike Streams

Yanchen Dong, Ruiqin Xiong, Jian Zhang et al.

CVPR 2024poster

#10346

Diffusion Handles Enabling 3D Edits for Diffusion Models by Lifting Activations to 3D

Karran Pandey, Paul Guerrero, Matheus Gadelha et al.

CVPR 2024highlightarXiv:2312.02190

#10347

Uncovering What Why and How: A Comprehensive Benchmark for Causation Understanding of Video Anomaly

Hang Du, Sicheng Zhang, Binzhu Xie et al.

CVPR 2024posterarXiv:2405.00181

#10348

DifFlow3D: Toward Robust Uncertainty-Aware Scene Flow Estimation with Iterative Diffusion-Based Refinement

Jiuming Liu, Guangming Wang, Weicai Ye et al.

CVPR 2024poster

#10349

Shadows Don't Lie and Lines Can't Bend! Generative Models don't know Projective Geometry...for now

Ayush Sarkar, Hanlin Mai, Amitabh Mahapatra et al.

CVPR 2024posterarXiv:2311.17138

#10350

Aligning Logits Generatively for Principled Black-Box Knowledge Distillation

Jing Ma, Xiang Xiang, Ke Wang et al.

CVPR 2024posterarXiv:2205.10490

#10351

Permutation Equivariance of Transformers and Its Applications

Hengyuan Xu, Liyao Xiang, Hangyu Ye et al.

CVPR 2024posterarXiv:2304.07735

#10352

HomoFormer: Homogenized Transformer for Image Shadow Removal

Jie Xiao, Xueyang Fu, Yurui Zhu et al.

CVPR 2024poster

#10353

HardMo: A Large-Scale Hardcase Dataset for Motion Capture

Jiaqi Liao, Chuanchen Luo, Yinuo Du et al.

CVPR 2024poster

#10354

SLICE: Stabilized LIME for Consistent Explanations for Image Classification

Revoti Prasad Bora, Kiran Raja, Philipp Terhörst et al.

CVPR 2024highlight

#10355

EFHQ: Multi-purpose ExtremePose-Face-HQ dataset

Trung Dao, Duc H Vu, Cuong Pham et al.

CVPR 2024posterarXiv:2312.17205

#10356

Logarithmic Lenses: Exploring Log RGB Data for Image Classification

Bruce Maxwell, Sumegha Singhania, Avnish Patel et al.

CVPR 2024poster

#10357

TokenCompose: Text-to-Image Diffusion with Token-level Supervision

Zirui Wang, Zhizhou Sha, Zheng Ding et al.

CVPR 2024posterarXiv:2312.03626

#10358

Seeing the World through Your Eyes

Hadi Alzayer, Kevin Zhang, Brandon Y. Feng et al.

CVPR 2024posterarXiv:2306.09348

#10359

Learning Vision from Models Rivals Learning Vision from Data

Yonglong Tian, Lijie Fan, Kaifeng Chen et al.

CVPR 2024posterarXiv:2312.17742

#10360

JointSQ: Joint Sparsification-Quantization for Distributed Learning

Weiying Xie, Haowei Li, Ma Jitao et al.

CVPR 2024poster

#10361

Holo-Relighting: Controllable Volumetric Portrait Relighting from a Single Image

Yiqun Mei, Yu Zeng, He Zhang et al.

CVPR 2024posterarXiv:2403.09632

#10362

Matching 2D Images in 3D: Metric Relative Pose from Metric Correspondences

Axel Barroso-Laguna, Sowmya Munukutla, Victor Adrian Prisacariu et al.

CVPR 2024posterarXiv:2404.06337

#10363

MorpheuS: Neural Dynamic 360° Surface Reconstruction from Monocular RGB-D Video

Hengyi Wang, Jingwen Wang, Lourdes Agapito

CVPR 2024posterarXiv:2312.00778

#10364

Capturing Closely Interacted Two-Person Motions with Reaction Priors

Qi Fang, Yinghui Fan, Yanjun Li et al.

CVPR 2024poster

#10365

DiVa-360: The Dynamic Visual Dataset for Immersive Neural Fields

Cheng-You Lu, Peisen Zhou, Angela Xing et al.

CVPR 2024highlightarXiv:2307.16897

#10366

Learning Visual Prompt for Gait Recognition

Kang Ma, Ying Fu, Chunshui Cao et al.

CVPR 2024poster

#10367

PolarRec: Improving Radio Interferometric Data Reconstruction Using Polar Coordinates

Ruoqi Wang, Zhuoyang Chen, Jiayi Zhu et al.

CVPR 2024poster

#10368

StyleCineGAN: Landscape Cinemagraph Generation using a Pre-trained StyleGAN

Jongwoo Choi, Kwanggyoon Seo, Amirsaman Ashtari et al.

CVPR 2024posterarXiv:2403.14186

#10369

Temporally Consistent Unbalanced Optimal Transport for Unsupervised Action Segmentation

Ming Xu, Stephen Gould

CVPR 2024posterarXiv:2404.01518

#10370

Learning for Transductive Threshold Calibration in Open-World Recognition

Qin ZHANG, DONGSHENG An, Tianjun Xiao et al.

CVPR 2024posterarXiv:2305.12039

#10371

SonicVisionLM: Playing Sound with Vision Language Models

Zhifeng Xie, Shengye Yu, Qile He et al.

CVPR 2024posterarXiv:2401.04394

#10372

Real-Time Exposure Correction via Collaborative Transformations and Adaptive Sampling

Ziwen Li, Feng Zhang, Meng Cao et al.

CVPR 2024poster

#10373

NeLF-Pro: Neural Light Field Probes for Multi-Scale Novel View Synthesis

Zinuo You, Andreas Geiger, Anpei Chen

CVPR 2024posterarXiv:2312.13328

#10374

OpenEQA: Embodied Question Answering in the Era of Foundation Models

Arjun Majumdar, Anurag Ajay, Xiaohan Zhang et al.

CVPR 2024poster

#10375

Modeling Dense Multimodal Interactions Between Biological Pathways and Histology for Survival Prediction

Guillaume Jaume, Anurag Vaidya, Richard J. Chen et al.

CVPR 2024posterarXiv:2304.06819

#10376

Practical Measurements of Translucent Materials with Inter-Pixel Translucency Prior

Zhenyu Chen, Jie Guo, Shuichang Lai et al.

CVPR 2024poster

#10377

View-Category Interactive Sharing Transformer for Incomplete Multi-View Multi-Label Learning

Shilong Ou, Zhe Xue, Yawen Li et al.

CVPR 2024highlight

#10378

Dual Memory Networks: A Versatile Adaptation Approach for Vision-Language Models

Yabin Zhang, Wenjie Zhu, Hui Tang et al.

CVPR 2024posterarXiv:2403.17589

#10379

FISBe: A Real-World Benchmark Dataset for Instance Segmentation of Long-Range Thin Filamentous Structures

Lisa Mais, Peter Hirsch, Claire Managan et al.

CVPR 2024posterarXiv:2404.00130

#10380

RankMatch: Exploring the Better Consistency Regularization for Semi-supervised Semantic Segmentation

Huayu Mai, Rui Sun, Tianzhu Zhang et al.

CVPR 2024poster

#10381

CoDe: An Explicit Content Decoupling Framework for Image Restoration

Enxuan Gu, Hongwei Ge, Yong Guo

CVPR 2024poster

#10382

Masked Spatial Propagation Network for Sparsity-Adaptive Depth Refinement

Jinyoung Jun, Jae-Han Lee, Chang-Su Kim

CVPR 2024posterarXiv:2404.19294

#10383

D^4: Dataset Distillation via Disentangled Diffusion Model

Duo Su, Junjie Hou, Weizhi Gao et al.

CVPR 2024poster

#10384

An Empirical Study of the Generalization Ability of Lidar 3D Object Detectors to Unseen Domains

George Eskandar

CVPR 2024posterarXiv:2402.17562

#10385

ViLa-MIL: Dual-scale Vision-Language Multiple Instance Learning for Whole Slide Image Classification

Jiangbo Shi, Chen Li, Tieliang Gong et al.

CVPR 2024posterarXiv:2502.08391

#10386

CaDeT: a Causal Disentanglement Approach for Robust Trajectory Prediction in Autonomous Driving

Mozhgan Pourkeshavarz, Junrui Zhang, Amir Rasouli

CVPR 2024poster

#10387

Boosting Neural Representations for Videos with a Conditional Decoder

XINJIE ZHANG, Ren Yang, Dailan He et al.

CVPR 2024highlightarXiv:2402.18152

#10388

Text-Guided 3D Face Synthesis - From Generation to Editing

Yunjie Wu, Yapeng Meng, Zhipeng Hu et al.

CVPR 2024posterarXiv:2312.00375

#10389

IReNe: Instant Recoloring of Neural Radiance Fields

Alessio Mazzucchelli, Adrian Garcia-Garcia, Elena Garces et al.

CVPR 2024posterarXiv:2405.19876

#10390

Distilling CLIP with Dual Guidance for Learning Discriminative Human Body Shape Representation

Feng Liu, Minchul Kim, Zhiyuan Ren et al.

CVPR 2024poster

#10391

CARZero: Cross-Attention Alignment for Radiology Zero-Shot Classification

Haoran Lai, Qingsong Yao, Zihang Jiang et al.

CVPR 2024posterarXiv:2402.17417

#10392

MedM2G: Unifying Medical Multi-Modal Generation via Cross-Guided Diffusion with Visual Invariant

Chenlu Zhan, Gaoang Wang, Yu LIN et al.

CVPR 2024posterarXiv:2403.04290

#10393

Your Image is My Video: Reshaping the Receptive Field via Image-To-Video Differentiable AutoAugmentation and Fusion

Sofia Casarin, Cynthia Ugwu, Sergio Escalera et al.

CVPR 2024posterarXiv:2403.15194

#10394

Contrastive Denoising Score for Text-guided Latent Diffusion Image Editing

Hyelin Nam, Gihyun Kwon, Geon Yeong Park et al.

CVPR 2024posterarXiv:2311.18608

#10395

DiffLoc: Diffusion Model for Outdoor LiDAR Localization

Wen Li, Yuyang Yang, Shangshu Yu et al.

CVPR 2024poster

#10396

Portrait4D: Learning One-Shot 4D Head Avatar Synthesis using Synthetic Data

Yu Deng, Duomin Wang, Xiaohang Ren et al.

CVPR 2024posterarXiv:2311.18729

#10397

Soften to Defend: Towards Adversarial Robustness via Self-Guided Label Refinement

Daiwei Yu, Zhuorong Li, Lina Wei et al.

CVPR 2024posterarXiv:2403.09101

#10398

Wired Perspectives: Multi-View Wire Art Embraces Generative AI

Zhiyu Qu, LAN YANG, Honggang Zhang et al.

CVPR 2024posterarXiv:2311.15421

#10399

Small Scale Data-Free Knowledge Distillation

He Liu, Yikai Wang, Huaping Liu et al.

CVPR 2024posterarXiv:2406.07876

#10400

Transfer CLIP for Generalizable Image Denoising

Jun Cheng, Dong Liang, Shan Tan

CVPR 2024posterarXiv:2403.15132

← Previous

1...50 51 52 53 54...62

Most Cited 2024 "active vision" Papers

Conference

Paper Type

InterpretARA: Enhancing Hybrid Automatic Readability Assessment with Linguistic Feature Interpreter and Contrastive Learning

Learning Multi-Modal Cross-Scale Deformable Transformer Network for Unregistered Hyperspectral Image Super-resolution

ScanERU: Interactive 3D Visual Grounding Based on Embodied Reference Understanding

Model-Driven Deep Neural Network for Enhanced AoA Estimation Using 5G gNB

Response Enhanced Semi-supervised Dialogue Query Generation

READ-PVLA: Recurrent Adapter with Partial Video-Language Alignment for Parameter-Efficient Transfer Learning in Low-Resource Video-Language Modeling

Winnie: Task-Oriented Dialog System with Structure-Aware Contrastive Learning and Enhanced Policy Planning

Dual-Prior Augmented Decoding Network for Long Tail Distribution in HOI Detection

Low-Light Face Super-resolution via Illumination, Structure, and Texture Associated Representation

One Self-Configurable Model to Solve Many Abstract Visual Reasoning Problems

A Diffusion Model with State Estimation for Degradation-Blind Inverse Imaging

Self-Supervised 3D Human Mesh Recovery from a Single Image with Uncertainty-Aware Learning

Descanning: From Scanned to the Original Images with a Color Correction Diffusion Model

SparseGNV: Generating Novel Views of Indoor Scenes with Sparse RGB-D Images

Collaborative Tooth Motion Diffusion Model in Digital Orthodontics

An Information-Flow Perspective on Algorithmic Fairness

KeDuSR: Real-World Dual-Lens Super-resolution via Kernel-Free Matching

Robustly Train Normalizing Flows via KL Divergence Regularization

CoVR: Learning Composed Video Retrieval from Web Video Captions

Unknown-Aware Graph Regularization for Robust Semi-supervised Learning from Uncurated Data

Learning Encodings for Constructive Neural Combinatorial Optimization Needs to Regret

Taming the Sigmoid Bottleneck: Provably Argmaxable Sparse Multi-Label Classification

DP-AdamBC: Your DP-Adam Is Actually DP-SGD (Unless You Apply Bias Correction)

Fine-Tuning Graph Neural Networks by Preserving Graph Generative Patterns

MEPSI: An MDL-Based Ensemble Pruning Approach with Structural Information

Semi-supervised Learning of Dynamical Systems with Neural Ordinary Differential Equations: A Teacher-Student Model Approach

New Classes of the Greedy-Applicable Arm Feature Distributions in the Sparse Linear Bandit Problem

Universal Weak Coreset

Contrastive Balancing Representation Learning for Heterogeneous Dose-Response Curves Estimation

RetroOOD: Understanding Out-of-Distribution Generalization in Retrosynthesis Prediction

MemoryBank: Enhancing Large Language Models with Long-Term Memory

REGLO: Provable Neural Network Repair for Global Robustness Properties

CaMIL: Causal Multiple Instance Learning for Whole Slide Image Classification

Approximation Scheme for Weighted Metric Clustering via Sherali-Adams

Inducing Point Operator Transformer: A Flexible and Scalable Architecture for Solving PDEs

Contextual Pandora’s Box

Robust Distributed Gradient Aggregation Using Projections onto Gradient Manifolds

Generative Model Perception Rectification Algorithm for Trade-Off between Diversity and Quality

Faithful Model Explanations through Energy-Constrained Conformal Counterfactuals

Enhancing the Robustness of Spiking Neural Networks with Stochastic Gating Mechanisms

A Closer Look at Curriculum Adversarial Training: From an Online Perspective

Provably Convergent Federated Trilevel Learning

Dynamic Knowledge Injection for AIXI Agents

Finite-Time Frequentist Regret Bounds of Multi-Agent Thompson Sampling on Sparse Hypergraphs

Feature Distribution Matching by Optimal Transport for Effective and Robust Coreset Selection

A Unified Self-Distillation Framework for Multimodal Sentiment Analysis with Uncertain Missing Modalities

Guiding a Harsh-Environments Robust Detector via RAW Data Characteristic Mining

Resisting Backdoor Attacks in Federated Learning via Bidirectional Elections and Individual Perspective

Transportable Representations for Domain Generalization

Exponential Hardness of Optimization from the Locality in Quantum Neural Networks

MFOS: Model-Free &#x26; One-Shot Object Pose Estimation

Hierarchical Topology Isomorphism Expertise Embedded Graph Contrastive Learning

PDE+: Enhancing Generalization via PDE with Adaptive Distributional Diffusion

Learning Representations on the Unit Sphere: Investigating Angular Gaussian and Von Mises-Fisher Distributions for Online Continual Learning

Towards Real-World Test-Time Adaptation: Tri-net Self-Training with Balanced Normalization

Probabilistic Offline Policy Ranking with Approximate Bayesian Computation

Revisiting Disentanglement in Downstream Tasks: A Study on Its Necessity for Abstract Visual Reasoning

Out of Thin Air: Exploring Data-Free Adversarial Robustness Distillation

HAGO-Net: Hierarchical Geometric Massage Passing for Molecular Representation Learning

Unsupervised Template-assisted Point Cloud Shape Correspondence Network

X-3D: Explicit 3D Structure Modeling for Point Cloud Recognition

Efficient Model Stealing Defense with Noise Transition Matrix

MV-Adapter: Multimodal Video Transfer Learning for Video Text Retrieval

FLHetBench: Benchmarking Device and State Heterogeneity in Federated Learning

OmniParser: A Unified Framework for Text Spotting Key Information Extraction and Table Recognition

CroSel: Cross Selection of Confident Pseudo Labels for Partial-Label Learning

PTM-VQA: Efficient Video Quality Assessment Leveraging Diverse PreTrained Models from the Wild

Improved Self-Training for Test-Time Adaptation

Mudslide: A Universal Nuclear Instance Segmentation Method

Rewrite the Stars

Virtual Immunohistochemistry Staining for Histological Images Assisted by Weakly-supervised Learning

3DiffTection: 3D Object Detection with Geometry-Aware Diffusion Features

Model Adaptation for Time Constrained Embodied Control

SCE-MAE: Selective Correspondence Enhancement with Masked Autoencoder for Self-Supervised Landmark Estimation

Residual Denoising Diffusion Models

Weakly Supervised Point Cloud Semantic Segmentation via Artificial Oracle

Generating Content for HDR Deghosting from Frequency View

MFOS: Model-Free & One-Shot Object Pose Estimation