Most Cited 2024 "uncertainty-aware exploration" Papers

12,324 papers found • Page 12 of 62

#2201

Sequential Fusion Based Multi-Granularity Consistency for Space-Time Transformer Tracking

Kun Hu, Wenjing Yang, Wanrong Huang et al.

AAAI 2024paper
14
citations
#2202

Free-Editor: Zero-shot Text-driven 3D Scene Editing

Md Nazmul Karim, Hasan Iqbal, Umar Khalid et al.

ECCV 2024posterarXiv:2312.13663
14
citations
#2203

Hyperbolic Learning with Synthetic Captions for Open-World Detection

Fanjie Kong, Yanbei Chen, Jiarui Cai et al.

CVPR 2024posterarXiv:2404.05016
14
citations
#2204

BEHAVIOR Vision Suite: Customizable Dataset Generation via Simulation

Yunhao Ge, Yihe Tang, Jiashu Xu et al.

CVPR 2024highlightarXiv:2405.09546
14
citations
#2205

On the Road to Portability: Compressing End-to-End Motion Planner for Autonomous Driving

Kaituo Feng, Changsheng Li, Dongchun Ren et al.

CVPR 2024posterarXiv:2403.01238
14
citations
#2206

CAMIL: Context-Aware Multiple Instance Learning for Cancer Detection and Subtyping in Whole Slide Images

olga fourkioti, Matt De Vries, Chris Bakal

ICLR 2024spotlightarXiv:2305.05314
14
citations
#2207

AttnZero: Efficient Attention Discovery for Vision Transformers

Lujun Li, Zimian Wei, Peijie Dong et al.

ECCV 2024poster
14
citations
#2208

Referring Atomic Video Action Recognition

Kunyu Peng, Jia Fu, Kailun Yang et al.

ECCV 2024posterarXiv:2407.01872
14
citations
#2209

MoVideo: Motion-Aware Video Generation with Diffusion Models

Jingyun Liang, Yuchen Fan, Kai Zhang et al.

ECCV 2024posterarXiv:2311.11325
14
citations
#2210

Event Camera Data Dense Pre-training

Yan Yang, Liyuan Pan, Liu liu

ECCV 2024posterarXiv:2311.11533
14
citations
#2211

UniHuman: A Unified Model For Editing Human Images in the Wild

Nannan Li, Qing Liu, Krishna Kumar Singh et al.

CVPR 2024posterarXiv:2312.14985
14
citations
#2212

Finding Visual Task Vectors

Alberto Hojel, Yutong Bai, Trevor Darrell et al.

ECCV 2024posterarXiv:2404.05729
14
citations
#2213

What You See is What You GAN: Rendering Every Pixel for High-Fidelity Geometry in 3D GANs

Alex Trevithick, Matthew Chan, Towaki Takikawa et al.

CVPR 2024posterarXiv:2401.02411
14
citations
#2214

Can We Evaluate Domain Adaptation Models Without Target-Domain Labels?

JIANFEI YANG, Hanjie Qian, Yuecong Xu et al.

ICLR 2024posterarXiv:2305.18712
14
citations
#2215

Towards Fair Graph Federated Learning via Incentive Mechanisms

12794 Chenglu Pan, Jiarong Xu, Yue Yu et al.

AAAI 2024paperarXiv:2312.13306
14
citations
#2216

Vision-Language Action Knowledge Learning for Semantic-Aware Action Quality Assessment

Huangbiao Xu, Xiao Ke, Yuezhou Li et al.

ECCV 2024poster
14
citations
#2217

A Good Learner can Teach Better: Teacher-Student Collaborative Knowledge Distillation

Ayan Sengupta, Shantanu Dixit, Md Shad Akhtar et al.

ICLR 2024poster
14
citations
#2218

RoDUS: Robust Decomposition of Static and Dynamic Elements in Urban Scenes

Thang-Anh-Quan Nguyen, Luis G Roldao Jimenez, Nathan Piasco et al.

ECCV 2024posterarXiv:2403.09419
14
citations
#2219

AGS: Affordable and Generalizable Substitute Training for Transferable Adversarial Attack

Ruikui Wang, Yuanfang Guo, Yunhong Wang

AAAI 2024paper
14
citations
#2220

M2Doc: A Multi-Modal Fusion Approach for Document Layout Analysis

Ning Zhang, Hiuyi Cheng, Jiayu Chen et al.

AAAI 2024paper
14
citations
#2221

CNN Kernels Can Be the Best Shapelets

Eric Qu, Yansen Wang, Xufang Luo et al.

ICLR 2024poster
14
citations
#2222

Find n' Propagate: Open-Vocabulary 3D Object Detection in Urban Environments

Djamahl Etchegaray, Zi Helen Huang, Tatsuya Harada et al.

ECCV 2024posterarXiv:2403.13556
14
citations
#2223

EntAugment: Entropy-Driven Adaptive Data Augmentation Framework for Image Classification

Suorong Yang, Furao Shen, Jian Zhao

ECCV 2024posterarXiv:2409.06290
14
citations
#2224

SyFormer: Structure-Guided Synergism Transformer for Large-Portion Image Inpainting

Jie Wu, Yuchao Feng, Honghui Xu et al.

AAAI 2024paper
14
citations
#2225

AMD: Automatic Multi-step Distillation of Large-scale Vision Models

Cheng Han, Qifan Wang, Sohail A Dianat et al.

ECCV 2024posterarXiv:2407.04208
14
citations
#2226

Online GNN Evaluation Under Test-time Graph Distribution Shifts

Xin Zheng, Dongjin Song, Qingsong Wen et al.

ICLR 2024spotlightarXiv:2403.09953
14
citations
#2227

Dynamic Feature Pruning and Consolidation for Occluded Person Re-identification

YuTeng Ye, Hang Zhou, Jiale Cai et al.

AAAI 2024paperarXiv:2211.14742
14
citations
#2228

LEIA: Latent View-invariant Embeddings for Implicit 3D Articulation

Archana Swaminathan, Anubhav Anubhav, Kamal Gupta et al.

ECCV 2024posterarXiv:2409.06703
14
citations
#2229

PIN: Positional Insert Unlocks Object Localisation Abilities in VLMs

Michael Dorkenwald, Nimrod Barazani, Cees G. M. Snoek et al.

CVPR 2024posterarXiv:2402.08657
14
citations
#2230

Generative 3D Part Assembly via Part-Whole-Hierarchy Message Passing

Bi'an Du, Xiang Gao, Wei Hu et al.

CVPR 2024posterarXiv:2402.17464
14
citations
#2231

DexFuncGrasp: A Robotic Dexterous Functional Grasp Dataset Constructed from a Cost-Effective Real-Simulation Annotation System

Jinglue Hang, Xiangbo Lin, Tianqiang Zhu et al.

AAAI 2024paper
14
citations
#2232

LayoutDETR: Detection Transformer Is a Good Multimodal Layout Designer

Ning Yu, Chia-Chih Chen, Zeyuan Chen et al.

ECCV 2024posterarXiv:2212.09877
14
citations
#2233

HONGAT: Graph Attention Networks in the Presence of High-Order Neighbors

Heng-Kai Zhang, Yi-Ge Zhang, Zhi Zhou et al.

AAAI 2024paper
14
citations
#2234

TASeg: Temporal Aggregation Network for LiDAR Semantic Segmentation

Xiaopei Wu, Yuenan Hou, Xiaoshui Huang et al.

CVPR 2024posterarXiv:2407.09751
14
citations
#2235

Exploiting Auxiliary Caption for Video Grounding

Hongxiang Li, Meng Cao, Xuxin Cheng et al.

AAAI 2024paperarXiv:2301.05997
14
citations
#2236

HUMOS: Human Motion Model Conditioned on Body Shape

Shashank Tripathi, Omid Taheri, Christoph Lassner et al.

ECCV 2024posterarXiv:2409.03944
14
citations
#2237

Retrieval-Guided Reinforcement Learning for Boolean Circuit Minimization

Animesh Basak Chowdhury, Marco Romanelli, Benjamin Tan et al.

ICLR 2024posterarXiv:2401.12205
14
citations
#2238

A Restoration Network as an Implicit Prior

Yuyang Hu, Mauricio Delbracio, Peyman Milanfar et al.

ICLR 2024posterarXiv:2310.01391
14
citations
#2239

DiffSal: Joint Audio and Video Learning for Diffusion Saliency Prediction

Junwen Xiong, Peng Zhang, Tao You et al.

CVPR 2024posterarXiv:2403.01226
14
citations
#2240

UniCode : Learning a Unified Codebook for Multimodal Large Language Models

Sipeng Zheng, Bohan Zhou, Yicheng Feng et al.

ECCV 2024posterarXiv:2403.09072
14
citations
#2241

Temporal Event Stereo via Joint Learning with Stereoscopic Flow

Hoonhee Cho, Jae-young Kang, Kuk-Jin Yoon

ECCV 2024posterarXiv:2407.10831
14
citations
#2242

Pre-training with Random Orthogonal Projection Image Modeling

Maryam Haghighat, Peyman Moghadam, Shaheer Mohamed et al.

ICLR 2024spotlightarXiv:2310.18737
14
citations
#2243

Neural-Symbolic Recursive Machine for Systematic Generalization

Qing Li, Yixin Zhu, Yitao Liang et al.

ICLR 2024posterarXiv:2210.01603
14
citations
#2244

Free-VSC: Free Semantics from Visual Foundation Models for Unsupervised Video Semantic Compression

Yuan Tian, Guo Lu, Guangtao Zhai

ECCV 2024posterarXiv:2409.11718
14
citations
#2245

Estimating Noisy Class Posterior with Part-level Labels for Noisy Label Learning

Rui Zhao, Bin Shi, Jianfei Ruan et al.

CVPR 2024posterarXiv:2405.05714
14
citations
#2246

Learning to Learn Better Visual Prompts

Fengxiang Wang, Wanrong Huang, Shaowu Yang et al.

AAAI 2024paper
14
citations
#2247

Generalizable Task Representation Learning for Offline Meta-Reinforcement Learning with Data Limitations

Renzhe Zhou, Chen-Xiao Gao, Zongzhang Zhang et al.

AAAI 2024paperarXiv:2312.15909
14
citations
#2248

Spiking NeRF: Representing the Real-World Geometry by a Discontinuous Representation

Zhanfeng Liao, Yan Liu, Qian Zheng et al.

AAAI 2024paperarXiv:2311.09077
14
citations
#2249

Reinforcement Learning Meets Visual Odometry

Nico Messikommer, Giovanni Cioffi, Mathias Gehrig et al.

ECCV 2024posterarXiv:2407.15626
14
citations
#2250

Adversarial Backdoor Attack by Naturalistic Data Poisoning on Trajectory Prediction in Autonomous Driving

Mozhgan Pourkeshavarz, Mohammad Sabokrou, Amir Rasouli

CVPR 2024posterarXiv:2306.15755
14
citations
#2251

UnionFormer: Unified-Learning Transformer with Multi-View Representation for Image Manipulation Detection and Localization

Shuaibo Li, Wei Ma, Jianwei Guo et al.

CVPR 2024poster
14
citations
#2252

PointNeRF++: A multi-scale, point-based Neural Radiance Field

Weiwei Sun, Eduard Trulls, Yang-Che Tseng et al.

ECCV 2024posterarXiv:2312.02362
14
citations
#2253

Iterative Ensemble Training with Anti-Gradient Control for Mitigating Memorization in Diffusion Models

Xiao Liu, Xiaoliu Guan, Yu Wu et al.

ECCV 2024posterarXiv:2407.15328
14
citations
#2254

Inspecting Prediction Confidence for Detecting Black-Box Backdoor Attacks

Tong Wang, Yuan Yao, Feng Xu et al.

AAAI 2024paper
14
citations
#2255

Deep Structural Knowledge Exploitation and Synergy for Estimating Node Importance Value on Heterogeneous Information Networks

Yankai Chen, Yixiang Fang, Qiongyan Wang et al.

AAAI 2024paperarXiv:2402.12411
14
citations
#2256

Learning Intra-view and Cross-view Geometric Knowledge for Stereo Matching

Rui Gong, Weide Liu, ZAIWANG GU et al.

CVPR 2024posterarXiv:2402.19270
14
citations
#2257

HandBooster: Boosting 3D Hand-Mesh Reconstruction by Conditional Synthesis and Sampling of Hand-Object Interactions

Hao Xu, Li Haipeng, Yinqiao Wang et al.

CVPR 2024posterarXiv:2403.18575
14
citations
#2258

Robust Distillation via Untargeted and Targeted Intermediate Adversarial Samples

Junhao Dong, Piotr Koniusz, Junxi Chen et al.

CVPR 2024poster
14
citations
#2259

Norface: Improving Facial Expression Analysis by Identity Normalization

Hanwei Liu, Rudong An, Zhimeng Zhang et al.

ECCV 2024posterarXiv:2407.15617
14
citations
#2260

Long-term Temporal Context Gathering for Neural Video Compression

Linfeng Qi, Zhaoyang Jia, Jiahao Li et al.

ECCV 2024poster
14
citations
#2261

DiffSCI: Zero-Shot Snapshot Compressive Imaging via Iterative Spectral Diffusion Model

Zhenghao Pan, Haijin Zeng, Jiezhang Cao et al.

CVPR 2024posterarXiv:2311.11417
14
citations
#2262

Neural Volumetric World Models for Autonomous Driving

Zanming Huang, Jimuyang Zhang, Eshed Ohn-Bar

ECCV 2024poster
14
citations
#2263

MoEAD: A Parameter-efficient Model for Multi-class Anomaly Detection

Shiyuan Meng, Wenchao Meng, Qihang Zhou et al.

ECCV 2024poster
14
citations
#2264

Accelerating Data Generation for Neural Operators via Krylov Subspace Recycling

Hong Wang, Zhongkai Hao, Jie Wang et al.

ICLR 2024spotlightarXiv:2401.09516
14
citations
#2265

FoSp: Focus and Separation Network for Early Smoke Segmentation

Lujian Yao, Haitao Zhao, Jingchao Peng et al.

AAAI 2024paperarXiv:2306.04474
14
citations
#2266

Foster Adaptivity and Balance in Learning with Noisy Labels

Mengmeng Sheng, Zeren Sun, Tao Chen et al.

ECCV 2024posterarXiv:2407.02778
14
citations
#2267

Towards Robust Image Stitching: An Adaptive Resistance Learning against Compatible Attacks

Zhiying Jiang, Xingyuan Li, Jinyuan Liu et al.

AAAI 2024paperarXiv:2402.15959
14
citations
#2268

Regroup Median Loss for Combating Label Noise

Authors: Fengpeng Li, Kemou Li, Jinyu Tian et al.

AAAI 2024paperarXiv:2312.06273
14
citations
#2269

Bridge Past and Future: Overcoming Information Asymmetry in Incremental Object Detection

QIJIE MO, Yipeng Gao, Shenghao Fu et al.

ECCV 2024posterarXiv:2407.11499
14
citations
#2270

FineMatch: Aspect-based Fine-grained Image and Text Mismatch Detection and Correction

Hang Hua, Jing Shi, Kushal Kafle et al.

ECCV 2024posterarXiv:2404.14715
14
citations
#2271

CDPNet: Cross-Modal Dual Phases Network for Point Cloud Completion

Zhenjiang Du, Jiale Dou, Zhitao Liu et al.

AAAI 2024paper
14
citations
#2272

EmoTalk3D: High-Fidelity Free-View Synthesis of Emotional 3D Talking Head

Qianyun He, Xinya Ji, Yicheng Gong et al.

ECCV 2024posterarXiv:2408.00297
14
citations
#2273

Learning Instance-Aware Correspondences for Robust Multi-Instance Point Cloud Registration in Cluttered Scenes

Zhiyuan Yu, Zheng Qin, lintao zheng et al.

CVPR 2024posterarXiv:2404.04557
14
citations
#2274

Grounding Language Models for Visual Entity Recognition

Zilin Xiao, Ming Gong, Paola Cascante-Bonilla et al.

ECCV 2024posterarXiv:2402.18695
13
citations
#2275

PartGLEE: A Foundation Model for Recognizing and Parsing Any Objects

Junyi Li, Junfeng Wu, Weizhi Zhao et al.

ECCV 2024posterarXiv:2407.16696
13
citations
#2276

Full Bayesian Significance Testing via Neural Networks

Zehua Liu, Zimeng Li, Jingyuan Wang et al.

AAAI 2024paper
13
citations
#2277

DreamMover: Leveraging the Prior of Diffusion Models for Image Interpolation with Large Motion

Liao Shen, Tianqi Liu, Huiqiang Sun et al.

ECCV 2024posterarXiv:2409.09605
13
citations
#2278

Parameterized Approximation Algorithms for Sum of Radii Clustering and Variants

Xianrun Chen, Dachuan Xu, Yicheng Xu et al.

AAAI 2024paper
13
citations
#2279

RoDLA: Benchmarking the Robustness of Document Layout Analysis Models

Yufan Chen, Jiaming Zhang, Kunyu Peng et al.

CVPR 2024posterarXiv:2403.14442
13
citations
#2280

InstaStyle: Inversion Noise of a Stylized Image is Secretly a Style Adviser

Xing Cui, Zekun Li, Peipei Li et al.

ECCV 2024posterarXiv:2311.15040
13
citations
#2281

ReSyncer: Rewiring Style-based Generator for Unified Audio-Visually Synced Facial Performer

Jiazhi Guan, Zhiliang Xu, Hang Zhou et al.

ECCV 2024posterarXiv:2408.03284
13
citations
#2282

BENO: Boundary-embedded Neural Operators for Elliptic PDEs

Haixin Wang, Jiaxin Li, Anubhav Dwivedi et al.

ICLR 2024posterarXiv:2401.09323
13
citations
#2283

Federated Causality Learning with Explainable Adaptive Optimization

Dezhi Yang, Xintong He, Jun Wang et al.

AAAI 2024paperarXiv:2312.05540
13
citations
#2284

Novel Class Discovery for Ultra-Fine-Grained Visual Categorization

Qi Jia, Yaqi Cai, Qi Jia et al.

CVPR 2024highlightarXiv:2405.06283
13
citations
#2285

ScanTalk: 3D Talking Heads from Unregistered Scans

Federico Nocentini, Thomas Besnier, Claudio Ferrari et al.

ECCV 2024posterarXiv:2403.10942
13
citations
#2286

Learning Representations of Satellite Images From Metadata Supervision

Jules Bourcier, Gohar Dashyan, Karteek Alahari et al.

ECCV 2024poster
13
citations
#2287

MagicEraser: Erasing Any Objects via Semantics-Aware Control

FAN LI, Zixiao Zhang, Yi Huang et al.

ECCV 2024posterarXiv:2410.10207
13
citations
#2288

Rethinking the Paradigm of Content Constraints in Unpaired Image-to-Image Translation

Xiuding Cai, Yaoyao Zhu, Dong Miao et al.

AAAI 2024paperarXiv:2211.10867
13
citations
#2289

FashionERN: Enhance-and-Refine Network for Composed Fashion Image Retrieval

Yanzhe Chen, Huasong Zhong, Xiangteng He et al.

AAAI 2024paper
13
citations
#2290

How to Handle Sketch-Abstraction in Sketch-Based Image Retrieval?

Subhadeep Koley, Ayan Kumar Bhunia, Aneeshan Sain et al.

CVPR 2024posterarXiv:2403.07203
13
citations
#2291

Scribble Hides Class: Promoting Scribble-Based Weakly-Supervised Semantic Segmentation

Xinliang Zhang, Lei Zhu, Hangzhou He et al.

AAAI 2024paperarXiv:2402.17555
13
citations
#2292

On the Utility of 3D Hand Poses for Action Recognition

Md Salman Shamil, Dibyadip Chatterjee, Fadime Sener et al.

ECCV 2024posterarXiv:2403.09805
13
citations
#2293

Brain-ID: Learning Contrast-agnostic Anatomical Representations for Brain Imaging

Peirong Liu, Oula Puonti, Xiaoling Hu et al.

ECCV 2024posterarXiv:2311.16914
13
citations
#2294

Multi-Scene Generalized Trajectory Global Graph Solver with Composite Nodes for Multiple Object Tracking

Yan Gao, Haojun Xu, Jie Li et al.

AAAI 2024paperarXiv:2312.08951
13
citations
#2295

UMG-CLIP: A Unified Multi-Granularity Vision Generalist for Open-World Understanding

Bowen Shi, Peisen Zhao, Zichen Wang et al.

ECCV 2024posterarXiv:2401.06397
13
citations
#2296

Learning Temporal Resolution in Spectrogram for Audio Classification

Haohe Liu, Xubo Liu, Qiuqiang Kong et al.

AAAI 2024paperarXiv:2210.01719
13
citations
#2297

SketchINR: A First Look into Sketches as Implicit Neural Representations

Hmrishav Bandyopadhyay, Ayan Kumar Bhunia, Pinaki Nath Chowdhury et al.

CVPR 2024posterarXiv:2403.09344
13
citations
#2298

3iGS: Factorised Tensorial Illumination for 3D Gaussian Splatting

Zhe Jun Tang, Tat-Jen Cham

ECCV 2024posterarXiv:2408.03753
13
citations
#2299

MCPNet: An Interpretable Classifier via Multi-Level Concept Prototypes

Bor Shiun Wang, Chien-Yi Wang, Wei-Chen Chiu

CVPR 2024posterarXiv:2404.08968
13
citations
#2300

Learning Representations on the Unit Sphere: Investigating Angular Gaussian and Von Mises-Fisher Distributions for Online Continual Learning

Nicolas Michel, Giovanni Chierchia, Romain Negrel et al.

AAAI 2024paperarXiv:2306.03364
13
citations
#2301

ScatterFormer: Efficient Voxel Transformer with Scattered Linear Attention

Chenhang He, Ruihuang Li, Guowen Zhang et al.

ECCV 2024posterarXiv:2401.00912
13
citations
#2302

Kalman-Inspired Feature Propagation for Video Face Super-Resolution

Ruicheng Feng, Chongyi Li, Chen Change Loy

ECCV 2024posterarXiv:2408.05205
13
citations
#2303

Differentiable Euler Characteristic Transforms for Shape Classification

Ernst Roell, Bastian Rieck

ICLR 2024posterarXiv:2310.07630
13
citations
#2304

Boosting 3D Single Object Tracking with 2D Matching Distillation and 3D Pre-training

qiangqiang wu, Yan Xia, Jia Wan et al.

ECCV 2024poster
13
citations
#2305

Chronic Poisoning: Backdoor Attack against Split Learning

Fangchao Yu, Bo Zeng, Kai Zhao et al.

AAAI 2024paper
13
citations
#2306

BKDSNN: Enhancing the Performance of Learning-based Spiking Neural Networks Training with Blurred Knowledge Distillation

Zekai Xu, Kang You, Qinghai Guo et al.

ECCV 2024posterarXiv:2407.09083
13
citations
#2307

InstructGIE: Towards Generalizable Image Editing

Zichong Meng, Changdi Yang, Jun Liu et al.

ECCV 2024posterarXiv:2403.05018
13
citations
#2308

Boosting Adversarial Training via Fisher-Rao Norm-based Regularization

Xiangyu Yin, Wenjie Ruan

CVPR 2024posterarXiv:2403.17520
13
citations
#2309

Local Action-Guided Motion Diffusion Model for Text-to-Motion Generation

Peng Jin, Hao Li, Zesen Cheng et al.

ECCV 2024posterarXiv:2407.10528
13
citations
#2310

Compound Text-Guided Prompt Tuning via Image-Adaptive Cues

Hao Tan, Jun Li, Yizhuang Zhou et al.

AAAI 2024paperarXiv:2312.06401
13
citations
#2311

LQMFormer: Language-aware Query Mask Transformer for Referring Image Segmentation

Nisarg Shah, Vibashan VS, Vishal M. Patel

CVPR 2024poster
13
citations
#2312

Unlocking Textual and Visual Wisdom: Open-Vocabulary 3D Object Detection Enhanced by Comprehensive Guidance from Text and Image

Pengkun Jiao, Na Zhao, Jingjing Chen et al.

ECCV 2024posterarXiv:2407.05256
13
citations
#2313

Single-View Scene Point Cloud Human Grasp Generation

Yan-Kang Wang, Chengyi Xing, Yi-Lin Wei et al.

CVPR 2024posterarXiv:2404.15815
13
citations
#2314

Task-Driven Causal Feature Distillation: Towards Trustworthy Risk Prediction

Zhixuan Chu, Mengxuan Hu, Qing Cui et al.

AAAI 2024paperarXiv:2312.16113
13
citations
#2315

WildRefer: 3D Object Localization in Large-scale Dynamic Scenes with Multi-modal Visual Data and Natural Language

Zhenxiang Lin, Xidong Peng, peishan cong et al.

ECCV 2024posterarXiv:2304.05645
13
citations
#2316

Symmetric Mean-field Langevin Dynamics for Distributional Minimax Problems

Juno Kim, Kakei Yamamoto, Kazusato Oko et al.

ICLR 2024spotlightarXiv:2312.01127
13
citations
#2317

STAMP: Outlier-Aware Test-Time Adaptation with Stable Memory Replay

Yu Yongcan, Lijun Sheng, Ran He et al.

ECCV 2024posterarXiv:2407.15773
13
citations
#2318

Harnessing Text-to-Image Diffusion Models for Category-Agnostic Pose Estimation

Duo Peng, Zhengbo Zhang, Ping Hu et al.

ECCV 2024poster
13
citations
#2319

Beyond Prototypes: Semantic Anchor Regularization for Better Representation Learning

Yanqi Ge, Qiang Nie, Ye Huang et al.

AAAI 2024paperarXiv:2312.11872
13
citations
#2320

Effective Video Mirror Detection with Inconsistent Motion Cues

Alex Warren, Ke Xu, Jiaying Lin et al.

CVPR 2024poster
13
citations
#2321

DV-3DLane: End-to-end Multi-modal 3D Lane Detection with Dual-view Representation

Yueru Luo, Shuguang Cui, Zhen Li

ICLR 2024posterarXiv:2406.16072
13
citations
#2322

Trust Region Methods for Nonconvex Stochastic Optimization beyond Lipschitz Smoothness

Chenghan Xie, Chenxi Li, Chuwen Zhang et al.

AAAI 2024paperarXiv:2310.17319
13
citations
#2323

Adaptive Compressed Sensing with Diffusion-Based Posterior Sampling

Noam Elata, Tomer Michaeli, Michael Elad

ECCV 2024posterarXiv:2407.08256
13
citations
#2324

Semi-supervised Segmentation of Histopathology Images with Noise-Aware Topological Consistency

Meilong Xu, Xiaoling Hu, Saumya Gupta et al.

ECCV 2024posterarXiv:2311.16447
13
citations
#2325

Move Anything with Layered Scene Diffusion

Jiawei Ren, Mengmeng Xu, Jui-Chieh Wu et al.

CVPR 2024posterarXiv:2404.07178
13
citations
#2326

OTSeg: Multi-prompt Sinkhorn Attention for Zero-Shot Semantic Segmentation

Kwanyoung Kim, Yujin Oh, Jong Chul Ye

ECCV 2024posterarXiv:2403.14183
13
citations
#2327

SportsSloMo: A New Benchmark and Baselines for Human-centric Video Frame Interpolation

Jiaben Chen, Huaizu Jiang

CVPR 2024posterarXiv:2308.16876
13
citations
#2328

PH-Net: Semi-Supervised Breast Lesion Segmentation via Patch-wise Hardness

Siyao Jiang, Huisi Wu, Junyang Chen et al.

CVPR 2024poster
13
citations
#2329

Where am I? Scene Retrieval with Language

Jiaqi Chen, Daniel Barath, Iro Armeni et al.

ECCV 2024posterarXiv:2404.14565
13
citations
#2330

Empowering Embodied Visual Tracking with Visual Foundation Models and Offline RL

Fangwei Zhong, Kui Wu, Hai Ci et al.

ECCV 2024posterarXiv:2404.09857
13
citations
#2331

Light Schrödinger Bridge

Alexander Korotin, Nikita Gushchin, Evgeny Burnaev

ICLR 2024posterarXiv:2310.01174
13
citations
#2332

3DGazeNet: Generalizing Gaze Estimation with Weak Supervision from Synthetic Views

Evangelos Ververas, Polydefkis Gkagkos, Jiankang Deng et al.

ECCV 2024posterarXiv:2212.02997
13
citations
#2333

Robust Multimodal Learning via Representation Decoupling

Shicai Wei, Yang Luo, Yuji Wang et al.

ECCV 2024posterarXiv:2407.04458
13
citations
#2334

Hebbian Learning based Orthogonal Projection for Continual Learning of Spiking Neural Networks

Mingqing Xiao, Qingyan Meng, Zongpeng Zhang et al.

ICLR 2024posterarXiv:2402.11984
13
citations
#2335

Rethinking Data Augmentation for Robust LiDAR Semantic Segmentation in Adverse Weather

Junsung Park, Kyungmin Kim, Hyunjung Shim

ECCV 2024posterarXiv:2407.02286
13
citations
#2336

MultiDelete for Multimodal Machine Unlearning

Jiali Cheng, Hadi Amiri

ECCV 2024posterarXiv:2311.12047
13
citations
#2337

GDA: Generalized Diffusion for Robust Test-time Adaptation

Yun-Yun Tsai, Fu-Chen Chen, Albert Chen et al.

CVPR 2024posterarXiv:2404.00095
13
citations
#2338

Self-Guided Generation of Minority Samples Using Diffusion Models

Soobin Um, Jong Chul Ye

ECCV 2024posterarXiv:2407.11555
13
citations
#2339

DiffuMatting: Synthesizing Arbitrary Objects with Matting-level Annotation

Xiaobin Hu, Xu Peng, Donghao Luo et al.

ECCV 2024posterarXiv:2403.06168
13
citations
#2340

F3Loc: Fusion and Filtering for Floorplan Localization

Changan Chen, Rui Wang, Christoph Vogel et al.

CVPR 2024highlight
13
citations
#2341

3D Open-Vocabulary Panoptic Segmentation with 2D-3D Vision-Language Distillation

Zihao Xiao, Longlong Jing, Shangxuan Wu et al.

ECCV 2024posterarXiv:2401.02402
13
citations
#2342

ReALFRED: An Embodied Instruction Following Benchmark in Photo-Realistic Environments

Taewoong Kim, Cheolhong Min, Byeonghwi Kim et al.

ECCV 2024posterarXiv:2407.18550
13
citations
#2343

IMMA: Immunizing text-to-image Models against Malicious Adaptation

Amber Yijia Zheng, Raymond Yeh

ECCV 2024posterarXiv:2311.18815
13
citations
#2344

OVD-Explorer: Optimism Should Not Be the Sole Pursuit of Exploration in Noisy Environments

Jinyi Liu, Zhi Wang, Yan Zheng et al.

AAAI 2024paperarXiv:2312.12145
13
citations
#2345

Partial-to-Partial Shape Matching with Geometric Consistency

Viktoria Ehm, Maolin Gao, Paul Roetzer et al.

CVPR 2024posterarXiv:2404.12209
13
citations
#2346

USE: Universal Segment Embeddings for Open-Vocabulary Image Segmentation

Xiaoqi Wang, Wenbin He, Xiwei Xuan et al.

CVPR 2024posterarXiv:2406.05271
13
citations
#2347

Improving Geo-diversity of Generated Images with Contextualized Vendi Score Guidance

Reyhane Askari Hemmat, Melissa Hall, Alicia Yi Sun et al.

ECCV 2024posterarXiv:2406.04551
13
citations
#2348

OpenPSG: Open-set Panoptic Scene Graph Generation via Large Multimodal Models

Zijian Zhou, Zheng Zhu, Holger Caesar et al.

ECCV 2024posterarXiv:2407.11213
13
citations
#2349

MaxFusion: Plug&Play Multi-Modal Generation in Text-to-Image Diffusion Models

Nithin Gopalakrishnan Nair, Jeya Maria Jose Valanarasu, Vishal Patel

ECCV 2024posterarXiv:2404.09977
13
citations
#2350

SpectralNeRF: Physically Based Spectral Rendering with Neural Radiance Field

Ru Li, Jia Liu, Guanghui Liu et al.

AAAI 2024paperarXiv:2312.08692
13
citations
#2351

Turbo: Informativity-Driven Acceleration Plug-In for Vision-Language Large Models

Chen Ju, Haicheng Wang, Haozhe Cheng et al.

ECCV 2024posterarXiv:2407.11717
13
citations
#2352

Fed-QSSL: A Framework for Personalized Federated Learning under Bitwidth and Data Heterogeneity

Yiyue Chen, Haris Vikalo, Chianing Wang

AAAI 2024paperarXiv:2312.13380
13
citations
#2353

Real-World Mobile Image Denoising Dataset with Efficient Baselines

Roman Flepp, Andrey Ignatov, Radu Timofte et al.

CVPR 2024poster
13
citations
#2354

PAC Prediction Sets Under Label Shift

Wenwen Si, Sangdon Park, Insup Lee et al.

ICLR 2024posterarXiv:2310.12964
13
citations
#2355

FedLoGe: Joint Local and Generic Federated Learning under Long-tailed Data

Zikai Xiao, Zihan Chen, Liyinglan Liu et al.

ICLR 2024posterarXiv:2401.08977
13
citations
#2356

Distilling Reliable Knowledge for Instance-Dependent Partial Label Learning

Dong-Dong Wu, Deng-Bao Wang, Min-Ling Zhang

AAAI 2024paper
13
citations
#2357

Editable Image Elements for Controllable Synthesis

Jiteng Mu, Michael Gharbi, Richard Zhang et al.

ECCV 2024posterarXiv:2404.16029
13
citations
#2358

CALICO: Self-Supervised Camera-LiDAR Contrastive Pre-training for BEV Perception

Jiachen Sun, Haizhong Zheng, Qingzhao Zhang et al.

ICLR 2024posterarXiv:2306.00349
13
citations
#2359

ISP-Teacher: Image Signal Process with Disentanglement Regularization for Unsupervised Domain Adaptive Dark Object Detection

Yin Zhang, Yongqiang Zhang, Zian Zhang et al.

AAAI 2024paper
13
citations
#2360

Lift3D: Zero-Shot Lifting of Any 2D Vision Model to 3D

Mukund Varma T, Peihao Wang, Zhiwen Fan et al.

CVPR 2024posterarXiv:2403.18922
13
citations
#2361

3D Neural Edge Reconstruction

Lei Li, Songyou Peng, Zehao Yu et al.

CVPR 2024posterarXiv:2405.19295
13
citations
#2362

AdapEdit: Spatio-Temporal Guided Adaptive Editing Algorithm for Text-Based Continuity-Sensitive Image Editing

Zhiyuan Ma, Guoli Jia, Bowen Zhou

AAAI 2024paperarXiv:2312.08019
13
citations
#2363

Label-Efficient Few-Shot Semantic Segmentation with Unsupervised Meta-Training

Jianwu Li, Kaiyue Shi, Guo-Sen Xie et al.

AAAI 2024paper
13
citations
#2364

TriSampler: A Better Negative Sampling Principle for Dense Retrieval

Zhen Yang, Zhou Shao, Yuxiao Dong et al.

AAAI 2024paperarXiv:2402.11855
13
citations
#2365

Every Pixel Has its Moments: Ultra-High-Resolution Unpaired Image-to-Image Translation via Dense Normalization

Ming-Yang Ho, Che-Ming Wu, Min-Sheng Wu et al.

ECCV 2024posterarXiv:2407.04245
13
citations
#2366

Cell Graph Transformer for Nuclei Classification

Wei Lou, Guanbin Li, Xiang Wan et al.

AAAI 2024paperarXiv:2402.12946
13
citations
#2367

CVT-xRF: Contrastive In-Voxel Transformer for 3D Consistent Radiance Fields from Sparse Inputs

Yingji Zhong, Lanqing Hong, Zhenguo Li et al.

CVPR 2024posterarXiv:2403.16885
13
citations
#2368

Teddy: Efficient Large-Scale Dataset Distillation via Taylor-Approximated Matching

Ruonan Yu, Songhua Liu, Jingwen Ye et al.

ECCV 2024posterarXiv:2410.07579
13
citations
#2369

Representing Topological Self-Similarity Using Fractal Feature Maps for Accurate Segmentation of Tubular Structures

Jiaxing Huang, Yanfeng Zhou, Yaoru Luo et al.

ECCV 2024posterarXiv:2407.14754
13
citations
#2370

Dual-Window Multiscale Transformer for Hyperspectral Snapshot Compressive Imaging

Fulin Luo, Xi Chen, Xiuwen Gong et al.

AAAI 2024paper
13
citations
#2371

An Empirical Study of the Generalization Ability of Lidar 3D Object Detectors to Unseen Domains

George Eskandar

CVPR 2024posterarXiv:2402.17562
13
citations
#2372

3D Multi-frame Fusion for Video Stabilization

Zhan Peng, Xinyi Ye, Weiyue Zhao et al.

CVPR 2024posterarXiv:2404.12887
13
citations
#2373

Representing Part-Whole Hierarchies in Foundation Models by Learning Localizability Composability and Decomposability from Anatomy via Self Supervision

Mohammad Reza Hosseinzadeh Taher, Michael Gotway, Jianming Liang

CVPR 2024poster
13
citations
#2374

Fine-Grained Knowledge Selection and Restoration for Non-exemplar Class Incremental Learning

Authors: Jiang-Tian Zhai, Xialei Liu, Lu Yu et al.

AAAI 2024paperarXiv:2312.12722
13
citations
#2375

Just a Hint: Point-Supervised Camouflaged Object Detection

Huafeng Chen, Dian SHAO, Guangqian Guo et al.

ECCV 2024posterarXiv:2408.10777
13
citations
#2376

SPD-DDPM: Denoising Diffusion Probabilistic Models in the Symmetric Positive Definite Space

Yunchen Li, Zhou Yu, Gaoqi He et al.

AAAI 2024paperarXiv:2312.08200
13
citations
#2377

Rephrase, Augment, Reason: Visual Grounding of Questions for Vision-Language Models

Archiki Prasad, Elias Stengel-Eskin, Mohit Bansal

ICLR 2024posterarXiv:2310.05861
13
citations
#2378

Identifiability of Direct Effects from Summary Causal Graphs

Simon Ferreira, Charles Assaad

AAAI 2024paperarXiv:2306.16958
13
citations
#2379

Multi-Sentence Grounding for Long-term Instructional Video

Zeqian Li, QIRUI CHEN, Tengda Han et al.

ECCV 2024posterarXiv:2312.14055
12
citations
#2380

Adaptive Self-training Framework for Fine-grained Scene Graph Generation

Kibum Kim, Kanghoon Yoon, Yeonjun In et al.

ICLR 2024posterarXiv:2401.09786
12
citations
#2381

PairAug: What Can Augmented Image-Text Pairs Do for Radiology?

Yutong Xie, Qi Chen, Sinuo Wang et al.

CVPR 2024posterarXiv:2404.04960
12
citations
#2382

Unsupervised Gaze Representation Learning from Multi-view Face Images

Yiwei Bao, Feng Lu

CVPR 2024poster
12
citations
#2383

Adapting Fine-Grained Cross-View Localization to Areas without Fine Ground Truth

Zimin Xia, Yujiao Shi, HONGDONG LI et al.

ECCV 2024posterarXiv:2406.00474
12
citations
#2384

Generalized Planning for the Abstraction and Reasoning Corpus

Chao Lei, Nir Lipovetzky, Krista A. Ehinger

AAAI 2024paperarXiv:2401.07426
12
citations
#2385

Discriminative Sample-Guided and Parameter-Efficient Feature Space Adaptation for Cross-Domain Few-Shot Learning

Rashindrie Perera, Saman Halgamuge

CVPR 2024posterarXiv:2403.04492
12
citations
#2386

Generalizability of Adversarial Robustness Under Distribution Shifts

Bernard Ghanem, Kumail Alhamoud, Hasan Hammoud et al.

ICLR 2024poster
12
citations
#2387

DiffFAS: Face Anti-Spoofing via Generative Diffusion Models

Xinxu Ge, Xin Liu, Zitong Yu et al.

ECCV 2024posterarXiv:2409.08572
12
citations
#2388

ZeroFlow: Scalable Scene Flow via Distillation

Kyle Vedder, Neehar Peri, Nathaniel Chodosh et al.

ICLR 2024oralarXiv:2305.10424
12
citations
#2389

Physical-Based Event Camera Simulator

Haiqian Han, Jiacheng Lyu, Jianing Li et al.

ECCV 2024poster
12
citations
#2390

FreeCompose: Generic Zero-Shot Image Composition with Diffusion Prior

Zhekai Chen, Wen Wang, Zhen Yang et al.

ECCV 2024posterarXiv:2407.04947
12
citations
#2391

CF-NeRF: Camera Parameter Free Neural Radiance Fields with Incremental Learning

Qingsong Yan, Qiang Wang, Kaiyong Zhao et al.

AAAI 2024paperarXiv:2312.08760
12
citations
#2392

RGNet: A Unified Clip Retrieval and Grounding Network for Long Videos

Tanveer Hannan, Mohaiminul Islam, Thomas Seidl et al.

ECCV 2024posterarXiv:2312.06729
12
citations
#2393

M3SOT: Multi-Frame, Multi-Field, Multi-Space 3D Single Object Tracking

Jiaming Liu, Yue Wu, Maoguo Gong et al.

AAAI 2024paperarXiv:2312.06117
12
citations
#2394

UpFusion: Novel View Diffusion from Unposed Sparse View Observations

Bharath Raj Nagoor Kani, Hsin-Ying Lee, Sergey Tulyakov et al.

ECCV 2024posterarXiv:2312.06661
12
citations
#2395

Multi-Label Cluster Discrimination for Visual Representation Learning

Xiang An, Kaicheng Yang, Xiangzi Dai et al.

ECCV 2024posterarXiv:2407.17331
12
citations
#2396

Mastering Symbolic Operations: Augmenting Language Models with Compiled Neural Networks

Yixuan Weng, Minjun Zhu, Fei Xia et al.

ICLR 2024posterarXiv:2304.01665
12
citations
#2397

Real Appearance Modeling for More General Deepfake Detection

Jiahe Tian, Yu Cai, Xi Wang et al.

ECCV 2024poster
12
citations
#2398

PanoFree: Tuning-Free Holistic Multi-view Image Generation with Cross-view Self-Guidance

Aoming Liu, Zhong Li, Zhang Chen et al.

ECCV 2024posterarXiv:2408.02157
12
citations
#2399

Temporally Consistent Stereo Matching

Jiaxi Zeng, Chengtang Yao, Yuwei Wu et al.

ECCV 2024posterarXiv:2407.11950
12
citations
#2400

S²MVTC: a Simple yet Efficient Scalable Multi-View Tensor Clustering

Zhen Long, Qiyuan Wang, Yazhou Ren et al.

CVPR 2024poster
12
citations