Most Cited 2024 &quot;time-dependent attention&quot; Papers

#3202

Inverse Weight-Balancing for Deep Long-Tailed Learning

Wenqi Dang, Zhou Yang, Weisheng Dong et al.

ECCV 2024posterarXiv:2407.17058

#3203

DiffCD: A Symmetric Differentiable Chamfer Distance for Neural Implicit Surface Fitting

Linus Härenstam-Nielsen, Lu Sang, Abhishek Saroha et al.

CVPR 2024highlightarXiv:2401.01823

#3204

Detours for Navigating Instructional Videos

Kumar Ashutosh, Zihui Xue, Tushar Nagarajan et al.

ECCV 2024posterarXiv:2404.03836

#3205

PARIS3D: Reasoning-based 3D Part Segmentation Using Large Multimodal Model

Amrin Kareem, Jean Lahoud, Hisham Cholakkal

AAAI 2024paperarXiv:2312.13118

#3206

LRS: Enhancing Adversarial Transferability through Lipschitz Regularized Surrogate

Tao Wu, Tie Luo, D. C. Wunsch

CVPR 2024posterarXiv:2404.07985

#3207

WaveMo: Learning Wavefront Modulations to See Through Scattering

Mingyang Xie, Haiyun Guo, Brandon Y. Feng et al.

ECCV 2024posterarXiv:2407.16260

#3208

DreamDissector: Learning Disentangled Text-to-3D Generation from 2D Diffusion Priors

Zizheng Yan, Jiapeng Zhou, Fanpeng Meng et al.

CVPR 2024posterarXiv:2311.17352

#3209

Efficient Stitchable Task Adaptation

Haoyu He, Zizheng Pan, Jing Liu et al.

CVPR 2024posterarXiv:2405.11481

#3210

Physics-Aware Hand-Object Interaction Denoising

Haowen Luo, Yunze Liu, Li Yi

AAAI 2024paperarXiv:2301.13821

#3211

Complete Neural Networks for Complete Euclidean Graphs

Snir Hordan, Tal Amir, Nadav Dym et al.

#3212

Improving Zero-Shot Generalization for CLIP with Variational Adapter

Ziqian Lu, Fengli Shen, Mushui Liu et al.

CVPR 2024posterarXiv:2403.04198

#3213

CN-RMA: Combined Network with Ray Marching Aggregation for 3D Indoor Object Detection from Multi-view Images

Guanlin Shen, Jingwei Huang, Zhihua Hu et al.

CVPR 2024posterarXiv:2411.15673

#3214

Semantic Shield: Defending Vision-Language Models Against Backdooring and Poisoning via Fine-grained Knowledge Alignment

Alvi Md Ishmam, Chris Thomas

ECCV 2024posterarXiv:2409.08077

#3215

Diffusion-Based Image-to-Image Translation by Noise Correction via Prompt Interpolation

Junsung Lee, Minsoo Kang, Bohyung Han

ECCV 2024posterarXiv:2407.16448

#3216

MonoWAD: Weather-Adaptive Diffusion Model for Robust Monocular 3D Object Detection

Youngmin Oh, Hyung-Il Kim, Seong Tae Kim et al.

#3217

SemReg: Semantics Constrained Point Cloud Registration

Sheldon Fung, Xuequan Lu, Dasith de Silva Edirimuni et al.

AAAI 2024paperarXiv:2312.10648

#3218

Faithful Model Explanations through Energy-Constrained Conformal Counterfactuals

Patrick Altmeyer, Mojtaba Farmanbar, Arie Van Deursen et al.

AAAI 2024paperarXiv:2401.13193

#3219

Catch-Up Mix: Catch-Up Class for Struggling Filters in CNN

Minsoo Kang, Minkoo Kang, Suhyun Kim

AAAI 2024paperarXiv:2312.09219

#3220

NestE: Modeling Nested Relational Structures for Knowledge Graph Reasoning

Bo Xiong, Mojtaba Nayyeri, Linhao Luo et al.

CVPR 2024posterarXiv:2403.02561

#3221

Semantic Human Mesh Reconstruction with Textures

xiaoyu zhan, Jianxin Yang, Yuanqi Li et al.

AAAI 2024paperarXiv:2403.06235

#3222

Probabilistic Neural Circuits

Pedro Zuidberg Dos Martires

#3223

DAG-Aware Variational Autoencoder for Social Propagation Graph Generation

Dongpeng Hou, Chao Gao, Xuelong Li et al.

AAAI 2024paperarXiv:2312.11934

#3224

Identification of Causal Structure with Latent Variables Based on Higher Order Cumulants

Wei Chen, Zhiyi Huang, Ruichu Cai et al.

ECCV 2024posterarXiv:2311.12090

#3225

FrePolad: Frequency-Rectified Point Latent Diffusion for Point Cloud Generation

Chenliang Zhou, Fangcheng Zhong, Param Hanji et al.

AAAI 2024paperarXiv:2312.05974

#3226

Learning the Causal Structure of Networked Dynamical Systems under Latent Nodes and Structured Noise

Augusto Santos, Diogo Rente, Rui Seabra et al.

AAAI 2024paperarXiv:2308.07272

#3227

Dialogue for Prompting: A Policy-Gradient-Based Discrete Prompt Generation for Few-Shot Learning

Chengzhengxu Li, Xiaoming Liu, Yichen Wang et al.

ECCV 2024posterarXiv:2410.20451

#3228

BlinkVision: A Benchmark for Optical Flow, Scene Flow and Point Tracking Estimation using RGB Frames and Events

Yijin Li, Yichen Shen, Zhaoyang Huang et al.

#3229

Implicit Motion Function

Yue Gao, Jiahao Li, Lei Chu et al.

ECCV 2024posterarXiv:2310.08442

#3230

Unmasking Bias in Diffusion Model Training

Hu Yu, Li Shen, Jie Huang et al.

AAAI 2024paperarXiv:2312.07991

#3231

Accelerating the Global Aggregation of Local Explanations

Alon Mor, Yonatan Belinkov, Benny Kimelfeld

ECCV 2024posterarXiv:2403.14121

#3232

External Knowledge Enhanced 3D Scene Generation from Sketch

Zijie Wu, Mingtao Feng, Yaonan Wang et al.

AAAI 2024paperarXiv:2401.02606

#3233

Exploiting Polarized Material Cues for Robust Car Detection

Wen Dong, Haiyang Mei, Ziqi Wei et al.

CVPR 2024highlightarXiv:2403.04303

#3234

LORS: Low-rank Residual Structure for Parameter-Efficient Network Stacking

Jialin Li, Qiang Nie, Weifu Fu et al.

#3235

On the Robustness of Neural-Enhanced Video Streaming against Adversarial Attacks

Qihua Zhou, Jingcai Guo, Song Guo et al.

AAAI 2024paperarXiv:2105.10334

#3236

Fact-Driven Logical Reasoning for Machine Reading Comprehension

Siru Ouyang, Zhuosheng Zhang, Hai Zhao

CVPR 2024posterarXiv:2312.05889

#3237

SuperPrimitive: Scene Reconstruction at a Primitive Level

Kirill Mazur, Gwangbin Bae, Andrew J. Davison

#3238

Dependency Structure-Enhanced Graph Attention Networks for Event Detection

Qizhi Wan, Changxuan wan, Keli Xiao et al.

#3239

Elucidating the Hierarchical Nature of Behavior with Masked Autoencoders

Lucas Stoffl, Andy Bonnetto, Stéphane D'Ascoli et al.

AAAI 2024paperarXiv:2402.12406

#3240

Teacher as a Lenient Expert: Teacher-Agnostic Data-Free Knowledge Distillation

Hyunjune Shin, Dong-Wan Choi

#3241

UNR-Explainer: Counterfactual Explanations for Unsupervised Node Representation Learning Models

Hyunju Kang, Geonhee Han, Hogun Park

ICLR 2024poster

#3242

Color Event Enhanced Single-Exposure HDR Imaging

Mengyao Cui, Zhigang Wang, Dong Wang et al.

#3243

Weakly Supervised Few-Shot Object Detection with DETR

Chenbo Zhang, Yinglu Zhang, Lu Zhang et al.

#3244

Elegantly Written: Disentangling Writer and Character Styles for Enhancing Online Chinese Handwriting

Yu Liu, Fatimah binti Khalid, Lei Wang et al.

CVPR 2024posterarXiv:2404.02889

#3245

Steganographic Passport: An Owner and User Verifiable Credential for Deep Model IP Protection Without Retraining

Qi Cui, Ruohan Meng, Chaohui Xu et al.

CVPR 2024posterarXiv:2312.04552

#3246

Generating Illustrated Instructions

Sachit Menon, Ishan Misra, Rohit Girdhar

#3247

Unsupervised Pan-Sharpening via Mutually Guided Detail Restoration

Huangxing Lin, Yuhang Dong, Xinghao Ding et al.

CVPR 2024posterarXiv:2404.00301

#3248

Monocular Identity-Conditioned Facial Reflectance Reconstruction

Xingyu Ren, Jiankang Deng, Yuhao Cheng et al.

ICLR 2024posterarXiv:2503.16799

#3249

Causally Aligned Curriculum Learning

Mingxuan Li, Junzhe Zhang, Elias Bareinboim

#3250

Your Career Path Matters in Person-Job Fit

Zhuocheng Gong, Yang Song, Tao Zhang et al.

#3251

Fisher Calibration for Backdoor-Robust Heterogeneous Federated Learning

Wenke Huang, Mang Ye, zekun shi et al.

#3252

Towards the Disappearing Truth: Fine-Grained Joint Causal Influences Learning with Hidden Variable-Driven Causal Hypergraphs

Kun Zhu, Chunhui Zhao

ICLR 2024posterarXiv:2303.03284

#3253

The Wasserstein Believer: Learning Belief Updates for Partially Observable Environments through Reliable Latent Space Models

Raphael Avalos, Florent Delgrange, Ann Nowe et al.

#3254

HiFi-Score: Fine-grained Image Description Evaluation with Hierarchical Parsing Graphs

Ziwei Yao, Ruiping Wang, Xilin CHEN

AAAI 2024paperarXiv:2402.12846

#3255

ConVQG: Contrastive Visual Question Generation with Multimodal Guidance

Li Mi, Syrielle Montariol, Javiera Castillo Navarro et al.

ICLR 2024oralarXiv:2306.16922

#3256

The Expressive Leaky Memory Neuron: an Efficient and Expressive Phenomenological Neuron Model Can Solve Long-Horizon Tasks.

Aaron Spieler, Nasim Rahaman, Georg Martius et al.

ECCV 2024posterarXiv:2407.16133

#3257

Open-Set Biometrics: Beyond Good Closed-Set Models

Yiyang Su, Minchul Kim, Feng Liu et al.

AAAI 2024paperarXiv:2312.12489

#3258

H-ensemble: An Information Theoretic Approach to Reliable Few-Shot Multi-Source-Free Transfer

Yanru Wu, Jianning Wang, Weida Wang et al.

#3259

MCSSME: Multi-Task Contrastive Learning for Semi-supervised Singing Melody Extraction from Polyphonic Music

Shuai Yu

AAAI 2024paperarXiv:2303.16521

#3260

Hard Regularization to Prevent Deep Online Clustering Collapse without Data Augmentation

Louis Mahon, Thomas Lukasiewicz

AAAI 2024paperarXiv:2312.16425

#3261

In-Hand 3D Object Reconstruction from a Monocular RGB Video

Shijian Jiang, Qi Ye, Rengan Xie et al.

AAAI 2024paperarXiv:2306.06770

#3262

Improving Knowledge Extraction from LLMs for Task Learning through Agent Analysis

James Kirk, Robert Wray, Peter Lindes et al.

AAAI 2024paperarXiv:2505.15648

#3263

Learning Small Decision Trees with Few Outliers: A Parameterized Perspective

Harmender Gahlawat, Meirav Zehavi

AAAI 2024paperarXiv:2401.03540

#3264

SeTformer Is What You Need for Vision and Language

Pourya Shamsolmoali, Masoumeh Zareapoor, Eric Granger et al.

#3265

Knowledge-Enhanced Historical Document Segmentation and Recognition

En-Hao Gao, Yu-Xuan Huang, Wen-Chao Hu et al.

ECCV 2024posterarXiv:2404.10700

#3266

Rawformer: Unpaired Raw-to-Raw Translation for Learnable Camera ISPs

Georgy Perevozchikov, Nancy Mehta, Mahmoud Afifi et al.

ECCV 2024posterarXiv:2312.06719

#3267

SkyScenes: A Synthetic Dataset for Aerial Scene Understanding

Sahil Santosh Khose, Anisha Pal, Aayushi Agarwal et al.

AAAI 2024paperarXiv:2403.17742

#3268

Using Stratified Sampling to Improve LIME Image Explanations

Muhammad Rashid, Elvio G. Amparore, Enrico Ferrari et al.

CVPR 2024posterarXiv:2403.01231

#3269

Benchmarking Segmentation Models with Mask-Preserved Attribute Editing

Zijin Yin, Kongming Liang, Bing Li et al.

#3270

Probabilistic Weather Forecasting with Deterministic Guidance-based Diffusion Model

Donggeun Yoon, Minseok Seo, Doyi Kim et al.

ECCV 2024posterarXiv:2404.09150

#3271

Learning Cross-hand Policies of High-DOF Reaching and Grasping

Qijin She, Shishun Zhang, Yunfan Ye et al.

#3272

Synergy of Sight and Semantics: Visual Intention Understanding with CLIP

Qu Yang, Mang Ye, Dacheng Tao

ECCV 2024posterarXiv:2407.07478

#3273

EA-VTR: Event-Aware Video-Text Retrieval

Zongyang Ma, Ziqi Zhang, Yuxin Chen et al.

ICLR 2024posterarXiv:2404.16779

#3274

DrS: Learning Reusable Dense Rewards for Multi-Stage Tasks

Tongzhou Mu, Minghua Liu, Hao Su

ECCV 2024posterarXiv:2407.09838

#3275

Background Adaptation with Residual Modeling for Exemplar-Free Class-Incremental Semantic Segmentation

Anqi Zhang, Guangyu Gao

ECCV 2024posterarXiv:2403.16020

#3276

PaPr: Training-Free One-Step Patch Pruning with Lightweight ConvNets for Faster Inference

Tanvir Mahmud, Burhaneddin Yaman, Chun-Hao Liu et al.

#3277

HPE-Li: WiFi-enabled Lightweight Dual Selective Kernel Convolution for Human Pose Estimation

Gian Toan D., Tien Dac Lai, Thien Van Luong et al.

ECCV 2024posterarXiv:2409.16689

#3278

Layout-Corrector: Alleviating Layout Sticking Phenomenon in Discrete Diffusion Model

Shoma Iwai, Atsuki Osanai, Shunsuke Kitada et al.

CVPR 2024posterarXiv:2305.17368

#3279

Instance-based Max-margin for Practical Few-shot Recognition

Minghao Fu, Ke Zhu

AAAI 2024paperarXiv:2312.09812

#3280

Structural Information Guided Multimodal Pre-training for Vehicle-Centric Perception

Xiao Wang, Wentao Wu, Chenglong Li et al.

ECCV 2024posterarXiv:2312.08704

#3281

PairingNet: A Learning-based Pair-searching and -matching Network for Image Fragments

rixin zhou, Ding Xia, YI ZHANG et al.

ECCV 2024posterarXiv:2401.00403

#3282

Overcome Modal Bias in Multi-modal Federated Learning via Balanced Modality Selection

Yunfeng Fan, Wenchao Xu, Haozhao Wang et al.

#3283

Enhancing Tampered Text Detection through Frequency Feature Fusion and Decomposition

Zhongxi Chen, Shen Chen, Taiping Yao et al.

#3284

Modelling Competitive Behaviors in Autonomous Driving Under Generative World Model

Guanren Qiao, Guiliang Liu, Guorui Quan et al.

CVPR 2024posterarXiv:2403.00939

#3285

G3DR: Generative 3D Reconstruction in ImageNet

Pradyumna Reddy, Ismail Elezi, Jiankang Deng

#3286

Multi-Modal Disordered Representation Learning Network for Description-Based Person Search

Fan Yang, Wei Li, Menglong Yang et al.

#3287

Click Prompt Learning with Optimal Transport for Interactive Segmentation

Jie Liu, haochen wang, Wenzhe Yin et al.

AAAI 2024paperarXiv:2304.12707

#3288

Lyapunov-Stable Deep Equilibrium Models

Haoyu Chu, Shikui Wei, Ting Liu et al.

AAAI 2024paperarXiv:2308.11071

#3289

Neural Amortized Inference for Nested Multi-Agent Reasoning

Kunal Jha, Tuan Anh Le, Chuanyang Jin et al.

ECCV 2024posterarXiv:2407.12387

#3290

HGL: Hierarchical Geometry Learning for Test-time Adaptation in 3D Point Cloud Segmentation

Tianpei Zou, Sanqing Qu, Zhijun Li et al.

ECCV 2024posterarXiv:2410.19483

#3291

Content-Aware Radiance Fields: Aligning Model Complexity with Scene Intricacy Through Learned Bitwidth Quantization

Weihang Liu, Xue Xian Zheng, Jingyi Yu et al.

ECCV 2024posterarXiv:2308.04553

#3292

From Fake to Real: Pretraining on Balanced Synthetic Images to Prevent Spurious Correlations in Image Recognition

Maan Qraitem, Kate Saenko, Bryan Plummer

CVPR 2024posterarXiv:2406.01843

#3293

L-MAGIC: Language Model Assisted Generation of Images with Coherence

zhipeng cai, Matthias Mueller, Reiner Birkl et al.

ECCV 2024posterarXiv:2407.14958

#3294

Temporal Residual Jacobians for Rig-free Motion Transfer

Sanjeev Muralikrishnan, Niladri Shekhar Dutt, Siddhartha Chaudhuri et al.

ECCV 2024posterarXiv:2402.17514

#3295

Robust Zero-Shot Crowd Counting and Localization with Adaptive Resolution SAM

Jia Wan, qiangqiang wu, Wei Lin et al.

AAAI 2024paperarXiv:2312.13219

#3296

Interactive Visual Task Learning for Robots

Weiwei Gu, Anant Sah, N. Gopalan

ECCV 2024posterarXiv:2409.11923

#3297

Agglomerative Token Clustering

Joakim Bruslund Haurum, Sergio Escalera, Graham W. Taylor et al.

ECCV 2024posterarXiv:2409.18218

#3298

Learning to Drive via Asymmetric Self-Play

Chris Zhang, Sourav Biswas, Kelvin Wong et al.

ECCV 2024posterarXiv:2409.19429

#3299

Fast Encoding and Decoding for Implicit Video Representation

Hao Chen, Saining Xie, Ser-Nam Lim et al.

ICLR 2024posterarXiv:2305.17555

#3300

Diffeomorphic Mesh Deformation via Efficient Optimal Transport for Cortical Surface Reconstruction

Thanh-Tung Le, Khai Nguyen, shanlin sun et al.

#3301

Unsupervised Multi-modal Medical Image Registration via Invertible Translation

Mengjie Guo

#3302

Hetecooper: Feature Collaboration Graph for Heterogeneous Collaborative Perception

Congzhang Shao, Guiyang Luo, Quan Yuan et al.

ECCV 2024posterarXiv:2302.14696

#3303

Dissolving Is Amplifying: Towards Fine-Grained Anomaly Detection

Jian Shi, Pengyi Zhang, Ni Zhang et al.

ECCV 2024posterarXiv:2408.06798

#3304

Token Compensator: Altering Inference Cost of Vision Transformer without Re-Tuning

Shibo Jie, Yehui Tang, Jianyuan Guo et al.

AAAI 2024paperarXiv:2402.13025

#3305

CFEVER: A Chinese Fact Extraction and VERification Dataset

Ying-Jia Lin, ChunYi Lin, Chia-Jen Yeh et al.

#3306

When Visual Grounding Meets Gigapixel-level Large-scale Scenes: Benchmark and Approach

TAO MA, Bing Bai, Haozhe Lin et al.

ICLR 2024spotlightarXiv:2305.14585

#3307

Faithful and Efficient Explanations for Neural Networks via Neural Tangent Kernel Surrogate Models

Andrew Engel, Zhichao Wang, Natalie Frank et al.

#3308

RCL: Reliable Continual Learning for Unified Failure Detection

Fei Zhu, Zhen Cheng, Xu-Yao Zhang et al.

ECCV 2024posterarXiv:2407.09686

#3309

SPIN: Hierarchical Segmentation with Subpart Granularity in Natural Images

josh myers-dean, Jarek T Reynolds, Brian Price et al.

CVPR 2024posterarXiv:2405.14136

#3310

Efficient Multitask Dense Predictor via Binarization

Yuzhang Shang, Dan Xu, Gaowen Liu et al.

#3311

Dual-Enhanced Coreset Selection with Class-wise Collaboration for Online Blurry Class Incremental Learning

Yutian Luo, Shiqi Zhao, Haoran Wu et al.

#3312

De-confounded Gaze Estimation

Ziyang Liang, Yiwei Bao, Feng Lu

ECCV 2024posterarXiv:2406.05849

#3313

MAP-ADAPT: Real-Time Quality-Adaptive Semantic 3D Maps

Jianhao Zheng, Daniel Barath, Marc Pollefeys et al.

ECCV 2024posterarXiv:2404.10527

#3314

SPVLoc: Semantic Panoramic Viewport Matching for 6D Camera Localization in Unseen Environments

Niklas Gard, Anna Hilsmann, Peter Eisert

ICLR 2024oralarXiv:2406.06149

#3315

Decoupled Marked Temporal Point Process using Neural Ordinary Differential Equations

Yujee Song, Donghyun LEE, Rui Meng et al.

ECCV 2024posterarXiv:2410.00289

#3316

Delving Deep into Engagement Prediction of Short Videos

dasong Li, Wenjie Li, Baili Lu et al.

ECCV 2024posterarXiv:2409.06129

#3317

DECOLLAGE: 3D Detailization by Controllable, Localized, and Learned Geometry Enhancement

Qimin Chen, Zhiqin Chen, Vladimir Kim et al.

ECCV 2024posterarXiv:2310.00161

#3318

Region-centric Image-Language Pretraining for Open-Vocabulary Detection

Dahun Kim, Anelia Angelova, Weicheng Kuo

#3319

Adapt without Forgetting: Distill Proximity from Dual Teachers in Vision-Language Models

MENGYU ZHENG, Yehui Tang, Zhiwei Hao et al.

ECCV 2024posterarXiv:2407.02309

#3320

Semantically Guided Representation Learning For Action Anticipation

Anxhelo Diko, Danilo Avola, Bardh Prenkaj et al.

ICLR 2024posterarXiv:2309.16883

#3321

The Lipschitz-Variance-Margin Tradeoff for Enhanced Randomized Smoothing

Blaise Delattre, Alexandre Araujo, Quentin Barthélemy et al.

CVPR 2024posterarXiv:2403.18469

#3322

Density-guided Translator Boosts Synthetic-to-Real Unsupervised Domain Adaptive Segmentation of 3D Point Clouds

Zhimin Yuan, Wankang Zeng, Yanfei Su et al.

#3323

STSP: Spatial-Temporal Subspace Projection for Video Class-incremental Learning

Hao CHENG, SIYUAN YANG, Chong Wang et al.

ECCV 2024posterarXiv:2407.06838

#3324

Event Trojan: Asynchronous Event-based Backdoor Attacks

Ruofei Wang, Qing Guo, Haoliang Li et al.

ECCV 2024posterarXiv:2409.15264

#3325

UDA-Bench: Revisiting Common Assumptions in Unsupervised Domain Adaptation Using a Standardized Framework

Tarun Kalluri, Sreyas Ravichandran, Manmohan Chandraker

ECCV 2024posterarXiv:2403.10911

#3326

Efficient Diffusion-Driven Corruption Editor for Test-Time Adaptation

Yeongtak Oh, Jonghyun Lee, Jooyoung Choi et al.

ECCV 2024posterarXiv:2407.09303

#3327

ProDepth: Boosting Self-Supervised Multi-Frame Monocular Depth with Probabilistic Fusion

Sungmin Woo, Wonjoon Lee, Woo Jin Kim et al.

#3328

Unveiling the Unknown: Unleashing the Power of Unknown to Known in Open-Set Source-Free Domain Adaptation

Fuli Wan, Han Zhao, Xu Yang et al.

ECCV 2024posterarXiv:2401.08687

#3329

DA-BEV: Unsupervised Domain Adaptation for Bird's Eye View Perception

Kai Jiang, Jiaxing Huang, Weiying Xie et al.

CVPR 2024posterarXiv:2405.10575

#3330

Accurate Training Data for Occupancy Map Prediction in Automated Driving Using Evidence Theory

Jonas Kälble, Sascha Wirges, Maxim Tatarchenko et al.

#3331

Better Regression Makes Better Test-time Adaptive 3D Object Detection

Jiakang Yuan, Bo Zhang, Kaixiong Gong et al.

ECCV 2024posterarXiv:2409.19293

#3332

VLAD-BuFF: Burst-aware Fast Feature Aggregation for Visual Place Recognition

Ahmad Khaliq, Ming Xu, Stephen Hausler et al.

ECCV 2024posterarXiv:2407.10439

#3333

PolyRoom: Room-aware Transformer for Floorplan Reconstruction

Yuzhou Liu, Lingjie Zhu, Xiaodong Ma et al.

ECCV 2024posterarXiv:2408.00160

#3334

Hierarchical Conditioning of Diffusion Models Using Tree-of-Life for Studying Species Evolution

Mridul Khurana, Arka Daw, M. Maruf et al.

CVPR 2024posterarXiv:2404.15263

#3335

Multi-Session SLAM with Differentiable Wide-Baseline Pose Optimization

Lahav Lipson, Jia Deng

CVPR 2024posterarXiv:2402.17372

#3336

Coupled Laplacian Eigenmaps for Locally-Aware 3D Rigid Point Cloud Matching

Matteo Bastico, Etienne Decencière, Laurent Corté et al.

ECCV 2024posterarXiv:2403.04943

#3337

AFreeCA: Annotation-Free Counting for All

Adriano DAlessandro, Ali Mahdavi-Amiri, Ghassan Hamarneh

CVPR 2024posterarXiv:2412.13081

#3338

Prompt Augmentation for Self-supervised Text-guided Image Manipulation

Rumeysa Bodur, Binod Bhattarai, Tae-Kyun Kim

AAAI 2024paperarXiv:2312.16604

#3339

Twice Class Bias Correction for Imbalanced Semi

supervised Learning

CVPR 2024posterarXiv:2404.17528

#3340

Geometry-aware Reconstruction and Fusion-refined Rendering for Generalizable Neural Radiance Fields

Tianqi Liu, Xinyi Ye, Min Shi et al.

ECCV 2024posterarXiv:2407.15540

#3341

Differentiable Product Quantization for Memory Efficient Camera Relocalization

Zakaria Laskar, Iaroslav Melekhov, Assia Benbihi et al.

#3342

Probability-Polarized Optimal Transport for Unsupervised Domain Adaptation

Yan Wang, Chuan-Xian Ren, Yi-Ming Zhai et al.

#3343

Risk-Aware Self-Consistent Imitation Learning for Trajectory Planning in Autonomous Driving

Yixuan Fan, Ya-Li Li, Shengjin Wang

ECCV 2024posterarXiv:2407.04538

#3344

PDiscoFormer: Relaxing Part Discovery Constraints with Vision Transformers

Ananthu Aniraj, Cassio F. Dantas, Dino Ienco et al.

AAAI 2024paperarXiv:2401.11740

#3345

Multi-Level Cross-Modal Alignment for Image Clustering

Liping Qiu, Qin Zhang, Xiaojun Chen et al.

AAAI 2024paperarXiv:2401.02734

#3346

FedNS: A Fast Sketching Newton-Type Algorithm for Federated Learning

Jian Li, Yong Liu, Wei Wang et al.

CVPR 2024posterarXiv:2401.07114

#3347

Revisiting Sampson Approximations for Geometric Estimation Problems

Felix Rydell, Angelica Torres, Viktor Larsson

ECCV 2024posterarXiv:2407.11382

#3348

Segment, Lift and Fit: Automatic 3D Shape Labeling from 2D Prompts

Jianhao Li, Tianyu Sun, Zhongdao Wang et al.

AAAI 2024paperarXiv:2405.03565

#3349

Liberating Seen Classes: Boosting Few-Shot and Zero-Shot Text Classification via Anchor

Han Liu, Siyang Zhao, Xiaotong Zhang et al.

CVPR 2024highlightarXiv:2312.04529

#3350

Diffusion Reflectance Map: Single-Image Stochastic Inverse Rendering of Illumination and Reflectance

Yuto Enyo, Ko Nishino

CVPR 2024posterarXiv:2404.00680

#3351

Learning to Rank Patches for Unbiased Image Redundancy Reduction

Yang Luo, Zhineng Chen, Peng Zhou et al.

CVPR 2024posterarXiv:2403.19904

#3352

Fully Geometric Panoramic Localization

Junho Kim, Jiwon Jeong, Young Min Kim

ECCV 2024posterarXiv:2408.16235

#3353

LMT-GP: Combined Latent Mean-Teacher and Gaussian Process for Semi-supervised Low-light Image Enhancement

Ye Yu, Fengxin Chen, Jun Yu et al.

ICLR 2024posterarXiv:2405.04342

#3354

The Curse of Diversity in Ensemble-Based Exploration

Zhixuan Lin, Pierluca D'Oro, Evgenii Nikishin et al.

ECCV 2024posterarXiv:2408.05364

#3355

Spherical World-Locking for Audio-Visual Localization in Egocentric Videos

Heeseung Yun, Ruohan Gao, Ishwarya Ananthabhotla et al.

CVPR 2024posterarXiv:2402.18786

#3356

OpticalDR: A Deep Optical Imaging Model for Privacy-Protective Depression Recognition

Yuchen Pan, Junjun Jiang, Kui Jiang et al.

#3357

CMA: A Chromaticity Map Adapter for Robust Detection of Screen-Recapture Document Images

Changsheng Chen, Liangwei Lin, Yongqi Chen et al.

CVPR 2024posterarXiv:2309.05073

#3358

FreeMan: Towards Benchmarking 3D Human Pose Estimation under Real-World Conditions

Jiong WANG, Fengyu Yang, Bingliang Li et al.

#3359

3D-Aware Face Editing via Warping-Guided Latent Direction Learning

Yuhao Cheng, Zhuo Chen, Xingyu Ren et al.

#3360

SENCR: A Span Enhanced Two-Stage Network with Counterfactual Rethinking for Chinese NER

Hang Zheng, Qingsong Li, Shen Chen et al.

ECCV 2024posterarXiv:2409.16145

#3361

Learning to Localize Actions in Instructional Videos with LLM-Based Multi-Pathway Text-Video Alignment

Yuxiao Chen, Kai Li, Wentao Bao et al.

ECCV 2024posterarXiv:2404.00380

#3362

DHR: Dual Features-Driven Hierarchical Rebalancing in Inter- and Intra-Class Regions for Weakly-Supervised Semantic Segmentation

Sanghyun Jo, Fei Pan, In-Jae Yu et al.

CVPR 2024posterarXiv:2403.16258

#3363

Laplacian-guided Entropy Model in Neural Codec with Blur-dissipated Synthesis

Atefeh Khoshkhahtinat, Ali Zafari, Piyush Mehta et al.

AAAI 2024paperarXiv:2312.15942

#3364

Pano-NeRF: Synthesizing High Dynamic Range Novel Views with Geometry from Sparse Low Dynamic Range Panoramic Images

Zhan Lu, Qian Zheng, Boxin Shi et al.

#3365

Self-Prompt Mechanism for Few-Shot Image Recognition

Mingchen Song, Huiqiang Wang, Guoqiang Zhong

CVPR 2024posterarXiv:2403.01773

#3366

Improving Out-of-Distribution Generalization in Graphs via Hierarchical Semantic Environments

Yinhua Piao, Sangseon Lee, Yijingxiu Lu et al.

ECCV 2024posterarXiv:2407.18207

#3367

Geometry Fidelity for Spherical Images

Anders Christensen, Nooshin Mojab, Khushman Patel et al.

#3368

Flexible Depth Completion for Sparse and Varying Point Densities

Jinhyung Park, Yu-Jhe Li, Kris Kitani

ECCV 2024posterarXiv:2407.08947

#3369

Constructing Concept-based Models to Mitigate Spurious Correlations with Minimal Human Effort

Jeeyung Kim, Ze Wang, Qiang Qiu

ECCV 2024posterarXiv:2408.15660

#3370

Merging and Splitting Diffusion Paths for Semantically Coherent Panoramas

Fabio Quattrini, Vittorio Pippi, Silvia Cascianelli et al.

CVPR 2024posterarXiv:2312.09925

#3371

CNC-Net: Self-Supervised Learning for CNC Machining Operations

Mohsen Yavartanoo, Sangmin Hong, Reyhaneh Neshatavar et al.

#3372

Adaptive Multi-task Learning for Few-shot Object Detection

Yan Ren, Yanling Li, Wai-Kin Adams Kong

AAAI 2024paperarXiv:2311.00109

#3373

FairWASP: Fast and Optimal Fair Wasserstein Pre-processing

Zikai Xiong, Niccolo Dalmasso, Alan Mishler et al.

#3374

Operational Open-Set Recognition and PostMax Refinement

Steve Cruz, Ryan Rabinowitz, Manuel Günther et al.

#3375

X-InstructBLIP: A Framework for Aligning Image, 3D, Audio, Video to LLMs and its Emergent Cross-modal Reasoning

Artemis Panagopoulou, Le Xue, Ning Yu et al.

#3376

CoG-DQA: Chain-of-Guiding Learning with Large Language Models for Diagram Question Answering

Shaowei Wang, Lingling Zhang, Longji Zhu et al.

CVPR 2024posterarXiv:2404.10124

#3377

Epistemic Uncertainty Quantification For Pre-Trained Neural Networks

Hanjing Wang, Qiang Ji

ECCV 2024posterarXiv:2407.02350

#3378

Conceptual Codebook Learning for Vision-Language Models

Yi Zhang, Ke Yu, Siqi Wu et al.

CVPR 2024posterarXiv:2402.10636

#3379

PEGASUS: Personalized Generative 3D Avatars with Composable Attributes

Hyunsoo Cha, Byungjun Kim, Hanbyul Joo

#3380

Towards Making Learnware Specification and Market Evolvable

Jian-Dong Liu, Zhi-Hao Tan, Zhi-Hua Zhou

CVPR 2024posterarXiv:2311.09104

#3381

Cross-view and Cross-pose Completion for 3D Human Understanding

Matthieu Armando, Salma Galaaoui, Fabien Baradel et al.

#3382

LLMCO4MR: LLMs-aided Neural Combinatorial Optimization for Ancient Manuscript Restoration from Fragments with Case Studies on Dunhuang

Yuqing Zhang, Hangqi Li, Shengyu Zhang et al.

ECCV 2024posterarXiv:2406.02776

#3383

MeshVPR: Citywide Visual Place Recognition Using 3D Meshes

Gabriele Berton, Lorenz Junglas, Riccardo Zaccone et al.

#3384

Blind Face Restoration under Extreme Conditions: Leveraging 3D-2D Prior Fusion for Superior Structural and Texture Recovery

Zhengrui Chen, Liying Lu, Ziyang Yuan et al.

AAAI 2024paperarXiv:2312.11532

#3385

Topic-VQ-VAE: Leveraging Latent Codebooks for Flexible Topic-Guided Document Generation

YoungJoon Yoo, Jongwon Choi

ECCV 2024posterarXiv:2407.08377

#3386

Long-range Turbulence Mitigation: A Large-scale Dataset and A Coarse-to-fine Framework

Shengqi Xu, Run Sun, Yi Chang et al.

AAAI 2024paperarXiv:2312.08234

#3387

Beyond the Label Itself: Latent Labels Enhance Semi-supervised Point Cloud Panoptic Segmentation

Yujun Chen, Xin Tan, Zhizhong Zhang et al.

CVPR 2024highlightarXiv:2403.15789

#3388

In-Context Matting

He Guo, Zixuan Ye, Zhiguo Cao et al.

#3389

Mind Artist: Creating Artistic Snapshots with Human Thought

Jiaxuan Chen, Yu Qi, Yueming Wang et al.

ICLR 2024posterarXiv:2310.02611

#3390

Analyzing and Improving Optimal-Transport-based Adversarial Networks

Jaemoo Choi, Jaewoong Choi, Myungjoo Kang

AAAI 2024paperarXiv:2308.13772

#3391

Boosting Residual Networks with Group Knowledge

Shengji Tang, Peng Ye, Baopu Li et al.

#3392

Scores for Learning Discrete Causal Graphs with Unobserved Confounders

Alexis Bellot, Junzhe Zhang, Elias Bareinboim

#3393

Partial Label Learning with a Partner

Chongjie Si, Zekun Jiang, Xuehui Wang et al.

ECCV 2024posterarXiv:2312.12098

#3394

Rethinking LiDAR Domain Generalization: Single Source as Multiple Density Domains

Jaeyeul Kim, Jungwan Woo, Jeonghoon Kim et al.

CVPR 2024posterarXiv:2401.01482

#3395

Incorporating Geo-Diverse Knowledge into Prompting for Increased Geographical Robustness in Object Recognition

Kyle Buettner, Sina Malakouti, Xiang Li et al.

ICLR 2024posterarXiv:2306.02558

#3396

Multi-View Representation is What You Need for Point-Cloud Pre-Training

Siming Yan, Chen Song, Youkang Kong et al.

AAAI 2024paperarXiv:2302.08929

#3397

Spatial Voting with Incomplete Voter Information

Aviram Imber, Jonas Israel, Markus Brill et al.

AAAI 2024paperarXiv:2303.01213

#3398

DSD$^2$: Can We Dodge Sparse Double Descent and Compress the Neural Network Worry-Free?

Victor Quetu, Enzo Tartaglione

ICLR 2024posterarXiv:2310.04416

#3399

Alice Benchmarks: Connecting Real World Re-Identification with the Synthetic

Xiaoxiao Sun, Yue Yao, Shengjin Wang et al.

ECCV 2024posterarXiv:2411.01494

#3400

Finding NeMo: Negative-mined Mosaic Augmentation for Referring Image Segmentation

Seongsu Ha, Chaeyun Kim, Donghwa Kim et al.