Most Cited 2024 &quot;modal integration&quot; Papers

#3002

Decoding Global Preferences: Temporal and Cooperative Dependency Modeling in Multi-Agent Preference-Based Reinforcement Learning

Tianchen Zhu, Yue Qiu, Haoyi Zhou et al.

AAAI 2024paperarXiv:2312.09783

#3003

Keep the Faith: Faithful Explanations in Convolutional Neural Networks for Case-Based Reasoning

Tom Nuno Wolf, Fabian Bongratz, Anne-Marie Rickmann et al.

AAAI 2024paperarXiv:2312.11091

#3004

Colored Noise in PPO: Improved Exploration and Performance through Correlated Action Sampling

Jakob Hollenstein, Georg Martius, Justus Piater

AAAI 2024paperarXiv:2305.08372

#3005

Hierarchical Aligned Multimodal Learning for NER on Tweet Posts

Peipei Liu, Hong Li, Yimo Ren et al.

ECCV 2024arXiv:2409.03755

#3006

DC-Solver: Improving Predictor-Corrector Diffusion Sampler via Dynamic Compensation

Wenliang Zhao, Haolin Wang, Jie Zhou et al.

AAAI 2024paperarXiv:2312.13066

#3007

PPEA-Depth: Progressive Parameter-Efficient Adaptation for Self-Supervised Monocular Depth Estimation

Yue-Jiang Dong, Yuan-Chen Guo, Ying-Tian Liu et al.

CVPR 2024arXiv:2403.12202

#3008

DeCoTR: Enhancing Depth Completion with 2D and 3D Attentions

Yunxiao Shi, Manish Singh, Hong Cai et al.

ECCV 2024arXiv:2409.15801

#3009

DIAL: Dense Image-text ALignment for Weakly Supervised Semantic Segmentation

Soojin Jang, JungMin Yun, JuneHyoung Kwon et al.

AAAI 2024paperarXiv:2403.00225

#3010

Robust Policy Learning via Offline Skill Diffusion

Woo Kyung Kim, Minjong Yoo, Honguk Woo

ECCV 2024arXiv:2305.18381

#3011

Distill Gold from Massive Ores: Bi-level Data Pruning towards Efficient Dataset Distillation

YUE XU, Yong-Lu Li, Kaitong Cui et al.

AAAI 2024paperarXiv:2412.03825

#3012

Residual Hyperbolic Graph Convolution Networks

Yangkai Xue, Jindou Dai, Zhipeng Lu et al.

ICLR 2024arXiv:2306.16688

#3013

SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores

Zhiyu Mei, Wei Fu, Jiaxuan Gao et al.

#3014

RePOSE: 3D Human Pose Estimation via Spatio-Temporal Depth Relational Consistency

Ziming Sun, Yuan Liang, Zejun Ma et al.

AAAI 2024paperarXiv:2305.14024

#3015

Improved Metric Distortion via Threshold Approvals

Elliot Anshelevich, Aris Filos-Ratsikas, Christopher Jerrett et al.

ECCV 2024arXiv:2407.15396

#3016

Semantic Diversity-aware Prototype-based Learning for Unbiased Scene Graph Generation

Jaehyeong Jeon, Kibum Kim, Kanghoon Yoon et al.

AAAI 2024paperarXiv:2312.15971

#3017

Graph Context Transformation Learning for Progressive Correspondence Pruning

Junwen Guo, Guobao Xiao, Shiping Wang et al.

AAAI 2024paperarXiv:2305.06741

#3018

IVP-VAE: Modeling EHR Time Series with Initial Value Problem Solvers

Jingge Xiao, Leonie Basso, Wolfgang Nejdl et al.

#3019

Spherical Pseudo-Cylindrical Representation for Omnidirectional Image Super-resolution

Qing Cai, Mu Li, Dongwei Ren et al.

#3020

Bottom-Up Domain Prompt Tuning for Generalized Face Anti-Spoofing

SI-QI LIU, Qirui Wang, Pong Chi Yuen

CVPR 2024arXiv:2302.04871

#3021

In-N-Out: Faithful 3D GAN Inversion with Volumetric Decomposition for Face Editing

Yiran Xu, Zhixin Shu, Cameron Smith et al.

ECCV 2024arXiv:2311.17086

#3022

PEA-Diffusion: Parameter-Efficient Adapter with Knowledge Distillation in non-English Text-to-Image Generation

Jian Ma, Chen Chen, Qingsong Xie et al.

#3023

Relational Matching for Weakly Semi-Supervised Oriented Object Detection

Wenhao Wu, Hau San Wong, Si Wu et al.

CVPR 2024

#3024

An Explainable Vision Question Answer Model via Diffusion Chain-of-Thought

Chunhao LU, Qiang Lu, Jake Luo

ICLR 2024arXiv:2401.10632

#3025

Interventional Fairness on Partially Known Causal Graphs: A Constrained Optimization Approach

Aoqi Zuo, yiqing li, Susan Wei et al.

CVPR 2024arXiv:2404.11139

#3026

GeoReF: Geometric Alignment Across Shape Variation for Category-level Object Pose Refinement

Linfang Zheng, Tze Ho Elden Tse, Chen Wang et al.

CVPR 2024arXiv:2403.19235

#3027

DreamSalon: A Staged Diffusion Framework for Preserving Identity-Context in Editable Face Generation

Haonan Lin

AAAI 2024paperarXiv:2312.08504

#3028

1/2-Approximate MMS Allocation for Separable Piecewise Linear Concave Valuations

Chandra Chekuri, Pooja Kulkarni, Rucha Kulkarni et al.

#3029

Backdoor Adjustment via Group Adaptation for Debiased Coupon Recommendations

Junpeng Fang, Gongduo Zhang, Qing Cui et al.

#3030

Learning Personalized Causally Invariant Representations for Heterogeneous Federated Clients

Xueyang Tang, Song Guo, Jie ZHANG et al.

ICLR 2024

#3031

Diversity-Authenticity Co-constrained Stylization for Federated Domain Generalization in Person Re-identification

Fengxiang Yang, Zhun Zhong, Zhiming Luo et al.

#3032

Shape from Heat Conduction

Sriram Narayanan, Mani Ramanagopal, Mark Sheinin et al.

ECCV 2024arXiv:2501.02771

#3033

WorldPose: A World Cup Dataset for Global 3D Human Pose Estimation

Tianjian Jiang, Johsan Billingham, Sebastian Müksch et al.

ECCV 2024arXiv:2402.17986

#3034

PolyOculus: Simultaneous Multi-view Image-based Novel View Synthesis

Jason Yu, Tristan Aumentado-Armstrong, Fereshteh Forghani et al.

CVPR 2024arXiv:2405.10286

#3035

FFF: Fixing Flawed Foundations in Contrastive Pre-Training Results in Very Strong Vision-Language Models

Adrian Bulat, Yassine Ouali, Georgios Tzimiropoulos

#3036

CatmullRom Splines-Based Regression for Image Forgery Localization

Li Zhang, Mingliang Xu, Dong Li et al.

CVPR 2024arXiv:2403.04245

#3037

A Study of Dropout-Induced Modality Bias on Robustness to Missing Video Frames for Audio-Visual Speech Recognition

Yusheng Dai, HangChen, Jun Du et al.

#3038

TTT-MIM: Test-Time Training with Masked Image Modeling for Denoising Distribution Shifts

Youssef Mansour, Xuyang Zhong, Serdar Caglar et al.

#3039

Text2City: One-Stage Text-Driven Urban Layout Regeneration

Yiming Qin, Nanxuan Zhao, Bin Sheng et al.

ECCV 2024arXiv:2305.03713

#3040

Avatar Fingerprinting for Authorized Use of Synthetic Talking-Head Videos

Ekta Prashnani, Koki Nagano, Shalini De Mello et al.

AAAI 2024paperarXiv:2402.12184

#3041

Colorizing Monochromatic Radiance Fields

Yean Cheng, Renjie Wan, Shuchen Weng et al.

ECCV 2024arXiv:2406.12452

#3042

Insect Identification in the Wild: The AMI Dataset

Aditya Jain, Fagner Cunha, Michael J Bunsen et al.

AAAI 2024paperarXiv:2312.10608

#3043

Robust 3D Tracking with Quality-Aware Shape Completion

Jingwen Zhang, Zikun Zhou, Guangming Lu et al.

ICLR 2024arXiv:2306.03857

#3044

Learning with a Mole: Transferable latent spatial representations for navigation without reconstruction

Guillaume Bono, Leonid Antsfeld, Assem Sadek et al.

ECCV 2024arXiv:2407.05578

#3045

FALIP: Visual Prompt as Foveal Attention Boosts CLIP Zero-Shot Performance

Jiedong Zhuang, Jiaqi Hu, Lianrui Mu et al.

ECCV 2024arXiv:2409.18953

#3046

UniCal: Unified Neural Sensor Calibration

Ze Yang, George G Chen, Haowei Zhang et al.

#3047

Knowledge Enhanced Representation Learning for Drug Discovery

Thanh Lam Hoang, Marco Luca Sbodio, Marcos Martinez et al.

ECCV 2024arXiv:2311.18435

#3048

Layered Rendering Diffusion Model for Controllable Zero-Shot Image Synthesis

Zipeng Qi, Guoxi Huang, Chenyang Liu et al.

ECCV 2024arXiv:2408.02226

#3049

ProCreate, Don't Reproduce! Propulsive Energy Diffusion for Creative Generation

Jack Lu, Ryan Teehan, Mengye Ren

AAAI 2024paperarXiv:2308.08644

#3050

Generalized Bradley-Terry Models for Score Estimation from Paired Comparisons

Julien Fageot, Lê-Nguyên Hoang, Oscar Villemaud et al.

AAAI 2024paperarXiv:2312.11260

#3051

Leveraging Normalization Layer in Adapters with Progressive Learning and Adaptive Distillation for Cross-Domain Few-Shot Learning

YongJin Yang, Taehyeon Kim, Se-Young Yun

CVPR 2024arXiv:2404.05661

#3052

Automatic Controllable Colorization via Imagination

Xiaoyan Cong, Yue Wu, Qifeng Chen et al.

ECCV 2024arXiv:2312.00114

#3053

Un-EVIMO: Unsupervised Event-based Independent Motion Segmentation

Ziyun Wang, Jinyuan Guo, Kostas Daniilidis

ECCV 2024arXiv:2408.00361

#3054

High-Precision Self-Supervised Monocular Depth Estimation with Rich-Resource Prior

Shen Jianbing, Wencheng Han

ECCV 2024arXiv:2407.14972

#3055

ARoFace: Alignment Robustness to Improve Low-quality Face Recognition

Mohammad Saeed Ebrahimi Saadabadi, Sahar Rahimi Malakshan, Ali Dabouei et al.

#3056

Bridging the Semantic Latent Space between Brain and Machine: Similarity Is All You Need

Jiaxuan Chen, Yu Qi, Yueming Wang et al.

ECCV 2024arXiv:2410.09802

#3057

EBDM: Exemplar-guided Image Translation with Brownian-bridge Diffusion Models

Lee Eungbean, Somi Jeong, Kwanghoon Sohn

AAAI 2024paperarXiv:2312.07241

#3058

Reachability of Fair Allocations via Sequential Exchanges

Ayumi Igarashi, Naoyuki Kamiyama, Warut Suksompong et al.

#3059

SEER: Backdoor Detection for Vision-Language Models through Searching Target Text and Image Trigger Jointly

Liuwan Zhu, Rui Ning, Jiang Li et al.

ECCV 2024arXiv:2407.07564

#3060

Trainable Highly-expressive Activation Functions

Irit Chelly, Shahaf Finder, Shira Ifergane et al.

CVPR 2024arXiv:2404.00842

#3061

An N-Point Linear Solver for Line and Motion Estimation with Event Cameras

Ling Gao, Daniel Gehrig, Hang Su et al.

ECCV 2024arXiv:2409.08518

#3062

Anytime Continual Learning for Open Vocabulary Classification

Zhen Zhu, Yiming Gong, Derek Hoiem

ECCV 2024arXiv:2406.10740

#3063

FreeMotion: MoCap-Free Human Motion Synthesis with Multimodal Large Language Models

Zhikai Zhang, Yitang Li, Haofeng Huang et al.

ICLR 2024oralarXiv:2311.01450

#3064

DreamSmooth: Improving Model-based Reinforcement Learning via Reward Smoothing

Vint Lee, Pieter Abbeel, Youngwoon Lee

CVPR 2024arXiv:2403.13548

#3065

Diversity-aware Channel Pruning for StyleGAN Compression

Jiwoo Chung, Sangeek Hyun, Sang-Heon Shim et al.

AAAI 2024paperarXiv:2312.11972

#3066

Expressive Forecasting of 3D Whole-Body Human Motions

Pengxiang Ding, Qiongjie Cui, Haofan Wang et al.

AAAI 2024paperarXiv:2312.02684

#3067

DeepPointMap: Advancing LiDAR SLAM with Unified Neural Descriptors

Xiaze Zhang, Ziheng Ding, Qi Jing et al.

ICLR 2024arXiv:2310.17256

#3068

fairret: a Framework for Differentiable Fairness Regularization Terms

Maarten Buyl, MaryBeth Defrance, Tijl De Bie

ECCV 2024arXiv:2407.10632

#3069

Bidirectional Stereo Image Compression with Cross-Dimensional Entropy Model

Zhening Liu, XINJIE ZHANG, Jiawei Shao et al.

CVPR 2024arXiv:2403.19326

#3070

MedBN: Robust Test-Time Adaptation against Malicious Test Samples

Hyejin Park, Jeongyeon Hwang, Sunung Mun et al.

ECCV 2024arXiv:2312.11587

#3071

Relightable Neural Actor with Intrinsic Decomposition and Pose Control

Diogo Carbonera Luvizon, Vladislav Golyanik, Adam Kortylewski et al.

#3072

Repaint123: Fast and High-quality One Image to 3D Generation with Progressive Controllable Repainting

Junwu Zhang, Zhenyu Tang, Yatian Pang et al.

#3073

Improving Knowledge Distillation via Regularizing Feature Direction and Norm

Yuzhu Wang, Lechao Cheng, Manni Duan et al.

ECCV 2024arXiv:2403.11107

#3074

Self-supervised co-salient object detection via feature correspondences at multiple scales

Souradeep Chakraborty, Dimitris Samaras

ECCV 2024arXiv:2407.13545

#3075

DiffuX2CT: Diffusion Learning to Reconstruct CT Images from Biplanar X-Rays

Baochang Zhang, Zhi Qiao, Runkun Liu et al.

ECCV 2024arXiv:2408.03282

#3076

AMES: Asymmetric and Memory-Efficient Similarity Estimation for Instance-level Retrieval

Pavel Suma, Giorgos Kordopatis-Zilos, Ahmet Iscen et al.

AAAI 2024paperarXiv:2312.14590

#3077

SIG: Speaker Identification in Literature via Prompt-Based Generation

Zhenlin Su, Liyan Xu, Jin Xu et al.

ECCV 2024arXiv:2403.12800

#3078

Learning Neural Volumetric Pose Features for Camera Localization

Jingyu Lin, Jiaqi Gu, Bojian Wu et al.

ICLR 2024arXiv:2309.17189

#3079

RTFS-Net: Recurrent Time-Frequency Modelling for Efficient Audio-Visual Speech Separation

Samuel Pegg, Kai Li, Xiaolin Hu

ECCV 2024arXiv:2312.05525

#3080

You Only Learn One Query: Learning Unified Human Query for Single-Stage Multi-Person Multi-Task Human-Centric Perception

Sheng Jin, Shuhuai Li, Tong Li et al.

AAAI 2024paperarXiv:2312.10797

#3081

Large-Scale Multi-Robot Coverage Path Planning via Local Search

Jingtao Tang, Hang Ma

ICLR 2024arXiv:2311.08384

#3082

Offline Data Enhanced On-Policy Policy Gradient with Provable Guarantees

Yifei Zhou, Ayush Sekhari, Yuda Song et al.

ECCV 2024arXiv:2404.03042

#3083

AWOL: Analysis WithOut synthesis using Language

Silvia Zuffi, Michael J. Black

ECCV 2024arXiv:2408.02788

#3084

GazeXplain: Learning to Predict Natural Language Explanations of Visual Scanpaths

Xianyu Chen, Ming Jiang, Qi Zhao

ECCV 2024arXiv:2405.05552

#3085

Bidirectional Progressive Transformer for Interaction Intention Anticipation

Zichen Zhang, Hongchen Luo, Wei Zhai et al.

ECCV 2024arXiv:2409.11235

#3086

SLAck: Semantic, Location, and Appearance Aware Open-Vocabulary Tracking

Siyuan Li, Lei Ke, Yung-Hsu Yang et al.

ICLR 2024arXiv:2404.16779

#3087

DrS: Learning Reusable Dense Rewards for Multi-Stage Tasks

Tongzhou Mu, Minghua Liu, Hao Su

ECCV 2024arXiv:2407.13095

#3088

Audio-visual Generalized Zero-shot Learning the Easy Way

Shentong Mo, Pedro Morgado

ECCV 2024arXiv:2407.07003

#3089

Learning to Complement and to Defer to Multiple Users

Zheng Zhang, Wenjie Ai, Kevin Wells et al.

CVPR 2024arXiv:2402.18862

#3090

Towards Backward-Compatible Continual Learning of Image Compression

Zhihao Duan, Ming Lu, Justin Yang et al.

ICLR 2024spotlightarXiv:2305.14585

#3091

Faithful and Efficient Explanations for Neural Networks via Neural Tangent Kernel Surrogate Models

Andrew Engel, Zhichao Wang, Natalie Frank et al.

#3092

Image-adaptive 3D Lookup Tables for Real-time Image Enhancement with Bilateral Grids

Wontae Kim, Nam Ik Cho

#3093

Gradient-Aware for Class-Imbalanced Semi-supervised Medical Image Segmentation

Wenbo Qi, Jiafei Wu, S. C. Chan

ECCV 2024arXiv:2407.06315

#3094

Shedding More Light on Robust Classifiers under the lens of Energy-based Models

Mujtaba Hussain Mirza, Maria Rosaria Briglia, Senad Beadini et al.

CVPR 2024arXiv:2304.05440

#3095

PixelRNN: In-pixel Recurrent Neural Networks for End-to-end–optimized Perception with Neural Sensors

Haley So, Laurie Bose, Piotr Dudek et al.

CVPR 2024arXiv:2311.11995

#3096

BrainWash: A Poisoning Attack to Forget in Continual Learning

Ali Abbasi, Parsa Nooralinejad, Hamed Pirsiavash et al.

ECCV 2024arXiv:2311.08843

#3097

Personalized Video Relighting With an At-Home Light Stage

Jun Myeong Choi, Max Christman, Roni Sengupta

ECCV 2024arXiv:2403.14628

#3098

Zero-Shot Multi-Object Scene Completion

Shun Iwase, Katherine Liu, Vitor Guizilini et al.

ICLR 2024arXiv:2305.17555

#3099

Diffeomorphic Mesh Deformation via Efficient Optimal Transport for Cortical Surface Reconstruction

Thanh-Tung Le, Khai Nguyen, shanlin sun et al.

ECCV 2024arXiv:2407.06984

#3100

Category-level Object Detection, Pose Estimation and Reconstruction from Stereo Images

Chuanrui Zhang, Yonggen Ling, Minglei Lu et al.

ICLR 2024arXiv:2307.02040

#3101

VertiBench: Advancing Feature Distribution Diversity in Vertical Federated Learning Benchmarks

Zhaomin Wu, Junyi Hou, Bingsheng He

CVPR 2024arXiv:2403.08436

#3102

PFStorer: Personalized Face Restoration and Super-Resolution

Tuomas Varanka, Tapani Toivonen, Soumya Tripathy et al.

ECCV 2024arXiv:2408.02672

#3103

Latent-INR: A Flexible Framework for Implicit Representations of Videos with Discriminative Semantics

Shishira R Maiya, Anubhav Anubhav, Matthew Gwilliam et al.

ECCV 2024arXiv:2410.00905

#3104

Removing Distributional Discrepancies in Captions Improves Image-Text Alignment

Mu Cai, Haotian Liu, Yuheng Li et al.

CVPR 2024arXiv:2405.19833

#3105

KITRO: Refining Human Mesh by 2D Clues and Kinematic-tree Rotation

Fengyuan Yang, Kerui Gu, Angela Yao

ECCV 2024arXiv:2407.11356

#3106

The Devil is in the Statistics: Mitigating and Exploiting Statistics Difference for Generalizable Semi-supervised Medical Image Segmentation

Muyang Qiu, Jian Zhang, Lei Qi et al.

ECCV 2024arXiv:2404.02514

#3107

High-Fidelity and Transferable NeRF Editing by Frequency Decomposition

YISHENG HE, Weihao Yuan, Siyu Zhu et al.

#3108

Querying as Prompt: Parameter-Efficient Learning for Multimodal Language Model

Tian Liang, Jing Huang, Ming Kong et al.

CVPR 2024

ECCV 2024arXiv:2409.06535

#3109

PoseEmbroider: Towards a 3D, Visual, Semantic-aware Human Pose Representation

Ginger Delmas, Philippe Weinzaepfel, Francesc Moreno et al.

CVPR 2024arXiv:2406.01843

#3110

L-MAGIC: Language Model Assisted Generation of Images with Coherence

zhipeng cai, Matthias Mueller, Reiner Birkl et al.

ECCV 2024arXiv:2407.04461

#3111

VCD-Texture: Variance Alignment based 3D-2D Co-Denoising for Text-Guided Texturing

Shang Liu, Chaohui Yu, Chenjie Cao et al.

ECCV 2024arXiv:2407.16125

#3112

Diffusion Prior-Based Amortized Variational Inference for Noisy Inverse Problems

Sojin Lee, Dogyun Park, Inho Kong et al.

ECCV 2024arXiv:2408.02110

#3113

AvatarPose: Avatar-guided 3D Pose Estimation of Close Human Interaction from Sparse Multi-view Videos

Feichi Lu, Zijian Dong, Jie Song et al.

#3114

OneTrack: Demystifying the Conflict Between Detection and Tracking in End-to-End 3D Trackers

Qitai Wang, Jiawei He, Yuntao Chen et al.

ECCV 2024arXiv:2312.04885

#3115

VISAGE: Video Instance Segmentation with Appearance-Guided Enhancement

Hanjung Kim, Jaehyun Kang, Miran Heo et al.

CVPR 2024arXiv:2404.01591

#3116

Language Model Guided Interpretable Video Action Reasoning

Ning Wang, Guangming Zhu, Hongsheng Li et al.

CVPR 2024arXiv:2403.08262

#3117

BiTT: Bi-directional Texture Reconstruction of Interacting Two Hands from a Single Image

Minje Kim, Tae-Kyun Kim

ECCV 2024arXiv:2409.07307

#3118

Data Augmentation via Latent Diffusion for Saliency Prediction

Bahar Aydemir, Deblina Bhattacharjee, Tong Zhang et al.

#3119

Unsupervised Deep Unrolling Networks for Phase Unwrapping

Zhile Chen, Yuhui Quan, Hui Ji

CVPR 2024

CVPR 2024arXiv:2308.15692

#3120

Intriguing Properties of Diffusion Models: An Empirical Study of the Natural Attack Capability in Text-to-Image Generative Models

Takami Sato, Justin Yue, Nanze Chen et al.

ECCV 2024arXiv:2404.06154

#3121

Concise Plane Arrangements for Low-Poly Surface and Volume Modelling

Raphael Sulzer, Florent Lafarge

ECCV 2024arXiv:2408.00759

#3122

Text-Guided Video Masked Autoencoder

David Fan, Jue Wang, Shuai Liao et al.

ECCV 2024arXiv:2409.01021

#3123

CONDA: Condensed Deep Association Learning for Co-Salient Object Detection.

Long Li, Nian Liu, Dingwen Zhang et al.

#3124

Frontier-enhanced Topological Memory with Improved Exploration Awareness for Embodied Visual Navigation

Xinru Cui, Qiming Liu, Zhe Liu et al.

ICLR 2024arXiv:2310.00545

#3125

Implicit Neural Representations and the Algebra of Complex Wavelets

T Mitchell Roddenberry, Vishwanath Saragadam, Maarten V de Hoop et al.

ECCV 2024arXiv:2309.07322

#3126

NePhi: Neural Deformation Fields for Approximately Diffeomorphic Medical Image Registration

Lin Tian, Thomas H Greer, Raul San Jose Estepar et al.

ECCV 2024arXiv:2404.12867

#3127

FipTR: A Simple yet Effective Transformer Framework for Future Instance Prediction in Autonomous Driving

Xingtai Gui, Tengteng Huang, Haonan Shao et al.

#3128

DySeT: a Dynamic Masked Self-distillation Approach for Robust Trajectory Prediction

MOZHGAN POURKESHAVARZ, Arielle Zhang, Amir Rasouli

CVPR 2024arXiv:2507.14559

#3129

LEAD: Exploring Logit Space Evolution for Model Selection

Zixuan Hu, Xiaotong Li, SHIXIANG TANG et al.

CVPR 2024arXiv:2403.02041

#3130

A Generative Approach for Wikipedia-Scale Visual Entity Recognition

Mathilde Caron, Ahmet Iscen, Alireza Fathi et al.

CVPR 2024arXiv:2404.01123

#3131

CLIPtone: Unsupervised Learning for Text-based Image Tone Adjustment

Hyeongmin Lee, Kyoungkook Kang, Jungseul Ok et al.

ICLR 2024oralarXiv:2403.09669

#3132

STREAM: Spatio-TempoRal Evaluation and Analysis Metric for Video Generative Models

Pum Jun Kim, Seojun Kim, Jaejun Yoo

ECCV 2024arXiv:2409.03424

#3133

Weight Conditioning for Smooth Optimization of Neural Networks

Hemanth Saratchandran, Thomas X Wang, Simon Lucey

#3134

Vision-Language Dual-Pattern Matching for Out-of-Distribution Detection

Zihan Zhang, Zhuo Xu, Xiang Xiang

ECCV 2024arXiv:2312.07231

#3135

Fast Training of Diffusion Transformer with Extreme Masking for 3D Point Clouds Generation

Shentong Mo, Enze Xie, Yue Wu et al.

ICLR 2024arXiv:2312.14567

#3136

Accelerated Convergence of Stochastic Heavy Ball Method under Anisotropic Gradient Noise

Rui Pan, Yuxing Liu, Xiaoyu Wang et al.

ECCV 2024arXiv:2311.17101

#3137

A high-quality robust diffusion framework for corrupted dataset

Quan Dao, Binh Ta, Tung Pham et al.

ECCV 2024arXiv:2312.06721

#3138

Understanding Physical Dynamics with Counterfactual World Modeling

Rahul Mysore Venkatesh, Honglin Chen, Kevin Feigelis et al.

ECCV 2024arXiv:2407.09367

#3139

Reshaping the Online Data Buffering and Organizing Mechanism for Continual Test-Time Adaptation

Zhilin Zhu, Xiaopeng Hong, Zhiheng Ma et al.

ECCV 2024arXiv:2411.02149

#3140

Improving Domain Generalization in Self-Supervised Monocular Depth Estimation via Stabilized Adversarial Training

Yuanqi Yao, Gang Wu, Kui Jiang et al.

ICLR 2024arXiv:2401.06604

#3141

Identifying Policy Gradient Subspaces

Jan Schneider, Pierre Schumacher, Simon Guist et al.

AAAI 2024paperarXiv:2404.00603

#3142

Weak Distribution Detectors Lead to Stronger Generalizability of Vision-Language Prompt Tuning

Kun Ding, Haojian Zhang, Qiang Yu et al.

AAAI 2024paperarXiv:2401.15987

#3143

Hand-Centric Motion Refinement for 3D Hand-Object Interaction via Hierarchical Spatial-Temporal Modeling

Yuze Hao, Jianrong Zhang, Tao Zhuo et al.

AAAI 2024paperarXiv:2303.01141

#3144

DeepSaDe: Learning Neural Networks That Guarantee Domain Constraint Satisfaction

Kshitij Goyal, Sebastijan Dumancic, Hendrik Blockeel

AAAI 2024paperarXiv:2312.13596

#3145

Anchoring Path for Inductive Relation Prediction in Knowledge Graphs

Zhixiang Su, Di Wang, Chunyan Miao et al.

CVPR 2024arXiv:2404.03183

#3146

BodyMAP - Jointly Predicting Body Mesh and 3D Applied Pressure Map for People in Bed

Abhishek Tandon, Anujraaj Goyal, Henry M. Clever et al.

ECCV 2024arXiv:2401.06826

#3147

Direct Distillation between Different Domains

Jialiang Tang, Shuo Chen, Gang Niu et al.

ECCV 2024arXiv:2403.14772

#3148

Improving Robustness to Model Inversion Attacks via Sparse Coding Architectures

Sayanton Vhaduri Dibbo, Adam Breuer, Juston Moore et al.

ECCV 2024arXiv:2409.14340

#3149

Self-Supervised Audio-Visual Soundscape Stylization

Tingle Li, Renhao Wang, Po-Yao Huang et al.

AAAI 2024paperarXiv:2402.17978

#3150

Imagine, Initialize, and Explore: An Effective Exploration Method in Multi-Agent Reinforcement Learning

Zeyang Liu, Lipeng Wan, Xinrui Yang et al.

CVPR 2024arXiv:2312.17686

#3151

Multiscale Vision Transformers Meet Bipartite Matching for Efficient Single-stage Action Localization

Ioanna Ntinou, Enrique Sanchez, Georgios Tzimiropoulos

AAAI 2024paperarXiv:2308.09065

#3152

Discretization-Induced Dirichlet Posterior for Robust Uncertainty Quantification on Regression

Xuanlong Yu, Gianni Franchi, Jindong Gu et al.

ECCV 2024arXiv:2407.02038

#3153

Camera-LiDAR Cross-modality Gait Recognition

Wenxuan Guo, Yingping Liang, Zhiyu Pan et al.

AAAI 2024paperarXiv:2312.10887

#3154

On Computing Makespan-Optimal Solutions for Generalized Sliding-Tile Puzzles

Marcus Gozon, Jingjin Yu

CVPR 2024arXiv:2312.03420

#3155

Artist-Friendly Relightable and Animatable Neural Heads

Yingyan Xu, Prashanth Chandran, Sebastian Weiss et al.

ICLR 2024arXiv:2312.03248

#3156

Customizable Combination of Parameter-Efficient Modules for Multi-Task Learning

Haowen Wang, Tao Sun, Congyun Jin et al.

CVPR 2024arXiv:2309.04437

#3157

Single View Refractive Index Tomography with Neural Fields

Brandon Zhao, Aviad Levis, Liam Connor et al.

CVPR 2024arXiv:2406.06813

#3158

Stable Neighbor Denoising for Source-free Domain Adaptive Segmentation

Dong Zhao, Shuang Wang, Qi Zang et al.

CVPR 2024arXiv:2401.04071

#3159

Fun with Flags: Robust Principal Directions via Flag Manifolds

Tolga Birdal, Nathan Mankovich

ECCV 2024arXiv:2407.13338

#3160

Learn to Memorize and to Forget: A Continual Learning Perspective of Dynamic SLAM

Baicheng Li, Zike Yan, Dong Wu et al.

AAAI 2024paperarXiv:2308.10779

#3161

Spear and Shield: Adversarial Attacks and Defense Methods for Model-Based Link Prediction on Continuous-Time Dynamic Graphs

Dongjin Lee, Juho Lee, Kijung Shin

ECCV 2024arXiv:2407.12267

#3162

Generating 3D House Wireframes with Semantics

Xueqi Ma, Yilin Liu, Wenjun Zhou et al.

ICLR 2024arXiv:2305.16242

#3163

Two-timescale Extragradient for Finding Local Minimax Points

Jiseok Chae, Kyuwon Kim, Donghwan Kim

ICLR 2024arXiv:2402.14430

#3164

Robust Training of Federated Models with Extremely Label Deficiency

Yonggang Zhang, Zhiqin Yang, Xinmei Tian et al.

ECCV 2024arXiv:2407.12292

#3165

Any Target Can be Offense: Adversarial Example Generation via Generalized Latent Infection

Youheng Sun, Shengming Yuan, Xuanhan Wang et al.

CVPR 2024highlightarXiv:2401.01823

#3166

Detours for Navigating Instructional Videos

Kumar Ashutosh, Zihui Xue, Tushar Nagarajan et al.

CVPR 2024arXiv:2405.20729

#3167

Extreme Point Supervised Instance Segmentation

Hyeonjun Lee, Sehyun Hwang, Suha Kwak

AAAI 2024paperarXiv:2401.08086

#3168

Spatial-Semantic Collaborative Cropping for User Generated Content

Yukun Su, Yiwen Cao, Jingliang Deng et al.

ECCV 2024arXiv:2308.16349

#3169

Affective Visual Dialog: A Large-Scale Benchmark for Emotional Reasoning Based on Visually Grounded Conversations

KILICHBEK HAYDAROV, Xiaoqian Shen, Avinash Madasu et al.

AAAI 2024paperarXiv:2312.10380

#3170

PPIDSG: A Privacy-Preserving Image Distribution Sharing Scheme with GAN in Federated Learning

Yuting Ma, Yuanzhi Yao, Xiaohua Xu

CVPR 2024arXiv:2404.07985

#3171

WaveMo: Learning Wavefront Modulations to See Through Scattering

Mingyang Xie, Haiyun Guo, Brandon Y. Feng et al.

#3172

Inverse Weight-Balancing for Deep Long-Tailed Learning

Wenqi Dang, Zhou Yang, Weisheng Dong et al.

ECCV 2024arXiv:2407.17058

#3173

DiffCD: A Symmetric Differentiable Chamfer Distance for Neural Implicit Surface Fitting

Linus Härenstam-Nielsen, Lu Sang, Abhishek Saroha et al.

CVPR 2024arXiv:2405.11481

#3174

Physics-Aware Hand-Object Interaction Denoising

Haowen Luo, Yunze Liu, Li Yi

CVPR 2024arXiv:2311.17352

#3175

Efficient Stitchable Task Adaptation

Haoyu He, Zizheng Pan, Jing Liu et al.

ECCV 2024arXiv:2404.03836

#3176

PARIS3D: Reasoning-based 3D Part Segmentation Using Large Multimodal Model

Amrin Kareem, Jean Lahoud, Hisham Cholakkal

AAAI 2024paperarXiv:2301.13821

#3177

Complete Neural Networks for Complete Euclidean Graphs

Snir Hordan, Tal Amir, Nadav Dym et al.

AAAI 2024paperarXiv:2312.13118

#3178

LRS: Enhancing Adversarial Transferability through Lipschitz Regularized Surrogate

Tao Wu, Tie Luo, D. C. Wunsch

AAAI 2024paperarXiv:2312.10648

#3179

Faithful Model Explanations through Energy-Constrained Conformal Counterfactuals

Patrick Altmeyer, Mojtaba Farmanbar, Arie Van Deursen et al.

CVPR 2024arXiv:2404.01342

#3180

DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language Model

Lirui Zhao, Yue Yang, Kaipeng Zhang et al.

ECCV 2024arXiv:2407.16260

#3181

DreamDissector: Learning Disentangled Text-to-3D Generation from 2D Diffusion Priors

Zizheng Yan, Jiapeng Zhou, Fanpeng Meng et al.

AAAI 2024paperarXiv:2401.13193

#3182

Catch-Up Mix: Catch-Up Class for Struggling Filters in CNN

Minsoo Kang, Minkoo Kang, Suhyun Kim

AAAI 2024paperarXiv:2312.09219

#3183

NestE: Modeling Nested Relational Structures for Knowledge Graph Reasoning

Bo Xiong, Mojtaba Nayyeri, Linhao Luo et al.

#3184

Improving Zero-Shot Generalization for CLIP with Variational Adapter

Ziqian Lu, Fengli Shen, Mushui Liu et al.

CVPR 2024highlightarXiv:2403.04303

#3185

LORS: Low-rank Residual Structure for Parameter-Efficient Network Stacking

Jialin Li, Qiang Nie, Weifu Fu et al.

#3186

DAG-Aware Variational Autoencoder for Social Propagation Graph Generation

Dongpeng Hou, Chao Gao, Xuelong Li et al.

ECCV 2024arXiv:2409.08077

#3187

Diffusion-Based Image-to-Image Translation by Noise Correction via Prompt Interpolation

Junsung Lee, Minsoo Kang, Bohyung Han

ECCV 2024arXiv:2407.16448

#3188

MonoWAD: Weather-Adaptive Diffusion Model for Robust Monocular 3D Object Detection

Youngmin Oh, Hyung-Il Kim, Seong Tae Kim et al.

#3189

SemReg: Semantics Constrained Point Cloud Registration

Sheldon Fung, Xuequan Lu, Dasith de Silva Edirimuni et al.

CVPR 2024arXiv:2411.15673

#3190

Semantic Shield: Defending Vision-Language Models Against Backdooring and Poisoning via Fine-grained Knowledge Alignment

Alvi Md Ishmam, Chris Thomas

CVPR 2024arXiv:2406.03461

#3191

Polarization Wavefront Lidar: Learning Large Scene Reconstruction from Polarized Wavefronts

Dominik Scheuble, Chenyang Lei, Mario Bijelic et al.

AAAI 2024paperarXiv:2403.06235

#3192

Probabilistic Neural Circuits

Pedro Zuidberg Dos Martires

CVPR 2024arXiv:2406.11129

#3193

Neural Lineage

Runpeng Yu, Xinchao Wang

AAAI 2024paperarXiv:2312.07991

#3194

Accelerating the Global Aggregation of Local Explanations

Alon Mor, Yonatan Belinkov, Benny Kimelfeld

AAAI 2024paperarXiv:2312.11934

#3195

Identification of Causal Structure with Latent Variables Based on Higher Order Cumulants

Wei Chen, Zhiyi Huang, Ruichu Cai et al.

ECCV 2024arXiv:2311.12090

#3196

FrePolad: Frequency-Rectified Point Latent Diffusion for Point Cloud Generation

Chenliang Zhou, Fangcheng Zhong, Param Hanji et al.

AAAI 2024paperarXiv:2401.02606

#3197

Exploiting Polarized Material Cues for Robust Car Detection

Wen Dong, Haiyang Mei, Ziqi Wei et al.

AAAI 2024paperarXiv:2308.07272

#3198

Dialogue for Prompting: A Policy-Gradient-Based Discrete Prompt Generation for Few-Shot Learning

Chengzhengxu Li, Xiaoming Liu, Yichen Wang et al.

CVPR 2024arXiv:2403.02561

#3199

Semantic Human Mesh Reconstruction with Textures

xiaoyu zhan, Jianxin Yang, Yuanqi Li et al.

ECCV 2024arXiv:2410.20451

#3200

BlinkVision: A Benchmark for Optical Flow, Scene Flow and Point Tracking Estimation using RGB Frames and Events

Yijin Li, Yichen Shen, Zhaoyang Huang et al.