Most Cited 2024 &quot;transformers&quot; Papers

ICLR 2024arXiv:2401.13558

#6202

Task structure and nonlinearity jointly determine learned representational geometry

Matteo Alleman, Jack Lindsey, Stefano Fusi

ICLR 2024arXiv:2310.06756

#6203

Going Beyond Neural Network Feature Similarity: The Network Feature Complexity and Its Interpretation Using Category Theory

Yiting Chen, Zhanpeng Zhou, Junchi Yan

ICLR 2024arXiv:2311.04465

#6204

Solving High Frequency and Multi-Scale PDEs with Gaussian Processes

Shikai Fang, Madison Cooley, Da Long et al.

ECCV 2024arXiv:2404.04860

#6205

ByteEdit: Boost, Comply and Accelerate Generative Image Editing

YUXI REN, Jie Wu, Yanzuo Lu et al.

ECCV 2024arXiv:2407.07324

#6206

Event-Aided Time-To-Collision Estimation for Autonomous Driving

Jinghang Li, Bangyan Liao, Xiuyuan LU et al.

ICLR 2024arXiv:2403.02690

#6207

Dirichlet-based Per-Sample Weighting by Transition Matrix for Noisy Label Learning

HeeSun Bae, Seungjae Shin, Byeonghu Na et al.

ICML 2024arXiv:2405.18217

#6208

Understanding Inter-Concept Relationships in Concept-Based Models

Naveen Raman, Mateo Espinosa Zarlenga, Mateja Jamnik

AAAI 2024paperarXiv:2312.08009

#6209

Semi-supervised Class-Agnostic Motion Prediction with Pseudo Label Regeneration and BEVMix

Kewei Wang, Yizheng Wu, Zhiyu Pan et al.

#6210

Cross-Modal Match for Language Conditioned 3D Object Grounding

Yachao Zhang, Runze Hu, Ronghui Li et al.

ECCV 2024arXiv:2311.16254

#6211

Safe-CLIP: Removing NSFW Concepts from Vision-and-Language Models

Samuele Poppi, Tobia Poppi, Federico Cocchi et al.

ECCV 2024arXiv:2407.08931

#6212

Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary Detection

Xingyu Peng, Yan Bai, Chen Gao et al.

CVPR 2024arXiv:2212.05315

#6213

Mind The Edge: Refining Depth Edges in Sparsely-Supervised Monocular Depth Estimation

Lior Talker, Aviad Cohen, Erez Yosef et al.

ICML 2024oralarXiv:2406.12409

#6214

Translation Equivariant Transformer Neural Processes

Matthew Ashman, Cristiana Diaconu, Junhyuck Kim et al.

ICML 2024arXiv:2404.00924

#6215

BadPart: Unified Black-box Adversarial Patch Attacks against Pixel-wise Regression Tasks

Zhiyuan Cheng, Zhaoyi Liu, Tengda Guo et al.

ICML 2024arXiv:2405.16417

#6216

CRoFT: Robust Fine-Tuning with Concurrent Optimization for OOD Generalization and Open-Set OOD Detection

Lin Zhu, Yifeng Yang, Qinying Gu et al.

AAAI 2024paperarXiv:2402.03561

#6217

VLN-Video: Utilizing Driving Videos for Outdoor Vision-and-Language Navigation

Jialu Li, Aishwarya Padmakumar, Gaurav Sukhatme et al.

CVPR 2024arXiv:2404.01727

#6218

Generalizing 6-DoF Grasp Detection via Domain Prior Knowledge

Haoxiang Ma, Modi Shi, Boyang GAO et al.

ICLR 2024arXiv:2405.18861

#6219

Domain-Inspired Sharpness-Aware Minimization Under Domain Shifts

Ruipeng Zhang, Ziqing Fan, Jiangchao Yao et al.

#6220

PTMQ: Post-training Multi-Bit Quantization of Neural Networks

Ke Xu, Zhongcheng Li, Shanshan Wang et al.

ICML 2024spotlightarXiv:2402.08530

#6221

A Distributional Analogue to the Successor Representation

Harley Wiltzer, Jesse Farebrother, Arthur Gretton et al.

ECCV 2024arXiv:2407.10200

#6222

Shape2Scene: 3D Scene Representation Learning Through Pre-training on Shape Data

Tuo FENG, Wenguan Wang, Ruijie Quan et al.

#6223

Characteristics Matching Based Hash Codes Generation for Efficient Fine-grained Image Retrieval

Zhen-Duo Chen, Li-Jun Zhao, Zi-Chao Zhang et al.

CVPR 2024

ECCV 2024arXiv:2407.01332

#6224

AdaDistill: Adaptive Knowledge Distillation for Deep Face Recognition

Fadi Boutros, Vitomir Struc, Naser Damer

#6225

Parsing All Adverse Scenes: Severity-Aware Semantic Segmentation with Mask-Enhanced Cross-Domain Consistency

Fuhao Li, Ziyang Gong, Yupeng Deng et al.

#6226

Multi-View Dynamic Reflection Prior for Video Glass Surface Detection

Fang Liu, Yuhao Liu, Jiaying Lin et al.

CVPR 2024arXiv:2403.01619

#6227

Spectrum AUC Difference (SAUCD): Human-aligned 3D Shape Evaluation

Tianyu Luan, Zhong Li, Lele Chen et al.

ICLR 2024arXiv:2403.11348

#6228

COLEP: Certifiably Robust Learning-Reasoning Conformal Prediction via Probabilistic Circuits

Mintong Kang, Nezihe Merve Gürel, Linyi Li et al.

ICML 2024arXiv:2407.16560

#6229

COALA: A Practical and Vision-Centric Federated Learning Platform

Weiming Zhuang, Jian Xu, Chen Chen et al.

ECCV 2024arXiv:2308.11487

#6230

Free Lunch for Gait Recognition: A Novel Relation Descriptor

Jilong Wang, Saihui Hou, Yan Huang et al.

ECCV 2024arXiv:2407.11532

#6231

Length-Aware Motion Synthesis via Latent Diffusion

Alessio Sampieri, Alessio Palma, Indro Spinelli et al.

ICML 2024spotlightarXiv:2405.03064

#6232

RICE: Breaking Through the Training Bottlenecks of Reinforcement Learning with Explanation

Zelei Cheng, Xian Wu, Jiahao Yu et al.

CVPR 2024arXiv:2403.01781

#6233

Integrating Efficient Optimal Transport and Functional Maps For Unsupervised Shape Correspondence Learning

Tung Le, Khai Nguyen, Shanlin Sun et al.

ECCV 2024arXiv:2407.13342

#6234

Implicit Filtering for Learning Neural Signed Distance Functions from 3D Point Clouds

Shengtao Li, Ge Gao, Yudong Liu et al.

ECCV 2024arXiv:2312.08291

#6235

VQ-HPS: Human Pose and Shape Estimation in a Vector-Quantized Latent Space

Guénolé Fiche, Simon Leglaive, Xavier Alameda-Pineda et al.

ICLR 2024arXiv:2310.02832

#6236

Out-of-Distribution Detection by Leveraging Between-Layer Transformation Smoothness

Fran Jelenić, Josip Jukić, Martin Tutek et al.

ICML 2024arXiv:2406.00294

#6237

Creative Text-to-Audio Generation via Synthesizer Programming

Manuel Cherep, Nikhil Singh, Jessica Shand

ICML 2024oralarXiv:2402.06187

#6238

Premier-TACO is a Few-Shot Policy Learner: Pretraining Multitask Representation via Temporal Action-Driven Contrastive Loss

Ruijie Zheng, Yongyuan Liang, xiyao wang et al.

ICLR 2024arXiv:2309.02214

#6239

Improving equilibrium propagation without weight symmetry through Jacobian homeostasis

Axel Laborieux, Friedemann Zenke

#6240

Identification of Necessary Semantic Undertakers in the Causal View for Image-Text Matching

Huatian Zhang, Lei Zhang, Kun Zhang et al.

ICLR 2024arXiv:2310.00115

#6241

Learning Over Molecular Conformer Ensembles: Datasets and Benchmarks

Yanqiao Zhu, Jeehyun Hwang, Keir Adams et al.

CVPR 2024arXiv:2404.00974

#6242

Improving Visual Recognition with Hyperbolical Visual Hierarchy Mapping

Hyeongjun Kwon, Jinhyun Jang, Jin Kim et al.

AAAI 2024paperarXiv:2403.05117

#6243

Arbitrary-Scale Point Cloud Upsampling by Voxel-Based Network with Latent Geometric-Consistent Learning

Hang Du, Xuejun Yan, Jingjing Wang et al.

#6244

CatFormer: Category-Level 6D Object Pose Estimation with Transformer

Sheng Yu, Dihua Zhai, Yuanqing Xia

AAAI 2024paperarXiv:2312.11936

#6245

Exact ASP Counting with Compact Encodings

Mohimenul Kabir, Supratik Chakraborty, Kuldeep S Meel

ICLR 2024arXiv:2401.17992

#6246

Multilinear Operator Networks

Yixin Cheng, Grigorios Chrysos, Markos Georgopoulos et al.

AAAI 2024paperarXiv:2210.05742

#6247

Curved Representation Space of Vision Transformers

Juyeop Kim, Junha Park, Songkuk Kim et al.

ICLR 2024arXiv:2311.04640

#6248

Object-Centric Learning with Slot Mixture Module

Daniil Kirilenko, Vitaliy Vorobyov, Aleksey Kovalev et al.

CVPR 2024arXiv:2403.18708

#6249

Dense Vision Transformer Compression with Few Samples

Hanxiao Zhang, Yifan Zhou, Guo-Hua Wang

AAAI 2024paperarXiv:2407.09787

#6250

Semi-supervised 3D Object Detection with PatchTeacher and PillarMix

Xiaopei Wu, Liang Peng, Liang Xie et al.

AAAI 2024paperarXiv:2312.08057

#6251

Combinatorial Stochastic-Greedy Bandit

Fares Fourati, Christopher John Quinn, Mohamed-Slim Alouini et al.

ICLR 2024arXiv:2403.09274

#6252

EventRPG: Event Data Augmentation with Relevance Propagation Guidance

Mingyuan Sun, Donghao Zhang, Zongyuan Ge et al.

#6253

Self-Training Based Few-Shot Node Classification by Knowledge Distillation

Zongqian Wu, Yujie Mo, Peng Zhou et al.

ECCV 2024arXiv:2405.02508

#6254

Rasterized Edge Gradients: Handling Discontinuities Differentially

Stanislav Pidhorskyi, Tomas Simon, Gabriel Schwartz et al.

#6255

De-biased Attention Supervision for Text Classification with Causality

Yiquan Wu, Yifei Liu, Ziyu Zhao et al.

ICML 2024arXiv:2405.16646

#6256

A Provably Effective Method for Pruning Experts in Fine-tuned Sparse Mixture-of-Experts

Mohammed Nowaz Rabbani Chowdhury, Meng Wang, Kaoutar El Maghraoui et al.

CVPR 2024arXiv:2404.09819

#6257

3D Face Tracking from 2D Video through Iterative Dense UV to Image Flow

Felix Taubner, Prashant Raina, Mathieu Tuli et al.

ICLR 2024spotlightarXiv:2212.07699

#6258

Retrieval-based Disentangled Representation Learning with Natural Language Supervision

Jiawei Zhou, Xiaoguang Li, Lifeng Shang et al.

CVPR 2024arXiv:2307.00761

#6259

Learning Degradation-Independent Representations for Camera ISP Pipelines

Yanhui Guo, Fangzhou Luo, Xiaolin Wu

CVPR 2024arXiv:2403.11463

#6260

Siamese Learning with Joint Alignment and Regression for Weakly-Supervised Video Paragraph Grounding

Chaolei Tan, Jianhuang Lai, Wei-Shi Zheng et al.

AAAI 2024paperarXiv:2311.07745

#6261

Simplifying Complex Observation Models in Continuous POMDP Planning with Probabilistic Guarantees and Practice

Idan Lev-Yehudi, Moran Barenboim, Vadim Indelman

AAAI 2024paperarXiv:2307.05892

#6262

SC-NeuS: Consistent Neural Surface Reconstruction from Sparse and Noisy Views

Shi-Sheng Huang, Zixin Zou, Yichi Zhang et al.

CVPR 2024arXiv:2312.00598

#6263

Learning from One Continuous Video Stream

Joao Carreira, Michael King, Viorica Patraucean et al.

CVPR 2024arXiv:2403.12821

#6264

FlowerFormer: Empowering Neural Architecture Encoding using a Flow-aware Graph Transformer

Dongyeong Hwang, Hyunju Kim, Sunwoo Kim et al.

ICML 2024arXiv:2405.01468

#6265

Understanding Retrieval-Augmented Task Adaptation for Vision-Language Models

Yifei Ming, Sharon Li

ECCV 2024arXiv:2406.02461

#6266

RoomTex: Texturing Compositional Indoor Scenes via Iterative Inpainting

Qi Wang, Ruijie Lu, Xudong XU et al.

ICML 2024arXiv:2405.08540

#6267

Generalizing Knowledge Graph Embedding with Universal Orthogonal Parameterization

Rui Li, Chaozhuo Li, Yanming Shen et al.

AAAI 2024paperarXiv:2308.08644

#6268

Generalized Bradley-Terry Models for Score Estimation from Paired Comparisons

Julien Fageot, Lê-Nguyên Hoang, Oscar Villemaud et al.

ECCV 2024arXiv:2502.05641

#6269

Generating Physically Realistic and Directable Human Motions from Multi-Modal Inputs

Aayam Shrestha, Pan Liu, German Ros et al.

ECCV 2024arXiv:2312.04763

#6270

Enhancing Recipe Retrieval with Foundation Models: A Data Augmentation Perspective

Fangzhou Song, Bin Zhu, Yanbin Hao et al.

ECCV 2024arXiv:2403.17213

#6271

AnimateMe: 4D Facial Expressions via Diffusion Models

Dimitrios Gerogiannis, Foivos Paraperas Papantoniou, Rolandos Alexandros Potamias et al.

CVPR 2024arXiv:2404.06244

#6272

Anchor-based Robust Finetuning of Vision-Language Models

Jinwei Han, Zhiwen Lin, Zhongyisun Sun et al.

ICLR 2024oralarXiv:2402.12598

#6273

Graph-based Virtual Sensing from Sparse and Partial Multivariate Observations

Giovanni De Felice, Andrea Cini, Daniele Zambon et al.

ICLR 2024spotlightarXiv:2401.17099

#6274

MT-Ranker: Reference-free machine translation evaluation by inter-system ranking

Ibraheem Muhammad Moosa, Rui Zhang, Wenpeng Yin

ICML 2024arXiv:2401.10432

#6275

A2Q+: Improving Accumulator-Aware Weight Quantization

Ian Colbert, Alessandro Pappalardo, Jakoba Petri-Koenig et al.

AAAI 2024paperarXiv:2312.10802

#6276

GO-DICE: Goal-Conditioned Option-Aware Offline Imitation Learning via Stationary Distribution Correction Estimation

Abhinav Jain, Vaibhav Unhelkar

ECCV 2024arXiv:2305.14668

#6277

NOVUM: Neural Object Volumes for Robust Object Classification

Artur Jesslen, Guofeng Zhang, Angtian Wang et al.

AAAI 2024paperarXiv:2312.16451

#6278

Domain Generalization with Vital Phase Augmentation

Ingyun Lee, WooJu Lee, Hyun Myung

ICLR 2024arXiv:2403.15500

#6279

Gene Regulatory Network Inference in the Presence of Dropouts: a Causal View

HAOYUE DAI, Ignavier Ng, Gongxu Luo et al.

ICML 2024arXiv:2407.09905

#6280

Global Reinforcement Learning : Beyond Linear and Convex Rewards via Submodular Semi-gradient Methods

Riccardo De Santi, Manish Prajapat, Andreas Krause

AAAI 2024paperarXiv:2210.14404

#6281

Adversarial Purification with the Manifold Hypothesis

Zhaoyuan Yang, Zhiwei Xu, Jing Zhang et al.

ICML 2024arXiv:2405.07919

#6282

Exploring the Low-Pass Filtering Behavior in Image Super-Resolution

Haoyu Deng, Zijing Xu, Yule Duan et al.

ICML 2024arXiv:2405.02225

#6283

Fair Risk Control: A Generalized Framework for Calibrating Multi-group Fairness Risks

Lujing Zhang, Aaron Roth, Linjun Zhang

ECCV 2024arXiv:2407.09352

#6284

Imaging Interiors: An Implicit Solution to Electromagnetic Inverse Scattering Problems

Ziyuan Luo, Boxin Shi, Haoliang Li et al.

ICML 2024arXiv:2405.10531

#6285

Nonparametric Teaching of Implicit Neural Representations

Chen Zhang, Steven T. S. Luo, Jason Chun Lok Li et al.

ICLR 2024spotlightarXiv:2305.19190

#6286

Inverse Approximation Theory for Nonlinear Recurrent Neural Networks

Shida Wang, Zhong Li, Qianxiao Li

#6287

Efficient Privacy-Preserving Visual Localization Using 3D Ray Clouds

Heejoon Moon, Chunghwan Lee, Je Hyeong Hong

CVPR 2024

ECCV 2024arXiv:2311.15908

#6288

Enhancing Perceptual Quality in Video Super-Resolution through Temporally-Consistent Detail Synthesis using Diffusion Models

Claudio Rota, Marco Buzzelli, Joost Van de Weijer

ICML 2024arXiv:2405.18110

#6289

Individual Contributions as Intrinsic Exploration Scaffolds for Multi-agent Reinforcement Learning

Xinran Li, Zifan LIU, Shibo Chen et al.

AAAI 2024paperarXiv:2303.09171

#6290

Empowering CAM-Based Methods with Capability to Generate Fine-Grained and High-Faithfulness Explanations

Changqing Qiu, Fusheng Jin, Yining Zhang

CVPR 2024arXiv:2403.19473

#6291

Benchmarking Implicit Neural Representation and Geometric Rendering in Real-Time RGB-D SLAM

Tongyan Hua, Addison, Lin Wang

ICML 2024arXiv:2311.17105

#6292

On the Calibration of Human Pose Estimation

Kerui Gu, Rongyu Chen, Xuanlong Yu et al.

ECCV 2024arXiv:2409.15801

#6293

DIAL: Dense Image-text ALignment for Weakly Supervised Semantic Segmentation

Soojin Jang, JungMin Yun, JuneHyoung Kwon et al.

ICML 2024arXiv:2404.04575

#6294

To Cool or not to Cool? Temperature Network Meets Large Foundation Models via DRO

Zi-Hao Qiu, Siqi Guo, Mao Xu et al.

#6295

2043 Improved MLP Point Cloud Processing with High-Dimensional Positional Encoding

Yanmei Zou, Hongshan Yu, Zhengeng Yang et al.

ICLR 2024arXiv:2401.10848

#6296

Source-Free and Image-Only Unsupervised Domain Adaptation for Category Level Object Pose Estimation

Prakhar Kaushik, Aayush Mishra, Adam Kortylewski et al.

CVPR 2024arXiv:2406.08960

#6297

AirPlanes: Accurate Plane Estimation via 3D-Consistent Embeddings

Jamie Watson, Filippo Aleotti, Mohamed Sayed et al.

ICLR 2024arXiv:2302.05049

#6298

Principled Federated Domain Adaptation: Gradient Projection and Auto-Weighting

Enyi Jiang, Yibo Jacky Zhang, Sanmi Koyejo

ICLR 2024arXiv:2310.01973

#6299

Federated Wasserstein Distance

alain rakotomamonjy, Kimia Nadjahi, Liva Ralaivola

ICML 2024spotlightarXiv:2405.07224

#6300

A Geometric Decomposition of Finite Games: Convergence vs. Recurrence under Exponential Weights

Davide Legacci, Panayotis Mertikopoulos, Bary Pradelski

AAAI 2024paperarXiv:2312.09167

#6301

Maximizing Nash Social Welfare under Two-Sided Preferences

Pallavi Jain, Rohit Vaish

ECCV 2024arXiv:2404.08330

#6302

Emerging Property of Masked Token for Effective Pre-training

Hyesong Choi, Hunsang Lee, Seyoung Joung et al.

AAAI 2024paperarXiv:2403.05660

#6303

Decoupling Degradations with Recurrent Network for Video Restoration in Under-Display Camera

Chengxu Liu, Xuan Wang, Yuanting Fan et al.

ICLR 2024arXiv:2310.03957

#6304

Understanding prompt engineering may not require rethinking generalization

Victor Akinwande, Yiding Jiang, Dylan Sam et al.

CVPR 2024arXiv:2404.02686

#6305

Design2Cloth: 3D Cloth Generation from 2D Masks

Jiali Zheng, Rolandos Alexandros Potamias, Stefanos Zafeiriou

CVPR 2024arXiv:2309.16421

#6306

Distilling ODE Solvers of Diffusion Models into Smaller Steps

Sanghwan Kim, Hao Tang, Fisher Yu

CVPR 2024highlightarXiv:2401.15261

#6307

Vanishing-Point-Guided Video Semantic Segmentation of Driving Scenes

Diandian Guo, Deng-Ping Fan, Tongyu Lu et al.

ICML 2024arXiv:2504.13429

#6308

Bounded and Uniform Energy-based Out-of-distribution Detection for Graphs

Shenzhi Yang, Bin Liang, An Liu et al.

AAAI 2024paperarXiv:2312.14646

#6309

Collaborative Synthesis of Patient Records through Multi-Visit Health State Inference

Hongda Sun, Hongzhan Lin, Rui Yan

AAAI 2024paperarXiv:2310.09603

#6310

B-spine: Learning B-spline Curve Representation for Robust and Interpretable Spinal Curvature Estimation

Hao Wang, Qiang Song, Ruofeng Yin et al.

AAAI 2024paperarXiv:2312.08187

#6311

Completing Priceable Committees: Utilitarian and Representation Guarantees for Proportional Multiwinner Voting

Markus Brill, Jannik Peters

ICML 2024arXiv:2405.02936

#6312

On the Tractability of SHAP Explanations under Markovian Distributions

Reda Marzouk, De la Higuera

ICML 2024arXiv:2407.09690

#6313

Private Heterogeneous Federated Learning Without a Trusted Server Revisited: Error-Optimal and Communication-Efficient Algorithms for Convex Losses

Changyu Gao, Andrew Lowy, Xingyu Zhou et al.

ECCV 2024arXiv:2407.12291

#6314

JointDreamer: Ensuring Geometry Consistency and Text Congruence in Text-to-3D Generation via Joint Score Distillation

ChenHan Jiang, Yihan Zeng, Tianyang Hu et al.

ECCV 2024arXiv:2310.05615

#6315

Adaptive Multi-head Contrastive Learning

Lei Wang, Piotr Koniusz, Tom Gedeon et al.

AAAI 2024paperarXiv:2312.05784

#6316

Graph-Based Prediction and Planning Policy Network (GP3Net) for Scalable Self-Driving in Dynamic Environments Using Deep Reinforcement Learning

Jayabrata Chowdhury, Venkataramanan Shivaraman, Suresh Sundaram et al.

AAAI 2024paperarXiv:2309.02923

#6317

Patched Line Segment Learning for Vector Road Mapping

Jiakun Xu, Bowen Xu, Gui-Song Xia et al.

ICML 2024arXiv:2402.01516

#6318

Cross-view Masked Diffusion Transformers for Person Image Synthesis

Trung Pham, Kang Zhang, Chang Yoo

ICLR 2024arXiv:2402.11025

#6319

Training Bayesian Neural Networks with Sparse Subspace Variational Inference

Junbo Li, Zichen Miao, Qiang Qiu et al.

ICML 2024arXiv:2406.01352

#6320

Position: An Inner Interpretability Framework for AI Inspired by Lessons from Cognitive Neuroscience

Martina G. Vilas, Federico Adolfi, David Poeppel et al.

#6321

Mixture of Weak and Strong Experts on Graphs

Hanqing Zeng, Hanjia Lyu, Diyi Hu et al.

ICLR 2024

ICML 2024arXiv:2409.08754

#6322

Uncertainty Estimation by Density Aware Evidential Deep Learning

Taeseong Yoon, Heeyoung Kim

ICLR 2024arXiv:2307.15196

#6323

The Marginal Value of Momentum for Small Learning Rate SGD

Runzhe Wang, Sadhika Malladi, Tianhao Wang et al.

ICML 2024arXiv:2402.08991

#6324

Towards Robust Model-Based Reinforcement Learning Against Adversarial Corruption

Chenlu Ye, Jiafan He, Quanquan Gu et al.

#6325

Adversarially Robust Distillation by Reducing the Student-Teacher Variance Gap

Junhao Dong, Piotr Koniusz, Junxi Chen et al.

ECCV 2024arXiv:2403.04908

#6326

Self-Adapting Large Visual-Language Models to Edge Devices across Visual Modalities

Kaiwen Cai, ZheKai Duan, Gaowen Liu et al.

ICLR 2024oralarXiv:2310.01381

#6327

DiffAR: Denoising Diffusion Autoregressive Model for Raw Speech Waveform Generation

Roi Benita, Michael Elad, Joseph Keshet

CVPR 2024arXiv:2404.09001

#6328

Smart Help: Strategic Opponent Modeling for Proactive and Adaptive Robot Assistance in Households

Zhihao Cao, ZiDong Wang, Siwen Xie et al.

AAAI 2024paperarXiv:2402.18821

#6329

Debiased Novel Category Discovering and Localization

Juexiao Feng, Yuhong Yang, Yanchun Xie et al.

AAAI 2024paperarXiv:2401.14919

#6330

PARSAC: Accelerating Robust Multi-Model Fitting with Parallel Sample Consensus

Florian Kluger, Bodo Rosenhahn

#6331

Symmetric Self-Paced Learning for Domain Generalization

Di Zhao, Yun Sing Koh, Gillian Dobbie et al.

ICML 2024arXiv:2405.08779

#6332

Jacobian Regularizer-based Neural Granger Causality

Wanqi Zhou, Shuanghao Bai, Shujian Yu et al.

ICLR 2024arXiv:2403.13134

#6333

Robust NAS under adversarial training: benchmark, theory, and beyond

Yongtao Wu, Fanghui Liu, Carl-Johann Simon-Gabriel et al.

ECCV 2024arXiv:2312.07315

#6334

NVS-Adapter: Plug-and-Play Novel View Synthesis from a Single Image

Yoonwoo Jeong, Jinwoo Lee, Chiheon Kim et al.

ICLR 2024arXiv:2302.05326

#6335

Scalable Real-Time Recurrent Learning Using Columnar-Constructive Networks

Khurram Javed, Haseeb Shah, Richard Sutton et al.

ECCV 2024arXiv:2409.13037

#6336

DNI: Dilutional Noise Initialization for Diffusion Video Editing

Sunjae Yoon, Gwanhyeong Koo, Ji Woo Hong et al.

AAAI 2024paperarXiv:2401.13621

#6337

DenoSent: A Denoising Objective for Self-Supervised Sentence Representation Learning

Xinghao Wang, Junliang He, Pengyu Wang et al.

ECCV 2024arXiv:2407.06704

#6338

Self-supervised visual learning from interactions with objects

Arthur Aubret, Céline Teulière, Jochen Triesch

ECCV 2024arXiv:2407.04345

#6339

CanonicalFusion: Generating Drivable 3D Human Avatars from Multiple Images

Jisu Shin, Junmyeong Lee, Seongmin Lee et al.

ICLR 2024arXiv:2310.13225

#6340

Scalable Neural Network Kernels

Arijit Sehanobish, Krzysztof Choromanski, YUNFAN ZHAO et al.

ECCV 2024arXiv:2409.13803

#6341

Intrinsic Single-Image HDR Reconstruction

Sebastian Dille, Chris Careaga, Yagiz Aksoy

ECCV 2024arXiv:2403.09468

#6342

Eta Inversion: Designing an Optimal Eta Function for Diffusion-based Real Image Editing

Wonjun Kang, Kevin Galim, Hyung Il Koo

ECCV 2024arXiv:2409.05688

#6343

LayeredFlow: A Real-World Benchmark for Non-Lambertian Multi-Layer Optical Flow

Hongyu Wen, Erich Liang, Jia Deng

ECCV 2024arXiv:2408.00677

#6344

Scaling Backwards: Minimal Synthetic Pre-training?

Ryo Nakamura, Ryu Tadokoro, Ryosuke Yamada et al.

ECCV 2024arXiv:2407.21465

#6345

MarvelOVD: Marrying Object Recognition and Vision-Language Models for Robust Open-Vocabulary Object Detection

Kuo Wang, Lechao Cheng, Weikai Chen et al.

ECCV 2024arXiv:2408.16426

#6346

COIN: Control-Inpainting Diffusion Prior for Human and Camera Motion Estimation

Jiefeng Li, Ye Yuan, Davis Rempe et al.

ECCV 2024arXiv:2406.18537

#6347

AddBiomechanics Dataset: Capturing the Physics of Human Motion at Scale

Keenon Werling, Janelle M Kaneda, Tian Tan et al.

ECCV 2024arXiv:2408.05749

#6348

Efficient and Versatile Robust Fine-Tuning of Zero-shot Models

Sungyeon Kim, Boseung Jeong, Donghyun Kim et al.

ECCV 2024arXiv:2403.13524

#6349

Compress3D: a Compressed Latent Space for 3D Generation from a Single Image

Bowen Zhang, Tianyu Yang, Yu Li et al.

ECCV 2024arXiv:2407.17596

#6350

Quality Assured: Rethinking Annotation Strategies in Imaging AI

Tim Rädsch, Annika Reinke, Vivienn Weru et al.

ECCV 2024arXiv:2501.02771

#6351

WorldPose: A World Cup Dataset for Global 3D Human Pose Estimation

Tianjian Jiang, Johsan Billingham, Sebastian Müksch et al.

ECCV 2024arXiv:2407.10704

#6352

Quantized Prompt for Efficient Generalization of Vision-Language Models

Tianxiang Hao, Xiaohan Ding, Juexiao Feng et al.

#6353

PQ-SAM: Post-training Quantization for Segment Anything Model

Xiaoyu Liu, Xin Ding, Lei Yu et al.

#6354

Brain Netflix: Scaling Data to Reconstruct Videos from Brain Signals

Camilo Fosco, Benjamin Lahner, Bowen Pan et al.

ECCV 2024arXiv:2311.13777

#6355

GS-Pose: Category-Level Object Pose Estimation via Geometric and Semantic Correspondence

Pengyuan Wang, Takuya Ikeda, Robert Lee et al.

ECCV 2024arXiv:2409.05867

#6356

Flash Cache: Reducing Bias in Radiance Cache Based Inverse Rendering

Benjamin Attal, Dor Verbin, Ben Mildenhall et al.

ECCV 2024arXiv:2407.12443

#6357

Preventing Catastrophic Overfitting in Fast Adversarial Training: A Bi-level Optimization Perspective

Zhaoxin Wang, Handing Wang, Cong Tian et al.

ECCV 2024arXiv:2305.03716

#6358

3D Small Object Detection with Dynamic Spatial Pruning

Xiuwei Xu, Zhihao Sun, Ziwei Wang et al.

ECCV 2024arXiv:2407.07074

#6359

Hyperion – A fast, versatile symbolic Gaussian Belief Propagation framework for Continuous-Time SLAM

David Hug, Ignacio Alzugaray Lopez, Margarita Chli

ECCV 2024arXiv:2403.09638

#6360

SCP-Diff: Spatial-Categorical Joint Prior for Diffusion Based Semantic Image Synthesis

Huan-ang Gao, Mingju Gao, Jiaju Li et al.

ECCV 2024arXiv:2407.02665

#6361

SMILe: Leveraging Submodular Mutual Information For Robust Few-Shot Object Detection

Anay Majee, Ryan X Sharp, Rishabh Iyer

ECCV 2024arXiv:2403.11586

#6362

DynoSurf: Neural Deformation-based Temporally Consistent Dynamic Surface Reconstruction

Yuxin Yao, Siyu Ren, Junhui Hou et al.

#6363

Learning Diffusion Models for Multi-View Anomaly Detection

Chieh Liu, Yu-Min Chu, Ting-I Hsieh et al.

ECCV 2024arXiv:2407.06514

#6364

Asymmetric Mask Scheme for Self-Supervised Real Image Denoising

Xiangyu Liao, Tianheng Zheng, Jiayu Zhong et al.

ECCV 2024arXiv:2407.20228

#6365

FlexAttention for Efficient High-Resolution Vision-Language Models

Junyan Li, Delin Chen, Tianle Cai et al.

ECCV 2024arXiv:2309.03244

#6366

EGIC: Enhanced Low-Bit-Rate Generative Image Compression Guided by Semantic Segmentation

Nikolai Körber, Eduard Kromer, Andreas Siebert et al.

ECCV 2024arXiv:2407.12939

#6367

GenRC: Generative 3D Room Completion from Sparse Image Collections

Ming-Feng Li, Yueh-Feng Ku, Hong-Xuan Yen et al.

ECCV 2024arXiv:2407.02047

#6368

CountFormer: Multi-View Crowd Counting Transformer

Hong Mo, Xiong Zhang, Jianchao Tan et al.

ECCV 2024arXiv:2405.09883

#6369

RoScenes: A Large-scale Multi-view 3D Dataset for Roadside Perception

Xiaosu Zhu, Hualian Sheng, Sijia Cai et al.

ECCV 2024arXiv:2407.07402

#6370

ActionVOS: Actions as Prompts for Video Object Segmentation

LIANGYANG OUYANG, Ruicong Liu, Yifei Huang et al.

ECCV 2024arXiv:2407.20657

#6371

Prompt-Driven Contrastive Learning for Transferable Adversarial Attacks

Hunmin Yang, Jongoh Jeong, Kuk-Jin Yoon

ECCV 2024arXiv:2407.19666

#6372

Take A Step Back: Rethinking the Two Stages in Visual Reasoning

Mingyu Zhang, Jiting Cai, Mingyu Liu et al.

#6373

ZoLA: Zero-Shot Creative Long Animation Generation with Short Video Model

Fu-Yun Wang, Zhaoyang Huang, Qiang Ma et al.

ECCV 2024arXiv:2407.05594

#6374

SLIM: Spuriousness Mitigation with Minimal Human Annotations

Xiwei Xuan, Ziquan Deng, Hsuan-Tien Lin et al.

ECCV 2024arXiv:2403.18820

#6375

MetaCap: Meta-learning Priors from Multi-View Imagery for Sparse-view Human Performance Capture and Rendering

Guoxing Sun, Rishabh Dabral, Pascal Fua et al.

ECCV 2024arXiv:2311.15562

#6376

Fully Authentic Visual Question Answering Dataset from Online Communities

Chongyan Chen, Mengchen Liu, Noel C Codella et al.

ECCV 2024arXiv:2408.08671

#6377

Towards Physical World Backdoor Attacks against Skeleton Action Recognition

Qichen Zheng, Yi Yu, SIYUAN YANG et al.

ECCV 2024arXiv:2408.10624

#6378

WRIM-Net: Wide-Ranging Information Mining Network for Visible-Infrared Person Re-Identification

Yonggan Wu, Ling-Chao Meng, Yuan Zichao et al.

ECCV 2024arXiv:2407.05352

#6379

Exploring Phrase-Level Grounding with Text-to-Image Diffusion Model

Danni Yang, Ruohan Dong, Jiayi Ji et al.

ECCV 2024arXiv:2407.16125

#6380

Diffusion Prior-Based Amortized Variational Inference for Noisy Inverse Problems

Sojin Lee, Dogyun Park, Inho Kong et al.

ECCV 2024arXiv:2305.15798

#6381

BK-SDM: A Lightweight, Fast, and Cheap Version of Stable Diffusion

Bo-Kyeong Kim, Hyoung-Kyu Song, Thibault Castells et al.

ECCV 2024arXiv:2407.10131

#6382

WPS-SAM: Towards Weakly-Supervised Part Segmentation with Foundation Models

xinjian wu, Ruisong Zhang, Jie Qin et al.

ECCV 2024arXiv:2403.13808

#6383

On Pretraining Data Diversity for Self-Supervised Learning

Hasan Abed El Kader Hammoud, Tuhin Das, Fabio Pizzati et al.

ECCV 2024arXiv:2408.00296

#6384

Head360: Learning a Parametric 3D Full-Head for Free-View Synthesis in 360°

Yuxiao He, Yiyu Zhuang, Yanwen Wang et al.

ECCV 2024arXiv:2403.19238

#6385

Taming Lookup Tables for Efficient Image Retouching

Sidi Yang, Binxiao Huang, Mingdeng Cao et al.

ECCV 2024arXiv:2409.18783

#6386

DualDn: Dual-domain Denoising via Differentiable ISP

Ruikang Li, Yujin Wang, Shiqi Chen et al.

ECCV 2024arXiv:2407.08418

#6387

PredBench: Benchmarking Spatio-Temporal Prediction across Diverse Disciplines

Zidong Wang, Zeyu Lu, Di Huang et al.

ECCV 2024arXiv:2406.08392

#6388

FontStudio: Shape-Adaptive Diffusion Model for Coherent and Consistent Font Effect Generation

Xinzhi MU, Li Chen, Bohan CHEN et al.

ECCV 2024arXiv:2407.12489

#6389

Dual-level Adaptive Self-Labeling for Novel Class Discovery in Point Cloud Segmentation

Ruijie Xu, Chuyu Zhang, Hui Ren et al.

ECCV 2024arXiv:2407.13545

#6390

DiffuX2CT: Diffusion Learning to Reconstruct CT Images from Biplanar X-Rays

Baochang Zhang, Zhi Qiao, Runkun Liu et al.

ECCV 2024arXiv:2312.11587

#6391

Relightable Neural Actor with Intrinsic Decomposition and Pose Control

Diogo Carbonera Luvizon, Vladislav Golyanik, Adam Kortylewski et al.

ECCV 2024arXiv:2411.06344

#6392

CityGuessr: City-Level Video Geo-Localization on a Global Scale

Parth Parag Kulkarni, Gaurav Kumar Nayak, Shah Mubarak

ECCV 2024arXiv:2311.12090

#6393

FrePolad: Frequency-Rectified Point Latent Diffusion for Point Cloud Generation

Chenliang Zhou, Fangcheng Zhong, Param Hanji et al.

ECCV 2024arXiv:2408.02966

#6394

Fast Point Cloud Geometry Compression with Context-based Residual Coding and INR-based Refinement

Hao Xu, Xi Zhang, Xiaolin Wu

ECCV 2024arXiv:2403.12003

#6395

GenView: Enhancing View Quality with Pretrained Generative Model for Self-Supervised Learning

Xiaojie Li, Yibo Yang, Xiangtai Li et al.

ECCV 2024arXiv:2407.15396

#6396

Semantic Diversity-aware Prototype-based Learning for Unbiased Scene Graph Generation

Jaehyeong Jeon, Kibum Kim, Kanghoon Yoon et al.

ECCV 2024arXiv:2407.03036

#6397

SAFT: Towards Out-of-Distribution Generalization in Fine-Tuning

Bac Nguyen, Stefan Uhlich, Fabien Cardinaux et al.

#6398

HSR: Holistic 3D Human-Scene Reconstruction from Monocular Videos

Lixin Xue, Chen Guo, Chengwei Zheng et al.

ECCV 2024arXiv:2407.04086

#6399

Certifiably Robust Image Watermark

Zhengyuan Jiang, Moyang Guo, Yuepeng Hu et al.

ECCV 2024arXiv:2305.03036

#6400

3D Reconstruction of Objects in Hands without Real World 3D Supervision

Aditya Prakash, Matthew Chang, Matthew Jin et al.