Most Cited 2024 "transformers" Papers

12,324 papers found • Page 32 of 62

#6201

Powerful and Flexible: Personalized Text-to-Image Generation via Reinforcement Learning

Fanyue Wei, Wei Zeng, Zhenyang Li et al.

ECCV 2024arXiv:2407.06642
10
citations
#6202

Task structure and nonlinearity jointly determine learned representational geometry

Matteo Alleman, Jack Lindsey, Stefano Fusi

ICLR 2024arXiv:2401.13558
10
citations
#6203

Going Beyond Neural Network Feature Similarity: The Network Feature Complexity and Its Interpretation Using Category Theory

Yiting Chen, Zhanpeng Zhou, Junchi Yan

ICLR 2024arXiv:2310.06756
10
citations
#6204

Solving High Frequency and Multi-Scale PDEs with Gaussian Processes

Shikai Fang, Madison Cooley, Da Long et al.

ICLR 2024arXiv:2311.04465
10
citations
#6205

ByteEdit: Boost, Comply and Accelerate Generative Image Editing

YUXI REN, Jie Wu, Yanzuo Lu et al.

ECCV 2024arXiv:2404.04860
10
citations
#6206

Event-Aided Time-To-Collision Estimation for Autonomous Driving

Jinghang Li, Bangyan Liao, Xiuyuan LU et al.

ECCV 2024arXiv:2407.07324
10
citations
#6207

Dirichlet-based Per-Sample Weighting by Transition Matrix for Noisy Label Learning

HeeSun Bae, Seungjae Shin, Byeonghu Na et al.

ICLR 2024arXiv:2403.02690
10
citations
#6208

Understanding Inter-Concept Relationships in Concept-Based Models

Naveen Raman, Mateo Espinosa Zarlenga, Mateja Jamnik

ICML 2024arXiv:2405.18217
10
citations
#6209

Semi-supervised Class-Agnostic Motion Prediction with Pseudo Label Regeneration and BEVMix

Kewei Wang, Yizheng Wu, Zhiyu Pan et al.

AAAI 2024paperarXiv:2312.08009
10
citations
#6210

Cross-Modal Match for Language Conditioned 3D Object Grounding

Yachao Zhang, Runze Hu, Ronghui Li et al.

AAAI 2024paper
10
citations
#6211

Safe-CLIP: Removing NSFW Concepts from Vision-and-Language Models

Samuele Poppi, Tobia Poppi, Federico Cocchi et al.

ECCV 2024arXiv:2311.16254
10
citations
#6212

Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary Detection

Xingyu Peng, Yan Bai, Chen Gao et al.

ECCV 2024arXiv:2407.08931
10
citations
#6213

Mind The Edge: Refining Depth Edges in Sparsely-Supervised Monocular Depth Estimation

Lior Talker, Aviad Cohen, Erez Yosef et al.

CVPR 2024arXiv:2212.05315
10
citations
#6214

Translation Equivariant Transformer Neural Processes

Matthew Ashman, Cristiana Diaconu, Junhyuck Kim et al.

ICML 2024oralarXiv:2406.12409
10
citations
#6215

BadPart: Unified Black-box Adversarial Patch Attacks against Pixel-wise Regression Tasks

Zhiyuan Cheng, Zhaoyi Liu, Tengda Guo et al.

ICML 2024arXiv:2404.00924
10
citations
#6216

CRoFT: Robust Fine-Tuning with Concurrent Optimization for OOD Generalization and Open-Set OOD Detection

Lin Zhu, Yifeng Yang, Qinying Gu et al.

ICML 2024arXiv:2405.16417
10
citations
#6217

VLN-Video: Utilizing Driving Videos for Outdoor Vision-and-Language Navigation

Jialu Li, Aishwarya Padmakumar, Gaurav Sukhatme et al.

AAAI 2024paperarXiv:2402.03561
10
citations
#6218

Generalizing 6-DoF Grasp Detection via Domain Prior Knowledge

Haoxiang Ma, Modi Shi, Boyang GAO et al.

CVPR 2024arXiv:2404.01727
10
citations
#6219

Domain-Inspired Sharpness-Aware Minimization Under Domain Shifts

Ruipeng Zhang, Ziqing Fan, Jiangchao Yao et al.

ICLR 2024arXiv:2405.18861
10
citations
#6220

PTMQ: Post-training Multi-Bit Quantization of Neural Networks

Ke Xu, Zhongcheng Li, Shanshan Wang et al.

AAAI 2024paper
10
citations
#6221

A Distributional Analogue to the Successor Representation

Harley Wiltzer, Jesse Farebrother, Arthur Gretton et al.

ICML 2024spotlightarXiv:2402.08530
10
citations
#6222

Shape2Scene: 3D Scene Representation Learning Through Pre-training on Shape Data

Tuo FENG, Wenguan Wang, Ruijie Quan et al.

ECCV 2024arXiv:2407.10200
10
citations
#6223

Characteristics Matching Based Hash Codes Generation for Efficient Fine-grained Image Retrieval

Zhen-Duo Chen, Li-Jun Zhao, Zi-Chao Zhang et al.

CVPR 2024
10
citations
#6224

AdaDistill: Adaptive Knowledge Distillation for Deep Face Recognition

Fadi Boutros, Vitomir Struc, Naser Damer

ECCV 2024arXiv:2407.01332
10
citations
#6225

Parsing All Adverse Scenes: Severity-Aware Semantic Segmentation with Mask-Enhanced Cross-Domain Consistency

Fuhao Li, Ziyang Gong, Yupeng Deng et al.

AAAI 2024paper
10
citations
#6226

Multi-View Dynamic Reflection Prior for Video Glass Surface Detection

Fang Liu, Yuhao Liu, Jiaying Lin et al.

AAAI 2024paper
10
citations
#6227

Spectrum AUC Difference (SAUCD): Human-aligned 3D Shape Evaluation

Tianyu Luan, Zhong Li, Lele Chen et al.

CVPR 2024arXiv:2403.01619
10
citations
#6228

COLEP: Certifiably Robust Learning-Reasoning Conformal Prediction via Probabilistic Circuits

Mintong Kang, Nezihe Merve Gürel, Linyi Li et al.

ICLR 2024arXiv:2403.11348
10
citations
#6229

COALA: A Practical and Vision-Centric Federated Learning Platform

Weiming Zhuang, Jian Xu, Chen Chen et al.

ICML 2024arXiv:2407.16560
10
citations
#6230

Free Lunch for Gait Recognition: A Novel Relation Descriptor

Jilong Wang, Saihui Hou, Yan Huang et al.

ECCV 2024arXiv:2308.11487
10
citations
#6231

Length-Aware Motion Synthesis via Latent Diffusion

Alessio Sampieri, Alessio Palma, Indro Spinelli et al.

ECCV 2024arXiv:2407.11532
10
citations
#6232

RICE: Breaking Through the Training Bottlenecks of Reinforcement Learning with Explanation

Zelei Cheng, Xian Wu, Jiahao Yu et al.

ICML 2024spotlightarXiv:2405.03064
10
citations
#6233

Integrating Efficient Optimal Transport and Functional Maps For Unsupervised Shape Correspondence Learning

Tung Le, Khai Nguyen, Shanlin Sun et al.

CVPR 2024arXiv:2403.01781
10
citations
#6234

Implicit Filtering for Learning Neural Signed Distance Functions from 3D Point Clouds

Shengtao Li, Ge Gao, Yudong Liu et al.

ECCV 2024arXiv:2407.13342
10
citations
#6235

VQ-HPS: Human Pose and Shape Estimation in a Vector-Quantized Latent Space

Guénolé Fiche, Simon Leglaive, Xavier Alameda-Pineda et al.

ECCV 2024arXiv:2312.08291
10
citations
#6236

Out-of-Distribution Detection by Leveraging Between-Layer Transformation Smoothness

Fran Jelenić, Josip Jukić, Martin Tutek et al.

ICLR 2024arXiv:2310.02832
10
citations
#6237

Creative Text-to-Audio Generation via Synthesizer Programming

Manuel Cherep, Nikhil Singh, Jessica Shand

ICML 2024arXiv:2406.00294
10
citations
#6238

Premier-TACO is a Few-Shot Policy Learner: Pretraining Multitask Representation via Temporal Action-Driven Contrastive Loss

Ruijie Zheng, Yongyuan Liang, xiyao wang et al.

ICML 2024oralarXiv:2402.06187
10
citations
#6239

Improving equilibrium propagation without weight symmetry through Jacobian homeostasis

Axel Laborieux, Friedemann Zenke

ICLR 2024arXiv:2309.02214
10
citations
#6240

Identification of Necessary Semantic Undertakers in the Causal View for Image-Text Matching

Huatian Zhang, Lei Zhang, Kun Zhang et al.

AAAI 2024paper
10
citations
#6241

Learning Over Molecular Conformer Ensembles: Datasets and Benchmarks

Yanqiao Zhu, Jeehyun Hwang, Keir Adams et al.

ICLR 2024arXiv:2310.00115
10
citations
#6242

Improving Visual Recognition with Hyperbolical Visual Hierarchy Mapping

Hyeongjun Kwon, Jinhyun Jang, Jin Kim et al.

CVPR 2024arXiv:2404.00974
10
citations
#6243

Arbitrary-Scale Point Cloud Upsampling by Voxel-Based Network with Latent Geometric-Consistent Learning

Hang Du, Xuejun Yan, Jingjing Wang et al.

AAAI 2024paperarXiv:2403.05117
10
citations
#6244

CatFormer: Category-Level 6D Object Pose Estimation with Transformer

Sheng Yu, Dihua Zhai, Yuanqing Xia

AAAI 2024paper
10
citations
#6245

Exact ASP Counting with Compact Encodings

Mohimenul Kabir, Supratik Chakraborty, Kuldeep S Meel

AAAI 2024paperarXiv:2312.11936
10
citations
#6246

Multilinear Operator Networks

Yixin Cheng, Grigorios Chrysos, Markos Georgopoulos et al.

ICLR 2024arXiv:2401.17992
10
citations
#6247

Curved Representation Space of Vision Transformers

Juyeop Kim, Junha Park, Songkuk Kim et al.

AAAI 2024paperarXiv:2210.05742
10
citations
#6248

Object-Centric Learning with Slot Mixture Module

Daniil Kirilenko, Vitaliy Vorobyov, Aleksey Kovalev et al.

ICLR 2024arXiv:2311.04640
10
citations
#6249

Dense Vision Transformer Compression with Few Samples

Hanxiao Zhang, Yifan Zhou, Guo-Hua Wang

CVPR 2024arXiv:2403.18708
10
citations
#6250

Semi-supervised 3D Object Detection with PatchTeacher and PillarMix

Xiaopei Wu, Liang Peng, Liang Xie et al.

AAAI 2024paperarXiv:2407.09787
10
citations
#6251

Combinatorial Stochastic-Greedy Bandit

Fares Fourati, Christopher John Quinn, Mohamed-Slim Alouini et al.

AAAI 2024paperarXiv:2312.08057
10
citations
#6252

EventRPG: Event Data Augmentation with Relevance Propagation Guidance

Mingyuan Sun, Donghao Zhang, Zongyuan Ge et al.

ICLR 2024arXiv:2403.09274
10
citations
#6253

Self-Training Based Few-Shot Node Classification by Knowledge Distillation

Zongqian Wu, Yujie Mo, Peng Zhou et al.

AAAI 2024paper
10
citations
#6254

Rasterized Edge Gradients: Handling Discontinuities Differentially

Stanislav Pidhorskyi, Tomas Simon, Gabriel Schwartz et al.

ECCV 2024arXiv:2405.02508
10
citations
#6255

De-biased Attention Supervision for Text Classification with Causality

Yiquan Wu, Yifei Liu, Ziyu Zhao et al.

AAAI 2024paper
10
citations
#6256

A Provably Effective Method for Pruning Experts in Fine-tuned Sparse Mixture-of-Experts

Mohammed Nowaz Rabbani Chowdhury, Meng Wang, Kaoutar El Maghraoui et al.

ICML 2024arXiv:2405.16646
10
citations
#6257

3D Face Tracking from 2D Video through Iterative Dense UV to Image Flow

Felix Taubner, Prashant Raina, Mathieu Tuli et al.

CVPR 2024arXiv:2404.09819
10
citations
#6258

Retrieval-based Disentangled Representation Learning with Natural Language Supervision

Jiawei Zhou, Xiaoguang Li, Lifeng Shang et al.

ICLR 2024spotlightarXiv:2212.07699
10
citations
#6259

Learning Degradation-Independent Representations for Camera ISP Pipelines

Yanhui Guo, Fangzhou Luo, Xiaolin Wu

CVPR 2024arXiv:2307.00761
10
citations
#6260

Siamese Learning with Joint Alignment and Regression for Weakly-Supervised Video Paragraph Grounding

Chaolei Tan, Jianhuang Lai, Wei-Shi Zheng et al.

CVPR 2024arXiv:2403.11463
10
citations
#6261

Simplifying Complex Observation Models in Continuous POMDP Planning with Probabilistic Guarantees and Practice

Idan Lev-Yehudi, Moran Barenboim, Vadim Indelman

AAAI 2024paperarXiv:2311.07745
10
citations
#6262

SC-NeuS: Consistent Neural Surface Reconstruction from Sparse and Noisy Views

Shi-Sheng Huang, Zixin Zou, Yichi Zhang et al.

AAAI 2024paperarXiv:2307.05892
10
citations
#6263

Learning from One Continuous Video Stream

Joao Carreira, Michael King, Viorica Patraucean et al.

CVPR 2024arXiv:2312.00598
10
citations
#6264

FlowerFormer: Empowering Neural Architecture Encoding using a Flow-aware Graph Transformer

Dongyeong Hwang, Hyunju Kim, Sunwoo Kim et al.

CVPR 2024arXiv:2403.12821
10
citations
#6265

Understanding Retrieval-Augmented Task Adaptation for Vision-Language Models

Yifei Ming, Sharon Li

ICML 2024arXiv:2405.01468
10
citations
#6266

RoomTex: Texturing Compositional Indoor Scenes via Iterative Inpainting

Qi Wang, Ruijie Lu, Xudong XU et al.

ECCV 2024arXiv:2406.02461
10
citations
#6267

Generalizing Knowledge Graph Embedding with Universal Orthogonal Parameterization

Rui Li, Chaozhuo Li, Yanming Shen et al.

ICML 2024arXiv:2405.08540
10
citations
#6268

Generalized Bradley-Terry Models for Score Estimation from Paired Comparisons

Julien Fageot, Lê-Nguyên Hoang, Oscar Villemaud et al.

AAAI 2024paperarXiv:2308.08644
10
citations
#6269

Generating Physically Realistic and Directable Human Motions from Multi-Modal Inputs

Aayam Shrestha, Pan Liu, German Ros et al.

ECCV 2024arXiv:2502.05641
10
citations
#6270

Enhancing Recipe Retrieval with Foundation Models: A Data Augmentation Perspective

Fangzhou Song, Bin Zhu, Yanbin Hao et al.

ECCV 2024arXiv:2312.04763
10
citations
#6271

AnimateMe: 4D Facial Expressions via Diffusion Models

Dimitrios Gerogiannis, Foivos Paraperas Papantoniou, Rolandos Alexandros Potamias et al.

ECCV 2024arXiv:2403.17213
10
citations
#6272

Anchor-based Robust Finetuning of Vision-Language Models

Jinwei Han, Zhiwen Lin, Zhongyisun Sun et al.

CVPR 2024arXiv:2404.06244
10
citations
#6273

Graph-based Virtual Sensing from Sparse and Partial Multivariate Observations

Giovanni De Felice, Andrea Cini, Daniele Zambon et al.

ICLR 2024oralarXiv:2402.12598
10
citations
#6274

MT-Ranker: Reference-free machine translation evaluation by inter-system ranking

Ibraheem Muhammad Moosa, Rui Zhang, Wenpeng Yin

ICLR 2024spotlightarXiv:2401.17099
10
citations
#6275

A2Q+: Improving Accumulator-Aware Weight Quantization

Ian Colbert, Alessandro Pappalardo, Jakoba Petri-Koenig et al.

ICML 2024arXiv:2401.10432
10
citations
#6276

GO-DICE: Goal-Conditioned Option-Aware Offline Imitation Learning via Stationary Distribution Correction Estimation

Abhinav Jain, Vaibhav Unhelkar

AAAI 2024paperarXiv:2312.10802
10
citations
#6277

NOVUM: Neural Object Volumes for Robust Object Classification

Artur Jesslen, Guofeng Zhang, Angtian Wang et al.

ECCV 2024arXiv:2305.14668
10
citations
#6278

Domain Generalization with Vital Phase Augmentation

Ingyun Lee, WooJu Lee, Hyun Myung

AAAI 2024paperarXiv:2312.16451
10
citations
#6279

Gene Regulatory Network Inference in the Presence of Dropouts: a Causal View

HAOYUE DAI, Ignavier Ng, Gongxu Luo et al.

ICLR 2024arXiv:2403.15500
10
citations
#6280

Global Reinforcement Learning : Beyond Linear and Convex Rewards via Submodular Semi-gradient Methods

Riccardo De Santi, Manish Prajapat, Andreas Krause

ICML 2024arXiv:2407.09905
10
citations
#6281

Adversarial Purification with the Manifold Hypothesis

Zhaoyuan Yang, Zhiwei Xu, Jing Zhang et al.

AAAI 2024paperarXiv:2210.14404
10
citations
#6282

Exploring the Low-Pass Filtering Behavior in Image Super-Resolution

Haoyu Deng, Zijing Xu, Yule Duan et al.

ICML 2024arXiv:2405.07919
10
citations
#6283

Fair Risk Control: A Generalized Framework for Calibrating Multi-group Fairness Risks

Lujing Zhang, Aaron Roth, Linjun Zhang

ICML 2024arXiv:2405.02225
10
citations
#6284

Imaging Interiors: An Implicit Solution to Electromagnetic Inverse Scattering Problems

Ziyuan Luo, Boxin Shi, Haoliang Li et al.

ECCV 2024arXiv:2407.09352
10
citations
#6285

Nonparametric Teaching of Implicit Neural Representations

Chen Zhang, Steven T. S. Luo, Jason Chun Lok Li et al.

ICML 2024arXiv:2405.10531
10
citations
#6286

Inverse Approximation Theory for Nonlinear Recurrent Neural Networks

Shida Wang, Zhong Li, Qianxiao Li

ICLR 2024spotlightarXiv:2305.19190
10
citations
#6287

Efficient Privacy-Preserving Visual Localization Using 3D Ray Clouds

Heejoon Moon, Chunghwan Lee, Je Hyeong Hong

CVPR 2024
10
citations
#6288

Enhancing Perceptual Quality in Video Super-Resolution through Temporally-Consistent Detail Synthesis using Diffusion Models

Claudio Rota, Marco Buzzelli, Joost Van de Weijer

ECCV 2024arXiv:2311.15908
10
citations
#6289

Individual Contributions as Intrinsic Exploration Scaffolds for Multi-agent Reinforcement Learning

Xinran Li, Zifan LIU, Shibo Chen et al.

ICML 2024arXiv:2405.18110
10
citations
#6290

Empowering CAM-Based Methods with Capability to Generate Fine-Grained and High-Faithfulness Explanations

Changqing Qiu, Fusheng Jin, Yining Zhang

AAAI 2024paperarXiv:2303.09171
10
citations
#6291

Benchmarking Implicit Neural Representation and Geometric Rendering in Real-Time RGB-D SLAM

Tongyan Hua, Addison, Lin Wang

CVPR 2024arXiv:2403.19473
10
citations
#6292

On the Calibration of Human Pose Estimation

Kerui Gu, Rongyu Chen, Xuanlong Yu et al.

ICML 2024arXiv:2311.17105
10
citations
#6293

DIAL: Dense Image-text ALignment for Weakly Supervised Semantic Segmentation

Soojin Jang, JungMin Yun, JuneHyoung Kwon et al.

ECCV 2024arXiv:2409.15801
10
citations
#6294

To Cool or not to Cool? Temperature Network Meets Large Foundation Models via DRO

Zi-Hao Qiu, Siqi Guo, Mao Xu et al.

ICML 2024arXiv:2404.04575
10
citations
#6295

2043 Improved MLP Point Cloud Processing with High-Dimensional Positional Encoding

Yanmei Zou, Hongshan Yu, Zhengeng Yang et al.

AAAI 2024paper
10
citations
#6296

Source-Free and Image-Only Unsupervised Domain Adaptation for Category Level Object Pose Estimation

Prakhar Kaushik, Aayush Mishra, Adam Kortylewski et al.

ICLR 2024arXiv:2401.10848
10
citations
#6297

AirPlanes: Accurate Plane Estimation via 3D-Consistent Embeddings

Jamie Watson, Filippo Aleotti, Mohamed Sayed et al.

CVPR 2024arXiv:2406.08960
10
citations
#6298

Principled Federated Domain Adaptation: Gradient Projection and Auto-Weighting

Enyi Jiang, Yibo Jacky Zhang, Sanmi Koyejo

ICLR 2024arXiv:2302.05049
10
citations
#6299

Federated Wasserstein Distance

alain rakotomamonjy, Kimia Nadjahi, Liva Ralaivola

ICLR 2024arXiv:2310.01973
10
citations
#6300

A Geometric Decomposition of Finite Games: Convergence vs. Recurrence under Exponential Weights

Davide Legacci, Panayotis Mertikopoulos, Bary Pradelski

ICML 2024spotlightarXiv:2405.07224
10
citations
#6301

Maximizing Nash Social Welfare under Two-Sided Preferences

Pallavi Jain, Rohit Vaish

AAAI 2024paperarXiv:2312.09167
10
citations
#6302

Emerging Property of Masked Token for Effective Pre-training

Hyesong Choi, Hunsang Lee, Seyoung Joung et al.

ECCV 2024arXiv:2404.08330
10
citations
#6303

Decoupling Degradations with Recurrent Network for Video Restoration in Under-Display Camera

Chengxu Liu, Xuan Wang, Yuanting Fan et al.

AAAI 2024paperarXiv:2403.05660
10
citations
#6304

Understanding prompt engineering may not require rethinking generalization

Victor Akinwande, Yiding Jiang, Dylan Sam et al.

ICLR 2024arXiv:2310.03957
10
citations
#6305

Design2Cloth: 3D Cloth Generation from 2D Masks

Jiali Zheng, Rolandos Alexandros Potamias, Stefanos Zafeiriou

CVPR 2024arXiv:2404.02686
10
citations
#6306

Distilling ODE Solvers of Diffusion Models into Smaller Steps

Sanghwan Kim, Hao Tang, Fisher Yu

CVPR 2024arXiv:2309.16421
10
citations
#6307

Vanishing-Point-Guided Video Semantic Segmentation of Driving Scenes

Diandian Guo, Deng-Ping Fan, Tongyu Lu et al.

CVPR 2024highlightarXiv:2401.15261
10
citations
#6308

Bounded and Uniform Energy-based Out-of-distribution Detection for Graphs

Shenzhi Yang, Bin Liang, An Liu et al.

ICML 2024arXiv:2504.13429
10
citations
#6309

Collaborative Synthesis of Patient Records through Multi-Visit Health State Inference

Hongda Sun, Hongzhan Lin, Rui Yan

AAAI 2024paperarXiv:2312.14646
10
citations
#6310

B-spine: Learning B-spline Curve Representation for Robust and Interpretable Spinal Curvature Estimation

Hao Wang, Qiang Song, Ruofeng Yin et al.

AAAI 2024paperarXiv:2310.09603
10
citations
#6311

Completing Priceable Committees: Utilitarian and Representation Guarantees for Proportional Multiwinner Voting

Markus Brill, Jannik Peters

AAAI 2024paperarXiv:2312.08187
10
citations
#6312

On the Tractability of SHAP Explanations under Markovian Distributions

Reda Marzouk, De la Higuera

ICML 2024arXiv:2405.02936
10
citations
#6313

Private Heterogeneous Federated Learning Without a Trusted Server Revisited: Error-Optimal and Communication-Efficient Algorithms for Convex Losses

Changyu Gao, Andrew Lowy, Xingyu Zhou et al.

ICML 2024arXiv:2407.09690
10
citations
#6314

JointDreamer: Ensuring Geometry Consistency and Text Congruence in Text-to-3D Generation via Joint Score Distillation

ChenHan Jiang, Yihan Zeng, Tianyang Hu et al.

ECCV 2024arXiv:2407.12291
10
citations
#6315

Adaptive Multi-head Contrastive Learning

Lei Wang, Piotr Koniusz, Tom Gedeon et al.

ECCV 2024arXiv:2310.05615
10
citations
#6316

Graph-Based Prediction and Planning Policy Network (GP3Net) for Scalable Self-Driving in Dynamic Environments Using Deep Reinforcement Learning

Jayabrata Chowdhury, Venkataramanan Shivaraman, Suresh Sundaram et al.

AAAI 2024paperarXiv:2312.05784
10
citations
#6317

Patched Line Segment Learning for Vector Road Mapping

Jiakun Xu, Bowen Xu, Gui-Song Xia et al.

AAAI 2024paperarXiv:2309.02923
10
citations
#6318

Cross-view Masked Diffusion Transformers for Person Image Synthesis

Trung Pham, Kang Zhang, Chang Yoo

ICML 2024arXiv:2402.01516
10
citations
#6319

Training Bayesian Neural Networks with Sparse Subspace Variational Inference

Junbo Li, Zichen Miao, Qiang Qiu et al.

ICLR 2024arXiv:2402.11025
10
citations
#6320

Position: An Inner Interpretability Framework for AI Inspired by Lessons from Cognitive Neuroscience

Martina G. Vilas, Federico Adolfi, David Poeppel et al.

ICML 2024arXiv:2406.01352
10
citations
#6321

Mixture of Weak and Strong Experts on Graphs

Hanqing Zeng, Hanjia Lyu, Diyi Hu et al.

ICLR 2024
10
citations
#6322

Uncertainty Estimation by Density Aware Evidential Deep Learning

Taeseong Yoon, Heeyoung Kim

ICML 2024arXiv:2409.08754
10
citations
#6323

The Marginal Value of Momentum for Small Learning Rate SGD

Runzhe Wang, Sadhika Malladi, Tianhao Wang et al.

ICLR 2024arXiv:2307.15196
10
citations
#6324

Towards Robust Model-Based Reinforcement Learning Against Adversarial Corruption

Chenlu Ye, Jiafan He, Quanquan Gu et al.

ICML 2024arXiv:2402.08991
10
citations
#6325

Adversarially Robust Distillation by Reducing the Student-Teacher Variance Gap

Junhao Dong, Piotr Koniusz, Junxi Chen et al.

ECCV 2024
10
citations
#6326

Self-Adapting Large Visual-Language Models to Edge Devices across Visual Modalities

Kaiwen Cai, ZheKai Duan, Gaowen Liu et al.

ECCV 2024arXiv:2403.04908
10
citations
#6327

DiffAR: Denoising Diffusion Autoregressive Model for Raw Speech Waveform Generation

Roi Benita, Michael Elad, Joseph Keshet

ICLR 2024oralarXiv:2310.01381
10
citations
#6328

Smart Help: Strategic Opponent Modeling for Proactive and Adaptive Robot Assistance in Households

Zhihao Cao, ZiDong Wang, Siwen Xie et al.

CVPR 2024arXiv:2404.09001
10
citations
#6329

Debiased Novel Category Discovering and Localization

Juexiao Feng, Yuhong Yang, Yanchun Xie et al.

AAAI 2024paperarXiv:2402.18821
10
citations
#6330

PARSAC: Accelerating Robust Multi-Model Fitting with Parallel Sample Consensus

Florian Kluger, Bodo Rosenhahn

AAAI 2024paperarXiv:2401.14919
10
citations
#6331

Symmetric Self-Paced Learning for Domain Generalization

Di Zhao, Yun Sing Koh, Gillian Dobbie et al.

AAAI 2024paper
10
citations
#6332

Jacobian Regularizer-based Neural Granger Causality

Wanqi Zhou, Shuanghao Bai, Shujian Yu et al.

ICML 2024arXiv:2405.08779
10
citations
#6333

Robust NAS under adversarial training: benchmark, theory, and beyond

Yongtao Wu, Fanghui Liu, Carl-Johann Simon-Gabriel et al.

ICLR 2024arXiv:2403.13134
10
citations
#6334

NVS-Adapter: Plug-and-Play Novel View Synthesis from a Single Image

Yoonwoo Jeong, Jinwoo Lee, Chiheon Kim et al.

ECCV 2024arXiv:2312.07315
10
citations
#6335

Scalable Real-Time Recurrent Learning Using Columnar-Constructive Networks

Khurram Javed, Haseeb Shah, Richard Sutton et al.

ICLR 2024arXiv:2302.05326
10
citations
#6336

DNI: Dilutional Noise Initialization for Diffusion Video Editing

Sunjae Yoon, Gwanhyeong Koo, Ji Woo Hong et al.

ECCV 2024arXiv:2409.13037
10
citations
#6337

DenoSent: A Denoising Objective for Self-Supervised Sentence Representation Learning

Xinghao Wang, Junliang He, Pengyu Wang et al.

AAAI 2024paperarXiv:2401.13621
10
citations
#6338

Self-supervised visual learning from interactions with objects

Arthur Aubret, Céline Teulière, Jochen Triesch

ECCV 2024arXiv:2407.06704
10
citations
#6339

CanonicalFusion: Generating Drivable 3D Human Avatars from Multiple Images

Jisu Shin, Junmyeong Lee, Seongmin Lee et al.

ECCV 2024arXiv:2407.04345
10
citations
#6340

Scalable Neural Network Kernels

Arijit Sehanobish, Krzysztof Choromanski, YUNFAN ZHAO et al.

ICLR 2024arXiv:2310.13225
9
citations
#6341

Intrinsic Single-Image HDR Reconstruction

Sebastian Dille, Chris Careaga, Yagiz Aksoy

ECCV 2024arXiv:2409.13803
9
citations
#6342

Eta Inversion: Designing an Optimal Eta Function for Diffusion-based Real Image Editing

Wonjun Kang, Kevin Galim, Hyung Il Koo

ECCV 2024arXiv:2403.09468
9
citations
#6343

LayeredFlow: A Real-World Benchmark for Non-Lambertian Multi-Layer Optical Flow

Hongyu Wen, Erich Liang, Jia Deng

ECCV 2024arXiv:2409.05688
9
citations
#6344

Scaling Backwards: Minimal Synthetic Pre-training?

Ryo Nakamura, Ryu Tadokoro, Ryosuke Yamada et al.

ECCV 2024arXiv:2408.00677
9
citations
#6345

MarvelOVD: Marrying Object Recognition and Vision-Language Models for Robust Open-Vocabulary Object Detection

Kuo Wang, Lechao Cheng, Weikai Chen et al.

ECCV 2024arXiv:2407.21465
9
citations
#6346

COIN: Control-Inpainting Diffusion Prior for Human and Camera Motion Estimation

Jiefeng Li, Ye Yuan, Davis Rempe et al.

ECCV 2024arXiv:2408.16426
9
citations
#6347

AddBiomechanics Dataset: Capturing the Physics of Human Motion at Scale

Keenon Werling, Janelle M Kaneda, Tian Tan et al.

ECCV 2024arXiv:2406.18537
9
citations
#6348

Efficient and Versatile Robust Fine-Tuning of Zero-shot Models

Sungyeon Kim, Boseung Jeong, Donghyun Kim et al.

ECCV 2024arXiv:2408.05749
9
citations
#6349

Compress3D: a Compressed Latent Space for 3D Generation from a Single Image

Bowen Zhang, Tianyu Yang, Yu Li et al.

ECCV 2024arXiv:2403.13524
9
citations
#6350

Quality Assured: Rethinking Annotation Strategies in Imaging AI

Tim Rädsch, Annika Reinke, Vivienn Weru et al.

ECCV 2024arXiv:2407.17596
9
citations
#6351

WorldPose: A World Cup Dataset for Global 3D Human Pose Estimation

Tianjian Jiang, Johsan Billingham, Sebastian Müksch et al.

ECCV 2024arXiv:2501.02771
9
citations
#6352

Quantized Prompt for Efficient Generalization of Vision-Language Models

Tianxiang Hao, Xiaohan Ding, Juexiao Feng et al.

ECCV 2024arXiv:2407.10704
9
citations
#6353

PQ-SAM: Post-training Quantization for Segment Anything Model

Xiaoyu Liu, Xin Ding, Lei Yu et al.

ECCV 2024
9
citations
#6354

Brain Netflix: Scaling Data to Reconstruct Videos from Brain Signals

Camilo Fosco, Benjamin Lahner, Bowen Pan et al.

ECCV 2024
9
citations
#6355

GS-Pose: Category-Level Object Pose Estimation via Geometric and Semantic Correspondence

Pengyuan Wang, Takuya Ikeda, Robert Lee et al.

ECCV 2024arXiv:2311.13777
9
citations
#6356

Flash Cache: Reducing Bias in Radiance Cache Based Inverse Rendering

Benjamin Attal, Dor Verbin, Ben Mildenhall et al.

ECCV 2024arXiv:2409.05867
9
citations
#6357

Preventing Catastrophic Overfitting in Fast Adversarial Training: A Bi-level Optimization Perspective

Zhaoxin Wang, Handing Wang, Cong Tian et al.

ECCV 2024arXiv:2407.12443
9
citations
#6358

3D Small Object Detection with Dynamic Spatial Pruning

Xiuwei Xu, Zhihao Sun, Ziwei Wang et al.

ECCV 2024arXiv:2305.03716
9
citations
#6359

Hyperion – A fast, versatile symbolic Gaussian Belief Propagation framework for Continuous-Time SLAM

David Hug, Ignacio Alzugaray Lopez, Margarita Chli

ECCV 2024arXiv:2407.07074
9
citations
#6360

SCP-Diff: Spatial-Categorical Joint Prior for Diffusion Based Semantic Image Synthesis

Huan-ang Gao, Mingju Gao, Jiaju Li et al.

ECCV 2024arXiv:2403.09638
9
citations
#6361

SMILe: Leveraging Submodular Mutual Information For Robust Few-Shot Object Detection

Anay Majee, Ryan X Sharp, Rishabh Iyer

ECCV 2024arXiv:2407.02665
9
citations
#6362

DynoSurf: Neural Deformation-based Temporally Consistent Dynamic Surface Reconstruction

Yuxin Yao, Siyu Ren, Junhui Hou et al.

ECCV 2024arXiv:2403.11586
9
citations
#6363

Learning Diffusion Models for Multi-View Anomaly Detection

Chieh Liu, Yu-Min Chu, Ting-I Hsieh et al.

ECCV 2024
9
citations
#6364

Asymmetric Mask Scheme for Self-Supervised Real Image Denoising

Xiangyu Liao, Tianheng Zheng, Jiayu Zhong et al.

ECCV 2024arXiv:2407.06514
9
citations
#6365

FlexAttention for Efficient High-Resolution Vision-Language Models

Junyan Li, Delin Chen, Tianle Cai et al.

ECCV 2024arXiv:2407.20228
9
citations
#6366

EGIC: Enhanced Low-Bit-Rate Generative Image Compression Guided by Semantic Segmentation

Nikolai Körber, Eduard Kromer, Andreas Siebert et al.

ECCV 2024arXiv:2309.03244
9
citations
#6367

GenRC: Generative 3D Room Completion from Sparse Image Collections

Ming-Feng Li, Yueh-Feng Ku, Hong-Xuan Yen et al.

ECCV 2024arXiv:2407.12939
9
citations
#6368

CountFormer: Multi-View Crowd Counting Transformer

Hong Mo, Xiong Zhang, Jianchao Tan et al.

ECCV 2024arXiv:2407.02047
9
citations
#6369

RoScenes: A Large-scale Multi-view 3D Dataset for Roadside Perception

Xiaosu Zhu, Hualian Sheng, Sijia Cai et al.

ECCV 2024arXiv:2405.09883
9
citations
#6370

ActionVOS: Actions as Prompts for Video Object Segmentation

LIANGYANG OUYANG, Ruicong Liu, Yifei Huang et al.

ECCV 2024arXiv:2407.07402
9
citations
#6371

Prompt-Driven Contrastive Learning for Transferable Adversarial Attacks

Hunmin Yang, Jongoh Jeong, Kuk-Jin Yoon

ECCV 2024arXiv:2407.20657
9
citations
#6372

Take A Step Back: Rethinking the Two Stages in Visual Reasoning

Mingyu Zhang, Jiting Cai, Mingyu Liu et al.

ECCV 2024arXiv:2407.19666
9
citations
#6373

ZoLA: Zero-Shot Creative Long Animation Generation with Short Video Model

Fu-Yun Wang, Zhaoyang Huang, Qiang Ma et al.

ECCV 2024
9
citations
#6374

SLIM: Spuriousness Mitigation with Minimal Human Annotations

Xiwei Xuan, Ziquan Deng, Hsuan-Tien Lin et al.

ECCV 2024arXiv:2407.05594
9
citations
#6375

MetaCap: Meta-learning Priors from Multi-View Imagery for Sparse-view Human Performance Capture and Rendering

Guoxing Sun, Rishabh Dabral, Pascal Fua et al.

ECCV 2024arXiv:2403.18820
9
citations
#6376

Fully Authentic Visual Question Answering Dataset from Online Communities

Chongyan Chen, Mengchen Liu, Noel C Codella et al.

ECCV 2024arXiv:2311.15562
9
citations
#6377

Towards Physical World Backdoor Attacks against Skeleton Action Recognition

Qichen Zheng, Yi Yu, SIYUAN YANG et al.

ECCV 2024arXiv:2408.08671
9
citations
#6378

WRIM-Net: Wide-Ranging Information Mining Network for Visible-Infrared Person Re-Identification

Yonggan Wu, Ling-Chao Meng, Yuan Zichao et al.

ECCV 2024arXiv:2408.10624
9
citations
#6379

Exploring Phrase-Level Grounding with Text-to-Image Diffusion Model

Danni Yang, Ruohan Dong, Jiayi Ji et al.

ECCV 2024arXiv:2407.05352
9
citations
#6380

Diffusion Prior-Based Amortized Variational Inference for Noisy Inverse Problems

Sojin Lee, Dogyun Park, Inho Kong et al.

ECCV 2024arXiv:2407.16125
9
citations
#6381

BK-SDM: A Lightweight, Fast, and Cheap Version of Stable Diffusion

Bo-Kyeong Kim, Hyoung-Kyu Song, Thibault Castells et al.

ECCV 2024arXiv:2305.15798
9
citations
#6382

WPS-SAM: Towards Weakly-Supervised Part Segmentation with Foundation Models

xinjian wu, Ruisong Zhang, Jie Qin et al.

ECCV 2024arXiv:2407.10131
9
citations
#6383

On Pretraining Data Diversity for Self-Supervised Learning

Hasan Abed El Kader Hammoud, Tuhin Das, Fabio Pizzati et al.

ECCV 2024arXiv:2403.13808
9
citations
#6384

Head360: Learning a Parametric 3D Full-Head for Free-View Synthesis in 360°

Yuxiao He, Yiyu Zhuang, Yanwen Wang et al.

ECCV 2024arXiv:2408.00296
9
citations
#6385

Taming Lookup Tables for Efficient Image Retouching

Sidi Yang, Binxiao Huang, Mingdeng Cao et al.

ECCV 2024arXiv:2403.19238
9
citations
#6386

DualDn: Dual-domain Denoising via Differentiable ISP

Ruikang Li, Yujin Wang, Shiqi Chen et al.

ECCV 2024arXiv:2409.18783
9
citations
#6387

PredBench: Benchmarking Spatio-Temporal Prediction across Diverse Disciplines

Zidong Wang, Zeyu Lu, Di Huang et al.

ECCV 2024arXiv:2407.08418
9
citations
#6388

FontStudio: Shape-Adaptive Diffusion Model for Coherent and Consistent Font Effect Generation

Xinzhi MU, Li Chen, Bohan CHEN et al.

ECCV 2024arXiv:2406.08392
9
citations
#6389

Dual-level Adaptive Self-Labeling for Novel Class Discovery in Point Cloud Segmentation

Ruijie Xu, Chuyu Zhang, Hui Ren et al.

ECCV 2024arXiv:2407.12489
9
citations
#6390

DiffuX2CT: Diffusion Learning to Reconstruct CT Images from Biplanar X-Rays

Baochang Zhang, Zhi Qiao, Runkun Liu et al.

ECCV 2024arXiv:2407.13545
9
citations
#6391

Relightable Neural Actor with Intrinsic Decomposition and Pose Control

Diogo Carbonera Luvizon, Vladislav Golyanik, Adam Kortylewski et al.

ECCV 2024arXiv:2312.11587
9
citations
#6392

CityGuessr: City-Level Video Geo-Localization on a Global Scale

Parth Parag Kulkarni, Gaurav Kumar Nayak, Shah Mubarak

ECCV 2024arXiv:2411.06344
9
citations
#6393

FrePolad: Frequency-Rectified Point Latent Diffusion for Point Cloud Generation

Chenliang Zhou, Fangcheng Zhong, Param Hanji et al.

ECCV 2024arXiv:2311.12090
9
citations
#6394

Fast Point Cloud Geometry Compression with Context-based Residual Coding and INR-based Refinement

Hao Xu, Xi Zhang, Xiaolin Wu

ECCV 2024arXiv:2408.02966
9
citations
#6395

GenView: Enhancing View Quality with Pretrained Generative Model for Self-Supervised Learning

Xiaojie Li, Yibo Yang, Xiangtai Li et al.

ECCV 2024arXiv:2403.12003
9
citations
#6396

Semantic Diversity-aware Prototype-based Learning for Unbiased Scene Graph Generation

Jaehyeong Jeon, Kibum Kim, Kanghoon Yoon et al.

ECCV 2024arXiv:2407.15396
9
citations
#6397

SAFT: Towards Out-of-Distribution Generalization in Fine-Tuning

Bac Nguyen, Stefan Uhlich, Fabien Cardinaux et al.

ECCV 2024arXiv:2407.03036
9
citations
#6398

HSR: Holistic 3D Human-Scene Reconstruction from Monocular Videos

Lixin Xue, Chen Guo, Chengwei Zheng et al.

ECCV 2024
9
citations
#6399

Certifiably Robust Image Watermark

Zhengyuan Jiang, Moyang Guo, Yuepeng Hu et al.

ECCV 2024arXiv:2407.04086
9
citations
#6400

3D Reconstruction of Objects in Hands without Real World 3D Supervision

Aditya Prakash, Matthew Chang, Matthew Jin et al.

ECCV 2024arXiv:2305.03036
9
citations