Most Cited ICML "dense local features" Papers

5,975 papers found • Page 2 of 30

Filters:Most Cited ICML dense local features Clear all

Conference

AAAI 2025 (3,028)COLM 2025 (418)CVPR 2025 (2,873)ICCV 2025 (2,701)ICLR 2025 (3,827)ICML 2025 (3,340)ISMAR 2025 (229)NEURIPS 2025 (5,858)AAAI 2024 (2,289)CVPR 2024 (2,716)ECCV 2024 (2,387)ICLR 2024 (2,297)ICML 2024 (2,635)

Paper Type

poster (24,624)paper (8,558)oral (1,594)spotlight (1,421)highlight (975)

#201

When Do LLMs Help With Node Classification? A Comprehensive Analysis

Xixi Wu, Yifei Shen, Fangzhou Ge et al.

ICML 2025posterarXiv:2502.00829

citations

#202

Generalizing from SIMPLE to HARD Visual Reasoning: Can We Mitigate Modality Imbalance in VLMs?

Simon Park, Abhishek Panigrahi, Yun Cheng et al.

ICML 2025posterarXiv:2501.02669

citations

#203

Can Classic GNNs Be Strong Baselines for Graph-level Tasks? Simple Architectures Meet Excellence

Yuankai Luo, Lei Shi, Xiao-Ming Wu

ICML 2025posterarXiv:2502.09263

citations

#204

MUDDFormer: Breaking Residual Bottlenecks in Transformers via Multiway Dynamic Dense Connections

Da Xiao, Qingye Meng, Shengping Li et al.

ICML 2025posterarXiv:2502.12170

citations

#205

On Temperature Scaling and Conformal Prediction of Deep Classifiers

Lahav Dabah, Tom Tirer

ICML 2025posterarXiv:2402.05806

citations

#206

Adaptive Learn-then-Test: Statistically Valid and Efficient Hyperparameter Selection

Matteo Zecchin, Sangwoo Park, Osvaldo Simeone

ICML 2025spotlightarXiv:2409.15844

citations

#207

Gaussian Mixture Flow Matching Models

Hansheng Chen, Kai Zhang, Hao Tan et al.

ICML 2025posterarXiv:2504.05304

citations

#208

BRiTE: Bootstrapping Reinforced Thinking Process to Enhance Language Model Reasoning

Han Zhong, Yutong Yin, Shenao Zhang et al.

ICML 2025posterarXiv:2501.18858

citations

#209

GaussMark: A Practical Approach for Structural Watermarking of Language Models

Adam Block, Alexander Rakhlin, Ayush Sekhari

ICML 2025posterarXiv:2501.13941

citations

#210

Improving LLMs for Recommendation with Out-Of-Vocabulary Tokens

Ting-Ji Huang, Jia-Qi Yang, Chunxu Shen et al.

ICML 2025posterarXiv:2406.08477

citations

#211

MoEQuant: Enhancing Quantization for Mixture-of-Experts Large Language Models via Expert-Balanced Sampling and Affinity Guidance

Zhixuan Chen, Xing Hu, Dawei Yang et al.

ICML 2025posterarXiv:2505.03804

citations

#212

Fast and Robust: Task Sampling with Posterior and Diversity Synergies for Adaptive Decision-Makers in Randomized Environments

Yun Qu, Cheems Wang, Yixiu Mao et al.

ICML 2025posterarXiv:2504.19139

citations

#213

A Closer Look at Multimodal Representation Collapse

Abhra Chaudhuri, Anjan Dutta, Tu Bui et al.

ICML 2025spotlightarXiv:2505.22483

citations

#214

Self-Discriminative Modeling for Anomalous Graph Detection

Jinyu Cai, Yunhe Zhang, Jicong Fan

ICML 2025posterarXiv:2310.06261

citations

#215

LoRA-One: One-Step Full Gradient Could Suffice for Fine-Tuning Large Language Models, Provably and Efficiently

Yuanhe Zhang, Fanghui Liu, Yudong Chen

ICML 2025oralarXiv:2502.01235

citations

#216

De-mark: Watermark Removal in Large Language Models

Ruibo Chen, Yihan Wu, Junfeng Guo et al.

ICML 2025posterarXiv:2410.13808

citations

#217

Not all solutions are created equal: An analytical dissociation of functional and representational similarity in deep linear neural networks

Lukas Braun, Erin Grant, Andrew Saxe

ICML 2025spotlight

citations

#218

Soft Reasoning: Navigating Solution Spaces in Large Language Models through Controlled Embedding Exploration

Qinglin Zhu, Runcong Zhao, Hanqi Yan et al.

ICML 2025spotlightarXiv:2505.24688

citations

#219

Effective and Efficient Masked Image Generation Models

Zebin You, Jingyang Ou, Xiaolu Zhang et al.

ICML 2025posterarXiv:2503.07197

citations

#220

AutoAdvExBench: Benchmarking Autonomous Exploitation of Adversarial Example Defenses

Nicholas Carlini, Edoardo Debenedetti, Javier Rando et al.

ICML 2025oralarXiv:2503.01811

citations

#221

Secant Line Search for Frank-Wolfe Algorithms

Deborah Hendrych, Sebastian Pokutta, Mathieu Besançon et al.

ICML 2025posterarXiv:2501.18775

citations

#222

Right Now, Wrong Then: Non-Stationary Direct Preference Optimization under Preference Drift

Seongho Son, William Bankes, Sayak Ray Chowdhury et al.

ICML 2025oralarXiv:2407.18676

citations

#223

XAttnMark: Learning Robust Audio Watermarking with Cross-Attention

Yixin Liu, Lie Lu, Jihui Jin et al.

ICML 2025oralarXiv:2502.04230

citations

#224

Learning Adaptive Lighting via Channel-Aware Guidance

Qirui Yang, Peng-Tao Jiang, Hao Zhang et al.

ICML 2025posterarXiv:2412.01493

citations

#225

Position: The Most Expensive Part of an LLM should be its Training Data

Nikhil Kandpal, Colin Raffel

ICML 2025posterarXiv:2504.12427

citations

#226

Towards Robustness and Explainability of Automatic Algorithm Selection

Xingyu Wu, Jibin Wu, Yu Zhou et al.

ICML 2025spotlight

citations

#227

Data-Juicer Sandbox: A Feedback-Driven Suite for Multimodal Data-Model Co-development

Daoyuan Chen, Haibin Wang, Yilun Huang et al.

ICML 2025spotlightarXiv:2407.11784

citations

#228

Learning Safety Constraints for Large Language Models

Xin Chen, Yarden As, Andreas Krause

ICML 2025spotlightarXiv:2505.24445

citations

#229

Vision-Language Models Create Cross-Modal Task Representations

Grace Luo, Trevor Darrell, Amir Bar

ICML 2025posterarXiv:2410.22330

citations

#230

Compression via Pre-trained Transformers: A Study on Byte-Level Multimodal Data

David Heurtel-Depeiges, Anian Ruoss, Joel Veness et al.

ICML 2025posterarXiv:2410.05078

citations

#231

Impossible Videos

Zechen Bai, Hai Ci, Mike Zheng Shou

ICML 2025oralarXiv:2503.14378

citations

#232

DISCO: learning to DISCover an evolution Operator for multi-physics-agnostic prediction

Rudy Morel, Jiequn Han, Edouard Oyallon

ICML 2025oralarXiv:2504.19496

citations

#233

Contrastive Private Data Synthesis via Weighted Multi-PLM Fusion

Tianyuan Zou, Yang Liu, Peng Li et al.

ICML 2025posterarXiv:2502.00245

citations

#234

Enhancing Rating-Based Reinforcement Learning to Effectively Leverage Feedback from Large Vision-Language Models

Minh-Tung Luu, Younghwan Lee, Donghoon Lee et al.

ICML 2025posterarXiv:2506.12822

citations

#235

Scaling Laws for Task-Optimized Models of the Primate Visual Ventral Stream

Abdulkadir Gokce, Martin Schrimpf

ICML 2025oralarXiv:2411.05712

citations

#236

Loss Functions and Operators Generated by f-Divergences

Vincent Roulet, Tianlin Liu, Nino Vieillard et al.

ICML 2025posterarXiv:2501.18537

citations

#237

Privacy Attacks on Image AutoRegressive Models

Antoni Kowalczuk, Jan Dubiński, Franziska Boenisch et al.

ICML 2025posterarXiv:2502.02514

citations

#238

Robust and Conjugate Spatio-Temporal Gaussian Processes

William Laplante, Matias Altamirano, Andrew Duncan et al.

ICML 2025oralarXiv:2502.02450

citations

#239

Perception in Reflection

Yana Wei, Liang Zhao, Kangheng Lin et al.

ICML 2025posterarXiv:2504.07165

citations

#240

Evaluating Neuron Explanations: A Unified Framework with Sanity Checks

Tuomas Oikarinen, Ge Yan, Lily Weng

ICML 2025posterarXiv:2506.05774

citations

#241

Selective Prompt Anchoring for Code Generation

Yuan Tian, Tianyi Zhang

ICML 2025posterarXiv:2408.09121

citations

#242

Overcoming the Curse of Dimensionality in Reinforcement Learning Through Approximate Factorization

Chenbei Lu, Laixi Shi, Zaiwei Chen et al.

ICML 2025posterarXiv:2411.07591

citations

#243

Ultra-Resolution Adaptation with Ease

Ruonan Yu, Songhua Liu, Zhenxiong Tan et al.

ICML 2025posterarXiv:2503.16322

citations

#244

Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration

Max Wilcoxson, Qiyang Li, Kevin Frans et al.

ICML 2025posterarXiv:2410.18076

citations

#245

Ringmaster ASGD: The First Asynchronous SGD with Optimal Time Complexity

Artavazd Maranjyan, Alexander Tyurin, Peter Richtarik

ICML 2025posterarXiv:2501.16168

citations

#246

Componential Prompt-Knowledge Alignment for Domain Incremental Learning

Kunlun Xu, Xu Zou, Gang Hua et al.

ICML 2025posterarXiv:2505.04575

citations

#247

AutoElicit: Using Large Language Models for Expert Prior Elicitation in Predictive Modelling

Alexander Capstick, Rahul G. Krishnan, Payam Barnaghi

ICML 2025posterarXiv:2411.17284

citations

#248

SEMU: Singular Value Decomposition for Efficient Machine Unlearning

Marcin Sendera, Łukasz Struski, Kamil Książek et al.

ICML 2025posterarXiv:2502.07587

citations

#249

Efficient and Scalable Density Functional Theory Hamiltonian Prediction through Adaptive Sparsity

Erpai Luo, Xinran Wei, Lin Huang et al.

ICML 2025posterarXiv:2502.01171

citations

#250

Doubly Robust Conformalized Survival Analysis with Right-Censored Data

Matteo Sesia, vladimir svetnik

ICML 2025spotlightarXiv:2412.09729

citations

#251

Speculative Prefill: Turbocharging TTFT with Lightweight and Training-Free Token Importance Estimation

Jingyu Liu, Beidi Chen, Ce Zhang

ICML 2025posterarXiv:2502.02789

citations

#252

SafetyAnalyst: Interpretable, Transparent, and Steerable Safety Moderation for AI Behavior

Jing-Jing Li, Valentina Pyatkin, Max Kleiman-Weiner et al.

ICML 2025posterarXiv:2410.16665

citations

#253

Position: We Need An Algorithmic Understanding of Generative AI

Oliver Eberle, Thomas McGee, Hamza Giaffar et al.

ICML 2025spotlightarXiv:2507.07544

citations

#254

When Maximum Entropy Misleads Policy Optimization

Ruipeng Zhang, Ya-Chien Chang, Sicun Gao

ICML 2025posterarXiv:2506.05615

citations

#255

PatchPilot: A Cost-Efficient Software Engineering Agent with Early Attempts on Formal Verification

Hongwei Li, Yuheng Tang, Shiqi Wang et al.

ICML 2025posterarXiv:2502.02747

citations

#256

Task Generalization with Autoregressive Compositional Structure: Can Learning from $D$ Tasks Generalize to $D^T$ Tasks?

Amirhesam Abedsoltan, Huaqing Zhang, Kaiyue Wen et al.

ICML 2025posterarXiv:2502.08991

citations

#257

Activation Space Interventions Can Be Transferred Between Large Language Models

Narmeen Oozeer, Dhruv Nathawani, Nirmalendu Prakash et al.

ICML 2025posterarXiv:2503.04429

citations

#258

Vulnerability-Aware Alignment: Mitigating Uneven Forgetting in Harmful Fine-Tuning

Liang CHEN, Xueting Han, Li Shen et al.

ICML 2025posterarXiv:2506.03850

citations

#259

MuLan: Adapting Multilingual Diffusion Models for Hundreds of Languages with Negligible Cost

Sen Xing, Muyan Zhong, Zeqiang Lai et al.

ICML 2025posterarXiv:2412.01271

citations

#260

DEALing with Image Reconstruction: Deep Attentive Least Squares

Mehrsa Pourya, Erich Kobler, Michael Unser et al.

ICML 2025posterarXiv:2502.04079

citations

#261

SK-VQA: Synthetic Knowledge Generation at Scale for Training Context-Augmented Multimodal LLMs

Xin Su, Man Luo, Kris Pan et al.

ICML 2025oralarXiv:2406.19593

citations

#262

Efficient Distributed Optimization under Heavy-Tailed Noise

Su Hyeong Lee, Manzil Zaheer, Tian Li

ICML 2025posterarXiv:2502.04164

citations

#263

PROXSPARSE: REGULARIZED LEARNING OF SEMI-STRUCTURED SPARSITY MASKS FOR PRETRAINED LLMS

Hongyi Liu, Rajarshi Saha, Zhen Jia et al.

ICML 2025posterarXiv:2502.00258

citations

#264

Prediction-Powered E-Values

Daniel Csillag, Claudio Struchiner, Guilherme Tegoni Goedert

ICML 2025posterarXiv:2502.04294

citations

#265

QT-DoG: Quantization-Aware Training for Domain Generalization

Saqib Javed, Hieu Le, Mathieu Salzmann

ICML 2025posterarXiv:2410.06020

citations

#266

RZ-NAS: Enhancing LLM-guided Neural Architecture Search via Reflective Zero-Cost Strategy

Zipeng Ji, Guanghui Zhu, Chunfeng Yuan et al.

ICML 2025poster

citations

#267

Hierarchical Graph Tokenization for Molecule-Language Alignment

Yongqiang Chen, QUANMING YAO, Juzheng Zhang et al.

ICML 2025posterarXiv:2406.14021

citations

#268

No-Regret is not enough! Bandits with General Constraints through Adaptive Regret Minimization

Martino Bernasconi, Matteo Castiglioni, Andrea Celli

ICML 2025posterarXiv:2405.06575

citations

#269

ROPO: Robust Preference Optimization for Large Language Models

Xize Liang, Chao Chen, Shuang Qiu et al.

ICML 2025posterarXiv:2404.04102

citations

#270

Parameter-Efficient Fine-Tuning of State Space Models

Kevin Galim, Wonjun Kang, Yuchen Zeng et al.

ICML 2025posterarXiv:2410.09016

citations

#271

LLMs can see and hear without any training

Kumar Ashutosh, Yossi Gandelsman, Xinlei Chen et al.

ICML 2025posterarXiv:2501.18096

citations

#272

HaploVL: A Single-Transformer Baseline for Multi-Modal Understanding

Rui Yang, Lin Song, Yicheng Xiao et al.

ICML 2025posterarXiv:2503.14694

citations

#273

Toward Efficient Kernel-Based Solvers for Nonlinear PDEs

Zhitong Xu, Da Long, Yiming Xu et al.

ICML 2025posterarXiv:2410.11165

citations

#274

Unisolver: PDE-Conditional Transformers Towards Universal Neural PDE Solvers

Hang Zhou, Yuezhou Ma, Haixu Wu et al.

ICML 2025posterarXiv:2405.17527

citations

#275

Position: The Artificial Intelligence and Machine Learning Community Should Adopt a More Transparent and Regulated Peer Review Process

Jing Yang

ICML 2025posterarXiv:2502.00874

citations

#276

Language Models over Canonical Byte-Pair Encodings

Tim Vieira, Tianyu Liu, Clemente Pasti et al.

ICML 2025posterarXiv:2506.07956

citations

#277

OV-MER: Towards Open-Vocabulary Multimodal Emotion Recognition

Zheng Lian, Haiyang Sun, Licai Sun et al.

ICML 2025posterarXiv:2410.01495

citations

#278

Enhancing Statistical Validity and Power in Hybrid Controlled Trials: A Randomization Inference Approach with Conformal Selective Borrowing

Ke Zhu, Shu Yang, Xiaofei Wang

ICML 2025posterarXiv:2410.11713

citations

#279

Variational Control for Guidance in Diffusion Models

Kushagra Pandey, Farrin Marouf Sofian, Felix Draxler et al.

ICML 2025posterarXiv:2502.03686

citations

#280

Jacobian Sparse Autoencoders: Sparsify Computations, Not Just Activations

Lucy Farnik, Tim Lawson, Conor Houghton et al.

ICML 2025spotlightarXiv:2502.18147

citations

#281

SAE-V: Interpreting Multimodal Models for Enhanced Alignment

Hantao Lou, Changye Li, Jiaming Ji et al.

ICML 2025posterarXiv:2502.17514

citations

#282

Analyze Feature Flow to Enhance Interpretation and Steering in Language Models

Daniil Laptev, Nikita Balagansky, Yaroslav Aksenov et al.

ICML 2025posterarXiv:2502.03032

citations

#283

Active Fine-Tuning of Multi-Task Policies

Marco Bagatella, Jonas Hübotter, Georg Martius et al.

ICML 2025oralarXiv:2410.05026

citations

#284

LLM-Augmented Chemical Synthesis and Design Decision Programs

Haorui Wang, Jeff Guo, Lingkai Kong et al.

ICML 2025posterarXiv:2505.07027

citations

#285

Hyperband-based Bayesian Optimization for Black-box Prompt Selection

Lennart Schneider, Martin Wistuba, Aaron Klein et al.

ICML 2025posterarXiv:2412.07820

citations

#286

Causal Discovery from Conditionally Stationary Time Series

Carles Balsells-Rodas, Xavier Sumba, Tanmayee Narendra et al.

ICML 2025posterarXiv:2110.06257

citations

#287

Understanding and Mitigating Memorization in Diffusion Models for Tabular Data

Zhengyu Fang, Zhimeng Jiang, Huiyuan Chen et al.

ICML 2025posterarXiv:2412.11044

citations

#288

Regress, Don't Guess: A Regression-like Loss on Number Tokens for Language Models

Jonas Zausinger, Lars Pennig, Anamarija Kozina et al.

ICML 2025posterarXiv:2411.02083

citations

#289

Volume Optimality in Conformal Prediction with Structured Prediction Sets

Chao Gao, Liren Shan, Vaidehi Srinivas et al.

ICML 2025posterarXiv:2502.16658

citations

#290

Learning Distances from Data with Normalizing Flows and Score Matching

Peter Sorrenson, Daniel Behrend-Uriarte, Christoph Schnörr et al.

ICML 2025posterarXiv:2407.09297

citations

#291

Network Sparsity Unlocks the Scaling Potential of Deep Reinforcement Learning

Guozheng Ma, Lu Li, Zilin Wang et al.

ICML 2025oralarXiv:2506.17204

citations

#292

Overestimation in LLM Evaluation: A Controlled Large-Scale Study on Data Contamination’s Impact on Machine Translation

Muhammed Yusuf Kocyigit, Eleftheria Briakou, Daniel Deutsch et al.

ICML 2025oralarXiv:2501.18771

citations

#293

Learning from Integral Losses in Physics Informed Neural Networks

Ehsan Saleh, Saba Ghaffari, Timothy Bretl et al.

ICML 2024posterarXiv:2305.17387

citations

#294

From Passive to Active Reasoning: Can Large Language Models Ask the Right Questions under Incomplete Information?

Zhanke Zhou, Xiao Feng, Zhaocheng Zhu et al.

ICML 2025posterarXiv:2506.08295

citations

#295

Understanding the Limits of Deep Tabular Methods with Temporal Shift

Haorun Cai, Han-Jia Ye

ICML 2025oralarXiv:2502.20260

citations

#296

Robust Offline Reinforcement Learning with Linearly Structured $f$-Divergence Regularization

Cheng Tang, Zhishuai Liu, Pan Xu

ICML 2025posterarXiv:2411.18612

citations

#297

Flowing Datasets with Wasserstein over Wasserstein Gradient Flows

Clément Bonet, Christophe Vauthier, Anna Korba

ICML 2025oralarXiv:2506.07534

citations

#298

EARL-BO: Reinforcement Learning for Multi-Step Lookahead, High-Dimensional Bayesian Optimization

Mujin Cheon, Jay Lee, Dong-Yeun Koh et al.

ICML 2025posterarXiv:2411.00171

citations

#299

TimePro: Efficient Multivariate Long-term Time Series Forecasting with Variable- and Time-Aware Hyper-state

Xiaowen Ma, Zhen-Liang Ni, Shuai Xiao et al.

ICML 2025oralarXiv:2505.20774

citations

#300

LotteryCodec: Searching the Implicit Representation in a Random Network for Low-Complexity Image Compression

Haotian Wu, Gongpu Chen, Pier Luigi Dragotti et al.

ICML 2025spotlightarXiv:2507.01204

citations

#301

Rethinking Chain-of-Thought from the Perspective of Self-Training

Zongqian Wu, Baoduo Xu, Ruochen Cui et al.

ICML 2025posterarXiv:2412.10827

citations

#302

Provable Maximum Entropy Manifold Exploration via Diffusion Models

Riccardo De Santi, Marin Vlastelica, Ya-Ping Hsieh et al.

ICML 2025posterarXiv:2506.15385

citations

#303

Predicting mutational effects on protein binding from folding energy

Arthur Deng, Karsten Householder, Fang Wu et al.

ICML 2025posterarXiv:2507.05502

citations

#304

Fundamental Limits of Visual Autoregressive Transformers: Universal Approximation Abilities

Yifang Chen, Xiaoyu Li, Yingyu Liang et al.

ICML 2025poster

citations

#305

KV Shifting Attention Enhances Language Modeling

Mingyu Xu, Bingning Wang, Weipeng Chen

ICML 2025oralarXiv:2411.19574

citations

#306

In-Context Learning and Occam's Razor

Eric Elmoznino, Tom Marty, Tejas Kasetty et al.

ICML 2025posterarXiv:2410.14086

citations

#307

GAPrompt: Geometry-Aware Point Cloud Prompt for 3D Vision Model

Zixiang Ai, Zichen Liu, Yuanhang Lei et al.

ICML 2025posterarXiv:2505.04119

citations

#308

Ranked Entropy Minimization for Continual Test-Time Adaptation

Jisu Han, Jaemin Na, Wonjun Hwang

ICML 2025posterarXiv:2505.16441

citations

#309

PINNsAgent: Automated PDE Surrogation with Large Language Models

Qingpo Wuwu, Chonghan Gao, Tianyu Chen et al.

ICML 2025posterarXiv:2501.12053

citations

#310

Aligning Multimodal Representations through an Information Bottleneck

Antonio Almudévar, Jose Miguel Hernandez-Lobato, Sameer Khurana et al.

ICML 2025posterarXiv:2506.04870

citations

#311

Contextual Online Decision Making with Infinite-Dimensional Functional Regression

Haichen Hu, Rui Ai, Stephen Bates et al.

ICML 2025posterarXiv:2501.18359

citations

#312

On Volume Minimization in Conformal Regression

Batiste Le Bars, Pierre Humbert

ICML 2025posterarXiv:2502.09985

citations

#313

On the Robustness of Reward Models for Language Model Alignment

Jiwoo Hong, Noah Lee, Eunki Kim et al.

ICML 2025posterarXiv:2505.07271

citations

#314

Shielded Diffusion: Generating Novel and Diverse Images using Sparse Repellency

Michael Kirchhof, James Thornton, Louis Béthune et al.

ICML 2025posterarXiv:2410.06025

citations

#315

Constrained Belief Updates Explain Geometric Structures in Transformer Representations

Mateusz Piotrowski, Paul Riechers, Daniel Filan et al.

ICML 2025posterarXiv:2502.01954

citations

#316

DipLLM: Fine-Tuning LLM for Strategic Decision-making in Diplomacy

Kaixuan Xu, Jiajun Chai, Sicheng Li et al.

ICML 2025posterarXiv:2506.09655

citations

#317

Adversarial Cooperative Rationalization: The Risk of Spurious Correlations in Even Clean Datasets

Wei Liu, Zhongyu Niu, Lang Gao et al.

ICML 2025posterarXiv:2505.02118

citations

#318

Continuous Visual Autoregressive Generation via Score Maximization

Chenze Shao, Fandong Meng, Jie Zhou

ICML 2025posterarXiv:2505.07812

citations

#319

Test-Time Training Provably Improves Transformers as In-context Learners

Halil Alperen Gozeten, Muhammed Emrullah Ildiz, Xuechen Zhang et al.

ICML 2025posterarXiv:2503.11842

citations

#320

Automatically Identify and Rectify: Robust Deep Contrastive Multi-view Clustering in Noisy Scenarios

xihong yang, Siwei Wang, Fangdi Wang et al.

ICML 2025spotlightarXiv:2505.21387

citations

#321

EffiCoder: Enhancing Code Generation in Large Language Models through Efficiency-Aware Fine-tuning

Dong HUANG, Guangtao Zeng, Jianbo Dai et al.

ICML 2025posterarXiv:2410.10209

citations

#322

DOLPHIN: A Programmable Framework for Scalable Neurosymbolic Learning

Aaditya Naik, Jason Liu, Claire Wang et al.

ICML 2025posterarXiv:2410.03348

citations

#323

Scaling Laws for Upcycling Mixture-of-Experts Language Models

Seng Pei Liew, Takuya Kato, Sho Takase

ICML 2025posterarXiv:2502.03009

citations

#324

In-Context Denoising with One-Layer Transformers: Connections between Attention and Associative Memory Retrieval

Matthew Smart, Alberto Bietti, Anirvan Sengupta

ICML 2025oralarXiv:2502.05164

citations

#325

MERGE$^3$: Efficient Evolutionary Merging on Consumer-grade GPUs

Tommaso Mencattini, Adrian Robert Minut, Donato Crisostomi et al.

ICML 2025posterarXiv:2502.10436

citations

#326

Features are fate: a theory of transfer learning in high-dimensional regression

Javan Tahir, Surya Ganguli, Grant Rotskoff

ICML 2025posterarXiv:2410.08194

citations

#327

When Diffusion Models Memorize: Inductive Biases in Probability Flow of Minimum-Norm Shallow Neural Nets

Chen Zeno, Hila Manor, Gregory Ongie et al.

ICML 2025posterarXiv:2506.19031

citations

#328

Enforcing Latent Euclidean Geometry in Single-Cell VAEs for Manifold Interpolation

Alessandro Palma, Sergei Rybakov, Leon Hetzel et al.

ICML 2025spotlightarXiv:2507.11789

citations

#329

Embedding Safety into RL: A New Take on Trust Region Methods

Nikola Milosevic, Johannes Müller, Nico Scherf

ICML 2025posterarXiv:2411.02957

citations

#330

On the Vulnerability of Applying Retrieval-Augmented Generation within Knowledge-Intensive Application Domains

Xun Xian, Ganghua Wang, Xuan Bi et al.

ICML 2025posterarXiv:2409.17275

citations

#331

Understanding Generalization in Quantum Machine Learning with Margins

TAK HUR, Daniel Kyungdeock Park

ICML 2025posterarXiv:2411.06919

citations

#332

Towards Trustworthy Federated Learning with Untrusted Participants

Youssef Allouah, Rachid Guerraoui, John Stephan

ICML 2025posterarXiv:2505.01874

citations

#333

Activation by Interval-wise Dropout: A Simple Way to Prevent Neural Networks from Plasticity Loss

Sangyeon Park, Isaac Han, Seungwon Oh et al.

ICML 2025posterarXiv:2502.01342

citations

#334

Temporal Difference Flows

Jesse Farebrother, Matteo Pirotta, Andrea Tirinzoni et al.

ICML 2025oralarXiv:2503.09817

citations

#335

LLMScan: Causal Scan for LLM Misbehavior Detection

Mengdi Zhang, Goh Kiat, Peixin Zhang et al.

ICML 2025posterarXiv:2410.16638

citations

#336

Importance Corrected Neural JKO Sampling

Johannes Hertrich, Robert Gruhlke

ICML 2025posterarXiv:2407.20444

citations

#337

ELITE: Enhanced Language-Image Toxicity Evaluation for Safety

Wonjun Lee, Doehyeon Lee, Eugene Choi et al.

ICML 2025posterarXiv:2502.04757

citations

#338

What makes an Ensemble (Un) Interpretable?

Shahaf Bassan, Guy Amir, Meirav Zehavi et al.

ICML 2025posterarXiv:2506.08216

citations

#339

One Leaf Reveals the Season: Occlusion-Based Contrastive Learning with Semantic-Aware Views for Efficient Visual Representation

Xiaoyu Yang, Lijian Xu, Hongsheng Li et al.

ICML 2025posterarXiv:2411.09858

citations

#340

Tree-Sliced Wasserstein Distance with Nonlinear Projection

Thanh Tran, Viet Hoang Tran, Thanh Chu et al.

ICML 2025posterarXiv:2505.00968

citations

#341

TTFSFormer: A TTFS-based Lossless Conversion of Spiking Transformer

Lusen Zhao, Zihan Huang, Ding Jianhao et al.

ICML 2025poster

citations

#342

Inverse Problem Sampling in Latent Space Using Sequential Monte Carlo

Idan Achituve, Hai Victor Habi, Amir Rosenfeld et al.

ICML 2025posterarXiv:2502.05908

citations

#343

Position: Build Agent Advocates, Not Platform Agents

Sayash Kapoor, Noam Kolt, Seth Lazar

ICML 2025poster

citations

#344

(How) Can Transformers Predict Pseudo-Random Numbers?

Tao Tao, Darshil Doshi, Dayal Singh Kalra et al.

ICML 2025posterarXiv:2502.10390

citations

#345

PARQ: Piecewise-Affine Regularized Quantization

Lisa Jin, Jianhao Ma, Zechun Liu et al.

ICML 2025posterarXiv:2503.15748

citations

#346

Scaling Laws for Floating–Point Quantization Training

Xingwu Sun, Shuaipeng Li, Ruobing Xie et al.

ICML 2025poster

citations

#347

General framework for online-to-nonconvex conversion: Schedule-free SGD is also effective for nonconvex optimization

Kwangjun Ahn, Gagik Magakyan, Ashok Cutkosky

ICML 2025oralarXiv:2411.07061

citations

#348

Correlated Errors in Large Language Models

Elliot Myunghoon Kim, Avi Garg, Kenny Peng et al.

ICML 2025posterarXiv:2506.07962

citations

#349

Looking Beyond the Top-1: Transformers Determine Top Tokens in Order

Daria Lioubashevski, Tomer Schlank, Gabriel Stanovsky et al.

ICML 2025posterarXiv:2410.20210

citations

#350

Reward-Augmented Data Enhances Direct Preference Alignment of LLMs

Shenao Zhang, Zhihan Liu, Boyi Liu et al.

ICML 2025posterarXiv:2410.08067

citations

#351

Tensor Product Neural Networks for Functional ANOVA Model

Seokhun Park, Insung Kong, yongchan Choi et al.

ICML 2025posterarXiv:2502.15215

citations

#352

Enhancing Decision-Making of Large Language Models via Actor-Critic

Heng Dong, Kefei Duan, Chongjie Zhang

ICML 2025posterarXiv:2506.06376

citations

#353

Foundation Molecular Grammar: Multi-Modal Foundation Models Induce Interpretable Molecular Graph Languages

Michael Sun, Weize Yuan, Gang Liu et al.

ICML 2025posterarXiv:2505.22948

citations

#354

Point-Level Topological Representation Learning on Point Clouds

Vincent P. Grande, Michael Schaub

ICML 2025posterarXiv:2406.02300

citations

#355

ADHMR: Aligning Diffusion-based Human Mesh Recovery via Direct Preference Optimization

Wenhao Shen, Wanqi Yin, Xiaofeng Yang et al.

ICML 2025posterarXiv:2505.10250

citations

#356

CHATS: Combining Human-Aligned Optimization and Test-Time Sampling for Text-to-Image Generation

Minghao Fu, Guo-Hua Wang, Liangfu Cao et al.

ICML 2025posterarXiv:2502.12579

citations

#357

Understanding Model Ensemble in Transferable Adversarial Attack

Wei Yao, Zeliang Zhang, Huayi Tang et al.

ICML 2025posterarXiv:2410.06851

citations

#358

Position: Lifetime tuning is incompatible with continual reinforcement learning

Golnaz Mesbahi, Parham Mohammad Panahi, Olya Mastikhina et al.

ICML 2025posterarXiv:2404.02113

citations

#359

X-Hacking: The Threat of Misguided AutoML

Rahul Sharma, Sumantrak Mukherjee, Andrea Šipka et al.

ICML 2025poster

citations

#360

Representations Shape Weak-to-Strong Generalization: Theoretical Insights and Empirical Predictions

Yihao Xue, Jiping Li, Baharan Mirzasoleiman

ICML 2025posterarXiv:2502.00620

citations

#361

MAGELLAN: Metacognitive predictions of learning progress guide autotelic LLM agents in large goal spaces

Loris Gaven, Thomas Carta, Clément Romac et al.

ICML 2025posterarXiv:2502.07709

citations

#362

TRACE Back from the Future: A Probabilistic Reasoning Approach to Controllable Language Generation

Gwen Yidou-Weng, Benjie Wang, Guy Van den Broeck

ICML 2025poster

citations

#363

Understanding and Mitigating Memorization in Generative Models via Sharpness of Probability Landscapes

Dongjae Jeon, Dueun Kim, Albert No

ICML 2025spotlightarXiv:2412.04140

citations

#364

Collapse-Proof Non-Contrastive Self-Supervised Learning

EMANUELE SANSONE, Tim Lebailly, Tinne Tuytelaars

ICML 2025posterarXiv:2410.04959

citations

#365

M3-JEPA: Multimodal Alignment via Multi-gate MoE based on the Joint-Embedding Predictive Architecture

Hongyang Lei, Xiaolong Cheng, Qi Qin et al.

ICML 2025posterarXiv:2409.05929

citations

#366

Prune 'n Predict: Optimizing LLM Decision-making with Conformal Prediction

Harit Vishwakarma, Alan Mishler, Thomas Cook et al.

ICML 2025posterarXiv:2501.00555

citations

#367

Can RLHF be More Efficient with Imperfect Reward Models? A Policy Coverage Perspective

Jiawei Huang, Bingcong Li, Christoph Dann et al.

ICML 2025posterarXiv:2502.19255

citations

#368

Policy Design for Two-sided Platforms with Participation Dynamics

Haruka Kiyohara, Fan Yao, Sarah Dean

ICML 2025posterarXiv:2502.01792

citations

#369

CALM: Consensus-Aware Localized Merging for Multi-Task Learning

Kunda Yan, Min Zhang, Sen Cui et al.

ICML 2025posterarXiv:2506.13406

citations

#370

Quantifying Prediction Consistency Under Fine-tuning Multiplicity in Tabular LLMs

Faisal Hamman, Sachindra P Dissanayake, Saumitra Mishra et al.

ICML 2025posterarXiv:2407.04173

citations

#371

SpeCache: Speculative Key-Value Caching for Efficient Generation of LLMs

Shibo Jie, Yehui Tang, Kai Han et al.

ICML 2025posterarXiv:2503.16163

citations

#372

SlimLLM: Accurate Structured Pruning for Large Language Models

Jialong Guo, Xinghao Chen, Yehui Tang et al.

ICML 2025posterarXiv:2505.22689

citations

#373

Nested Expectations with Kernel Quadrature

Zonghao Chen, Masha Naslidnyk, Francois-Xavier Briol

ICML 2025posterarXiv:2502.18284

citations

#374

Best of Both Worlds: Advantages of Hybrid Graph Sequence Models

Ali Behrouz, Ali Parviz, Mahdi Karami et al.

ICML 2025posterarXiv:2411.15671

citations

#375

Blink of an eye: a simple theory for feature localization in generative models

Marvin Li, Aayush Karan, Sitan Chen

ICML 2025oralarXiv:2502.00921

citations

#376

PANDAS: Improving Many-shot Jailbreaking via Positive Affirmation, Negative Demonstration, and Adaptive Sampling

Avery Ma, Yangchen Pan, Amir-massoud Farahmand

ICML 2025spotlightarXiv:2502.01925

citations

#377

Learning Dynamics in Continual Pre-Training for Large Language Models

Xingjin Wang, Howe Tissue, Lu Wang et al.

ICML 2025oralarXiv:2505.07796

citations

#378

Projection Optimization: A General Framework for Multi-Objective and Multi-Group RLHF

Nuoya Xiong, Aarti Singh

ICML 2025posterarXiv:2502.15145

citations

#379

CFP-Gen: Combinatorial Functional Protein Generation via Diffusion Language Models

Junbo Yin, Chao Zha, Wenjia He et al.

ICML 2025posterarXiv:2505.22869

citations

#380

LLaVA-ReID: Selective Multi-image Questioner for Interactive Person Re-Identification

Yiding Lu, Mouxing Yang, Dezhong Peng et al.

ICML 2025posterarXiv:2504.10174

citations

#381

Controlled Generation with Equivariant Variational Flow Matching

Floor Eijkelboom, Heiko Zimmermann, Sharvaree Vadgama et al.

ICML 2025posterarXiv:2506.18340

citations

#382

Task-Agnostic Pre-training and Task-Guided Fine-tuning for Versatile Diffusion Planner

Chenyou Fan, Chenjia Bai, Zhao Shan et al.

ICML 2025posterarXiv:2409.19949

citations

#383

MPO: An Efficient Post-Processing Framework for Mixing Diverse Preference Alignment

Tianze Wang, Dongnan Gui, Yifan Hu et al.

ICML 2025posterarXiv:2502.18699

citations

#384

Cannot See the Forest for the Trees: Invoking Heuristics and Biases to Elicit Irrational Choices of LLMs

Haoming Yang, Ke Ma, Xiaojun Jia et al.

ICML 2025posterarXiv:2505.02862

citations

#385

Linear $Q$-Learning Does Not Diverge in $L^2$: Convergence Rates to a Bounded Set

Xinyu Liu, Zixuan Xie, Shangtong Zhang

ICML 2025posterarXiv:2501.19254

citations

#386

Efficient Logit-based Knowledge Distillation of Deep Spiking Neural Networks for Full-Range Timestep Deployment

Chengting Yu, Xiaochen Zhao, Lei Liu et al.

ICML 2025oralarXiv:2501.15925

citations

#387

SENSEI: Semantic Exploration Guided by Foundation Models to Learn Versatile World Models

Cansu Sancaktar, Christian Gumbsch, Andrii Zadaianchuk et al.

ICML 2025posterarXiv:2503.01584

citations

#388

AssistanceZero: Scalably Solving Assistance Games

Cassidy Laidlaw, Eli Bronstein, Timothy Guo et al.

ICML 2025posterarXiv:2504.07091

citations

#389

Transformative or Conservative? Conservation laws for ResNets and Transformers

Sibylle Marcotte, Rémi Gribonval, Gabriel Peyré

ICML 2025oralarXiv:2506.06194

citations

#390

On-Device Collaborative Language Modeling via a Mixture of Generalists and Specialists

Dongyang Fan, Bettina Messmer, Nikita Doikov et al.

ICML 2025posterarXiv:2409.13931

citations

#391

DMOSpeech: Direct Metric Optimization via Distilled Diffusion Model in Zero-Shot Speech Synthesis

Yinghao Li, Rithesh Kumar, Zeyu Jin

ICML 2025oralarXiv:2410.11097

citations

#392

Breaking the Curse of Multiagency in Robust Multi-Agent Reinforcement Learning

Laixi Shi, Jingchu Gai, Eric Mazumdar et al.

ICML 2025oralarXiv:2409.20067

citations

#393

Focus On This, Not That! Steering LLMs with Adaptive Feature Specification

Tom A. Lamb, Adam Davies, Alasdair J Paren et al.

ICML 2025posterarXiv:2410.22944

citations

#394

Generalized Venn and Venn-Abers Calibration with Applications in Conformal Prediction

Lars van der Laan, Ahmed Alaa

ICML 2025posterarXiv:2502.05676

citations

#395

Boosting Virtual Agent Learning and Reasoning: A Step-Wise, Multi-Dimensional, and Generalist Reward Model with Benchmark

Bingchen Miao, Yang Wu, Minghe Gao et al.

ICML 2025posterarXiv:2503.18665

citations

#396

A Theoretical Framework For Overfitting In Energy-based Modeling

Giovanni Catania, Aurélien Decelle, Cyril Furtlehner et al.

ICML 2025posterarXiv:2501.19158

citations

#397

WGFormer: An SE(3)-Transformer Driven by Wasserstein Gradient Flows for Molecular Ground-State Conformation Prediction

Fanmeng Wang, Minjie Cheng, Hongteng Xu

ICML 2025posterarXiv:2410.09795

citations

#398

Gradient Boosting Reinforcement Learning

Benjamin Fuhrer, Chen Tessler, Gal Dalal

ICML 2025posterarXiv:2407.08250

citations

#399

Improving Multimodal Learning Balance and Sufficiency through Data Remixing

Xiaoyu Ma, Hao Chen, Yongjian Deng

ICML 2025posterarXiv:2506.11550

citations

#400

Score-Based Diffusion Policy Compatible with Reinforcement Learning via Optimal Transport

Mingyang Sun, Pengxiang Ding, Weinan Zhang et al.

ICML 2025posterarXiv:2502.12631

citations

← Previous

1 2 3 4...30

Most Cited ICML "dense local features" Papers

Conference

Paper Type

When Do LLMs Help With Node Classification? A Comprehensive Analysis

Generalizing from SIMPLE to HARD Visual Reasoning: Can We Mitigate Modality Imbalance in VLMs?

Can Classic GNNs Be Strong Baselines for Graph-level Tasks? Simple Architectures Meet Excellence

MUDDFormer: Breaking Residual Bottlenecks in Transformers via Multiway Dynamic Dense Connections

On Temperature Scaling and Conformal Prediction of Deep Classifiers

Adaptive Learn-then-Test: Statistically Valid and Efficient Hyperparameter Selection

Gaussian Mixture Flow Matching Models

BRiTE: Bootstrapping Reinforced Thinking Process to Enhance Language Model Reasoning

GaussMark: A Practical Approach for Structural Watermarking of Language Models

Improving LLMs for Recommendation with Out-Of-Vocabulary Tokens

MoEQuant: Enhancing Quantization for Mixture-of-Experts Large Language Models via Expert-Balanced Sampling and Affinity Guidance

Fast and Robust: Task Sampling with Posterior and Diversity Synergies for Adaptive Decision-Makers in Randomized Environments

A Closer Look at Multimodal Representation Collapse

Self-Discriminative Modeling for Anomalous Graph Detection

LoRA-One: One-Step Full Gradient Could Suffice for Fine-Tuning Large Language Models, Provably and Efficiently

De-mark: Watermark Removal in Large Language Models

Not all solutions are created equal: An analytical dissociation of functional and representational similarity in deep linear neural networks

Soft Reasoning: Navigating Solution Spaces in Large Language Models through Controlled Embedding Exploration

Effective and Efficient Masked Image Generation Models

AutoAdvExBench: Benchmarking Autonomous Exploitation of Adversarial Example Defenses

Secant Line Search for Frank-Wolfe Algorithms

Right Now, Wrong Then: Non-Stationary Direct Preference Optimization under Preference Drift

XAttnMark: Learning Robust Audio Watermarking with Cross-Attention

Learning Adaptive Lighting via Channel-Aware Guidance

Position: The Most Expensive Part of an LLM *should* be its Training Data

Towards Robustness and Explainability of Automatic Algorithm Selection

Data-Juicer Sandbox: A Feedback-Driven Suite for Multimodal Data-Model Co-development

Learning Safety Constraints for Large Language Models

Vision-Language Models Create Cross-Modal Task Representations

Compression via Pre-trained Transformers: A Study on Byte-Level Multimodal Data

Impossible Videos

DISCO: learning to DISCover an evolution Operator for multi-physics-agnostic prediction

Contrastive Private Data Synthesis via Weighted Multi-PLM Fusion

Enhancing Rating-Based Reinforcement Learning to Effectively Leverage Feedback from Large Vision-Language Models

Scaling Laws for Task-Optimized Models of the Primate Visual Ventral Stream

Loss Functions and Operators Generated by f-Divergences

Privacy Attacks on Image AutoRegressive Models

Robust and Conjugate Spatio-Temporal Gaussian Processes

Perception in Reflection

Evaluating Neuron Explanations: A Unified Framework with Sanity Checks

Selective Prompt Anchoring for Code Generation

Overcoming the Curse of Dimensionality in Reinforcement Learning Through Approximate Factorization

Ultra-Resolution Adaptation with Ease

Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration

Ringmaster ASGD: The First Asynchronous SGD with Optimal Time Complexity

Componential Prompt-Knowledge Alignment for Domain Incremental Learning

AutoElicit: Using Large Language Models for Expert Prior Elicitation in Predictive Modelling

SEMU: Singular Value Decomposition for Efficient Machine Unlearning

Efficient and Scalable Density Functional Theory Hamiltonian Prediction through Adaptive Sparsity

Doubly Robust Conformalized Survival Analysis with Right-Censored Data

Speculative Prefill: Turbocharging TTFT with Lightweight and Training-Free Token Importance Estimation

SafetyAnalyst: Interpretable, Transparent, and Steerable Safety Moderation for AI Behavior

Position: We Need An Algorithmic Understanding of Generative AI

When Maximum Entropy Misleads Policy Optimization

PatchPilot: A Cost-Efficient Software Engineering Agent with Early Attempts on Formal Verification

Task Generalization with Autoregressive Compositional Structure: Can Learning from $D$ Tasks Generalize to $D^T$ Tasks?

Activation Space Interventions Can Be Transferred Between Large Language Models

Vulnerability-Aware Alignment: Mitigating Uneven Forgetting in Harmful Fine-Tuning

MuLan: Adapting Multilingual Diffusion Models for Hundreds of Languages with Negligible Cost

DEALing with Image Reconstruction: Deep Attentive Least Squares

SK-VQA: Synthetic Knowledge Generation at Scale for Training Context-Augmented Multimodal LLMs

Efficient Distributed Optimization under Heavy-Tailed Noise

PROXSPARSE: REGULARIZED LEARNING OF SEMI-STRUCTURED SPARSITY MASKS FOR PRETRAINED LLMS

Prediction-Powered E-Values

QT-DoG: Quantization-Aware Training for Domain Generalization

RZ-NAS: Enhancing LLM-guided Neural Architecture Search via Reflective Zero-Cost Strategy

Hierarchical Graph Tokenization for Molecule-Language Alignment

No-Regret is not enough! Bandits with General Constraints through Adaptive Regret Minimization

ROPO: Robust Preference Optimization for Large Language Models

Parameter-Efficient Fine-Tuning of State Space Models

LLMs can see and hear without any training

HaploVL: A Single-Transformer Baseline for Multi-Modal Understanding

Toward Efficient Kernel-Based Solvers for Nonlinear PDEs

Unisolver: PDE-Conditional Transformers Towards Universal Neural PDE Solvers

Position: The Artificial Intelligence and Machine Learning Community Should Adopt a More Transparent and Regulated Peer Review Process

Language Models over Canonical Byte-Pair Encodings

OV-MER: Towards Open-Vocabulary Multimodal Emotion Recognition

Position: The Most Expensive Part of an LLM should be its Training Data