Most Cited ICML "dense local features" Papers

5,975 papers found • Page 2 of 30

#201

When Do LLMs Help With Node Classification? A Comprehensive Analysis

Xixi Wu, Yifei Shen, Fangzhou Ge et al.

ICML 2025posterarXiv:2502.00829
9
citations
#202

Generalizing from SIMPLE to HARD Visual Reasoning: Can We Mitigate Modality Imbalance in VLMs?

Simon Park, Abhishek Panigrahi, Yun Cheng et al.

ICML 2025posterarXiv:2501.02669
9
citations
#203

Can Classic GNNs Be Strong Baselines for Graph-level Tasks? Simple Architectures Meet Excellence

Yuankai Luo, Lei Shi, Xiao-Ming Wu

ICML 2025posterarXiv:2502.09263
9
citations
#204

MUDDFormer: Breaking Residual Bottlenecks in Transformers via Multiway Dynamic Dense Connections

Da Xiao, Qingye Meng, Shengping Li et al.

ICML 2025posterarXiv:2502.12170
9
citations
#205

On Temperature Scaling and Conformal Prediction of Deep Classifiers

Lahav Dabah, Tom Tirer

ICML 2025posterarXiv:2402.05806
9
citations
#206

Adaptive Learn-then-Test: Statistically Valid and Efficient Hyperparameter Selection

Matteo Zecchin, Sangwoo Park, Osvaldo Simeone

ICML 2025spotlightarXiv:2409.15844
8
citations
#207

Gaussian Mixture Flow Matching Models

Hansheng Chen, Kai Zhang, Hao Tan et al.

ICML 2025posterarXiv:2504.05304
8
citations
#208

BRiTE: Bootstrapping Reinforced Thinking Process to Enhance Language Model Reasoning

Han Zhong, Yutong Yin, Shenao Zhang et al.

ICML 2025posterarXiv:2501.18858
8
citations
#209

GaussMark: A Practical Approach for Structural Watermarking of Language Models

Adam Block, Alexander Rakhlin, Ayush Sekhari

ICML 2025posterarXiv:2501.13941
8
citations
#210

Improving LLMs for Recommendation with Out-Of-Vocabulary Tokens

Ting-Ji Huang, Jia-Qi Yang, Chunxu Shen et al.

ICML 2025posterarXiv:2406.08477
8
citations
#211

MoEQuant: Enhancing Quantization for Mixture-of-Experts Large Language Models via Expert-Balanced Sampling and Affinity Guidance

Zhixuan Chen, Xing Hu, Dawei Yang et al.

ICML 2025posterarXiv:2505.03804
8
citations
#212

Fast and Robust: Task Sampling with Posterior and Diversity Synergies for Adaptive Decision-Makers in Randomized Environments

Yun Qu, Cheems Wang, Yixiu Mao et al.

ICML 2025posterarXiv:2504.19139
8
citations
#213

A Closer Look at Multimodal Representation Collapse

Abhra Chaudhuri, Anjan Dutta, Tu Bui et al.

ICML 2025spotlightarXiv:2505.22483
8
citations
#214

Self-Discriminative Modeling for Anomalous Graph Detection

Jinyu Cai, Yunhe Zhang, Jicong Fan

ICML 2025posterarXiv:2310.06261
8
citations
#215

LoRA-One: One-Step Full Gradient Could Suffice for Fine-Tuning Large Language Models, Provably and Efficiently

Yuanhe Zhang, Fanghui Liu, Yudong Chen

ICML 2025oralarXiv:2502.01235
8
citations
#216

De-mark: Watermark Removal in Large Language Models

Ruibo Chen, Yihan Wu, Junfeng Guo et al.

ICML 2025posterarXiv:2410.13808
8
citations
#217

Not all solutions are created equal: An analytical dissociation of functional and representational similarity in deep linear neural networks

Lukas Braun, Erin Grant, Andrew Saxe

ICML 2025spotlight
8
citations
#218

Soft Reasoning: Navigating Solution Spaces in Large Language Models through Controlled Embedding Exploration

Qinglin Zhu, Runcong Zhao, Hanqi Yan et al.

ICML 2025spotlightarXiv:2505.24688
8
citations
#219

Effective and Efficient Masked Image Generation Models

Zebin You, Jingyang Ou, Xiaolu Zhang et al.

ICML 2025posterarXiv:2503.07197
8
citations
#220

AutoAdvExBench: Benchmarking Autonomous Exploitation of Adversarial Example Defenses

Nicholas Carlini, Edoardo Debenedetti, Javier Rando et al.

ICML 2025oralarXiv:2503.01811
8
citations
#221

Secant Line Search for Frank-Wolfe Algorithms

Deborah Hendrych, Sebastian Pokutta, Mathieu Besançon et al.

ICML 2025posterarXiv:2501.18775
8
citations
#222

Right Now, Wrong Then: Non-Stationary Direct Preference Optimization under Preference Drift

Seongho Son, William Bankes, Sayak Ray Chowdhury et al.

ICML 2025oralarXiv:2407.18676
8
citations
#223

XAttnMark: Learning Robust Audio Watermarking with Cross-Attention

Yixin Liu, Lie Lu, Jihui Jin et al.

ICML 2025oralarXiv:2502.04230
7
citations
#224

Learning Adaptive Lighting via Channel-Aware Guidance

Qirui Yang, Peng-Tao Jiang, Hao Zhang et al.

ICML 2025posterarXiv:2412.01493
7
citations
#225

Position: The Most Expensive Part of an LLM *should* be its Training Data

Nikhil Kandpal, Colin Raffel

ICML 2025posterarXiv:2504.12427
7
citations
#226

Towards Robustness and Explainability of Automatic Algorithm Selection

Xingyu Wu, Jibin Wu, Yu Zhou et al.

ICML 2025spotlight
7
citations
#227

Data-Juicer Sandbox: A Feedback-Driven Suite for Multimodal Data-Model Co-development

Daoyuan Chen, Haibin Wang, Yilun Huang et al.

ICML 2025spotlightarXiv:2407.11784
7
citations
#228

Learning Safety Constraints for Large Language Models

Xin Chen, Yarden As, Andreas Krause

ICML 2025spotlightarXiv:2505.24445
7
citations
#229

Vision-Language Models Create Cross-Modal Task Representations

Grace Luo, Trevor Darrell, Amir Bar

ICML 2025posterarXiv:2410.22330
7
citations
#230

Compression via Pre-trained Transformers: A Study on Byte-Level Multimodal Data

David Heurtel-Depeiges, Anian Ruoss, Joel Veness et al.

ICML 2025posterarXiv:2410.05078
7
citations
#231

Impossible Videos

Zechen Bai, Hai Ci, Mike Zheng Shou

ICML 2025oralarXiv:2503.14378
7
citations
#232

DISCO: learning to DISCover an evolution Operator for multi-physics-agnostic prediction

Rudy Morel, Jiequn Han, Edouard Oyallon

ICML 2025oralarXiv:2504.19496
7
citations
#233

Contrastive Private Data Synthesis via Weighted Multi-PLM Fusion

Tianyuan Zou, Yang Liu, Peng Li et al.

ICML 2025posterarXiv:2502.00245
7
citations
#234

Enhancing Rating-Based Reinforcement Learning to Effectively Leverage Feedback from Large Vision-Language Models

Minh-Tung Luu, Younghwan Lee, Donghoon Lee et al.

ICML 2025posterarXiv:2506.12822
7
citations
#235

Scaling Laws for Task-Optimized Models of the Primate Visual Ventral Stream

Abdulkadir Gokce, Martin Schrimpf

ICML 2025oralarXiv:2411.05712
7
citations
#236

Loss Functions and Operators Generated by f-Divergences

Vincent Roulet, Tianlin Liu, Nino Vieillard et al.

ICML 2025posterarXiv:2501.18537
7
citations
#237

Privacy Attacks on Image AutoRegressive Models

Antoni Kowalczuk, Jan Dubiński, Franziska Boenisch et al.

ICML 2025posterarXiv:2502.02514
7
citations
#238

Robust and Conjugate Spatio-Temporal Gaussian Processes

William Laplante, Matias Altamirano, Andrew Duncan et al.

ICML 2025oralarXiv:2502.02450
7
citations
#239

Perception in Reflection

Yana Wei, Liang Zhao, Kangheng Lin et al.

ICML 2025posterarXiv:2504.07165
7
citations
#240

Evaluating Neuron Explanations: A Unified Framework with Sanity Checks

Tuomas Oikarinen, Ge Yan, Lily Weng

ICML 2025posterarXiv:2506.05774
7
citations
#241

Selective Prompt Anchoring for Code Generation

Yuan Tian, Tianyi Zhang

ICML 2025posterarXiv:2408.09121
7
citations
#242

Overcoming the Curse of Dimensionality in Reinforcement Learning Through Approximate Factorization

Chenbei Lu, Laixi Shi, Zaiwei Chen et al.

ICML 2025posterarXiv:2411.07591
7
citations
#243

Ultra-Resolution Adaptation with Ease

Ruonan Yu, Songhua Liu, Zhenxiong Tan et al.

ICML 2025posterarXiv:2503.16322
7
citations
#244

Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration

Max Wilcoxson, Qiyang Li, Kevin Frans et al.

ICML 2025posterarXiv:2410.18076
7
citations
#245

Ringmaster ASGD: The First Asynchronous SGD with Optimal Time Complexity

Artavazd Maranjyan, Alexander Tyurin, Peter Richtarik

ICML 2025posterarXiv:2501.16168
7
citations
#246

Componential Prompt-Knowledge Alignment for Domain Incremental Learning

Kunlun Xu, Xu Zou, Gang Hua et al.

ICML 2025posterarXiv:2505.04575
7
citations
#247

AutoElicit: Using Large Language Models for Expert Prior Elicitation in Predictive Modelling

Alexander Capstick, Rahul G. Krishnan, Payam Barnaghi

ICML 2025posterarXiv:2411.17284
7
citations
#248

SEMU: Singular Value Decomposition for Efficient Machine Unlearning

Marcin Sendera, Łukasz Struski, Kamil Książek et al.

ICML 2025posterarXiv:2502.07587
7
citations
#249

Efficient and Scalable Density Functional Theory Hamiltonian Prediction through Adaptive Sparsity

Erpai Luo, Xinran Wei, Lin Huang et al.

ICML 2025posterarXiv:2502.01171
7
citations
#250

Doubly Robust Conformalized Survival Analysis with Right-Censored Data

Matteo Sesia, vladimir svetnik

ICML 2025spotlightarXiv:2412.09729
7
citations
#251

Speculative Prefill: Turbocharging TTFT with Lightweight and Training-Free Token Importance Estimation

Jingyu Liu, Beidi Chen, Ce Zhang

ICML 2025posterarXiv:2502.02789
7
citations
#252

SafetyAnalyst: Interpretable, Transparent, and Steerable Safety Moderation for AI Behavior

Jing-Jing Li, Valentina Pyatkin, Max Kleiman-Weiner et al.

ICML 2025posterarXiv:2410.16665
7
citations
#253

Position: We Need An Algorithmic Understanding of Generative AI

Oliver Eberle, Thomas McGee, Hamza Giaffar et al.

ICML 2025spotlightarXiv:2507.07544
7
citations
#254

When Maximum Entropy Misleads Policy Optimization

Ruipeng Zhang, Ya-Chien Chang, Sicun Gao

ICML 2025posterarXiv:2506.05615
6
citations
#255

PatchPilot: A Cost-Efficient Software Engineering Agent with Early Attempts on Formal Verification

Hongwei Li, Yuheng Tang, Shiqi Wang et al.

ICML 2025posterarXiv:2502.02747
6
citations
#256

Task Generalization with Autoregressive Compositional Structure: Can Learning from $D$ Tasks Generalize to $D^T$ Tasks?

Amirhesam Abedsoltan, Huaqing Zhang, Kaiyue Wen et al.

ICML 2025posterarXiv:2502.08991
6
citations
#257

Activation Space Interventions Can Be Transferred Between Large Language Models

Narmeen Oozeer, Dhruv Nathawani, Nirmalendu Prakash et al.

ICML 2025posterarXiv:2503.04429
6
citations
#258

Vulnerability-Aware Alignment: Mitigating Uneven Forgetting in Harmful Fine-Tuning

Liang CHEN, Xueting Han, Li Shen et al.

ICML 2025posterarXiv:2506.03850
6
citations
#259

MuLan: Adapting Multilingual Diffusion Models for Hundreds of Languages with Negligible Cost

Sen Xing, Muyan Zhong, Zeqiang Lai et al.

ICML 2025posterarXiv:2412.01271
6
citations
#260

DEALing with Image Reconstruction: Deep Attentive Least Squares

Mehrsa Pourya, Erich Kobler, Michael Unser et al.

ICML 2025posterarXiv:2502.04079
6
citations
#261

SK-VQA: Synthetic Knowledge Generation at Scale for Training Context-Augmented Multimodal LLMs

Xin Su, Man Luo, Kris Pan et al.

ICML 2025oralarXiv:2406.19593
6
citations
#262

Efficient Distributed Optimization under Heavy-Tailed Noise

Su Hyeong Lee, Manzil Zaheer, Tian Li

ICML 2025posterarXiv:2502.04164
6
citations
#263

PROXSPARSE: REGULARIZED LEARNING OF SEMI-STRUCTURED SPARSITY MASKS FOR PRETRAINED LLMS

Hongyi Liu, Rajarshi Saha, Zhen Jia et al.

ICML 2025posterarXiv:2502.00258
6
citations
#264

Prediction-Powered E-Values

Daniel Csillag, Claudio Struchiner, Guilherme Tegoni Goedert

ICML 2025posterarXiv:2502.04294
6
citations
#265

QT-DoG: Quantization-Aware Training for Domain Generalization

Saqib Javed, Hieu Le, Mathieu Salzmann

ICML 2025posterarXiv:2410.06020
6
citations
#266

RZ-NAS: Enhancing LLM-guided Neural Architecture Search via Reflective Zero-Cost Strategy

Zipeng Ji, Guanghui Zhu, Chunfeng Yuan et al.

ICML 2025poster
6
citations
#267

Hierarchical Graph Tokenization for Molecule-Language Alignment

Yongqiang Chen, QUANMING YAO, Juzheng Zhang et al.

ICML 2025posterarXiv:2406.14021
6
citations
#268

No-Regret is not enough! Bandits with General Constraints through Adaptive Regret Minimization

Martino Bernasconi, Matteo Castiglioni, Andrea Celli

ICML 2025posterarXiv:2405.06575
6
citations
#269

ROPO: Robust Preference Optimization for Large Language Models

Xize Liang, Chao Chen, Shuang Qiu et al.

ICML 2025posterarXiv:2404.04102
6
citations
#270

Parameter-Efficient Fine-Tuning of State Space Models

Kevin Galim, Wonjun Kang, Yuchen Zeng et al.

ICML 2025posterarXiv:2410.09016
6
citations
#271

LLMs can see and hear without any training

Kumar Ashutosh, Yossi Gandelsman, Xinlei Chen et al.

ICML 2025posterarXiv:2501.18096
6
citations
#272

HaploVL: A Single-Transformer Baseline for Multi-Modal Understanding

Rui Yang, Lin Song, Yicheng Xiao et al.

ICML 2025posterarXiv:2503.14694
6
citations
#273

Toward Efficient Kernel-Based Solvers for Nonlinear PDEs

Zhitong Xu, Da Long, Yiming Xu et al.

ICML 2025posterarXiv:2410.11165
6
citations
#274

Unisolver: PDE-Conditional Transformers Towards Universal Neural PDE Solvers

Hang Zhou, Yuezhou Ma, Haixu Wu et al.

ICML 2025posterarXiv:2405.17527
6
citations
#275

Position: The Artificial Intelligence and Machine Learning Community Should Adopt a More Transparent and Regulated Peer Review Process

Jing Yang

ICML 2025posterarXiv:2502.00874
6
citations
#276

Language Models over Canonical Byte-Pair Encodings

Tim Vieira, Tianyu Liu, Clemente Pasti et al.

ICML 2025posterarXiv:2506.07956
6
citations
#277

OV-MER: Towards Open-Vocabulary Multimodal Emotion Recognition

Zheng Lian, Haiyang Sun, Licai Sun et al.

ICML 2025posterarXiv:2410.01495
6
citations
#278

Enhancing Statistical Validity and Power in Hybrid Controlled Trials: A Randomization Inference Approach with Conformal Selective Borrowing

Ke Zhu, Shu Yang, Xiaofei Wang

ICML 2025posterarXiv:2410.11713
6
citations
#279

Variational Control for Guidance in Diffusion Models

Kushagra Pandey, Farrin Marouf Sofian, Felix Draxler et al.

ICML 2025posterarXiv:2502.03686
6
citations
#280

Jacobian Sparse Autoencoders: Sparsify Computations, Not Just Activations

Lucy Farnik, Tim Lawson, Conor Houghton et al.

ICML 2025spotlightarXiv:2502.18147
6
citations
#281

SAE-V: Interpreting Multimodal Models for Enhanced Alignment

Hantao Lou, Changye Li, Jiaming Ji et al.

ICML 2025posterarXiv:2502.17514
6
citations
#282

Analyze Feature Flow to Enhance Interpretation and Steering in Language Models

Daniil Laptev, Nikita Balagansky, Yaroslav Aksenov et al.

ICML 2025posterarXiv:2502.03032
6
citations
#283

Active Fine-Tuning of Multi-Task Policies

Marco Bagatella, Jonas Hübotter, Georg Martius et al.

ICML 2025oralarXiv:2410.05026
6
citations
#284

LLM-Augmented Chemical Synthesis and Design Decision Programs

Haorui Wang, Jeff Guo, Lingkai Kong et al.

ICML 2025posterarXiv:2505.07027
6
citations
#285

Hyperband-based Bayesian Optimization for Black-box Prompt Selection

Lennart Schneider, Martin Wistuba, Aaron Klein et al.

ICML 2025posterarXiv:2412.07820
6
citations
#286

Causal Discovery from Conditionally Stationary Time Series

Carles Balsells-Rodas, Xavier Sumba, Tanmayee Narendra et al.

ICML 2025posterarXiv:2110.06257
6
citations
#287

Understanding and Mitigating Memorization in Diffusion Models for Tabular Data

Zhengyu Fang, Zhimeng Jiang, Huiyuan Chen et al.

ICML 2025posterarXiv:2412.11044
6
citations
#288

Regress, Don't Guess: A Regression-like Loss on Number Tokens for Language Models

Jonas Zausinger, Lars Pennig, Anamarija Kozina et al.

ICML 2025posterarXiv:2411.02083
6
citations
#289

Volume Optimality in Conformal Prediction with Structured Prediction Sets

Chao Gao, Liren Shan, Vaidehi Srinivas et al.

ICML 2025posterarXiv:2502.16658
6
citations
#290

Learning Distances from Data with Normalizing Flows and Score Matching

Peter Sorrenson, Daniel Behrend-Uriarte, Christoph Schnörr et al.

ICML 2025posterarXiv:2407.09297
6
citations
#291

Network Sparsity Unlocks the Scaling Potential of Deep Reinforcement Learning

Guozheng Ma, Lu Li, Zilin Wang et al.

ICML 2025oralarXiv:2506.17204
6
citations
#292

Overestimation in LLM Evaluation: A Controlled Large-Scale Study on Data Contamination’s Impact on Machine Translation

Muhammed Yusuf Kocyigit, Eleftheria Briakou, Daniel Deutsch et al.

ICML 2025oralarXiv:2501.18771
6
citations
#293

Learning from Integral Losses in Physics Informed Neural Networks

Ehsan Saleh, Saba Ghaffari, Timothy Bretl et al.

ICML 2024posterarXiv:2305.17387
6
citations
#294

From Passive to Active Reasoning: Can Large Language Models Ask the Right Questions under Incomplete Information?

Zhanke Zhou, Xiao Feng, Zhaocheng Zhu et al.

ICML 2025posterarXiv:2506.08295
6
citations
#295

Understanding the Limits of Deep Tabular Methods with Temporal Shift

Haorun Cai, Han-Jia Ye

ICML 2025oralarXiv:2502.20260
6
citations
#296

Robust Offline Reinforcement Learning with Linearly Structured $f$-Divergence Regularization

Cheng Tang, Zhishuai Liu, Pan Xu

ICML 2025posterarXiv:2411.18612
6
citations
#297

Flowing Datasets with Wasserstein over Wasserstein Gradient Flows

Clément Bonet, Christophe Vauthier, Anna Korba

ICML 2025oralarXiv:2506.07534
6
citations
#298

EARL-BO: Reinforcement Learning for Multi-Step Lookahead, High-Dimensional Bayesian Optimization

Mujin Cheon, Jay Lee, Dong-Yeun Koh et al.

ICML 2025posterarXiv:2411.00171
6
citations
#299

TimePro: Efficient Multivariate Long-term Time Series Forecasting with Variable- and Time-Aware Hyper-state

Xiaowen Ma, Zhen-Liang Ni, Shuai Xiao et al.

ICML 2025oralarXiv:2505.20774
6
citations
#300

LotteryCodec: Searching the Implicit Representation in a Random Network for Low-Complexity Image Compression

Haotian Wu, Gongpu Chen, Pier Luigi Dragotti et al.

ICML 2025spotlightarXiv:2507.01204
6
citations
#301

Rethinking Chain-of-Thought from the Perspective of Self-Training

Zongqian Wu, Baoduo Xu, Ruochen Cui et al.

ICML 2025posterarXiv:2412.10827
6
citations
#302

Provable Maximum Entropy Manifold Exploration via Diffusion Models

Riccardo De Santi, Marin Vlastelica, Ya-Ping Hsieh et al.

ICML 2025posterarXiv:2506.15385
5
citations
#303

Predicting mutational effects on protein binding from folding energy

Arthur Deng, Karsten Householder, Fang Wu et al.

ICML 2025posterarXiv:2507.05502
5
citations
#304

Fundamental Limits of Visual Autoregressive Transformers: Universal Approximation Abilities

Yifang Chen, Xiaoyu Li, Yingyu Liang et al.

ICML 2025poster
5
citations
#305

KV Shifting Attention Enhances Language Modeling

Mingyu Xu, Bingning Wang, Weipeng Chen

ICML 2025oralarXiv:2411.19574
5
citations
#306

In-Context Learning and Occam's Razor

Eric Elmoznino, Tom Marty, Tejas Kasetty et al.

ICML 2025posterarXiv:2410.14086
5
citations
#307

GAPrompt: Geometry-Aware Point Cloud Prompt for 3D Vision Model

Zixiang Ai, Zichen Liu, Yuanhang Lei et al.

ICML 2025posterarXiv:2505.04119
5
citations
#308

Ranked Entropy Minimization for Continual Test-Time Adaptation

Jisu Han, Jaemin Na, Wonjun Hwang

ICML 2025posterarXiv:2505.16441
5
citations
#309

PINNsAgent: Automated PDE Surrogation with Large Language Models

Qingpo Wuwu, Chonghan Gao, Tianyu Chen et al.

ICML 2025posterarXiv:2501.12053
5
citations
#310

Aligning Multimodal Representations through an Information Bottleneck

Antonio Almudévar, Jose Miguel Hernandez-Lobato, Sameer Khurana et al.

ICML 2025posterarXiv:2506.04870
5
citations
#311

Contextual Online Decision Making with Infinite-Dimensional Functional Regression

Haichen Hu, Rui Ai, Stephen Bates et al.

ICML 2025posterarXiv:2501.18359
5
citations
#312

On Volume Minimization in Conformal Regression

Batiste Le Bars, Pierre Humbert

ICML 2025posterarXiv:2502.09985
5
citations
#313

On the Robustness of Reward Models for Language Model Alignment

Jiwoo Hong, Noah Lee, Eunki Kim et al.

ICML 2025posterarXiv:2505.07271
5
citations
#314

Shielded Diffusion: Generating Novel and Diverse Images using Sparse Repellency

Michael Kirchhof, James Thornton, Louis Béthune et al.

ICML 2025posterarXiv:2410.06025
5
citations
#315

Constrained Belief Updates Explain Geometric Structures in Transformer Representations

Mateusz Piotrowski, Paul Riechers, Daniel Filan et al.

ICML 2025posterarXiv:2502.01954
5
citations
#316

DipLLM: Fine-Tuning LLM for Strategic Decision-making in Diplomacy

Kaixuan Xu, Jiajun Chai, Sicheng Li et al.

ICML 2025posterarXiv:2506.09655
5
citations
#317

Adversarial Cooperative Rationalization: The Risk of Spurious Correlations in Even Clean Datasets

Wei Liu, Zhongyu Niu, Lang Gao et al.

ICML 2025posterarXiv:2505.02118
5
citations
#318

Continuous Visual Autoregressive Generation via Score Maximization

Chenze Shao, Fandong Meng, Jie Zhou

ICML 2025posterarXiv:2505.07812
5
citations
#319

Test-Time Training Provably Improves Transformers as In-context Learners

Halil Alperen Gozeten, Muhammed Emrullah Ildiz, Xuechen Zhang et al.

ICML 2025posterarXiv:2503.11842
5
citations
#320

Automatically Identify and Rectify: Robust Deep Contrastive Multi-view Clustering in Noisy Scenarios

xihong yang, Siwei Wang, Fangdi Wang et al.

ICML 2025spotlightarXiv:2505.21387
5
citations
#321

EffiCoder: Enhancing Code Generation in Large Language Models through Efficiency-Aware Fine-tuning

Dong HUANG, Guangtao Zeng, Jianbo Dai et al.

ICML 2025posterarXiv:2410.10209
5
citations
#322

DOLPHIN: A Programmable Framework for Scalable Neurosymbolic Learning

Aaditya Naik, Jason Liu, Claire Wang et al.

ICML 2025posterarXiv:2410.03348
5
citations
#323

Scaling Laws for Upcycling Mixture-of-Experts Language Models

Seng Pei Liew, Takuya Kato, Sho Takase

ICML 2025posterarXiv:2502.03009
5
citations
#324

In-Context Denoising with One-Layer Transformers: Connections between Attention and Associative Memory Retrieval

Matthew Smart, Alberto Bietti, Anirvan Sengupta

ICML 2025oralarXiv:2502.05164
5
citations
#325

MERGE$^3$: Efficient Evolutionary Merging on Consumer-grade GPUs

Tommaso Mencattini, Adrian Robert Minut, Donato Crisostomi et al.

ICML 2025posterarXiv:2502.10436
5
citations
#326

Features are fate: a theory of transfer learning in high-dimensional regression

Javan Tahir, Surya Ganguli, Grant Rotskoff

ICML 2025posterarXiv:2410.08194
5
citations
#327

When Diffusion Models Memorize: Inductive Biases in Probability Flow of Minimum-Norm Shallow Neural Nets

Chen Zeno, Hila Manor, Gregory Ongie et al.

ICML 2025posterarXiv:2506.19031
5
citations
#328

Enforcing Latent Euclidean Geometry in Single-Cell VAEs for Manifold Interpolation

Alessandro Palma, Sergei Rybakov, Leon Hetzel et al.

ICML 2025spotlightarXiv:2507.11789
5
citations
#329

Embedding Safety into RL: A New Take on Trust Region Methods

Nikola Milosevic, Johannes Müller, Nico Scherf

ICML 2025posterarXiv:2411.02957
5
citations
#330

On the Vulnerability of Applying Retrieval-Augmented Generation within Knowledge-Intensive Application Domains

Xun Xian, Ganghua Wang, Xuan Bi et al.

ICML 2025posterarXiv:2409.17275
5
citations
#331

Understanding Generalization in Quantum Machine Learning with Margins

TAK HUR, Daniel Kyungdeock Park

ICML 2025posterarXiv:2411.06919
5
citations
#332

Towards Trustworthy Federated Learning with Untrusted Participants

Youssef Allouah, Rachid Guerraoui, John Stephan

ICML 2025posterarXiv:2505.01874
5
citations
#333

Activation by Interval-wise Dropout: A Simple Way to Prevent Neural Networks from Plasticity Loss

Sangyeon Park, Isaac Han, Seungwon Oh et al.

ICML 2025posterarXiv:2502.01342
5
citations
#334

Temporal Difference Flows

Jesse Farebrother, Matteo Pirotta, Andrea Tirinzoni et al.

ICML 2025oralarXiv:2503.09817
5
citations
#335

LLMScan: Causal Scan for LLM Misbehavior Detection

Mengdi Zhang, Goh Kiat, Peixin Zhang et al.

ICML 2025posterarXiv:2410.16638
5
citations
#336

Importance Corrected Neural JKO Sampling

Johannes Hertrich, Robert Gruhlke

ICML 2025posterarXiv:2407.20444
5
citations
#337

ELITE: Enhanced Language-Image Toxicity Evaluation for Safety

Wonjun Lee, Doehyeon Lee, Eugene Choi et al.

ICML 2025posterarXiv:2502.04757
5
citations
#338

What makes an Ensemble (Un) Interpretable?

Shahaf Bassan, Guy Amir, Meirav Zehavi et al.

ICML 2025posterarXiv:2506.08216
5
citations
#339

One Leaf Reveals the Season: Occlusion-Based Contrastive Learning with Semantic-Aware Views for Efficient Visual Representation

Xiaoyu Yang, Lijian Xu, Hongsheng Li et al.

ICML 2025posterarXiv:2411.09858
5
citations
#340

Tree-Sliced Wasserstein Distance with Nonlinear Projection

Thanh Tran, Viet Hoang Tran, Thanh Chu et al.

ICML 2025posterarXiv:2505.00968
5
citations
#341

TTFSFormer: A TTFS-based Lossless Conversion of Spiking Transformer

Lusen Zhao, Zihan Huang, Ding Jianhao et al.

ICML 2025poster
5
citations
#342

Inverse Problem Sampling in Latent Space Using Sequential Monte Carlo

Idan Achituve, Hai Victor Habi, Amir Rosenfeld et al.

ICML 2025posterarXiv:2502.05908
5
citations
#343

Position: Build Agent Advocates, Not Platform Agents

Sayash Kapoor, Noam Kolt, Seth Lazar

ICML 2025poster
5
citations
#344

(How) Can Transformers Predict Pseudo-Random Numbers?

Tao Tao, Darshil Doshi, Dayal Singh Kalra et al.

ICML 2025posterarXiv:2502.10390
5
citations
#345

PARQ: Piecewise-Affine Regularized Quantization

Lisa Jin, Jianhao Ma, Zechun Liu et al.

ICML 2025posterarXiv:2503.15748
5
citations
#346

Scaling Laws for Floating–Point Quantization Training

Xingwu Sun, Shuaipeng Li, Ruobing Xie et al.

ICML 2025poster
5
citations
#347

General framework for online-to-nonconvex conversion: Schedule-free SGD is also effective for nonconvex optimization

Kwangjun Ahn, Gagik Magakyan, Ashok Cutkosky

ICML 2025oralarXiv:2411.07061
5
citations
#348

Correlated Errors in Large Language Models

Elliot Myunghoon Kim, Avi Garg, Kenny Peng et al.

ICML 2025posterarXiv:2506.07962
5
citations
#349

Looking Beyond the Top-1: Transformers Determine Top Tokens in Order

Daria Lioubashevski, Tomer Schlank, Gabriel Stanovsky et al.

ICML 2025posterarXiv:2410.20210
5
citations
#350

Reward-Augmented Data Enhances Direct Preference Alignment of LLMs

Shenao Zhang, Zhihan Liu, Boyi Liu et al.

ICML 2025posterarXiv:2410.08067
5
citations
#351

Tensor Product Neural Networks for Functional ANOVA Model

Seokhun Park, Insung Kong, yongchan Choi et al.

ICML 2025posterarXiv:2502.15215
4
citations
#352

Enhancing Decision-Making of Large Language Models via Actor-Critic

Heng Dong, Kefei Duan, Chongjie Zhang

ICML 2025posterarXiv:2506.06376
4
citations
#353

Foundation Molecular Grammar: Multi-Modal Foundation Models Induce Interpretable Molecular Graph Languages

Michael Sun, Weize Yuan, Gang Liu et al.

ICML 2025posterarXiv:2505.22948
4
citations
#354

Point-Level Topological Representation Learning on Point Clouds

Vincent P. Grande, Michael Schaub

ICML 2025posterarXiv:2406.02300
4
citations
#355

ADHMR: Aligning Diffusion-based Human Mesh Recovery via Direct Preference Optimization

Wenhao Shen, Wanqi Yin, Xiaofeng Yang et al.

ICML 2025posterarXiv:2505.10250
4
citations
#356

CHATS: Combining Human-Aligned Optimization and Test-Time Sampling for Text-to-Image Generation

Minghao Fu, Guo-Hua Wang, Liangfu Cao et al.

ICML 2025posterarXiv:2502.12579
4
citations
#357

Understanding Model Ensemble in Transferable Adversarial Attack

Wei Yao, Zeliang Zhang, Huayi Tang et al.

ICML 2025posterarXiv:2410.06851
4
citations
#358

Position: Lifetime tuning is incompatible with continual reinforcement learning

Golnaz Mesbahi, Parham Mohammad Panahi, Olya Mastikhina et al.

ICML 2025posterarXiv:2404.02113
4
citations
#359

X-Hacking: The Threat of Misguided AutoML

Rahul Sharma, Sumantrak Mukherjee, Andrea Šipka et al.

ICML 2025poster
4
citations
#360

Representations Shape Weak-to-Strong Generalization: Theoretical Insights and Empirical Predictions

Yihao Xue, Jiping Li, Baharan Mirzasoleiman

ICML 2025posterarXiv:2502.00620
4
citations
#361

MAGELLAN: Metacognitive predictions of learning progress guide autotelic LLM agents in large goal spaces

Loris Gaven, Thomas Carta, Clément Romac et al.

ICML 2025posterarXiv:2502.07709
4
citations
#362

TRACE Back from the Future: A Probabilistic Reasoning Approach to Controllable Language Generation

Gwen Yidou-Weng, Benjie Wang, Guy Van den Broeck

ICML 2025poster
4
citations
#363

Understanding and Mitigating Memorization in Generative Models via Sharpness of Probability Landscapes

Dongjae Jeon, Dueun Kim, Albert No

ICML 2025spotlightarXiv:2412.04140
4
citations
#364

Collapse-Proof Non-Contrastive Self-Supervised Learning

EMANUELE SANSONE, Tim Lebailly, Tinne Tuytelaars

ICML 2025posterarXiv:2410.04959
4
citations
#365

M3-JEPA: Multimodal Alignment via Multi-gate MoE based on the Joint-Embedding Predictive Architecture

Hongyang Lei, Xiaolong Cheng, Qi Qin et al.

ICML 2025posterarXiv:2409.05929
4
citations
#366

Prune 'n Predict: Optimizing LLM Decision-making with Conformal Prediction

Harit Vishwakarma, Alan Mishler, Thomas Cook et al.

ICML 2025posterarXiv:2501.00555
4
citations
#367

Can RLHF be More Efficient with Imperfect Reward Models? A Policy Coverage Perspective

Jiawei Huang, Bingcong Li, Christoph Dann et al.

ICML 2025posterarXiv:2502.19255
4
citations
#368

Policy Design for Two-sided Platforms with Participation Dynamics

Haruka Kiyohara, Fan Yao, Sarah Dean

ICML 2025posterarXiv:2502.01792
4
citations
#369

CALM: Consensus-Aware Localized Merging for Multi-Task Learning

Kunda Yan, Min Zhang, Sen Cui et al.

ICML 2025posterarXiv:2506.13406
4
citations
#370

Quantifying Prediction Consistency Under Fine-tuning Multiplicity in Tabular LLMs

Faisal Hamman, Sachindra P Dissanayake, Saumitra Mishra et al.

ICML 2025posterarXiv:2407.04173
4
citations
#371

SpeCache: Speculative Key-Value Caching for Efficient Generation of LLMs

Shibo Jie, Yehui Tang, Kai Han et al.

ICML 2025posterarXiv:2503.16163
4
citations
#372

SlimLLM: Accurate Structured Pruning for Large Language Models

Jialong Guo, Xinghao Chen, Yehui Tang et al.

ICML 2025posterarXiv:2505.22689
4
citations
#373

Nested Expectations with Kernel Quadrature

Zonghao Chen, Masha Naslidnyk, Francois-Xavier Briol

ICML 2025posterarXiv:2502.18284
4
citations
#374

Best of Both Worlds: Advantages of Hybrid Graph Sequence Models

Ali Behrouz, Ali Parviz, Mahdi Karami et al.

ICML 2025posterarXiv:2411.15671
4
citations
#375

Blink of an eye: a simple theory for feature localization in generative models

Marvin Li, Aayush Karan, Sitan Chen

ICML 2025oralarXiv:2502.00921
4
citations
#376

PANDAS: Improving Many-shot Jailbreaking via Positive Affirmation, Negative Demonstration, and Adaptive Sampling

Avery Ma, Yangchen Pan, Amir-massoud Farahmand

ICML 2025spotlightarXiv:2502.01925
4
citations
#377

Learning Dynamics in Continual Pre-Training for Large Language Models

Xingjin Wang, Howe Tissue, Lu Wang et al.

ICML 2025oralarXiv:2505.07796
4
citations
#378

Projection Optimization: A General Framework for Multi-Objective and Multi-Group RLHF

Nuoya Xiong, Aarti Singh

ICML 2025posterarXiv:2502.15145
4
citations
#379

CFP-Gen: Combinatorial Functional Protein Generation via Diffusion Language Models

Junbo Yin, Chao Zha, Wenjia He et al.

ICML 2025posterarXiv:2505.22869
4
citations
#380

LLaVA-ReID: Selective Multi-image Questioner for Interactive Person Re-Identification

Yiding Lu, Mouxing Yang, Dezhong Peng et al.

ICML 2025posterarXiv:2504.10174
4
citations
#381

Controlled Generation with Equivariant Variational Flow Matching

Floor Eijkelboom, Heiko Zimmermann, Sharvaree Vadgama et al.

ICML 2025posterarXiv:2506.18340
4
citations
#382

Task-Agnostic Pre-training and Task-Guided Fine-tuning for Versatile Diffusion Planner

Chenyou Fan, Chenjia Bai, Zhao Shan et al.

ICML 2025posterarXiv:2409.19949
4
citations
#383

MPO: An Efficient Post-Processing Framework for Mixing Diverse Preference Alignment

Tianze Wang, Dongnan Gui, Yifan Hu et al.

ICML 2025posterarXiv:2502.18699
4
citations
#384

Cannot See the Forest for the Trees: Invoking Heuristics and Biases to Elicit Irrational Choices of LLMs

Haoming Yang, Ke Ma, Xiaojun Jia et al.

ICML 2025posterarXiv:2505.02862
4
citations
#385

Linear $Q$-Learning Does Not Diverge in $L^2$: Convergence Rates to a Bounded Set

Xinyu Liu, Zixuan Xie, Shangtong Zhang

ICML 2025posterarXiv:2501.19254
4
citations
#386

Efficient Logit-based Knowledge Distillation of Deep Spiking Neural Networks for Full-Range Timestep Deployment

Chengting Yu, Xiaochen Zhao, Lei Liu et al.

ICML 2025oralarXiv:2501.15925
4
citations
#387

SENSEI: Semantic Exploration Guided by Foundation Models to Learn Versatile World Models

Cansu Sancaktar, Christian Gumbsch, Andrii Zadaianchuk et al.

ICML 2025posterarXiv:2503.01584
4
citations
#388

AssistanceZero: Scalably Solving Assistance Games

Cassidy Laidlaw, Eli Bronstein, Timothy Guo et al.

ICML 2025posterarXiv:2504.07091
4
citations
#389

Transformative or Conservative? Conservation laws for ResNets and Transformers

Sibylle Marcotte, Rémi Gribonval, Gabriel Peyré

ICML 2025oralarXiv:2506.06194
4
citations
#390

On-Device Collaborative Language Modeling via a Mixture of Generalists and Specialists

Dongyang Fan, Bettina Messmer, Nikita Doikov et al.

ICML 2025posterarXiv:2409.13931
4
citations
#391

DMOSpeech: Direct Metric Optimization via Distilled Diffusion Model in Zero-Shot Speech Synthesis

Yinghao Li, Rithesh Kumar, Zeyu Jin

ICML 2025oralarXiv:2410.11097
4
citations
#392

Breaking the Curse of Multiagency in Robust Multi-Agent Reinforcement Learning

Laixi Shi, Jingchu Gai, Eric Mazumdar et al.

ICML 2025oralarXiv:2409.20067
4
citations
#393

Focus On This, Not That! Steering LLMs with Adaptive Feature Specification

Tom A. Lamb, Adam Davies, Alasdair J Paren et al.

ICML 2025posterarXiv:2410.22944
4
citations
#394

Generalized Venn and Venn-Abers Calibration with Applications in Conformal Prediction

Lars van der Laan, Ahmed Alaa

ICML 2025posterarXiv:2502.05676
4
citations
#395

Boosting Virtual Agent Learning and Reasoning: A Step-Wise, Multi-Dimensional, and Generalist Reward Model with Benchmark

Bingchen Miao, Yang Wu, Minghe Gao et al.

ICML 2025posterarXiv:2503.18665
4
citations
#396

A Theoretical Framework For Overfitting In Energy-based Modeling

Giovanni Catania, Aurélien Decelle, Cyril Furtlehner et al.

ICML 2025posterarXiv:2501.19158
4
citations
#397

WGFormer: An SE(3)-Transformer Driven by Wasserstein Gradient Flows for Molecular Ground-State Conformation Prediction

Fanmeng Wang, Minjie Cheng, Hongteng Xu

ICML 2025posterarXiv:2410.09795
4
citations
#398

Gradient Boosting Reinforcement Learning

Benjamin Fuhrer, Chen Tessler, Gal Dalal

ICML 2025posterarXiv:2407.08250
4
citations
#399

Improving Multimodal Learning Balance and Sufficiency through Data Remixing

Xiaoyu Ma, Hao Chen, Yongjian Deng

ICML 2025posterarXiv:2506.11550
4
citations
#400

Score-Based Diffusion Policy Compatible with Reinforcement Learning via Optimal Transport

Mingyang Sun, Pengxiang Ding, Weinan Zhang et al.

ICML 2025posterarXiv:2502.12631
4
citations