Most Cited ICML "latent refinement" Papers

5,975 papers found • Page 2 of 30

#201

Synthesizing Privacy-Preserving Text Data via Finetuning *without* Finetuning Billion-Scale LLMs

Bowen Tan, Zheng Xu, Eric Xing et al.

ICML 2025posterarXiv:2503.12347
9
citations
#202

Scaling Laws for Differentially Private Language Models

Ryan McKenna, Yangsibo Huang, Amer Sinha et al.

ICML 2025posterarXiv:2501.18914
9
citations
#203

On Temperature Scaling and Conformal Prediction of Deep Classifiers

Lahav Dabah, Tom Tirer

ICML 2025posterarXiv:2402.05806
9
citations
#204

Update Your Transformer to the Latest Release: Re-Basin of Task Vectors

Filippo Rinaldi, Giacomo Capitani, Lorenzo Bonicelli et al.

ICML 2025posterarXiv:2505.22697
9
citations
#205

Strategy Coopetition Explains the Emergence and Transience of In-Context Learning

Aaditya Singh, Ted Moskovitz, Sara Dragutinović et al.

ICML 2025oralarXiv:2503.05631
9
citations
#206

Not all solutions are created equal: An analytical dissociation of functional and representational similarity in deep linear neural networks

Lukas Braun, Erin Grant, Andrew Saxe

ICML 2025spotlight
8
citations
#207

Right Now, Wrong Then: Non-Stationary Direct Preference Optimization under Preference Drift

Seongho Son, William Bankes, Sayak Ray Chowdhury et al.

ICML 2025oralarXiv:2407.18676
8
citations
#208

Soft Reasoning: Navigating Solution Spaces in Large Language Models through Controlled Embedding Exploration

Qinglin Zhu, Runcong Zhao, Hanqi Yan et al.

ICML 2025spotlightarXiv:2505.24688
8
citations
#209

Effective and Efficient Masked Image Generation Models

Zebin You, Jingyang Ou, Xiaolu Zhang et al.

ICML 2025posterarXiv:2503.07197
8
citations
#210

Fast and Robust: Task Sampling with Posterior and Diversity Synergies for Adaptive Decision-Makers in Randomized Environments

Yun Qu, Cheems Wang, Yixiu Mao et al.

ICML 2025posterarXiv:2504.19139
8
citations
#211

GaussMark: A Practical Approach for Structural Watermarking of Language Models

Adam Block, Alexander Rakhlin, Ayush Sekhari

ICML 2025posterarXiv:2501.13941
8
citations
#212

BRiTE: Bootstrapping Reinforced Thinking Process to Enhance Language Model Reasoning

Han Zhong, Yutong Yin, Shenao Zhang et al.

ICML 2025posterarXiv:2501.18858
8
citations
#213

Adaptive Learn-then-Test: Statistically Valid and Efficient Hyperparameter Selection

Matteo Zecchin, Sangwoo Park, Osvaldo Simeone

ICML 2025spotlightarXiv:2409.15844
8
citations
#214

De-mark: Watermark Removal in Large Language Models

Ruibo Chen, Yihan Wu, Junfeng Guo et al.

ICML 2025posterarXiv:2410.13808
8
citations
#215

MoEQuant: Enhancing Quantization for Mixture-of-Experts Large Language Models via Expert-Balanced Sampling and Affinity Guidance

Zhixuan Chen, Xing Hu, Dawei Yang et al.

ICML 2025posterarXiv:2505.03804
8
citations
#216

A Closer Look at Multimodal Representation Collapse

Abhra Chaudhuri, Anjan Dutta, Tu Bui et al.

ICML 2025spotlightarXiv:2505.22483
8
citations
#217

Gaussian Mixture Flow Matching Models

Hansheng Chen, Kai Zhang, Hao Tan et al.

ICML 2025posterarXiv:2504.05304
8
citations
#218

AutoAdvExBench: Benchmarking Autonomous Exploitation of Adversarial Example Defenses

Nicholas Carlini, Edoardo Debenedetti, Javier Rando et al.

ICML 2025oralarXiv:2503.01811
8
citations
#219

LoRA-One: One-Step Full Gradient Could Suffice for Fine-Tuning Large Language Models, Provably and Efficiently

Yuanhe Zhang, Fanghui Liu, Yudong Chen

ICML 2025oralarXiv:2502.01235
8
citations
#220

Improving LLMs for Recommendation with Out-Of-Vocabulary Tokens

Ting-Ji Huang, Jia-Qi Yang, Chunxu Shen et al.

ICML 2025posterarXiv:2406.08477
8
citations
#221

Self-Discriminative Modeling for Anomalous Graph Detection

Jinyu Cai, Yunhe Zhang, Jicong Fan

ICML 2025posterarXiv:2310.06261
8
citations
#222

Secant Line Search for Frank-Wolfe Algorithms

Deborah Hendrych, Sebastian Pokutta, Mathieu Besançon et al.

ICML 2025posterarXiv:2501.18775
8
citations
#223

Ringmaster ASGD: The First Asynchronous SGD with Optimal Time Complexity

Artavazd Maranjyan, Alexander Tyurin, Peter Richtarik

ICML 2025posterarXiv:2501.16168
7
citations
#224

Evaluating Neuron Explanations: A Unified Framework with Sanity Checks

Tuomas Oikarinen, Ge Yan, Lily Weng

ICML 2025posterarXiv:2506.05774
7
citations
#225

DISCO: learning to DISCover an evolution Operator for multi-physics-agnostic prediction

Rudy Morel, Jiequn Han, Edouard Oyallon

ICML 2025oralarXiv:2504.19496
7
citations
#226

Data-Juicer Sandbox: A Feedback-Driven Suite for Multimodal Data-Model Co-development

Daoyuan Chen, Haibin Wang, Yilun Huang et al.

ICML 2025spotlightarXiv:2407.11784
7
citations
#227

Position: We Need An Algorithmic Understanding of Generative AI

Oliver Eberle, Thomas McGee, Hamza Giaffar et al.

ICML 2025spotlightarXiv:2507.07544
7
citations
#228

AutoElicit: Using Large Language Models for Expert Prior Elicitation in Predictive Modelling

Alexander Capstick, Rahul G. Krishnan, Payam Barnaghi

ICML 2025posterarXiv:2411.17284
7
citations
#229

Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration

Max Wilcoxson, Qiyang Li, Kevin Frans et al.

ICML 2025posterarXiv:2410.18076
7
citations
#230

Ultra-Resolution Adaptation with Ease

Ruonan Yu, Songhua Liu, Zhenxiong Tan et al.

ICML 2025posterarXiv:2503.16322
7
citations
#231

Overcoming the Curse of Dimensionality in Reinforcement Learning Through Approximate Factorization

Chenbei Lu, Laixi Shi, Zaiwei Chen et al.

ICML 2025posterarXiv:2411.07591
7
citations
#232

Perception in Reflection

Yana Wei, Liang Zhao, Kangheng Lin et al.

ICML 2025posterarXiv:2504.07165
7
citations
#233

Robust and Conjugate Spatio-Temporal Gaussian Processes

William Laplante, Matias Altamirano, Andrew Duncan et al.

ICML 2025oralarXiv:2502.02450
7
citations
#234

Scaling Laws for Task-Optimized Models of the Primate Visual Ventral Stream

Abdulkadir Gokce, Martin Schrimpf

ICML 2025oralarXiv:2411.05712
7
citations
#235

Loss Functions and Operators Generated by f-Divergences

Vincent Roulet, Tianlin Liu, Nino Vieillard et al.

ICML 2025posterarXiv:2501.18537
7
citations
#236

Impossible Videos

Zechen Bai, Hai Ci, Mike Zheng Shou

ICML 2025oralarXiv:2503.14378
7
citations
#237

Compression via Pre-trained Transformers: A Study on Byte-Level Multimodal Data

David Heurtel-Depeiges, Anian Ruoss, Joel Veness et al.

ICML 2025posterarXiv:2410.05078
7
citations
#238

Learning Safety Constraints for Large Language Models

Xin Chen, Yarden As, Andreas Krause

ICML 2025spotlightarXiv:2505.24445
7
citations
#239

Doubly Robust Conformalized Survival Analysis with Right-Censored Data

Matteo Sesia, vladimir svetnik

ICML 2025spotlightarXiv:2412.09729
7
citations
#240

Privacy Attacks on Image AutoRegressive Models

Antoni Kowalczuk, Jan Dubiński, Franziska Boenisch et al.

ICML 2025posterarXiv:2502.02514
7
citations
#241

Towards Robustness and Explainability of Automatic Algorithm Selection

Xingyu Wu, Jibin Wu, Yu Zhou et al.

ICML 2025spotlight
7
citations
#242

Selective Prompt Anchoring for Code Generation

Yuan Tian, Tianyi Zhang

ICML 2025posterarXiv:2408.09121
7
citations
#243

Enhancing Rating-Based Reinforcement Learning to Effectively Leverage Feedback from Large Vision-Language Models

Minh-Tung Luu, Younghwan Lee, Donghoon Lee et al.

ICML 2025posterarXiv:2506.12822
7
citations
#244

Speculative Prefill: Turbocharging TTFT with Lightweight and Training-Free Token Importance Estimation

Jingyu Liu, Beidi Chen, Ce Zhang

ICML 2025posterarXiv:2502.02789
7
citations
#245

Vision-Language Models Create Cross-Modal Task Representations

Grace Luo, Trevor Darrell, Amir Bar

ICML 2025posterarXiv:2410.22330
7
citations
#246

Efficient and Scalable Density Functional Theory Hamiltonian Prediction through Adaptive Sparsity

Erpai Luo, Xinran Wei, Lin Huang et al.

ICML 2025posterarXiv:2502.01171
7
citations
#247

Componential Prompt-Knowledge Alignment for Domain Incremental Learning

Kunlun Xu, Xu Zou, Gang Hua et al.

ICML 2025posterarXiv:2505.04575
7
citations
#248

SafetyAnalyst: Interpretable, Transparent, and Steerable Safety Moderation for AI Behavior

Jing-Jing Li, Valentina Pyatkin, Max Kleiman-Weiner et al.

ICML 2025posterarXiv:2410.16665
7
citations
#249

Learning Adaptive Lighting via Channel-Aware Guidance

Qirui Yang, Peng-Tao Jiang, Hao Zhang et al.

ICML 2025posterarXiv:2412.01493
7
citations
#250

Contrastive Private Data Synthesis via Weighted Multi-PLM Fusion

Tianyuan Zou, Yang Liu, Peng Li et al.

ICML 2025posterarXiv:2502.00245
7
citations
#251

XAttnMark: Learning Robust Audio Watermarking with Cross-Attention

Yixin Liu, Lie Lu, Jihui Jin et al.

ICML 2025oralarXiv:2502.04230
7
citations
#252

Position: The Most Expensive Part of an LLM *should* be its Training Data

Nikhil Kandpal, Colin Raffel

ICML 2025posterarXiv:2504.12427
7
citations
#253

SEMU: Singular Value Decomposition for Efficient Machine Unlearning

Marcin Sendera, Łukasz Struski, Kamil Książek et al.

ICML 2025posterarXiv:2502.07587
7
citations
#254

Understanding the Limits of Deep Tabular Methods with Temporal Shift

Haorun Cai, Han-Jia Ye

ICML 2025oralarXiv:2502.20260
6
citations
#255

DEALing with Image Reconstruction: Deep Attentive Least Squares

Mehrsa Pourya, Erich Kobler, Michael Unser et al.

ICML 2025posterarXiv:2502.04079
6
citations
#256

PROXSPARSE: REGULARIZED LEARNING OF SEMI-STRUCTURED SPARSITY MASKS FOR PRETRAINED LLMS

Hongyi Liu, Rajarshi Saha, Zhen Jia et al.

ICML 2025posterarXiv:2502.00258
6
citations
#257

Task Generalization with Autoregressive Compositional Structure: Can Learning from $D$ Tasks Generalize to $D^T$ Tasks?

Amirhesam Abedsoltan, Huaqing Zhang, Kaiyue Wen et al.

ICML 2025posterarXiv:2502.08991
6
citations
#258

Flowing Datasets with Wasserstein over Wasserstein Gradient Flows

Clément Bonet, Christophe Vauthier, Anna Korba

ICML 2025oralarXiv:2506.07534
6
citations
#259

Understanding and Mitigating Memorization in Diffusion Models for Tabular Data

Zhengyu Fang, Zhimeng Jiang, Huiyuan Chen et al.

ICML 2025posterarXiv:2412.11044
6
citations
#260

Unisolver: PDE-Conditional Transformers Towards Universal Neural PDE Solvers

Hang Zhou, Yuezhou Ma, Haixu Wu et al.

ICML 2025posterarXiv:2405.17527
6
citations
#261

Toward Efficient Kernel-Based Solvers for Nonlinear PDEs

Zhitong Xu, Da Long, Yiming Xu et al.

ICML 2025posterarXiv:2410.11165
6
citations
#262

QT-DoG: Quantization-Aware Training for Domain Generalization

Saqib Javed, Hieu Le, Mathieu Salzmann

ICML 2025posterarXiv:2410.06020
6
citations
#263

EARL-BO: Reinforcement Learning for Multi-Step Lookahead, High-Dimensional Bayesian Optimization

Mujin Cheon, Jay Lee, Dong-Yeun Koh et al.

ICML 2025posterarXiv:2411.00171
6
citations
#264

Volume Optimality in Conformal Prediction with Structured Prediction Sets

Chao Gao, Liren Shan, Vaidehi Srinivas et al.

ICML 2025posterarXiv:2502.16658
6
citations
#265

Active Fine-Tuning of Multi-Task Policies

Marco Bagatella, Jonas Hübotter, Georg Martius et al.

ICML 2025oralarXiv:2410.05026
6
citations
#266

Position: The Artificial Intelligence and Machine Learning Community Should Adopt a More Transparent and Regulated Peer Review Process

Jing Yang

ICML 2025posterarXiv:2502.00874
6
citations
#267

No-Regret is not enough! Bandits with General Constraints through Adaptive Regret Minimization

Martino Bernasconi, Matteo Castiglioni, Andrea Celli

ICML 2025posterarXiv:2405.06575
6
citations
#268

TimePro: Efficient Multivariate Long-term Time Series Forecasting with Variable- and Time-Aware Hyper-state

Xiaowen Ma, Zhen-Liang Ni, Shuai Xiao et al.

ICML 2025oralarXiv:2505.20774
6
citations
#269

Network Sparsity Unlocks the Scaling Potential of Deep Reinforcement Learning

Guozheng Ma, Lu Li, Zilin Wang et al.

ICML 2025oralarXiv:2506.17204
6
citations
#270

Causal Discovery from Conditionally Stationary Time Series

Carles Balsells-Rodas, Xavier Sumba, Tanmayee Narendra et al.

ICML 2025posterarXiv:2110.06257
6
citations
#271

Language Models over Canonical Byte-Pair Encodings

Tim Vieira, Tianyu Liu, Clemente Pasti et al.

ICML 2025posterarXiv:2506.07956
6
citations
#272

Activation Space Interventions Can Be Transferred Between Large Language Models

Narmeen Oozeer, Dhruv Nathawani, Nirmalendu Prakash et al.

ICML 2025posterarXiv:2503.04429
6
citations
#273

Vulnerability-Aware Alignment: Mitigating Uneven Forgetting in Harmful Fine-Tuning

Liang CHEN, Xueting Han, Li Shen et al.

ICML 2025posterarXiv:2506.03850
6
citations
#274

LotteryCodec: Searching the Implicit Representation in a Random Network for Low-Complexity Image Compression

Haotian Wu, Gongpu Chen, Pier Luigi Dragotti et al.

ICML 2025spotlightarXiv:2507.01204
6
citations
#275

Regress, Don't Guess: A Regression-like Loss on Number Tokens for Language Models

Jonas Zausinger, Lars Pennig, Anamarija Kozina et al.

ICML 2025posterarXiv:2411.02083
6
citations
#276

HaploVL: A Single-Transformer Baseline for Multi-Modal Understanding

Rui Yang, Lin Song, Yicheng Xiao et al.

ICML 2025posterarXiv:2503.14694
6
citations
#277

ROPO: Robust Preference Optimization for Large Language Models

Xize Liang, Chao Chen, Shuang Qiu et al.

ICML 2025posterarXiv:2404.04102
6
citations
#278

Parameter-Efficient Fine-Tuning of State Space Models

Kevin Galim, Wonjun Kang, Yuchen Zeng et al.

ICML 2025posterarXiv:2410.09016
6
citations
#279

SK-VQA: Synthetic Knowledge Generation at Scale for Training Context-Augmented Multimodal LLMs

Xin Su, Man Luo, Kris Pan et al.

ICML 2025oralarXiv:2406.19593
6
citations
#280

From Passive to Active Reasoning: Can Large Language Models Ask the Right Questions under Incomplete Information?

Zhanke Zhou, Xiao Feng, Zhaocheng Zhu et al.

ICML 2025posterarXiv:2506.08295
6
citations
#281

Robust Offline Reinforcement Learning with Linearly Structured $f$-Divergence Regularization

Cheng Tang, Zhishuai Liu, Pan Xu

ICML 2025posterarXiv:2411.18612
6
citations
#282

Overestimation in LLM Evaluation: A Controlled Large-Scale Study on Data Contamination’s Impact on Machine Translation

Muhammed Yusuf Kocyigit, Eleftheria Briakou, Daniel Deutsch et al.

ICML 2025oralarXiv:2501.18771
6
citations
#283

OV-MER: Towards Open-Vocabulary Multimodal Emotion Recognition

Zheng Lian, Haiyang Sun, Licai Sun et al.

ICML 2025posterarXiv:2410.01495
6
citations
#284

Variational Control for Guidance in Diffusion Models

Kushagra Pandey, Farrin Marouf Sofian, Felix Draxler et al.

ICML 2025posterarXiv:2502.03686
6
citations
#285

Efficient Distributed Optimization under Heavy-Tailed Noise

Su Hyeong Lee, Manzil Zaheer, Tian Li

ICML 2025posterarXiv:2502.04164
6
citations
#286

RZ-NAS: Enhancing LLM-guided Neural Architecture Search via Reflective Zero-Cost Strategy

Zipeng Ji, Guanghui Zhu, Chunfeng Yuan et al.

ICML 2025poster
6
citations
#287

MuLan: Adapting Multilingual Diffusion Models for Hundreds of Languages with Negligible Cost

Sen Xing, Muyan Zhong, Zeqiang Lai et al.

ICML 2025posterarXiv:2412.01271
6
citations
#288

Hyperband-based Bayesian Optimization for Black-box Prompt Selection

Lennart Schneider, Martin Wistuba, Aaron Klein et al.

ICML 2025posterarXiv:2412.07820
6
citations
#289

LLM-Augmented Chemical Synthesis and Design Decision Programs

Haorui Wang, Jeff Guo, Lingkai Kong et al.

ICML 2025posterarXiv:2505.07027
6
citations
#290

SAE-V: Interpreting Multimodal Models for Enhanced Alignment

Hantao Lou, Changye Li, Jiaming Ji et al.

ICML 2025posterarXiv:2502.17514
6
citations
#291

Analyze Feature Flow to Enhance Interpretation and Steering in Language Models

Daniil Laptev, Nikita Balagansky, Yaroslav Aksenov et al.

ICML 2025posterarXiv:2502.03032
6
citations
#292

Enhancing Statistical Validity and Power in Hybrid Controlled Trials: A Randomization Inference Approach with Conformal Selective Borrowing

Ke Zhu, Shu Yang, Xiaofei Wang

ICML 2025posterarXiv:2410.11713
6
citations
#293

LLMs can see and hear without any training

Kumar Ashutosh, Yossi Gandelsman, Xinlei Chen et al.

ICML 2025posterarXiv:2501.18096
6
citations
#294

Rethinking Chain-of-Thought from the Perspective of Self-Training

Zongqian Wu, Baoduo Xu, Ruochen Cui et al.

ICML 2025posterarXiv:2412.10827
6
citations
#295

Prediction-Powered E-Values

Daniel Csillag, Claudio Struchiner, Guilherme Tegoni Goedert

ICML 2025posterarXiv:2502.04294
6
citations
#296

Hierarchical Graph Tokenization for Molecule-Language Alignment

Yongqiang Chen, QUANMING YAO, Juzheng Zhang et al.

ICML 2025posterarXiv:2406.14021
6
citations
#297

PatchPilot: A Cost-Efficient Software Engineering Agent with Early Attempts on Formal Verification

Hongwei Li, Yuheng Tang, Shiqi Wang et al.

ICML 2025posterarXiv:2502.02747
6
citations
#298

When Maximum Entropy Misleads Policy Optimization

Ruipeng Zhang, Ya-Chien Chang, Sicun Gao

ICML 2025posterarXiv:2506.05615
6
citations
#299

Learning from Integral Losses in Physics Informed Neural Networks

Ehsan Saleh, Saba Ghaffari, Timothy Bretl et al.

ICML 2024posterarXiv:2305.17387
6
citations
#300

Learning Distances from Data with Normalizing Flows and Score Matching

Peter Sorrenson, Daniel Behrend-Uriarte, Christoph Schnörr et al.

ICML 2025posterarXiv:2407.09297
6
citations
#301

Jacobian Sparse Autoencoders: Sparsify Computations, Not Just Activations

Lucy Farnik, Tim Lawson, Conor Houghton et al.

ICML 2025spotlightarXiv:2502.18147
6
citations
#302

Tree-Sliced Wasserstein Distance with Nonlinear Projection

Thanh Tran, Viet Hoang Tran, Thanh Chu et al.

ICML 2025posterarXiv:2505.00968
5
citations
#303

In-Context Denoising with One-Layer Transformers: Connections between Attention and Associative Memory Retrieval

Matthew Smart, Alberto Bietti, Anirvan Sengupta

ICML 2025oralarXiv:2502.05164
5
citations
#304

Correlated Errors in Large Language Models

Elliot Myunghoon Kim, Avi Garg, Kenny Peng et al.

ICML 2025posterarXiv:2506.07962
5
citations
#305

Inverse Problem Sampling in Latent Space Using Sequential Monte Carlo

Idan Achituve, Hai Victor Habi, Amir Rosenfeld et al.

ICML 2025posterarXiv:2502.05908
5
citations
#306

Enforcing Latent Euclidean Geometry in Single-Cell VAEs for Manifold Interpolation

Alessandro Palma, Sergei Rybakov, Leon Hetzel et al.

ICML 2025spotlightarXiv:2507.11789
5
citations
#307

MERGE$^3$: Efficient Evolutionary Merging on Consumer-grade GPUs

Tommaso Mencattini, Adrian Robert Minut, Donato Crisostomi et al.

ICML 2025posterarXiv:2502.10436
5
citations
#308

DOLPHIN: A Programmable Framework for Scalable Neurosymbolic Learning

Aaditya Naik, Jason Liu, Claire Wang et al.

ICML 2025posterarXiv:2410.03348
5
citations
#309

General framework for online-to-nonconvex conversion: Schedule-free SGD is also effective for nonconvex optimization

Kwangjun Ahn, Gagik Magakyan, Ashok Cutkosky

ICML 2025oralarXiv:2411.07061
5
citations
#310

Activation by Interval-wise Dropout: A Simple Way to Prevent Neural Networks from Plasticity Loss

Sangyeon Park, Isaac Han, Seungwon Oh et al.

ICML 2025posterarXiv:2502.01342
5
citations
#311

Features are fate: a theory of transfer learning in high-dimensional regression

Javan Tahir, Surya Ganguli, Grant Rotskoff

ICML 2025posterarXiv:2410.08194
5
citations
#312

One Leaf Reveals the Season: Occlusion-Based Contrastive Learning with Semantic-Aware Views for Efficient Visual Representation

Xiaoyu Yang, Lijian Xu, Hongsheng Li et al.

ICML 2025posterarXiv:2411.09858
5
citations
#313

Reward-Augmented Data Enhances Direct Preference Alignment of LLMs

Shenao Zhang, Zhihan Liu, Boyi Liu et al.

ICML 2025posterarXiv:2410.08067
5
citations
#314

TTFSFormer: A TTFS-based Lossless Conversion of Spiking Transformer

Lusen Zhao, Zihan Huang, Ding Jianhao et al.

ICML 2025poster
5
citations
#315

What makes an Ensemble (Un) Interpretable?

Shahaf Bassan, Guy Amir, Meirav Zehavi et al.

ICML 2025posterarXiv:2506.08216
5
citations
#316

Temporal Difference Flows

Jesse Farebrother, Matteo Pirotta, Andrea Tirinzoni et al.

ICML 2025oralarXiv:2503.09817
5
citations
#317

Understanding Generalization in Quantum Machine Learning with Margins

TAK HUR, Daniel Kyungdeock Park

ICML 2025posterarXiv:2411.06919
5
citations
#318

EffiCoder: Enhancing Code Generation in Large Language Models through Efficiency-Aware Fine-tuning

Dong HUANG, Guangtao Zeng, Jianbo Dai et al.

ICML 2025posterarXiv:2410.10209
5
citations
#319

Adversarial Cooperative Rationalization: The Risk of Spurious Correlations in Even Clean Datasets

Wei Liu, Zhongyu Niu, Lang Gao et al.

ICML 2025posterarXiv:2505.02118
5
citations
#320

Shielded Diffusion: Generating Novel and Diverse Images using Sparse Repellency

Michael Kirchhof, James Thornton, Louis Béthune et al.

ICML 2025posterarXiv:2410.06025
5
citations
#321

Aligning Multimodal Representations through an Information Bottleneck

Antonio Almudévar, Jose Miguel Hernandez-Lobato, Sameer Khurana et al.

ICML 2025posterarXiv:2506.04870
5
citations
#322

On the Robustness of Reward Models for Language Model Alignment

Jiwoo Hong, Noah Lee, Eunki Kim et al.

ICML 2025posterarXiv:2505.07271
5
citations
#323

PINNsAgent: Automated PDE Surrogation with Large Language Models

Qingpo Wuwu, Chonghan Gao, Tianyu Chen et al.

ICML 2025posterarXiv:2501.12053
5
citations
#324

KV Shifting Attention Enhances Language Modeling

Mingyu Xu, Bingning Wang, Weipeng Chen

ICML 2025oralarXiv:2411.19574
5
citations
#325

Fundamental Limits of Visual Autoregressive Transformers: Universal Approximation Abilities

Yifang Chen, Xiaoyu Li, Yingyu Liang et al.

ICML 2025poster
5
citations
#326

Predicting mutational effects on protein binding from folding energy

Arthur Deng, Karsten Householder, Fang Wu et al.

ICML 2025posterarXiv:2507.05502
5
citations
#327

Provable Maximum Entropy Manifold Exploration via Diffusion Models

Riccardo De Santi, Marin Vlastelica, Ya-Ping Hsieh et al.

ICML 2025posterarXiv:2506.15385
5
citations
#328

PARQ: Piecewise-Affine Regularized Quantization

Lisa Jin, Jianhao Ma, Zechun Liu et al.

ICML 2025posterarXiv:2503.15748
5
citations
#329

Scaling Laws for Floating–Point Quantization Training

Xingwu Sun, Shuaipeng Li, Ruobing Xie et al.

ICML 2025poster
5
citations
#330

Importance Corrected Neural JKO Sampling

Johannes Hertrich, Robert Gruhlke

ICML 2025posterarXiv:2407.20444
5
citations
#331

ELITE: Enhanced Language-Image Toxicity Evaluation for Safety

Wonjun Lee, Doehyeon Lee, Eugene Choi et al.

ICML 2025posterarXiv:2502.04757
5
citations
#332

Embedding Safety into RL: A New Take on Trust Region Methods

Nikola Milosevic, Johannes Müller, Nico Scherf

ICML 2025posterarXiv:2411.02957
5
citations
#333

Looking Beyond the Top-1: Transformers Determine Top Tokens in Order

Daria Lioubashevski, Tomer Schlank, Gabriel Stanovsky et al.

ICML 2025posterarXiv:2410.20210
5
citations
#334

Towards Trustworthy Federated Learning with Untrusted Participants

Youssef Allouah, Rachid Guerraoui, John Stephan

ICML 2025posterarXiv:2505.01874
5
citations
#335

On the Vulnerability of Applying Retrieval-Augmented Generation within Knowledge-Intensive Application Domains

Xun Xian, Ganghua Wang, Xuan Bi et al.

ICML 2025posterarXiv:2409.17275
5
citations
#336

When Diffusion Models Memorize: Inductive Biases in Probability Flow of Minimum-Norm Shallow Neural Nets

Chen Zeno, Hila Manor, Gregory Ongie et al.

ICML 2025posterarXiv:2506.19031
5
citations
#337

Scaling Laws for Upcycling Mixture-of-Experts Language Models

Seng Pei Liew, Takuya Kato, Sho Takase

ICML 2025posterarXiv:2502.03009
5
citations
#338

Test-Time Training Provably Improves Transformers as In-context Learners

Halil Alperen Gozeten, Muhammed Emrullah Ildiz, Xuechen Zhang et al.

ICML 2025posterarXiv:2503.11842
5
citations
#339

GAPrompt: Geometry-Aware Point Cloud Prompt for 3D Vision Model

Zixiang Ai, Zichen Liu, Yuanhang Lei et al.

ICML 2025posterarXiv:2505.04119
5
citations
#340

LLMScan: Causal Scan for LLM Misbehavior Detection

Mengdi Zhang, Goh Kiat, Peixin Zhang et al.

ICML 2025posterarXiv:2410.16638
5
citations
#341

Continuous Visual Autoregressive Generation via Score Maximization

Chenze Shao, Fandong Meng, Jie Zhou

ICML 2025posterarXiv:2505.07812
5
citations
#342

On Volume Minimization in Conformal Regression

Batiste Le Bars, Pierre Humbert

ICML 2025posterarXiv:2502.09985
5
citations
#343

Constrained Belief Updates Explain Geometric Structures in Transformer Representations

Mateusz Piotrowski, Paul Riechers, Daniel Filan et al.

ICML 2025posterarXiv:2502.01954
5
citations
#344

Contextual Online Decision Making with Infinite-Dimensional Functional Regression

Haichen Hu, Rui Ai, Stephen Bates et al.

ICML 2025posterarXiv:2501.18359
5
citations
#345

DipLLM: Fine-Tuning LLM for Strategic Decision-making in Diplomacy

Kaixuan Xu, Jiajun Chai, Sicheng Li et al.

ICML 2025posterarXiv:2506.09655
5
citations
#346

Automatically Identify and Rectify: Robust Deep Contrastive Multi-view Clustering in Noisy Scenarios

xihong yang, Siwei Wang, Fangdi Wang et al.

ICML 2025spotlightarXiv:2505.21387
5
citations
#347

Ranked Entropy Minimization for Continual Test-Time Adaptation

Jisu Han, Jaemin Na, Wonjun Hwang

ICML 2025posterarXiv:2505.16441
5
citations
#348

In-Context Learning and Occam's Razor

Eric Elmoznino, Tom Marty, Tejas Kasetty et al.

ICML 2025posterarXiv:2410.14086
5
citations
#349

(How) Can Transformers Predict Pseudo-Random Numbers?

Tao Tao, Darshil Doshi, Dayal Singh Kalra et al.

ICML 2025posterarXiv:2502.10390
5
citations
#350

Position: Build Agent Advocates, Not Platform Agents

Sayash Kapoor, Noam Kolt, Seth Lazar

ICML 2025poster
5
citations
#351

MODA: MOdular Duplex Attention for Multimodal Perception, Cognition, and Emotion Understanding

Zhicheng Zhang, Wuyou Xia, Chenxi Zhao et al.

ICML 2025spotlightarXiv:2507.04635
4
citations
#352

On-Device Collaborative Language Modeling via a Mixture of Generalists and Specialists

Dongyang Fan, Bettina Messmer, Nikita Doikov et al.

ICML 2025posterarXiv:2409.13931
4
citations
#353

Cannot See the Forest for the Trees: Invoking Heuristics and Biases to Elicit Irrational Choices of LLMs

Haoming Yang, Ke Ma, Xiaojun Jia et al.

ICML 2025posterarXiv:2505.02862
4
citations
#354

Best of Both Worlds: Advantages of Hybrid Graph Sequence Models

Ali Behrouz, Ali Parviz, Mahdi Karami et al.

ICML 2025posterarXiv:2411.15671
4
citations
#355

FACTER: Fairness-Aware Conformal Thresholding and Prompt Engineering for Enabling Fair LLM-Based Recommender Systems

Arya Fayyazi, Mehdi Kamal, Massoud Pedram

ICML 2025posterarXiv:2502.02966
4
citations
#356

DMOSpeech: Direct Metric Optimization via Distilled Diffusion Model in Zero-Shot Speech Synthesis

Yinghao Li, Rithesh Kumar, Zeyu Jin

ICML 2025oralarXiv:2410.11097
4
citations
#357

Task-Agnostic Pre-training and Task-Guided Fine-tuning for Versatile Diffusion Planner

Chenyou Fan, Chenjia Bai, Zhao Shan et al.

ICML 2025posterarXiv:2409.19949
4
citations
#358

Collapse-Proof Non-Contrastive Self-Supervised Learning

EMANUELE SANSONE, Tim Lebailly, Tinne Tuytelaars

ICML 2025posterarXiv:2410.04959
4
citations
#359

Understanding and Mitigating Memorization in Generative Models via Sharpness of Probability Landscapes

Dongjae Jeon, Dueun Kim, Albert No

ICML 2025spotlightarXiv:2412.04140
4
citations
#360

TRACE Back from the Future: A Probabilistic Reasoning Approach to Controllable Language Generation

Gwen Yidou-Weng, Benjie Wang, Guy Van den Broeck

ICML 2025poster
4
citations
#361

MAGELLAN: Metacognitive predictions of learning progress guide autotelic LLM agents in large goal spaces

Loris Gaven, Thomas Carta, Clément Romac et al.

ICML 2025posterarXiv:2502.07709
4
citations
#362

Representations Shape Weak-to-Strong Generalization: Theoretical Insights and Empirical Predictions

Yihao Xue, Jiping Li, Baharan Mirzasoleiman

ICML 2025posterarXiv:2502.00620
4
citations
#363

Position: Lifetime tuning is incompatible with continual reinforcement learning

Golnaz Mesbahi, Parham Mohammad Panahi, Olya Mastikhina et al.

ICML 2025posterarXiv:2404.02113
4
citations
#364

Point-Level Topological Representation Learning on Point Clouds

Vincent P. Grande, Michael Schaub

ICML 2025posterarXiv:2406.02300
4
citations
#365

SPEX: Scaling Feature Interaction Explanations for LLMs

Justin S. Kang, Landon Butler, Abhineet Agarwal et al.

ICML 2025posterarXiv:2502.13870
4
citations
#366

Log-Sum-Exponential Estimator for Off-Policy Evaluation and Learning

Armin Behnamnia, Gholamali Aminian, Alireza Aghaei et al.

ICML 2025spotlightarXiv:2506.06873
4
citations
#367

Relating Misfit to Gain in Weak-to-Strong Generalization Beyond the Squared Loss

Abhijeet Mulgund, Chirag Pabbaraju

ICML 2025posterarXiv:2501.19105
4
citations
#368

Projection Optimization: A General Framework for Multi-Objective and Multi-Group RLHF

Nuoya Xiong, Aarti Singh

ICML 2025posterarXiv:2502.15145
4
citations
#369

Learning Dynamics in Continual Pre-Training for Large Language Models

Xingjin Wang, Howe Tissue, Lu Wang et al.

ICML 2025oralarXiv:2505.07796
4
citations
#370

Lightspeed Geometric Dataset Distance via Sliced Optimal Transport

Khai Nguyen, Hai Nguyen, Tuan Pham et al.

ICML 2025posterarXiv:2501.18901
4
citations
#371

Mastering Massive Multi-Task Reinforcement Learning via Mixture-of-Expert Decision Transformer

Yilun Kong, Guozheng Ma, Qi Zhao et al.

ICML 2025posterarXiv:2505.24378
4
citations
#372

Geometric Hyena Networks for Large-scale Equivariant Learning

Artem Moskalev, Mangal Prakash, Junjie Xu et al.

ICML 2025spotlightarXiv:2505.22560
4
citations
#373

Improving Multimodal Learning Balance and Sufficiency through Data Remixing

Xiaoyu Ma, Hao Chen, Yongjian Deng

ICML 2025posterarXiv:2506.11550
4
citations
#374

Breaking the Curse of Multiagency in Robust Multi-Agent Reinforcement Learning

Laixi Shi, Jingchu Gai, Eric Mazumdar et al.

ICML 2025oralarXiv:2409.20067
4
citations
#375

Controlled Generation with Equivariant Variational Flow Matching

Floor Eijkelboom, Heiko Zimmermann, Sharvaree Vadgama et al.

ICML 2025posterarXiv:2506.18340
4
citations
#376

Blink of an eye: a simple theory for feature localization in generative models

Marvin Li, Aayush Karan, Sitan Chen

ICML 2025oralarXiv:2502.00921
4
citations
#377

SpeCache: Speculative Key-Value Caching for Efficient Generation of LLMs

Shibo Jie, Yehui Tang, Kai Han et al.

ICML 2025posterarXiv:2503.16163
4
citations
#378

CALM: Consensus-Aware Localized Merging for Multi-Task Learning

Kunda Yan, Min Zhang, Sen Cui et al.

ICML 2025posterarXiv:2506.13406
4
citations
#379

Nested Expectations with Kernel Quadrature

Zonghao Chen, Masha Naslidnyk, Francois-Xavier Briol

ICML 2025posterarXiv:2502.18284
4
citations
#380

Tensor Product Neural Networks for Functional ANOVA Model

Seokhun Park, Insung Kong, yongchan Choi et al.

ICML 2025posterarXiv:2502.15215
4
citations
#381

Sable: a Performant, Efficient and Scalable Sequence Model for MARL

Omayma Mahjoub, Sasha Abramowitz, Ruan de Kock et al.

ICML 2025oralarXiv:2410.01706
4
citations
#382

WGFormer: An SE(3)-Transformer Driven by Wasserstein Gradient Flows for Molecular Ground-State Conformation Prediction

Fanmeng Wang, Minjie Cheng, Hongteng Xu

ICML 2025posterarXiv:2410.09795
4
citations
#383

Score-Based Diffusion Policy Compatible with Reinforcement Learning via Optimal Transport

Mingyang Sun, Pengxiang Ding, Weinan Zhang et al.

ICML 2025posterarXiv:2502.12631
4
citations
#384

Linear $Q$-Learning Does Not Diverge in $L^2$: Convergence Rates to a Bounded Set

Xinyu Liu, Zixuan Xie, Shangtong Zhang

ICML 2025posterarXiv:2501.19254
4
citations
#385

Focus On This, Not That! Steering LLMs with Adaptive Feature Specification

Tom A. Lamb, Adam Davies, Alasdair J Paren et al.

ICML 2025posterarXiv:2410.22944
4
citations
#386

MPO: An Efficient Post-Processing Framework for Mixing Diverse Preference Alignment

Tianze Wang, Dongnan Gui, Yifan Hu et al.

ICML 2025posterarXiv:2502.18699
4
citations
#387

Quantifying Prediction Consistency Under Fine-tuning Multiplicity in Tabular LLMs

Faisal Hamman, Sachindra P Dissanayake, Saumitra Mishra et al.

ICML 2025posterarXiv:2407.04173
4
citations
#388

Foundation Molecular Grammar: Multi-Modal Foundation Models Induce Interpretable Molecular Graph Languages

Michael Sun, Weize Yuan, Gang Liu et al.

ICML 2025posterarXiv:2505.22948
4
citations
#389

Enhancing Decision-Making of Large Language Models via Actor-Critic

Heng Dong, Kefei Duan, Chongjie Zhang

ICML 2025posterarXiv:2506.06376
4
citations
#390

Unified Breakdown Analysis for Byzantine Robust Gossip

Renaud Gaucher, Aymeric Dieuleveut, Hadrien Hendrikx

ICML 2025posterarXiv:2410.10418
4
citations
#391

A Theoretical Framework For Overfitting In Energy-based Modeling

Giovanni Catania, Aurélien Decelle, Cyril Furtlehner et al.

ICML 2025posterarXiv:2501.19158
4
citations
#392

Gradient Boosting Reinforcement Learning

Benjamin Fuhrer, Chen Tessler, Gal Dalal

ICML 2025posterarXiv:2407.08250
4
citations
#393

BoA: Attention-aware Post-training Quantization without Backpropagation

Junhan Kim, Ho-young Kim, Eulrang Cho et al.

ICML 2025posterarXiv:2406.13474
4
citations
#394

SENSEI: Semantic Exploration Guided by Foundation Models to Learn Versatile World Models

Cansu Sancaktar, Christian Gumbsch, Andrii Zadaianchuk et al.

ICML 2025posterarXiv:2503.01584
4
citations
#395

Efficient Logit-based Knowledge Distillation of Deep Spiking Neural Networks for Full-Range Timestep Deployment

Chengting Yu, Xiaochen Zhao, Lei Liu et al.

ICML 2025oralarXiv:2501.15925
4
citations
#396

AssistanceZero: Scalably Solving Assistance Games

Cassidy Laidlaw, Eli Bronstein, Timothy Guo et al.

ICML 2025posterarXiv:2504.07091
4
citations
#397

CFP-Gen: Combinatorial Functional Protein Generation via Diffusion Language Models

Junbo Yin, Chao Zha, Wenjia He et al.

ICML 2025posterarXiv:2505.22869
4
citations
#398

SlimLLM: Accurate Structured Pruning for Large Language Models

Jialong Guo, Xinghao Chen, Yehui Tang et al.

ICML 2025posterarXiv:2505.22689
4
citations
#399

Prune 'n Predict: Optimizing LLM Decision-making with Conformal Prediction

Harit Vishwakarma, Alan Mishler, Thomas Cook et al.

ICML 2025posterarXiv:2501.00555
4
citations
#400

Can RLHF be More Efficient with Imperfect Reward Models? A Policy Coverage Perspective

Jiawei Huang, Bingcong Li, Christoph Dann et al.

ICML 2025posterarXiv:2502.19255
4
citations