ICML Papers

5,975 papers found • Page 36 of 120

Looking Beyond the Top-1: Transformers Determine Top Tokens in Order

Daria Lioubashevski, Tomer Schlank, Gabriel Stanovsky et al.

ICML 2025posterarXiv:2410.20210
5
citations

Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in Multimodal Large Language Models

Xin Zou, Yizhou WANG, Yibo Yan et al.

ICML 2025posterarXiv:2410.03577

LoRA-Gen: Specializing Large Language Model via Online LoRA Generation

Yicheng Xiao, Lin Song, Rui Yang et al.

ICML 2025posterarXiv:2506.11638

LoRA-One: One-Step Full Gradient Could Suffice for Fine-Tuning Large Language Models, Provably and Efficiently

Yuanhe Zhang, Fanghui Liu, Yudong Chen

ICML 2025oralarXiv:2502.01235
8
citations

LoRA Training Provably Converges to a Low-Rank Global Minimum Or It Fails Loudly (But it Probably Won't Fail)

Junsu Kim, Jaeyeon Kim, Ernest Ryu

ICML 2025oralarXiv:2502.09376

Loss Functions and Operators Generated by f-Divergences

Vincent Roulet, Tianlin Liu, Nino Vieillard et al.

ICML 2025posterarXiv:2501.18537
7
citations

LotteryCodec: Searching the Implicit Representation in a Random Network for Low-Complexity Image Compression

Haotian Wu, Gongpu Chen, Pier Luigi Dragotti et al.

ICML 2025spotlightarXiv:2507.01204
6
citations

Low-Dimension-to-High-Dimension Generalization and Its Implications for Length Generalization

Yang Chen, Long Yang, Yitao Liang et al.

ICML 2025posterarXiv:2410.08898

Low-distortion and GPU-compatible Tree Embeddings in Hyperbolic Space

Max van Spengler, Pascal Mettes

ICML 2025posterarXiv:2502.17130

Lower Bounds for Chain-of-Thought Reasoning in Hard-Attention Transformers

Alireza Amiribavandpour, Xinting Huang, Mark Rofin et al.

ICML 2025poster

LowRA: Accurate and Efficient LoRA Fine-Tuning of LLMs under 2 Bits

Zikai Zhou, Qizheng Zhang, Hermann Kumbong et al.

ICML 2025posterarXiv:2502.08141

Low-Rank Adapting Models for Sparse Autoencoders

Matthew Chen, Josh Engels, Max Tegmark

ICML 2025posterarXiv:2501.19406

Low-Rank Tensor Transitions (LoRT) for Transferable Tensor Regression

Andong Wang, Yuning Qiu, Zhong Jin et al.

ICML 2025oral

Low-Rank Thinning

Annabelle Carrell, Albert Gong, Abhishek Shetty et al.

ICML 2025posterarXiv:2502.12063

LRA-QViT: Integrating Low-Rank Approximation and Quantization for Robust and Efficient Vision Transformers

Beom Jin Kang, NamJoon Kim, Hyun Kim

ICML 2025poster
2
citations

LSCD: Lomb--Scargle Conditioned Diffusion for Time series Imputation

Elizabeth M Fons Etcheverry, Alejandro Sztrajman, Yousef El-Laham et al.

ICML 2025poster

LV-XAttn: Distributed Cross-Attention for Long Visual Inputs in Multimodal Large Language Models

Tzu-Tao (Tommy) Chang, Shivaram Venkataraman

ICML 2025posterarXiv:2502.02406
1
citations

M2PDE: Compositional Generative Multiphysics and Multi-component PDE Simulation

Tao Zhang, Zhenhai Liu, Feipeng Qi et al.

ICML 2025posterarXiv:2412.04134

M³HF: Multi-agent Reinforcement Learning from Multi-phase Human Feedback of Mixed Quality

Ziyan Wang, Zhicheng Zhang, Fei Fang et al.

ICML 2025poster

M3-JEPA: Multimodal Alignment via Multi-gate MoE based on the Joint-Embedding Predictive Architecture

Hongyang Lei, Xiaolong Cheng, Qi Qin et al.

ICML 2025posterarXiv:2409.05929
4
citations

Machine Learning meets Algebraic Combinatorics: A Suite of Datasets Capturing Research-level Conjecturing Ability in Pure Mathematics

Herman Chau, Helen Jenne, Davis Brown et al.

ICML 2025oralarXiv:2503.06366

Machines and Mathematical Mutations: Using GNNs to Characterize Quiver Mutation Classes

Jesse He, Helen Jenne, Herman Chau et al.

ICML 2025posterarXiv:2411.07467

MAGELLAN: Metacognitive predictions of learning progress guide autotelic LLM agents in large goal spaces

Loris Gaven, Thomas Carta, Clément Romac et al.

ICML 2025posterarXiv:2502.07709
4
citations

Mahalanobis++: Improving OOD Detection via Feature Normalization

Maximilian Müller, Matthias Hein

ICML 2025poster

Maintaining Proportional Committees with Dynamic Candidate Sets

Chris Dong, Jannik Peters

ICML 2025poster

Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment

Chenghao Fan, zhenyi lu, Sichen Liu et al.

ICML 2025posterarXiv:2502.16894

Making Hard Problems Easier with Custom Data Distributions and Loss Regularization: A Case Study in Modular Arithmetic

Eshika Saxena, Alberto Alfarano, Emily Wenger et al.

ICML 2025posterarXiv:2410.03569

MA-LoT: Model-Collaboration Lean-based Long Chain-of-Thought Reasoning enhances Formal Theorem Proving

Ruida Wang, Rui Pan, Yuxin Li et al.

ICML 2025posterarXiv:2503.03205

MapEval: A Map-Based Evaluation of Geo-Spatial Reasoning in Foundation Models

Mahir Labib Dihan, Tanvir Hassan, Md Tanvir Parvez et al.

ICML 2025spotlightarXiv:2501.00316

MAPLE: Many-Shot Adaptive Pseudo-Labeling for In-Context Learning

Zihan Chen, Song Wang, Zhen Tan et al.

ICML 2025posterarXiv:2505.16225

MARGE: Improving Math Reasoning with Guided Exploration

Jingyue Gao, Runji Lin, Keming Lu et al.

ICML 2025poster

MARS: Unleashing the Power of Variance Reduction for Training Large Models

Huizhuo Yuan, Yifeng Liu, Shuang Wu et al.

ICML 2025posterarXiv:2411.10438

MAS-GPT: Training LLMs to Build LLM-based Multi-Agent Systems

Rui Ye, shuo tang, Rui Ge et al.

ICML 2025posterarXiv:2503.03686

Masked Autoencoders Are Effective Tokenizers for Diffusion Models

Hao Chen, Yujin Han, Fangyi Chen et al.

ICML 2025spotlightarXiv:2502.03444

Masked Generative Nested Transformers with Decode Time Scaling

Sahil Goyal, Debapriya Tula, Gagan Jain et al.

ICML 2025posterarXiv:2502.00382

Mask-Enhanced Autoregressive Prediction: Pay Less Attention to Learn More

Xialie Zhuang, Zhikai Jia, Jianjin Li et al.

ICML 2025posterarXiv:2502.07490

MaskTwins: Dual-form Complementary Masking for Domain-Adaptive Image Segmentation

Jiawen Wang, Yinda Chen, Xiaoyu Liu et al.

ICML 2025poster

Massive Values in Self-Attention Modules are the Key to Contextual Knowledge Understanding

Mingyu Jin, Kai Mei, Wujiang Xu et al.

ICML 2025posterarXiv:2502.01563

MASS: Mathematical Data Selection via Skill Graphs for Pretraining Large Language Models

Jiazheng Li, Lu Yu, Qing Cui et al.

ICML 2025posterarXiv:2503.14917

Mastering Board Games by External and Internal Planning with Language Models

John Schultz, Jakub Adamek, Matej Jusup et al.

ICML 2025spotlightarXiv:2412.12119
21
citations

Mastering Massive Multi-Task Reinforcement Learning via Mixture-of-Expert Decision Transformer

Yilun Kong, Guozheng Ma, Qi Zhao et al.

ICML 2025posterarXiv:2505.24378
4
citations

Mastering Multiple-Expert Routing: Realizable $H$-Consistency and Strong Guarantees for Learning to Defer

Anqi Mao, Mehryar Mohri, Yutao Zhong

ICML 2025posterarXiv:2506.20650

MathConstruct: Challenging LLM Reasoning with Constructive Proofs

Mislav Balunovic, Jasper Dekoninck, Nikola Jovanović et al.

ICML 2025poster

MATH-Perturb: Benchmarking LLMs' Math Reasoning Abilities against Hard Perturbations

Kaixuan Huang, Jiacheng Guo, Zihao Li et al.

ICML 2025posterarXiv:2502.06453

Matrix Completion with Incomplete Side Information via Orthogonal Complement Projection

Gengshuo Chang, Wei Zhang, Lehan Zhang

ICML 2025poster

Matryoshka Quantization

Pranav Nair, Puranjay Datta, Jeff Dean et al.

ICML 2025posterarXiv:2502.06786

MATS: An Audio Language Model under Text-only Supervision

Wen Wang, Ruibing Hou, Hong Chang et al.

ICML 2025posterarXiv:2502.13433

Maximizing Intermediate Checkpoint Value in LLM Pretraining with Bayesian Optimization

Deyuan Liu, Zecheng Wang, Bingning Wang et al.

ICML 2025poster

Maximum Coverage in Turnstile Streams with Applications to Fingerprinting Measures

Alina Ene, Alessandro Epasto, Vahab Mirrokni et al.

ICML 2025posterarXiv:2504.18394

Maximum Entropy Reinforcement Learning with Diffusion Policy

Xiaoyi Dong, Jian Cheng, Xi Zhang

ICML 2025posterarXiv:2502.11612