Most Cited ICML "compute-optimal training" Papers

5,975 papers found • Page 9 of 30

#1601

Minimalist Concept Erasure in Generative Models

Yang Zhang, Er Jin, Yanfei Dong et al.

ICML 2025arXiv:2507.13386
#1602

Quantifying Treatment Effects: Estimating Risk Ratios via Observational Studies

Ahmed Boughdiri, julie Josse, Erwan Scornet

ICML 2025
#1603

Communicating Activations Between Language Model Agents

Vignav Ramesh, Kenneth Li

ICML 2025arXiv:2501.14082
#1604

MedRAX: Medical Reasoning Agent for Chest X-ray

Adibvafa Fallahpour, Jun Ma, Alif Munim et al.

ICML 2025arXiv:2502.02673
#1605

On the Power of Learning-Augmented Search Trees

Jingbang Chen, Xinyuan Cao, Alicia Stepin et al.

ICML 2025arXiv:2211.09251
#1606

Universal Biological Sequence Reranking for Improved De Novo Peptide Sequencing

Zijie Qiu, Jiaqi Wei, Xiang Zhang et al.

ICML 2025arXiv:2505.17552
#1607

Latent Variable Causal Discovery under Selection Bias

Haoyue Dai, Yiwen Qiu, Ignavier Ng et al.

ICML 2025arXiv:2512.11219
#1608

WMarkGPT: Watermarked Image Understanding via Multimodal Large Language Models

Tan Songbai, Xuerui Qiu, Yao Shu et al.

ICML 2025
#1609

LEVIS: Large Exact Verifiable Input Spaces for Neural Networks

Mohamad Chehade, Wenting Li, Brian Bell et al.

ICML 2025arXiv:2408.08824
#1610

Diversifying Robot Locomotion Behaviors with Extrinsic Behavioral Curiosity

Zhenglin Wan, Xingrui Yu, David Bossens et al.

ICML 2025oral
#1611

Overcoming Non-monotonicity in Transducer-based Streaming Generation

Zhengrui Ma, Yang Feng, Min zhang

ICML 2025arXiv:2411.17170
#1612

Efficient Bisection Projection to Ensure Neural-Network Solution Feasibility for Optimization over General Set

Enming Liang, Minghua Chen

ICML 2025
#1613

Time-Aware World Model for Adaptive Prediction and Control

Anh Nhu, Sanghyun Son, Ming Lin

ICML 2025oralarXiv:2506.08441
#1614

A Mixture-Based Framework for Guiding Diffusion Models

Yazid Janati, Badr MOUFAD, Mehdi Qassime et al.

ICML 2025arXiv:2502.03332
#1615

CVE-Bench: A Benchmark for AI Agents’ Ability to Exploit Real-World Web Application Vulnerabilities

Yuxuan Zhu, Antony Kellermann, Dylan Bowman et al.

ICML 2025spotlightarXiv:2503.17332
#1616

Mixture of Hidden-Dimensions: Not All Hidden-States’ Dimensions are Needed in Transformer

Yilong Chen, Junyuan Shang, Zhenyu Zhang et al.

ICML 2025
#1617

Conformal Tail Risk Control for Large Language Model Alignment

Catherine Chen, Jingyan Shen, Xinyu Yang et al.

ICML 2025arXiv:2502.20285
#1618

Aligning LLMs by Predicting Preferences from User Writing Samples

Stéphane Aroca-Ouellette, Natalie Mackraz, Barry-John Theobald et al.

ICML 2025arXiv:2505.23815
#1619

MetaOptimize: A Framework for Optimizing Step Sizes and Other Meta-parameters

Arsalan Sharifnassab, Saber Salehkaleybar, Rich Sutton

ICML 2025arXiv:2402.02342
#1620

In-Context Deep Learning via Transformer Models

Weimin Wu, Maojiang Su, Jerry Yao-Chieh Hu et al.

ICML 2025arXiv:2411.16549
#1621

Learning dynamics in linear recurrent neural networks

Alexandra Proca, Clémentine Dominé, Murray Shanahan et al.

ICML 2025oral
#1622

Benign Overfitting in Token Selection of Attention Mechanism

Keitaro Sakamoto, Issei Sato

ICML 2025arXiv:2409.17625
#1623

On Expressive Power of Looped Transformers: Theoretical Analysis and Enhancement via Timestep Encoding

Kevin Xu, Issei Sato

ICML 2025arXiv:2410.01405
#1624

Structured Preconditioners in Adaptive Optimization: A Unified Analysis

Shuo Xie, Tianhao Wang, Sashank J. Reddi et al.

ICML 2025arXiv:2503.10537
#1625

KoNODE: Koopman-Driven Neural Ordinary Differential Equations with Evolving Parameters for Time Series Analysis

Hanru Bai, Weiyang Ding

ICML 2025
#1626

Structure-informed Risk Minimization for Robust Ensemble Learning

Fengchun Qiao, Yanlin Chen, Xi Peng

ICML 2025
#1627

Feasible Action Search for Bandit Linear Programs via Thompson Sampling

Aditya Gangrade, Aldo Pacchiano, Clay Scott et al.

ICML 2025
#1628

Policy Regularization on Globally Accessible States in Cross-Dynamics Reinforcement Learning

Zhenghai Xue, Lang Feng, Jiacheng Xu et al.

ICML 2025spotlightarXiv:2503.06893
#1629

AuPair: Golden Example Pairs for Code Repair

Aditi Mavalankar, Hassan Mansoor, Zita Marinho et al.

ICML 2025arXiv:2502.18487
#1630

Deep Streaming View Clustering

Honglin Yuan, Xingfeng Li, Jian Dai et al.

ICML 2025
#1631

Janus: Dual-Server Multi-Round Secure Aggregation with Verifiability for Federated Learning

Lang Pu, Jingjing Gu, Chao Lin et al.

ICML 2025
#1632

Lego Sketch: A Scalable Memory-augmented Neural Network for Sketching Data Streams

Yuan Feng, Yukun Cao, Hairu Wang et al.

ICML 2025arXiv:2505.19561
#1633

EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM

Zhuofan Zong, Dongzhi Jiang, Bingqi Ma et al.

ICML 2025arXiv:2412.09618
#1634

Risk and cross validation in ridge regression with correlated samples

Alexander Atanasov, Jacob A Zavatone-Veth, Cengiz Pehlevan

ICML 2025arXiv:2408.04607
#1635

Identification of Latent Confounders via Investigating the Tensor Ranks of the Nonlinear Observations

Zhengming Chen, Yewei Xia, Feng Xie et al.

ICML 2025
#1636

Online Learning in the Random-Order Model

Martino Bernasconi, Andrea Celli, Riccardo Colini Baldeschi et al.

ICML 2025
#1637

On the Similarities of Embeddings in Contrastive Learning

Chungpa Lee, Sehee Lim, Kibok Lee et al.

ICML 2025arXiv:2506.09781
#1638

VerbalTS: Generating Time Series from Texts

Shuqi Gu, Chuyue Li, Baoyu Jing et al.

ICML 2025oral
#1639

Private Model Personalization Revisited

Conor Snedeker, Xinyu Zhou, Raef Bassily

ICML 2025arXiv:2506.19220
#1640

Are LLMs Prescient? A Continuous Evaluation using Daily News as the Oracle

Hui Dai, Ryan Teehan, Mengye Ren

ICML 2025oralarXiv:2411.08324
#1641

Balancing Preservation and Modification: A Region and Semantic Aware Metric for Instruction-Based Image Editing

Zhuoying Li, Zhu Xu, Yuxin Peng et al.

ICML 2025arXiv:2506.13827
#1642

Diss-l-ECT: Dissecting Graph Data with Local Euler Characteristic Transforms

Julius Von Rohrscheidt, Bastian Rieck

ICML 2025arXiv:2410.02622
#1643

Voronoi-grid-based Pareto Front Learning and Its Application to Collaborative Federated Learning

Mengmeng Chen, Xiaohu Wu, QIQI LIU et al.

ICML 2025arXiv:2505.20648
#1644

Beyond Cropped Regions: New Benchmark and Corresponding Baseline for Chinese Scene Text Retrieval in Diverse Layouts

Li gengluo, Huawen Shen, Yu ZHOU

ICML 2025arXiv:2506.04999
#1645

Harmonizing Geometry and Uncertainty: Diffusion with Hyperspheres

Muskan Dosi, Chiranjeev Chiranjeev, Kartik Thakral et al.

ICML 2025arXiv:2506.10576
#1646

Towards Rationale-Answer Alignment of LVLMs via Self-Rationale Calibration

Yuanchen Wu, Ke Yan, Shouhong Ding et al.

ICML 2025arXiv:2509.13919
#1647

Unveiling Markov heads in Pretrained Language Models for Offline Reinforcement Learning

Wenhao Zhao, Qiushui Xu, Linjie Xu et al.

ICML 2025arXiv:2409.06985
#1648

Stability and Generalization Capability of Subgraph Reasoning Models for Inductive Knowledge Graph Completion

Minsung Hwang, Jaejun Lee, Joyce Whang

ICML 2025
#1649

Improving Zero-Shot Adversarial Robustness in Vision-Language Models by Closed-form Alignment of Adversarial Path Simplices

Junhao Dong, Piotr Koniusz, Yifei Zhang et al.

ICML 2025spotlight
#1650

Inverse problems with experiment-guided AlphaFold

Sai Advaith Maddipatla, Nadav Bojan, Meital Bojan et al.

ICML 2025arXiv:2502.09372
#1651

Cross-regularization: Adaptive Model Complexity through Validation Gradients

Carlos Stein Naves de Brito

ICML 2025arXiv:2506.19755
#1652

CTBench: A Library and Benchmark for Certified Training

Yuhao Mao, Stefan Balauca, Martin Vechev

ICML 2025arXiv:2406.04848
#1653

Average Certified Radius is a Poor Metric for Randomized Smoothing

Chenhao Sun, Yuhao Mao, Mark Müller et al.

ICML 2025arXiv:2410.06895
#1654

Generalization Principles for Inference over Text-Attributed Graphs with Large Language Models

Haoyu Wang, Shikun Liu, Rongzhe Wei et al.

ICML 2025
#1655

Identifying and Understanding Cross-Class Features in Adversarial Training

Zeming Wei, Yiwen Guo, Yisen Wang

ICML 2025arXiv:2506.05032
#1656

Sum-of-Parts: Self-Attributing Neural Networks with End-to-End Learning of Feature Groups

Weiqiu You, Helen Qu, Marco Gatti et al.

ICML 2025arXiv:2310.16316
#1657

Ad-Hoc Human-AI Coordination Challenge

Tin Dizdarevic, Ravi Hammond, Tobias Gessler et al.

ICML 2025spotlight
#1658

Revisiting Unbiased Implicit Variational Inference

Tobias Pielok, Bernd Bischl, David Rügamer

ICML 2025arXiv:2506.03839
#1659

MVA: Linear Attention with High-order Query-Keys Integration and Multi-level Vocabulary Decomposition

ning wang, Zekun Li, Tongxin Bai et al.

ICML 2025
#1660

Exploring Large Action Sets with Hyperspherical Embeddings using von Mises-Fisher Sampling

Walid Bendada, Guillaume Salha-Galvan, Romain Hennequin et al.

ICML 2025arXiv:2507.00518
#1661

Stochastic Encodings for Active Feature Acquisition

Alexander Norcliffe, Changhee Lee, Fergus Imrie et al.

ICML 2025arXiv:2508.01957
#1662

Predicting High-precision Depth on Low-Precision Devices Using 2D Hilbert Curves

Mykhailo Uss, Ruslan Yermolenko, Oleksii Shashko et al.

ICML 2025arXiv:2405.14024
#1663

D-Fusion: Direct Preference Optimization for Aligning Diffusion Models with Visually Consistent Samples

Zijing Hu, Fengda Zhang, Kun Kuang

ICML 2025arXiv:2505.22002
#1664

Approximating Latent Manifolds in Neural Networks via Vanishing Ideals

Nico Pelleriti, Max Zimmer, Elias Wirth et al.

ICML 2025arXiv:2502.15051
#1665

Models of Heavy-Tailed Mechanistic Universality

Liam Hodgkinson, Zhichao Wang, Michael Mahoney

ICML 2025arXiv:2506.03470
#1666

Discovering Latent Causal Graphs from Spatiotemporal Data

Kun Wang, Sumanth Varambally, Duncan Watson-Parris et al.

ICML 2025oralarXiv:2411.05331
#1667

Adjusting Model Size in Continual Gaussian Processes: How Big is Big Enough?

Guiomar Pescador-Barrios, Sarah Filippi, Mark van der Wilk

ICML 2025spotlightarXiv:2408.07588
#1668

SBGD: Improving Graph Diffusion Generative Model via Stochastic Block Diffusion

Junwei Su, shan Wu

ICML 2025arXiv:2508.14352
#1669

BOPO: Neural Combinatorial Optimization via Best-anchored and Objective-guided Preference Optimization

Zijun Liao, Jinbiao Chen, Debing Wang et al.

ICML 2025arXiv:2503.07580
#1670

Identifying Metric Structures of Deep Latent Variable Models

Stas Syrota, Yevgen Zainchkovskyy, Johnny Xi et al.

ICML 2025arXiv:2502.13757
#1671

CoDy: Counterfactual Explainers for Dynamic Graphs

Zhan Qu, Daniel Gomm, Michael Färber

ICML 2025oralarXiv:2403.16846
#1672

How Do Images Align and Complement LiDAR? Towards a Harmonized Multi-modal 3D Panoptic Segmentation

Yining Pan, Qiongjie Cui, Xulei Yang et al.

ICML 2025arXiv:2505.18956
#1673

Eigen Analysis of Conjugate Kernel and Neural Tangent Kernel

Xiangchao Li, Xiao Han, Qing Yang

ICML 2025
#1674

Causal Invariance-aware Augmentation for Brain Graph Contrastive Learning

Minqi Yu, Jinduo Liu, Junzhong Ji

ICML 2025
#1675

Online Clustering of Dueling Bandits

Zhiyong Wang, Jiahang Sun, Mingze Kong et al.

ICML 2025arXiv:2502.02079
#1676

Learnable Spatial-Temporal Positional Encoding for Link Prediction

Katherine Tieu, Dongqi Fu, Zihao Li et al.

ICML 2025oralarXiv:2506.08309
#1677

Elucidating the Design Space of Multimodal Protein Language Models

Cheng-Yen Hsieh, Xinyou Wang, Daiheng Zhang et al.

ICML 2025spotlightarXiv:2504.11454
#1678

Curriculum Learning for Biological Sequence Prediction: The Case of De Novo Peptide Sequencing

Xiang Zhang, Jiaqi Wei, Zijie Qiu et al.

ICML 2025oralarXiv:2506.13485
#1679

Understanding Model Reprogramming for CLIP via Decoupling Visual Prompts

Chengyi Cai, Zesheng Ye, Lei Feng et al.

ICML 2025arXiv:2506.01000
#1680

Automatically Interpreting Millions of Features in Large Language Models

Gonçalo Paulo, Alex Mallen, Caden Juang et al.

ICML 2025arXiv:2410.13928
#1681

Phase and Amplitude-aware Prompting for Enhancing Adversarial Robustness

Yibo Xu, Dawei Zhou, Decheng Liu et al.

ICML 2025
#1682

On Differential Privacy for Adaptively Solving Search Problems via Sketching

Shiyuan Feng, Ying Feng, George Li et al.

ICML 2025oralarXiv:2506.05503
#1683

Uncertainty-Based Extensible Codebook for Discrete Federated Learning in Heterogeneous Data Silos

Tianyi Zhang, Yu Cao, Dianbo Liu

ICML 2025arXiv:2402.18888
#1684

Designing Cyclic Peptides via Harmonic SDE with Atom-Bond Modeling

Xiangxin Zhou, Mingyu Li, xiao yi et al.

ICML 2025arXiv:2505.21452
#1685

Can Diffusion Models Learn Hidden Inter-Feature Rules Behind Images?

Yujin Han, Andi Han, Wei Huang et al.

ICML 2025arXiv:2502.04725
#1686

Towards Escaping from Class Dependency Modeling for Multi-Dimensional Classification

Teng Huang, Bin-Bin Jia, Min-Ling Zhang

ICML 2025
#1687

Skip the Equations: Learning Behavior of Personalized Dynamical Systems Directly From Data

Krzysztof Kacprzyk, Julianna Piskorz, Mihaela van der Schaar

ICML 2025oral
#1688

Provably Near-Optimal Federated Ensemble Distillation with Negligible Overhead

Won-Jun Jang, Hyeon-Seo Park, Si-Hyeon Lee

ICML 2025arXiv:2502.06349
#1689

Prediction-Powered Adaptive Shrinkage Estimation

Sida Li, Nikolaos Ignatiadis

ICML 2025arXiv:2502.14166
#1690

Understanding Mode Connectivity via Parameter Space Symmetry

Bo Zhao, Nima Dehmamy, Robin Walters et al.

ICML 2025arXiv:2505.23681
#1691

Introducing 3D Representation for Dense Volume-to-Volume Translation via Score Fusion

Xiyue Zhu, Dou Kwark, Ruike Zhu et al.

ICML 2025oral
#1692

The Logical Implication Steering Method for Conditional Interventions on Transformer Generation

Damjan Kalajdzievski

ICML 2025arXiv:2502.03618
#1693

ProDiff: Prototype-Guided Diffusion for Minimal Information Trajectory Imputation

Tianci Bu, Le Zhou, Wenchuan Yang et al.

ICML 2025oralarXiv:2505.23048
#1694

Flopping for FLOPs: Leveraging Equivariance for Computational Efficiency

Georg Bökman, David Nordström, Fredrik Kahl

ICML 2025spotlightarXiv:2502.05169
#1695

Deterministic Sparse Fourier Transform for Continuous Signals with Frequency Gap

Xiaoyu Li, Zhao Song, Shenghao Xie

ICML 2025
#1696

Beyond Sensor Data: Foundation Models of Behavioral Data from Wearables Improve Health Predictions

Eray Erturk, Fahad Kamran, Salar Abbaspourazad et al.

ICML 2025oralarXiv:2507.00191
#1697

Learning Initial Basis Selection for Linear Programming via Duality-Inspired Tripartite Graph Representation and Comprehensive Supervision

Anqi Lu, Junchi Yan

ICML 2025
#1698

Unified Screening for Multiple Diseases

Yiğit Narter, Alihan Hüyük, Mihaela van der Schaar et al.

ICML 2025
#1699

Generalization and Robustness of the Tilted Empirical Risk

Gholamali Aminian, Amir R. Asadi, Tian Li et al.

ICML 2025arXiv:2409.19431
#1700

TIMING: Temporality-Aware Integrated Gradients for Time Series Explanation

Hyeongwon Jang, Changhun Kim, Eunho Yang

ICML 2025oralarXiv:2506.05035
#1701

Diving into Self-Evolving Training for Multimodal Reasoning

Wei Liu, Junlong Li, Xiwen Zhang et al.

ICML 2025arXiv:2412.17451
#1702

Angle Domain Guidance: Latent Diffusion Requires Rotation Rather Than Extrapolation

Cheng Jin, Zhenyu Xiao, Chutao Liu et al.

ICML 2025arXiv:2506.11039
#1703

Segment Anyword: Mask Prompt Inversion for Open-Set Grounded Segmentation

Zhihua Liu, Amrutha Saseendran, Lei Tong et al.

ICML 2025arXiv:2505.17994
#1704

Multilayer Matrix Factorization via Dimension-Reducing Diffusion Variational Inference

Junbin Liu, Farzan Farnia, Wing-Kin Ma

ICML 2025
#1705

Attention-Level Speculation

Jack Cai, Ammar Vora, Randolph Zhang et al.

ICML 2025
#1706

GIVE: Structured Reasoning of Large Language Models with Knowledge Graph Inspired Veracity Extrapolation

Jiashu HE, Mingyu Ma, Jinxuan Fan et al.

ICML 2025arXiv:2410.08475
#1707

any4: Learned 4-bit Numeric Representation for LLMs

Mostafa Elhoushi, Jeff Johnson

ICML 2025arXiv:2507.04610
#1708

Feature Shift Localization Network

Míriam Barrabés, Daniel Mas Montserrat, Kapal Dev et al.

ICML 2025arXiv:2506.09101
#1709

The Sharpness Disparity Principle in Transformers for Accelerating Language Model Pre-Training

Jinbo Wang, Mingze Wang, Zhanpeng Zhou et al.

ICML 2025arXiv:2502.19002
#1710

Contract Design Under Approximate Best Responses

Francesco Bacchiocchi, Jiarui Gan, Matteo Castiglioni et al.

ICML 2025arXiv:2502.15523
#1711

A Closer Look at Backdoor Attacks on CLIP

Shuo He, Zhifang Zhang, Feng Liu et al.

ICML 2025
#1712

Aligning with Logic: Measuring, Evaluating and Improving Logical Preference Consistency in Large Language Models

Yinhong Liu, Zhijiang Guo, Tianya Liang et al.

ICML 2025spotlightarXiv:2410.02205
#1713

Toward Data-centric Directed Graph Learning: An Entropy-driven Approach

Xunkai Li, Zhengyu Wu, Kaichi Yu et al.

ICML 2025arXiv:2505.00983
#1714

PhantomWiki: On-Demand Datasets for Reasoning and Retrieval Evaluation

Albert Gong, Kamilė Stankevičiūtė, Chao Wan et al.

ICML 2025arXiv:2502.20377
#1715

Visual Abstraction: A Plug-and-Play Approach for Text-Visual Retrieval

Guofeng Ding, Yiding Lu, Peng Hu et al.

ICML 2025
#1716

Geometric Feature Embedding for Effective 3D Few-Shot Class Incremental Learning

Xiangqi Li, Libo Huang, Zhulin An et al.

ICML 2025
#1717

Not All Tokens Matter All The Time: Dynamic Token Aggregation Towards Efficient Detection Transformers

Jiacheng Cheng, Xiwen Yao, Xiang Yuan et al.

ICML 2025
#1718

Expert Race: A Flexible Routing Strategy for Scaling Diffusion Transformer with Mixture of Experts

Yike Yuan, Ziyu Wang, Zihao Huang et al.

ICML 2025arXiv:2503.16057
#1719

Conformal Anomaly Detection in Event Sequences

Shuai Zhang, Chuan Zhou, Yang Liu et al.

ICML 2025
#1720

When to retrain a machine learning model

Florence Regol, Leo Schwinn, Kyle Sprague et al.

ICML 2025arXiv:2505.14903
#1721

Cavia: Camera-controllable Multi-view Video Diffusion with View-Integrated Attention

Dejia Xu, Yifan Jiang, Chen Huang et al.

ICML 2025oralarXiv:2410.10774
#1722

EARTH: Epidemiology-Aware Neural ODE with Continuous Disease Transmission Graph

Guancheng Wan, Zewen Liu, Xiaojun Shan et al.

ICML 2025
#1723

Pareto-frontier Entropy Search with Variational Lower Bound Maximization

Masanori Ishikura, Masayuki Karasuyama

ICML 2025arXiv:2501.19073
#1724

SAN: Hypothesizing Long-Term Synaptic Development and Neural Engram Mechanism in Scalable Model's Parameter-Efficient Fine-Tuning

Gaole Dai, Chun-Kai Fan, Yiming Tang et al.

ICML 2025arXiv:2409.06706
#1725

Learning Invariant Causal Mechanism from Vision-Language Models

Zeen Song, Siyu Zhao, Xingyu Zhang et al.

ICML 2025arXiv:2405.15289
#1726

Emergent Response Planning in LLMs

Zhichen Dong, Zhanhui Zhou, Zhixuan Liu et al.

ICML 2025arXiv:2502.06258
#1727

Non-Asymptotic and Non-Lipschitzian Bounds on Optimal Values in Stochastic Optimization Under Heavy Tails

Jindong Tong, Hongcheng Liu, Johannes Royset

ICML 2025
#1728

Finding Wasserstein Ball Center: Efficient Algorithm and The Applications in Fairness

Yuntao Wang, Yuxuan Li, Qingyuan Yang et al.

ICML 2025
#1729

Self-Organizing Visual Prototypes for Non-Parametric Representation Learning

Thalles Silva, Helio Pedrini, Adín Ramírez Rivera

ICML 2025arXiv:2505.21533
#1730

CaDA: Cross-Problem Routing Solver with Constraint-Aware Dual-Attention

Han Li, Fei Liu, Zhi Zheng et al.

ICML 2025arXiv:2412.00346
#1731

BAME: Block-Aware Mask Evolution for Efficient N:M Sparse Training

Chenyi yang, Wenjie Nie, Yuxin Zhang et al.

ICML 2025
#1732

VCT: Training Consistency Models with Variational Noise Coupling

Gianluigi Silvestri, Luca Ambrogioni, Chieh-Hsin Lai et al.

ICML 2025arXiv:2502.18197
#1733

Learning from True-False Labels via Multi-modal Prompt Retrieving

Zhongnian Li, Jinghao Xu, Peng Ying et al.

ICML 2025arXiv:2405.15228
#1734

Empower Structure-Based Molecule Optimization with Gradient Guided Bayesian Flow Networks

Keyue Qiu, Yuxuan Song, Jie Yu et al.

ICML 2025arXiv:2411.13280
#1735

Automated Benchmark Generation for Repository-Level Coding Tasks

Konstantinos Vergopoulos, Mark Müller, Martin Vechev

ICML 2025arXiv:2503.07701
#1736

RAGGED: Towards Informed Design of Scalable and Stable RAG Systems

Jennifer Hsia, Afreen Shaikh, Zhiruo Wang et al.

ICML 2025arXiv:2403.09040
#1737

A General Representation-Based Approach to Multi-Source Domain Adaptation

Ignavier Ng, Yan Li, Zijian Li et al.

ICML 2025
#1738

Fast Min-$\epsilon$ Segmented Regression using Constant-Time Segment Merging

Ansgar Lößer, Max Schlecht, Florian Schintke et al.

ICML 2025
#1739

ParallelComp: Parallel Long-Context Compressor for Length Extrapolation

Jing Xiong, Jianghan Shen, Chuanyang Zheng et al.

ICML 2025arXiv:2502.14317
#1740

Improved Last-Iterate Convergence of Shuffling Gradient Methods for Nonsmooth Convex Optimization

Zijian Liu, Zhengyuan Zhou

ICML 2025arXiv:2505.23056
#1741

Local Pan-privacy for Federated Analytics

Vitaly Feldman, Audra McMillan, Guy Rothblum et al.

ICML 2025arXiv:2503.11850
#1742

Topological Signatures of Adversaries in Multimodal Alignments

Minh Vu, Geigh Zollicoffer, Huy Mai et al.

ICML 2025arXiv:2501.18006
#1743

DeepLayout: Learning Neural Representations of Circuit Placement Layout

Yuxiang Zhao, zhuomin chai, Xun Jiang et al.

ICML 2025
#1744

Accelerated Diffusion Models via Speculative Sampling

Valentin De Bortoli, Alexandre Galashov, Arthur Gretton et al.

ICML 2025arXiv:2501.05370
#1745

Forest-of-Thought: Scaling Test-Time Compute for Enhancing LLM Reasoning

Zhenni Bi, Kai Han, Chuanjian Liu et al.

ICML 2025arXiv:2412.09078
#1746

A Selective Learning Method for Temporal Graph Continual Learning

Hanmo Liu, Shimin Di, Haoyang LI et al.

ICML 2025oralarXiv:2503.01580
#1747

Predictive Performance of Deep Quantum Data Re-uploading Models

Xin Wang, Hanxiao Tao, Re-Bing Wu

ICML 2025arXiv:2505.20337
#1748

Scalable First-order Method for Certifying Optimal k-Sparse GLMs

Jiachang Liu, Soroosh Shafiee, Andrea Lodi

ICML 2025arXiv:2502.09502
#1749

Speculate, then Collaborate: Fusing Knowledge of Language Models during Decoding

Ziyao Wang, Muneeza Azmat, Ang Li et al.

ICML 2025arXiv:2502.08020
#1750

Visual and Domain Knowledge for Professional-level Graph-of-Thought Medical Reasoning

Rina Bao, Shilong Dong, Zhenfang Chen et al.

ICML 2025spotlight
#1751

Reinforcement Learning for Quantum Control under Physical Constraints

Jan Ole Ernst, Aniket Chatterjee, Tim Franzmeyer et al.

ICML 2025arXiv:2501.14372
#1752

Simplicity Bias and Optimization Threshold in Two-Layer ReLU Networks

Etienne Boursier, Nicolas Flammarion

ICML 2025arXiv:2410.02348
#1753

Knowledge Swapping via Learning and Unlearning

Mingyu Xing, Lechao Cheng, Shengeng Tang et al.

ICML 2025arXiv:2502.08075
#1754

Optimization over Sparse Support-Preserving Sets: Two-Step Projection with Global Optimality Guarantees

William de Vazelhes, Xiaotong Yuan, Bin Gu

ICML 2025arXiv:2506.08558
#1755

Teaching Physical Awareness to LLMs through Sounds

Weiguo Wang, Andy Nie, Wenrui Zhou et al.

ICML 2025arXiv:2506.08524
#1756

Distributed Nonparametric Estimation: from Sparse to Dense Samples per Terminal

Deheng Yuan, Tao Guo, Zhongyi Huang

ICML 2025arXiv:2501.07879
#1757

Explicit Exploration for High-Welfare Equilibria in Game-Theoretic Multiagent Reinforcement Learning

Austin Nguyen, Anri Gu, Michael Wellman

ICML 2025
#1758

Generalization of noisy SGD in unbounded non-convex settings

Leello Dadi, Volkan Cevher

ICML 2025
#1759

Compute or Load KV Cache? Why Not Both?

Shuowei Jin, Xueshen Liu, Qingzhao Zhang et al.

ICML 2025arXiv:2410.03065
#1760

MoMa: Modulating Mamba for Adapting Image Foundation Models to Video Recognition

Yuhuan Yang, Chaofan Ma, Zhenjie Mao et al.

ICML 2025oralarXiv:2506.23283
#1761

An All-Atom Generative Model for Designing Protein Complexes

Ruizhe Chen, Dongyu Xue, Xiangxin Zhou et al.

ICML 2025arXiv:2504.13075
#1762

3D-LMVIC: Learning-based Multi-View Image Compression with 3D Gaussian Geometric Priors

Yujun Huang, Bin Chen, Niu Lian et al.

ICML 2025
#1763

Reward-Guided Speculative Decoding for Efficient LLM Reasoning

Baohao Liao, Yuhui Xu, Hanze Dong et al.

ICML 2025arXiv:2501.19324
#1764

Online Differentially Private Conformal Prediction for Uncertainty Quantification

ICML 2025
#1765

Gradient Aligned Regression via Pairwise Losses

Dixian Zhu, Tianbao Yang, Livnat Jerby

ICML 2025arXiv:2402.06104
#1766

A Bregman Proximal Viewpoint on Neural Operators

Abdel-Rahim Mezidi, Jordan Patracone, Saverio Salzo et al.

ICML 2025
#1767

Fine-Grained Captioning of Long Videos through Scene Graph Consolidation

Sanghyeok Chu, Seonguk Seo, Bohyung Han

ICML 2025oralarXiv:2502.16427
#1768

Stacey: Promoting Stochastic Steepest Descent via Accelerated $\ell_p$-Smooth Nonconvex Optimization

Xinyu Luo, Cedar Site Bai, Bolian Li et al.

ICML 2025arXiv:2506.06606
#1769

FOCoOp: Enhancing Out-of-Distribution Robustness in Federated Prompt Learning for Vision-Language Models

Xinting Liao, Weiming Liu, Jiaming Qian et al.

ICML 2025arXiv:2506.16218
#1770

Convergence Analysis of Policy Gradient Methods with Dynamic Stochasticity

Alessandro Montenegro, Marco Mussi, Matteo Papini et al.

ICML 2025
#1771

ADIOS: Antibody Development via Opponent Shaping

Sebastian Towers, Aleksandra Kalisz, Philippe Robert et al.

ICML 2025arXiv:2409.10588
#1772

WikiBigEdit: Understanding the Limits of Lifelong Knowledge Editing in LLMs

Lukas Thede, Karsten Roth, Matthias Bethge et al.

ICML 2025arXiv:2503.05683
#1773

Autoencoder-Based Hybrid Replay for Class-Incremental Learning

Milad Khademi Nori, Il-Min Kim, Guanghui Wang

ICML 2025arXiv:2505.05926
#1774

Learning from Suboptimal Data in Continuous Control via Auto-Regressive Soft Q-Network

Jijia Liu, Feng Gao, Qingmin Liao et al.

ICML 2025arXiv:2502.00288
#1775

Habitizing Diffusion Planning for Efficient and Effective Decision Making

Haofei Lu, Yifei Shen, Dongsheng Li et al.

ICML 2025arXiv:2502.06401
#1776

Fast Tensor Completion via Approximate Richardson Iteration

Mehrdad Ghadiri, Matthew Fahrbach, Yunbum Kook et al.

ICML 2025arXiv:2502.09534
#1777

Internal Causal Mechanisms Robustly Predict Language Model Out-of-Distribution Behaviors

Jing Huang, Junyi Tao, Thomas Icard et al.

ICML 2025arXiv:2505.11770
#1778

Improving Memory Efficiency for Training KANs via Meta Learning

Zhangchi Zhao, Jun Shu, Deyu Meng et al.

ICML 2025arXiv:2506.07549
#1779

No Metric to Rule Them All: Toward Principled Evaluations of Graph-Learning Datasets

Corinna Coupette, Jeremy Wayland, Emily Simons et al.

ICML 2025arXiv:2502.02379
#1780

MARS: Unleashing the Power of Variance Reduction for Training Large Models

Huizhuo Yuan, Yifeng Liu, Shuang Wu et al.

ICML 2025arXiv:2411.10438
#1781

BalancEdit: Dynamically Balancing the Generality-Locality Trade-off in Multi-modal Model Editing

Dongliang Guo, Mengxuan Hu, Zihan Guan et al.

ICML 2025arXiv:2505.01343
#1782

SERENA: A Unified Stochastic Recursive Variance Reduced Gradient Framework for Riemannian Non-Convex Optimization

Yan Liu, Mingjie Chen, Chaojie Ji et al.

ICML 2025
#1783

Quadratic Upper Bound for Boosting Robustness

Euijin You, Hyang-Won Lee

ICML 2025arXiv:2601.13645
#1784

C2IQL: Constraint-Conditioned Implicit Q-learning for Safe Offline Reinforcement Learning

Zifan LIU, Xinran Li, Jun Zhang

ICML 2025
#1785

Mechanistic Unlearning: Robust Knowledge Unlearning and Editing via Mechanistic Localization

Phillip Guo, Aaquib Syed, Abhay Sheshadri et al.

ICML 2025spotlightarXiv:2410.12949
#1786

Flow-based Domain Randomization for Learning and Sequencing Robotic Skills

Aidan Curtis, Eric Li, Michael S Noseworthy et al.

ICML 2025arXiv:2502.01800
#1787

Quadruple Attention in Many-body Systems for Accurate Molecular Property Predictions

Jiahua Rao, Dahao Xu, Wentao Wei et al.

ICML 2025
#1788

All-atom inverse protein folding through discrete flow matching

Kai Yi, Kiarash Jamali, Sjors Scheres

ICML 2025arXiv:2507.14156
#1789

Progressively Label Enhancement for Large Language Model Alignment

Biao Liu, Ning Xu, Xin Geng

ICML 2025arXiv:2408.02599
#1790

Compute Optimal Inference and Provable Amortisation Gap in Sparse Autoencoders

Charles O'Neill, Alim Gumran, David Klindt

ICML 2025arXiv:2411.13117
#1791

Depth Degeneracy in Neural Networks: Vanishing Angles in Fully Connected ReLU Networks on Initialization

Cameron Jakub, Mihai Nica

ICML 2025arXiv:2302.09712
#1792

Topology-aware Neural Flux Prediction Guided by Physics

Haoyang Jiang, Jindong Wang, Xingquan Zhu et al.

ICML 2025arXiv:2506.05676
#1793

KernelBench: Can LLMs Write Efficient GPU Kernels?

Anne Ouyang, Simon Guo, Simran Arora et al.

ICML 2025arXiv:2502.10517
#1794

DIS-CO: Discovering Copyrighted Content in VLMs Training Data

André Duarte, Xuandong Zhao, Arlindo Oliveira et al.

ICML 2025arXiv:2502.17358
#1795

Continuous-Time Analysis of Heavy Ball Momentum in Min-Max Games

Yi Feng, Kaito Fujii, EFSTRATIOS PANTELEIMON SKOULAKIS et al.

ICML 2025arXiv:2505.19537
#1796

DPO Meets PPO: Reinforced Token Optimization for RLHF

Han Zhong, Zikang Shan, Guhao Feng et al.

ICML 2025spotlightarXiv:2404.18922
#1797

Revisiting Chain-of-Thought in Code Generation: Do Language Models Need to Learn Reasoning before Coding?

Ren-Biao Liu, Anqi Li, ChaodingYang et al.

ICML 2025
#1798

Byzantine-Resilient Federated Alternating Gradient Descent and Minimization for Partly-Decoupled Low Rank Matrix Learning

Ankit Pratap Singh, Ahmed Abbasi, Namrata Vaswani

ICML 2025
#1799

Safe-EF: Error Feedback for Non-smooth Constrained Optimization

Rustem Islamov, Yarden As, Ilyas Fatkhullin

ICML 2025
#1800

Understanding Synthetic Context Extension via Retrieval Heads

Xinyu Zhao, Fangcong Yin, Greg Durrett

ICML 2025arXiv:2410.22316