Most Cited ICML "compute-optimal training" Papers

5,975 papers found • Page 9 of 30

Filters:Most Cited ICML compute-optimal training Clear all

Conference

AAAI 2025 (3,028)COLM 2025 (418)CVPR 2025 (2,873)ICCV 2025 (2,701)ICLR 2025 (3,827)ICML 2025 (3,340)ISMAR 2025 (229)NEURIPS 2025 (5,858)AAAI 2024 (2,289)CVPR 2024 (2,716)ECCV 2024 (2,387)ICLR 2024 (2,297)ICML 2024 (2,635)

Paper Type

poster (24,624)paper (8,558)oral (1,594)spotlight (1,421)highlight (975)

#1601

Minimalist Concept Erasure in Generative Models

Yang Zhang, Er Jin, Yanfei Dong et al.

ICML 2025arXiv:2507.13386

#1602

Quantifying Treatment Effects: Estimating Risk Ratios via Observational Studies

Ahmed Boughdiri, julie Josse, Erwan Scornet

ICML 2025

#1603

Communicating Activations Between Language Model Agents

Vignav Ramesh, Kenneth Li

ICML 2025arXiv:2501.14082

#1604

MedRAX: Medical Reasoning Agent for Chest X-ray

Adibvafa Fallahpour, Jun Ma, Alif Munim et al.

ICML 2025arXiv:2502.02673

#1605

On the Power of Learning-Augmented Search Trees

Jingbang Chen, Xinyuan Cao, Alicia Stepin et al.

ICML 2025arXiv:2211.09251

#1606

Universal Biological Sequence Reranking for Improved De Novo Peptide Sequencing

Zijie Qiu, Jiaqi Wei, Xiang Zhang et al.

ICML 2025arXiv:2505.17552

#1607

Latent Variable Causal Discovery under Selection Bias

Haoyue Dai, Yiwen Qiu, Ignavier Ng et al.

ICML 2025arXiv:2512.11219

#1608

WMarkGPT: Watermarked Image Understanding via Multimodal Large Language Models

Tan Songbai, Xuerui Qiu, Yao Shu et al.

ICML 2025

#1609

LEVIS: Large Exact Verifiable Input Spaces for Neural Networks

Mohamad Chehade, Wenting Li, Brian Bell et al.

ICML 2025arXiv:2408.08824

#1610

Diversifying Robot Locomotion Behaviors with Extrinsic Behavioral Curiosity

Zhenglin Wan, Xingrui Yu, David Bossens et al.

ICML 2025oral

#1611

Overcoming Non-monotonicity in Transducer-based Streaming Generation

Zhengrui Ma, Yang Feng, Min zhang

ICML 2025arXiv:2411.17170

#1612

Efficient Bisection Projection to Ensure Neural-Network Solution Feasibility for Optimization over General Set

Enming Liang, Minghua Chen

ICML 2025

#1613

Time-Aware World Model for Adaptive Prediction and Control

Anh Nhu, Sanghyun Son, Ming Lin

ICML 2025oralarXiv:2506.08441

#1614

A Mixture-Based Framework for Guiding Diffusion Models

Yazid Janati, Badr MOUFAD, Mehdi Qassime et al.

ICML 2025arXiv:2502.03332

#1615

CVE-Bench: A Benchmark for AI Agents’ Ability to Exploit Real-World Web Application Vulnerabilities

Yuxuan Zhu, Antony Kellermann, Dylan Bowman et al.

ICML 2025spotlightarXiv:2503.17332

#1616

Mixture of Hidden-Dimensions: Not All Hidden-States’ Dimensions are Needed in Transformer

Yilong Chen, Junyuan Shang, Zhenyu Zhang et al.

ICML 2025

#1617

Conformal Tail Risk Control for Large Language Model Alignment

Catherine Chen, Jingyan Shen, Xinyu Yang et al.

ICML 2025arXiv:2502.20285

#1618

Aligning LLMs by Predicting Preferences from User Writing Samples

Stéphane Aroca-Ouellette, Natalie Mackraz, Barry-John Theobald et al.

ICML 2025arXiv:2505.23815

#1619

MetaOptimize: A Framework for Optimizing Step Sizes and Other Meta-parameters

Arsalan Sharifnassab, Saber Salehkaleybar, Rich Sutton

ICML 2025arXiv:2402.02342

#1620

In-Context Deep Learning via Transformer Models

Weimin Wu, Maojiang Su, Jerry Yao-Chieh Hu et al.

ICML 2025arXiv:2411.16549

#1621

Learning dynamics in linear recurrent neural networks

Alexandra Proca, Clémentine Dominé, Murray Shanahan et al.

ICML 2025oral

#1622

Benign Overfitting in Token Selection of Attention Mechanism

Keitaro Sakamoto, Issei Sato

ICML 2025arXiv:2409.17625

#1623

On Expressive Power of Looped Transformers: Theoretical Analysis and Enhancement via Timestep Encoding

Kevin Xu, Issei Sato

ICML 2025arXiv:2410.01405

#1624

Structured Preconditioners in Adaptive Optimization: A Unified Analysis

Shuo Xie, Tianhao Wang, Sashank J. Reddi et al.

ICML 2025arXiv:2503.10537

#1625

KoNODE: Koopman-Driven Neural Ordinary Differential Equations with Evolving Parameters for Time Series Analysis

Hanru Bai, Weiyang Ding

ICML 2025

#1626

Structure-informed Risk Minimization for Robust Ensemble Learning

Fengchun Qiao, Yanlin Chen, Xi Peng

ICML 2025

#1627

Feasible Action Search for Bandit Linear Programs via Thompson Sampling

Aditya Gangrade, Aldo Pacchiano, Clay Scott et al.

ICML 2025

#1628

Policy Regularization on Globally Accessible States in Cross-Dynamics Reinforcement Learning

Zhenghai Xue, Lang Feng, Jiacheng Xu et al.

ICML 2025spotlightarXiv:2503.06893

#1629

AuPair: Golden Example Pairs for Code Repair

Aditi Mavalankar, Hassan Mansoor, Zita Marinho et al.

ICML 2025arXiv:2502.18487

#1630

Deep Streaming View Clustering

Honglin Yuan, Xingfeng Li, Jian Dai et al.

ICML 2025

#1631

Janus: Dual-Server Multi-Round Secure Aggregation with Verifiability for Federated Learning

Lang Pu, Jingjing Gu, Chao Lin et al.

ICML 2025

#1632

Lego Sketch: A Scalable Memory-augmented Neural Network for Sketching Data Streams

Yuan Feng, Yukun Cao, Hairu Wang et al.

ICML 2025arXiv:2505.19561

#1633

EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM

Zhuofan Zong, Dongzhi Jiang, Bingqi Ma et al.

ICML 2025arXiv:2412.09618

#1634

Risk and cross validation in ridge regression with correlated samples

Alexander Atanasov, Jacob A Zavatone-Veth, Cengiz Pehlevan

ICML 2025arXiv:2408.04607

#1635

Identification of Latent Confounders via Investigating the Tensor Ranks of the Nonlinear Observations

Zhengming Chen, Yewei Xia, Feng Xie et al.

ICML 2025

#1636

Online Learning in the Random-Order Model

Martino Bernasconi, Andrea Celli, Riccardo Colini Baldeschi et al.

ICML 2025

#1637

On the Similarities of Embeddings in Contrastive Learning

Chungpa Lee, Sehee Lim, Kibok Lee et al.

ICML 2025arXiv:2506.09781

#1638

VerbalTS: Generating Time Series from Texts

Shuqi Gu, Chuyue Li, Baoyu Jing et al.

ICML 2025oral

#1639

Private Model Personalization Revisited

Conor Snedeker, Xinyu Zhou, Raef Bassily

ICML 2025arXiv:2506.19220

#1640

Are LLMs Prescient? A Continuous Evaluation using Daily News as the Oracle

Hui Dai, Ryan Teehan, Mengye Ren

ICML 2025oralarXiv:2411.08324

#1641

Balancing Preservation and Modification: A Region and Semantic Aware Metric for Instruction-Based Image Editing

Zhuoying Li, Zhu Xu, Yuxin Peng et al.

ICML 2025arXiv:2506.13827

#1642

Diss-l-ECT: Dissecting Graph Data with Local Euler Characteristic Transforms

Julius Von Rohrscheidt, Bastian Rieck

ICML 2025arXiv:2410.02622

#1643

Voronoi-grid-based Pareto Front Learning and Its Application to Collaborative Federated Learning

Mengmeng Chen, Xiaohu Wu, QIQI LIU et al.

ICML 2025arXiv:2505.20648

#1644

Beyond Cropped Regions: New Benchmark and Corresponding Baseline for Chinese Scene Text Retrieval in Diverse Layouts

Li gengluo, Huawen Shen, Yu ZHOU

ICML 2025arXiv:2506.04999

#1645

Harmonizing Geometry and Uncertainty: Diffusion with Hyperspheres

Muskan Dosi, Chiranjeev Chiranjeev, Kartik Thakral et al.

ICML 2025arXiv:2506.10576

#1646

Towards Rationale-Answer Alignment of LVLMs via Self-Rationale Calibration

Yuanchen Wu, Ke Yan, Shouhong Ding et al.

ICML 2025arXiv:2509.13919

#1647

Unveiling Markov heads in Pretrained Language Models for Offline Reinforcement Learning

Wenhao Zhao, Qiushui Xu, Linjie Xu et al.

ICML 2025arXiv:2409.06985

#1648

Stability and Generalization Capability of Subgraph Reasoning Models for Inductive Knowledge Graph Completion

Minsung Hwang, Jaejun Lee, Joyce Whang

ICML 2025

#1649

Improving Zero-Shot Adversarial Robustness in Vision-Language Models by Closed-form Alignment of Adversarial Path Simplices

Junhao Dong, Piotr Koniusz, Yifei Zhang et al.

ICML 2025spotlight

#1650

Inverse problems with experiment-guided AlphaFold

Sai Advaith Maddipatla, Nadav Bojan, Meital Bojan et al.

ICML 2025arXiv:2502.09372

#1651

Cross-regularization: Adaptive Model Complexity through Validation Gradients

Carlos Stein Naves de Brito

ICML 2025arXiv:2506.19755

#1652

CTBench: A Library and Benchmark for Certified Training

Yuhao Mao, Stefan Balauca, Martin Vechev

ICML 2025arXiv:2406.04848

#1653

Average Certified Radius is a Poor Metric for Randomized Smoothing

Chenhao Sun, Yuhao Mao, Mark Müller et al.

ICML 2025arXiv:2410.06895

#1654

Generalization Principles for Inference over Text-Attributed Graphs with Large Language Models

Haoyu Wang, Shikun Liu, Rongzhe Wei et al.

ICML 2025

#1655

Identifying and Understanding Cross-Class Features in Adversarial Training

Zeming Wei, Yiwen Guo, Yisen Wang

ICML 2025arXiv:2506.05032

#1656

Sum-of-Parts: Self-Attributing Neural Networks with End-to-End Learning of Feature Groups

Weiqiu You, Helen Qu, Marco Gatti et al.

ICML 2025arXiv:2310.16316

#1657

Ad-Hoc Human-AI Coordination Challenge

Tin Dizdarevic, Ravi Hammond, Tobias Gessler et al.

ICML 2025spotlight

#1658

Revisiting Unbiased Implicit Variational Inference

Tobias Pielok, Bernd Bischl, David Rügamer

ICML 2025arXiv:2506.03839

#1659

MVA: Linear Attention with High-order Query-Keys Integration and Multi-level Vocabulary Decomposition

ning wang, Zekun Li, Tongxin Bai et al.

ICML 2025

#1660

Exploring Large Action Sets with Hyperspherical Embeddings using von Mises-Fisher Sampling

Walid Bendada, Guillaume Salha-Galvan, Romain Hennequin et al.

ICML 2025arXiv:2507.00518

#1661

Stochastic Encodings for Active Feature Acquisition

Alexander Norcliffe, Changhee Lee, Fergus Imrie et al.

ICML 2025arXiv:2508.01957

#1662

Predicting High-precision Depth on Low-Precision Devices Using 2D Hilbert Curves

Mykhailo Uss, Ruslan Yermolenko, Oleksii Shashko et al.

ICML 2025arXiv:2405.14024

#1663

D-Fusion: Direct Preference Optimization for Aligning Diffusion Models with Visually Consistent Samples

Zijing Hu, Fengda Zhang, Kun Kuang

ICML 2025arXiv:2505.22002

#1664

Approximating Latent Manifolds in Neural Networks via Vanishing Ideals

Nico Pelleriti, Max Zimmer, Elias Wirth et al.

ICML 2025arXiv:2502.15051

#1665

Models of Heavy-Tailed Mechanistic Universality

Liam Hodgkinson, Zhichao Wang, Michael Mahoney

ICML 2025arXiv:2506.03470

#1666

Discovering Latent Causal Graphs from Spatiotemporal Data

Kun Wang, Sumanth Varambally, Duncan Watson-Parris et al.

ICML 2025oralarXiv:2411.05331

#1667

Adjusting Model Size in Continual Gaussian Processes: How Big is Big Enough?

Guiomar Pescador-Barrios, Sarah Filippi, Mark van der Wilk

ICML 2025spotlightarXiv:2408.07588

#1668

SBGD: Improving Graph Diffusion Generative Model via Stochastic Block Diffusion

Junwei Su, shan Wu

ICML 2025arXiv:2508.14352

#1669

BOPO: Neural Combinatorial Optimization via Best-anchored and Objective-guided Preference Optimization

Zijun Liao, Jinbiao Chen, Debing Wang et al.

ICML 2025arXiv:2503.07580

#1670

Identifying Metric Structures of Deep Latent Variable Models

Stas Syrota, Yevgen Zainchkovskyy, Johnny Xi et al.

ICML 2025arXiv:2502.13757

#1671

CoDy: Counterfactual Explainers for Dynamic Graphs

Zhan Qu, Daniel Gomm, Michael Färber

ICML 2025oralarXiv:2403.16846

#1672

How Do Images Align and Complement LiDAR? Towards a Harmonized Multi-modal 3D Panoptic Segmentation

Yining Pan, Qiongjie Cui, Xulei Yang et al.

ICML 2025arXiv:2505.18956

#1673

Eigen Analysis of Conjugate Kernel and Neural Tangent Kernel

Xiangchao Li, Xiao Han, Qing Yang

ICML 2025

#1674

Causal Invariance-aware Augmentation for Brain Graph Contrastive Learning

Minqi Yu, Jinduo Liu, Junzhong Ji

ICML 2025

#1675

Online Clustering of Dueling Bandits

Zhiyong Wang, Jiahang Sun, Mingze Kong et al.

ICML 2025arXiv:2502.02079

#1676

Learnable Spatial-Temporal Positional Encoding for Link Prediction

Katherine Tieu, Dongqi Fu, Zihao Li et al.

ICML 2025oralarXiv:2506.08309

#1677

Elucidating the Design Space of Multimodal Protein Language Models

Cheng-Yen Hsieh, Xinyou Wang, Daiheng Zhang et al.

ICML 2025spotlightarXiv:2504.11454

#1678

Curriculum Learning for Biological Sequence Prediction: The Case of De Novo Peptide Sequencing

Xiang Zhang, Jiaqi Wei, Zijie Qiu et al.

ICML 2025oralarXiv:2506.13485

#1679

Understanding Model Reprogramming for CLIP via Decoupling Visual Prompts

Chengyi Cai, Zesheng Ye, Lei Feng et al.

ICML 2025arXiv:2506.01000

#1680

Automatically Interpreting Millions of Features in Large Language Models

Gonçalo Paulo, Alex Mallen, Caden Juang et al.

ICML 2025arXiv:2410.13928

#1681

Phase and Amplitude-aware Prompting for Enhancing Adversarial Robustness

Yibo Xu, Dawei Zhou, Decheng Liu et al.

ICML 2025

#1682

On Differential Privacy for Adaptively Solving Search Problems via Sketching

Shiyuan Feng, Ying Feng, George Li et al.

ICML 2025oralarXiv:2506.05503

#1683

Uncertainty-Based Extensible Codebook for Discrete Federated Learning in Heterogeneous Data Silos

Tianyi Zhang, Yu Cao, Dianbo Liu

ICML 2025arXiv:2402.18888

#1684

Designing Cyclic Peptides via Harmonic SDE with Atom-Bond Modeling

Xiangxin Zhou, Mingyu Li, xiao yi et al.

ICML 2025arXiv:2505.21452

#1685

Can Diffusion Models Learn Hidden Inter-Feature Rules Behind Images?

Yujin Han, Andi Han, Wei Huang et al.

ICML 2025arXiv:2502.04725

#1686

Towards Escaping from Class Dependency Modeling for Multi-Dimensional Classification

Teng Huang, Bin-Bin Jia, Min-Ling Zhang

ICML 2025

#1687

Skip the Equations: Learning Behavior of Personalized Dynamical Systems Directly From Data

Krzysztof Kacprzyk, Julianna Piskorz, Mihaela van der Schaar

ICML 2025oral

#1688

Provably Near-Optimal Federated Ensemble Distillation with Negligible Overhead

Won-Jun Jang, Hyeon-Seo Park, Si-Hyeon Lee

ICML 2025arXiv:2502.06349

#1689

Prediction-Powered Adaptive Shrinkage Estimation

Sida Li, Nikolaos Ignatiadis

ICML 2025arXiv:2502.14166

#1690

Understanding Mode Connectivity via Parameter Space Symmetry

Bo Zhao, Nima Dehmamy, Robin Walters et al.

ICML 2025arXiv:2505.23681

#1691

Introducing 3D Representation for Dense Volume-to-Volume Translation via Score Fusion

Xiyue Zhu, Dou Kwark, Ruike Zhu et al.

ICML 2025oral

#1692

The Logical Implication Steering Method for Conditional Interventions on Transformer Generation

Damjan Kalajdzievski

ICML 2025arXiv:2502.03618

#1693

ProDiff: Prototype-Guided Diffusion for Minimal Information Trajectory Imputation

Tianci Bu, Le Zhou, Wenchuan Yang et al.

ICML 2025oralarXiv:2505.23048

#1694

Flopping for FLOPs: Leveraging Equivariance for Computational Efficiency

Georg Bökman, David Nordström, Fredrik Kahl

ICML 2025spotlightarXiv:2502.05169

#1695

Deterministic Sparse Fourier Transform for Continuous Signals with Frequency Gap

Xiaoyu Li, Zhao Song, Shenghao Xie

ICML 2025

#1696

Beyond Sensor Data: Foundation Models of Behavioral Data from Wearables Improve Health Predictions

Eray Erturk, Fahad Kamran, Salar Abbaspourazad et al.

ICML 2025oralarXiv:2507.00191

#1697

Learning Initial Basis Selection for Linear Programming via Duality-Inspired Tripartite Graph Representation and Comprehensive Supervision

Anqi Lu, Junchi Yan

ICML 2025

#1698

Unified Screening for Multiple Diseases

Yiğit Narter, Alihan Hüyük, Mihaela van der Schaar et al.

ICML 2025

#1699

Generalization and Robustness of the Tilted Empirical Risk

Gholamali Aminian, Amir R. Asadi, Tian Li et al.

ICML 2025arXiv:2409.19431

#1700

TIMING: Temporality-Aware Integrated Gradients for Time Series Explanation

Hyeongwon Jang, Changhun Kim, Eunho Yang

ICML 2025oralarXiv:2506.05035

#1701

Diving into Self-Evolving Training for Multimodal Reasoning

Wei Liu, Junlong Li, Xiwen Zhang et al.

ICML 2025arXiv:2412.17451

#1702

Angle Domain Guidance: Latent Diffusion Requires Rotation Rather Than Extrapolation

Cheng Jin, Zhenyu Xiao, Chutao Liu et al.

ICML 2025arXiv:2506.11039

#1703

Segment Anyword: Mask Prompt Inversion for Open-Set Grounded Segmentation

Zhihua Liu, Amrutha Saseendran, Lei Tong et al.

ICML 2025arXiv:2505.17994

#1704

Multilayer Matrix Factorization via Dimension-Reducing Diffusion Variational Inference

Junbin Liu, Farzan Farnia, Wing-Kin Ma

ICML 2025

#1705

Attention-Level Speculation

Jack Cai, Ammar Vora, Randolph Zhang et al.

ICML 2025

#1706

GIVE: Structured Reasoning of Large Language Models with Knowledge Graph Inspired Veracity Extrapolation

Jiashu HE, Mingyu Ma, Jinxuan Fan et al.

ICML 2025arXiv:2410.08475

#1707

any4: Learned 4-bit Numeric Representation for LLMs

Mostafa Elhoushi, Jeff Johnson

ICML 2025arXiv:2507.04610

#1708

Feature Shift Localization Network

Míriam Barrabés, Daniel Mas Montserrat, Kapal Dev et al.

ICML 2025arXiv:2506.09101

#1709

The Sharpness Disparity Principle in Transformers for Accelerating Language Model Pre-Training

Jinbo Wang, Mingze Wang, Zhanpeng Zhou et al.

ICML 2025arXiv:2502.19002

#1710

Contract Design Under Approximate Best Responses

Francesco Bacchiocchi, Jiarui Gan, Matteo Castiglioni et al.

ICML 2025arXiv:2502.15523

#1711

A Closer Look at Backdoor Attacks on CLIP

Shuo He, Zhifang Zhang, Feng Liu et al.

ICML 2025

#1712

Aligning with Logic: Measuring, Evaluating and Improving Logical Preference Consistency in Large Language Models

Yinhong Liu, Zhijiang Guo, Tianya Liang et al.

ICML 2025spotlightarXiv:2410.02205

#1713

Toward Data-centric Directed Graph Learning: An Entropy-driven Approach

Xunkai Li, Zhengyu Wu, Kaichi Yu et al.

ICML 2025arXiv:2505.00983

#1714

PhantomWiki: On-Demand Datasets for Reasoning and Retrieval Evaluation

Albert Gong, Kamilė Stankevičiūtė, Chao Wan et al.

ICML 2025arXiv:2502.20377

#1715

Visual Abstraction: A Plug-and-Play Approach for Text-Visual Retrieval

Guofeng Ding, Yiding Lu, Peng Hu et al.

ICML 2025

#1716

Geometric Feature Embedding for Effective 3D Few-Shot Class Incremental Learning

Xiangqi Li, Libo Huang, Zhulin An et al.

ICML 2025

#1717

Not All Tokens Matter All The Time: Dynamic Token Aggregation Towards Efficient Detection Transformers

Jiacheng Cheng, Xiwen Yao, Xiang Yuan et al.

ICML 2025

#1718

Expert Race: A Flexible Routing Strategy for Scaling Diffusion Transformer with Mixture of Experts

Yike Yuan, Ziyu Wang, Zihao Huang et al.

ICML 2025arXiv:2503.16057

#1719

Conformal Anomaly Detection in Event Sequences

Shuai Zhang, Chuan Zhou, Yang Liu et al.

ICML 2025

#1720

When to retrain a machine learning model

Florence Regol, Leo Schwinn, Kyle Sprague et al.

ICML 2025arXiv:2505.14903

#1721

Cavia: Camera-controllable Multi-view Video Diffusion with View-Integrated Attention

Dejia Xu, Yifan Jiang, Chen Huang et al.

ICML 2025oralarXiv:2410.10774

#1722

EARTH: Epidemiology-Aware Neural ODE with Continuous Disease Transmission Graph

Guancheng Wan, Zewen Liu, Xiaojun Shan et al.

ICML 2025

#1723

Pareto-frontier Entropy Search with Variational Lower Bound Maximization

Masanori Ishikura, Masayuki Karasuyama

ICML 2025arXiv:2501.19073

#1724

SAN: Hypothesizing Long-Term Synaptic Development and Neural Engram Mechanism in Scalable Model's Parameter-Efficient Fine-Tuning

Gaole Dai, Chun-Kai Fan, Yiming Tang et al.

ICML 2025arXiv:2409.06706

#1725

Learning Invariant Causal Mechanism from Vision-Language Models

Zeen Song, Siyu Zhao, Xingyu Zhang et al.

ICML 2025arXiv:2405.15289

#1726

Emergent Response Planning in LLMs

Zhichen Dong, Zhanhui Zhou, Zhixuan Liu et al.

ICML 2025arXiv:2502.06258

#1727

Non-Asymptotic and Non-Lipschitzian Bounds on Optimal Values in Stochastic Optimization Under Heavy Tails

Jindong Tong, Hongcheng Liu, Johannes Royset

ICML 2025

#1728

Finding Wasserstein Ball Center: Efficient Algorithm and The Applications in Fairness

Yuntao Wang, Yuxuan Li, Qingyuan Yang et al.

ICML 2025

#1729

Self-Organizing Visual Prototypes for Non-Parametric Representation Learning

Thalles Silva, Helio Pedrini, Adín Ramírez Rivera

ICML 2025arXiv:2505.21533

#1730

CaDA: Cross-Problem Routing Solver with Constraint-Aware Dual-Attention

Han Li, Fei Liu, Zhi Zheng et al.

ICML 2025arXiv:2412.00346

#1731

BAME: Block-Aware Mask Evolution for Efficient N:M Sparse Training

Chenyi yang, Wenjie Nie, Yuxin Zhang et al.

ICML 2025

#1732

VCT: Training Consistency Models with Variational Noise Coupling

Gianluigi Silvestri, Luca Ambrogioni, Chieh-Hsin Lai et al.

ICML 2025arXiv:2502.18197

#1733

Learning from True-False Labels via Multi-modal Prompt Retrieving

Zhongnian Li, Jinghao Xu, Peng Ying et al.

ICML 2025arXiv:2405.15228

#1734

Empower Structure-Based Molecule Optimization with Gradient Guided Bayesian Flow Networks

Keyue Qiu, Yuxuan Song, Jie Yu et al.

ICML 2025arXiv:2411.13280

#1735

Automated Benchmark Generation for Repository-Level Coding Tasks

Konstantinos Vergopoulos, Mark Müller, Martin Vechev

ICML 2025arXiv:2503.07701

#1736

RAGGED: Towards Informed Design of Scalable and Stable RAG Systems

Jennifer Hsia, Afreen Shaikh, Zhiruo Wang et al.

ICML 2025arXiv:2403.09040

#1737

A General Representation-Based Approach to Multi-Source Domain Adaptation

Ignavier Ng, Yan Li, Zijian Li et al.

ICML 2025

#1738

Fast Min-$\epsilon$ Segmented Regression using Constant-Time Segment Merging

Ansgar Lößer, Max Schlecht, Florian Schintke et al.

ICML 2025

#1739

ParallelComp: Parallel Long-Context Compressor for Length Extrapolation

Jing Xiong, Jianghan Shen, Chuanyang Zheng et al.

ICML 2025arXiv:2502.14317

#1740

Improved Last-Iterate Convergence of Shuffling Gradient Methods for Nonsmooth Convex Optimization

Zijian Liu, Zhengyuan Zhou

ICML 2025arXiv:2505.23056

#1741

Local Pan-privacy for Federated Analytics

Vitaly Feldman, Audra McMillan, Guy Rothblum et al.

ICML 2025arXiv:2503.11850

#1742

Topological Signatures of Adversaries in Multimodal Alignments

Minh Vu, Geigh Zollicoffer, Huy Mai et al.

ICML 2025arXiv:2501.18006

#1743

DeepLayout: Learning Neural Representations of Circuit Placement Layout

Yuxiang Zhao, zhuomin chai, Xun Jiang et al.

ICML 2025

#1744

Accelerated Diffusion Models via Speculative Sampling

Valentin De Bortoli, Alexandre Galashov, Arthur Gretton et al.

ICML 2025arXiv:2501.05370

#1745

Forest-of-Thought: Scaling Test-Time Compute for Enhancing LLM Reasoning

Zhenni Bi, Kai Han, Chuanjian Liu et al.

ICML 2025arXiv:2412.09078

#1746

A Selective Learning Method for Temporal Graph Continual Learning

Hanmo Liu, Shimin Di, Haoyang LI et al.

ICML 2025oralarXiv:2503.01580

#1747

Predictive Performance of Deep Quantum Data Re-uploading Models

Xin Wang, Hanxiao Tao, Re-Bing Wu

ICML 2025arXiv:2505.20337

#1748

Scalable First-order Method for Certifying Optimal k-Sparse GLMs

Jiachang Liu, Soroosh Shafiee, Andrea Lodi

ICML 2025arXiv:2502.09502

#1749

Speculate, then Collaborate: Fusing Knowledge of Language Models during Decoding

Ziyao Wang, Muneeza Azmat, Ang Li et al.

ICML 2025arXiv:2502.08020

#1750

Visual and Domain Knowledge for Professional-level Graph-of-Thought Medical Reasoning

Rina Bao, Shilong Dong, Zhenfang Chen et al.

ICML 2025spotlight

#1751

Reinforcement Learning for Quantum Control under Physical Constraints

Jan Ole Ernst, Aniket Chatterjee, Tim Franzmeyer et al.

ICML 2025arXiv:2501.14372

#1752

Simplicity Bias and Optimization Threshold in Two-Layer ReLU Networks

Etienne Boursier, Nicolas Flammarion

ICML 2025arXiv:2410.02348

#1753

Knowledge Swapping via Learning and Unlearning

Mingyu Xing, Lechao Cheng, Shengeng Tang et al.

ICML 2025arXiv:2502.08075

#1754

Optimization over Sparse Support-Preserving Sets: Two-Step Projection with Global Optimality Guarantees

William de Vazelhes, Xiaotong Yuan, Bin Gu

ICML 2025arXiv:2506.08558

#1755

Teaching Physical Awareness to LLMs through Sounds

Weiguo Wang, Andy Nie, Wenrui Zhou et al.

ICML 2025arXiv:2506.08524

#1756

Distributed Nonparametric Estimation: from Sparse to Dense Samples per Terminal

Deheng Yuan, Tao Guo, Zhongyi Huang

ICML 2025arXiv:2501.07879

#1757

Explicit Exploration for High-Welfare Equilibria in Game-Theoretic Multiagent Reinforcement Learning

Austin Nguyen, Anri Gu, Michael Wellman

ICML 2025

#1758

Generalization of noisy SGD in unbounded non-convex settings

Leello Dadi, Volkan Cevher

ICML 2025

#1759

Compute or Load KV Cache? Why Not Both?

Shuowei Jin, Xueshen Liu, Qingzhao Zhang et al.

ICML 2025arXiv:2410.03065

#1760

MoMa: Modulating Mamba for Adapting Image Foundation Models to Video Recognition

Yuhuan Yang, Chaofan Ma, Zhenjie Mao et al.

ICML 2025oralarXiv:2506.23283

#1761

An All-Atom Generative Model for Designing Protein Complexes

Ruizhe Chen, Dongyu Xue, Xiangxin Zhou et al.

ICML 2025arXiv:2504.13075

#1762

3D-LMVIC: Learning-based Multi-View Image Compression with 3D Gaussian Geometric Priors

Yujun Huang, Bin Chen, Niu Lian et al.

ICML 2025

#1763

Reward-Guided Speculative Decoding for Efficient LLM Reasoning

Baohao Liao, Yuhui Xu, Hanze Dong et al.

ICML 2025arXiv:2501.19324

#1764

Online Differentially Private Conformal Prediction for Uncertainty Quantification

ICML 2025

#1765

Gradient Aligned Regression via Pairwise Losses

Dixian Zhu, Tianbao Yang, Livnat Jerby

ICML 2025arXiv:2402.06104

#1766

A Bregman Proximal Viewpoint on Neural Operators

Abdel-Rahim Mezidi, Jordan Patracone, Saverio Salzo et al.

ICML 2025

#1767

Fine-Grained Captioning of Long Videos through Scene Graph Consolidation

Sanghyeok Chu, Seonguk Seo, Bohyung Han

ICML 2025oralarXiv:2502.16427

#1768

Stacey: Promoting Stochastic Steepest Descent via Accelerated $\ell_p$-Smooth Nonconvex Optimization

Xinyu Luo, Cedar Site Bai, Bolian Li et al.

ICML 2025arXiv:2506.06606

#1769

FOCoOp: Enhancing Out-of-Distribution Robustness in Federated Prompt Learning for Vision-Language Models

Xinting Liao, Weiming Liu, Jiaming Qian et al.

ICML 2025arXiv:2506.16218

#1770

Convergence Analysis of Policy Gradient Methods with Dynamic Stochasticity

Alessandro Montenegro, Marco Mussi, Matteo Papini et al.

ICML 2025

#1771

ADIOS: Antibody Development via Opponent Shaping

Sebastian Towers, Aleksandra Kalisz, Philippe Robert et al.

ICML 2025arXiv:2409.10588

#1772

WikiBigEdit: Understanding the Limits of Lifelong Knowledge Editing in LLMs

Lukas Thede, Karsten Roth, Matthias Bethge et al.

ICML 2025arXiv:2503.05683

#1773

Autoencoder-Based Hybrid Replay for Class-Incremental Learning

Milad Khademi Nori, Il-Min Kim, Guanghui Wang

ICML 2025arXiv:2505.05926

#1774

Learning from Suboptimal Data in Continuous Control via Auto-Regressive Soft Q-Network

Jijia Liu, Feng Gao, Qingmin Liao et al.

ICML 2025arXiv:2502.00288

#1775

Habitizing Diffusion Planning for Efficient and Effective Decision Making

Haofei Lu, Yifei Shen, Dongsheng Li et al.

ICML 2025arXiv:2502.06401

#1776

Fast Tensor Completion via Approximate Richardson Iteration

Mehrdad Ghadiri, Matthew Fahrbach, Yunbum Kook et al.

ICML 2025arXiv:2502.09534

#1777

Internal Causal Mechanisms Robustly Predict Language Model Out-of-Distribution Behaviors

Jing Huang, Junyi Tao, Thomas Icard et al.

ICML 2025arXiv:2505.11770

#1778

Improving Memory Efficiency for Training KANs via Meta Learning

Zhangchi Zhao, Jun Shu, Deyu Meng et al.

ICML 2025arXiv:2506.07549

#1779

No Metric to Rule Them All: Toward Principled Evaluations of Graph-Learning Datasets

Corinna Coupette, Jeremy Wayland, Emily Simons et al.

ICML 2025arXiv:2502.02379

#1780

MARS: Unleashing the Power of Variance Reduction for Training Large Models

Huizhuo Yuan, Yifeng Liu, Shuang Wu et al.

ICML 2025arXiv:2411.10438

#1781

BalancEdit: Dynamically Balancing the Generality-Locality Trade-off in Multi-modal Model Editing

Dongliang Guo, Mengxuan Hu, Zihan Guan et al.

ICML 2025arXiv:2505.01343

#1782

SERENA: A Unified Stochastic Recursive Variance Reduced Gradient Framework for Riemannian Non-Convex Optimization

Yan Liu, Mingjie Chen, Chaojie Ji et al.

ICML 2025

#1783

Quadratic Upper Bound for Boosting Robustness

Euijin You, Hyang-Won Lee

ICML 2025arXiv:2601.13645

#1784

C2IQL: Constraint-Conditioned Implicit Q-learning for Safe Offline Reinforcement Learning

Zifan LIU, Xinran Li, Jun Zhang

ICML 2025

#1785

Mechanistic Unlearning: Robust Knowledge Unlearning and Editing via Mechanistic Localization

Phillip Guo, Aaquib Syed, Abhay Sheshadri et al.

ICML 2025spotlightarXiv:2410.12949

#1786

Flow-based Domain Randomization for Learning and Sequencing Robotic Skills

Aidan Curtis, Eric Li, Michael S Noseworthy et al.

ICML 2025arXiv:2502.01800

#1787

Quadruple Attention in Many-body Systems for Accurate Molecular Property Predictions

Jiahua Rao, Dahao Xu, Wentao Wei et al.

ICML 2025

#1788

All-atom inverse protein folding through discrete flow matching

Kai Yi, Kiarash Jamali, Sjors Scheres

ICML 2025arXiv:2507.14156

#1789

Progressively Label Enhancement for Large Language Model Alignment

Biao Liu, Ning Xu, Xin Geng

ICML 2025arXiv:2408.02599

#1790

Compute Optimal Inference and Provable Amortisation Gap in Sparse Autoencoders

Charles O'Neill, Alim Gumran, David Klindt

ICML 2025arXiv:2411.13117

#1791

Depth Degeneracy in Neural Networks: Vanishing Angles in Fully Connected ReLU Networks on Initialization

Cameron Jakub, Mihai Nica

ICML 2025arXiv:2302.09712

#1792

Topology-aware Neural Flux Prediction Guided by Physics

Haoyang Jiang, Jindong Wang, Xingquan Zhu et al.

ICML 2025arXiv:2506.05676

#1793

KernelBench: Can LLMs Write Efficient GPU Kernels?

Anne Ouyang, Simon Guo, Simran Arora et al.

ICML 2025arXiv:2502.10517

#1794

DIS-CO: Discovering Copyrighted Content in VLMs Training Data

André Duarte, Xuandong Zhao, Arlindo Oliveira et al.

ICML 2025arXiv:2502.17358

#1795

Continuous-Time Analysis of Heavy Ball Momentum in Min-Max Games

Yi Feng, Kaito Fujii, EFSTRATIOS PANTELEIMON SKOULAKIS et al.

ICML 2025arXiv:2505.19537

#1796

DPO Meets PPO: Reinforced Token Optimization for RLHF

Han Zhong, Zikang Shan, Guhao Feng et al.

ICML 2025spotlightarXiv:2404.18922

#1797

Revisiting Chain-of-Thought in Code Generation: Do Language Models Need to Learn Reasoning before Coding?

Ren-Biao Liu, Anqi Li, ChaodingYang et al.

ICML 2025

#1798

Byzantine-Resilient Federated Alternating Gradient Descent and Minimization for Partly-Decoupled Low Rank Matrix Learning

Ankit Pratap Singh, Ahmed Abbasi, Namrata Vaswani

ICML 2025

#1799

Safe-EF: Error Feedback for Non-smooth Constrained Optimization

Rustem Islamov, Yarden As, Ilyas Fatkhullin

ICML 2025

#1800

Understanding Synthetic Context Extension via Retrieval Heads

Xinyu Zhao, Fangcong Yin, Greg Durrett

ICML 2025arXiv:2410.22316

← Previous

1...7 8 9 10 11...30