Most Cited 2025 "learning rate warmup" Papers

21,856 papers found • Page 109 of 110

#21601

A Generic Family of Graphical Models: Diversity, Efficiency, and Heterogeneity

Yufei Huang, Changhu Wang, Junjie Tang et al.

ICML 2025poster
#21602

TabFlex: Scaling Tabular Learning to Millions with Linear Attention

Yuchen Zeng, Tuan Dinh, Wonjun Kang et al.

ICML 2025spotlightarXiv:2506.05584
#21603

Efficiently Vectorized MCMC on Modern Accelerators

Hugh Dance, Pierre Glaser, Peter Orbanz et al.

ICML 2025spotlightarXiv:2503.17405
#21604

Understanding Complexity in VideoQA via Visual Program Generation

Cristobal Eyzaguirre, Igor Vasiljevic, Achal Dave et al.

ICML 2025posterarXiv:2505.13429
#21605

Adversarial Inputs for Linear Algebra Backends

Jonas Möller, Lukas Pirch, Felix Weissberg et al.

ICML 2025poster
#21606

SKOLR: Structured Koopman Operator Linear RNN for Time-Series Forecasting

Yitian Zhang, Liheng Ma, Antonios Valkanas et al.

ICML 2025posterarXiv:2506.14113
#21607

Beyond Topological Self-Explainable GNNs: A Formal Explainability Perspective

Steve Azzolin, SAGAR MALHOTRA, Andrea Passerini et al.

ICML 2025posterarXiv:2502.02719
#21608

ViTally Consistent: Scaling Biological Representation Learning for Cell Microscopy

Kian Kenyon-Dean, Zitong Jerry Wang, John Urbanik et al.

ICML 2025posterarXiv:2411.02572
#21609

SafeAuto: Knowledge-Enhanced Safe Autonomous Driving with Multimodal Foundation Models

Jiawei Zhang, Xuan Yang, Taiqi Wang et al.

ICML 2025posterarXiv:2503.00211
#21610

Nemotron-CORTEXA: Enhancing LLM Agents for Software Engineering Tasks via Improved Localization and Solution Diversity

Atefeh Sohrabizadeh, Jialin Song, Mingjie Liu et al.

ICML 2025poster
#21611

Whitened CLIP as a Likelihood Surrogate of Images and Captions

Roy Betser, Meir Yossef Levi, Guy Gilboa

ICML 2025posterarXiv:2505.06934
#21612

Time Series Representations with Hard-Coded Invariances

Thibaut Germain, Chrysoula Kosma, Laurent Oudre

ICML 2025poster
#21613

EncryptedLLM: Privacy-Preserving Large Language Model Inference via GPU-Accelerated Fully Homomorphic Encryption

Leo de Castro, Daniel Escudero, Adya Agrawal et al.

ICML 2025poster
#21614

Universal Neural Optimal Transport

Jonathan Geuter, Gregor Kornhardt, Ingimar Tomasson et al.

ICML 2025posterarXiv:2212.00133
#21615

Measuring In-Context Computation Complexity via Hidden State Prediction

Vincent Herrmann, Róbert Csordás, Jürgen Schmidhuber

ICML 2025posterarXiv:2503.13431
#21616

Teaching Transformers Causal Reasoning through Axiomatic Training

Aniket Vashishtha, Abhinav Kumar, Atharva Pandey et al.

ICML 2025posterarXiv:2407.07612
#21617

CAT Merging: A Training-Free Approach for Resolving Conflicts in Model Merging

Wenju Sun, Qingyong Li, Yangliao Geng et al.

ICML 2025posterarXiv:2505.06977
#21618

Efficient Curvature-Aware Hypergradient Approximation for Bilevel Optimization

Youran Dong, Junfeng Yang, Wei Yao et al.

ICML 2025posterarXiv:2505.02101
#21619

CEGA: A Cost-Effective Approach for Graph-Based Model Extraction and Acquisition

Zebin Wang, Menghan Lin, Bolin Shen et al.

ICML 2025posterarXiv:2506.17709
#21620

Simplicity Bias and Optimization Threshold in Two-Layer ReLU Networks

Etienne Boursier, Nicolas Flammarion

ICML 2025posterarXiv:2410.02348
#21621

Provably Near-Optimal Federated Ensemble Distillation with Negligible Overhead

Won-Jun Jang, Hyeon-Seo Park, Si-Hyeon Lee

ICML 2025posterarXiv:2502.06349
#21622

Voronoi-grid-based Pareto Front Learning and Its Application to Collaborative Federated Learning

Mengmeng Chen, Xiaohu Wu, QIQI LIU et al.

ICML 2025posterarXiv:2505.20648
#21623

Improved Coresets for Vertical Federated Learning: Regularized Linear and Logistic Regressions

Supratim Shit, Gurmehak chadha, Surendra kumar et al.

ICML 2025poster
#21624

Improved Algorithm for Deep Active Learning under Imbalance via Optimal Separation

Shyam Nuggehalli, Jifan Zhang, Lalit Jain et al.

ICML 2025posterarXiv:2312.09196
#21625

Strong and Weak Identifiability of Optimization-based Causal Discovery in Non-linear Additive Noise Models

Mingjia Li, Hong Qian, Tian-Zuo Wang et al.

ICML 2025poster
#21626

Nearly Optimal Sample Complexity for Learning with Label Proportions

Robert Busa-Fekete, Travis Dick, Claudio Gentile et al.

ICML 2025posterarXiv:2505.05355
#21627

Scalable Private Partition Selection via Adaptive Weighting

Justin Chen, Vincent Cohen-Addad, Alessandro Epasto et al.

ICML 2025posterarXiv:2502.08878
#21628

GSM-$\infty$: How Do your LLMs Behave over Infinitely Increasing Reasoning Complexity and Context Length?

Yang Zhou, Hongyi Liu, Zhuoming Chen et al.

ICML 2025poster
#21629

PertEval-scFM: Benchmarking Single-Cell Foundation Models for Perturbation Effect Prediction

Aaron Wenteler, Martina Occhetta, Nikhil Branson et al.

ICML 2025poster
#21630

Perceptual-GS: Scene-adaptive Perceptual Densification for Gaussian Splatting

Hongbi ZHOU, Zhangkai NI

ICML 2025posterarXiv:2506.12400
#21631

Scaling Trends in Language Model Robustness

Nikolaus Howe, Ian McKenzie, Oskar Hollinsworth et al.

ICML 2025spotlightarXiv:2407.18213
#21632

Statistical and Computational Guarantees of Kernel Max-Sliced Wasserstein Distances

Jie Wang, March Boedihardjo, Yao Xie

ICML 2025posterarXiv:2405.15441
#21633

I Think, Therefore I Diffuse: Enabling Multimodal In-Context Reasoning in Diffusion Models

Zhenxing Mi, Kuan-Chieh Wang, Guocheng Qian et al.

ICML 2025posterarXiv:2502.10458
#21634

Efficient First-Order Optimization on the Pareto Set for Multi-Objective Learning under Preference Guidance

Lisha Chen, Quan Xiao, Ellen Fukuda et al.

ICML 2025spotlightarXiv:2504.02854
#21635

LangDAug: Langevin Data Augmentation for Multi-Source Domain Generalization in Medical Image Segmentation

Piyush Lalitkumar Tiwary, Kinjawl Bhattacharyya, Prathosh AP

ICML 2025posterarXiv:2505.19659
#21636

Feedforward Few-shot Species Range Estimation

Christian Lange, Max Hamilton, Elijah Cole et al.

ICML 2025posterarXiv:2502.14977
#21637

BRIDGE: Bootstrapping Text to Control Time-Series Generation via Multi-Agent Iterative Optimization and Diffusion Modeling

Hao Li, Yu-Hao Huang, Chang Xu et al.

ICML 2025oralarXiv:2503.02445
#21638

A Reasoning-Based Approach to Cryptic Crossword Clue Solving

Martin Andrews, Sam Witteveen

ICML 2025posterarXiv:2506.04824
#21639

Fully Dynamic Euclidean Bi-Chromatic Matching in Sublinear Update Time

Gramoz Goranci, Peter Kiss, Neel Patel et al.

ICML 2025oralarXiv:2505.09010
#21640

Global-Local Dirichlet Processes for Clustering Grouped Data in the Presence of Group-Specific Idiosyncratic Variables

Arhit Chakrabarti, Yang Ni, Debdeep Pati et al.

ICML 2025poster
#21641

TimeDART: A Diffusion Autoregressive Transformer for Self-Supervised Time Series Representation

Daoyu Wang, Mingyue Cheng, Zhiding Liu et al.

ICML 2025posterarXiv:2410.05711
#21642

DCTdiff: Intriguing Properties of Image Generative Modeling in the DCT Space

Mang Ning, Mingxiao Li, Jianlin Su et al.

ICML 2025posterarXiv:2412.15032
#21643

FastCAV: Efficient Computation of Concept Activation Vectors for Explaining Deep Neural Networks

Laines Schmalwasser, Niklas Penzel, Joachim Denzler et al.

ICML 2025posterarXiv:2505.17883
#21644

Neural Encoding and Decoding at Scale

Yizi Zhang, Yanchen Wang, Mehdi Azabou et al.

ICML 2025oralarXiv:2504.08201
#21645

Wyckoff Transformer: Generation of Symmetric Crystals

Nikita Kazeev, Wei Nong, Ignat Romanov et al.

ICML 2025posterarXiv:2503.02407
#21646

Generalization Analysis for Supervised Contrastive Representation Learning under Non-IID Settings

Minh Hieu Nong, Antoine Ledent

ICML 2025posterarXiv:2505.04937
#21647

Sampling from Binary Quadratic Distributions via Stochastic Localization

Chenguang Wang, Kaiyuan Cui, Weichen Zhao et al.

ICML 2025posterarXiv:2505.19438
#21648

Online Detection of LLM-Generated Texts via Sequential Hypothesis Testing by Betting

Can Chen, Jun-Kun Wang

ICML 2025posterarXiv:2410.22318
#21649

Putnam-AXIOM: A Functional & Static Benchmark for Measuring Higher Level Mathematical Reasoning in LLMs

Aryan Gulati, Brando Miranda, Eric Chen et al.

ICML 2025poster
#21650

Avoiding spurious sharpness minimization broadens applicability of SAM

Sidak Pal Singh, Hossein Mobahi, Atish Agarwala et al.

ICML 2025posterarXiv:2502.02407
#21651

Recommendations with Sparse Comparison Data: Provably Fast Convergence for Nonconvex Matrix Factorization

Suryanarayana Sankagiri, Jalal Etesami, Matthias Grossglauser

ICML 2025posterarXiv:2502.20033
#21652

Canonical Rank Adaptation: An Efficient Fine-Tuning Strategy for Vision Transformers

Lokesh Veeramacheneni, Moritz Wolter, Hilde Kuehne et al.

ICML 2025poster
#21653

Mahalanobis++: Improving OOD Detection via Feature Normalization

Maximilian Müller, Matthias Hein

ICML 2025poster
#21654

Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Abilities

Sreyan Ghosh, Zhifeng Kong, Sonal Kumar et al.

ICML 2025posterarXiv:2503.03983
#21655

Rethinking Confidence Scores and Thresholds in Pseudolabeling-based SSL

Harit Vishwakarma, Yi Chen, Satya Sai Srinath Namburi GNVV et al.

ICML 2025poster
#21656

Hi Robot: Open-Ended Instruction Following with Hierarchical Vision-Language-Action Models

Lucy Xiaoyang Shi, brian ichter, Michael Equi et al.

ICML 2025posterarXiv:2502.19417
#21657

Clipped SGD Algorithms for Performative Prediction: Tight Bounds for Stochastic Bias and Remedies

Qiang Li, Michal Yemini, Hoi To Wai

ICML 2025poster
#21658

Gradient Descent Converges Arbitrarily Fast for Logistic Regression via Large and Adaptive Stepsizes

Ruiqi Zhang, Jingfeng Wu, Peter Bartlett

ICML 2025poster
#21659

OrcaLoca: An LLM Agent Framework for Software Issue Localization

Zhongming Yu, Hejia Zhang, Yujie Zhao et al.

ICML 2025posterarXiv:2502.00350
#21660

An Adaptive Orthogonal Convolution Scheme for Efficient and Flexible CNN Architectures

Thibaut Boissin, Franck Mamalet, Thomas Fel et al.

ICML 2025posterarXiv:2501.07930
#21661

Scalable Model Merging with Progressive Layer-wise Distillation

Jing Xu, Jiazheng Li, Jingzhao Zhang

ICML 2025posterarXiv:2502.12706
#21662

AffectGPT: A New Dataset, Model, and Benchmark for Emotion Understanding with Multimodal Large Language Models

Zheng Lian, Haoyu Chen, Lan Chen et al.

ICML 2025oralarXiv:2501.16566
#21663

Covered Forest: Fine-grained generalization analysis of graph neural networks

Antonis Vasileiou, Ben Finkelshtein, Floris Geerts et al.

ICML 2025spotlightarXiv:2412.07106
#21664

Breaking Barriers: Combinatorial Algorithms for Non-Monotone Submodular Maximization with Sublinear Adaptivity and $1/e$ Approximation

Yixin Chen, Wenjing Chen, Alan Kuhnle

ICML 2025posterarXiv:2502.07062
#21665

NextCoder: Robust Adaptation of Code LMs to Diverse Code Edits

Tushar Aggarwal, Swayam Singh, Abhijeet Awasthi et al.

ICML 2025poster
#21666

MMInference: Accelerating Pre-filling for Long-Context Visual Language Models via Modality-Aware Permutation Sparse Attention

Yucheng Li, Huiqiang Jiang, Chengruidong Zhang et al.

ICML 2025oral
#21667

A General Graph Spectral Wavelet Convolution via Chebyshev Order Decomposition

Nian Liu, Xiaoxin He, Thomas Laurent et al.

ICML 2025posterarXiv:2405.13806
#21668

Improving the Statistical Efficiency of Cross-Conformal Prediction

ICML 2025posterarXiv:2503.01495
#21669

The Number of Trials Matters in Infinite-Horizon General-Utility Markov Decision Processes

Pedro Santos, Alberto Sardinha, Francisco S. Melo

ICML 2025spotlightarXiv:2409.15128
#21670

Improved Regret Analysis in Gaussian Process Bandits: Optimality for Noiseless Reward, RKHS norm, and Non-Stationary Variance

Shogo Iwazaki, Shion Takeno

ICML 2025oralarXiv:2502.06363
#21671

Policy Gradient with Tree Expansion

Gal Dalal, Assaf Hallak, Gugan Chandrashekhar Mallika Thoppe et al.

ICML 2025posterarXiv:2301.13236
#21672

SGD Jittering: A Training Strategy for Robust and Accurate Model-Based Architectures

Peimeng Guan, Mark Davenport

ICML 2025posterarXiv:2410.14667
#21673

Adaptive Elicitation of Latent Information Using Natural Language

Jimmy Wang, Tom Zollo, Richard Zemel et al.

ICML 2025posterarXiv:2504.04204
#21674

Diversity By Design: Leveraging Distribution Matching for Offline Model-Based Optimization

Michael S Yao, James Gee, Osbert Bastani

ICML 2025posterarXiv:2501.18768
#21675

Efficient Molecular Conformer Generation with SO(3)-Averaged Flow Matching and Reflow

Zhonglin Cao, Mario Geiger, Allan Costa et al.

ICML 2025posterarXiv:2507.09785
#21676

Survival Analysis via Density Estimation

Hiroki Yanagisawa, Shunta Akiyama

ICML 2025poster
#21677

Hybrid Batch Normalisation: Resolving the Dilemma of Batch Normalisation in Federated Learning

Hongyao Chen, Tianyang Xu, Xiaojun Wu et al.

ICML 2025posterarXiv:2505.21877
#21678

Tilted Sharpness-Aware Minimization

Tian Li, Tianyi Zhou, Jeff Bilmes

ICML 2025posterarXiv:2410.22656
#21679

DynaMind: Reasoning over Abstract Video Dynamics for Embodied Decision-Making

Ziru Wang, Mengmeng Wang, Jade Dai et al.

ICML 2025oral
#21680

Solving Probabilistic Verification Problems of Neural Networks using Branch and Bound

David Boetius, Stefan Leue, Tobias Sutter

ICML 2025posterarXiv:2405.17556
#21681

Position: AI Agents Need Authenticated Delegation

Tobin South, Samuele Marro, Thomas Hardjono et al.

ICML 2025oral
#21682

Position: Certified Robustness Does Not (Yet) Imply Model Security

Andrew C. Cullen, Paul MONTAGUE, Sarah Erfani et al.

ICML 2025oralarXiv:2506.13024
#21683

Position: Political Neutrality in AI Is Impossible — But Here Is How to Approximate It

Jillian Fisher, Ruth Elisabeth Appel, Chan Young Park et al.

ICML 2025oral
#21684

Position: AI Safety Must Embrace an Antifragile Perspective

Ming Jin, Hyunin Lee

ICML 2025posterarXiv:2509.13339
#21685

Machines and Mathematical Mutations: Using GNNs to Characterize Quiver Mutation Classes

Jesse He, Helen Jenne, Herman Chau et al.

ICML 2025posterarXiv:2411.07467
#21686

Trajectory World Models for Heterogeneous Environments

Shaofeng Yin, Jialong Wu, Siqiao Huang et al.

ICML 2025posterarXiv:2502.01366
#21687

Position: Beyond Assistance – Reimagining LLMs as Ethical and Adaptive Co-Creators in Mental Health Care

Abeer Badawi, Md Tahmid Rahman Laskar, Jimmy Huang et al.

ICML 2025posterarXiv:2503.16456
#21688

Reliable Algorithm Selection for Machine Learning-Guided Design

Clara Fannjiang, Ji Won Park

ICML 2025posterarXiv:2503.20767
#21689

On Fine-Grained Distinct Element Estimation

Ilias Diakonikolas, Daniel Kane, Jasper Lee et al.

ICML 2025posterarXiv:2506.22608
#21690

Accelerating PDE-Constrained Optimization by the Derivative of Neural Operators

Ze Cheng, Zhuoyu Li, Wang Xiaoqiang et al.

ICML 2025posterarXiv:2506.13120
#21691

FSTLLM: Spatio-Temporal LLM for Few Shot Time Series Forecasting

Yue Jiang, Yile Chen, Xiucheng Li et al.

ICML 2025oral
#21692

Finite-Time Global Optimality Convergence in Deep Neural Actor-Critic Methods for Decentralized Multi-Agent Reinforcement Learning

Zhiyao Zhang, Myeung Suk Oh, Hairi et al.

ICML 2025posterarXiv:2505.18433
#21693

The Jailbreak Tax: How Useful are Your Jailbreak Outputs?

Kristina Nikolić, Luze Sun, Jie Zhang et al.

ICML 2025spotlightarXiv:2504.10694
#21694

Bayesian Optimization from Human Feedback: Near-Optimal Regret Bounds

Aya Kayal, Sattar Vakili, Laura Toni et al.

ICML 2025posterarXiv:2505.23673
#21695

High-Dimensional Prediction for Sequential Decision Making

Georgy Noarov, Ramya Ramalingam, Aaron Roth et al.

ICML 2025oralarXiv:2310.17651
#21696

David and Goliath: Small One-step Model Beats Large Diffusion with Score Post-training

Weijian Luo, colin zhang, Debing Zhang et al.

ICML 2025posterarXiv:2410.20898
#21697

A Reduction Framework for Distributionally Robust Reinforcement Learning under Average Reward

Zachary Roch, George Atia, Yue Wang

ICML 2025poster
#21698

Distributionally Robust Active Learning for Gaussian Process Regression

Shion Takeno, Yoshito Okura, Yu Inatsu et al.

ICML 2025posterarXiv:2502.16870
#21699

Representative Language Generation

Charlotte Peale, Vinod Raman, Omer Reingold

ICML 2025posterarXiv:2505.21819
#21700

Detecting Strategic Deception with Linear Probes

Nicholas Goldowsky-Dill, Bilal Chughtai, Stefan Heimersheim et al.

ICML 2025posterarXiv:2502.03407
#21701

LOB-Bench: Benchmarking Generative AI for Finance - an Application to Limit Order Book Data

Peer Nagy, Sascha Frey, Kang Li et al.

ICML 2025posterarXiv:2502.09172
#21702

Goal-Oriented Skill Abstraction for Offline Multi-Task Reinforcement Learning

Jinmin He, Kai Li, Yifan Zang et al.

ICML 2025posterarXiv:2507.06628
#21703

Hypo3D: Exploring Hypothetical Reasoning in 3D

Ye Mao, Weixun Luo, Junpeng Jing et al.

ICML 2025posterarXiv:2502.00954
#21704

Sample Complexity of Distributionally Robust Off-Dynamics Reinforcement Learning with Online Interaction

Yiting He, Zhishuai Liu, Weixin Wang et al.

ICML 2025posterarXiv:2511.05396
#21705

Unifews: You Need Fewer Operations for Efficient Graph Neural Networks

Ningyi Liao, Zihao Yu, Ruixiao Zeng et al.

ICML 2025posterarXiv:2403.13268
#21706

Zero-Shot Offline Imitation Learning via Optimal Transport

Thomas Rupf, Marco Bagatella, Nico Gürtler et al.

ICML 2025posterarXiv:2410.08751
#21707

Anytime-Constrained Equilibria in Polynomial Time

Jeremy McMahan

ICML 2025posterarXiv:2410.23637
#21708

EPIC: Efficient Position-Independent Caching for Serving Large Language Models

JUNHAO HU, Wenrui Huang, Weidong Wang et al.

ICML 2025posterarXiv:2410.15332
#21709

Multivariate Conformal Selection

Tian Bai, Yue Zhao, Xiang Yu et al.

ICML 2025posterarXiv:2505.00917
#21710

PCEvolve: Private Contrastive Evolution for Synthetic Dataset Generation via Few-Shot Private Data and Generative APIs

Jianqing Zhang, Yang Liu, Jie Fu et al.

ICML 2025spotlightarXiv:2506.05407
#21711

Direct Motion Models for Assessing Generated Videos

Kelsey Allen, Carl Doersch, Guangyao Zhou et al.

ICML 2025oralarXiv:2505.00209
#21712

KernelBench: Can LLMs Write Efficient GPU Kernels?

Anne Ouyang, Simon Guo, Simran Arora et al.

ICML 2025posterarXiv:2502.10517
#21713

ADIOS: Antibody Development via Opponent Shaping

Sebastian Towers, Aleksandra Kalisz, Philippe Robert et al.

ICML 2025posterarXiv:2409.10588
#21714

Aligning with Logic: Measuring, Evaluating and Improving Logical Preference Consistency in Large Language Models

Yinhong Liu, Zhijiang Guo, Tianya Liang et al.

ICML 2025spotlightarXiv:2410.02205
#21715

Attention-Level Speculation

Jack Cai, Ammar Vora, Randolph Zhang et al.

ICML 2025poster
#21716

Models of Heavy-Tailed Mechanistic Universality

Liam Hodgkinson, Zhichao Wang, Michael Mahoney

ICML 2025posterarXiv:2506.03470
#21717

Balancing Preservation and Modification: A Region and Semantic Aware Metric for Instruction-Based Image Editing

Zhuoying Li, Zhu Xu, Yuxin Peng et al.

ICML 2025posterarXiv:2506.13827
#21718

On the Similarities of Embeddings in Contrastive Learning

Chungpa Lee, Sehee Lim, Kibok Lee et al.

ICML 2025posterarXiv:2506.09781
#21719

Diversifying Robot Locomotion Behaviors with Extrinsic Behavioral Curiosity

Zhenglin Wan, Xingrui Yu, David Bossens et al.

ICML 2025oral
#21720

High-Fidelity Simultaneous Speech-To-Speech Translation

Tom Labiausse, Laurent Mazaré, Edouard Grave et al.

ICML 2025posterarXiv:2502.03382
#21721

PlaySlot: Learning Inverse Latent Dynamics for Controllable Object-Centric Video Prediction and Planning

Angel Villar-Corrales, Sven Behnke

ICML 2025posterarXiv:2502.07600
#21722

World Model Implanting for Test-time Adaptation of Embodied Agents

Minjong Yoo, Jinwoo Jang, Sihyung Yoon et al.

ICML 2025posterarXiv:2509.03956
#21723

M³HF: Multi-agent Reinforcement Learning from Multi-phase Human Feedback of Mixed Quality

Ziyan Wang, Zhicheng Zhang, Fei Fang et al.

ICML 2025poster
#21724

Position: General Intelligence Requires Reward-based Pretraining

Seungwook Han, Jyothish Pari, Samuel Gershman et al.

ICML 2025spotlight
#21725

Position: Spectral GNNs Rely Less on Graph Fourier Basis than Conceived

Yuhe Guo, Huayi Tang, Jiahong Ma et al.

ICML 2025poster
#21726

Position: Constants are Critical in Regret Bounds for Reinforcement Learning

Simone Drago, Marco Mussi, Alberto Maria Metelli

ICML 2025poster
#21727

Continuous Bayesian Model Selection for Multivariate Causal Discovery

Anish Dhir, Ruby Sedgwick, Avinash Kori et al.

ICML 2025posterarXiv:2411.10154
#21728

Pretraining Generative Flow Networks with Inexpensive Rewards for Molecular Graph Generation

Mohit Pandey, Gopeshh Subbaraj, Artem Cherkasov et al.

ICML 2025posterarXiv:2503.06337
#21729

Visual Attention Never Fades: Selective Progressive Attention ReCalibration for Detailed Image Captioning in Multimodal Large Language Models

Mingi Jung, Saehyung Lee, Eunji Kim et al.

ICML 2025posterarXiv:2502.01419
#21730

Text-to-LoRA: Instant Transformer Adaption

Rujikorn Charakorn, Edoardo Cetin, Yujin Tang et al.

ICML 2025posterarXiv:2506.06105
#21731

On the Tension between Byzantine Robustness and No-Attack Accuracy in Distributed Learning

Yi-Rui Yang, Chang-Wei Shi, Wu-Jun Li

ICML 2025spotlight
#21732

Can Large Language Models Understand Intermediate Representations in Compilers?

Hailong Jiang, Jianfeng Zhu, Yao Wan et al.

ICML 2025posterarXiv:2502.06854
#21733

Editable Noise Map Inversion: Encoding Target-image into Noise For High-Fidelity Image Manipulation

Mingyu Kang, Yong Suk Choi

ICML 2025oralarXiv:2509.25776
#21734

Learnware Specification via Dual Alignment

Wei Chen, Jun-Xiang Mao, Xiaozheng Wang et al.

ICML 2025poster
#21735

On the Importance of Gaussianizing Representations

Daniel Eftekhari, Vardan Papyan

ICML 2025posterarXiv:2505.00685
#21736

CodeSync: Synchronizing Large Language Models with Dynamic Code Evolution at Scale

Chenlong Wang, Zhaoyang Chu, Zhengxiang Cheng et al.

ICML 2025posterarXiv:2502.16645
#21737

Adversarial Inception Backdoor Attacks against Reinforcement Learning

Ethan Rathbun, Alina Oprea, Christopher Amato

ICML 2025posterarXiv:2410.13995
#21738

Gradient Flow Provably Learns Robust Classifiers for Orthonormal GMMs

Hancheng Min, Rene Vidal

ICML 2025poster
#21739

The underlying structures of self-attention: symmetry, directionality, and emergent dynamics in Transformer training

Matteo Saponati, Pascal J. Sager, Pau Vilimelis Aceituno et al.

ICML 2025posterarXiv:2502.10927
#21740

Novelty Detection in Reinforcement Learning with World Models

Geigh Zollicoffer, Kenneth Eaton, Jonathan Balloch et al.

ICML 2025spotlightarXiv:2310.08731
#21741

Relational Invariant Learning for Robust Solvation Free Energy Prediction

Yeyun Chen

ICML 2025spotlight
#21742

Unconstrained Robust Online Convex Optimization

Jiujia Zhang, Ashok Cutkosky

ICML 2025posterarXiv:2506.12781
#21743

The Hidden Dimensions of LLM Alignment: A Multi-Dimensional Analysis of Orthogonal Safety Directions

Wenbo Pan, Zhichao Liu, Qiguang Chen et al.

ICML 2025posterarXiv:2502.09674
#21744

Outlier-Aware Post-Training Quantization for Discrete Graph Diffusion Models

Zheng Gong, Ying Sun

ICML 2025poster
#21745

ReverB-SNN: Reversing Bit of the Weight and Activation for Spiking Neural Networks

Yufei Guo, Yuhan Zhang, Zhou Jie et al.

ICML 2025posterarXiv:2506.07720
#21746

Scaling Laws for Forgetting during Finetuning with Pretraining Data Injection

Louis Béthune, David Grangier, Dan Busbridge et al.

ICML 2025poster
#21747

Relative Error Fair Clustering in the Weak-Strong Oracle Model

Vladimir Braverman, Prathamesh Dharangutte, Shaofeng Jiang et al.

ICML 2025posterarXiv:2506.12287
#21748

Doubly Protected Estimation for Survival Outcomes Utilizing External Controls for Randomized Clinical Trials

Chenyin Gao, Shu Yang, Mingyang Shan et al.

ICML 2025posterarXiv:2410.18409
#21749

Outlier Gradient Analysis: Efficiently Identifying Detrimental Training Samples for Deep Learning Models

Anshuman Chhabra, Bo Li, Jian Chen et al.

ICML 2025oralarXiv:2405.03869
#21750

BaWA: Automatic Optimizing Pruning Metric for Large Language Models with Balanced Weight and Activation

Lian Liu, Xiandong Zhao, Guanchen Li et al.

ICML 2025poster
#21751

Geometric Contact Flows: Contactomorphisms for Dynamics and Control

Andrea Testa, Søren Hauberg, Tamim Asfour et al.

ICML 2025posterarXiv:2506.17868
#21752

Neural Genetic Search in Discrete Spaces

Hyeonah Kim, Sanghyeok Choi, Jiwoo Son et al.

ICML 2025posterarXiv:2502.10433
#21753

Sparse Video-Gen: Accelerating Video Diffusion Transformers with Spatial-Temporal Sparsity

Haocheng Xi, Shuo Yang, Yilong Zhao et al.

ICML 2025oral
#21754

PieClam: A Universal Graph Autoencoder Based on Overlapping Inclusive and Exclusive Communities

Daniel Zilberg, Ron Levie

ICML 2025posterarXiv:2409.11618
#21755

Efficient Source-free Unlearning via Energy-Guided Data Synthesis and Discrimination-Aware Multitask Optimization

Xiuyuan Wang, Chaochao Chen, Weiming Liu et al.

ICML 2025spotlight
#21756

Ad Hoc Teamwork via Offline Goal-Based Decision Transformers

Xinzhi Zhang, Hoehi Chan, Deheng Ye et al.

ICML 2025poster
#21757

AlphaVerus: Bootstrapping Formally Verified Code Generation through Self-Improving Translation and Treefinement

Pranjal Aggarwal, Bryan Parno, Sean Welleck

ICML 2025posterarXiv:2412.06176
#21758

The Ripple Effect: On Unforeseen Complications of Backdoor Attacks

Rui Zhang, Yun Shen, Hongwei Li et al.

ICML 2025posterarXiv:2505.11586
#21759

A Sharper Global Convergence Analysis for Average Reward Reinforcement Learning via an Actor-Critic Approach

Swetha Ganesh, Washim Mondal, Vaneet Aggarwal

ICML 2025posterarXiv:2407.18878
#21760

Contrastive Localized Language-Image Pre-Training

Hong-You Chen, Zhengfeng Lai, Haotian Zhang et al.

ICML 2025posterarXiv:2410.02746
#21761

Implicit Bias of Gradient Descent for Non-Homogeneous Deep Networks

Yuhang Cai, Kangjie Zhou, Jingfeng Wu et al.

ICML 2025posterarXiv:2502.16075
#21762

Improved and Oracle-Efficient Online $\ell_1$-Multicalibration

Rohan Ghuge, Vidya Muthukumar, Sahil Singla

ICML 2025posterarXiv:2505.17365
#21763

One Diffusion Step to Real-World Super-Resolution via Flow Trajectory Distillation

Jianze Li, Jiezhang Cao, Yong Guo et al.

ICML 2025posterarXiv:2502.01993
#21764

Capturing Temporal Dynamics in Large-Scale Canopy Tree Height Estimation

Jan Pauls, Max Zimmer, Berkant Turan et al.

ICML 2025oralarXiv:2501.19328
#21765

Towards the Efficient Inference by Incorporating Automated Computational Phenotypes under Covariate Shift

chao ying, Jun Jin, Yi Guo et al.

ICML 2025posterarXiv:2505.22632
#21766

A Sample Efficient Conditional Independence Test in the Presence of Discretization

Boyang Sun, Yu Yao, Xinshuai Dong et al.

ICML 2025posterarXiv:2506.08747
#21767

"Why Is There a Tumor?": Tell Me the Reason, Show Me the Evidence

Mengmeng Ma, Tang Li, Yunxiang Peng et al.

ICML 2025poster
#21768

Diverging Preferences: When do Annotators Disagree and do Models Know?

Michael Zhang, Zhilin Wang, Jena Hwang et al.

ICML 2025posterarXiv:2410.14632
#21769

Self-cross Feature based Spiking Neural Networks for Efficient Few-shot Learning

Qi Xu, Junyang Zhu, Dongdong Zhou et al.

ICML 2025oralarXiv:2505.07921
#21770

Active Treatment Effect Estimation via Limited Samples

Zhiheng Zhang, Haoxiang Wang, Haoxuan Li et al.

ICML 2025poster
#21771

Random Policy Evaluation Uncovers Policies of Generative Flow Networks

Haoran He, Emmanuel Bengio, Qingpeng Cai et al.

ICML 2025posterarXiv:2406.02213
#21772

Generalized Category Discovery via Reciprocal Learning and Class-Wise Distribution Regularization

Duo Liu, Zhiquan Tan, Linglan Zhao et al.

ICML 2025posterarXiv:2506.02334
#21773

Inductive Gradient Adjustment for Spectral Bias in Implicit Neural Representations

Kexuan Shi, Hai Chen, Leheng Zhang et al.

ICML 2025posterarXiv:2410.13271
#21774

Learning Efficient Robotic Garment Manipulation with Standardization

zhou changshi, Feng Luan, hujiarui et al.

ICML 2025posterarXiv:2506.22769
#21775

Efficient Heterogeneity-Aware Federated Active Data Selection

Yingpeng Tang, Chao Ren, Xiaoli Tang et al.

ICML 2025poster
#21776

Preconditioned Riemannian Gradient Descent Algorithm for Low-Multilinear-Rank Tensor Completion

Yuanwei Zhang, Fengmiao Bian, Xiaoqun Zhang et al.

ICML 2025poster
#21777

Grammar-Forced Translation of Natural Language to Temporal Logic using LLMs

William English, Dominic Simon, Sumit Jha et al.

ICML 2025oralarXiv:2512.16814
#21778

Equivariant Neural Tangent Kernels

Philipp Misof, Pan Kessel, Jan Gerken

ICML 2025posterarXiv:2406.06504
#21779

Empowering World Models with Reflection for Embodied Video Prediction

Xiaowei Chi, Chun-Kai Fan, Hengyuan Zhang et al.

ICML 2025poster
#21780

LoRA-Gen: Specializing Large Language Model via Online LoRA Generation

Yicheng Xiao, Lin Song, Rui Yang et al.

ICML 2025posterarXiv:2506.11638
#21781

Protein Structure Tokenization: Benchmarking and New Recipe

Xinyu Yuan, Zichen Wang, Marcus Collins et al.

ICML 2025posterarXiv:2503.00089
#21782

How to Move Your Dragon: Text-to-Motion Synthesis for Large-Vocabulary Objects

Wonkwang Lee, Jongwon Jeong, Taehong Moon et al.

ICML 2025posterarXiv:2503.04257
#21783

Tensorized Multi-View Multi-Label Classification via Laplace Tensor Rank

Qiyu Zhong, Yi Shan, Haobo Wang et al.

ICML 2025poster
#21784

Learning to Keep a Promise: Scaling Language Model Decoding Parallelism with Learned Asynchronous Decoding

Tian Jin, Ellie Cheng, Zachary Ankner et al.

ICML 2025posterarXiv:2502.11517
#21785

Learning Multi-Level Features with Matryoshka Sparse Autoencoders

Bart Bussmann, Noa Nabeshima, Adam Karvonen et al.

ICML 2025posterarXiv:2503.17547
#21786

Grokking in the Wild: Data Augmentation for Real-World Multi-Hop Reasoning with Transformers

Roman Abramov, Felix Steinbauer, Gjergji Kasneci

ICML 2025posterarXiv:2504.20752
#21787

Code-Generated Graph Representations Using Multiple LLM Agents for Material Properties Prediction

Jiao Huang, Qianli Xing, Jinglong Ji et al.

ICML 2025poster
#21788

FeatSharp: Your Vision Model Features, Sharper

Mike Ranzinger, Greg Heinrich, Pavlo Molchanov et al.

ICML 2025posterarXiv:2502.16025
#21789

Private Lossless Multiple Release

Joel Daniel Andersson, Lukas Retschmeier, Boel Nelson et al.

ICML 2025posterarXiv:2505.22449
#21790

Disentangling and Integrating Relational and Sensory Information in Transformer Architectures

Awni Altabaa, John Lafferty

ICML 2025posterarXiv:2405.16727
#21791

Rethinking External Slow-Thinking: From Snowball Errors to Probability of Correct Reasoning

Zeyu Gan, Yun Liao, Yong Liu

ICML 2025posterarXiv:2501.15602
#21792

Parameters vs FLOPs: Scaling Laws for Optimal Sparsity for Mixture-of-Experts Language Models

Samira Abnar, Harshay Shah, Dan Busbridge et al.

ICML 2025posterarXiv:2501.12370
#21793

A Reductions Approach to Risk-Sensitive Reinforcement Learning with Optimized Certainty Equivalents

Kaiwen Wang, Dawen Liang, Nathan Kallus et al.

ICML 2025posterarXiv:2403.06323
#21794

LIMEFLDL: A Local Interpretable Model-Agnostic Explanations Approach for Label Distribution Learning

Xiuyi Jia, Jinchi Li, Yunan Lu et al.

ICML 2025poster
#21795

Hardware and Software Platform Inference

Cheng Zhang, Hanna Foerster, Robert Mullins et al.

ICML 2025posterarXiv:2411.05197
#21796

Nonlinearly Preconditioned Gradient Methods under Generalized Smoothness

Konstantinos Oikonomidis, Jan Quan, Emanuel Laude et al.

ICML 2025oralarXiv:2502.08532
#21797

What Limits Bidirectional Model's Generative Capabilities? A Uni-Bi-Directional Mixture-of-Expert Method For Bidirectional Fine-tuning

Zuchao Li, Yonghua Hei, Qiwei Li et al.

ICML 2025poster
#21798

How to Evaluate and Mitigate IP Infringement in Visual Generative AI?

Zhenting Wang, Chen Chen, Vikash Sehwag et al.

ICML 2025poster
#21799

Locate-then-edit for Multi-hop Factual Recall under Knowledge Editing

Zhuoran Zhang, Yongxiang Li, Zijian Kan et al.

ICML 2025posterarXiv:2410.06331
#21800

On Efficient Estimation of Distributional Treatment Effects under Covariate-Adaptive Randomization

Undral Byambadalai, Tomu Hirata, Tatsushi Oka et al.

ICML 2025posterarXiv:2506.05945