Most Cited ICML "reinforcement learning variance" Papers

5,975 papers found • Page 9 of 30

#1601

Interpreting the Repeated Token Phenomenon in Large Language Models

Itay Yona, Ilia Shumailov, Jamie Hayes et al.

ICML 2025arXiv:2503.08908
10
citations
#1602

Self-Composing Policies for Scalable Continual Reinforcement Learning

Mikel Malagón, Josu Ceberio, Jose A Lozano

ICML 2024arXiv:2506.14811
10
citations
#1603

Meta-Learners for Partially-Identified Treatment Effects Across Multiple Environments

Jonas Schweisthal, Dennis Frauen, M van der Schaar et al.

ICML 2024arXiv:2406.02464
10
citations
#1604

On Expressive Power of Looped Transformers: Theoretical Analysis and Enhancement via Timestep Encoding

Kevin Xu, Issei Sato

ICML 2025arXiv:2410.01405
10
citations
#1605

On the Calibration of Human Pose Estimation

Kerui Gu, Rongyu Chen, Xuanlong Yu et al.

ICML 2024arXiv:2311.17105
10
citations
#1606

A Closer Look at Multimodal Representation Collapse

Abhra Chaudhuri, Anjan Dutta, Tu Bui et al.

ICML 2025spotlightarXiv:2505.22483
10
citations
#1607

A Mixture-Based Framework for Guiding Diffusion Models

Yazid Janati, Badr MOUFAD, Mehdi Qassime et al.

ICML 2025arXiv:2502.03332
10
citations
#1608

Contrastive Representation for Data Filtering in Cross-Domain Offline Reinforcement Learning

Xiaoyu Wen, Chenjia Bai, Kang Xu et al.

ICML 2024arXiv:2405.06192
10
citations
#1609

Individual Contributions as Intrinsic Exploration Scaffolds for Multi-agent Reinforcement Learning

Xinran Li, Zifan LIU, Shibo Chen et al.

ICML 2024arXiv:2405.18110
10
citations
#1610

SOLD: Slot Object-Centric Latent Dynamics Models for Relational Manipulation Learning from Pixels

Malte Mosbach, Jan Ewertz, Angel Villar-Corrales et al.

ICML 2025arXiv:2410.08822
10
citations
#1611

R.I.P.: Better Models by Survival of the Fittest Prompts

Ping Yu, Weizhe Yuan, Olga Golovneva et al.

ICML 2025arXiv:2501.18578
10
citations
#1612

PARCv2: Physics-aware Recurrent Convolutional Neural Networks for Spatiotemporal Dynamics Modeling

Phong Nguyen, Xinlun Cheng, Shahab Azarfar et al.

ICML 2024oralarXiv:2402.12503
10
citations
#1613

RocketKV: Accelerating Long-Context LLM Inference via Two-Stage KV Cache Compression

Payman Behnam, Yaosheng Fu, Ritchie Zhao et al.

ICML 2025arXiv:2502.14051
10
citations
#1614

Libra: Building Decoupled Vision System on Large Language Models

Yifan Xu, Xiaoshan Yang, Yaguang Song et al.

ICML 2024arXiv:2405.10140
10
citations
#1615

Synthesizing Software Engineering Data in a Test-Driven Manner

Lei Zhang, Jiaxi Yang, Min Yang et al.

ICML 2025arXiv:2506.09003
10
citations
#1616

Reasoning Through Execution: Unifying Process and Outcome Rewards for Code Generation

Zhuohao Yu, Weizheng Gu, Yidong Wang et al.

ICML 2025arXiv:2412.15118
10
citations
#1617

Sample as you Infer: Predictive Coding with Langevin Dynamics

Umais Zahid, Qinghai Guo, Zafeirios Fountas

ICML 2024arXiv:2311.13664
10
citations
#1618

Oscillation-Reduced MXFP4 Training for Vision Transformers

Yuxiang Chen, Haocheng Xi, Jun Zhu et al.

ICML 2025arXiv:2502.20853
10
citations
#1619

When Do LLMs Help With Node Classification? A Comprehensive Analysis

Xixi Wu, Yifei Shen, Fangzhou Ge et al.

ICML 2025arXiv:2502.00829
10
citations
#1620

Fundamental limits of learning in sequence multi-index models and deep attention networks: high-dimensional asymptotics and sharp thresholds

Emanuele Troiani, Hugo Cui, Yatin Dandi et al.

ICML 2025arXiv:2502.00901
10
citations
#1621

Generalizing Knowledge Graph Embedding with Universal Orthogonal Parameterization

Rui Li, Chaozhuo Li, Yanming Shen et al.

ICML 2024arXiv:2405.08540
10
citations
#1622

Constrain Alignment with Sparse Autoencoders

Qingyu Yin, Chak Tou Leong, Hongbo Zhang et al.

ICML 2025arXiv:2411.07618
10
citations
#1623

Temperature-Annealed Boltzmann Generators

Henrik Schopmans, Pascal Friederich

ICML 2025arXiv:2501.19077
10
citations
#1624

Solving Poisson Equations using Neural Walk-on-Spheres

Hong Chul Nam, Julius Berner, Anima Anandkumar

ICML 2024arXiv:2406.03494
10
citations
#1625

How Much Can Transfer? BRIDGE: Bounded Multi-Domain Graph Foundation Model with Generalization Guarantees

Haonan Yuan, Qingyun Sun, Junhua Shi et al.

ICML 2025
10
citations
#1626

Discovering Mixtures of Structural Causal Models from Time Series Data

Sumanth Varambally, Yian Ma, Rose Yu

ICML 2024arXiv:2310.06312
10
citations
#1627

GRU: Mitigating the Trade-off between Unlearning and Retention for LLMs

Yue Wang, Qizhou Wang, Feng Liu et al.

ICML 2025arXiv:2503.09117
10
citations
#1628

CuTS: Customizable Tabular Synthetic Data Generation

Mark Vero, Mislav Balunovic, Martin Vechev

ICML 2024arXiv:2307.03577
10
citations
#1629

SEFE: Superficial and Essential Forgetting Eliminator for Multimodal Continual Instruction Tuning

Jinpeng Chen, Runmin Cong, Yuzhi Zhao et al.

ICML 2025arXiv:2505.02486
10
citations
#1630

Stereographic Spherical Sliced Wasserstein Distances

Huy Tran, Yikun Bai, Abihith Kothapalli et al.

ICML 2024spotlightarXiv:2402.02345
10
citations
#1631

Hybrid$^2$ Neural ODE Causal Modeling and an Application to Glycemic Response

Junyi Zou, Matthew Levine, Dessi Zaharieva et al.

ICML 2024arXiv:2402.17233
10
citations
#1632

Short-Long Convolutions Help Hardware-Efficient Linear Attention to Focus on Long Sequences

Zicheng Liu, Siyuan Li, Li Wang et al.

ICML 2024arXiv:2406.08128
10
citations
#1633

A Distributional Analogue to the Successor Representation

Harley Wiltzer, Jesse Farebrother, Arthur Gretton et al.

ICML 2024spotlightarXiv:2402.08530
10
citations
#1634

Improving Transformer World Models for Data-Efficient RL

Antoine Dedieu, Joseph Ortiz, Xinghua Lou et al.

ICML 2025arXiv:2502.01591
10
citations
#1635

Unifying 2D and 3D Vision-Language Understanding

Ayush Jain, Alexander Swerdlow, Yuzhou Wang et al.

ICML 2025arXiv:2503.10745
10
citations
#1636

Understanding Inter-Concept Relationships in Concept-Based Models

Naveen Raman, Mateo Espinosa Zarlenga, Mateja Jamnik

ICML 2024arXiv:2405.18217
10
citations
#1637

Fair Off-Policy Learning from Observational Data

Dennis Frauen, Valentyn Melnychuk, Stefan Feuerriegel

ICML 2024oralarXiv:2303.08516
10
citations
#1638

OTMatch: Improving Semi-Supervised Learning with Optimal Transport

Zhiquan Tan, Kaipeng Zheng, Weiran Huang

ICML 2024arXiv:2310.17455
10
citations
#1639

More Flexible PAC-Bayesian Meta-Learning by Learning Learning Algorithms

Hossein Zakerinia, Amin Behjati, Christoph Lampert

ICML 2024arXiv:2402.04054
10
citations
#1640

Simple Policy Optimization

Zhengpeng Xie, Qiang Zhang, Fan Yang et al.

ICML 2025arXiv:2401.16025
10
citations
#1641

Quantum Positional Encodings for Graph Neural Networks

Slimane Thabet, Mehdi Djellabi, Igor Sokolov et al.

ICML 2024arXiv:2406.06547
10
citations
#1642

Exploring the Low-Pass Filtering Behavior in Image Super-Resolution

Haoyu Deng, Zijing Xu, Yule Duan et al.

ICML 2024arXiv:2405.07919
10
citations
#1643

CRoFT: Robust Fine-Tuning with Concurrent Optimization for OOD Generalization and Open-Set OOD Detection

Lin Zhu, Yifeng Yang, Qinying Gu et al.

ICML 2024arXiv:2405.16417
10
citations
#1644

Translation Equivariant Transformer Neural Processes

Matthew Ashman, Cristiana Diaconu, Junhyuck Kim et al.

ICML 2024oralarXiv:2406.12409
10
citations
#1645

MONA: Myopic Optimization with Non-myopic Approval Can Mitigate Multi-step Reward Hacking

Sebastian Farquhar, Vikrant Varma, David Lindner et al.

ICML 2025arXiv:2501.13011
10
citations
#1646

Stochastic Control for Fine-tuning Diffusion Models: Optimality, Regularity, and Convergence

Yinbin Han, Meisam Razaviyayn, Renyuan Xu

ICML 2025arXiv:2412.18164
10
citations
#1647

Global Reinforcement Learning : Beyond Linear and Convex Rewards via Submodular Semi-gradient Methods

Riccardo De Santi, Manish Prajapat, Andreas Krause

ICML 2024arXiv:2407.09905
10
citations
#1648

Beyond Bradley-Terry Models: A General Preference Model for Language Model Alignment

Yifan Zhang, Ge Zhang, Yue Wu et al.

ICML 2025arXiv:2410.02197
10
citations
#1649

PlaySlot: Learning Inverse Latent Dynamics for Controllable Object-Centric Video Prediction and Planning

Angel Villar-Corrales, Sven Behnke

ICML 2025arXiv:2502.07600
10
citations
#1650

Multi-Session Budget Optimization for Forward Auction-based Federated Learning

Xiaoli Tang, Han Yu, Zengxiang Li et al.

ICML 2025arXiv:2311.12548
10
citations
#1651

How Transformers Learn Structured Data: Insights From Hierarchical Filtering

Jerome Garnier-Brun, Marc Mezard, Emanuele Moscato et al.

ICML 2025arXiv:2408.15138
10
citations
#1652

DAMA: Data- and Model-aware Alignment of Multi-modal LLMs

Jinda Lu, Junkang Wu, Jinghan Li et al.

ICML 2025arXiv:2502.01943
10
citations
#1653

Unisolver: PDE-Conditional Transformers Towards Universal Neural PDE Solvers

Hang Zhou, Yuezhou Ma, Haixu Wu et al.

ICML 2025arXiv:2405.17527
10
citations
#1654

Generative Enzyme Design Guided by Functionally Important Sites and Small-Molecule Substrates

Zhenqiao Song, Yunlong Zhao, Wenxian Shi et al.

ICML 2024arXiv:2405.08205
10
citations
#1655

A2Q+: Improving Accumulator-Aware Weight Quantization

Ian Colbert, Alessandro Pappalardo, Jakoba Petri-Koenig et al.

ICML 2024arXiv:2401.10432
10
citations
#1656

Helpful or Harmful Data? Fine-tuning-free Shapley Attribution for Explaining Language Model Predictions

Jingtan Wang, Xiaoqiang Lin, Rui Qiao et al.

ICML 2024arXiv:2406.04606
10
citations
#1657

Scalable Real-Time Recurrent Learning Using Columnar-Constructive Networks

Khurram Javed, Haseeb Shah, Richard Sutton et al.

ICML 2024arXiv:2302.05326
10
citations
#1658

HarmoniCa: Harmonizing Training and Inference for Better Feature Caching in Diffusion Transformer Acceleration

Yushi Huang, Zining Wang, Ruihao Gong et al.

ICML 2025arXiv:2410.01723
10
citations
#1659

Tool Unlearning for Tool-Augmented LLMs

Jiali Cheng, Hadi Amiri

ICML 2025arXiv:2502.01083
10
citations
#1660

Creative Text-to-Audio Generation via Synthesizer Programming

Manuel Cherep, Nikhil Singh, Jessica Shand

ICML 2024arXiv:2406.00294
10
citations
#1661

Improving Parallel Program Performance with LLM Optimizers via Agent-System Interfaces

Anjiang Wei, Allen Nie, Thiago Teixeira et al.

ICML 2025arXiv:2410.15625
10
citations
#1662

RICE: Breaking Through the Training Bottlenecks of Reinforcement Learning with Explanation

Zelei Cheng, Xian Wu, Jiahao Yu et al.

ICML 2024spotlightarXiv:2405.03064
10
citations
#1663

Towards Understanding Fine-Tuning Mechanisms of LLMs via Circuit Analysis

Xu Wang, Yan Hu, Wenyu Du et al.

ICML 2025arXiv:2502.11812
10
citations
#1664

On Measuring Long-Range Interactions in Graph Neural Networks

Jacob Bamberger, Benjamin Gutteridge, Scott le Roux et al.

ICML 2025arXiv:2506.05971
10
citations
#1665

Bounded and Uniform Energy-based Out-of-distribution Detection for Graphs

Shenzhi Yang, Bin Liang, An Liu et al.

ICML 2024arXiv:2504.13429
10
citations
#1666

BadPart: Unified Black-box Adversarial Patch Attacks against Pixel-wise Regression Tasks

Zhiyuan Cheng, Zhaoyi Liu, Tengda Guo et al.

ICML 2024arXiv:2404.00924
10
citations
#1667

FightLadder: A Benchmark for Competitive Multi-Agent Reinforcement Learning

Wenzhe Li, Zihan Ding, Seth Karten et al.

ICML 2024arXiv:2406.02081
10
citations
#1668

Reducing sequential change detection to sequential estimation

Shubhanshu Shekhar, Aaditya Ramdas

ICML 2024arXiv:2309.09111
10
citations
#1669

Revealing the Dark Secrets of Extremely Large Kernel ConvNets on Robustness

Honghao Chen, Zhang Yurong, xiaokun Feng et al.

ICML 2024arXiv:2407.08972
10
citations
#1670

Position: An Inner Interpretability Framework for AI Inspired by Lessons from Cognitive Neuroscience

Martina G. Vilas, Federico Adolfi, David Poeppel et al.

ICML 2024arXiv:2406.01352
10
citations
#1671

ReQFlow: Rectified Quaternion Flow for Efficient and High-Quality Protein Backbone Generation

Angxiao Yue, Zichong Wang, Hongteng Xu

ICML 2025arXiv:2502.14637
10
citations
#1672

Premier-TACO is a Few-Shot Policy Learner: Pretraining Multitask Representation via Temporal Action-Driven Contrastive Loss

Ruijie Zheng, Yongyuan Liang, xiyao wang et al.

ICML 2024oralarXiv:2402.06187
10
citations
#1673

Objective drives the consistency of representational similarity across datasets

Laure Ciernik, Lorenz Linhardt, Marco Morik et al.

ICML 2025arXiv:2411.05561
10
citations
#1674

Tandem Transformers for Inference Efficient LLMs

Aishwarya P S, Pranav Nair, Yashas Samaga et al.

ICML 2024arXiv:2402.08644
10
citations
#1675

Improved Off-policy Reinforcement Learning in Biological Sequence Design

Hyeonah Kim, Minsu Kim, Taeyoung Yun et al.

ICML 2025arXiv:2410.04461
10
citations
#1676

Lessons from Generalization Error Analysis of Federated Learning: You May Communicate Less Often!

Milad Sefidgaran, Romain Chor, Abdellatif Zaidi et al.

ICML 2024arXiv:2306.05862
10
citations
#1677

Feature Distribution on Graph Topology Mediates the Effect of Graph Convolution: Homophily Perspective

Soo Yong Lee, Sunwoo Kim, Fanchen Bu et al.

ICML 2024arXiv:2402.04621
10
citations
#1678

Efficient Pareto Manifold Learning with Low-Rank Structure

Weiyu CHEN, James Kwok

ICML 2024spotlightarXiv:2407.20734
10
citations
#1679

Leveraging Self-Consistency for Data-Efficient Amortized Bayesian Inference

Marvin Schmitt, Desi Ivanova, Daniel Habermann et al.

ICML 2024arXiv:2310.04395
10
citations
#1680

Compressed Image Generation with Denoising Diffusion Codebook Models

Guy Ohayon, Hila Manor, Tomer Michaeli et al.

ICML 2025arXiv:2502.01189
10
citations
#1681

Gaussian Mixture Flow Matching Models

Hansheng Chen, Kai Zhang, Hao Tan et al.

ICML 2025arXiv:2504.05304
10
citations
#1682

Fair Risk Control: A Generalized Framework for Calibrating Multi-group Fairness Risks

Lujing Zhang, Aaron Roth, Linjun Zhang

ICML 2024arXiv:2405.02225
10
citations
#1683

Safety Reasoning with Guidelines

Haoyu Wang, Zeyu Qin, Li Shen et al.

ICML 2025arXiv:2502.04040
10
citations
#1684

One-Shot Strategic Classification Under Unknown Costs

Elan Rosenfeld, Nir Rosenfeld

ICML 2024arXiv:2311.02761
10
citations
#1685

Mean Estimation in the Add-Remove Model of Differential Privacy

Alex Kulesza, Ananda Suresh, Yuyan Wang

ICML 2024arXiv:2312.06658
10
citations
#1686

Estimating Barycenters of Distributions with Neural Optimal Transport

Alexander Kolesov, Petr Mokrov, Igor Udovichenko et al.

ICML 2024arXiv:2402.03828
10
citations
#1687

AdsorbDiff: Adsorbate Placement via Conditional Denoising Diffusion

Adeesh Kolluru, John Kitchin

ICML 2024arXiv:2405.03962
10
citations
#1688

Nonparametric Teaching of Implicit Neural Representations

Chen Zhang, Steven T. S. Luo, Jason Chun Lok Li et al.

ICML 2024arXiv:2405.10531
10
citations
#1689

Multiplicative Weights Update, Area Convexity and Random Coordinate Descent for Densest Subgraph Problems

Ta Duy Nguyen, Alina Ene

ICML 2024arXiv:2405.18809
10
citations
#1690

MA-LoT: Model-Collaboration Lean-based Long Chain-of-Thought Reasoning enhances Formal Theorem Proving

Ruida Wang, Rui Pan, Yuxin Li et al.

ICML 2025arXiv:2503.03205
10
citations
#1691

Private Heterogeneous Federated Learning Without a Trusted Server Revisited: Error-Optimal and Communication-Efficient Algorithms for Convex Losses

Changyu Gao, Andrew Lowy, Xingyu Zhou et al.

ICML 2024arXiv:2407.09690
10
citations
#1692

How Uniform Random Weights Induce Non-uniform Bias: Typical Interpolating Neural Networks Generalize with Narrow Teachers

Gon Buzaglo, Itamar Harel, Mor Shpigel Nacson et al.

ICML 2024spotlightarXiv:2402.06323
10
citations
#1693

Weisfeiler Leman for Euclidean Equivariant Machine Learning

Snir Hordan, Tal Amir, Nadav Dym

ICML 2024arXiv:2402.02484
10
citations
#1694

Zebra: In-Context Generative Pretraining for Solving Parametric PDEs

Louis Serrano, Armand Kassaï Koupaï, Thomas Wang et al.

ICML 2025arXiv:2410.03437
10
citations
#1695

The Role of Learning Algorithms in Collective Action

Omri Ben-Dov, Jake Fawkes, Samira Samadi et al.

ICML 2024arXiv:2405.06582
10
citations
#1696

Learning to Predict Mutational Effects of Protein-Protein Interactions by Microenvironment-aware Hierarchical Prompt Learning

Lirong Wu, Yijun Tian, Haitao Lin et al.

ICML 2024arXiv:2405.10348
10
citations
#1697

To Cool or not to Cool? Temperature Network Meets Large Foundation Models via DRO

Zi-Hao Qiu, Siqi Guo, Mao Xu et al.

ICML 2024arXiv:2404.04575
10
citations
#1698

Non-Asymptotic Analysis for Single-Loop (Natural) Actor-Critic with Compatible Function Approximation

Yudan Wang, Yue Wang, Yi Zhou et al.

ICML 2024oralarXiv:2406.01762
10
citations
#1699

Cross-view Masked Diffusion Transformers for Person Image Synthesis

Trung Pham, Kang Zhang, Chang Yoo

ICML 2024arXiv:2402.01516
10
citations
#1700

Latent Logic Tree Extraction for Event Sequence Explanation from LLMs

Zitao Song, Chao Yang, Chaojie Wang et al.

ICML 2024oralarXiv:2406.01124
10
citations
#1701

Beyond the Calibration Point: Mechanism Comparison in Differential Privacy

Georgios Kaissis, Stefan Kolek, Borja de Balle Pigem et al.

ICML 2024arXiv:2406.08918
10
citations
#1702

Towards Understanding Inductive Bias in Transformers: A View From Infinity

Itay Lavie, Guy Gur-Ari, Zohar Ringel

ICML 2024arXiv:2402.05173
10
citations
#1703

Scaling Large Motion Models with Million-Level Human Motions

Ye Wang, Sipeng Zheng, Bin Cao et al.

ICML 2025arXiv:2410.03311
10
citations
#1704

A Provably Effective Method for Pruning Experts in Fine-tuned Sparse Mixture-of-Experts

Mohammed Nowaz Rabbani Chowdhury, Meng Wang, Kaoutar El Maghraoui et al.

ICML 2024arXiv:2405.16646
10
citations
#1705

QuantSpec: Self-Speculative Decoding with Hierarchical Quantized KV Cache

Rishabh Tiwari, Haocheng Xi, Aditya Tomar et al.

ICML 2025arXiv:2502.10424
10
citations
#1706

Probabilistic Conceptual Explainers: Trustworthy Conceptual Explanations for Vision Foundation Models

Hengyi Wang, Shiwei Tan, Hao Wang

ICML 2024arXiv:2406.12649
9
citations
#1707

On Temperature Scaling and Conformal Prediction of Deep Classifiers

Lahav Dabah, Tom Tirer

ICML 2025arXiv:2402.05806
9
citations
#1708

Scalable Online Exploration via Coverability

Philip Amortila, Dylan Foster, Akshay Krishnamurthy

ICML 2024arXiv:2403.06571
9
citations
#1709

An Analysis for Reasoning Bias of Language Models with Small Initialization

Junjie Yao, zhongwang zhang, Zhi-Qin John Xu

ICML 2025spotlightarXiv:2502.04375
9
citations
#1710

Sortformer: A Novel Approach for Permutation-Resolved Speaker Supervision in Speech-to-Text Systems

Taejin Park, Ivan Medennikov, Kunal Dhawan et al.

ICML 2025arXiv:2409.06656
9
citations
#1711

TabFlex: Scaling Tabular Learning to Millions with Linear Attention

Yuchen Zeng, Tuan Dinh, Wonjun Kang et al.

ICML 2025spotlightarXiv:2506.05584
9
citations
#1712

Position: Stop Making Unscientific AGI Performance Claims

Patrick Altmeyer, Andrew Demetriou, Antony Bartlett et al.

ICML 2024arXiv:2402.03962
9
citations
#1713

STAR: Learning Diverse Robot Skill Abstractions through Rotation-Augmented Vector Quantization

Hao Li, Qi Lv, Rui Shao et al.

ICML 2025spotlightarXiv:2506.03863
9
citations
#1714

LARM: Large Auto-Regressive Model for Long-Horizon Embodied Intelligence

Zhuoling Li, Xiaogang Xu, Zhenhua Xu et al.

ICML 2025arXiv:2405.17424
9
citations
#1715

SEMU: Singular Value Decomposition for Efficient Machine Unlearning

Marcin Sendera, Łukasz Struski, Kamil Książek et al.

ICML 2025arXiv:2502.07587
9
citations
#1716

When Linear Attention Meets Autoregressive Decoding: Towards More Effective and Efficient Linearized Large Language Models

Haoran You, Yichao Fu, Zheng Wang et al.

ICML 2024arXiv:2406.07368
9
citations
#1717

Data-Juicer Sandbox: A Feedback-Driven Suite for Multimodal Data-Model Co-development

Daoyuan Chen, Haibin Wang, Yilun Huang et al.

ICML 2025spotlightarXiv:2407.11784
9
citations
#1718

The Elicitation Game: Evaluating Capability Elicitation Techniques

Felix Hofstätter, Teun van der Weij, Jayden Teoh et al.

ICML 2025arXiv:2502.02180
9
citations
#1719

Adaptively Perturbed Mirror Descent for Learning in Games

Kenshi Abe, Kaito Ariu, Mitsuki Sakamoto et al.

ICML 2024arXiv:2305.16610
9
citations
#1720

ArtWhisperer: A Dataset for Characterizing Human-AI Interactions in Artistic Creations

Kailas Vodrahalli, James Zou

ICML 2024arXiv:2306.08141
9
citations
#1721

AD3: Implicit Action is the Key for World Models to Distinguish the Diverse Visual Distractors

Yucen Wang, Shenghua Wan, Le Gan et al.

ICML 2024arXiv:2403.09976
9
citations
#1722

Unlocking the Power of Spatial and Temporal Information in Medical Multimodal Pre-training

Jinxia Yang, Bing Su, Xin Zhao et al.

ICML 2024oralarXiv:2405.19654
9
citations
#1723

An Online Optimization Perspective on First-Order and Zero-Order Decentralized Nonsmooth Nonconvex Stochastic Optimization

Emre Sahinoglu, Shahin Shahrampour

ICML 2024arXiv:2406.01484
9
citations
#1724

Synthesizing Privacy-Preserving Text Data via Finetuning *without* Finetuning Billion-Scale LLMs

Bowen Tan, Zheng Xu, Eric Xing et al.

ICML 2025arXiv:2503.12347
9
citations
#1725

Lightweight Dataset Pruning without Full Training via Example Difficulty and Prediction Uncertainty

Yeseul Cho, Baekrok Shin, Changmin Kang et al.

ICML 2025arXiv:2502.06905
9
citations
#1726

LoRA Training Provably Converges to a Low-Rank Global Minimum Or It Fails Loudly (But it Probably Won't Fail)

Junsu Kim, Jaeyeon Kim, Ernest Ryu

ICML 2025oralarXiv:2502.09376
9
citations
#1727

Leveraging VLM-Based Pipelines to Annotate 3D Objects

Rishabh Kabra, Loic Matthey, Alexander Lerchner et al.

ICML 2024arXiv:2311.17851
9
citations
#1728

Textual Unlearning Gives a False Sense of Unlearning

Jiacheng Du, Zhibo Wang, Jie Zhang et al.

ICML 2025arXiv:2406.13348
9
citations
#1729

A New Robust Partial p-Wasserstein-Based Metric for Comparing Distributions

Sharath Raghvendra, Pouyan Shirzadian, Kaiyi Zhang

ICML 2024arXiv:2405.03664
9
citations
#1730

Task Generalization with Autoregressive Compositional Structure: Can Learning from $D$ Tasks Generalize to $D^T$ Tasks?

Amirhesam Abedsoltan, Huaqing Zhang, Kaiyue Wen et al.

ICML 2025arXiv:2502.08991
9
citations
#1731

Effective and Efficient Masked Image Generation Models

Zebin You, Jingyang Ou, Xiaolu Zhang et al.

ICML 2025arXiv:2503.07197
9
citations
#1732

PEARL: Zero-shot Cross-task Preference Alignment and Robust Reward Learning for Robotic Manipulation

Runze Liu, Yali Du, Fengshuo Bai et al.

ICML 2024arXiv:2306.03615
9
citations
#1733

Selective Mixup Helps with Distribution Shifts, But Not (Only) because of Mixup

Damien Teney, Jindong Wang, Ehsan Abbasnejad

ICML 2024arXiv:2305.16817
9
citations
#1734

Compositional Risk Minimization

Divyat Mahajan, Mohammad Pezeshki, Charles Arnal et al.

ICML 2025arXiv:2410.06303
9
citations
#1735

Position: Tensor Networks are a Valuable Asset for Green AI

Eva Memmel, Clara Menzen, Jetze Schuurmans et al.

ICML 2024arXiv:2205.12961
9
citations
#1736

Improving LLMs for Recommendation with Out-Of-Vocabulary Tokens

Ting-Ji Huang, Jia-Qi Yang, Chunxu Shen et al.

ICML 2025arXiv:2406.08477
9
citations
#1737

Density-Softmax: Efficient Test-time Model for Uncertainty Estimation and Robustness under Distribution Shifts

Ha Manh Bui, Anqi Liu

ICML 2024arXiv:2302.06495
9
citations
#1738

Prototypical Transformer As Unified Motion Learners

Cheng Han, Yawen Lu, Guohao Sun et al.

ICML 2024arXiv:2406.01559
9
citations
#1739

Privacy Attacks in Decentralized Learning

Abdellah El Mrini, Edwige Cyffers, Aurélien Bellet

ICML 2024arXiv:2402.10001
9
citations
#1740

Toward a Unified Theory of Gradient Descent under Generalized Smoothness

Alexander Tyurin

ICML 2025arXiv:2412.11773
9
citations
#1741

Learning Safety Constraints for Large Language Models

Xin Chen, Yarden As, Andreas Krause

ICML 2025spotlightarXiv:2505.24445
9
citations
#1742

Bring Your Own (Non-Robust) Algorithm to Solve Robust MDPs by Estimating The Worst Kernel

Uri Gadot, Kaixin Wang, Navdeep Kumar et al.

ICML 2024arXiv:2306.05859
9
citations
#1743

GistScore: Learning Better Representations for In-Context Example Selection with Gist Bottlenecks

Shivanshu Gupta, Clemens Rosenbaum, Ethan R. Elenberg

ICML 2024arXiv:2311.09606
9
citations
#1744

Manifold Integrated Gradients: Riemannian Geometry for Feature Attribution

Eslam Zaher, Maciej Trzaskowski, Quan Nguyen et al.

ICML 2024arXiv:2405.09800
9
citations
#1745

Accelerating LLM Inference with Lossless Speculative Decoding Algorithms for Heterogeneous Vocabularies

Nadav Timor, Jonathan Mamou, Daniel Korat et al.

ICML 2025oralarXiv:2502.05202
9
citations
#1746

On the Diminishing Returns of Width for Continual Learning

Etash Guha, Vihan Lakshman

ICML 2024arXiv:2403.06398
9
citations
#1747

Autaptic Synaptic Circuit Enhances Spatio-temporal Predictive Learning of Spiking Neural Networks

Lihao Wang, Zhaofei Yu

ICML 2024oralarXiv:2406.00405
9
citations
#1748

BoA: Attention-aware Post-training Quantization without Backpropagation

Junhan Kim, Ho-young Kim, Eulrang Cho et al.

ICML 2025arXiv:2406.13474
9
citations
#1749

Sequential Disentanglement by Extracting Static Information From A Single Sequence Element

Nimrod Berman, Ilan Naiman, Idan Arbiv et al.

ICML 2024arXiv:2406.18131
9
citations
#1750

In-Context Deep Learning via Transformer Models

Weimin Wu, Maojiang Su, Jerry Yao-Chieh Hu et al.

ICML 2025arXiv:2411.16549
9
citations
#1751

A Sharper Global Convergence Analysis for Average Reward Reinforcement Learning via an Actor-Critic Approach

Swetha Ganesh, Washim Mondal, Vaneet Aggarwal

ICML 2025arXiv:2407.18878
9
citations
#1752

Operator SVD with Neural Networks via Nested Low-Rank Approximation

Jongha (Jon) Ryu, Xiangxiang Xu, Hasan Sabri Melihcan Erol et al.

ICML 2024arXiv:2402.03655
9
citations
#1753

Modular Learning of Deep Causal Generative Models for High-dimensional Causal Inference

Md Musfiqur Rahman, Murat Kocaoglu

ICML 2024arXiv:2401.01426
9
citations
#1754

Tilting the Odds at the Lottery: the Interplay of Overparameterisation and Curricula in Neural Networks

Stefano Mannelli, Yaraslau Ivashynka, Andrew Saxe et al.

ICML 2024arXiv:2406.01589
9
citations
#1755

Unsupervised Concept Discovery Mitigates Spurious Correlations

Md Rifat Arefin, Yan Zhang, Aristide Baratin et al.

ICML 2024arXiv:2402.13368
9
citations
#1756

Hierarchical Neural Operator Transformer with Learnable Frequency-aware Loss Prior for Arbitrary-scale Super-resolution

Xihaier Luo, Xiaoning Qian, Byung-Jun Yoon

ICML 2024arXiv:2405.12202
9
citations
#1757

Rethinking Specificity in SBDD: Leveraging Delta Score and Energy-Guided Diffusion

Bowen Gao, Minsi Ren, Yuyan Ni et al.

ICML 2024arXiv:2403.12987
9
citations
#1758

From Debate to Equilibrium: Belief‑Driven Multi‑Agent LLM Reasoning via Bayesian Nash Equilibrium

Yi Xie, Zhanke Zhou, Chentao Cao et al.

ICML 2025arXiv:2506.08292
9
citations
#1759

FedCal: Achieving Local and Global Calibration in Federated Learning via Aggregated Parameterized Scaler

Hongyi Peng, Han Yu, Xiaoli Tang et al.

ICML 2024arXiv:2405.15458
9
citations
#1760

Which Frequencies do CNNs Need? Emergent Bottleneck Structure in Feature Learning

Yuxiao Wen, Arthur Jacot

ICML 2024arXiv:2402.08010
9
citations
#1761

GeoMFormer: A General Architecture for Geometric Molecular Representation Learning

Tianlang Chen, Shengjie Luo, Di He et al.

ICML 2024arXiv:2406.16853
9
citations
#1762

A Graph is Worth $K$ Words: Euclideanizing Graph using Pure Transformer

Zhangyang Gao, Daize Dong, Cheng Tan et al.

ICML 2024arXiv:2402.02464
9
citations
#1763

BWS: Best Window Selection Based on Sample Scores for Data Pruning across Broad Ranges

Hoyong Choi, Nohyun Ki, Hye Won Chung

ICML 2024arXiv:2406.03057
9
citations
#1764

Compression via Pre-trained Transformers: A Study on Byte-Level Multimodal Data

David Heurtel-Depeiges, Anian Ruoss, Joel Veness et al.

ICML 2025arXiv:2410.05078
9
citations
#1765

Sliced-Wasserstein Estimation with Spherical Harmonics as Control Variates

Rémi Leluc, Aymeric Dieuleveut, François Portier et al.

ICML 2024arXiv:2402.01493
9
citations
#1766

VinT-6D: A Large-Scale Object-in-hand Dataset from Vision, Touch and Proprioception

Zhaoliang Wan, Yonggen Ling, Senlin Yi et al.

ICML 2024arXiv:2501.00510
9
citations
#1767

Differentially Private Decentralized Learning with Random Walks

Edwige Cyffers, Aurélien Bellet, Jalaj Upadhyay

ICML 2024arXiv:2402.07471
9
citations
#1768

REINFORCE Adversarial Attacks on Large Language Models: An Adaptive, Distributional, and Semantic Objective

Simon Geisler, Tom Wollschläger, M. Hesham Abdalla et al.

ICML 2025arXiv:2502.17254
9
citations
#1769

BRiTE: Bootstrapping Reinforced Thinking Process to Enhance Language Model Reasoning

Han Zhong, Yutong Yin, Shenao Zhang et al.

ICML 2025arXiv:2501.18858
9
citations
#1770

Counterfactual Reasoning for Multi-Label Image Classification via Patching-Based Training

Ming-Kun Xie, Jia-Hao Xiao, Pei Peng et al.

ICML 2024arXiv:2404.06287
9
citations
#1771

Disguised Copyright Infringement of Latent Diffusion Models

Yiwei Lu, Matthew Yang, Zuoqiu Liu et al.

ICML 2024arXiv:2404.06737
9
citations
#1772

Learning High-Frequency Functions Made Easy with Sinusoidal Positional Encoding

Chuanhao Sun, Zhihang Yuan, Kai Xu et al.

ICML 2024arXiv:2407.09370
9
citations
#1773

Generalizing from SIMPLE to HARD Visual Reasoning: Can We Mitigate Modality Imbalance in VLMs?

Simon Park, Abhishek Panigrahi, Yun Cheng et al.

ICML 2025arXiv:2501.02669
9
citations
#1774

Neural SPH: Improved Neural Modeling of Lagrangian Fluid Dynamics

Artur Toshev, Jonas Erbesdobler, Nikolaus Adams et al.

ICML 2024arXiv:2402.06275
9
citations
#1775

Q-VDiT: Towards Accurate Quantization and Distillation of Video-Generation Diffusion Transformers

Weilun Feng, Chuanguang Yang, Haotong Qin et al.

ICML 2025oralarXiv:2505.22167
9
citations
#1776

Energy-Guided Diffusion Sampling for Offline-to-Online Reinforcement Learning

Xu-Hui Liu, Tian-Shuo Liu, Shengyi Jiang et al.

ICML 2024arXiv:2407.12448
9
citations
#1777

Federated Offline Reinforcement Learning: Collaborative Single-Policy Coverage Suffices

Jiin Woo, Laixi Shi, Gauri Joshi et al.

ICML 2024arXiv:2402.05876
9
citations
#1778

The Pitfalls and Promise of Conformal Inference Under Adversarial Attacks

Ziquan Liu, Yufei Cui, Yan Yan et al.

ICML 2024arXiv:2405.08886
9
citations
#1779

Function Encoders: A Principled Approach to Transfer Learning in Hilbert Spaces

Tyler Ingebrand, Adam Thorpe, Ufuk Topcu

ICML 2025arXiv:2501.18373
9
citations
#1780

Chaos Meets Attention: Transformers for Large-Scale Dynamical Prediction

Yi He, Yiming Yang, Xiaoyuan Cheng et al.

ICML 2025arXiv:2504.20858
9
citations
#1781

Deep Linear Network Training Dynamics from Random Initialization: Data, Width, Depth, and Hyperparameter Transfer

Blake Bordelon, Cengiz Pehlevan

ICML 2025arXiv:2502.02531
9
citations
#1782

Visual Representation Learning with Stochastic Frame Prediction

Huiwon Jang, Dongyoung Kim, Junsu Kim et al.

ICML 2024oralarXiv:2406.07398
9
citations
#1783

Dialogue Without Limits: Constant-Sized KV Caches for Extended Response in LLMs

Ravi Ghadia, Avinash Kumar, Gaurav Jain et al.

ICML 2025arXiv:2503.00979
9
citations
#1784

Robustness of Deep Learning for Accelerated MRI: Benefits of Diverse Training Data

Kang Lin, Reinhard Heckel

ICML 2024arXiv:2312.10271
9
citations
#1785

Posterior Sampling-Based Bayesian Optimization with Tighter Bayesian Regret Bounds

Shion Takeno, Yu Inatsu, Masayuki Karasuyama et al.

ICML 2024arXiv:2311.03760
9
citations
#1786

Stochastic Q-learning for Large Discrete Action Spaces

Fares Fourati, Vaneet Aggarwal, Mohamed-Slim Alouini

ICML 2024arXiv:2405.10310
9
citations
#1787

Stochastic Forward–Backward Deconvolution: Training Diffusion Models with Finite Noisy Datasets

Haoye Lu, Qifan Wu, Yaoliang Yu

ICML 2025arXiv:2502.05446
9
citations
#1788

KernelWarehouse: Rethinking the Design of Dynamic Convolution

Chao Li, Anbang Yao

ICML 2024arXiv:2406.07879
9
citations
#1789

Near-Optimal Sample Complexity for MDPs via Anchoring

Jongmin Lee, Mario Bravo, Roberto Cominetti

ICML 2025arXiv:2502.04477
9
citations
#1790

Highly Compressed Tokenizer Can Generate Without Training

Lukas Lao Beyer, Tianhong Li, Xinlei Chen et al.

ICML 2025arXiv:2506.08257
9
citations
#1791

Loss Functions and Operators Generated by f-Divergences

Vincent Roulet, Tianlin Liu, Nino Vieillard et al.

ICML 2025arXiv:2501.18537
9
citations
#1792

Boosting Offline Optimizers with Surrogate Sensitivity

Cuong Dao, Phi Le Nguyen, Thao Nguyen Truong et al.

ICML 2024arXiv:2503.04181
9
citations
#1793

PARM: Multi-Objective Test-Time Alignment via Preference-Aware Autoregressive Reward Model

Baijiong Lin, Weisen Jiang, Yuancheng Xu et al.

ICML 2025arXiv:2505.06274
9
citations
#1794

Graph External Attention Enhanced Transformer

Jianqing Liang, Min Chen, Jiye Liang

ICML 2024arXiv:2405.21061
9
citations
#1795

Pre-Training Protein Bi-level Representation Through Span Mask Strategy On 3D Protein Chains

Jiale Zhao, Wanru Zhuang, Jia Song et al.

ICML 2024arXiv:2402.01481
9
citations
#1796

Rethinking Transformers in Solving POMDPs

Chenhao Lu, Ruizhe Shi, Yuyao Liu et al.

ICML 2024arXiv:2405.17358
9
citations
#1797

ProofAug: Efficient Neural Theorem Proving via Fine-grained Proof Structure Analysis

Haoxiong Liu, Jiacheng Sun, Zhenguo Li et al.

ICML 2025arXiv:2501.18310
9
citations
#1798

Causal Discovery with Fewer Conditional Independence Tests

Kirankumar Shiragur, Jiaqi Zhang, Caroline Uhler

ICML 2024arXiv:2406.01823
9
citations
#1799

MindAligner: Explicit Brain Functional Alignment for Cross-Subject Visual Decoding from Limited fMRI Data

Yuqin Dai, Zhouheng Yao, Chunfeng Song et al.

ICML 2025arXiv:2502.05034
9
citations
#1800

EffiCoder: Enhancing Code Generation in Large Language Models through Efficiency-Aware Fine-tuning

Dong HUANG, Guangtao Zeng, Jianbo Dai et al.

ICML 2025arXiv:2410.10209
9
citations