Most Cited ICML "reinforcement learning variance" Papers

5,975 papers found • Page 9 of 30

Filters:Most Cited ICML reinforcement learning variance Clear all

Conference

AAAI 2025 (3,028)COLM 2025 (418)CVPR 2025 (2,873)ICCV 2025 (2,701)ICLR 2025 (3,827)ICML 2025 (3,340)ISMAR 2025 (229)NEURIPS 2025 (5,858)AAAI 2024 (2,289)CVPR 2024 (2,716)ECCV 2024 (2,387)ICLR 2024 (2,297)ICML 2024 (2,635)

Paper Type

poster (24,624)paper (8,558)oral (1,594)spotlight (1,421)highlight (975)

#1601

Interpreting the Repeated Token Phenomenon in Large Language Models

Itay Yona, Ilia Shumailov, Jamie Hayes et al.

ICML 2025arXiv:2503.08908

citations

#1602

Self-Composing Policies for Scalable Continual Reinforcement Learning

Mikel Malagón, Josu Ceberio, Jose A Lozano

ICML 2024arXiv:2506.14811

citations

#1603

Meta-Learners for Partially-Identified Treatment Effects Across Multiple Environments

Jonas Schweisthal, Dennis Frauen, M van der Schaar et al.

ICML 2024arXiv:2406.02464

citations

#1604

On Expressive Power of Looped Transformers: Theoretical Analysis and Enhancement via Timestep Encoding

Kevin Xu, Issei Sato

ICML 2025arXiv:2410.01405

citations

#1605

On the Calibration of Human Pose Estimation

Kerui Gu, Rongyu Chen, Xuanlong Yu et al.

ICML 2024arXiv:2311.17105

citations

#1606

A Closer Look at Multimodal Representation Collapse

Abhra Chaudhuri, Anjan Dutta, Tu Bui et al.

ICML 2025spotlightarXiv:2505.22483

citations

#1607

A Mixture-Based Framework for Guiding Diffusion Models

Yazid Janati, Badr MOUFAD, Mehdi Qassime et al.

ICML 2025arXiv:2502.03332

citations

#1608

Contrastive Representation for Data Filtering in Cross-Domain Offline Reinforcement Learning

Xiaoyu Wen, Chenjia Bai, Kang Xu et al.

ICML 2024arXiv:2405.06192

citations

#1609

Individual Contributions as Intrinsic Exploration Scaffolds for Multi-agent Reinforcement Learning

Xinran Li, Zifan LIU, Shibo Chen et al.

ICML 2024arXiv:2405.18110

citations

#1610

SOLD: Slot Object-Centric Latent Dynamics Models for Relational Manipulation Learning from Pixels

Malte Mosbach, Jan Ewertz, Angel Villar-Corrales et al.

ICML 2025arXiv:2410.08822

citations

#1611

R.I.P.: Better Models by Survival of the Fittest Prompts

Ping Yu, Weizhe Yuan, Olga Golovneva et al.

ICML 2025arXiv:2501.18578

citations

#1612

PARCv2: Physics-aware Recurrent Convolutional Neural Networks for Spatiotemporal Dynamics Modeling

Phong Nguyen, Xinlun Cheng, Shahab Azarfar et al.

ICML 2024oralarXiv:2402.12503

citations

#1613

RocketKV: Accelerating Long-Context LLM Inference via Two-Stage KV Cache Compression

Payman Behnam, Yaosheng Fu, Ritchie Zhao et al.

ICML 2025arXiv:2502.14051

citations

#1614

Libra: Building Decoupled Vision System on Large Language Models

Yifan Xu, Xiaoshan Yang, Yaguang Song et al.

ICML 2024arXiv:2405.10140

citations

#1615

Synthesizing Software Engineering Data in a Test-Driven Manner

Lei Zhang, Jiaxi Yang, Min Yang et al.

ICML 2025arXiv:2506.09003

citations

#1616

Reasoning Through Execution: Unifying Process and Outcome Rewards for Code Generation

Zhuohao Yu, Weizheng Gu, Yidong Wang et al.

ICML 2025arXiv:2412.15118

citations

#1617

Sample as you Infer: Predictive Coding with Langevin Dynamics

Umais Zahid, Qinghai Guo, Zafeirios Fountas

ICML 2024arXiv:2311.13664

citations

#1618

Oscillation-Reduced MXFP4 Training for Vision Transformers

Yuxiang Chen, Haocheng Xi, Jun Zhu et al.

ICML 2025arXiv:2502.20853

citations

#1619

When Do LLMs Help With Node Classification? A Comprehensive Analysis

Xixi Wu, Yifei Shen, Fangzhou Ge et al.

ICML 2025arXiv:2502.00829

citations

#1620

Fundamental limits of learning in sequence multi-index models and deep attention networks: high-dimensional asymptotics and sharp thresholds

Emanuele Troiani, Hugo Cui, Yatin Dandi et al.

ICML 2025arXiv:2502.00901

citations

#1621

Generalizing Knowledge Graph Embedding with Universal Orthogonal Parameterization

Rui Li, Chaozhuo Li, Yanming Shen et al.

ICML 2024arXiv:2405.08540

citations

#1622

Constrain Alignment with Sparse Autoencoders

Qingyu Yin, Chak Tou Leong, Hongbo Zhang et al.

ICML 2025arXiv:2411.07618

citations

#1623

Temperature-Annealed Boltzmann Generators

Henrik Schopmans, Pascal Friederich

ICML 2025arXiv:2501.19077

citations

#1624

Solving Poisson Equations using Neural Walk-on-Spheres

Hong Chul Nam, Julius Berner, Anima Anandkumar

ICML 2024arXiv:2406.03494

citations

#1625

How Much Can Transfer? BRIDGE: Bounded Multi-Domain Graph Foundation Model with Generalization Guarantees

Haonan Yuan, Qingyun Sun, Junhua Shi et al.

ICML 2025

citations

#1626

Discovering Mixtures of Structural Causal Models from Time Series Data

Sumanth Varambally, Yian Ma, Rose Yu

ICML 2024arXiv:2310.06312

citations

#1627

GRU: Mitigating the Trade-off between Unlearning and Retention for LLMs

Yue Wang, Qizhou Wang, Feng Liu et al.

ICML 2025arXiv:2503.09117

citations

#1628

CuTS: Customizable Tabular Synthetic Data Generation

Mark Vero, Mislav Balunovic, Martin Vechev

ICML 2024arXiv:2307.03577

citations

#1629

SEFE: Superficial and Essential Forgetting Eliminator for Multimodal Continual Instruction Tuning

Jinpeng Chen, Runmin Cong, Yuzhi Zhao et al.

ICML 2025arXiv:2505.02486

citations

#1630

Stereographic Spherical Sliced Wasserstein Distances

Huy Tran, Yikun Bai, Abihith Kothapalli et al.

ICML 2024spotlightarXiv:2402.02345

citations

#1631

Hybrid$^2$ Neural ODE Causal Modeling and an Application to Glycemic Response

Junyi Zou, Matthew Levine, Dessi Zaharieva et al.

ICML 2024arXiv:2402.17233

citations

#1632

Short-Long Convolutions Help Hardware-Efficient Linear Attention to Focus on Long Sequences

Zicheng Liu, Siyuan Li, Li Wang et al.

ICML 2024arXiv:2406.08128

citations

#1633

A Distributional Analogue to the Successor Representation

Harley Wiltzer, Jesse Farebrother, Arthur Gretton et al.

ICML 2024spotlightarXiv:2402.08530

citations

#1634

Improving Transformer World Models for Data-Efficient RL

Antoine Dedieu, Joseph Ortiz, Xinghua Lou et al.

ICML 2025arXiv:2502.01591

citations

#1635

Unifying 2D and 3D Vision-Language Understanding

Ayush Jain, Alexander Swerdlow, Yuzhou Wang et al.

ICML 2025arXiv:2503.10745

citations

#1636

Understanding Inter-Concept Relationships in Concept-Based Models

Naveen Raman, Mateo Espinosa Zarlenga, Mateja Jamnik

ICML 2024arXiv:2405.18217

citations

#1637

Fair Off-Policy Learning from Observational Data

Dennis Frauen, Valentyn Melnychuk, Stefan Feuerriegel

ICML 2024oralarXiv:2303.08516

citations

#1638

OTMatch: Improving Semi-Supervised Learning with Optimal Transport

Zhiquan Tan, Kaipeng Zheng, Weiran Huang

ICML 2024arXiv:2310.17455

citations

#1639

More Flexible PAC-Bayesian Meta-Learning by Learning Learning Algorithms

Hossein Zakerinia, Amin Behjati, Christoph Lampert

ICML 2024arXiv:2402.04054

citations

#1640

Simple Policy Optimization

Zhengpeng Xie, Qiang Zhang, Fan Yang et al.

ICML 2025arXiv:2401.16025

citations

#1641

Quantum Positional Encodings for Graph Neural Networks

Slimane Thabet, Mehdi Djellabi, Igor Sokolov et al.

ICML 2024arXiv:2406.06547

citations

#1642

Exploring the Low-Pass Filtering Behavior in Image Super-Resolution

Haoyu Deng, Zijing Xu, Yule Duan et al.

ICML 2024arXiv:2405.07919

citations

#1643

CRoFT: Robust Fine-Tuning with Concurrent Optimization for OOD Generalization and Open-Set OOD Detection

Lin Zhu, Yifeng Yang, Qinying Gu et al.

ICML 2024arXiv:2405.16417

citations

#1644

Translation Equivariant Transformer Neural Processes

Matthew Ashman, Cristiana Diaconu, Junhyuck Kim et al.

ICML 2024oralarXiv:2406.12409

citations

#1645

MONA: Myopic Optimization with Non-myopic Approval Can Mitigate Multi-step Reward Hacking

Sebastian Farquhar, Vikrant Varma, David Lindner et al.

ICML 2025arXiv:2501.13011

citations

#1646

Stochastic Control for Fine-tuning Diffusion Models: Optimality, Regularity, and Convergence

Yinbin Han, Meisam Razaviyayn, Renyuan Xu

ICML 2025arXiv:2412.18164

citations

#1647

Global Reinforcement Learning : Beyond Linear and Convex Rewards via Submodular Semi-gradient Methods

Riccardo De Santi, Manish Prajapat, Andreas Krause

ICML 2024arXiv:2407.09905

citations

#1648

Beyond Bradley-Terry Models: A General Preference Model for Language Model Alignment

Yifan Zhang, Ge Zhang, Yue Wu et al.

ICML 2025arXiv:2410.02197

citations

#1649

PlaySlot: Learning Inverse Latent Dynamics for Controllable Object-Centric Video Prediction and Planning

Angel Villar-Corrales, Sven Behnke

ICML 2025arXiv:2502.07600

citations

#1650

Multi-Session Budget Optimization for Forward Auction-based Federated Learning

Xiaoli Tang, Han Yu, Zengxiang Li et al.

ICML 2025arXiv:2311.12548

citations

#1651

How Transformers Learn Structured Data: Insights From Hierarchical Filtering

Jerome Garnier-Brun, Marc Mezard, Emanuele Moscato et al.

ICML 2025arXiv:2408.15138

citations

#1652

DAMA: Data- and Model-aware Alignment of Multi-modal LLMs

Jinda Lu, Junkang Wu, Jinghan Li et al.

ICML 2025arXiv:2502.01943

citations

#1653

Unisolver: PDE-Conditional Transformers Towards Universal Neural PDE Solvers

Hang Zhou, Yuezhou Ma, Haixu Wu et al.

ICML 2025arXiv:2405.17527

citations

#1654

Generative Enzyme Design Guided by Functionally Important Sites and Small-Molecule Substrates

Zhenqiao Song, Yunlong Zhao, Wenxian Shi et al.

ICML 2024arXiv:2405.08205

citations

#1655

A2Q+: Improving Accumulator-Aware Weight Quantization

Ian Colbert, Alessandro Pappalardo, Jakoba Petri-Koenig et al.

ICML 2024arXiv:2401.10432

citations

#1656

Helpful or Harmful Data? Fine-tuning-free Shapley Attribution for Explaining Language Model Predictions

Jingtan Wang, Xiaoqiang Lin, Rui Qiao et al.

ICML 2024arXiv:2406.04606

citations

#1657

Scalable Real-Time Recurrent Learning Using Columnar-Constructive Networks

Khurram Javed, Haseeb Shah, Richard Sutton et al.

ICML 2024arXiv:2302.05326

citations

#1658

HarmoniCa: Harmonizing Training and Inference for Better Feature Caching in Diffusion Transformer Acceleration

Yushi Huang, Zining Wang, Ruihao Gong et al.

ICML 2025arXiv:2410.01723

citations

#1659

Tool Unlearning for Tool-Augmented LLMs

Jiali Cheng, Hadi Amiri

ICML 2025arXiv:2502.01083

citations

#1660

Creative Text-to-Audio Generation via Synthesizer Programming

Manuel Cherep, Nikhil Singh, Jessica Shand

ICML 2024arXiv:2406.00294

citations

#1661

Improving Parallel Program Performance with LLM Optimizers via Agent-System Interfaces

Anjiang Wei, Allen Nie, Thiago Teixeira et al.

ICML 2025arXiv:2410.15625

citations

#1662

RICE: Breaking Through the Training Bottlenecks of Reinforcement Learning with Explanation

Zelei Cheng, Xian Wu, Jiahao Yu et al.

ICML 2024spotlightarXiv:2405.03064

citations

#1663

Towards Understanding Fine-Tuning Mechanisms of LLMs via Circuit Analysis

Xu Wang, Yan Hu, Wenyu Du et al.

ICML 2025arXiv:2502.11812

citations

#1664

On Measuring Long-Range Interactions in Graph Neural Networks

Jacob Bamberger, Benjamin Gutteridge, Scott le Roux et al.

ICML 2025arXiv:2506.05971

citations

#1665

Bounded and Uniform Energy-based Out-of-distribution Detection for Graphs

Shenzhi Yang, Bin Liang, An Liu et al.

ICML 2024arXiv:2504.13429

citations

#1666

BadPart: Unified Black-box Adversarial Patch Attacks against Pixel-wise Regression Tasks

Zhiyuan Cheng, Zhaoyi Liu, Tengda Guo et al.

ICML 2024arXiv:2404.00924

citations

#1667

FightLadder: A Benchmark for Competitive Multi-Agent Reinforcement Learning

Wenzhe Li, Zihan Ding, Seth Karten et al.

ICML 2024arXiv:2406.02081

citations

#1668

Reducing sequential change detection to sequential estimation

Shubhanshu Shekhar, Aaditya Ramdas

ICML 2024arXiv:2309.09111

citations

#1669

Revealing the Dark Secrets of Extremely Large Kernel ConvNets on Robustness

Honghao Chen, Zhang Yurong, xiaokun Feng et al.

ICML 2024arXiv:2407.08972

citations

#1670

Position: An Inner Interpretability Framework for AI Inspired by Lessons from Cognitive Neuroscience

Martina G. Vilas, Federico Adolfi, David Poeppel et al.

ICML 2024arXiv:2406.01352

citations

#1671

ReQFlow: Rectified Quaternion Flow for Efficient and High-Quality Protein Backbone Generation

Angxiao Yue, Zichong Wang, Hongteng Xu

ICML 2025arXiv:2502.14637

citations

#1672

Premier-TACO is a Few-Shot Policy Learner: Pretraining Multitask Representation via Temporal Action-Driven Contrastive Loss

Ruijie Zheng, Yongyuan Liang, xiyao wang et al.

ICML 2024oralarXiv:2402.06187

citations

#1673

Objective drives the consistency of representational similarity across datasets

Laure Ciernik, Lorenz Linhardt, Marco Morik et al.

ICML 2025arXiv:2411.05561

citations

#1674

Tandem Transformers for Inference Efficient LLMs

Aishwarya P S, Pranav Nair, Yashas Samaga et al.

ICML 2024arXiv:2402.08644

citations

#1675

Improved Off-policy Reinforcement Learning in Biological Sequence Design

Hyeonah Kim, Minsu Kim, Taeyoung Yun et al.

ICML 2025arXiv:2410.04461

citations

#1676

Lessons from Generalization Error Analysis of Federated Learning: You May Communicate Less Often!

Milad Sefidgaran, Romain Chor, Abdellatif Zaidi et al.

ICML 2024arXiv:2306.05862

citations

#1677

Feature Distribution on Graph Topology Mediates the Effect of Graph Convolution: Homophily Perspective

Soo Yong Lee, Sunwoo Kim, Fanchen Bu et al.

ICML 2024arXiv:2402.04621

citations

#1678

Efficient Pareto Manifold Learning with Low-Rank Structure

Weiyu CHEN, James Kwok

ICML 2024spotlightarXiv:2407.20734

citations

#1679

Leveraging Self-Consistency for Data-Efficient Amortized Bayesian Inference

Marvin Schmitt, Desi Ivanova, Daniel Habermann et al.

ICML 2024arXiv:2310.04395

citations

#1680

Compressed Image Generation with Denoising Diffusion Codebook Models

Guy Ohayon, Hila Manor, Tomer Michaeli et al.

ICML 2025arXiv:2502.01189

citations

#1681

Gaussian Mixture Flow Matching Models

Hansheng Chen, Kai Zhang, Hao Tan et al.

ICML 2025arXiv:2504.05304

citations

#1682

Fair Risk Control: A Generalized Framework for Calibrating Multi-group Fairness Risks

Lujing Zhang, Aaron Roth, Linjun Zhang

ICML 2024arXiv:2405.02225

citations

#1683

Safety Reasoning with Guidelines

Haoyu Wang, Zeyu Qin, Li Shen et al.

ICML 2025arXiv:2502.04040

citations

#1684

One-Shot Strategic Classification Under Unknown Costs

Elan Rosenfeld, Nir Rosenfeld

ICML 2024arXiv:2311.02761

citations

#1685

Mean Estimation in the Add-Remove Model of Differential Privacy

Alex Kulesza, Ananda Suresh, Yuyan Wang

ICML 2024arXiv:2312.06658

citations

#1686

Estimating Barycenters of Distributions with Neural Optimal Transport

Alexander Kolesov, Petr Mokrov, Igor Udovichenko et al.

ICML 2024arXiv:2402.03828

citations

#1687

AdsorbDiff: Adsorbate Placement via Conditional Denoising Diffusion

Adeesh Kolluru, John Kitchin

ICML 2024arXiv:2405.03962

citations

#1688

Nonparametric Teaching of Implicit Neural Representations

Chen Zhang, Steven T. S. Luo, Jason Chun Lok Li et al.

ICML 2024arXiv:2405.10531

citations

#1689

Multiplicative Weights Update, Area Convexity and Random Coordinate Descent for Densest Subgraph Problems

Ta Duy Nguyen, Alina Ene

ICML 2024arXiv:2405.18809

citations

#1690

MA-LoT: Model-Collaboration Lean-based Long Chain-of-Thought Reasoning enhances Formal Theorem Proving

Ruida Wang, Rui Pan, Yuxin Li et al.

ICML 2025arXiv:2503.03205

citations

#1691

Private Heterogeneous Federated Learning Without a Trusted Server Revisited: Error-Optimal and Communication-Efficient Algorithms for Convex Losses

Changyu Gao, Andrew Lowy, Xingyu Zhou et al.

ICML 2024arXiv:2407.09690

citations

#1692

How Uniform Random Weights Induce Non-uniform Bias: Typical Interpolating Neural Networks Generalize with Narrow Teachers

Gon Buzaglo, Itamar Harel, Mor Shpigel Nacson et al.

ICML 2024spotlightarXiv:2402.06323

citations

#1693

Weisfeiler Leman for Euclidean Equivariant Machine Learning

Snir Hordan, Tal Amir, Nadav Dym

ICML 2024arXiv:2402.02484

citations

#1694

Zebra: In-Context Generative Pretraining for Solving Parametric PDEs

Louis Serrano, Armand Kassaï Koupaï, Thomas Wang et al.

ICML 2025arXiv:2410.03437

citations

#1695

The Role of Learning Algorithms in Collective Action

Omri Ben-Dov, Jake Fawkes, Samira Samadi et al.

ICML 2024arXiv:2405.06582

citations

#1696

Learning to Predict Mutational Effects of Protein-Protein Interactions by Microenvironment-aware Hierarchical Prompt Learning

Lirong Wu, Yijun Tian, Haitao Lin et al.

ICML 2024arXiv:2405.10348

citations

#1697

To Cool or not to Cool? Temperature Network Meets Large Foundation Models via DRO

Zi-Hao Qiu, Siqi Guo, Mao Xu et al.

ICML 2024arXiv:2404.04575

citations

#1698

Non-Asymptotic Analysis for Single-Loop (Natural) Actor-Critic with Compatible Function Approximation

Yudan Wang, Yue Wang, Yi Zhou et al.

ICML 2024oralarXiv:2406.01762

citations

#1699

Cross-view Masked Diffusion Transformers for Person Image Synthesis

Trung Pham, Kang Zhang, Chang Yoo

ICML 2024arXiv:2402.01516

citations

#1700

Latent Logic Tree Extraction for Event Sequence Explanation from LLMs

Zitao Song, Chao Yang, Chaojie Wang et al.

ICML 2024oralarXiv:2406.01124

citations

#1701

Beyond the Calibration Point: Mechanism Comparison in Differential Privacy

Georgios Kaissis, Stefan Kolek, Borja de Balle Pigem et al.

ICML 2024arXiv:2406.08918

citations

#1702

Towards Understanding Inductive Bias in Transformers: A View From Infinity

Itay Lavie, Guy Gur-Ari, Zohar Ringel

ICML 2024arXiv:2402.05173

citations

#1703

Scaling Large Motion Models with Million-Level Human Motions

Ye Wang, Sipeng Zheng, Bin Cao et al.

ICML 2025arXiv:2410.03311

citations

#1704

A Provably Effective Method for Pruning Experts in Fine-tuned Sparse Mixture-of-Experts

Mohammed Nowaz Rabbani Chowdhury, Meng Wang, Kaoutar El Maghraoui et al.

ICML 2024arXiv:2405.16646

citations

#1705

QuantSpec: Self-Speculative Decoding with Hierarchical Quantized KV Cache

Rishabh Tiwari, Haocheng Xi, Aditya Tomar et al.

ICML 2025arXiv:2502.10424

citations

#1706

Probabilistic Conceptual Explainers: Trustworthy Conceptual Explanations for Vision Foundation Models

Hengyi Wang, Shiwei Tan, Hao Wang

ICML 2024arXiv:2406.12649

citations

#1707

On Temperature Scaling and Conformal Prediction of Deep Classifiers

Lahav Dabah, Tom Tirer

ICML 2025arXiv:2402.05806

citations

#1708

Scalable Online Exploration via Coverability

Philip Amortila, Dylan Foster, Akshay Krishnamurthy

ICML 2024arXiv:2403.06571

citations

#1709

An Analysis for Reasoning Bias of Language Models with Small Initialization

Junjie Yao, zhongwang zhang, Zhi-Qin John Xu

ICML 2025spotlightarXiv:2502.04375

citations

#1710

Sortformer: A Novel Approach for Permutation-Resolved Speaker Supervision in Speech-to-Text Systems

Taejin Park, Ivan Medennikov, Kunal Dhawan et al.

ICML 2025arXiv:2409.06656

citations

#1711

TabFlex: Scaling Tabular Learning to Millions with Linear Attention

Yuchen Zeng, Tuan Dinh, Wonjun Kang et al.

ICML 2025spotlightarXiv:2506.05584

citations

#1712

Position: Stop Making Unscientific AGI Performance Claims

Patrick Altmeyer, Andrew Demetriou, Antony Bartlett et al.

ICML 2024arXiv:2402.03962

citations

#1713

STAR: Learning Diverse Robot Skill Abstractions through Rotation-Augmented Vector Quantization

Hao Li, Qi Lv, Rui Shao et al.

ICML 2025spotlightarXiv:2506.03863

citations

#1714

LARM: Large Auto-Regressive Model for Long-Horizon Embodied Intelligence

Zhuoling Li, Xiaogang Xu, Zhenhua Xu et al.

ICML 2025arXiv:2405.17424

citations

#1715

SEMU: Singular Value Decomposition for Efficient Machine Unlearning

Marcin Sendera, Łukasz Struski, Kamil Książek et al.

ICML 2025arXiv:2502.07587

citations

#1716

When Linear Attention Meets Autoregressive Decoding: Towards More Effective and Efficient Linearized Large Language Models

Haoran You, Yichao Fu, Zheng Wang et al.

ICML 2024arXiv:2406.07368

citations

#1717

Data-Juicer Sandbox: A Feedback-Driven Suite for Multimodal Data-Model Co-development

Daoyuan Chen, Haibin Wang, Yilun Huang et al.

ICML 2025spotlightarXiv:2407.11784

citations

#1718

The Elicitation Game: Evaluating Capability Elicitation Techniques

Felix Hofstätter, Teun van der Weij, Jayden Teoh et al.

ICML 2025arXiv:2502.02180

citations

#1719

Adaptively Perturbed Mirror Descent for Learning in Games

Kenshi Abe, Kaito Ariu, Mitsuki Sakamoto et al.

ICML 2024arXiv:2305.16610

citations

#1720

ArtWhisperer: A Dataset for Characterizing Human-AI Interactions in Artistic Creations

Kailas Vodrahalli, James Zou

ICML 2024arXiv:2306.08141

citations

#1721

AD3: Implicit Action is the Key for World Models to Distinguish the Diverse Visual Distractors

Yucen Wang, Shenghua Wan, Le Gan et al.

ICML 2024arXiv:2403.09976

citations

#1722

Unlocking the Power of Spatial and Temporal Information in Medical Multimodal Pre-training

Jinxia Yang, Bing Su, Xin Zhao et al.

ICML 2024oralarXiv:2405.19654

citations

#1723

An Online Optimization Perspective on First-Order and Zero-Order Decentralized Nonsmooth Nonconvex Stochastic Optimization

Emre Sahinoglu, Shahin Shahrampour

ICML 2024arXiv:2406.01484

citations

#1724

Synthesizing Privacy-Preserving Text Data via Finetuning without Finetuning Billion-Scale LLMs

Bowen Tan, Zheng Xu, Eric Xing et al.

ICML 2025arXiv:2503.12347

citations

#1725

Lightweight Dataset Pruning without Full Training via Example Difficulty and Prediction Uncertainty

Yeseul Cho, Baekrok Shin, Changmin Kang et al.

ICML 2025arXiv:2502.06905

citations

#1726

LoRA Training Provably Converges to a Low-Rank Global Minimum Or It Fails Loudly (But it Probably Won't Fail)

Junsu Kim, Jaeyeon Kim, Ernest Ryu

ICML 2025oralarXiv:2502.09376

citations

#1727

Leveraging VLM-Based Pipelines to Annotate 3D Objects

Rishabh Kabra, Loic Matthey, Alexander Lerchner et al.

ICML 2024arXiv:2311.17851

citations

#1728

Textual Unlearning Gives a False Sense of Unlearning

Jiacheng Du, Zhibo Wang, Jie Zhang et al.

ICML 2025arXiv:2406.13348

citations

#1729

A New Robust Partial p-Wasserstein-Based Metric for Comparing Distributions

Sharath Raghvendra, Pouyan Shirzadian, Kaiyi Zhang

ICML 2024arXiv:2405.03664

citations

#1730

Task Generalization with Autoregressive Compositional Structure: Can Learning from $D$ Tasks Generalize to $D^T$ Tasks?

Amirhesam Abedsoltan, Huaqing Zhang, Kaiyue Wen et al.

ICML 2025arXiv:2502.08991

citations

#1731

Effective and Efficient Masked Image Generation Models

Zebin You, Jingyang Ou, Xiaolu Zhang et al.

ICML 2025arXiv:2503.07197

citations

#1732

PEARL: Zero-shot Cross-task Preference Alignment and Robust Reward Learning for Robotic Manipulation

Runze Liu, Yali Du, Fengshuo Bai et al.

ICML 2024arXiv:2306.03615

citations

#1733

Selective Mixup Helps with Distribution Shifts, But Not (Only) because of Mixup

Damien Teney, Jindong Wang, Ehsan Abbasnejad

ICML 2024arXiv:2305.16817

citations

#1734

Compositional Risk Minimization

Divyat Mahajan, Mohammad Pezeshki, Charles Arnal et al.

ICML 2025arXiv:2410.06303

citations

#1735

Position: Tensor Networks are a Valuable Asset for Green AI

Eva Memmel, Clara Menzen, Jetze Schuurmans et al.

ICML 2024arXiv:2205.12961

citations

#1736

Improving LLMs for Recommendation with Out-Of-Vocabulary Tokens

Ting-Ji Huang, Jia-Qi Yang, Chunxu Shen et al.

ICML 2025arXiv:2406.08477

citations

#1737

Density-Softmax: Efficient Test-time Model for Uncertainty Estimation and Robustness under Distribution Shifts

Ha Manh Bui, Anqi Liu

ICML 2024arXiv:2302.06495

citations

#1738

Prototypical Transformer As Unified Motion Learners

Cheng Han, Yawen Lu, Guohao Sun et al.

ICML 2024arXiv:2406.01559

citations

#1739

Privacy Attacks in Decentralized Learning

Abdellah El Mrini, Edwige Cyffers, Aurélien Bellet

ICML 2024arXiv:2402.10001

citations

#1740

Toward a Unified Theory of Gradient Descent under Generalized Smoothness

Alexander Tyurin

ICML 2025arXiv:2412.11773

citations

#1741

Learning Safety Constraints for Large Language Models

Xin Chen, Yarden As, Andreas Krause

ICML 2025spotlightarXiv:2505.24445

citations

#1742

Bring Your Own (Non-Robust) Algorithm to Solve Robust MDPs by Estimating The Worst Kernel

Uri Gadot, Kaixin Wang, Navdeep Kumar et al.

ICML 2024arXiv:2306.05859

citations

#1743

GistScore: Learning Better Representations for In-Context Example Selection with Gist Bottlenecks

Shivanshu Gupta, Clemens Rosenbaum, Ethan R. Elenberg

ICML 2024arXiv:2311.09606

citations

#1744

Manifold Integrated Gradients: Riemannian Geometry for Feature Attribution

Eslam Zaher, Maciej Trzaskowski, Quan Nguyen et al.

ICML 2024arXiv:2405.09800

citations

#1745

Accelerating LLM Inference with Lossless Speculative Decoding Algorithms for Heterogeneous Vocabularies

Nadav Timor, Jonathan Mamou, Daniel Korat et al.

ICML 2025oralarXiv:2502.05202

citations

#1746

On the Diminishing Returns of Width for Continual Learning

Etash Guha, Vihan Lakshman

ICML 2024arXiv:2403.06398

citations

#1747

Autaptic Synaptic Circuit Enhances Spatio-temporal Predictive Learning of Spiking Neural Networks

Lihao Wang, Zhaofei Yu

ICML 2024oralarXiv:2406.00405

citations

#1748

BoA: Attention-aware Post-training Quantization without Backpropagation

Junhan Kim, Ho-young Kim, Eulrang Cho et al.

ICML 2025arXiv:2406.13474

citations

#1749

Sequential Disentanglement by Extracting Static Information From A Single Sequence Element

Nimrod Berman, Ilan Naiman, Idan Arbiv et al.

ICML 2024arXiv:2406.18131

citations

#1750

In-Context Deep Learning via Transformer Models

Weimin Wu, Maojiang Su, Jerry Yao-Chieh Hu et al.

ICML 2025arXiv:2411.16549

citations

#1751

A Sharper Global Convergence Analysis for Average Reward Reinforcement Learning via an Actor-Critic Approach

Swetha Ganesh, Washim Mondal, Vaneet Aggarwal

ICML 2025arXiv:2407.18878

citations

#1752

Operator SVD with Neural Networks via Nested Low-Rank Approximation

Jongha (Jon) Ryu, Xiangxiang Xu, Hasan Sabri Melihcan Erol et al.

ICML 2024arXiv:2402.03655

citations

#1753

Modular Learning of Deep Causal Generative Models for High-dimensional Causal Inference

Md Musfiqur Rahman, Murat Kocaoglu

ICML 2024arXiv:2401.01426

citations

#1754

Tilting the Odds at the Lottery: the Interplay of Overparameterisation and Curricula in Neural Networks

Stefano Mannelli, Yaraslau Ivashynka, Andrew Saxe et al.

ICML 2024arXiv:2406.01589

citations

#1755

Unsupervised Concept Discovery Mitigates Spurious Correlations

Md Rifat Arefin, Yan Zhang, Aristide Baratin et al.

ICML 2024arXiv:2402.13368

citations

#1756

Hierarchical Neural Operator Transformer with Learnable Frequency-aware Loss Prior for Arbitrary-scale Super-resolution

Xihaier Luo, Xiaoning Qian, Byung-Jun Yoon

ICML 2024arXiv:2405.12202

citations

#1757

Rethinking Specificity in SBDD: Leveraging Delta Score and Energy-Guided Diffusion

Bowen Gao, Minsi Ren, Yuyan Ni et al.

ICML 2024arXiv:2403.12987

citations

#1758

From Debate to Equilibrium: Belief‑Driven Multi‑Agent LLM Reasoning via Bayesian Nash Equilibrium

Yi Xie, Zhanke Zhou, Chentao Cao et al.

ICML 2025arXiv:2506.08292

citations

#1759

FedCal: Achieving Local and Global Calibration in Federated Learning via Aggregated Parameterized Scaler

Hongyi Peng, Han Yu, Xiaoli Tang et al.

ICML 2024arXiv:2405.15458

citations

#1760

Which Frequencies do CNNs Need? Emergent Bottleneck Structure in Feature Learning

Yuxiao Wen, Arthur Jacot

ICML 2024arXiv:2402.08010

citations

#1761

GeoMFormer: A General Architecture for Geometric Molecular Representation Learning

Tianlang Chen, Shengjie Luo, Di He et al.

ICML 2024arXiv:2406.16853

citations

#1762

A Graph is Worth $K$ Words: Euclideanizing Graph using Pure Transformer

Zhangyang Gao, Daize Dong, Cheng Tan et al.

ICML 2024arXiv:2402.02464

citations

#1763

BWS: Best Window Selection Based on Sample Scores for Data Pruning across Broad Ranges

Hoyong Choi, Nohyun Ki, Hye Won Chung

ICML 2024arXiv:2406.03057

citations

#1764

Compression via Pre-trained Transformers: A Study on Byte-Level Multimodal Data

David Heurtel-Depeiges, Anian Ruoss, Joel Veness et al.

ICML 2025arXiv:2410.05078

citations

#1765

Sliced-Wasserstein Estimation with Spherical Harmonics as Control Variates

Rémi Leluc, Aymeric Dieuleveut, François Portier et al.

ICML 2024arXiv:2402.01493

citations

#1766

VinT-6D: A Large-Scale Object-in-hand Dataset from Vision, Touch and Proprioception

Zhaoliang Wan, Yonggen Ling, Senlin Yi et al.

ICML 2024arXiv:2501.00510

citations

#1767

Differentially Private Decentralized Learning with Random Walks

Edwige Cyffers, Aurélien Bellet, Jalaj Upadhyay

ICML 2024arXiv:2402.07471

citations

#1768

REINFORCE Adversarial Attacks on Large Language Models: An Adaptive, Distributional, and Semantic Objective

Simon Geisler, Tom Wollschläger, M. Hesham Abdalla et al.

ICML 2025arXiv:2502.17254

citations

#1769

BRiTE: Bootstrapping Reinforced Thinking Process to Enhance Language Model Reasoning

Han Zhong, Yutong Yin, Shenao Zhang et al.

ICML 2025arXiv:2501.18858

citations

#1770

Counterfactual Reasoning for Multi-Label Image Classification via Patching-Based Training

Ming-Kun Xie, Jia-Hao Xiao, Pei Peng et al.

ICML 2024arXiv:2404.06287

citations

#1771

Disguised Copyright Infringement of Latent Diffusion Models

Yiwei Lu, Matthew Yang, Zuoqiu Liu et al.

ICML 2024arXiv:2404.06737

citations

#1772

Learning High-Frequency Functions Made Easy with Sinusoidal Positional Encoding

Chuanhao Sun, Zhihang Yuan, Kai Xu et al.

ICML 2024arXiv:2407.09370

citations

#1773

Generalizing from SIMPLE to HARD Visual Reasoning: Can We Mitigate Modality Imbalance in VLMs?

Simon Park, Abhishek Panigrahi, Yun Cheng et al.

ICML 2025arXiv:2501.02669

citations

#1774

Neural SPH: Improved Neural Modeling of Lagrangian Fluid Dynamics

Artur Toshev, Jonas Erbesdobler, Nikolaus Adams et al.

ICML 2024arXiv:2402.06275

citations

#1775

Q-VDiT: Towards Accurate Quantization and Distillation of Video-Generation Diffusion Transformers

Weilun Feng, Chuanguang Yang, Haotong Qin et al.

ICML 2025oralarXiv:2505.22167

citations

#1776

Energy-Guided Diffusion Sampling for Offline-to-Online Reinforcement Learning

Xu-Hui Liu, Tian-Shuo Liu, Shengyi Jiang et al.

ICML 2024arXiv:2407.12448

citations

#1777

Federated Offline Reinforcement Learning: Collaborative Single-Policy Coverage Suffices

Jiin Woo, Laixi Shi, Gauri Joshi et al.

ICML 2024arXiv:2402.05876

citations

#1778

The Pitfalls and Promise of Conformal Inference Under Adversarial Attacks

Ziquan Liu, Yufei Cui, Yan Yan et al.

ICML 2024arXiv:2405.08886

citations

#1779

Function Encoders: A Principled Approach to Transfer Learning in Hilbert Spaces

Tyler Ingebrand, Adam Thorpe, Ufuk Topcu

ICML 2025arXiv:2501.18373

citations

#1780

Chaos Meets Attention: Transformers for Large-Scale Dynamical Prediction

Yi He, Yiming Yang, Xiaoyuan Cheng et al.

ICML 2025arXiv:2504.20858

citations

#1781

Deep Linear Network Training Dynamics from Random Initialization: Data, Width, Depth, and Hyperparameter Transfer

Blake Bordelon, Cengiz Pehlevan

ICML 2025arXiv:2502.02531

citations

#1782

Visual Representation Learning with Stochastic Frame Prediction

Huiwon Jang, Dongyoung Kim, Junsu Kim et al.

ICML 2024oralarXiv:2406.07398

citations

#1783

Dialogue Without Limits: Constant-Sized KV Caches for Extended Response in LLMs

Ravi Ghadia, Avinash Kumar, Gaurav Jain et al.

ICML 2025arXiv:2503.00979

citations

#1784

Robustness of Deep Learning for Accelerated MRI: Benefits of Diverse Training Data

Kang Lin, Reinhard Heckel

ICML 2024arXiv:2312.10271

citations

#1785

Posterior Sampling-Based Bayesian Optimization with Tighter Bayesian Regret Bounds

Shion Takeno, Yu Inatsu, Masayuki Karasuyama et al.

ICML 2024arXiv:2311.03760

citations

#1786

Stochastic Q-learning for Large Discrete Action Spaces

Fares Fourati, Vaneet Aggarwal, Mohamed-Slim Alouini

ICML 2024arXiv:2405.10310

citations

#1787

Stochastic Forward–Backward Deconvolution: Training Diffusion Models with Finite Noisy Datasets

Haoye Lu, Qifan Wu, Yaoliang Yu

ICML 2025arXiv:2502.05446

citations

#1788

KernelWarehouse: Rethinking the Design of Dynamic Convolution

Chao Li, Anbang Yao

ICML 2024arXiv:2406.07879

citations

#1789

Near-Optimal Sample Complexity for MDPs via Anchoring

Jongmin Lee, Mario Bravo, Roberto Cominetti

ICML 2025arXiv:2502.04477

citations

#1790

Highly Compressed Tokenizer Can Generate Without Training

Lukas Lao Beyer, Tianhong Li, Xinlei Chen et al.

ICML 2025arXiv:2506.08257

citations

#1791

Loss Functions and Operators Generated by f-Divergences

Vincent Roulet, Tianlin Liu, Nino Vieillard et al.

ICML 2025arXiv:2501.18537

citations

#1792

Boosting Offline Optimizers with Surrogate Sensitivity

Cuong Dao, Phi Le Nguyen, Thao Nguyen Truong et al.

ICML 2024arXiv:2503.04181

citations

#1793

PARM: Multi-Objective Test-Time Alignment via Preference-Aware Autoregressive Reward Model

Baijiong Lin, Weisen Jiang, Yuancheng Xu et al.

ICML 2025arXiv:2505.06274

citations

#1794

Graph External Attention Enhanced Transformer

Jianqing Liang, Min Chen, Jiye Liang

ICML 2024arXiv:2405.21061

citations

#1795

Pre-Training Protein Bi-level Representation Through Span Mask Strategy On 3D Protein Chains

Jiale Zhao, Wanru Zhuang, Jia Song et al.

ICML 2024arXiv:2402.01481

citations

#1796

Rethinking Transformers in Solving POMDPs

Chenhao Lu, Ruizhe Shi, Yuyao Liu et al.

ICML 2024arXiv:2405.17358

citations

#1797

ProofAug: Efficient Neural Theorem Proving via Fine-grained Proof Structure Analysis

Haoxiong Liu, Jiacheng Sun, Zhenguo Li et al.

ICML 2025arXiv:2501.18310

citations

#1798

Causal Discovery with Fewer Conditional Independence Tests

Kirankumar Shiragur, Jiaqi Zhang, Caroline Uhler

ICML 2024arXiv:2406.01823

citations

#1799

MindAligner: Explicit Brain Functional Alignment for Cross-Subject Visual Decoding from Limited fMRI Data

Yuqin Dai, Zhouheng Yao, Chunfeng Song et al.

ICML 2025arXiv:2502.05034

citations

#1800

EffiCoder: Enhancing Code Generation in Large Language Models through Efficiency-Aware Fine-tuning

Dong HUANG, Guangtao Zeng, Jianbo Dai et al.

ICML 2025arXiv:2410.10209

citations

← Previous

1...7 8 9 10 11...30