Most Cited ICML "conditional outcome invariance" Papers

5,975 papers found • Page 12 of 30

#2201

Mitigating Plasticity Loss in Continual Reinforcement Learning by Reducing Churn

Hongyao Tang, Johan Obando-Ceron, Pablo Samuel Castro et al.

ICML 2025oralarXiv:2506.00592
7
citations
#2202

Position: We Need An Algorithmic Understanding of Generative AI

Oliver Eberle, Thomas McGee, Hamza Giaffar et al.

ICML 2025spotlightarXiv:2507.07544
7
citations
#2203

Do Transformer World Models Give Better Policy Gradients?

Michel Ma, Tianwei Ni, Clement Gehring et al.

ICML 2024arXiv:2402.05290
7
citations
#2204

Constrained Exploration via Reflected Replica Exchange Stochastic Gradient Langevin Dynamics

Haoyang Zheng, Hengrong Du, Qi Feng et al.

ICML 2024arXiv:2405.07839
7
citations
#2205

Unraveling the Interplay between Carryover Effects and Reward Autocorrelations in Switchback Experiments

Qianglin Wen, Chengchun Shi, Ying Yang et al.

ICML 2025arXiv:2403.17285
7
citations
#2206

Infinite-Horizon Distributionally Robust Regret-Optimal Control

Taylan Kargin, Joudi Hajar, Vikrant Malik et al.

ICML 2024arXiv:2406.07248
7
citations
#2207

EvoluNet: Advancing Dynamic Non-IID Transfer Learning on Graphs

Haohui Wang, Yuzhen Mao, Yujun Yan et al.

ICML 2024oralarXiv:2305.00664
7
citations
#2208

Efficient and Scalable Density Functional Theory Hamiltonian Prediction through Adaptive Sparsity

Erpai Luo, Xinran Wei, Lin Huang et al.

ICML 2025arXiv:2502.01171
7
citations
#2209

Scaling Laws for Upcycling Mixture-of-Experts Language Models

Seng Pei Liew, Takuya Kato, Sho Takase

ICML 2025arXiv:2502.03009
7
citations
#2210

EiG-Search: Generating Edge-Induced Subgraphs for GNN Explanation in Linear Time

Shengyao Lu, Bang Liu, Keith Mills et al.

ICML 2024arXiv:2405.01762
7
citations
#2211

Interacting Diffusion Processes for Event Sequence Forecasting

Mai Zeng, Florence Regol, Mark Coates

ICML 2024oralarXiv:2310.17800
7
citations
#2212

Bridging discrete and continuous state spaces: Exploring the Ehrenfest process in time-continuous diffusion models

Ludwig Winkler, Lorenz Richter, Manfred Opper

ICML 2024arXiv:2405.03549
6
citations
#2213

Understanding Generalization in Quantum Machine Learning with Margins

TAK HUR, Daniel Kyungdeock Park

ICML 2025arXiv:2411.06919
6
citations
#2214

Position: The Causal Revolution Needs Scientific Pragmatism

Joshua Loftus

ICML 2024arXiv:2406.02275
6
citations
#2215

UGrid: An Efficient-And-Rigorous Neural Multigrid Solver for Linear PDEs

Xi Han, Fei Hou, Hong Qin

ICML 2024arXiv:2408.04846
6
citations
#2216

Efficient Length-Generalizable Attention via Causal Retrieval for Long-Context Language Modeling

Xiang Hu, Zhihao Teng, Jun Zhao et al.

ICML 2025arXiv:2410.01651
6
citations
#2217

Fine-Grained Causal Dynamics Learning with Quantization for Improving Robustness in Reinforcement Learning

Inwoo Hwang, Yunhyeok Kwak, Suhyung Choi et al.

ICML 2024arXiv:2406.03234
6
citations
#2218

Parameter-Efficient Fine-Tuning of State Space Models

Kevin Galim, Wonjun Kang, Yuchen Zeng et al.

ICML 2025arXiv:2410.09016
6
citations
#2219

HaploVL: A Single-Transformer Baseline for Multi-Modal Understanding

Rui Yang, Lin Song, Yicheng Xiao et al.

ICML 2025arXiv:2503.14694
6
citations
#2220

$\texttt{MoE-RBench}$: Towards Building Reliable Language Models with Sparse Mixture-of-Experts

Guanjie Chen, Xinyu Zhao, Tianlong Chen et al.

ICML 2024arXiv:2406.11353
6
citations
#2221

What can large language models do for sustainable food?

Anna Thomas, Adam Yee, Andrew Mayne et al.

ICML 2025arXiv:2503.04734
6
citations
#2222

Causal Discovery from Conditionally Stationary Time Series

Carles Balsells-Rodas, Xavier Sumba, Tanmayee Narendra et al.

ICML 2025arXiv:2110.06257
6
citations
#2223

REG: Rectified Gradient Guidance for Conditional Diffusion Models

Zhengqi Gao, Kaiwen Zha, Tianyuan Zhang et al.

ICML 2025arXiv:2501.18865
6
citations
#2224

Human-like Category Learning by Injecting Ecological Priors from Large Language Models into Neural Networks

Akshay Kumar Jagadish, Julian Coda-Forno, Mirko Thalmann et al.

ICML 2024arXiv:2402.01821
6
citations
#2225

Minimizing $f$-Divergences by Interpolating Velocity Fields

Song Liu, Jiahao Yu, Jack Simons et al.

ICML 2024arXiv:2305.15577
6
citations
#2226

Extracting Training Data From Document-Based VQA Models

Francesco Pinto, Nathalie Rauschmayr, Florian Tramer et al.

ICML 2024arXiv:2407.08707
6
citations
#2227

On Efficient Estimation of Distributional Treatment Effects under Covariate-Adaptive Randomization

Undral Byambadalai, Tomu Hirata, Tatsushi Oka et al.

ICML 2025arXiv:2506.05945
6
citations
#2228

TENG: Time-Evolving Natural Gradient for Solving PDEs With Deep Neural Nets Toward Machine Precision

Zhuo Chen, Jacob McCarran, Esteban Vizcaino et al.

ICML 2024arXiv:2404.10771
6
citations
#2229

COPAL: Continual Pruning in Large Language Generative Models

Srikanth Malla, Joon Hee Choi, Chiho Choi

ICML 2024arXiv:2405.02347
6
citations
#2230

Revisiting Character-level Adversarial Attacks for Language Models

Elias Abad Rocamora, Yongtao Wu, Fanghui Liu et al.

ICML 2024arXiv:2405.04346
6
citations
#2231

SPABA: A Single-Loop and Probabilistic Stochastic Bilevel Algorithm Achieving Optimal Sample Complexity

Tianshu Chu, Dachuan Xu, Wei Yao et al.

ICML 2024arXiv:2405.18777
6
citations
#2232

Regress, Don't Guess: A Regression-like Loss on Number Tokens for Language Models

Jonas Zausinger, Lars Pennig, Anamarija Kozina et al.

ICML 2025arXiv:2411.02083
6
citations
#2233

SPD: Sync-Point Drop for Efficient Tensor Parallelism of Large Language Models

Han-Byul Kim, Duc Hoang, Arnav Kundu et al.

ICML 2025arXiv:2502.20727
6
citations
#2234

Unmasking Vulnerabilities: Cardinality Sketches under Adaptive Inputs

Sara Ahmadian, Edith Cohen

ICML 2024arXiv:2405.17780
6
citations
#2235

Looking Beyond the Top-1: Transformers Determine Top Tokens in Order

Daria Lioubashevski, Tomer Schlank, Gabriel Stanovsky et al.

ICML 2025arXiv:2410.20210
6
citations
#2236

Goal-Space Planning with Subgoal Models

Chunlok Lo, Kevin Roice, Parham Mohammad Panahi et al.

ICML 2025oralarXiv:2206.02902
6
citations
#2237

Boundary Exploration for Bayesian Optimization With Unknown Physical Constraints

Yunsheng Tian, Ane Zuniga, Xinwei Zhang et al.

ICML 2024arXiv:2402.07692
6
citations
#2238

Cooperation of Experts: Fusing Heterogeneous Information with Large Margin

Shuo Wang, Shunyang Huang, Jinghui Yuan et al.

ICML 2025arXiv:2505.20853
6
citations
#2239

MADA: Meta-Adaptive Optimizers Through Hyper-Gradient Descent

Kaan Ozkara, Can Karakus, Parameswaran Raman et al.

ICML 2024arXiv:2401.08893
6
citations
#2240

InfoNet: Neural Estimation of Mutual Information without Test-Time Optimization

Zhengyang Hu, Song Kang, Qunsong Zeng et al.

ICML 2024arXiv:2402.10158
6
citations
#2241

Equilibrium of Data Markets with Externality

Safwan Hossain, Yiling Chen

ICML 2024arXiv:2302.08012
6
citations
#2242

AsymRnR: Video Diffusion Transformers Acceleration with Asymmetric Reduction and Restoration

Wenhao SUN, Rong-Cheng Tu, Jingyi Liao et al.

ICML 2025arXiv:2412.11706
6
citations
#2243

MODULI: Unlocking Preference Generalization via Diffusion Models for Offline Multi-Objective Reinforcement Learning

Yifu Yuan, Zhenrui Zheng, Zibin Dong et al.

ICML 2025arXiv:2408.15501
6
citations
#2244

Elucidating the design space of language models for image generation

Xuantong Liu, Shaozhe Hao, Xianbiao Qi et al.

ICML 2025arXiv:2410.16257
6
citations
#2245

Scaling Probabilistic Circuits via Monarch Matrices

Honghua Zhang, Meihua Dang, Benjie Wang et al.

ICML 2025arXiv:2506.12383
6
citations
#2246

Efficient Multi-modal Long Context Learning for Training-free Adaptation

Zehong Ma, Shiliang Zhang, Longhui Wei et al.

ICML 2025arXiv:2505.19812
6
citations
#2247

EquivaMap: Leveraging LLMs for Automatic Equivalence Checking of Optimization Formulations

Haotian Zhai, Connor Lawless, Ellen Vitercik et al.

ICML 2025arXiv:2502.14760
6
citations
#2248

Understanding the Limits of Deep Tabular Methods with Temporal Shift

Haorun Cai, Han-Jia Ye

ICML 2025oralarXiv:2502.20260
6
citations
#2249

When Maximum Entropy Misleads Policy Optimization

Ruipeng Zhang, Ya-Chien Chang, Sicun Gao

ICML 2025arXiv:2506.05615
6
citations
#2250

Mitigating Oversmoothing Through Reverse Process of GNNs for Heterophilic Graphs

MoonJeong Park, Jaeseung Heo, Dongwoo Kim

ICML 2024arXiv:2403.10543
6
citations
#2251

RAPID: Long-Context Inference with Retrieval-Augmented Speculative Decoding

Guanzheng Chen, Qilong Feng, Jinjie Ni et al.

ICML 2025spotlightarXiv:2502.20330
6
citations
#2252

Geometric Representation Condition Improves Equivariant Molecule Generation

Zian Li, Cai Zhou, Xiyuan Wang et al.

ICML 2025spotlightarXiv:2410.03655
6
citations
#2253

Acquisition Conditioned Oracle for Nongreedy Active Feature Acquisition

Michael Valancius, Maxwell Lennon, Junier Oliva

ICML 2024arXiv:2302.13960
6
citations
#2254

Bespoke Non-Stationary Solvers for Fast Sampling of Diffusion and Flow Models

Neta Shaul, Uriel Singer, Ricky T. Q. Chen et al.

ICML 2024arXiv:2403.01329
6
citations
#2255

Compute Optimal Inference and Provable Amortisation Gap in Sparse Autoencoders

Charles O'Neill, Alim Gumran, David Klindt

ICML 2025arXiv:2411.13117
6
citations
#2256

EditLord: Learning Code Transformation Rules for Code Editing

Weichen Li, Albert Jan, Baishakhi Ray et al.

ICML 2025arXiv:2504.15284
6
citations
#2257

Exact Conversion of In-Context Learning to Model Weights in Linearized-Attention Transformers

Brian Chen, Tianyang Hu, Hui Jin et al.

ICML 2024arXiv:2406.02847
6
citations
#2258

Characterizing Large Language Model Geometry Helps Solve Toxicity Detection and Generation

Randall Balestriero, Romain Cosentino, Sarath Shekkizhar

ICML 2024arXiv:2312.01648
6
citations
#2259

Learning Optimal Deterministic Policies with Stochastic Policy Gradients

Alessandro Montenegro, Marco Mussi, Alberto Maria Metelli et al.

ICML 2024spotlightarXiv:2405.02235
6
citations
#2260

LotteryCodec: Searching the Implicit Representation in a Random Network for Low-Complexity Image Compression

Haotian Wu, Gongpu Chen, Pier Luigi Dragotti et al.

ICML 2025spotlightarXiv:2507.01204
6
citations
#2261

Dr. Strategy: Model-Based Generalist Agents with Strategic Dreaming

Hany Hamed, Subin Kim, Dongyeong Kim et al.

ICML 2024arXiv:2402.18866
6
citations
#2262

An Adaptive Orthogonal Convolution Scheme for Efficient and Flexible CNN Architectures

Thibaut Boissin, Franck Mamalet, Thomas Fel et al.

ICML 2025arXiv:2501.07930
6
citations
#2263

AttNS: Attention-Inspired Numerical Solving For Limited Data Scenarios

Zhongzhan Huang, Mingfu Liang, Shanshan Zhong et al.

ICML 2024arXiv:2302.10184
6
citations
#2264

Collapse-Proof Non-Contrastive Self-Supervised Learning

EMANUELE SANSONE, Tim Lebailly, Tinne Tuytelaars

ICML 2025arXiv:2410.04959
6
citations
#2265

Mixture of Lookup Experts

Shibo Jie, Yehui Tang, Kai Han et al.

ICML 2025oralarXiv:2503.15798
6
citations
#2266

Towards Learning to Complete Anything in Lidar

Ayça Takmaz, Cristiano Saltori, Neehar Peri et al.

ICML 2025oralarXiv:2504.12264
6
citations
#2267

Error Feedback Can Accurately Compress Preconditioners

Ionut-Vlad Modoranu, Aleksei Kalinov, Eldar Kurtic et al.

ICML 2024arXiv:2306.06098
6
citations
#2268

Optimal Eye Surgeon: Finding image priors through sparse generators at initialization

Avrajit Ghosh, Xitong Zhang, Kenneth Sun et al.

ICML 2024arXiv:2406.05288
6
citations
#2269

Mitigating Heterogeneous Token Overfitting in LLM Knowledge Editing

Tianci Liu, Ruirui Li, Zihan Dong et al.

ICML 2025arXiv:2502.00602
6
citations
#2270

Exploring the Benefit of Activation Sparsity in Pre-training

Zhengyan Zhang, Chaojun Xiao, Qiujieli Qin et al.

ICML 2024arXiv:2410.03440
6
citations
#2271

Mastering Massive Multi-Task Reinforcement Learning via Mixture-of-Expert Decision Transformer

Yilun Kong, Guozheng Ma, Qi Zhao et al.

ICML 2025arXiv:2505.24378
6
citations
#2272

Memorization Sinks: Isolating Memorization during LLM Training

Gaurav Ghosal, Pratyush Maini, Aditi Raghunathan

ICML 2025arXiv:2507.09937
6
citations
#2273

Ladder-Residual: Parallelism-Aware Architecture for Accelerating Large Model Inference with Communication Overlapping

Muru Zhang, Mayank Mishra, Zhongzhu Zhou et al.

ICML 2025arXiv:2501.06589
6
citations
#2274

Predicting mutational effects on protein binding from folding energy

Arthur Deng, Karsten Householder, Fang Wu et al.

ICML 2025arXiv:2507.05502
6
citations
#2275

Activation Space Interventions Can Be Transferred Between Large Language Models

Narmeen Oozeer, Dhruv Nathawani, Nirmalendu Prakash et al.

ICML 2025arXiv:2503.04429
6
citations
#2276

Stochastic Weakly Convex Optimization beyond Lipschitz Continuity

Wenzhi Gao, Qi Deng

ICML 2024arXiv:2401.13971
6
citations
#2277

Advancing DRL Agents in Commercial Fighting Games: Training, Integration, and Agent-Human Alignment

Chen Zhang, Qiang HE, Yuan Zhou et al.

ICML 2024arXiv:2406.01103
6
citations
#2278

Capturing Temporal Dynamics in Large-Scale Canopy Tree Height Estimation

Jan Pauls, Max Zimmer, Berkant Turan et al.

ICML 2025oralarXiv:2501.19328
6
citations
#2279

Persistent Topological Features in Large Language Models

Yuri Gardinazzi, Karthik Viswanathan, Giada Panerai et al.

ICML 2025arXiv:2410.11042
6
citations
#2280

Implicit Bias of Gradient Descent for Non-Homogeneous Deep Networks

Yuhang Cai, Kangjie Zhou, Jingfeng Wu et al.

ICML 2025arXiv:2502.16075
6
citations
#2281

Reward-Augmented Data Enhances Direct Preference Alignment of LLMs

Shenao Zhang, Zhihan Liu, Boyi Liu et al.

ICML 2025arXiv:2410.08067
6
citations
#2282

Robust Conformal Outlier Detection under Contaminated Reference Data

Meshi Bashari, Matteo Sesia, Yaniv Romano

ICML 2025arXiv:2502.04807
6
citations
#2283

Look Ahead or Look Around? A Theoretical Comparison Between Autoregressive and Masked Pretraining

Qi Zhang, Tianqi Du, Haotian Huang et al.

ICML 2024arXiv:2407.00935
6
citations
#2284

Universal Biological Sequence Reranking for Improved De Novo Peptide Sequencing

Zijie Qiu, Jiaqi Wei, Xiang Zhang et al.

ICML 2025arXiv:2505.17552
6
citations
#2285

Unifying Specialized Visual Encoders for Video Language Models

Jihoon Chung, Tyler Zhu, Max Gonzalez Saez-Diez et al.

ICML 2025oralarXiv:2501.01426
6
citations
#2286

Keep the Momentum: Conservation Laws beyond Euclidean Gradient Flows

Sibylle Marcotte, Rémi Gribonval, Gabriel Peyré

ICML 2024oralarXiv:2405.12888
6
citations
#2287

SADA: Stability-guided Adaptive Diffusion Acceleration

Ting Jiang, Yixiao Wang, Hancheng Ye et al.

ICML 2025arXiv:2507.17135
6
citations
#2288

One Leaf Reveals the Season: Occlusion-Based Contrastive Learning with Semantic-Aware Views for Efficient Visual Representation

Xiaoyu Yang, Lijian Xu, Hongsheng Li et al.

ICML 2025arXiv:2411.09858
6
citations
#2289

Hessian Geometry of Latent Space in Generative Models

Alexander Lobashev, Dmitry Guskov, Maria Larchenko et al.

ICML 2025arXiv:2506.10632
6
citations
#2290

Neighboring Perturbations of Knowledge Editing on Large Language Models

Jun-Yu Ma, Zhen-Hua Ling, Ningyu Zhang et al.

ICML 2024arXiv:2401.17623
6
citations
#2291

B-score: Detecting biases in large language models using response history

An Vo, Mohammad Reza Taesiri, Daeyoung Kim et al.

ICML 2025arXiv:2505.18545
6
citations
#2292

Ranked Entropy Minimization for Continual Test-Time Adaptation

Jisu Han, Jaemin Na, Wonjun Hwang

ICML 2025arXiv:2505.16441
6
citations
#2293

Rethinking Chain-of-Thought from the Perspective of Self-Training

Zongqian Wu, Baoduo Xu, Ruochen Cui et al.

ICML 2025arXiv:2412.10827
6
citations
#2294

Generalization Bounds for Heavy-Tailed SDEs through the Fractional Fokker-Planck Equation

Benjamin Dupuis, Umut Simsekli

ICML 2024arXiv:2402.07723
6
citations
#2295

Neuro-Symbolic Temporal Point Processes

Yang Yang, Chao Yang, Boyang Li et al.

ICML 2024oralarXiv:2406.03914
6
citations
#2296

On the Benefits of Active Data Collection in Operator Learning

Unique Subedi, Ambuj Tewari

ICML 2025spotlightarXiv:2410.19725
6
citations
#2297

Breaking the Barrier: Enhanced Utility and Robustness in Smoothed DRL Agents

Chung-En Sun, Sicun Gao, Lily Weng

ICML 2024arXiv:2406.18062
6
citations
#2298

To Steer or Not to Steer? Mechanistic Error Reduction with Abstention for Language Models

Anna Hedström, Salim I. Amoukou, Tom Bewley et al.

ICML 2025arXiv:2510.13290
6
citations
#2299

Spectral Phase Transition and Optimal PCA in Block-Structured Spiked Models

Pierre Mergny, Justin Ko, FLORENT KRZAKALA

ICML 2024arXiv:2403.03695
6
citations
#2300

No-Regret is not enough! Bandits with General Constraints through Adaptive Regret Minimization

Martino Bernasconi, Matteo Castiglioni, Andrea Celli

ICML 2025arXiv:2405.06575
6
citations
#2301

Constrained Belief Updates Explain Geometric Structures in Transformer Representations

Mateusz Piotrowski, Paul Riechers, Daniel Filan et al.

ICML 2025arXiv:2502.01954
6
citations
#2302

Supercharging Graph Transformers with Advective Diffusion

Qitian Wu, Chenxiao Yang, Kaipeng Zeng et al.

ICML 2025arXiv:2310.06417
6
citations
#2303

Addressing Imbalanced Domain-Incremental Learning through Dual-Balance Collaborative Experts

Lan Li, Da-Wei Zhou, Han-Jia Ye et al.

ICML 2025arXiv:2507.07100
6
citations
#2304

Efficient First-Order Optimization on the Pareto Set for Multi-Objective Learning under Preference Guidance

Lisha Chen, Quan Xiao, Ellen Fukuda et al.

ICML 2025spotlightarXiv:2504.02854
6
citations
#2305

Tuning LLM Judge Design Decisions for 1/1000 of the Cost

David Salinas, Omar Swelam, Frank Hutter

ICML 2025arXiv:2501.17178
6
citations
#2306

Impact of Decentralized Learning on Player Utilities in Stackelberg Games

Kate Donahue, Nicole Immorlica, Meena Jagadeesan et al.

ICML 2024arXiv:2403.00188
6
citations
#2307

Smoothing Proximal Gradient Methods for Nonsmooth Sparsity Constrained Optimization: Optimality Conditions and Global Convergence

Ganzhao Yuan

ICML 2024arXiv:2104.13782
6
citations
#2308

Exploring the LLM Journey from Cognition to Expression with Linear Representations

Yuzi Yan, Jialian Li, YipinZhang et al.

ICML 2024arXiv:2405.16964
6
citations
#2309

LLMs can see and hear without any training

Kumar Ashutosh, Yossi Gandelsman, Xinlei Chen et al.

ICML 2025arXiv:2501.18096
6
citations
#2310

Scalable Generation of Spatial Transcriptomics from Histology Images via Whole-Slide Flow Matching

Tinglin Huang, Tianyu Liu, Mehrtash Babadi et al.

ICML 2025spotlightarXiv:2506.05361
6
citations
#2311

Differential Coding for Training-Free ANN-to-SNN Conversion

Zihan Huang, Wei Fang, Tong Bu et al.

ICML 2025arXiv:2503.00301
6
citations
#2312

On the sample complexity of conditional independence testing with Von Mises estimator with application to causal discovery

Fateme Jamshidi, Luca Ganassali, Negar Kiyavash

ICML 2024arXiv:2310.13553
6
citations
#2313

Causal Effect Identification in LiNGAM Models with Latent Confounders

Daniele Tramontano, Yaroslav Kivva, Saber Salehkaleybar et al.

ICML 2024arXiv:2406.02049
6
citations
#2314

Position: AI Competitions Provide the Gold Standard for Empirical Rigor in GenAI Evaluation

D. Sculley, William Cukierski, Phil Culliton et al.

ICML 2025oralarXiv:2505.00612
6
citations
#2315

A New Computationally Efficient Algorithm to solve Feature Selection for Functional Data Classification in High-dimensional Spaces

Tobia Boschi, FRANCESCA BONIN, Rodrigo Ordonez-Hurtado et al.

ICML 2024arXiv:2401.05765
6
citations
#2316

First-Order Manifold Data Augmentation for Regression Learning

Ilya Kaufman, Omri Azencot

ICML 2024arXiv:2406.10914
6
citations
#2317

A Stronger Mixture of Low-Rank Experts for Fine-Tuning Foundation Models

Mengyang Sun, Yihao Wang, Tao Feng et al.

ICML 2025arXiv:2502.15828
6
citations
#2318

Local Causal Structure Learning in the Presence of Latent Variables

Feng Xie, Zheng Li, Peng Wu et al.

ICML 2024arXiv:2405.16225
6
citations
#2319

Beyond Entropy: Region Confidence Proxy for Wild Test-Time Adaptation

Zixuan Hu, Yichun Hu, Xiaotong Li et al.

ICML 2025arXiv:2505.20704
6
citations
#2320

Flexible Tails for Normalizing Flows

Tennessee Hickling, Dennis Prangle

ICML 2025arXiv:2406.16971
6
citations
#2321

Learning Cognitive Maps from Transformer Representations for Efficient Planning in Partially Observed Environments

Antoine Dedieu, Wolfgang Lehrach, Guangyao Zhou et al.

ICML 2024arXiv:2401.05946
6
citations
#2322

PINNsAgent: Automated PDE Surrogation with Large Language Models

Qingpo Wuwu, Chonghan Gao, Tianyu Chen et al.

ICML 2025arXiv:2501.12053
6
citations
#2323

How to Explore with Belief: State Entropy Maximization in POMDPs

Riccardo Zamboni, Duilio Cirino, Marcello Restelli et al.

ICML 2024arXiv:2406.02295
6
citations
#2324

LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models

Dachuan Shi, Yonggan Fu, Xiangchi Yuan et al.

ICML 2025arXiv:2507.14204
6
citations
#2325

Pretraining Generative Flow Networks with Inexpensive Rewards for Molecular Graph Generation

Mohit Pandey, Gopeshh Subbaraj, Artem Cherkasov et al.

ICML 2025arXiv:2503.06337
6
citations
#2326

KIND: Knowledge Integration and Diversion for Training Decomposable Models

Yucheng Xie, Fu Feng, Ruixiao Shi et al.

ICML 2025arXiv:2408.07337
6
citations
#2327

Selective Response Strategies for GenAI

Boaz Taitler, Omer Ben-Porat

ICML 2025arXiv:2502.00729
6
citations
#2328

Analyze Feature Flow to Enhance Interpretation and Steering in Language Models

Daniil Laptev, Nikita Balagansky, Yaroslav Aksenov et al.

ICML 2025arXiv:2502.03032
6
citations
#2329

When Bad Data Leads to Good Models

Kenneth Li, Yida Chen, Fernanda Viégas et al.

ICML 2025arXiv:2505.04741
6
citations
#2330

Dynamic Correlation Clustering in Sublinear Update Time

Vincent Cohen-Addad, Silvio Lattanzi, Andreas Maggiori et al.

ICML 2024spotlightarXiv:2406.09137
6
citations
#2331

Jacobian Sparse Autoencoders: Sparsify Computations, Not Just Activations

Lucy Farnik, Tim Lawson, Conor Houghton et al.

ICML 2025spotlightarXiv:2502.18147
6
citations
#2332

Layer-Aware Analysis of Catastrophic Overfitting: Revealing the Pseudo-Robust Shortcut Dependency

Runqi Lin, Chaojian Yu, Bo Han et al.

ICML 2024arXiv:2405.16262
6
citations
#2333

Aligning Multimodal Representations through an Information Bottleneck

Antonio Almudévar, Jose Miguel Hernandez-Lobato, Sameer Khurana et al.

ICML 2025arXiv:2506.04870
6
citations
#2334

Enhancing Statistical Validity and Power in Hybrid Controlled Trials: A Randomization Inference Approach with Conformal Selective Borrowing

Ke Zhu, Shu Yang, Xiaofei Wang

ICML 2025arXiv:2410.11713
6
citations
#2335

Position: The Artificial Intelligence and Machine Learning Community Should Adopt a More Transparent and Regulated Peer Review Process

Jing Yang

ICML 2025arXiv:2502.00874
6
citations
#2336

Improving Rationality in the Reasoning Process of Language Models through Self-playing Game

Pinzheng Wang, Juntao Li, Zecheng Tang et al.

ICML 2025arXiv:2506.22920
6
citations
#2337

Accelerating Linear Recurrent Neural Networks for the Edge with Unstructured Sparsity

Alessandro Pierro, Steven Abreu, Jonathan Timcheck et al.

ICML 2025arXiv:2502.01330
6
citations
#2338

Position: AI/ML Influencers Have a Place in the Academic Process

Iain Xie Weissburg, Mehir Arora, Xinyi Wang et al.

ICML 2024arXiv:2401.13782
6
citations
#2339

Boosting Virtual Agent Learning and Reasoning: A Step-Wise, Multi-Dimensional, and Generalist Reward Model with Benchmark

Bingchen Miao, Yang Wu, Minghe Gao et al.

ICML 2025arXiv:2503.18665
6
citations
#2340

Features are fate: a theory of transfer learning in high-dimensional regression

Javan Tahir, Surya Ganguli, Grant Rotskoff

ICML 2025arXiv:2410.08194
6
citations
#2341

LLM-Augmented Chemical Synthesis and Design Decision Programs

Haorui Wang, Jeff Guo, Lingkai Kong et al.

ICML 2025arXiv:2505.07027
6
citations
#2342

LLMScan: Causal Scan for LLM Misbehavior Detection

Mengdi Zhang, Goh Kiat, Peixin Zhang et al.

ICML 2025arXiv:2410.16638
6
citations
#2343

Models of Heavy-Tailed Mechanistic Universality

Liam Hodgkinson, Zhichao Wang, Michael Mahoney

ICML 2025arXiv:2506.03470
6
citations
#2344

DEALing with Image Reconstruction: Deep Attentive Least Squares

Mehrsa Pourya, Erich Kobler, Michael Unser et al.

ICML 2025arXiv:2502.04079
6
citations
#2345

Provably Better Explanations with Optimized Aggregation of Feature Attributions

Thomas Decker, Ananta Bhattarai, Jindong Gu et al.

ICML 2024arXiv:2406.05090
6
citations
#2346

CSTrack: Enhancing RGB-X Tracking via Compact Spatiotemporal Features

xiaokun Feng, Dailing Zhang, Shiyu Hu et al.

ICML 2025oralarXiv:2505.19434
6
citations
#2347

Understanding Synthetic Context Extension via Retrieval Heads

Xinyu Zhao, Fangcong Yin, Greg Durrett

ICML 2025arXiv:2410.22316
6
citations
#2348

Adversarial Cooperative Rationalization: The Risk of Spurious Correlations in Even Clean Datasets

Wei Liu, Zhongyu Niu, Lang Gao et al.

ICML 2025arXiv:2505.02118
6
citations
#2349

Don't be so Negative! Score-based Generative Modeling with Oracle-assisted Guidance

Saeid Naderiparizi, Xiaoxuan Liang, Setareh Cohan et al.

ICML 2024arXiv:2307.16463
6
citations
#2350

Spurious Correlations in High Dimensional Regression: The Roles of Regularization, Simplicity Bias and Over-Parameterization

Simone Bombari, Marco Mondelli

ICML 2025arXiv:2502.01347
6
citations
#2351

Probabilistic Subgoal Representations for Hierarchical Reinforcement Learning

Vivienne Wang, Tinghuai Wang, wenyan yang et al.

ICML 2024arXiv:2406.16707
6
citations
#2352

Zero-shot Meta-learning for Tabular Prediction Tasks with Adversarially Pre-trained Transformer

Yulun Wu, Doron Bergman

ICML 2025arXiv:2502.04573
6
citations
#2353

Optimization without Retraction on the Random Generalized Stiefel Manifold

Simon Vary, Pierre Ablin, Bin Gao et al.

ICML 2024arXiv:2405.01702
6
citations
#2354

Representing Molecules as Random Walks Over Interpretable Grammars

Michael Sun, Minghao Guo, Weize Yuan et al.

ICML 2024spotlightarXiv:2403.08147
6
citations
#2355

In-Context Denoising with One-Layer Transformers: Connections between Attention and Associative Memory Retrieval

Matthew Smart, Alberto Bietti, Anirvan Sengupta

ICML 2025oralarXiv:2502.05164
6
citations
#2356

Active Fine-Tuning of Multi-Task Policies

Marco Bagatella, Jonas Hübotter, Georg Martius et al.

ICML 2025oralarXiv:2410.05026
6
citations
#2357

Exploiting Curvature in Online Convex Optimization with Delayed Feedback

Hao Qiu, Emmanuel Esposito, Mengxiao Zhang

ICML 2025arXiv:2506.07595
6
citations
#2358

Posterior Inference with Diffusion Models for High-dimensional Black-box Optimization

Taeyoung Yun, Kiyoung Om, Jaewoo Lee et al.

ICML 2025arXiv:2502.16824
6
citations
#2359

Differentially Private Worst-group Risk Minimization

Xinyu Zhou, Raef Bassily

ICML 2024arXiv:2402.19437
6
citations
#2360

MuLan: Adapting Multilingual Diffusion Models for Hundreds of Languages with Negligible Cost

Sen Xing, Muyan Zhong, Zeqiang Lai et al.

ICML 2025arXiv:2412.01271
6
citations
#2361

Exploring the Enigma of Neural Dynamics Through A Scattering-Transform Mixer Landscape for Riemannian Manifold

Tingting Dan, Ziquan Wei, Won Hwa Kim et al.

ICML 2024arXiv:2405.16357
6
citations
#2362

Activation by Interval-wise Dropout: A Simple Way to Prevent Neural Networks from Plasticity Loss

Sangyeon Park, Isaac Han, Seungwon Oh et al.

ICML 2025arXiv:2502.01342
6
citations
#2363

Stochastic positional embeddings improve masked image modeling

Amir Bar, Florian Bordes, Assaf Shocher et al.

ICML 2024arXiv:2308.00566
6
citations
#2364

Data-Efficient Molecular Generation with Hierarchical Textual Inversion

Seojin Kim, Jaehyun Nam, Sihyun Yu et al.

ICML 2024arXiv:2405.02845
6
citations
#2365

BlockDialect: Block-wise Fine-grained Mixed Format Quantization for Energy-Efficient LLM Inference

Wonsuk Jang, Thierry Tambe

ICML 2025arXiv:2501.01144
6
citations
#2366

Optimal Transport for Structure Learning Under Missing Data

Vy Vo, He Zhao, Trung Le et al.

ICML 2024arXiv:2402.15255
6
citations
#2367

A New Branch-and-Bound Pruning Framework for $\ell_0$-Regularized Problems

Guyard Theo, Cédric Herzet, Clément Elvira et al.

ICML 2024arXiv:2406.03504
6
citations
#2368

Faster Maximum Inner Product Search in High Dimensions

Mo Tiwari, Ryan Kang, Jaeyong Lee et al.

ICML 2024arXiv:2212.07551
6
citations
#2369

RZ-NAS: Enhancing LLM-guided Neural Architecture Search via Reflective Zero-Cost Strategy

Zipeng Ji, Guanghui Zhu, Chunfeng Yuan et al.

ICML 2025
6
citations
#2370

ESPFormer: Doubly-Stochastic Attention with Expected Sliced Transport Plans

Ashkan Shahbazi, Elaheh Akbari, Darian Salehi et al.

ICML 2025arXiv:2502.07962
6
citations
#2371

A Probabilistic Approach to Learning the Degree of Equivariance in Steerable CNNs

Lars Veefkind, Gabriele Cesa

ICML 2024arXiv:2406.03946
6
citations
#2372

BBox-Adapter: Lightweight Adapting for Black-Box Large Language Models

Haotian Sun, Yuchen Zhuang, Wei Wei et al.

ICML 2024spotlightarXiv:2402.08219
6
citations
#2373

BinauralFlow: A Causal and Streamable Approach for High-Quality Binaural Speech Synthesis with Flow Matching Models

Susan Liang, Dejan Markovic, Israel D. Gebru et al.

ICML 2025arXiv:2505.22865
6
citations
#2374

Autonomy-of-Experts Models

Ang Lv, Ruobing Xie, Yining Qian et al.

ICML 2025arXiv:2501.13074
6
citations
#2375

Synthetic Text Generation for Training Large Language Models via Gradient Matching

Dang Nguyen, Zeman Li, MohammadHossein Bateni et al.

ICML 2025arXiv:2502.17607
6
citations
#2376

Towards Understanding the Word Sensitivity of Attention Layers: A Study via Random Features

Simone Bombari, Marco Mondelli

ICML 2024arXiv:2402.02969
6
citations
#2377

Winner-takes-all learners are geometry-aware conditional density estimators

Victor Letzelter, David Perera, Cédric Rommel et al.

ICML 2024arXiv:2406.04706
6
citations
#2378

TruthFlow: Truthful LLM Generation via Representation Flow Correction

Hanyu Wang, Bochuan Cao, Yuanpu Cao et al.

ICML 2025arXiv:2502.04556
6
citations
#2379

The Perils of Optimizing Learned Reward Functions: Low Training Error Does Not Guarantee Low Regret

Lukas Fluri, Leon Lang, Alessandro Abate et al.

ICML 2025arXiv:2406.15753
6
citations
#2380

Dynamic Survival Analysis with Controlled Latent States

Linus Bleistein, Van NGUYEN, Adeline Fermanian et al.

ICML 2024arXiv:2401.17077
6
citations
#2381

ALERT-Transformer: Bridging Asynchronous and Synchronous Machine Learning for Real-Time Event-based Spatio-Temporal Data

Carmen Martin-Turrero, Maxence Bouvier, Manuel Breitenstein et al.

ICML 2024oralarXiv:2402.01393
6
citations
#2382

Residual Matrix Transformers: Scaling the Size of the Residual Stream

Brian Mak, Jeffrey Flanigan

ICML 2025arXiv:2506.22696
6
citations
#2383

ED-Copilot: Reduce Emergency Department Wait Time with Language Model Diagnostic Assistance

Liwen Sun, Abhineet Agarwal, Aaron Kornblith et al.

ICML 2024arXiv:2402.13448
6
citations
#2384

SLiM: One-shot Quantization and Sparsity with Low-rank Approximation for LLM Weight Compression

Mohammad Mozaffari, Amir Yazdanbakhsh, Maryam Mehri Dehnavi

ICML 2025arXiv:2410.09615
6
citations
#2385

FisherSFT: Data-Efficient Supervised Fine-Tuning of Language Models Using Information Gain

Rohan Deb, Kiran Thekumparampil, Kousha Kalantari et al.

ICML 2025arXiv:2505.14826
6
citations
#2386

Revisiting the Predictability of Performative, Social Events

Juan Perdomo

ICML 2025arXiv:2503.11713
6
citations
#2387

Towards Trustworthy Federated Learning with Untrusted Participants

Youssef Allouah, Rachid Guerraoui, John Stephan

ICML 2025arXiv:2505.01874
6
citations
#2388

Understanding and Mitigating Memorization in Diffusion Models for Tabular Data

Zhengyu Fang, Zhimeng Jiang, Huiyuan Chen et al.

ICML 2025arXiv:2412.11044
6
citations
#2389

Algorithms with Calibrated Machine Learning Predictions

Judy Hanwen Shen, Ellen Vitercik, Anders Wikum

ICML 2025spotlightarXiv:2502.02861
6
citations
#2390

Spherical Rotation Dimension Reduction with Geometric Loss Functions

Hengrui Luo, Jeremy E. Purvis, Didong Li

ICML 2025arXiv:2204.10975
6
citations
#2391

IOI: Invisible One-Iteration Adversarial Attack on No-Reference Image- and Video-Quality Metrics

Ekaterina Shumitskaya, Anastasia Antsiferova, Dmitriy Vatolin

ICML 2024oralarXiv:2403.05955
6
citations
#2392

Revealing Weaknesses in Text Watermarking Through Self-Information Rewrite Attacks

Yixin Cheng, Hongcheng Guo, Yangming Li et al.

ICML 2025arXiv:2505.05190
6
citations
#2393

A Linear Time and Space Local Point Cloud Geometry Encoder via Vectorized Kernel Mixture (VecKM)

Dehao Yuan, Cornelia Fermuller, Tahseen Rabbani et al.

ICML 2024arXiv:2404.01568
6
citations
#2394

Exploring the Complexity of Deep Neural Networks through Functional Equivalence

Guohao Shen

ICML 2024arXiv:2305.11417
6
citations
#2395

Robust Offline Reinforcement Learning with Linearly Structured $f$-Divergence Regularization

Cheng Tang, Zhishuai Liu, Pan Xu

ICML 2025arXiv:2411.18612
6
citations
#2396

On Convergence of Incremental Gradient for Non-convex Smooth Functions

Anastasiia Koloskova, Nikita Doikov, Sebastian Stich et al.

ICML 2024arXiv:2305.19259
6
citations
#2397

On the Power of Context-Enhanced Learning in LLMs

Xingyu Zhu, Abhishek Panigrahi, Sanjeev Arora

ICML 2025spotlightarXiv:2503.01821
6
citations
#2398

Behavior-Regularized Diffusion Policy Optimization for Offline Reinforcement Learning

Chen-Xiao Gao, Chenyang Wu, Mingjun Cao et al.

ICML 2025arXiv:2502.04778
6
citations
#2399

Provably Scalable Black-Box Variational Inference with Structured Variational Families

Joohwan Ko, Kyurae Kim, Woo Chang Kim et al.

ICML 2024arXiv:2401.10989
6
citations
#2400

RepLoRA: Reparameterizing Low-rank Adaptation via the Perspective of Mixture of Experts

Tuan Truong, Chau Nguyen, Huy Nguyen et al.

ICML 2025arXiv:2502.03044
6
citations