Most Cited ICML "conditional outcome invariance" Papers

5,975 papers found • Page 12 of 30

Filters:Most Cited ICML conditional outcome invariance Clear all

Conference

AAAI 2025 (3,028)COLM 2025 (418)CVPR 2025 (2,873)ICCV 2025 (2,701)ICLR 2025 (3,827)ICML 2025 (3,340)ISMAR 2025 (229)NEURIPS 2025 (5,858)AAAI 2024 (2,289)CVPR 2024 (2,716)ECCV 2024 (2,387)ICLR 2024 (2,297)ICML 2024 (2,635)

Paper Type

poster (24,624)paper (8,558)oral (1,594)spotlight (1,421)highlight (975)

#2201

Mitigating Plasticity Loss in Continual Reinforcement Learning by Reducing Churn

Hongyao Tang, Johan Obando-Ceron, Pablo Samuel Castro et al.

ICML 2025oralarXiv:2506.00592

citations

#2202

Position: We Need An Algorithmic Understanding of Generative AI

Oliver Eberle, Thomas McGee, Hamza Giaffar et al.

ICML 2025spotlightarXiv:2507.07544

citations

#2203

Do Transformer World Models Give Better Policy Gradients?

Michel Ma, Tianwei Ni, Clement Gehring et al.

ICML 2024arXiv:2402.05290

citations

#2204

Constrained Exploration via Reflected Replica Exchange Stochastic Gradient Langevin Dynamics

Haoyang Zheng, Hengrong Du, Qi Feng et al.

ICML 2024arXiv:2405.07839

citations

#2205

Unraveling the Interplay between Carryover Effects and Reward Autocorrelations in Switchback Experiments

Qianglin Wen, Chengchun Shi, Ying Yang et al.

ICML 2025arXiv:2403.17285

citations

#2206

Infinite-Horizon Distributionally Robust Regret-Optimal Control

Taylan Kargin, Joudi Hajar, Vikrant Malik et al.

ICML 2024arXiv:2406.07248

citations

#2207

EvoluNet: Advancing Dynamic Non-IID Transfer Learning on Graphs

Haohui Wang, Yuzhen Mao, Yujun Yan et al.

ICML 2024oralarXiv:2305.00664

citations

#2208

Efficient and Scalable Density Functional Theory Hamiltonian Prediction through Adaptive Sparsity

Erpai Luo, Xinran Wei, Lin Huang et al.

ICML 2025arXiv:2502.01171

citations

#2209

Scaling Laws for Upcycling Mixture-of-Experts Language Models

Seng Pei Liew, Takuya Kato, Sho Takase

ICML 2025arXiv:2502.03009

citations

#2210

EiG-Search: Generating Edge-Induced Subgraphs for GNN Explanation in Linear Time

Shengyao Lu, Bang Liu, Keith Mills et al.

ICML 2024arXiv:2405.01762

citations

#2211

Interacting Diffusion Processes for Event Sequence Forecasting

Mai Zeng, Florence Regol, Mark Coates

ICML 2024oralarXiv:2310.17800

citations

#2212

Bridging discrete and continuous state spaces: Exploring the Ehrenfest process in time-continuous diffusion models

Ludwig Winkler, Lorenz Richter, Manfred Opper

ICML 2024arXiv:2405.03549

citations

#2213

Understanding Generalization in Quantum Machine Learning with Margins

TAK HUR, Daniel Kyungdeock Park

ICML 2025arXiv:2411.06919

citations

#2214

Position: The Causal Revolution Needs Scientific Pragmatism

Joshua Loftus

ICML 2024arXiv:2406.02275

citations

#2215

UGrid: An Efficient-And-Rigorous Neural Multigrid Solver for Linear PDEs

Xi Han, Fei Hou, Hong Qin

ICML 2024arXiv:2408.04846

citations

#2216

Efficient Length-Generalizable Attention via Causal Retrieval for Long-Context Language Modeling

Xiang Hu, Zhihao Teng, Jun Zhao et al.

ICML 2025arXiv:2410.01651

citations

#2217

Fine-Grained Causal Dynamics Learning with Quantization for Improving Robustness in Reinforcement Learning

Inwoo Hwang, Yunhyeok Kwak, Suhyung Choi et al.

ICML 2024arXiv:2406.03234

citations

#2218

Parameter-Efficient Fine-Tuning of State Space Models

Kevin Galim, Wonjun Kang, Yuchen Zeng et al.

ICML 2025arXiv:2410.09016

citations

#2219

HaploVL: A Single-Transformer Baseline for Multi-Modal Understanding

Rui Yang, Lin Song, Yicheng Xiao et al.

ICML 2025arXiv:2503.14694

citations

#2220

$\texttt{MoE-RBench}$: Towards Building Reliable Language Models with Sparse Mixture-of-Experts

Guanjie Chen, Xinyu Zhao, Tianlong Chen et al.

ICML 2024arXiv:2406.11353

citations

#2221

What can large language models do for sustainable food?

Anna Thomas, Adam Yee, Andrew Mayne et al.

ICML 2025arXiv:2503.04734

citations

#2222

Causal Discovery from Conditionally Stationary Time Series

Carles Balsells-Rodas, Xavier Sumba, Tanmayee Narendra et al.

ICML 2025arXiv:2110.06257

citations

#2223

REG: Rectified Gradient Guidance for Conditional Diffusion Models

Zhengqi Gao, Kaiwen Zha, Tianyuan Zhang et al.

ICML 2025arXiv:2501.18865

citations

#2224

Human-like Category Learning by Injecting Ecological Priors from Large Language Models into Neural Networks

Akshay Kumar Jagadish, Julian Coda-Forno, Mirko Thalmann et al.

ICML 2024arXiv:2402.01821

citations

#2225

Minimizing $f$-Divergences by Interpolating Velocity Fields

Song Liu, Jiahao Yu, Jack Simons et al.

ICML 2024arXiv:2305.15577

citations

#2226

Extracting Training Data From Document-Based VQA Models

Francesco Pinto, Nathalie Rauschmayr, Florian Tramer et al.

ICML 2024arXiv:2407.08707

citations

#2227

On Efficient Estimation of Distributional Treatment Effects under Covariate-Adaptive Randomization

Undral Byambadalai, Tomu Hirata, Tatsushi Oka et al.

ICML 2025arXiv:2506.05945

citations

#2228

TENG: Time-Evolving Natural Gradient for Solving PDEs With Deep Neural Nets Toward Machine Precision

Zhuo Chen, Jacob McCarran, Esteban Vizcaino et al.

ICML 2024arXiv:2404.10771

citations

#2229

COPAL: Continual Pruning in Large Language Generative Models

Srikanth Malla, Joon Hee Choi, Chiho Choi

ICML 2024arXiv:2405.02347

citations

#2230

Revisiting Character-level Adversarial Attacks for Language Models

Elias Abad Rocamora, Yongtao Wu, Fanghui Liu et al.

ICML 2024arXiv:2405.04346

citations

#2231

SPABA: A Single-Loop and Probabilistic Stochastic Bilevel Algorithm Achieving Optimal Sample Complexity

Tianshu Chu, Dachuan Xu, Wei Yao et al.

ICML 2024arXiv:2405.18777

citations

#2232

Regress, Don't Guess: A Regression-like Loss on Number Tokens for Language Models

Jonas Zausinger, Lars Pennig, Anamarija Kozina et al.

ICML 2025arXiv:2411.02083

citations

#2233

SPD: Sync-Point Drop for Efficient Tensor Parallelism of Large Language Models

Han-Byul Kim, Duc Hoang, Arnav Kundu et al.

ICML 2025arXiv:2502.20727

citations

#2234

Unmasking Vulnerabilities: Cardinality Sketches under Adaptive Inputs

Sara Ahmadian, Edith Cohen

ICML 2024arXiv:2405.17780

citations

#2235

Looking Beyond the Top-1: Transformers Determine Top Tokens in Order

Daria Lioubashevski, Tomer Schlank, Gabriel Stanovsky et al.

ICML 2025arXiv:2410.20210

citations

#2236

Goal-Space Planning with Subgoal Models

Chunlok Lo, Kevin Roice, Parham Mohammad Panahi et al.

ICML 2025oralarXiv:2206.02902

citations

#2237

Boundary Exploration for Bayesian Optimization With Unknown Physical Constraints

Yunsheng Tian, Ane Zuniga, Xinwei Zhang et al.

ICML 2024arXiv:2402.07692

citations

#2238

Cooperation of Experts: Fusing Heterogeneous Information with Large Margin

Shuo Wang, Shunyang Huang, Jinghui Yuan et al.

ICML 2025arXiv:2505.20853

citations

#2239

MADA: Meta-Adaptive Optimizers Through Hyper-Gradient Descent

Kaan Ozkara, Can Karakus, Parameswaran Raman et al.

ICML 2024arXiv:2401.08893

citations

#2240

InfoNet: Neural Estimation of Mutual Information without Test-Time Optimization

Zhengyang Hu, Song Kang, Qunsong Zeng et al.

ICML 2024arXiv:2402.10158

citations

#2241

Equilibrium of Data Markets with Externality

Safwan Hossain, Yiling Chen

ICML 2024arXiv:2302.08012

citations

#2242

AsymRnR: Video Diffusion Transformers Acceleration with Asymmetric Reduction and Restoration

Wenhao SUN, Rong-Cheng Tu, Jingyi Liao et al.

ICML 2025arXiv:2412.11706

citations

#2243

MODULI: Unlocking Preference Generalization via Diffusion Models for Offline Multi-Objective Reinforcement Learning

Yifu Yuan, Zhenrui Zheng, Zibin Dong et al.

ICML 2025arXiv:2408.15501

citations

#2244

Elucidating the design space of language models for image generation

Xuantong Liu, Shaozhe Hao, Xianbiao Qi et al.

ICML 2025arXiv:2410.16257

citations

#2245

Scaling Probabilistic Circuits via Monarch Matrices

Honghua Zhang, Meihua Dang, Benjie Wang et al.

ICML 2025arXiv:2506.12383

citations

#2246

Efficient Multi-modal Long Context Learning for Training-free Adaptation

Zehong Ma, Shiliang Zhang, Longhui Wei et al.

ICML 2025arXiv:2505.19812

citations

#2247

EquivaMap: Leveraging LLMs for Automatic Equivalence Checking of Optimization Formulations

Haotian Zhai, Connor Lawless, Ellen Vitercik et al.

ICML 2025arXiv:2502.14760

citations

#2248

Understanding the Limits of Deep Tabular Methods with Temporal Shift

Haorun Cai, Han-Jia Ye

ICML 2025oralarXiv:2502.20260

citations

#2249

When Maximum Entropy Misleads Policy Optimization

Ruipeng Zhang, Ya-Chien Chang, Sicun Gao

ICML 2025arXiv:2506.05615

citations

#2250

Mitigating Oversmoothing Through Reverse Process of GNNs for Heterophilic Graphs

MoonJeong Park, Jaeseung Heo, Dongwoo Kim

ICML 2024arXiv:2403.10543

citations

#2251

RAPID: Long-Context Inference with Retrieval-Augmented Speculative Decoding

Guanzheng Chen, Qilong Feng, Jinjie Ni et al.

ICML 2025spotlightarXiv:2502.20330

citations

#2252

Geometric Representation Condition Improves Equivariant Molecule Generation

Zian Li, Cai Zhou, Xiyuan Wang et al.

ICML 2025spotlightarXiv:2410.03655

citations

#2253

Acquisition Conditioned Oracle for Nongreedy Active Feature Acquisition

Michael Valancius, Maxwell Lennon, Junier Oliva

ICML 2024arXiv:2302.13960

citations

#2254

Bespoke Non-Stationary Solvers for Fast Sampling of Diffusion and Flow Models

Neta Shaul, Uriel Singer, Ricky T. Q. Chen et al.

ICML 2024arXiv:2403.01329

citations

#2255

Compute Optimal Inference and Provable Amortisation Gap in Sparse Autoencoders

Charles O'Neill, Alim Gumran, David Klindt

ICML 2025arXiv:2411.13117

citations

#2256

EditLord: Learning Code Transformation Rules for Code Editing

Weichen Li, Albert Jan, Baishakhi Ray et al.

ICML 2025arXiv:2504.15284

citations

#2257

Exact Conversion of In-Context Learning to Model Weights in Linearized-Attention Transformers

Brian Chen, Tianyang Hu, Hui Jin et al.

ICML 2024arXiv:2406.02847

citations

#2258

Characterizing Large Language Model Geometry Helps Solve Toxicity Detection and Generation

Randall Balestriero, Romain Cosentino, Sarath Shekkizhar

ICML 2024arXiv:2312.01648

citations

#2259

Learning Optimal Deterministic Policies with Stochastic Policy Gradients

Alessandro Montenegro, Marco Mussi, Alberto Maria Metelli et al.

ICML 2024spotlightarXiv:2405.02235

citations

#2260

LotteryCodec: Searching the Implicit Representation in a Random Network for Low-Complexity Image Compression

Haotian Wu, Gongpu Chen, Pier Luigi Dragotti et al.

ICML 2025spotlightarXiv:2507.01204

citations

#2261

Dr. Strategy: Model-Based Generalist Agents with Strategic Dreaming

Hany Hamed, Subin Kim, Dongyeong Kim et al.

ICML 2024arXiv:2402.18866

citations

#2262

An Adaptive Orthogonal Convolution Scheme for Efficient and Flexible CNN Architectures

Thibaut Boissin, Franck Mamalet, Thomas Fel et al.

ICML 2025arXiv:2501.07930

citations

#2263

AttNS: Attention-Inspired Numerical Solving For Limited Data Scenarios

Zhongzhan Huang, Mingfu Liang, Shanshan Zhong et al.

ICML 2024arXiv:2302.10184

citations

#2264

Collapse-Proof Non-Contrastive Self-Supervised Learning

EMANUELE SANSONE, Tim Lebailly, Tinne Tuytelaars

ICML 2025arXiv:2410.04959

citations

#2265

Mixture of Lookup Experts

Shibo Jie, Yehui Tang, Kai Han et al.

ICML 2025oralarXiv:2503.15798

citations

#2266

Towards Learning to Complete Anything in Lidar

Ayça Takmaz, Cristiano Saltori, Neehar Peri et al.

ICML 2025oralarXiv:2504.12264

citations

#2267

Error Feedback Can Accurately Compress Preconditioners

Ionut-Vlad Modoranu, Aleksei Kalinov, Eldar Kurtic et al.

ICML 2024arXiv:2306.06098

citations

#2268

Optimal Eye Surgeon: Finding image priors through sparse generators at initialization

Avrajit Ghosh, Xitong Zhang, Kenneth Sun et al.

ICML 2024arXiv:2406.05288

citations

#2269

Mitigating Heterogeneous Token Overfitting in LLM Knowledge Editing

Tianci Liu, Ruirui Li, Zihan Dong et al.

ICML 2025arXiv:2502.00602

citations

#2270

Exploring the Benefit of Activation Sparsity in Pre-training

Zhengyan Zhang, Chaojun Xiao, Qiujieli Qin et al.

ICML 2024arXiv:2410.03440

citations

#2271

Mastering Massive Multi-Task Reinforcement Learning via Mixture-of-Expert Decision Transformer

Yilun Kong, Guozheng Ma, Qi Zhao et al.

ICML 2025arXiv:2505.24378

citations

#2272

Memorization Sinks: Isolating Memorization during LLM Training

Gaurav Ghosal, Pratyush Maini, Aditi Raghunathan

ICML 2025arXiv:2507.09937

citations

#2273

Ladder-Residual: Parallelism-Aware Architecture for Accelerating Large Model Inference with Communication Overlapping

Muru Zhang, Mayank Mishra, Zhongzhu Zhou et al.

ICML 2025arXiv:2501.06589

citations

#2274

Predicting mutational effects on protein binding from folding energy

Arthur Deng, Karsten Householder, Fang Wu et al.

ICML 2025arXiv:2507.05502

citations

#2275

Activation Space Interventions Can Be Transferred Between Large Language Models

Narmeen Oozeer, Dhruv Nathawani, Nirmalendu Prakash et al.

ICML 2025arXiv:2503.04429

citations

#2276

Stochastic Weakly Convex Optimization beyond Lipschitz Continuity

Wenzhi Gao, Qi Deng

ICML 2024arXiv:2401.13971

citations

#2277

Advancing DRL Agents in Commercial Fighting Games: Training, Integration, and Agent-Human Alignment

Chen Zhang, Qiang HE, Yuan Zhou et al.

ICML 2024arXiv:2406.01103

citations

#2278

Capturing Temporal Dynamics in Large-Scale Canopy Tree Height Estimation

Jan Pauls, Max Zimmer, Berkant Turan et al.

ICML 2025oralarXiv:2501.19328

citations

#2279

Persistent Topological Features in Large Language Models

Yuri Gardinazzi, Karthik Viswanathan, Giada Panerai et al.

ICML 2025arXiv:2410.11042

citations

#2280

Implicit Bias of Gradient Descent for Non-Homogeneous Deep Networks

Yuhang Cai, Kangjie Zhou, Jingfeng Wu et al.

ICML 2025arXiv:2502.16075

citations

#2281

Reward-Augmented Data Enhances Direct Preference Alignment of LLMs

Shenao Zhang, Zhihan Liu, Boyi Liu et al.

ICML 2025arXiv:2410.08067

citations

#2282

Robust Conformal Outlier Detection under Contaminated Reference Data

Meshi Bashari, Matteo Sesia, Yaniv Romano

ICML 2025arXiv:2502.04807

citations

#2283

Look Ahead or Look Around? A Theoretical Comparison Between Autoregressive and Masked Pretraining

Qi Zhang, Tianqi Du, Haotian Huang et al.

ICML 2024arXiv:2407.00935

citations

#2284

Universal Biological Sequence Reranking for Improved De Novo Peptide Sequencing

Zijie Qiu, Jiaqi Wei, Xiang Zhang et al.

ICML 2025arXiv:2505.17552

citations

#2285

Unifying Specialized Visual Encoders for Video Language Models

Jihoon Chung, Tyler Zhu, Max Gonzalez Saez-Diez et al.

ICML 2025oralarXiv:2501.01426

citations

#2286

Keep the Momentum: Conservation Laws beyond Euclidean Gradient Flows

Sibylle Marcotte, Rémi Gribonval, Gabriel Peyré

ICML 2024oralarXiv:2405.12888

citations

#2287

SADA: Stability-guided Adaptive Diffusion Acceleration

Ting Jiang, Yixiao Wang, Hancheng Ye et al.

ICML 2025arXiv:2507.17135

citations

#2288

One Leaf Reveals the Season: Occlusion-Based Contrastive Learning with Semantic-Aware Views for Efficient Visual Representation

Xiaoyu Yang, Lijian Xu, Hongsheng Li et al.

ICML 2025arXiv:2411.09858

citations

#2289

Hessian Geometry of Latent Space in Generative Models

Alexander Lobashev, Dmitry Guskov, Maria Larchenko et al.

ICML 2025arXiv:2506.10632

citations

#2290

Neighboring Perturbations of Knowledge Editing on Large Language Models

Jun-Yu Ma, Zhen-Hua Ling, Ningyu Zhang et al.

ICML 2024arXiv:2401.17623

citations

#2291

B-score: Detecting biases in large language models using response history

An Vo, Mohammad Reza Taesiri, Daeyoung Kim et al.

ICML 2025arXiv:2505.18545

citations

#2292

Ranked Entropy Minimization for Continual Test-Time Adaptation

Jisu Han, Jaemin Na, Wonjun Hwang

ICML 2025arXiv:2505.16441

citations

#2293

Rethinking Chain-of-Thought from the Perspective of Self-Training

Zongqian Wu, Baoduo Xu, Ruochen Cui et al.

ICML 2025arXiv:2412.10827

citations

#2294

Generalization Bounds for Heavy-Tailed SDEs through the Fractional Fokker-Planck Equation

Benjamin Dupuis, Umut Simsekli

ICML 2024arXiv:2402.07723

citations

#2295

Neuro-Symbolic Temporal Point Processes

Yang Yang, Chao Yang, Boyang Li et al.

ICML 2024oralarXiv:2406.03914

citations

#2296

On the Benefits of Active Data Collection in Operator Learning

Unique Subedi, Ambuj Tewari

ICML 2025spotlightarXiv:2410.19725

citations

#2297

Breaking the Barrier: Enhanced Utility and Robustness in Smoothed DRL Agents

Chung-En Sun, Sicun Gao, Lily Weng

ICML 2024arXiv:2406.18062

citations

#2298

To Steer or Not to Steer? Mechanistic Error Reduction with Abstention for Language Models

Anna Hedström, Salim I. Amoukou, Tom Bewley et al.

ICML 2025arXiv:2510.13290

citations

#2299

Spectral Phase Transition and Optimal PCA in Block-Structured Spiked Models

Pierre Mergny, Justin Ko, FLORENT KRZAKALA

ICML 2024arXiv:2403.03695

citations

#2300

No-Regret is not enough! Bandits with General Constraints through Adaptive Regret Minimization

Martino Bernasconi, Matteo Castiglioni, Andrea Celli

ICML 2025arXiv:2405.06575

citations

#2301

Constrained Belief Updates Explain Geometric Structures in Transformer Representations

Mateusz Piotrowski, Paul Riechers, Daniel Filan et al.

ICML 2025arXiv:2502.01954

citations

#2302

Supercharging Graph Transformers with Advective Diffusion

Qitian Wu, Chenxiao Yang, Kaipeng Zeng et al.

ICML 2025arXiv:2310.06417

citations

#2303

Addressing Imbalanced Domain-Incremental Learning through Dual-Balance Collaborative Experts

Lan Li, Da-Wei Zhou, Han-Jia Ye et al.

ICML 2025arXiv:2507.07100

citations

#2304

Efficient First-Order Optimization on the Pareto Set for Multi-Objective Learning under Preference Guidance

Lisha Chen, Quan Xiao, Ellen Fukuda et al.

ICML 2025spotlightarXiv:2504.02854

citations

#2305

Tuning LLM Judge Design Decisions for 1/1000 of the Cost

David Salinas, Omar Swelam, Frank Hutter

ICML 2025arXiv:2501.17178

citations

#2306

Impact of Decentralized Learning on Player Utilities in Stackelberg Games

Kate Donahue, Nicole Immorlica, Meena Jagadeesan et al.

ICML 2024arXiv:2403.00188

citations

#2307

Smoothing Proximal Gradient Methods for Nonsmooth Sparsity Constrained Optimization: Optimality Conditions and Global Convergence

Ganzhao Yuan

ICML 2024arXiv:2104.13782

citations

#2308

Exploring the LLM Journey from Cognition to Expression with Linear Representations

Yuzi Yan, Jialian Li, YipinZhang et al.

ICML 2024arXiv:2405.16964

citations

#2309

LLMs can see and hear without any training

Kumar Ashutosh, Yossi Gandelsman, Xinlei Chen et al.

ICML 2025arXiv:2501.18096

citations

#2310

Scalable Generation of Spatial Transcriptomics from Histology Images via Whole-Slide Flow Matching

Tinglin Huang, Tianyu Liu, Mehrtash Babadi et al.

ICML 2025spotlightarXiv:2506.05361

citations

#2311

Differential Coding for Training-Free ANN-to-SNN Conversion

Zihan Huang, Wei Fang, Tong Bu et al.

ICML 2025arXiv:2503.00301

citations

#2312

On the sample complexity of conditional independence testing with Von Mises estimator with application to causal discovery

Fateme Jamshidi, Luca Ganassali, Negar Kiyavash

ICML 2024arXiv:2310.13553

citations

#2313

Causal Effect Identification in LiNGAM Models with Latent Confounders

Daniele Tramontano, Yaroslav Kivva, Saber Salehkaleybar et al.

ICML 2024arXiv:2406.02049

citations

#2314

Position: AI Competitions Provide the Gold Standard for Empirical Rigor in GenAI Evaluation

D. Sculley, William Cukierski, Phil Culliton et al.

ICML 2025oralarXiv:2505.00612

citations

#2315

A New Computationally Efficient Algorithm to solve Feature Selection for Functional Data Classification in High-dimensional Spaces

Tobia Boschi, FRANCESCA BONIN, Rodrigo Ordonez-Hurtado et al.

ICML 2024arXiv:2401.05765

citations

#2316

First-Order Manifold Data Augmentation for Regression Learning

Ilya Kaufman, Omri Azencot

ICML 2024arXiv:2406.10914

citations

#2317

A Stronger Mixture of Low-Rank Experts for Fine-Tuning Foundation Models

Mengyang Sun, Yihao Wang, Tao Feng et al.

ICML 2025arXiv:2502.15828

citations

#2318

Local Causal Structure Learning in the Presence of Latent Variables

Feng Xie, Zheng Li, Peng Wu et al.

ICML 2024arXiv:2405.16225

citations

#2319

Beyond Entropy: Region Confidence Proxy for Wild Test-Time Adaptation

Zixuan Hu, Yichun Hu, Xiaotong Li et al.

ICML 2025arXiv:2505.20704

citations

#2320

Flexible Tails for Normalizing Flows

Tennessee Hickling, Dennis Prangle

ICML 2025arXiv:2406.16971

citations

#2321

Learning Cognitive Maps from Transformer Representations for Efficient Planning in Partially Observed Environments

Antoine Dedieu, Wolfgang Lehrach, Guangyao Zhou et al.

ICML 2024arXiv:2401.05946

citations

#2322

PINNsAgent: Automated PDE Surrogation with Large Language Models

Qingpo Wuwu, Chonghan Gao, Tianyu Chen et al.

ICML 2025arXiv:2501.12053

citations

#2323

How to Explore with Belief: State Entropy Maximization in POMDPs

Riccardo Zamboni, Duilio Cirino, Marcello Restelli et al.

ICML 2024arXiv:2406.02295

citations

#2324

LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models

Dachuan Shi, Yonggan Fu, Xiangchi Yuan et al.

ICML 2025arXiv:2507.14204

citations

#2325

Pretraining Generative Flow Networks with Inexpensive Rewards for Molecular Graph Generation

Mohit Pandey, Gopeshh Subbaraj, Artem Cherkasov et al.

ICML 2025arXiv:2503.06337

citations

#2326

KIND: Knowledge Integration and Diversion for Training Decomposable Models

Yucheng Xie, Fu Feng, Ruixiao Shi et al.

ICML 2025arXiv:2408.07337

citations

#2327

Selective Response Strategies for GenAI

Boaz Taitler, Omer Ben-Porat

ICML 2025arXiv:2502.00729

citations

#2328

Analyze Feature Flow to Enhance Interpretation and Steering in Language Models

Daniil Laptev, Nikita Balagansky, Yaroslav Aksenov et al.

ICML 2025arXiv:2502.03032

citations

#2329

When Bad Data Leads to Good Models

Kenneth Li, Yida Chen, Fernanda Viégas et al.

ICML 2025arXiv:2505.04741

citations

#2330

Dynamic Correlation Clustering in Sublinear Update Time

Vincent Cohen-Addad, Silvio Lattanzi, Andreas Maggiori et al.

ICML 2024spotlightarXiv:2406.09137

citations

#2331

Jacobian Sparse Autoencoders: Sparsify Computations, Not Just Activations

Lucy Farnik, Tim Lawson, Conor Houghton et al.

ICML 2025spotlightarXiv:2502.18147

citations

#2332

Layer-Aware Analysis of Catastrophic Overfitting: Revealing the Pseudo-Robust Shortcut Dependency

Runqi Lin, Chaojian Yu, Bo Han et al.

ICML 2024arXiv:2405.16262

citations

#2333

Aligning Multimodal Representations through an Information Bottleneck

Antonio Almudévar, Jose Miguel Hernandez-Lobato, Sameer Khurana et al.

ICML 2025arXiv:2506.04870

citations

#2334

Enhancing Statistical Validity and Power in Hybrid Controlled Trials: A Randomization Inference Approach with Conformal Selective Borrowing

Ke Zhu, Shu Yang, Xiaofei Wang

ICML 2025arXiv:2410.11713

citations

#2335

Position: The Artificial Intelligence and Machine Learning Community Should Adopt a More Transparent and Regulated Peer Review Process

Jing Yang

ICML 2025arXiv:2502.00874

citations

#2336

Improving Rationality in the Reasoning Process of Language Models through Self-playing Game

Pinzheng Wang, Juntao Li, Zecheng Tang et al.

ICML 2025arXiv:2506.22920

citations

#2337

Accelerating Linear Recurrent Neural Networks for the Edge with Unstructured Sparsity

Alessandro Pierro, Steven Abreu, Jonathan Timcheck et al.

ICML 2025arXiv:2502.01330

citations

#2338

Position: AI/ML Influencers Have a Place in the Academic Process

Iain Xie Weissburg, Mehir Arora, Xinyi Wang et al.

ICML 2024arXiv:2401.13782

citations

#2339

Boosting Virtual Agent Learning and Reasoning: A Step-Wise, Multi-Dimensional, and Generalist Reward Model with Benchmark

Bingchen Miao, Yang Wu, Minghe Gao et al.

ICML 2025arXiv:2503.18665

citations

#2340

Features are fate: a theory of transfer learning in high-dimensional regression

Javan Tahir, Surya Ganguli, Grant Rotskoff

ICML 2025arXiv:2410.08194

citations

#2341

LLM-Augmented Chemical Synthesis and Design Decision Programs

Haorui Wang, Jeff Guo, Lingkai Kong et al.

ICML 2025arXiv:2505.07027

citations

#2342

LLMScan: Causal Scan for LLM Misbehavior Detection

Mengdi Zhang, Goh Kiat, Peixin Zhang et al.

ICML 2025arXiv:2410.16638

citations

#2343

Models of Heavy-Tailed Mechanistic Universality

Liam Hodgkinson, Zhichao Wang, Michael Mahoney

ICML 2025arXiv:2506.03470

citations

#2344

DEALing with Image Reconstruction: Deep Attentive Least Squares

Mehrsa Pourya, Erich Kobler, Michael Unser et al.

ICML 2025arXiv:2502.04079

citations

#2345

Provably Better Explanations with Optimized Aggregation of Feature Attributions

Thomas Decker, Ananta Bhattarai, Jindong Gu et al.

ICML 2024arXiv:2406.05090

citations

#2346

CSTrack: Enhancing RGB-X Tracking via Compact Spatiotemporal Features

xiaokun Feng, Dailing Zhang, Shiyu Hu et al.

ICML 2025oralarXiv:2505.19434

citations

#2347

Understanding Synthetic Context Extension via Retrieval Heads

Xinyu Zhao, Fangcong Yin, Greg Durrett

ICML 2025arXiv:2410.22316

citations

#2348

Adversarial Cooperative Rationalization: The Risk of Spurious Correlations in Even Clean Datasets

Wei Liu, Zhongyu Niu, Lang Gao et al.

ICML 2025arXiv:2505.02118

citations

#2349

Don't be so Negative! Score-based Generative Modeling with Oracle-assisted Guidance

Saeid Naderiparizi, Xiaoxuan Liang, Setareh Cohan et al.

ICML 2024arXiv:2307.16463

citations

#2350

Spurious Correlations in High Dimensional Regression: The Roles of Regularization, Simplicity Bias and Over-Parameterization

Simone Bombari, Marco Mondelli

ICML 2025arXiv:2502.01347

citations

#2351

Probabilistic Subgoal Representations for Hierarchical Reinforcement Learning

Vivienne Wang, Tinghuai Wang, wenyan yang et al.

ICML 2024arXiv:2406.16707

citations

#2352

Zero-shot Meta-learning for Tabular Prediction Tasks with Adversarially Pre-trained Transformer

Yulun Wu, Doron Bergman

ICML 2025arXiv:2502.04573

citations

#2353

Optimization without Retraction on the Random Generalized Stiefel Manifold

Simon Vary, Pierre Ablin, Bin Gao et al.

ICML 2024arXiv:2405.01702

citations

#2354

Representing Molecules as Random Walks Over Interpretable Grammars

Michael Sun, Minghao Guo, Weize Yuan et al.

ICML 2024spotlightarXiv:2403.08147

citations

#2355

In-Context Denoising with One-Layer Transformers: Connections between Attention and Associative Memory Retrieval

Matthew Smart, Alberto Bietti, Anirvan Sengupta

ICML 2025oralarXiv:2502.05164

citations

#2356

Active Fine-Tuning of Multi-Task Policies

Marco Bagatella, Jonas Hübotter, Georg Martius et al.

ICML 2025oralarXiv:2410.05026

citations

#2357

Exploiting Curvature in Online Convex Optimization with Delayed Feedback

Hao Qiu, Emmanuel Esposito, Mengxiao Zhang

ICML 2025arXiv:2506.07595

citations

#2358

Posterior Inference with Diffusion Models for High-dimensional Black-box Optimization

Taeyoung Yun, Kiyoung Om, Jaewoo Lee et al.

ICML 2025arXiv:2502.16824

citations

#2359

Differentially Private Worst-group Risk Minimization

Xinyu Zhou, Raef Bassily

ICML 2024arXiv:2402.19437

citations

#2360

MuLan: Adapting Multilingual Diffusion Models for Hundreds of Languages with Negligible Cost

Sen Xing, Muyan Zhong, Zeqiang Lai et al.

ICML 2025arXiv:2412.01271

citations

#2361

Exploring the Enigma of Neural Dynamics Through A Scattering-Transform Mixer Landscape for Riemannian Manifold

Tingting Dan, Ziquan Wei, Won Hwa Kim et al.

ICML 2024arXiv:2405.16357

citations

#2362

Activation by Interval-wise Dropout: A Simple Way to Prevent Neural Networks from Plasticity Loss

Sangyeon Park, Isaac Han, Seungwon Oh et al.

ICML 2025arXiv:2502.01342

citations

#2363

Stochastic positional embeddings improve masked image modeling

Amir Bar, Florian Bordes, Assaf Shocher et al.

ICML 2024arXiv:2308.00566

citations

#2364

Data-Efficient Molecular Generation with Hierarchical Textual Inversion

Seojin Kim, Jaehyun Nam, Sihyun Yu et al.

ICML 2024arXiv:2405.02845

citations

#2365

BlockDialect: Block-wise Fine-grained Mixed Format Quantization for Energy-Efficient LLM Inference

Wonsuk Jang, Thierry Tambe

ICML 2025arXiv:2501.01144

citations

#2366

Optimal Transport for Structure Learning Under Missing Data

Vy Vo, He Zhao, Trung Le et al.

ICML 2024arXiv:2402.15255

citations

#2367

A New Branch-and-Bound Pruning Framework for $\ell_0$-Regularized Problems

Guyard Theo, Cédric Herzet, Clément Elvira et al.

ICML 2024arXiv:2406.03504

citations

#2368

Faster Maximum Inner Product Search in High Dimensions

Mo Tiwari, Ryan Kang, Jaeyong Lee et al.

ICML 2024arXiv:2212.07551

citations

#2369

RZ-NAS: Enhancing LLM-guided Neural Architecture Search via Reflective Zero-Cost Strategy

Zipeng Ji, Guanghui Zhu, Chunfeng Yuan et al.

ICML 2025

citations

#2370

ESPFormer: Doubly-Stochastic Attention with Expected Sliced Transport Plans

Ashkan Shahbazi, Elaheh Akbari, Darian Salehi et al.

ICML 2025arXiv:2502.07962

citations

#2371

A Probabilistic Approach to Learning the Degree of Equivariance in Steerable CNNs

Lars Veefkind, Gabriele Cesa

ICML 2024arXiv:2406.03946

citations

#2372

BBox-Adapter: Lightweight Adapting for Black-Box Large Language Models

Haotian Sun, Yuchen Zhuang, Wei Wei et al.

ICML 2024spotlightarXiv:2402.08219

citations

#2373

BinauralFlow: A Causal and Streamable Approach for High-Quality Binaural Speech Synthesis with Flow Matching Models

Susan Liang, Dejan Markovic, Israel D. Gebru et al.

ICML 2025arXiv:2505.22865

citations

#2374

Autonomy-of-Experts Models

Ang Lv, Ruobing Xie, Yining Qian et al.

ICML 2025arXiv:2501.13074

citations

#2375

Synthetic Text Generation for Training Large Language Models via Gradient Matching

Dang Nguyen, Zeman Li, MohammadHossein Bateni et al.

ICML 2025arXiv:2502.17607

citations

#2376

Towards Understanding the Word Sensitivity of Attention Layers: A Study via Random Features

Simone Bombari, Marco Mondelli

ICML 2024arXiv:2402.02969

citations

#2377

Winner-takes-all learners are geometry-aware conditional density estimators

ICML 2024arXiv:2406.04706

citations

#2378

TruthFlow: Truthful LLM Generation via Representation Flow Correction

Hanyu Wang, Bochuan Cao, Yuanpu Cao et al.

ICML 2025arXiv:2502.04556

citations

#2379

The Perils of Optimizing Learned Reward Functions: Low Training Error Does Not Guarantee Low Regret

Lukas Fluri, Leon Lang, Alessandro Abate et al.

ICML 2025arXiv:2406.15753

citations

#2380

Dynamic Survival Analysis with Controlled Latent States

Linus Bleistein, Van NGUYEN, Adeline Fermanian et al.

ICML 2024arXiv:2401.17077

citations

#2381

ALERT-Transformer: Bridging Asynchronous and Synchronous Machine Learning for Real-Time Event-based Spatio-Temporal Data

Carmen Martin-Turrero, Maxence Bouvier, Manuel Breitenstein et al.

ICML 2024oralarXiv:2402.01393

citations

#2382

Residual Matrix Transformers: Scaling the Size of the Residual Stream

Brian Mak, Jeffrey Flanigan

ICML 2025arXiv:2506.22696

citations

#2383

ED-Copilot: Reduce Emergency Department Wait Time with Language Model Diagnostic Assistance

Liwen Sun, Abhineet Agarwal, Aaron Kornblith et al.

ICML 2024arXiv:2402.13448

citations

#2384

SLiM: One-shot Quantization and Sparsity with Low-rank Approximation for LLM Weight Compression

Mohammad Mozaffari, Amir Yazdanbakhsh, Maryam Mehri Dehnavi

ICML 2025arXiv:2410.09615

citations

#2385

FisherSFT: Data-Efficient Supervised Fine-Tuning of Language Models Using Information Gain

Rohan Deb, Kiran Thekumparampil, Kousha Kalantari et al.

ICML 2025arXiv:2505.14826

citations

#2386

Revisiting the Predictability of Performative, Social Events

Juan Perdomo

ICML 2025arXiv:2503.11713

citations

#2387

Towards Trustworthy Federated Learning with Untrusted Participants

Youssef Allouah, Rachid Guerraoui, John Stephan

ICML 2025arXiv:2505.01874

citations

#2388

Understanding and Mitigating Memorization in Diffusion Models for Tabular Data

Zhengyu Fang, Zhimeng Jiang, Huiyuan Chen et al.

ICML 2025arXiv:2412.11044

citations

#2389

Algorithms with Calibrated Machine Learning Predictions

Judy Hanwen Shen, Ellen Vitercik, Anders Wikum

ICML 2025spotlightarXiv:2502.02861

citations

#2390

Spherical Rotation Dimension Reduction with Geometric Loss Functions

Hengrui Luo, Jeremy E. Purvis, Didong Li

ICML 2025arXiv:2204.10975

citations

#2391

IOI: Invisible One-Iteration Adversarial Attack on No-Reference Image- and Video-Quality Metrics

Ekaterina Shumitskaya, Anastasia Antsiferova, Dmitriy Vatolin

ICML 2024oralarXiv:2403.05955

citations

#2392

Revealing Weaknesses in Text Watermarking Through Self-Information Rewrite Attacks

Yixin Cheng, Hongcheng Guo, Yangming Li et al.

ICML 2025arXiv:2505.05190

citations

#2393

A Linear Time and Space Local Point Cloud Geometry Encoder via Vectorized Kernel Mixture (VecKM)

Dehao Yuan, Cornelia Fermuller, Tahseen Rabbani et al.

ICML 2024arXiv:2404.01568

citations

#2394

Exploring the Complexity of Deep Neural Networks through Functional Equivalence

Guohao Shen

ICML 2024arXiv:2305.11417

citations

#2395

Robust Offline Reinforcement Learning with Linearly Structured $f$-Divergence Regularization

Cheng Tang, Zhishuai Liu, Pan Xu

ICML 2025arXiv:2411.18612

citations

#2396

On Convergence of Incremental Gradient for Non-convex Smooth Functions

Anastasiia Koloskova, Nikita Doikov, Sebastian Stich et al.

ICML 2024arXiv:2305.19259

citations

#2397

On the Power of Context-Enhanced Learning in LLMs

Xingyu Zhu, Abhishek Panigrahi, Sanjeev Arora

ICML 2025spotlightarXiv:2503.01821

citations

#2398

Behavior-Regularized Diffusion Policy Optimization for Offline Reinforcement Learning

Chen-Xiao Gao, Chenyang Wu, Mingjun Cao et al.

ICML 2025arXiv:2502.04778

citations

#2399

Provably Scalable Black-Box Variational Inference with Structured Variational Families

Joohwan Ko, Kyurae Kim, Woo Chang Kim et al.

ICML 2024arXiv:2401.10989

citations

#2400

RepLoRA: Reparameterizing Low-rank Adaptation via the Perspective of Mixture of Experts

Tuan Truong, Chau Nguyen, Huy Nguyen et al.

ICML 2025arXiv:2502.03044

citations

← Previous

1...10 11 12 13 14...30