Most Cited ICML "global weight shrinking" Papers

5,975 papers found • Page 8 of 30

#1401

NTPP: Generative Speech Language Modeling for Dual-Channel Spoken Dialogue via Next-Token-Pair Prediction

Qichao Wang, Ziqiao Meng, Wenqian Cui et al.

ICML 2025arXiv:2506.00975
12
citations
#1402

DeCoOp: Robust Prompt Tuning with Out-of-Distribution Detection

Zhi Zhou, Ming Yang, Jiang-Xin Shi et al.

ICML 2024arXiv:2406.00345
12
citations
#1403

Taming Knowledge Conflicts in Language Models

Gaotang Li, Yuzhong Chen, Hanghang Tong

ICML 2025spotlightarXiv:2503.10996
12
citations
#1404

Unlocking the Capabilities of Large Vision-Language Models for Generalizable and Explainable Deepfake Detection

Peipeng Yu, Jianwei Fei, Hui Gao et al.

ICML 2025arXiv:2503.14853
12
citations
#1405

Conformalized Adaptive Forecasting of Heterogeneous Trajectories

Yanfei Zhou, Lars Lindemann, Matteo Sesia

ICML 2024arXiv:2402.09623
12
citations
#1406

Predictive Data Selection: The Data That Predicts Is the Data That Teaches

KaShun SHUM, Yuzhen Huang, Hongjian Zou et al.

ICML 2025arXiv:2503.00808
12
citations
#1407

Conditioning Diffusions Using Malliavin Calculus

Jakiw Pidstrigach, Elizabeth Baker, Carles Domingo i Enrich et al.

ICML 2025arXiv:2504.03461
12
citations
#1408

MoEQuant: Enhancing Quantization for Mixture-of-Experts Large Language Models via Expert-Balanced Sampling and Affinity Guidance

Zhixuan Chen, Xing Hu, Dawei Yang et al.

ICML 2025arXiv:2505.03804
12
citations
#1409

A Universal Class of Sharpness-Aware Minimization Algorithms

Behrooz Tahmasebi, Ashkan Soleymani, Dara Bahri et al.

ICML 2024arXiv:2406.03682
12
citations
#1410

Text-to-LoRA: Instant Transformer Adaption

Rujikorn Charakorn, Edoardo Cetin, Yujin Tang et al.

ICML 2025arXiv:2506.06105
12
citations
#1411

PANDA: Expanded Width-Aware Message Passing Beyond Rewiring

Jeongwhan Choi, Sumin Parksumin, Hyowon Wi et al.

ICML 2024arXiv:2406.03671
12
citations
#1412

Characterizing Overfitting in Kernel Ridgeless Regression Through the Eigenspectrum

Tin Sum Cheng, Aurelien Lucchi, Anastasis Kratsios et al.

ICML 2024arXiv:2402.01297
12
citations
#1413

Metastable Dynamics of Chain-of-Thought Reasoning: Provable Benefits of Search, RL and Distillation

Juno Kim, Denny Wu, Jason Lee et al.

ICML 2025arXiv:2502.01694
12
citations
#1414

Minimum-Norm Interpolation Under Covariate Shift

Neil Mallinar, Austin Zane, Spencer Frei et al.

ICML 2024arXiv:2404.00522
12
citations
#1415

CLIMB: Data Foundations for Large Scale Multimodal Clinical Foundation Models

David Dai, Peilin Chen, Malinda Lu et al.

ICML 2025oralarXiv:2503.07667
12
citations
#1416

Sparse is Enough in Fine-tuning Pre-trained Large Language Models

Weixi Song, Zuchao Li, Lefei Zhang et al.

ICML 2024spotlightarXiv:2312.11875
12
citations
#1417

CAD-Editor: A Locate-then-Infill Framework with Automated Training Data Synthesis for Text-Based CAD Editing

Yu Yuan, Shizhao Sun, Qi Liu et al.

ICML 2025arXiv:2502.03997
12
citations
#1418

Harmony in Diversity: Merging Neural Networks with Canonical Correlation Analysis

Stefan Horoi, Albert Manuel Orozco Camacho, Eugene Belilovsky et al.

ICML 2024arXiv:2407.05385
12
citations
#1419

Mixture of Experts Made Intrinsically Interpretable

Xingyi Yang, Constantin Venhoff, Ashkan Khakzar et al.

ICML 2025arXiv:2503.07639
12
citations
#1420

LightningDrag: Lightning Fast and Accurate Drag-based Image Editing Emerging from Videos

Yujun Shi, Jun Hao Liew, Hanshu Yan et al.

ICML 2025arXiv:2405.13722
12
citations
#1421

WikiBigEdit: Understanding the Limits of Lifelong Knowledge Editing in LLMs

Lukas Thede, Karsten Roth, Matthias Bethge et al.

ICML 2025arXiv:2503.05683
12
citations
#1422

No Metric to Rule Them All: Toward Principled Evaluations of Graph-Learning Datasets

Corinna Coupette, Jeremy Wayland, Emily Simons et al.

ICML 2025arXiv:2502.02379
12
citations
#1423

Why Do Animals Need Shaping? A Theory of Task Composition and Curriculum Learning

Jin Hwa Lee, Stefano Mannelli, Andrew Saxe

ICML 2024arXiv:2402.18361
12
citations
#1424

S3O: A Dual-Phase Approach for Reconstructing Dynamic Shape and Skeleton of Articulated Objects from Single Monocular Video

Hao Zhang, Fang Li, Samyak Rawlekar et al.

ICML 2024arXiv:2405.12607
12
citations
#1425

Dissecting Submission Limit in Desk-Rejections: A Mathematical Analysis of Fairness in AI Conference Policies

Yuefan Cao, Xiaoyu Li, Yingyu Liang et al.

ICML 2025arXiv:2502.00690
12
citations
#1426

Discounted Adaptive Online Learning: Towards Better Regularization

Zhiyu Zhang, David Bombara, Heng Yang

ICML 2024arXiv:2402.02720
12
citations
#1427

Bongard in Wonderland: Visual Puzzles that Still Make AI Go Mad?

Antonia Wüst, Tim Woydt, Lukas Helff et al.

ICML 2025arXiv:2410.19546
12
citations
#1428

Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment

Chenghao Fan, zhenyi lu, Sichen Liu et al.

ICML 2025arXiv:2502.16894
12
citations
#1429

Robustly Learning Single-Index Models via Alignment Sharpness

Nikos Zarifis, Puqian Wang, Ilias Diakonikolas et al.

ICML 2024arXiv:2402.17756
12
citations
#1430

Diffuse Everything: Multimodal Diffusion Models on Arbitrary State Spaces

Kevin Rojas, Yuchen Zhu, Sichen Zhu et al.

ICML 2025arXiv:2506.07903
12
citations
#1431

Exploiting Code Symmetries for Learning Program Semantics

Kexin Pei, Weichen Li, Qirui Jin et al.

ICML 2024spotlightarXiv:2308.03312
12
citations
#1432

Momentum Particle Maximum Likelihood

Jen Ning Lim, Juan Kuntz, Samuel Power et al.

ICML 2024arXiv:2312.07335
12
citations
#1433

Online Algorithms with Uncertainty-Quantified Predictions

Bo Sun, Jerry Huang, Nicolas Christianson et al.

ICML 2024arXiv:2310.11558
12
citations
#1434

Understanding Multimodal LLMs Under Distribution Shifts: An Information-Theoretic Approach

Changdae Oh, zhen fang, Shawn Im et al.

ICML 2025arXiv:2502.00577
12
citations
#1435

Sparse and Structured Hopfield Networks

Saúl Santos, Vlad Niculae, Daniel McNamee et al.

ICML 2024spotlightarXiv:2402.13725
12
citations
#1436

InterpreTabNet: Distilling Predictive Signals from Tabular Data by Salient Feature Interpretation

Jacob Si, Wendy Yusi Cheng, Michael Cooper et al.

ICML 2024spotlightarXiv:2406.00426
12
citations
#1437

The Sharpness Disparity Principle in Transformers for Accelerating Language Model Pre-Training

Jinbo Wang, Mingze Wang, Zhanpeng Zhou et al.

ICML 2025arXiv:2502.19002
12
citations
#1438

Text-to-CAD Generation Through Infusing Visual Feedback in Large Language Models

Ruiyu Wang, Yu Yuan, Shizhao Sun et al.

ICML 2025arXiv:2501.19054
12
citations
#1439

OODRobustBench: a Benchmark and Large-Scale Analysis of Adversarial Robustness under Distribution Shift

Lin Li, Yifei Wang, Chawin Sitawarin et al.

ICML 2024arXiv:2310.12793
12
citations
#1440

FedBAT: Communication-Efficient Federated Learning via Learnable Binarization

Shiwei Li, Wenchao Xu, Haozhao Wang et al.

ICML 2024arXiv:2408.03215
12
citations
#1441

Random Exploration in Bayesian Optimization: Order-Optimal Regret and Computational Efficiency

Sudeep Salgia, Sattar Vakili, Qing Zhao

ICML 2024arXiv:2310.15351
12
citations
#1442

Speak Easy: Eliciting Harmful Jailbreaks from LLMs with Simple Interactions

Yik Siu Chan, Narutatsu Ri, Yuxin Xiao et al.

ICML 2025arXiv:2502.04322
12
citations
#1443

Compositional Image Decomposition with Diffusion Models

Jocelin Su, Nan Liu, Yanbo Wang et al.

ICML 2024arXiv:2406.19298
12
citations
#1444

Certified Unlearning for Neural Networks

Anastasiia Koloskova, Youssef Allouah, Animesh Jha et al.

ICML 2025arXiv:2506.06985
12
citations
#1445

Neural Collapse in Multi-label Learning with Pick-all-label Loss

Pengyu Li, Xiao Li, Yutong Wang et al.

ICML 2024arXiv:2310.15903
12
citations
#1446

Strategy Coopetition Explains the Emergence and Transience of In-Context Learning

Aaditya Singh, Ted Moskovitz, Sara Dragutinović et al.

ICML 2025oralarXiv:2503.05631
12
citations
#1447

Improving Token-Based World Models with Parallel Observation Prediction

Lior Cohen, Kaixin Wang, Bingyi Kang et al.

ICML 2024arXiv:2402.05643
12
citations
#1448

SafeAuto: Knowledge-Enhanced Safe Autonomous Driving with Multimodal Foundation Models

Jiawei Zhang, Xuan Yang, Taiqi Wang et al.

ICML 2025arXiv:2503.00211
12
citations
#1449

Mastering Multiple-Expert Routing: Realizable $H$-Consistency and Strong Guarantees for Learning to Defer

Anqi Mao, Mehryar Mohri, Yutao Zhong

ICML 2025arXiv:2506.20650
12
citations
#1450

Emergent Equivariance in Deep Ensembles

Jan Gerken, Pan Kessel

ICML 2024arXiv:2403.03103
12
citations
#1451

Exploring and Mitigating Adversarial Manipulation of Voting-Based Leaderboards

Yangsibo Huang, Milad Nasr, Anastasios Angelopoulos et al.

ICML 2025oralarXiv:2501.07493
12
citations
#1452

Online Linear Regression in Dynamic Environments via Discounting

Andrew Jacobsen, Ashok Cutkosky

ICML 2024arXiv:2405.19175
11
citations
#1453

PatchPilot: A Cost-Efficient Software Engineering Agent with Early Attempts on Formal Verification

Hongwei Li, Yuheng Tang, Shiqi Wang et al.

ICML 2025arXiv:2502.02747
11
citations
#1454

From Words to Actions: Unveiling the Theoretical Underpinnings of LLM-Driven Autonomous Systems

Jianliang He, Siyu Chen, Fengzhuo Zhang et al.

ICML 2024arXiv:2405.19883
11
citations
#1455

Puzzle: Distillation-Based NAS for Inference-Optimized LLMs

Akhiad Bercovich, Tomer Ronen, Talor Abramovich et al.

ICML 2025arXiv:2411.19146
11
citations
#1456

Stochastic Conditional Diffusion Models for Robust Semantic Image Synthesis

Juyeon Ko, Inho Kong, Dogyun Park et al.

ICML 2024arXiv:2402.16506
11
citations
#1457

CASE-Bench: Context-Aware SafEty Benchmark for Large Language Models

Guangzhi Sun, Xiao Zhan, Shutong Feng et al.

ICML 2025arXiv:2501.14940
11
citations
#1458

DiJiang: Efficient Large Language Models through Compact Kernelization

Hanting Chen, Liuzhicheng Liuzhicheng, Xutao Wang et al.

ICML 2024arXiv:2403.19928
11
citations
#1459

Predictive Linear Online Tracking for Unknown Targets

Anastasios Tsiamis, Aren Karapetyan, Yueshan Li et al.

ICML 2024spotlightarXiv:2402.10036
11
citations
#1460

Reflected Flow Matching

Tianyu Xie, Yu Zhu, Longlin Yu et al.

ICML 2024arXiv:2405.16577
11
citations
#1461

Smoothness Adaptive Hypothesis Transfer Learning

Haotian Lin, Matthew Reimherr

ICML 2024arXiv:2402.14966
11
citations
#1462

CodeSteer: Symbolic-Augmented Language Models via Code/Text Guidance

Yongchao Chen, Yilun Hao, Yueying Liu et al.

ICML 2025arXiv:2502.04350
11
citations
#1463

LIVS: A Pluralistic Alignment Dataset for Inclusive Public Spaces

Rashid Mushkani, Perampalli Shravan Nayak, Hugo Berard et al.

ICML 2025arXiv:2503.01894
11
citations
#1464

Hierarchical Equivariant Policy via Frame Transfer

Haibo Zhao, Dian Wang, Yizhe Zhu et al.

ICML 2025arXiv:2502.05728
11
citations
#1465

MAPLE: Many-Shot Adaptive Pseudo-Labeling for In-Context Learning

Zihan Chen, Song Wang, Zhen Tan et al.

ICML 2025arXiv:2505.16225
11
citations
#1466

xLSTM 7B: A Recurrent LLM for Fast and Efficient Inference

Maximilian Beck, Korbinian Pöppel, Phillip Lippe et al.

ICML 2025arXiv:2503.13427
11
citations
#1467

Embodied CoT Distillation From LLM To Off-the-shelf Agents

Wonje Choi, Woo Kyung Kim, Minjong Yoo et al.

ICML 2024arXiv:2412.11499
11
citations
#1468

Private Gradient Descent for Linear Regression: Tighter Error Bounds and Instance-Specific Uncertainty Estimation

Gavin Brown, Krishnamurthy Dvijotham, Georgina Evans et al.

ICML 2024arXiv:2402.13531
11
citations
#1469

Shifted Interpolation for Differential Privacy

Jinho Bok, Weijie Su, Jason Altschuler

ICML 2024arXiv:2403.00278
11
citations
#1470

Low-Rank Similarity Mining for Multimodal Dataset Distillation

Yue Xu, Zhilin Lin, Yusong Qiu et al.

ICML 2024arXiv:2406.03793
11
citations
#1471

Self-attention Networks Localize When QK-eigenspectrum Concentrates

Han Bao, Ryuichiro Hataya, Ryo Karakida

ICML 2024arXiv:2402.02098
11
citations
#1472

Provable Contrastive Continual Learning

Yichen Wen, Zhiquan Tan, Kaipeng Zheng et al.

ICML 2024arXiv:2405.18756
11
citations
#1473

A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks

Thomas Schmied, Thomas Adler, Vihang Patil et al.

ICML 2025arXiv:2410.22391
11
citations
#1474

MUDDFormer: Breaking Residual Bottlenecks in Transformers via Multiway Dynamic Dense Connections

Da Xiao, Qingye Meng, Shengping Li et al.

ICML 2025arXiv:2502.12170
11
citations
#1475

Learning to Reach Goals via Diffusion

Vineet Jain, Siamak Ravanbakhsh

ICML 2024arXiv:2310.02505
11
citations
#1476

Bridging Model Heterogeneity in Federated Learning via Uncertainty-based Asymmetrical Reciprocity Learning

Jiaqi Wang, Chenxu Zhao, Lingjuan Lyu et al.

ICML 2024arXiv:2407.03247
11
citations
#1477

MxMoE: Mixed-precision Quantization for MoE with Accuracy and Performance Co-Design

Haojie Duanmu, Xiuhong Li, Zhihang Yuan et al.

ICML 2025arXiv:2505.05799
11
citations
#1478

Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models

Akhil Kedia, Mohd Abbas Zaidi, Sushil Khyalia et al.

ICML 2024arXiv:2403.09635
11
citations
#1479

Does Generation Require Memorization? Creative Diffusion Models using Ambient Diffusion

Kulin Shah, Alkis Kalavasis, Adam Klivans et al.

ICML 2025arXiv:2502.21278
11
citations
#1480

Eureka-Moments in Transformers: Multi-Step Tasks Reveal Softmax Induced Optimization Problems

David T. Hoffmann, Simon Schrodi, Jelena Bratulić et al.

ICML 2024arXiv:2310.12956
11
citations
#1481

Vocabulary for Universal Approximation: A Linguistic Perspective of Mapping Compositions

Yongqiang Cai

ICML 2024spotlightarXiv:2305.12205
11
citations
#1482

Generation from Noisy Examples

Ananth Raman, Vinod Raman

ICML 2025arXiv:2501.04179
11
citations
#1483

Causal Action Influence Aware Counterfactual Data Augmentation

Núria Armengol Urpí, Marco Bagatella, Marin Vlastelica et al.

ICML 2024arXiv:2405.18917
11
citations
#1484

Improving Your Model Ranking on Chatbot Arena by Vote Rigging

Rui Min, Tianyu Pang, Chao Du et al.

ICML 2025arXiv:2501.17858
11
citations
#1485

Learning with Complementary Labels Revisited: The Selected-Completely-at-Random Setting Is More Practical

Wei Wang, Takashi Ishida, Yu-Jie Zhang et al.

ICML 2024arXiv:2311.15502
11
citations
#1486

Neural operators meet conjugate gradients: The FCG-NO method for efficient PDE solving

Alexander Rudikov, Fanaskov Vladimir, Ekaterina Muravleva et al.

ICML 2024arXiv:2402.05598
11
citations
#1487

Adaptive Feature Selection for No-Reference Image Quality Assessment by Mitigating Semantic Noise Sensitivity

Xudong Li, Timin Gao, Runze Hu et al.

ICML 2024arXiv:2312.06158
11
citations
#1488

How Free is Parameter-Free Stochastic Optimization?

Amit Attia, Tomer Koren

ICML 2024spotlightarXiv:2402.03126
11
citations
#1489

Invariance Makes LLM Unlearning Resilient Even to Unanticipated Downstream Fine-Tuning

Changsheng Wang, Yihua Zhang, jinghan jia et al.

ICML 2025arXiv:2506.01339
11
citations
#1490

Delving into Differentially Private Transformer

Youlong Ding, Xueyang Wu, Yining meng et al.

ICML 2024arXiv:2405.18194
11
citations
#1491

Understanding Diffusion Models by Feynman's Path Integral

Yuji Hirono, Akinori Tanaka, Kenji Fukushima

ICML 2024arXiv:2403.11262
11
citations
#1492

CellFlux: Simulating Cellular Morphology Changes via Flow Matching

Yuhui Zhang, Yuchang Su, Chenyu Wang et al.

ICML 2025arXiv:2502.09775
11
citations
#1493

AlphaPO: Reward Shape Matters for LLM Alignment

Aman Gupta, Shao Tang, Qingquan Song et al.

ICML 2025arXiv:2501.03884
11
citations
#1494

Enhancing Foundation Models for Time Series Forecasting via Wavelet-based Tokenization

Luca Masserano, Abdul Fatir Ansari, Boran Han et al.

ICML 2025oralarXiv:2412.05244
11
citations
#1495

Wasserstein Wormhole: Scalable Optimal Transport Distance with Transformer

Doron Haviv, Russell Kunes, Thomas Dougherty et al.

ICML 2024arXiv:2404.09411
11
citations
#1496

How Expressive are Knowledge Graph Foundation Models?

Xingyue Huang, Pablo Barcelo, Michael Bronstein et al.

ICML 2025arXiv:2502.13339
11
citations
#1497

Adversarial Robustness in Two-Stage Learning-to-Defer: Algorithms and Guarantees

Yannis Montreuil, Axel Carlier, Lai Xing Ng et al.

ICML 2025arXiv:2502.01027
11
citations
#1498

Failures Are Fated, But Can Be Faded: Characterizing and Mitigating Unwanted Behaviors in Large-Scale Vision and Language Models

Som Sagar, Aditya Taparia, Ransalu Senanayake

ICML 2024spotlightarXiv:2406.07145
11
citations
#1499

Improving the Scaling Laws of Synthetic Data with Deliberate Practice

Reyhane Askari Hemmat, Mohammad Pezeshki, Elvis Dohmatob et al.

ICML 2025oralarXiv:2502.15588
11
citations
#1500

To the Max: Reinventing Reward in Reinforcement Learning

Grigorii Veviurko, Wendelin Boehmer, Mathijs de Weerdt

ICML 2024arXiv:2402.01361
11
citations
#1501

Sample Complexity Bounds for Estimating Probability Divergences under Invariances

Behrooz Tahmasebi, Stefanie Jegelka

ICML 2024arXiv:2311.02868
11
citations
#1502

A Global Geometric Analysis of Maximal Coding Rate Reduction

Peng Wang, Huikang Liu, Druv Pai et al.

ICML 2024arXiv:2406.01909
11
citations
#1503

SafeMap: Robust HD Map Construction from Incomplete Observations

Xiaoshuai Hao, Lingdong Kong, Rong Yin et al.

ICML 2025arXiv:2507.00861
11
citations
#1504

GraphGPT: Generative Pre-trained Graph Eulerian Transformer

Qifang Zhao, Weidong Ren, Tianyu Li et al.

ICML 2025arXiv:2401.00529
11
citations
#1505

I Think, Therefore I Diffuse: Enabling Multimodal In-Context Reasoning in Diffusion Models

Zhenxing Mi, Kuan-Chieh Wang, Guocheng Qian et al.

ICML 2025arXiv:2502.10458
11
citations
#1506

Position: The AI Conference Peer Review Crisis Demands Author Feedback and Reviewer Rewards

Jaeho Kim, Yunseok Lee, Seulki Lee

ICML 2025oralarXiv:2505.04966
11
citations
#1507

Sliding Down the Stairs: How Correlated Latent Variables Accelerate Learning with Neural Networks

Lorenzo Bardone, Sebastian Goldt

ICML 2024arXiv:2404.08602
11
citations
#1508

Inferring the Long-Term Causal Effects of Long-Term Treatments from Short-Term Experiments

Allen Tran, Aurelien Bibaut, Nathan Kallus

ICML 2024arXiv:2311.08527
11
citations
#1509

Enabling Uncertainty Estimation in Iterative Neural Networks

Nikita Durasov, Doruk Oner, Jonathan Donier et al.

ICML 2024arXiv:2403.16732
11
citations
#1510

Simplicity Bias of Two-Layer Networks beyond Linearly Separable Data

Nikita Tsoy, Nikola Konstantinov

ICML 2024arXiv:2405.17299
11
citations
#1511

Adaptive Self-improvement LLM Agentic System for ML Library Development

Genghan Zhang, Weixin Liang, Olivia Hsu et al.

ICML 2025arXiv:2502.02534
11
citations
#1512

Beyond Individual Input for Deep Anomaly Detection on Tabular Data

Hugo Thimonier, Fabrice Popineau, Arpad Rimmel et al.

ICML 2024arXiv:2305.15121
11
citations
#1513

Gradient Compressed Sensing: A Query-Efficient Gradient Estimator for High-Dimensional Zeroth-Order Optimization

Ruizhong Qiu, Hanghang Tong

ICML 2024arXiv:2405.16805
11
citations
#1514

NExtLong: Toward Effective Long-Context Training without Long Documents

Chaochen Gao, Xing W, Zijia Lin et al.

ICML 2025arXiv:2501.12766
11
citations
#1515

Scaling Trends in Language Model Robustness

Nikolaus Howe, Ian McKenzie, Oskar Hollinsworth et al.

ICML 2025spotlightarXiv:2407.18213
11
citations
#1516

How to Make the Gradients Small Privately: Improved Rates for Differentially Private Non-Convex Optimization

Andrew Lowy, Jonathan Ullman, Stephen Wright

ICML 2024arXiv:2402.11173
11
citations
#1517

Constrained Reinforcement Learning Under Model Mismatch

Zhongchang Sun, Sihong He, Fei Miao et al.

ICML 2024arXiv:2405.01327
11
citations
#1518

Efficient Black-box Adversarial Attacks via Bayesian Optimization Guided by a Function Prior

Shuyu Cheng, Yibo Miao, Yinpeng Dong et al.

ICML 2024arXiv:2405.19098
11
citations
#1519

Language-guided Skill Learning with Temporal Variational Inference

Haotian Fu, Pratyusha Sharma, Elias Stengel-Eskin et al.

ICML 2024oralarXiv:2402.16354
11
citations
#1520

Provably Efficient Exploration in Quantum Reinforcement Learning with Logarithmic Worst-Case Regret

Han Zhong, Jiachen Hu, Yecheng Xue et al.

ICML 2024arXiv:2302.10796
11
citations
#1521

Federated Representation Learning in the Under-Parameterized Regime

Renpu Liu, Cong Shen, Jing Yang

ICML 2024arXiv:2406.04596
11
citations
#1522

Multimodal Medical Code Tokenizer

Xiaorui Su, Shvat Messica, Yepeng Huang et al.

ICML 2025arXiv:2502.04397
11
citations
#1523

D-Fusion: Direct Preference Optimization for Aligning Diffusion Models with Visually Consistent Samples

Zijing Hu, Fengda Zhang, Kun Kuang

ICML 2025arXiv:2505.22002
11
citations
#1524

From Generalization Analysis to Optimization Designs for State Space Models

Fusheng Liu, Qianxiao Li

ICML 2024oralarXiv:2405.02670
11
citations
#1525

HybridGS: High-Efficiency Gaussian Splatting Data Compression using Dual-Channel Sparse Representation and Point Cloud Encoder

Qi Yang, Le Yang, Geert Van der Auwera et al.

ICML 2025arXiv:2505.01938
11
citations
#1526

Pedestrian Attribute Recognition as Label-balanced Multi-label Learning

Yibo Zhou, Hai-Miao Hu, Yirong Xiang et al.

ICML 2024arXiv:2405.04858
11
citations
#1527

Exploring Correlations of Self-Supervised Tasks for Graphs

Taoran Fang, Wei Chow, Yifei Sun et al.

ICML 2024arXiv:2405.04245
11
citations
#1528

Scaling Laws for the Value of Individual Data Points in Machine Learning

Ian Covert, Wenlong Ji, Tatsunori Hashimoto et al.

ICML 2024arXiv:2405.20456
11
citations
#1529

Bringing Motion Taxonomies to Continuous Domains via GPLVM on Hyperbolic manifolds

Noémie Jaquier, Leonel Rozo, Miguel González-Duque et al.

ICML 2024arXiv:2210.01672
11
citations
#1530

Beyond Zero Initialization: Investigating the Impact of Non-Zero Initialization on LoRA Fine-Tuning Dynamics

Shiwei Li, Xiandi Luo, Xing Tang et al.

ICML 2025arXiv:2505.23194
11
citations
#1531

Expressivity and Generalization: Fragment-Biases for Molecular GNNs

Tom Wollschläger, Niklas Kemper, Leon Hetzel et al.

ICML 2024arXiv:2406.08210
11
citations
#1532

Position: Insights from Survey Methodology can Improve Training Data

Stephanie Eckman, Barbara Plank, Frauke Kreuter

ICML 2024arXiv:2403.01208
11
citations
#1533

When and How Does In-Distribution Label Help Out-of-Distribution Detection?

Xuefeng Du, Yiyou Sun, Sharon Li

ICML 2024arXiv:2405.18635
11
citations
#1534

Diversified Batch Selection for Training Acceleration

Feng Hong, Yueming LYU, Jiangchao Yao et al.

ICML 2024arXiv:2406.04872
11
citations
#1535

DSP: Dynamic Sequence Parallelism for Multi-Dimensional Transformers

Xuanlei Zhao, Shenggan Cheng, Chang Chen et al.

ICML 2025arXiv:2403.10266
11
citations
#1536

How Much Can We Forget about Data Contamination?

Sebastian Bordt, Suraj Srinivas, Valentyn Boreiko et al.

ICML 2025arXiv:2410.03249
11
citations
#1537

Subgraphormer: Unifying Subgraph GNNs and Graph Transformers via Graph Products

Guy Bar Shalom, Beatrice Bevilacqua, Haggai Maron

ICML 2024arXiv:2402.08450
11
citations
#1538

Asymptotics of Learning with Deep Structured (Random) Features

Dominik Schröder, Daniil Dmitriev, Hugo Cui et al.

ICML 2024arXiv:2402.13999
11
citations
#1539

Allocation Requires Prediction Only if Inequality Is Low

Ali Shirali, Rediet Abebe, Moritz Hardt

ICML 2024spotlightarXiv:2406.13882
11
citations
#1540

Peeking with PEAK: Sequential, Nonparametric Composite Hypothesis Tests for Means of Multiple Data Streams

Brian Cho, Kyra Gan, Nathan Kallus

ICML 2024arXiv:2402.06122
11
citations
#1541

DiffAug: Enhance Unsupervised Contrastive Learning with Domain-Knowledge-Free Diffusion-based Data Augmentation

Zelin Zang, Hao Luo, Kai Wang et al.

ICML 2024arXiv:2309.07909
11
citations
#1542

Quality Diversity through Human Feedback: Towards Open-Ended Diversity-Driven Optimization

Li Ding, Jenny Zhang, Jeff Clune et al.

ICML 2024arXiv:2310.12103
11
citations
#1543

Can Transformers Reason Logically? A Study in SAT Solving

Leyan Pan, Vijay Ganesh, Jacob Abernethy et al.

ICML 2025arXiv:2410.07432
11
citations
#1544

Do Multiple Instance Learning Models Transfer?

Daniel Shao, Richard Chen, Andrew Song et al.

ICML 2025spotlightarXiv:2506.09022
11
citations
#1545

Trained Random Forests Completely Reveal your Dataset

Julien Ferry, Ricardo Fukasawa, Timothée Pascal et al.

ICML 2024arXiv:2402.19232
11
citations
#1546

Generalization Bound and New Algorithm for Clean-Label Backdoor Attack

Lijia Yu, Shuang Liu, Yibo Miao et al.

ICML 2024arXiv:2406.00588
11
citations
#1547

FRAG: Frequency Adapting Group for Diffusion Video Editing

Sunjae Yoon, Gwanhyeong Koo, Geonwoo Kim et al.

ICML 2024arXiv:2406.06044
11
citations
#1548

Retrieval-Augmented Perception: High-resolution Image Perception Meets Visual RAG

Wenbin Wang, Yongcheng Jing, Liang Ding et al.

ICML 2025oralarXiv:2503.01222
11
citations
#1549

DRED: Zero-Shot Transfer in Reinforcement Learning via Data-Regularised Environment Design

Samuel Garcin, James Doran, Shangmin Guo et al.

ICML 2024arXiv:2402.03479
11
citations
#1550

Understanding Mode Connectivity via Parameter Space Symmetry

Bo Zhao, Nima Dehmamy, Robin Walters et al.

ICML 2025arXiv:2505.23681
11
citations
#1551

Flat-LoRA: Low-Rank Adaptation over a Flat Loss Landscape

Tao Li, Zhengbao He, Yujun Li et al.

ICML 2025arXiv:2409.14396
11
citations
#1552

TopoTune: A Framework for Generalized Combinatorial Complex Neural Networks

Mathilde Papillon, Guillermo Bernardez, Claudio Battiloro et al.

ICML 2025arXiv:2410.06530
11
citations
#1553

Diffusion Rejection Sampling

Byeonghu Na, Yeongmin Kim, Minsang Park et al.

ICML 2024arXiv:2405.17880
11
citations
#1554

Using Left and Right Brains Together: Towards Vision and Language Planning

Jun CEN, Chenfei Wu, Xiao Liu et al.

ICML 2024arXiv:2402.10534
11
citations
#1555

QLASS: Boosting Language Agent Inference via Q-Guided Stepwise Search

Zongyu Lin, Yao Tang, Xingcheng Yao et al.

ICML 2025arXiv:2502.02584
11
citations
#1556

Scaling Collapse Reveals Universal Dynamics in Compute-Optimally Trained Neural Networks

Shikai Qiu, Lechao Xiao, Andrew Wilson et al.

ICML 2025oralarXiv:2507.02119
11
citations
#1557

Revolve: Optimizing AI Systems by Tracking Response Evolution in Textual Optimization

Peiyan Zhang, Haibo Jin, Leyang Hu et al.

ICML 2025arXiv:2412.03092
11
citations
#1558

Improved Stability and Generalization Guarantees of the Decentralized SGD Algorithm

Batiste Le Bars, Aurélien Bellet, Marc Tommasi et al.

ICML 2024arXiv:2306.02939
11
citations
#1559

KV-Runahead: Scalable Causal LLM Inference by Parallel Key-Value Cache Generation

Minsik Cho, Mohammad Rastegari, Devang Naik

ICML 2024arXiv:2405.05329
11
citations
#1560

Listwise Reward Estimation for Offline Preference-based Reinforcement Learning

Heewoong Choi, Sangwon Jung, Hongjoon Ahn et al.

ICML 2024arXiv:2408.04190
11
citations
#1561

CTBench: A Library and Benchmark for Certified Training

Yuhao Mao, Stefan Balauca, Martin Vechev

ICML 2025arXiv:2406.04848
11
citations
#1562

Fundamental Benefit of Alternating Updates in Minimax Optimization

Jaewook Lee, Hanseul Cho, Chulhee Yun

ICML 2024spotlightarXiv:2402.10475
11
citations
#1563

CoreMatching: A Co-adaptive Sparse Inference Framework with Token and Neuron Pruning for Comprehensive Acceleration of Vision-Language Models

Qinsi Wang, Hancheng Ye, Ming-Yu Chung et al.

ICML 2025arXiv:2505.19235
11
citations
#1564

Resisting Stochastic Risks in Diffusion Planners with the Trajectory Aggregation Tree

Lang Feng, Pengjie Gu, Bo An et al.

ICML 2024spotlightarXiv:2405.17879
11
citations
#1565

Position: Quo Vadis, Unsupervised Time Series Anomaly Detection?

M. Saquib Sarfraz, Mei-Yen Chen, Lukas Layer et al.

ICML 2024arXiv:2405.02678
11
citations
#1566

Differentiability and Optimization of Multiparameter Persistent Homology

Luis Scoccola, Siddharth Setlur, David Loiseaux et al.

ICML 2024arXiv:2406.07224
11
citations
#1567

Reliable and Efficient Amortized Model-based Evaluation

Sang Truong, Yuheng Tu, Percy Liang et al.

ICML 2025arXiv:2503.13335
11
citations
#1568

Reasoning-as-Logic-Units: Scaling Test-Time Reasoning in Large Language Models Through Logic Unit Alignment

Cheryl Li, Tianyuan Xu, Yiwen Guo

ICML 2025arXiv:2502.07803
11
citations
#1569

Mean Estimation in the Add-Remove Model of Differential Privacy

Alex Kulesza, Ananda Suresh, Yuyan Wang

ICML 2024arXiv:2312.06658
10
citations
#1570

Cross-view Masked Diffusion Transformers for Person Image Synthesis

Trung Pham, Kang Zhang, Chang Yoo

ICML 2024arXiv:2402.01516
10
citations
#1571

PENCIL: Long Thoughts with Short Memory

Chenxiao Yang, Nati Srebro, David McAllester et al.

ICML 2025arXiv:2503.14337
10
citations
#1572

RocketKV: Accelerating Long-Context LLM Inference via Two-Stage KV Cache Compression

Payman Behnam, Yaosheng Fu, Ritchie Zhao et al.

ICML 2025arXiv:2502.14051
10
citations
#1573

Meta-Learners for Partially-Identified Treatment Effects Across Multiple Environments

Jonas Schweisthal, Dennis Frauen, M van der Schaar et al.

ICML 2024arXiv:2406.02464
10
citations
#1574

AdsorbDiff: Adsorbate Placement via Conditional Denoising Diffusion

Adeesh Kolluru, John Kitchin

ICML 2024arXiv:2405.03962
10
citations
#1575

Jacobian Regularizer-based Neural Granger Causality

Wanqi Zhou, Shuanghao Bai, Shujian Yu et al.

ICML 2024arXiv:2405.08779
10
citations
#1576

Libra: Building Decoupled Vision System on Large Language Models

Yifan Xu, Xiaoshan Yang, Yaguang Song et al.

ICML 2024arXiv:2405.10140
10
citations
#1577

CuTS: Customizable Tabular Synthetic Data Generation

Mark Vero, Mislav Balunovic, Martin Vechev

ICML 2024arXiv:2307.03577
10
citations
#1578

Estimating Barycenters of Distributions with Neural Optimal Transport

Alexander Kolesov, Petr Mokrov, Igor Udovichenko et al.

ICML 2024arXiv:2402.03828
10
citations
#1579

Revealing the Dark Secrets of Extremely Large Kernel ConvNets on Robustness

Honghao Chen, Zhang Yurong, xiaokun Feng et al.

ICML 2024arXiv:2407.08972
10
citations
#1580

Towards Understanding Inductive Bias in Transformers: A View From Infinity

Itay Lavie, Guy Gur-Ari, Zohar Ringel

ICML 2024arXiv:2402.05173
10
citations
#1581

Efficient Pareto Manifold Learning with Low-Rank Structure

Weiyu CHEN, James Kwok

ICML 2024spotlightarXiv:2407.20734
10
citations
#1582

Non-clairvoyant Scheduling with Partial Predictions

Ziyad Benomar, Vianney Perchet

ICML 2024arXiv:2405.01013
10
citations
#1583

Self-Improving Language Models for Evolutionary Program Synthesis: A Case Study on ARC-AGI

Julien Pourcel, Cédric Colas, Pierre-Yves Oudeyer

ICML 2025arXiv:2507.14172
10
citations
#1584

Individual Contributions as Intrinsic Exploration Scaffolds for Multi-agent Reinforcement Learning

Xinran Li, Zifan LIU, Shibo Chen et al.

ICML 2024arXiv:2405.18110
10
citations
#1585

Solving Poisson Equations using Neural Walk-on-Spheres

Hong Chul Nam, Julius Berner, Anima Anandkumar

ICML 2024arXiv:2406.03494
10
citations
#1586

When Do LLMs Help With Node Classification? A Comprehensive Analysis

Xixi Wu, Yifei Shen, Fangzhou Ge et al.

ICML 2025arXiv:2502.00829
10
citations
#1587

Fundamental limits of learning in sequence multi-index models and deep attention networks: high-dimensional asymptotics and sharp thresholds

Emanuele Troiani, Hugo Cui, Yatin Dandi et al.

ICML 2025arXiv:2502.00901
10
citations
#1588

Differentiable Mapper for Topological Optimization of Data Representation

Ziyad Oulhaj, Mathieu Carrière, Bertrand Michel

ICML 2024arXiv:2402.12854
10
citations
#1589

How Much Can Transfer? BRIDGE: Bounded Multi-Domain Graph Foundation Model with Generalization Guarantees

Haonan Yuan, Qingyun Sun, Junhua Shi et al.

ICML 2025
10
citations
#1590

Beyond the Calibration Point: Mechanism Comparison in Differential Privacy

Georgios Kaissis, Stefan Kolek, Borja de Balle Pigem et al.

ICML 2024arXiv:2406.08918
10
citations
#1591

Synthesizing Software Engineering Data in a Test-Driven Manner

Lei Zhang, Jiaxi Yang, Min Yang et al.

ICML 2025arXiv:2506.09003
10
citations
#1592

On the Calibration of Human Pose Estimation

Kerui Gu, Rongyu Chen, Xuanlong Yu et al.

ICML 2024arXiv:2311.17105
10
citations
#1593

Generalizing Knowledge Graph Embedding with Universal Orthogonal Parameterization

Rui Li, Chaozhuo Li, Yanming Shen et al.

ICML 2024arXiv:2405.08540
10
citations
#1594

Quantum Positional Encodings for Graph Neural Networks

Slimane Thabet, Mehdi Djellabi, Igor Sokolov et al.

ICML 2024arXiv:2406.06547
10
citations
#1595

Safety Reasoning with Guidelines

Haoyu Wang, Zeyu Qin, Li Shen et al.

ICML 2025arXiv:2502.04040
10
citations
#1596

Learning Efficient Robotic Garment Manipulation with Standardization

zhou changshi, Feng Luan, hujiarui et al.

ICML 2025arXiv:2506.22769
10
citations
#1597

Gaussian Mixture Flow Matching Models

Hansheng Chen, Kai Zhang, Hao Tan et al.

ICML 2025arXiv:2504.05304
10
citations
#1598

On the Training Convergence of Transformers for In-Context Classification of Gaussian Mixtures

Wei Shen, Ruida Zhou, Jing Yang et al.

ICML 2025arXiv:2410.11778
10
citations
#1599

ReLU to the Rescue: Improve Your On-Policy Actor-Critic with Positive Advantages

Andrew Jesson, Christopher Lu, Gunshi Gupta et al.

ICML 2024arXiv:2306.01460
10
citations
#1600

Contrastive Representation for Data Filtering in Cross-Domain Offline Reinforcement Learning

Xiaoyu Wen, Chenjia Bai, Kang Xu et al.

ICML 2024arXiv:2405.06192
10
citations