Most Cited ICML "subject-verb-object" Papers

5,975 papers found • Page 5 of 30

#801

Position: Why We Must Rethink Empirical Research in Machine Learning

Moritz Herrmann, F. Julian D. Lange, Katharina Eggensperger et al.

ICML 2024arXiv:2405.02200
24
citations
#802

Equivariant Diffusion for Crystal Structure Prediction

Peijia Lin, Pin Chen, Rui Jiao et al.

ICML 2024arXiv:2512.07289
24
citations
#803

Position: Key Claims in LLM Research Have a Long Tail of Footnotes

Anna Rogers, Sasha Luccioni

ICML 2024arXiv:2308.07120
24
citations
#804

Comparing Graph Transformers via Positional Encodings

Mitchell Black, Zhengchao Wan, Gal Mishne et al.

ICML 2024arXiv:2402.14202
24
citations
#805

Efficient and Effective Time-Series Forecasting with Spiking Neural Networks

Changze Lv, Yansen Wang, Dongqi Han et al.

ICML 2024oralarXiv:2402.01533
24
citations
#806

On Mechanistic Knowledge Localization in Text-to-Image Generative Models

Samyadeep Basu, Keivan Rezaei, Priyatham Kattakinda et al.

ICML 2024arXiv:2405.01008
24
citations
#807

Sampling in Unit Time with Kernel Fisher-Rao Flow

Aimee Maurais, Youssef Marzouk

ICML 2024arXiv:2401.03892
24
citations
#808

Position: Evaluating Generative AI Systems Is a Social Science Measurement Challenge

Hanna Wallach, Meera Desai, A. Feder Cooper et al.

ICML 2025arXiv:2502.00561
23
citations
#809

Compute Better Spent: Replacing Dense Layers with Structured Matrices

Shikai Qiu, Andres Potapczynski, Marc Finzi et al.

ICML 2024arXiv:2406.06248
23
citations
#810

PolySketchFormer: Fast Transformers via Sketching Polynomial Kernels

Praneeth Kacham, Vahab Mirrokni, Peilin Zhong

ICML 2024arXiv:2310.01655
23
citations
#811

Rethinking Data Shapley for Data Selection Tasks: Misleads and Merits

Jiachen Wang, Tianji Yang, James Zou et al.

ICML 2024arXiv:2405.03875
23
citations
#812

DySLIM: Dynamics Stable Learning by Invariant Measure for Chaotic Systems

Yair Schiff, Zhong Yi Wan, Jeffrey Parker et al.

ICML 2024arXiv:2402.04467
23
citations
#813

Exploring Criteria of Loss Reweighting to Enhance LLM Unlearning

Puning Yang, Qizhou Wang, Zhuo Huang et al.

ICML 2025arXiv:2505.11953
23
citations
#814

From Language Models over Tokens to Language Models over Characters

Tim Vieira, Benjamin LeBrun, Mario Giulianelli et al.

ICML 2025spotlightarXiv:2412.03719
23
citations
#815

On Least Square Estimation in Softmax Gating Mixture of Experts

Huy Nguyen, Nhat Ho, Alessandro Rinaldo

ICML 2024arXiv:2402.02952
23
citations
#816

Optimizing Language Models for Inference Time Objectives using Reinforcement Learning

Yunhao Tang, Kunhao Zheng, Gabriel Synnaeve et al.

ICML 2025arXiv:2503.19595
23
citations
#817

Machine Vision Therapy: Multimodal Large Language Models Can Enhance Visual Robustness via Denoising In-Context Learning

Zhuo Huang, Chang Liu, Yinpeng Dong et al.

ICML 2024arXiv:2312.02546
23
citations
#818

Mastering Board Games by External and Internal Planning with Language Models

John Schultz, Jakub Adamek, Matej Jusup et al.

ICML 2025spotlightarXiv:2412.12119
23
citations
#819

Two Stones Hit One Bird: Bilevel Positional Encoding for Better Length Extrapolation

Zhenyu He, Guhao Feng, Shengjie Luo et al.

ICML 2024arXiv:2401.16421
23
citations
#820

The Good, The Bad, and Why: Unveiling Emotions in Generative AI

CHENG LI, Jindong Wang, Yixuan Zhang et al.

ICML 2024arXiv:2312.11111
23
citations
#821

Structure-based drug design by denoising voxel grids

Pedro O. Pinheiro, Arian Jamasb, Omar Mahmood et al.

ICML 2024arXiv:2405.03961
23
citations
#822

A Simple Model of Inference Scaling Laws

Noam Levi

ICML 2025arXiv:2410.16377
23
citations
#823

TabPFN Unleashed: A Scalable and Effective Solution to Tabular Classification Problems

Si-Yang Liu, Han-Jia Ye

ICML 2025arXiv:2502.02527
23
citations
#824

Hyperbolic Geometric Latent Diffusion Model for Graph Generation

Xingcheng Fu, Yisen Gao, Yuecen Wei et al.

ICML 2024arXiv:2405.03188
23
citations
#825

Benchmarking and Building Long-Context Retrieval Models with LoCo and M2-BERT

Jon Saad-Falcon, Daniel Y Fu, Simran Arora et al.

ICML 2024arXiv:2402.07440
23
citations
#826

SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation

Zihan Liu, Shuangrui Ding, Zhixiong Zhang et al.

ICML 2025arXiv:2502.13128
23
citations
#827

M+: Extending MemoryLLM with Scalable Long-Term Memory

Yu Wang, Dmitry Krotov, Yuanzhe Hu et al.

ICML 2025arXiv:2502.00592
23
citations
#828

Automated Hypothesis Validation with Agentic Sequential Falsifications

Kexin Huang, Ying Jin, Ryan Li et al.

ICML 2025arXiv:2502.09858
23
citations
#829

TinyTrain: Resource-Aware Task-Adaptive Sparse Training of DNNs at the Data-Scarce Edge

Young Kwon, Rui Li, Stylianos Venieris et al.

ICML 2024arXiv:2307.09988
22
citations
#830

Projecting Molecules into Synthesizable Chemical Spaces

Shitong Luo, Wenhao Gao, Zuofan Wu et al.

ICML 2024arXiv:2406.04628
22
citations
#831

Understanding Adam Optimizer via Online Learning of Updates: Adam is FTRL in Disguise

Kwangjun Ahn, Zhiyu Zhang, Yunbum Kook et al.

ICML 2024arXiv:2402.01567
22
citations
#832

Reinforced Lifelong Editing for Language Models

Zherui Li, Houcheng Jiang, Hao Chen et al.

ICML 2025arXiv:2502.05759
22
citations
#833

Candidate Pseudolabel Learning: Enhancing Vision-Language Models by Prompt Tuning with Unlabeled Data

Jiahan Zhang, Qi Wei, Feng Liu et al.

ICML 2024arXiv:2406.10502
22
citations
#834

STEER: Assessing the Economic Rationality of Large Language Models

Narun Raman, Taylor Lundy, Samuel Joseph Amouyal et al.

ICML 2024arXiv:2402.09552
22
citations
#835

Attribution-based Explanations that Provide Recourse Cannot be Robust

Hidde Fokkema, Rianne de Heide, Tim van Erven

ICML 2024arXiv:2205.15834
22
citations
#836

How Private are DP-SGD Implementations?

Lynn Chua, Badih Ghazi, Pritish Kamath et al.

ICML 2024arXiv:2403.17673
22
citations
#837

Nonparametric Modern Hopfield Models

Jerry Yao-Chieh Hu, Bo-Yu Chen, Dennis Wu et al.

ICML 2025arXiv:2404.03900
22
citations
#838

Demystifying Cost-Efficiency in LLM Serving over Heterogeneous GPUs

Youhe Jiang, Fangcheng Fu, Xiaozhe Yao et al.

ICML 2025arXiv:2502.00722
22
citations
#839

Latent Space Symmetry Discovery

Jianke Yang, Nima Dehmamy, Robin Walters et al.

ICML 2024arXiv:2310.00105
22
citations
#840

Learning Reward for Robot Skills Using Large Language Models via Self-Alignment

Yuwei Zeng, Yao Mu, Lin Shao

ICML 2024arXiv:2405.07162
22
citations
#841

RNAFlow: RNA Structure & Sequence Design via Inverse Folding-Based Flow Matching

Divya Nori, Wengong Jin

ICML 2024arXiv:2405.18768
22
citations
#842

DPZero: Private Fine-Tuning of Language Models without Backpropagation

Liang Zhang, Bingcong Li, Kiran Thekumparampil et al.

ICML 2024arXiv:2310.09639
22
citations
#843

Optimal Ridge Regularization for Out-of-Distribution Prediction

Pratik Patil, Jin-Hong Du, Ryan Tibshirani

ICML 2024spotlightarXiv:2404.01233
22
citations
#844

Is Noise Conditioning Necessary for Denoising Generative Models?

Qiao Sun, Zhicheng Jiang, Hanhong Zhao et al.

ICML 2025arXiv:2502.13129
22
citations
#845

OneForecast: A Universal Framework for Global and Regional Weather Forecasting

Yuan Gao, Hao Wu, Ruiqi Shu et al.

ICML 2025arXiv:2502.00338
22
citations
#846

Causal Representation Learning Made Identifiable by Grouping of Observational Variables

Hiroshi Morioka, Aapo Hyvarinen

ICML 2024oralarXiv:2310.15709
22
citations
#847

Towards a General Time Series Forecasting Model with Unified Representation and Adaptive Transfer

Yihang Wang, Yuying Qiu, Peng Chen et al.

ICML 2025arXiv:2405.17478
22
citations
#848

Rethinking Momentum Knowledge Distillation in Online Continual Learning

Nicolas MICHEL, Maorong Wang, Ling Xiao et al.

ICML 2024arXiv:2309.02870
22
citations
#849

Estimating Canopy Height at Scale

Jan Pauls, Max Zimmer, Una Kelly et al.

ICML 2024arXiv:2406.01076
22
citations
#850

Position: Understanding LLMs Requires More Than Statistical Generalization

Patrik Reizinger, Szilvia Ujváry, Anna Mészáros et al.

ICML 2024spotlightarXiv:2405.01964
22
citations
#851

Position: AI Evaluation Should Learn from How We Test Humans

Yan Zhuang, Qi Liu, Zachary Pardos et al.

ICML 2025arXiv:2306.10512
22
citations
#852

Re-Dock: Towards Flexible and Realistic Molecular Docking with Diffusion Bridge

Yufei Huang, Odin Zhang, Lirong Wu et al.

ICML 2024spotlightarXiv:2402.11459
22
citations
#853

Compute or Load KV Cache? Why Not Both?

Shuowei Jin, Xueshen Liu, Qingzhao Zhang et al.

ICML 2025arXiv:2410.03065
22
citations
#854

Position: Rethinking Post-Hoc Search-Based Neural Approaches for Solving Large-Scale Traveling Salesman Problems

Yifan Xia, Xianliang Yang, Zichuan Liu et al.

ICML 2024arXiv:2406.03503
22
citations
#855

Towards Theoretical Understandings of Self-Consuming Generative Models

Shi Fu, Sen Zhang, Yingjie Wang et al.

ICML 2024arXiv:2402.11778
22
citations
#856

Erasing the Bias: Fine-Tuning Foundation Models for Semi-Supervised Learning

Kai Gan, Tong Wei

ICML 2024arXiv:2405.11756
22
citations
#857

A Sparsity Principle for Partially Observable Causal Representation Learning

Danru Xu, Dingling Yao, Sébastien Lachapelle et al.

ICML 2024arXiv:2403.08335
22
citations
#858

Fixing the Double Penalty in Data-Driven Weather Forecasting Through a Modified Spherical Harmonic Loss Function

Christopher Subich, Syed Husain, Leo Separovic et al.

ICML 2025arXiv:2501.19374
22
citations
#859

Generalization in Kernel Regression Under Realistic Assumptions

Daniel Barzilai, Ohad Shamir

ICML 2024spotlightarXiv:2312.15995
22
citations
#860

Fourier Position Embedding: Enhancing Attention’s Periodic Extension for Length Generalization

Ermo Hua, Che Jiang, Xingtai Lv et al.

ICML 2025arXiv:2412.17739
22
citations
#861

Self-Improving Transformers Overcome Easy-to-Hard and Length Generalization Challenges

Nayoung Lee, Jack Cai, Avi Schwarzschild et al.

ICML 2025arXiv:2502.01612
22
citations
#862

Converting Transformers to Polynomial Form for Secure Inference Over Homomorphic Encryption

Itamar Zimerman, Moran Baruch, Nir Drucker et al.

ICML 2024arXiv:2311.08610
22
citations
#863

Approximate Nearest Neighbor Search with Window Filters

Josh Engels, Ben Landrum, Shangdi Yu et al.

ICML 2024arXiv:2402.00943
21
citations
#864

Transformers Provably Learn Sparse Token Selection While Fully-Connected Nets Cannot

Zixuan Wang, Stanley Wei, Daniel Hsu et al.

ICML 2024arXiv:2406.06893
21
citations
#865

Discovering Environments with XRM

Mohammad Pezeshki, Diane Bouchacourt, Mark Ibrahim et al.

ICML 2024arXiv:2309.16748
21
citations
#866

Delta Decompression for MoE-based LLMs Compression

Hao Gu, Wei Li, Lujun Li et al.

ICML 2025arXiv:2502.17298
21
citations
#867

Membership Inference Attacks on Diffusion Models via Quantile Regression

Shuai Tang, Steven Wu, Sergul Aydore et al.

ICML 2024arXiv:2312.05140
21
citations
#868

Rethinking External Slow-Thinking: From Snowball Errors to Probability of Correct Reasoning

Zeyu Gan, Yun Liao, Yong Liu

ICML 2025arXiv:2501.15602
21
citations
#869

TimeSiam: A Pre-Training Framework for Siamese Time-Series Modeling

Jiaxiang Dong, Haixu Wu, Yuxuan Wang et al.

ICML 2024oralarXiv:2402.02475
21
citations
#870

Monte Carlo Tree Diffusion for System 2 Planning

Jaesik Yoon, Hyeonseo Cho, Doojin Baek et al.

ICML 2025spotlightarXiv:2502.07202
21
citations
#871

Reducing Tool Hallucination via Reliability Alignment

Hongshen Xu, Zichen Zhu, Lei Pan et al.

ICML 2025arXiv:2412.04141
21
citations
#872

Moreau Envelope for Nonconvex Bi-Level Optimization: A Single-Loop and Hessian-Free Solution Strategy

Risheng Liu, Zhu Liu, Wei Yao et al.

ICML 2024arXiv:2405.09927
21
citations
#873

Learning to Keep a Promise: Scaling Language Model Decoding Parallelism with Learned Asynchronous Decoding

Tian Jin, Ellie Cheng, Zachary Ankner et al.

ICML 2025arXiv:2502.11517
21
citations
#874

Seizing Serendipity: Exploiting the Value of Past Success in Off-Policy Actor-Critic

Tianying Ji, Yu Luo, Fuchun Sun et al.

ICML 2024arXiv:2306.02865
21
citations
#875

Unsupervised Evaluation of Code LLMs with Round-Trip Correctness

Miltiadis Allamanis, Sheena Panthaplackel, Pengcheng Yin

ICML 2024arXiv:2402.08699
21
citations
#876

Decision Theoretic Foundations for Conformal Prediction: Optimal Uncertainty Quantification for Risk-Averse Agents

Shayan Kiyani, George Pappas, Aaron Roth et al.

ICML 2025spotlightarXiv:2502.02561
21
citations
#877

Residual Quantization with Implicit Neural Codebooks

Iris Huijben, Matthijs Douze, Matthew Muckley et al.

ICML 2024arXiv:2401.14732
21
citations
#878

Compress then Serve: Serving Thousands of LoRA Adapters with Little Overhead

Rickard Gabrielsson, Jiacheng Zhu, Onkar Bhardwaj et al.

ICML 2025arXiv:2407.00066
21
citations
#879

Memorization Through the Lens of Curvature of Loss Function Around Samples

Isha Garg, Deepak Ravikumar, Kaushik Roy

ICML 2024spotlightarXiv:2307.05831
21
citations
#880

InfAlign: Inference-aware language model alignment

Ananth Balashankar, Ziteng Sun, Jonathan Berant et al.

ICML 2025arXiv:2412.19792
21
citations
#881

Improving LLM Safety Alignment with Dual-Objective Optimization

Xuandong Zhao, Will Cai, Tianneng Shi et al.

ICML 2025arXiv:2503.03710
21
citations
#882

Purify Unlearnable Examples via Rate-Constrained Variational Autoencoders

Yi Yu, Yufei Wang, Song Xia et al.

ICML 2024arXiv:2405.01460
21
citations
#883

TimeBridge: Non-Stationarity Matters for Long-term Time Series Forecasting

Peiyuan Liu, Beiliang Wu, Yifan Hu et al.

ICML 2025arXiv:2410.04442
21
citations
#884

DisCo-Diff: Enhancing Continuous Diffusion Models with Discrete Latents

Yilun Xu, Gabriele Corso, Tommi Jaakkola et al.

ICML 2024arXiv:2407.03300
21
citations
#885

Scaling Video-Language Models to 10K Frames via Hierarchical Differential Distillation

CHUANQI CHENG, Jian Guan, Wei Wu et al.

ICML 2025oralarXiv:2504.02438
21
citations
#886

IIANet: An Intra- and Inter-Modality Attention Network for Audio-Visual Speech Separation

Kai Li, Runxuan Yang, Fuchun Sun et al.

ICML 2024oralarXiv:2308.08143
21
citations
#887

Compositional Few-Shot Class-Incremental Learning

Yixiong Zou, Shanghang Zhang, haichen zhou et al.

ICML 2024arXiv:2405.17022
21
citations
#888

Few-Shot Character Understanding in Movies as an Assessment to Meta-Learning of Theory-of-Mind

Mo Yu, Qiujing Wang, Shunchi Zhang et al.

ICML 2024arXiv:2211.04684
21
citations
#889

Linear Alignment: A Closed-form Solution for Aligning Human Preferences without Tuning and Feedback

songyang gao, Qiming Ge, Wei Shen et al.

ICML 2024arXiv:2401.11458
21
citations
#890

Understanding the Effects of Iterative Prompting on Truthfulness

Satyapriya Krishna, Chirag Agarwal, Himabindu Lakkaraju

ICML 2024arXiv:2402.06625
21
citations
#891

A sampling theory perspective on activations for implicit neural representations

Hemanth Saratchandran, Sameera Ramasinghe, Violetta Shevchenko et al.

ICML 2024arXiv:2402.05427
21
citations
#892

MELON: Provable Defense Against Indirect Prompt Injection Attacks in AI Agents

Kaijie Zhu, Xianjun Yang, Jindong Wang et al.

ICML 2025arXiv:2502.05174
21
citations
#893

ReVISE: Learning to Refine at Test-Time via Intrinsic Self-Verification

Hyunseok Lee, Seunghyuk Oh, Jaehyung Kim et al.

ICML 2025arXiv:2502.14565
21
citations
#894

BaxBench: Can LLMs Generate Correct and Secure Backends?

Mark Vero, Niels Mündler, Viktor Chibotaru et al.

ICML 2025spotlightarXiv:2502.11844
21
citations
#895

How to Trace Latent Generative Model Generated Images without Artificial Watermark?

Zhenting Wang, Vikash Sehwag, Chen Chen et al.

ICML 2024arXiv:2405.13360
20
citations
#896

BECoTTA: Input-dependent Online Blending of Experts for Continual Test-time Adaptation

Daeun Lee, Jaehong Yoon, Sung Ju Hwang

ICML 2024arXiv:2402.08712
20
citations
#897

Graph-based Time Series Clustering for End-to-End Hierarchical Forecasting

Andrea Cini, Danilo Mandic, Cesare Alippi

ICML 2024arXiv:2305.19183
20
citations
#898

Reward Model Learning vs. Direct Policy Optimization: A Comparative Analysis of Learning from Human Preferences

Andi Nika, Debmalya Mandal, Parameswaran Kamalaruban et al.

ICML 2024arXiv:2403.01857
20
citations
#899

Homomorphism Counts for Graph Neural Networks: All About That Basis

Emily Jin, Michael Bronstein, Ismail Ceylan et al.

ICML 2024arXiv:2402.08595
20
citations
#900

Gaussian Processes on Cellular Complexes

Mathieu Alain, So Takao, Brooks Paige et al.

ICML 2024arXiv:2311.01198
20
citations
#901

Scaling Laws in Patchification: An Image Is Worth 50,176 Tokens And More

Feng Wang, Yaodong Yu, Wei Shao et al.

ICML 2025arXiv:2502.03738
20
citations
#902

Memory-Space Visual Prompting for Efficient Vision-Language Fine-Tuning

Shibo Jie, Yehui Tang, Ning Ding et al.

ICML 2024arXiv:2405.05615
20
citations
#903

Discovering Symmetry Breaking in Physical Systems with Relaxed Group Convolution

Rui Wang, Elyssa Hofgard, Han Gao et al.

ICML 2024arXiv:2310.02299
20
citations
#904

Symmetry Induces Structure and Constraint of Learning

Liu Ziyin

ICML 2024arXiv:2309.16932
20
citations
#905

EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM

Zhuofan Zong, Dongzhi Jiang, Bingqi Ma et al.

ICML 2025arXiv:2412.09618
20
citations
#906

BEST-Route: Adaptive LLM Routing with Test-Time Optimal Compute

Dujian Ding, Ankur Mallick, Shaokun Zhang et al.

ICML 2025arXiv:2506.22716
20
citations
#907

DistiLLM-2: A Contrastive Approach Boosts the Distillation of LLMs

Jongwoo Ko, Tianyi Chen, Sungnyun Kim et al.

ICML 2025oralarXiv:2503.07067
20
citations
#908

Wyckoff Transformer: Generation of Symmetric Crystals

Nikita Kazeev, Wei Nong, Ignat Romanov et al.

ICML 2025arXiv:2503.02407
20
citations
#909

AlphaVerus: Bootstrapping Formally Verified Code Generation through Self-Improving Translation and Treefinement

Pranjal Aggarwal, Bryan Parno, Sean Welleck

ICML 2025arXiv:2412.06176
20
citations
#910

What Can Transformer Learn with Varying Depth? Case Studies on Sequence Learning Tasks

Xingwu Chen, Difan Zou

ICML 2024arXiv:2404.01601
20
citations
#911

VersaPRM: Multi-Domain Process Reward Model via Synthetic Reasoning Data

Thomas Zeng, Shuibai Zhang, Shutong Wu et al.

ICML 2025oralarXiv:2502.06737
20
citations
#912

LSEnet: Lorentz Structural Entropy Neural Network for Deep Graph Clustering

Li Sun, Zhenhao Huang, Hao Peng et al.

ICML 2024arXiv:2405.11801
20
citations
#913

CRANE: Reasoning with constrained LLM generation

Debangshu Banerjee, Tarun Suresh, Shubham Ugare et al.

ICML 2025arXiv:2502.09061
20
citations
#914

SpikeZIP-TF: Conversion is All You Need for Transformer-based SNN

kang you, Zekai Xu, Chen Nie et al.

ICML 2024arXiv:2406.03470
20
citations
#915

StableMask: Refining Causal Masking in Decoder-only Transformer

Qingyu Yin, Xuzheng He, Xiang Zhuang et al.

ICML 2024arXiv:2402.04779
20
citations
#916

ReGAL: Refactoring Programs to Discover Generalizable Abstractions

Elias Stengel-Eskin, Archiki Prasad, Mohit Bansal

ICML 2024arXiv:2401.16467
20
citations
#917

LEAPS: A discrete neural sampler via locally equivariant networks

Peter Holderrieth, Michael Albergo, Tommi Jaakkola

ICML 2025arXiv:2502.10843
20
citations
#918

Investigating Non-Transitivity in LLM-as-a-Judge

Yi Xu, Laura Ruis, Tim Rocktäschel et al.

ICML 2025spotlightarXiv:2502.14074
20
citations
#919

P(all-atom) Is Unlocking New Path For Protein Design

Wei Qu, Jiawei Guan, Rui Ma et al.

ICML 2025spotlight
20
citations
#920

Great Models Think Alike and this Undermines AI Oversight

Shashwat Goel, Joschka Strüber, Ilze Amanda Auzina et al.

ICML 2025spotlightarXiv:2502.04313
20
citations
#921

Is Temperature Sample Efficient for Softmax Gaussian Mixture of Experts?

Huy Nguyen, Pedram Akbarian, Nhat Ho

ICML 2024arXiv:2401.13875
20
citations
#922

KAN-AD: Time Series Anomaly Detection with Kolmogorov–Arnold Networks

Quan Zhou, Changhua Pei, Fei Sun et al.

ICML 2025arXiv:2411.00278
20
citations
#923

Position: Uncertainty Quantification Needs Reassessment for Large Language Model Agents

Michael Kirchhof, Gjergji Kasneci, Enkelejda Kasneci

ICML 2025arXiv:2505.22655
20
citations
#924

Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation

Yunheng Li, Zhong-Yu Li, Quan-Sheng Zeng et al.

ICML 2024arXiv:2406.00670
20
citations
#925

CLIPZyme: Reaction-Conditioned Virtual Screening of Enzymes

Peter Mikhael, Itamar Chinn, Regina Barzilay

ICML 2024arXiv:2402.06748
20
citations
#926

Graph-based Forecasting with Missing Data through Spatiotemporal Downsampling

Ivan Marisca, Cesare Alippi, Filippo Maria Bianchi

ICML 2024oralarXiv:2402.10634
20
citations
#927

OptMATH: A Scalable Bidirectional Data Synthesis Framework for Optimization Modeling

Hongliang Lu, Zhonglin Xie, Yaoyu Wu et al.

ICML 2025arXiv:2502.11102
20
citations
#928

Rethinking Aleatoric and Epistemic Uncertainty

Freddie Bickford Smith, Jannik Kossen, Eleanor Trollope et al.

ICML 2025arXiv:2412.20892
20
citations
#929

On the Guidance of Flow Matching

Ruiqi Feng, Chenglei Yu, Wenhao Deng et al.

ICML 2025spotlightarXiv:2502.02150
20
citations
#930

LaRA: Benchmarking Retrieval-Augmented Generation and Long-Context LLMs – No Silver Bullet for LC or RAG Routing

Kuan Li, Liwen Zhang, Yong Jiang et al.

ICML 2025arXiv:2502.09977
20
citations
#931

MoMo: Momentum Models for Adaptive Learning Rates

Fabian Schaipp, Ruben Ohana, Michael Eickenberg et al.

ICML 2024arXiv:2305.07583
20
citations
#932

From Low Rank Gradient Subspace Stabilization to Low-Rank Weights: Observations, Theories, and Applications

Ajay Jaiswal, Yifan Wang, Lu Yin et al.

ICML 2025arXiv:2407.11239
20
citations
#933

Interpretability Illusions in the Generalization of Simplified Models

Dan Friedman, Andrew Lampinen, Lucas Dixon et al.

ICML 2024arXiv:2312.03656
20
citations
#934

Parrot: Multilingual Visual Instruction Tuning

Hai-Long Sun, Da-Wei Zhou, Yang Li et al.

ICML 2025arXiv:2406.02539
20
citations
#935

Neural Networks Learn Statistics of Increasing Complexity

Nora Belrose, Quintin Pope, Lucia Quirke et al.

ICML 2024arXiv:2402.04362
20
citations
#936

ConTextual: Evaluating Context-Sensitive Text-Rich Visual Reasoning in Large Multimodal Models

Rohan Wadhawan, Hritik Bansal, Kai-Wei Chang et al.

ICML 2024arXiv:2401.13311
20
citations
#937

Local vs. Global Interpretability: A Computational Complexity Perspective

Shahaf Bassan, Guy Amir, Guy Katz

ICML 2024spotlightarXiv:2406.02981
20
citations
#938

DreamDPO: Aligning Text-to-3D Generation with Human Preferences via Direct Preference Optimization

Zhenglin Zhou, Xiaobo Xia, Fan Ma et al.

ICML 2025arXiv:2502.04370
20
citations
#939

Unsupervised Zero-Shot Reinforcement Learning via Functional Reward Encodings

Kevin Frans, Seohong Park, Pieter Abbeel et al.

ICML 2024spotlightarXiv:2402.17135
20
citations
#940

Inference-Time Alignment of Diffusion Models with Direct Noise Optimization

Zhiwei Tang, Jiangweizhi Peng, Jiasheng Tang et al.

ICML 2025arXiv:2405.18881
20
citations
#941

Auditing $f$-differential privacy in one run

Saeed Mahloujifar, Luca Melis, Kamalika Chaudhuri

ICML 2025oralarXiv:2410.22235
19
citations
#942

Emoji Attack: Enhancing Jailbreak Attacks Against Judge LLM Detection

Zhipeng Wei, Yuqi Liu, N. Benjamin Erichson

ICML 2025arXiv:2411.01077
19
citations
#943

Adaptive Message Passing: A General Framework to Mitigate Oversmoothing, Oversquashing, and Underreaching

Federico Errica, Henrik Christiansen, Viktor Zaverkin et al.

ICML 2025arXiv:2312.16560
19
citations
#944

Prompting a Pretrained Transformer Can Be a Universal Approximator

Aleksandar Petrov, Phil Torr, Adel Bibi

ICML 2024arXiv:2402.14753
19
citations
#945

Universal Length Generalization with Turing Programs

Kaiying Hou, David Brandfonbrener, Sham Kakade et al.

ICML 2025arXiv:2407.03310
19
citations
#946

Emergent Symbolic Mechanisms Support Abstract Reasoning in Large Language Models

Yukang Yang, Declan Campbell, Kaixuan Huang et al.

ICML 2025arXiv:2502.20332
19
citations
#947

Flexible and Efficient Grammar-Constrained Decoding

Kanghee Park, Timothy Zhou, Loris D'Antoni

ICML 2025arXiv:2502.05111
19
citations
#948

Conformal Prediction with Learned Features

Shayan Kiyani, George J. Pappas, Hamed Hassani

ICML 2024arXiv:2404.17487
19
citations
#949

Trustless Audits without Revealing Data or Models

Suppakit Waiwitlikhit, Ion Stoica, Yi Sun et al.

ICML 2024arXiv:2404.04500
19
citations
#950

Equivariance via Minimal Frame Averaging for More Symmetries and Efficiency

Yuchao Lin, Jacob Helwig, Shurui Gui et al.

ICML 2024spotlightarXiv:2406.07598
19
citations
#951

Overcoming Data and Model heterogeneities in Decentralized Federated Learning via Synthetic Anchors

Chun-Yin Huang, Kartik Srinivas, Xin Zhang et al.

ICML 2024arXiv:2405.11525
19
citations
#952

Sparse Autoencoders for Hypothesis Generation

Rajiv Movva, Kenny Peng, Nikhil Garg et al.

ICML 2025arXiv:2502.04382
19
citations
#953

HAMLET: Graph Transformer Neural Operator for Partial Differential Equations

Andrey Bryutkin, Jiahao Huang, Zhongying Deng et al.

ICML 2024arXiv:2402.03541
19
citations
#954

ITFormer: Bridging Time Series and Natural Language for Multi-Modal QA with Large-Scale Multitask Dataset

Yilin Wang, Peixuan Lei, Jie Song et al.

ICML 2025oralarXiv:2506.20093
19
citations
#955

Softmax is not Enough (for Sharp Size Generalisation)

Petar Veličković, Christos Perivolaropoulos, Federico Barbero et al.

ICML 2025arXiv:2410.01104
19
citations
#956

KBQA-o1: Agentic Knowledge Base Question Answering with Monte Carlo Tree Search

Haoran Luo, Haihong E, Yikai Guo et al.

ICML 2025arXiv:2501.18922
19
citations
#957

Why Do You Grok? A Theoretical Analysis on Grokking Modular Addition

Mohamad Amin Mohamadi, Zhiyuan Li, Lei Wu et al.

ICML 2024arXiv:2407.12332
19
citations
#958

What Improves the Generalization of Graph Transformers? A Theoretical Dive into the Self-attention and Positional Encoding

Hongkang Li, Meng Wang, Tengfei Ma et al.

ICML 2024arXiv:2406.01977
19
citations
#959

Conformal Validity Guarantees Exist for Any Data Distribution (and How to Find Them)

Drew Prinster, Samuel Stanton, Anqi Liu et al.

ICML 2024arXiv:2405.06627
19
citations
#960

Interpreting CLIP with Hierarchical Sparse Autoencoders

Vladimir Zaigrajew, Hubert Baniecki, Przemysław Biecek

ICML 2025arXiv:2502.20578
19
citations
#961

In-Context Decision Transformer: Reinforcement Learning via Hierarchical Chain-of-Thought

sili huang, Jifeng Hu, Hechang Chen et al.

ICML 2024arXiv:2405.20692
19
citations
#962

Liouville Flow Importance Sampler

Yifeng Tian, Nishant Panda, Yen Ting Lin

ICML 2024arXiv:2405.06672
19
citations
#963

Pre-training Auto-regressive Robotic Models with 4D Representations

Dantong Niu, Yuvan Sharma, Haoru Xue et al.

ICML 2025arXiv:2502.13142
19
citations
#964

Beyond Regular Grids: Fourier-Based Neural Operators on Arbitrary Domains

Levi Lingsch, Mike Yan Michelis, Emmanuel de Bézenac et al.

ICML 2024arXiv:2305.19663
19
citations
#965

Can We Remove the Square-Root in Adaptive Gradient Methods? A Second-Order Perspective

Wu Lin, Felix Dangel, Runa Eschenhagen et al.

ICML 2024arXiv:2402.03496
19
citations
#966

PDHG-Unrolled Learning-to-Optimize Method for Large-Scale Linear Programming

Bingheng Li, Linxin Yang, Yupeng Chen et al.

ICML 2024arXiv:2406.01908
19
citations
#967

Position: Leverage Foundational Models for Black-Box Optimization

Xingyou Song, Yingtao Tian, Robert Lange et al.

ICML 2024arXiv:2405.03547
19
citations
#968

CHEMREASONER: Heuristic Search over a Large Language Model’s Knowledge Space using Quantum-Chemical Feedback

Henry W. Sprueill, Carl Edwards, Khushbu Agarwal et al.

ICML 2024arXiv:2402.10980
19
citations
#969

Empowering Graph Invariance Learning with Deep Spurious Infomax

Tianjun Yao, Yongqiang Chen, Zhenhao Chen et al.

ICML 2024arXiv:2407.11083
19
citations
#970

Reinforce LLM Reasoning through Multi-Agent Reflection

Yurun Yuan, Tengyang Xie

ICML 2025arXiv:2506.08379
19
citations
#971

Adversaries Can Misuse Combinations of Safe Models

Erik Jones, Anca Dragan, Jacob Steinhardt

ICML 2025arXiv:2406.14595
19
citations
#972

Multi-Domain Graph Foundation Models: Robust Knowledge Transfer via Topology Alignment

Shuo Wang, Bokui Wang, Zhixiang Shen et al.

ICML 2025arXiv:2502.02017
19
citations
#973

Temporal Query Network for Efficient Multivariate Time Series Forecasting

Shengsheng Lin, Haojun Chen, Haijie Wu et al.

ICML 2025oralarXiv:2505.12917
19
citations
#974

FairProof : Confidential and Certifiable Fairness for Neural Networks

Chhavi Yadav, Amrita Roy Chowdhury, Dan Boneh et al.

ICML 2024arXiv:2402.12572
19
citations
#975

Learn from Downstream and Be Yourself in Multimodal Large Language Models Fine-Tuning

Wenke Huang, Jian Liang, Zekun Shi et al.

ICML 2025arXiv:2411.10928
19
citations
#976

Improved Generalization of Weight Space Networks via Augmentations

Aviv Shamsian, Aviv Navon, David Zhang et al.

ICML 2024arXiv:2402.04081
19
citations
#977

Isometric Representation Learning for Disentangled Latent Space of Diffusion Models

Jaehoon Hahm, Junho Lee, Sunghyun Kim et al.

ICML 2024arXiv:2407.11451
19
citations
#978

Automated Benchmark Generation for Repository-Level Coding Tasks

Konstantinos Vergopoulos, Mark Müller, Martin Vechev

ICML 2025arXiv:2503.07701
19
citations
#979

Subobject-level Image Tokenization

Delong Chen, Samuel Cahyawijaya, Jianfeng Liu et al.

ICML 2025arXiv:2402.14327
19
citations
#980

Drug Discovery with Dynamic Goal-aware Fragments

Seul Lee, Seanie Lee, Kenji Kawaguchi et al.

ICML 2024arXiv:2310.00841
19
citations
#981

Memory Layers at Scale

Vincent-Pierre Berges, Barlas Oğuz, Daniel HAZIZA et al.

ICML 2025arXiv:2412.09764
19
citations
#982

Model Swarms: Collaborative Search to Adapt LLM Experts via Swarm Intelligence

Shangbin Feng, Zifeng Wang, Yike Wang et al.

ICML 2025arXiv:2410.11163
19
citations
#983

FreeBind: Free Lunch in Unified Multimodal Space via Knowledge Fusion

Zehan Wang, Ziang Zhang, xize cheng et al.

ICML 2024arXiv:2405.04883
19
citations
#984

Target Concrete Score Matching: A Holistic Framework for Discrete Diffusion

Ruixiang Zhang, Shuangfei Zhai, Yizhe Zhang et al.

ICML 2025arXiv:2504.16431
19
citations
#985

Explorations of Self-Repair in Language Models

Cody Rushing, Neel Nanda

ICML 2024arXiv:2402.15390
19
citations
#986

The Surprising Agreement Between Convex Optimization Theory and Learning-Rate Scheduling for Large Model Training

Fabian Schaipp, Alexander Hägele, Adrien Taylor et al.

ICML 2025arXiv:2501.18965
19
citations
#987

Hierarchical State Space Models for Continuous Sequence-to-Sequence Modeling

Raunaq Bhirangi, Chenyu Wang, Venkatesh Pattabiraman et al.

ICML 2024oralarXiv:2402.10211
19
citations
#988

Position: Optimization in SciML Should Employ the Function Space Geometry

Johannes Müller, Marius Zeinhofer

ICML 2024arXiv:2402.07318
19
citations
#989

Distillation of Discrete Diffusion through Dimensional Correlations

Satoshi Hayakawa, Yuhta Takida, Masaaki Imaizumi et al.

ICML 2025arXiv:2410.08709
18
citations
#990

Inherent Trade-Offs between Diversity and Stability in Multi-Task Benchmarks

Guanhua Zhang, Moritz Hardt

ICML 2024oralarXiv:2405.01719
18
citations
#991

KVTuner: Sensitivity-Aware Layer-Wise Mixed-Precision KV Cache Quantization for Efficient and Nearly Lossless LLM Inference

Xing Li, Zeyu Xing, Yiming Li et al.

ICML 2025arXiv:2502.04420
18
citations
#992

The Privacy Power of Correlated Noise in Decentralized Learning

Youssef Allouah, Anastasiia Koloskova, Aymane Firdoussi et al.

ICML 2024arXiv:2405.01031
18
citations
#993

Learning from Students: Applying t-Distributions to Explore Accurate and Efficient Formats for LLMs

Jordan Dotzel, Yuzong Chen, Bahaa Kotb et al.

ICML 2024arXiv:2405.03103
18
citations
#994

On the Expressive Power of Spectral Invariant Graph Neural Networks

Bohang Zhang, Lingxiao Zhao, Haggai Maron

ICML 2024arXiv:2406.04336
18
citations
#995

Multi-Source Conformal Inference Under Distribution Shift

Yi Liu, Alexander Levis, Sharon-Lise Normand et al.

ICML 2024arXiv:2405.09331
18
citations
#996

Scaling Tractable Probabilistic Circuits: A Systems Perspective

Anji Liu, Kareem Ahmed, Guy Van den Broeck

ICML 2024arXiv:2406.00766
18
citations
#997

MCU: An Evaluation Framework for Open-Ended Game Agents

Xinyue Zheng, Haowei Lin, Kaichen He et al.

ICML 2025spotlightarXiv:2310.08367
18
citations
#998

On the Sample Complexity and Metastability of Heavy-tailed Policy Search in Continuous Control

Amrit Singh Bedi, Anjaly Parayil, Junyu Zhang et al.

ICML 2024arXiv:2106.08414
18
citations
#999

ITBench: Evaluating AI Agents across Diverse Real-World IT Automation Tasks

Saurabh Jha, Rohan Arora, Yuji Watanabe et al.

ICML 2025oralarXiv:2502.05352
18
citations
#1000

Understanding the Learning Dynamics of Alignment with Human Feedback

Shawn Im, Sharon Li

ICML 2024arXiv:2403.18742
18
citations