Most Cited ICML "subject-verb-object" Papers

5,975 papers found • Page 5 of 30

Filters:Most Cited ICML subject-verb-object Clear all

Conference

AAAI 2025 (3,028)COLM 2025 (418)CVPR 2025 (2,873)ICCV 2025 (2,701)ICLR 2025 (3,827)ICML 2025 (3,340)ISMAR 2025 (229)NEURIPS 2025 (5,858)AAAI 2024 (2,289)CVPR 2024 (2,716)ECCV 2024 (2,387)ICLR 2024 (2,297)ICML 2024 (2,635)

Paper Type

poster (24,624)paper (8,558)oral (1,594)spotlight (1,421)highlight (975)

#801

Position: Why We Must Rethink Empirical Research in Machine Learning

Moritz Herrmann, F. Julian D. Lange, Katharina Eggensperger et al.

ICML 2024arXiv:2405.02200

citations

#802

Equivariant Diffusion for Crystal Structure Prediction

Peijia Lin, Pin Chen, Rui Jiao et al.

ICML 2024arXiv:2512.07289

citations

#803

Position: Key Claims in LLM Research Have a Long Tail of Footnotes

Anna Rogers, Sasha Luccioni

ICML 2024arXiv:2308.07120

citations

#804

Comparing Graph Transformers via Positional Encodings

Mitchell Black, Zhengchao Wan, Gal Mishne et al.

ICML 2024arXiv:2402.14202

citations

#805

Efficient and Effective Time-Series Forecasting with Spiking Neural Networks

Changze Lv, Yansen Wang, Dongqi Han et al.

ICML 2024oralarXiv:2402.01533

citations

#806

On Mechanistic Knowledge Localization in Text-to-Image Generative Models

Samyadeep Basu, Keivan Rezaei, Priyatham Kattakinda et al.

ICML 2024arXiv:2405.01008

citations

#807

Sampling in Unit Time with Kernel Fisher-Rao Flow

Aimee Maurais, Youssef Marzouk

ICML 2024arXiv:2401.03892

citations

#808

Position: Evaluating Generative AI Systems Is a Social Science Measurement Challenge

Hanna Wallach, Meera Desai, A. Feder Cooper et al.

ICML 2025arXiv:2502.00561

citations

#809

Compute Better Spent: Replacing Dense Layers with Structured Matrices

Shikai Qiu, Andres Potapczynski, Marc Finzi et al.

ICML 2024arXiv:2406.06248

citations

#810

PolySketchFormer: Fast Transformers via Sketching Polynomial Kernels

Praneeth Kacham, Vahab Mirrokni, Peilin Zhong

ICML 2024arXiv:2310.01655

citations

#811

Rethinking Data Shapley for Data Selection Tasks: Misleads and Merits

Jiachen Wang, Tianji Yang, James Zou et al.

ICML 2024arXiv:2405.03875

citations

#812

DySLIM: Dynamics Stable Learning by Invariant Measure for Chaotic Systems

Yair Schiff, Zhong Yi Wan, Jeffrey Parker et al.

ICML 2024arXiv:2402.04467

citations

#813

Exploring Criteria of Loss Reweighting to Enhance LLM Unlearning

Puning Yang, Qizhou Wang, Zhuo Huang et al.

ICML 2025arXiv:2505.11953

citations

#814

From Language Models over Tokens to Language Models over Characters

Tim Vieira, Benjamin LeBrun, Mario Giulianelli et al.

ICML 2025spotlightarXiv:2412.03719

citations

#815

On Least Square Estimation in Softmax Gating Mixture of Experts

Huy Nguyen, Nhat Ho, Alessandro Rinaldo

ICML 2024arXiv:2402.02952

citations

#816

Optimizing Language Models for Inference Time Objectives using Reinforcement Learning

Yunhao Tang, Kunhao Zheng, Gabriel Synnaeve et al.

ICML 2025arXiv:2503.19595

citations

#817

Machine Vision Therapy: Multimodal Large Language Models Can Enhance Visual Robustness via Denoising In-Context Learning

Zhuo Huang, Chang Liu, Yinpeng Dong et al.

ICML 2024arXiv:2312.02546

citations

#818

Mastering Board Games by External and Internal Planning with Language Models

John Schultz, Jakub Adamek, Matej Jusup et al.

ICML 2025spotlightarXiv:2412.12119

citations

#819

Two Stones Hit One Bird: Bilevel Positional Encoding for Better Length Extrapolation

Zhenyu He, Guhao Feng, Shengjie Luo et al.

ICML 2024arXiv:2401.16421

citations

#820

The Good, The Bad, and Why: Unveiling Emotions in Generative AI

CHENG LI, Jindong Wang, Yixuan Zhang et al.

ICML 2024arXiv:2312.11111

citations

#821

Structure-based drug design by denoising voxel grids

Pedro O. Pinheiro, Arian Jamasb, Omar Mahmood et al.

ICML 2024arXiv:2405.03961

citations

#822

A Simple Model of Inference Scaling Laws

Noam Levi

ICML 2025arXiv:2410.16377

citations

#823

TabPFN Unleashed: A Scalable and Effective Solution to Tabular Classification Problems

Si-Yang Liu, Han-Jia Ye

ICML 2025arXiv:2502.02527

citations

#824

Hyperbolic Geometric Latent Diffusion Model for Graph Generation

Xingcheng Fu, Yisen Gao, Yuecen Wei et al.

ICML 2024arXiv:2405.03188

citations

#825

Benchmarking and Building Long-Context Retrieval Models with LoCo and M2-BERT

Jon Saad-Falcon, Daniel Y Fu, Simran Arora et al.

ICML 2024arXiv:2402.07440

citations

#826

SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation

Zihan Liu, Shuangrui Ding, Zhixiong Zhang et al.

ICML 2025arXiv:2502.13128

citations

#827

M+: Extending MemoryLLM with Scalable Long-Term Memory

Yu Wang, Dmitry Krotov, Yuanzhe Hu et al.

ICML 2025arXiv:2502.00592

citations

#828

Automated Hypothesis Validation with Agentic Sequential Falsifications

Kexin Huang, Ying Jin, Ryan Li et al.

ICML 2025arXiv:2502.09858

citations

#829

TinyTrain: Resource-Aware Task-Adaptive Sparse Training of DNNs at the Data-Scarce Edge

Young Kwon, Rui Li, Stylianos Venieris et al.

ICML 2024arXiv:2307.09988

citations

#830

Projecting Molecules into Synthesizable Chemical Spaces

Shitong Luo, Wenhao Gao, Zuofan Wu et al.

ICML 2024arXiv:2406.04628

citations

#831

Understanding Adam Optimizer via Online Learning of Updates: Adam is FTRL in Disguise

Kwangjun Ahn, Zhiyu Zhang, Yunbum Kook et al.

ICML 2024arXiv:2402.01567

citations

#832

Reinforced Lifelong Editing for Language Models

Zherui Li, Houcheng Jiang, Hao Chen et al.

ICML 2025arXiv:2502.05759

citations

#833

Candidate Pseudolabel Learning: Enhancing Vision-Language Models by Prompt Tuning with Unlabeled Data

Jiahan Zhang, Qi Wei, Feng Liu et al.

ICML 2024arXiv:2406.10502

citations

#834

STEER: Assessing the Economic Rationality of Large Language Models

Narun Raman, Taylor Lundy, Samuel Joseph Amouyal et al.

ICML 2024arXiv:2402.09552

citations

#835

Attribution-based Explanations that Provide Recourse Cannot be Robust

Hidde Fokkema, Rianne de Heide, Tim van Erven

ICML 2024arXiv:2205.15834

citations

#836

How Private are DP-SGD Implementations?

Lynn Chua, Badih Ghazi, Pritish Kamath et al.

ICML 2024arXiv:2403.17673

citations

#837

Nonparametric Modern Hopfield Models

Jerry Yao-Chieh Hu, Bo-Yu Chen, Dennis Wu et al.

ICML 2025arXiv:2404.03900

citations

#838

Demystifying Cost-Efficiency in LLM Serving over Heterogeneous GPUs

Youhe Jiang, Fangcheng Fu, Xiaozhe Yao et al.

ICML 2025arXiv:2502.00722

citations

#839

Latent Space Symmetry Discovery

Jianke Yang, Nima Dehmamy, Robin Walters et al.

ICML 2024arXiv:2310.00105

citations

#840

Learning Reward for Robot Skills Using Large Language Models via Self-Alignment

Yuwei Zeng, Yao Mu, Lin Shao

ICML 2024arXiv:2405.07162

citations

#841

RNAFlow: RNA Structure & Sequence Design via Inverse Folding-Based Flow Matching

Divya Nori, Wengong Jin

ICML 2024arXiv:2405.18768

citations

#842

DPZero: Private Fine-Tuning of Language Models without Backpropagation

Liang Zhang, Bingcong Li, Kiran Thekumparampil et al.

ICML 2024arXiv:2310.09639

citations

#843

Optimal Ridge Regularization for Out-of-Distribution Prediction

Pratik Patil, Jin-Hong Du, Ryan Tibshirani

ICML 2024spotlightarXiv:2404.01233

citations

#844

Is Noise Conditioning Necessary for Denoising Generative Models?

Qiao Sun, Zhicheng Jiang, Hanhong Zhao et al.

ICML 2025arXiv:2502.13129

citations

#845

OneForecast: A Universal Framework for Global and Regional Weather Forecasting

Yuan Gao, Hao Wu, Ruiqi Shu et al.

ICML 2025arXiv:2502.00338

citations

#846

Causal Representation Learning Made Identifiable by Grouping of Observational Variables

Hiroshi Morioka, Aapo Hyvarinen

ICML 2024oralarXiv:2310.15709

citations

#847

Towards a General Time Series Forecasting Model with Unified Representation and Adaptive Transfer

Yihang Wang, Yuying Qiu, Peng Chen et al.

ICML 2025arXiv:2405.17478

citations

#848

Rethinking Momentum Knowledge Distillation in Online Continual Learning

Nicolas MICHEL, Maorong Wang, Ling Xiao et al.

ICML 2024arXiv:2309.02870

citations

#849

Estimating Canopy Height at Scale

Jan Pauls, Max Zimmer, Una Kelly et al.

ICML 2024arXiv:2406.01076

citations

#850

Position: Understanding LLMs Requires More Than Statistical Generalization

Patrik Reizinger, Szilvia Ujváry, Anna Mészáros et al.

ICML 2024spotlightarXiv:2405.01964

citations

#851

Position: AI Evaluation Should Learn from How We Test Humans

Yan Zhuang, Qi Liu, Zachary Pardos et al.

ICML 2025arXiv:2306.10512

citations

#852

Re-Dock: Towards Flexible and Realistic Molecular Docking with Diffusion Bridge

Yufei Huang, Odin Zhang, Lirong Wu et al.

ICML 2024spotlightarXiv:2402.11459

citations

#853

Compute or Load KV Cache? Why Not Both?

Shuowei Jin, Xueshen Liu, Qingzhao Zhang et al.

ICML 2025arXiv:2410.03065

citations

#854

Position: Rethinking Post-Hoc Search-Based Neural Approaches for Solving Large-Scale Traveling Salesman Problems

Yifan Xia, Xianliang Yang, Zichuan Liu et al.

ICML 2024arXiv:2406.03503

citations

#855

Towards Theoretical Understandings of Self-Consuming Generative Models

Shi Fu, Sen Zhang, Yingjie Wang et al.

ICML 2024arXiv:2402.11778

citations

#856

Erasing the Bias: Fine-Tuning Foundation Models for Semi-Supervised Learning

Kai Gan, Tong Wei

ICML 2024arXiv:2405.11756

citations

#857

A Sparsity Principle for Partially Observable Causal Representation Learning

Danru Xu, Dingling Yao, Sébastien Lachapelle et al.

ICML 2024arXiv:2403.08335

citations

#858

Fixing the Double Penalty in Data-Driven Weather Forecasting Through a Modified Spherical Harmonic Loss Function

Christopher Subich, Syed Husain, Leo Separovic et al.

ICML 2025arXiv:2501.19374

citations

#859

Generalization in Kernel Regression Under Realistic Assumptions

Daniel Barzilai, Ohad Shamir

ICML 2024spotlightarXiv:2312.15995

citations

#860

Fourier Position Embedding: Enhancing Attention’s Periodic Extension for Length Generalization

Ermo Hua, Che Jiang, Xingtai Lv et al.

ICML 2025arXiv:2412.17739

citations

#861

Self-Improving Transformers Overcome Easy-to-Hard and Length Generalization Challenges

Nayoung Lee, Jack Cai, Avi Schwarzschild et al.

ICML 2025arXiv:2502.01612

citations

#862

Converting Transformers to Polynomial Form for Secure Inference Over Homomorphic Encryption

Itamar Zimerman, Moran Baruch, Nir Drucker et al.

ICML 2024arXiv:2311.08610

citations

#863

Approximate Nearest Neighbor Search with Window Filters

Josh Engels, Ben Landrum, Shangdi Yu et al.

ICML 2024arXiv:2402.00943

citations

#864

Transformers Provably Learn Sparse Token Selection While Fully-Connected Nets Cannot

Zixuan Wang, Stanley Wei, Daniel Hsu et al.

ICML 2024arXiv:2406.06893

citations

#865

Discovering Environments with XRM

Mohammad Pezeshki, Diane Bouchacourt, Mark Ibrahim et al.

ICML 2024arXiv:2309.16748

citations

#866

Delta Decompression for MoE-based LLMs Compression

Hao Gu, Wei Li, Lujun Li et al.

ICML 2025arXiv:2502.17298

citations

#867

Membership Inference Attacks on Diffusion Models via Quantile Regression

Shuai Tang, Steven Wu, Sergul Aydore et al.

ICML 2024arXiv:2312.05140

citations

#868

Rethinking External Slow-Thinking: From Snowball Errors to Probability of Correct Reasoning

Zeyu Gan, Yun Liao, Yong Liu

ICML 2025arXiv:2501.15602

citations

#869

TimeSiam: A Pre-Training Framework for Siamese Time-Series Modeling

Jiaxiang Dong, Haixu Wu, Yuxuan Wang et al.

ICML 2024oralarXiv:2402.02475

citations

#870

Monte Carlo Tree Diffusion for System 2 Planning

Jaesik Yoon, Hyeonseo Cho, Doojin Baek et al.

ICML 2025spotlightarXiv:2502.07202

citations

#871

Reducing Tool Hallucination via Reliability Alignment

Hongshen Xu, Zichen Zhu, Lei Pan et al.

ICML 2025arXiv:2412.04141

citations

#872

Moreau Envelope for Nonconvex Bi-Level Optimization: A Single-Loop and Hessian-Free Solution Strategy

Risheng Liu, Zhu Liu, Wei Yao et al.

ICML 2024arXiv:2405.09927

citations

#873

Learning to Keep a Promise: Scaling Language Model Decoding Parallelism with Learned Asynchronous Decoding

Tian Jin, Ellie Cheng, Zachary Ankner et al.

ICML 2025arXiv:2502.11517

citations

#874

Seizing Serendipity: Exploiting the Value of Past Success in Off-Policy Actor-Critic

Tianying Ji, Yu Luo, Fuchun Sun et al.

ICML 2024arXiv:2306.02865

citations

#875

Unsupervised Evaluation of Code LLMs with Round-Trip Correctness

Miltiadis Allamanis, Sheena Panthaplackel, Pengcheng Yin

ICML 2024arXiv:2402.08699

citations

#876

Decision Theoretic Foundations for Conformal Prediction: Optimal Uncertainty Quantification for Risk-Averse Agents

Shayan Kiyani, George Pappas, Aaron Roth et al.

ICML 2025spotlightarXiv:2502.02561

citations

#877

Residual Quantization with Implicit Neural Codebooks

Iris Huijben, Matthijs Douze, Matthew Muckley et al.

ICML 2024arXiv:2401.14732

citations

#878

Compress then Serve: Serving Thousands of LoRA Adapters with Little Overhead

Rickard Gabrielsson, Jiacheng Zhu, Onkar Bhardwaj et al.

ICML 2025arXiv:2407.00066

citations

#879

Memorization Through the Lens of Curvature of Loss Function Around Samples

Isha Garg, Deepak Ravikumar, Kaushik Roy

ICML 2024spotlightarXiv:2307.05831

citations

#880

InfAlign: Inference-aware language model alignment

Ananth Balashankar, Ziteng Sun, Jonathan Berant et al.

ICML 2025arXiv:2412.19792

citations

#881

Improving LLM Safety Alignment with Dual-Objective Optimization

Xuandong Zhao, Will Cai, Tianneng Shi et al.

ICML 2025arXiv:2503.03710

citations

#882

Purify Unlearnable Examples via Rate-Constrained Variational Autoencoders

Yi Yu, Yufei Wang, Song Xia et al.

ICML 2024arXiv:2405.01460

citations

#883

TimeBridge: Non-Stationarity Matters for Long-term Time Series Forecasting

Peiyuan Liu, Beiliang Wu, Yifan Hu et al.

ICML 2025arXiv:2410.04442

citations

#884

DisCo-Diff: Enhancing Continuous Diffusion Models with Discrete Latents

Yilun Xu, Gabriele Corso, Tommi Jaakkola et al.

ICML 2024arXiv:2407.03300

citations

#885

Scaling Video-Language Models to 10K Frames via Hierarchical Differential Distillation

CHUANQI CHENG, Jian Guan, Wei Wu et al.

ICML 2025oralarXiv:2504.02438

citations

#886

IIANet: An Intra- and Inter-Modality Attention Network for Audio-Visual Speech Separation

Kai Li, Runxuan Yang, Fuchun Sun et al.

ICML 2024oralarXiv:2308.08143

citations

#887

Compositional Few-Shot Class-Incremental Learning

Yixiong Zou, Shanghang Zhang, haichen zhou et al.

ICML 2024arXiv:2405.17022

citations

#888

Few-Shot Character Understanding in Movies as an Assessment to Meta-Learning of Theory-of-Mind

Mo Yu, Qiujing Wang, Shunchi Zhang et al.

ICML 2024arXiv:2211.04684

citations

#889

Linear Alignment: A Closed-form Solution for Aligning Human Preferences without Tuning and Feedback

songyang gao, Qiming Ge, Wei Shen et al.

ICML 2024arXiv:2401.11458

citations

#890

Understanding the Effects of Iterative Prompting on Truthfulness

Satyapriya Krishna, Chirag Agarwal, Himabindu Lakkaraju

ICML 2024arXiv:2402.06625

citations

#891

A sampling theory perspective on activations for implicit neural representations

Hemanth Saratchandran, Sameera Ramasinghe, Violetta Shevchenko et al.

ICML 2024arXiv:2402.05427

citations

#892

MELON: Provable Defense Against Indirect Prompt Injection Attacks in AI Agents

Kaijie Zhu, Xianjun Yang, Jindong Wang et al.

ICML 2025arXiv:2502.05174

citations

#893

ReVISE: Learning to Refine at Test-Time via Intrinsic Self-Verification

Hyunseok Lee, Seunghyuk Oh, Jaehyung Kim et al.

ICML 2025arXiv:2502.14565

citations

#894

BaxBench: Can LLMs Generate Correct and Secure Backends?

Mark Vero, Niels Mündler, Viktor Chibotaru et al.

ICML 2025spotlightarXiv:2502.11844

citations

#895

How to Trace Latent Generative Model Generated Images without Artificial Watermark?

Zhenting Wang, Vikash Sehwag, Chen Chen et al.

ICML 2024arXiv:2405.13360

citations

#896

BECoTTA: Input-dependent Online Blending of Experts for Continual Test-time Adaptation

Daeun Lee, Jaehong Yoon, Sung Ju Hwang

ICML 2024arXiv:2402.08712

citations

#897

Graph-based Time Series Clustering for End-to-End Hierarchical Forecasting

Andrea Cini, Danilo Mandic, Cesare Alippi

ICML 2024arXiv:2305.19183

citations

#898

Reward Model Learning vs. Direct Policy Optimization: A Comparative Analysis of Learning from Human Preferences

Andi Nika, Debmalya Mandal, Parameswaran Kamalaruban et al.

ICML 2024arXiv:2403.01857

citations

#899

Homomorphism Counts for Graph Neural Networks: All About That Basis

Emily Jin, Michael Bronstein, Ismail Ceylan et al.

ICML 2024arXiv:2402.08595

citations

#900

Gaussian Processes on Cellular Complexes

Mathieu Alain, So Takao, Brooks Paige et al.

ICML 2024arXiv:2311.01198

citations

#901

Scaling Laws in Patchification: An Image Is Worth 50,176 Tokens And More

Feng Wang, Yaodong Yu, Wei Shao et al.

ICML 2025arXiv:2502.03738

citations

#902

Memory-Space Visual Prompting for Efficient Vision-Language Fine-Tuning

Shibo Jie, Yehui Tang, Ning Ding et al.

ICML 2024arXiv:2405.05615

citations

#903

Discovering Symmetry Breaking in Physical Systems with Relaxed Group Convolution

Rui Wang, Elyssa Hofgard, Han Gao et al.

ICML 2024arXiv:2310.02299

citations

#904

Symmetry Induces Structure and Constraint of Learning

Liu Ziyin

ICML 2024arXiv:2309.16932

citations

#905

EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM

Zhuofan Zong, Dongzhi Jiang, Bingqi Ma et al.

ICML 2025arXiv:2412.09618

citations

#906

BEST-Route: Adaptive LLM Routing with Test-Time Optimal Compute

Dujian Ding, Ankur Mallick, Shaokun Zhang et al.

ICML 2025arXiv:2506.22716

citations

#907

DistiLLM-2: A Contrastive Approach Boosts the Distillation of LLMs

Jongwoo Ko, Tianyi Chen, Sungnyun Kim et al.

ICML 2025oralarXiv:2503.07067

citations

#908

Wyckoff Transformer: Generation of Symmetric Crystals

Nikita Kazeev, Wei Nong, Ignat Romanov et al.

ICML 2025arXiv:2503.02407

citations

#909

AlphaVerus: Bootstrapping Formally Verified Code Generation through Self-Improving Translation and Treefinement

Pranjal Aggarwal, Bryan Parno, Sean Welleck

ICML 2025arXiv:2412.06176

citations

#910

What Can Transformer Learn with Varying Depth? Case Studies on Sequence Learning Tasks

Xingwu Chen, Difan Zou

ICML 2024arXiv:2404.01601

citations

#911

VersaPRM: Multi-Domain Process Reward Model via Synthetic Reasoning Data

Thomas Zeng, Shuibai Zhang, Shutong Wu et al.

ICML 2025oralarXiv:2502.06737

citations

#912

LSEnet: Lorentz Structural Entropy Neural Network for Deep Graph Clustering

Li Sun, Zhenhao Huang, Hao Peng et al.

ICML 2024arXiv:2405.11801

citations

#913

CRANE: Reasoning with constrained LLM generation

Debangshu Banerjee, Tarun Suresh, Shubham Ugare et al.

ICML 2025arXiv:2502.09061

citations

#914

SpikeZIP-TF: Conversion is All You Need for Transformer-based SNN

kang you, Zekai Xu, Chen Nie et al.

ICML 2024arXiv:2406.03470

citations

#915

StableMask: Refining Causal Masking in Decoder-only Transformer

Qingyu Yin, Xuzheng He, Xiang Zhuang et al.

ICML 2024arXiv:2402.04779

citations

#916

ReGAL: Refactoring Programs to Discover Generalizable Abstractions

Elias Stengel-Eskin, Archiki Prasad, Mohit Bansal

ICML 2024arXiv:2401.16467

citations

#917

LEAPS: A discrete neural sampler via locally equivariant networks

Peter Holderrieth, Michael Albergo, Tommi Jaakkola

ICML 2025arXiv:2502.10843

citations

#918

Investigating Non-Transitivity in LLM-as-a-Judge

Yi Xu, Laura Ruis, Tim Rocktäschel et al.

ICML 2025spotlightarXiv:2502.14074

citations

#919

P(all-atom) Is Unlocking New Path For Protein Design

Wei Qu, Jiawei Guan, Rui Ma et al.

ICML 2025spotlight

citations

#920

Great Models Think Alike and this Undermines AI Oversight

Shashwat Goel, Joschka Strüber, Ilze Amanda Auzina et al.

ICML 2025spotlightarXiv:2502.04313

citations

#921

Is Temperature Sample Efficient for Softmax Gaussian Mixture of Experts?

Huy Nguyen, Pedram Akbarian, Nhat Ho

ICML 2024arXiv:2401.13875

citations

#922

KAN-AD: Time Series Anomaly Detection with Kolmogorov–Arnold Networks

Quan Zhou, Changhua Pei, Fei Sun et al.

ICML 2025arXiv:2411.00278

citations

#923

Position: Uncertainty Quantification Needs Reassessment for Large Language Model Agents

Michael Kirchhof, Gjergji Kasneci, Enkelejda Kasneci

ICML 2025arXiv:2505.22655

citations

#924

Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation

Yunheng Li, Zhong-Yu Li, Quan-Sheng Zeng et al.

ICML 2024arXiv:2406.00670

citations

#925

CLIPZyme: Reaction-Conditioned Virtual Screening of Enzymes

Peter Mikhael, Itamar Chinn, Regina Barzilay

ICML 2024arXiv:2402.06748

citations

#926

Graph-based Forecasting with Missing Data through Spatiotemporal Downsampling

Ivan Marisca, Cesare Alippi, Filippo Maria Bianchi

ICML 2024oralarXiv:2402.10634

citations

#927

OptMATH: A Scalable Bidirectional Data Synthesis Framework for Optimization Modeling

Hongliang Lu, Zhonglin Xie, Yaoyu Wu et al.

ICML 2025arXiv:2502.11102

citations

#928

Rethinking Aleatoric and Epistemic Uncertainty

Freddie Bickford Smith, Jannik Kossen, Eleanor Trollope et al.

ICML 2025arXiv:2412.20892

citations

#929

On the Guidance of Flow Matching

Ruiqi Feng, Chenglei Yu, Wenhao Deng et al.

ICML 2025spotlightarXiv:2502.02150

citations

#930

LaRA: Benchmarking Retrieval-Augmented Generation and Long-Context LLMs – No Silver Bullet for LC or RAG Routing

Kuan Li, Liwen Zhang, Yong Jiang et al.

ICML 2025arXiv:2502.09977

citations

#931

MoMo: Momentum Models for Adaptive Learning Rates

Fabian Schaipp, Ruben Ohana, Michael Eickenberg et al.

ICML 2024arXiv:2305.07583

citations

#932

From Low Rank Gradient Subspace Stabilization to Low-Rank Weights: Observations, Theories, and Applications

Ajay Jaiswal, Yifan Wang, Lu Yin et al.

ICML 2025arXiv:2407.11239

citations

#933

Interpretability Illusions in the Generalization of Simplified Models

Dan Friedman, Andrew Lampinen, Lucas Dixon et al.

ICML 2024arXiv:2312.03656

citations

#934

Parrot: Multilingual Visual Instruction Tuning

Hai-Long Sun, Da-Wei Zhou, Yang Li et al.

ICML 2025arXiv:2406.02539

citations

#935

Neural Networks Learn Statistics of Increasing Complexity

Nora Belrose, Quintin Pope, Lucia Quirke et al.

ICML 2024arXiv:2402.04362

citations

#936

ConTextual: Evaluating Context-Sensitive Text-Rich Visual Reasoning in Large Multimodal Models

Rohan Wadhawan, Hritik Bansal, Kai-Wei Chang et al.

ICML 2024arXiv:2401.13311

citations

#937

Local vs. Global Interpretability: A Computational Complexity Perspective

Shahaf Bassan, Guy Amir, Guy Katz

ICML 2024spotlightarXiv:2406.02981

citations

#938

DreamDPO: Aligning Text-to-3D Generation with Human Preferences via Direct Preference Optimization

Zhenglin Zhou, Xiaobo Xia, Fan Ma et al.

ICML 2025arXiv:2502.04370

citations

#939

Unsupervised Zero-Shot Reinforcement Learning via Functional Reward Encodings

Kevin Frans, Seohong Park, Pieter Abbeel et al.

ICML 2024spotlightarXiv:2402.17135

citations

#940

Inference-Time Alignment of Diffusion Models with Direct Noise Optimization

Zhiwei Tang, Jiangweizhi Peng, Jiasheng Tang et al.

ICML 2025arXiv:2405.18881

citations

#941

Auditing $f$-differential privacy in one run

Saeed Mahloujifar, Luca Melis, Kamalika Chaudhuri

ICML 2025oralarXiv:2410.22235

citations

#942

Emoji Attack: Enhancing Jailbreak Attacks Against Judge LLM Detection

Zhipeng Wei, Yuqi Liu, N. Benjamin Erichson

ICML 2025arXiv:2411.01077

citations

#943

Adaptive Message Passing: A General Framework to Mitigate Oversmoothing, Oversquashing, and Underreaching

Federico Errica, Henrik Christiansen, Viktor Zaverkin et al.

ICML 2025arXiv:2312.16560

citations

#944

Prompting a Pretrained Transformer Can Be a Universal Approximator

Aleksandar Petrov, Phil Torr, Adel Bibi

ICML 2024arXiv:2402.14753

citations

#945

Universal Length Generalization with Turing Programs

Kaiying Hou, David Brandfonbrener, Sham Kakade et al.

ICML 2025arXiv:2407.03310

citations

#946

Emergent Symbolic Mechanisms Support Abstract Reasoning in Large Language Models

Yukang Yang, Declan Campbell, Kaixuan Huang et al.

ICML 2025arXiv:2502.20332

citations

#947

Flexible and Efficient Grammar-Constrained Decoding

Kanghee Park, Timothy Zhou, Loris D'Antoni

ICML 2025arXiv:2502.05111

citations

#948

Conformal Prediction with Learned Features

Shayan Kiyani, George J. Pappas, Hamed Hassani

ICML 2024arXiv:2404.17487

citations

#949

Trustless Audits without Revealing Data or Models

Suppakit Waiwitlikhit, Ion Stoica, Yi Sun et al.

ICML 2024arXiv:2404.04500

citations

#950

Equivariance via Minimal Frame Averaging for More Symmetries and Efficiency

Yuchao Lin, Jacob Helwig, Shurui Gui et al.

ICML 2024spotlightarXiv:2406.07598

citations

#951

Overcoming Data and Model heterogeneities in Decentralized Federated Learning via Synthetic Anchors

Chun-Yin Huang, Kartik Srinivas, Xin Zhang et al.

ICML 2024arXiv:2405.11525

citations

#952

Sparse Autoencoders for Hypothesis Generation

Rajiv Movva, Kenny Peng, Nikhil Garg et al.

ICML 2025arXiv:2502.04382

citations

#953

HAMLET: Graph Transformer Neural Operator for Partial Differential Equations

Andrey Bryutkin, Jiahao Huang, Zhongying Deng et al.

ICML 2024arXiv:2402.03541

citations

#954

ITFormer: Bridging Time Series and Natural Language for Multi-Modal QA with Large-Scale Multitask Dataset

Yilin Wang, Peixuan Lei, Jie Song et al.

ICML 2025oralarXiv:2506.20093

citations

#955

Softmax is not Enough (for Sharp Size Generalisation)

Petar Veličković, Christos Perivolaropoulos, Federico Barbero et al.

ICML 2025arXiv:2410.01104

citations

#956

KBQA-o1: Agentic Knowledge Base Question Answering with Monte Carlo Tree Search

Haoran Luo, Haihong E, Yikai Guo et al.

ICML 2025arXiv:2501.18922

citations

#957

Why Do You Grok? A Theoretical Analysis on Grokking Modular Addition

Mohamad Amin Mohamadi, Zhiyuan Li, Lei Wu et al.

ICML 2024arXiv:2407.12332

citations

#958

What Improves the Generalization of Graph Transformers? A Theoretical Dive into the Self-attention and Positional Encoding

Hongkang Li, Meng Wang, Tengfei Ma et al.

ICML 2024arXiv:2406.01977

citations

#959

Conformal Validity Guarantees Exist for Any Data Distribution (and How to Find Them)

Drew Prinster, Samuel Stanton, Anqi Liu et al.

ICML 2024arXiv:2405.06627

citations

#960

Interpreting CLIP with Hierarchical Sparse Autoencoders

Vladimir Zaigrajew, Hubert Baniecki, Przemysław Biecek

ICML 2025arXiv:2502.20578

citations

#961

In-Context Decision Transformer: Reinforcement Learning via Hierarchical Chain-of-Thought

sili huang, Jifeng Hu, Hechang Chen et al.

ICML 2024arXiv:2405.20692

citations

#962

Liouville Flow Importance Sampler

Yifeng Tian, Nishant Panda, Yen Ting Lin

ICML 2024arXiv:2405.06672

citations

#963

Pre-training Auto-regressive Robotic Models with 4D Representations

Dantong Niu, Yuvan Sharma, Haoru Xue et al.

ICML 2025arXiv:2502.13142

citations

#964

Beyond Regular Grids: Fourier-Based Neural Operators on Arbitrary Domains

Levi Lingsch, Mike Yan Michelis, Emmanuel de Bézenac et al.

ICML 2024arXiv:2305.19663

citations

#965

Can We Remove the Square-Root in Adaptive Gradient Methods? A Second-Order Perspective

Wu Lin, Felix Dangel, Runa Eschenhagen et al.

ICML 2024arXiv:2402.03496

citations

#966

PDHG-Unrolled Learning-to-Optimize Method for Large-Scale Linear Programming

Bingheng Li, Linxin Yang, Yupeng Chen et al.

ICML 2024arXiv:2406.01908

citations

#967

Position: Leverage Foundational Models for Black-Box Optimization

Xingyou Song, Yingtao Tian, Robert Lange et al.

ICML 2024arXiv:2405.03547

citations

#968

CHEMREASONER: Heuristic Search over a Large Language Model’s Knowledge Space using Quantum-Chemical Feedback

Henry W. Sprueill, Carl Edwards, Khushbu Agarwal et al.

ICML 2024arXiv:2402.10980

citations

#969

Empowering Graph Invariance Learning with Deep Spurious Infomax

Tianjun Yao, Yongqiang Chen, Zhenhao Chen et al.

ICML 2024arXiv:2407.11083

citations

#970

Reinforce LLM Reasoning through Multi-Agent Reflection

Yurun Yuan, Tengyang Xie

ICML 2025arXiv:2506.08379

citations

#971

Adversaries Can Misuse Combinations of Safe Models

Erik Jones, Anca Dragan, Jacob Steinhardt

ICML 2025arXiv:2406.14595

citations

#972

Multi-Domain Graph Foundation Models: Robust Knowledge Transfer via Topology Alignment

Shuo Wang, Bokui Wang, Zhixiang Shen et al.

ICML 2025arXiv:2502.02017

citations

#973

Temporal Query Network for Efficient Multivariate Time Series Forecasting

Shengsheng Lin, Haojun Chen, Haijie Wu et al.

ICML 2025oralarXiv:2505.12917

citations

#974

FairProof : Confidential and Certifiable Fairness for Neural Networks

Chhavi Yadav, Amrita Roy Chowdhury, Dan Boneh et al.

ICML 2024arXiv:2402.12572

citations

#975

Learn from Downstream and Be Yourself in Multimodal Large Language Models Fine-Tuning

Wenke Huang, Jian Liang, Zekun Shi et al.

ICML 2025arXiv:2411.10928

citations

#976

Improved Generalization of Weight Space Networks via Augmentations

Aviv Shamsian, Aviv Navon, David Zhang et al.

ICML 2024arXiv:2402.04081

citations

#977

Isometric Representation Learning for Disentangled Latent Space of Diffusion Models

Jaehoon Hahm, Junho Lee, Sunghyun Kim et al.

ICML 2024arXiv:2407.11451

citations

#978

Automated Benchmark Generation for Repository-Level Coding Tasks

Konstantinos Vergopoulos, Mark Müller, Martin Vechev

ICML 2025arXiv:2503.07701

citations

#979

Subobject-level Image Tokenization

Delong Chen, Samuel Cahyawijaya, Jianfeng Liu et al.

ICML 2025arXiv:2402.14327

citations

#980

Drug Discovery with Dynamic Goal-aware Fragments

Seul Lee, Seanie Lee, Kenji Kawaguchi et al.

ICML 2024arXiv:2310.00841

citations

#981

Memory Layers at Scale

Vincent-Pierre Berges, Barlas Oğuz, Daniel HAZIZA et al.

ICML 2025arXiv:2412.09764

citations

#982

Model Swarms: Collaborative Search to Adapt LLM Experts via Swarm Intelligence

Shangbin Feng, Zifeng Wang, Yike Wang et al.

ICML 2025arXiv:2410.11163

citations

#983

FreeBind: Free Lunch in Unified Multimodal Space via Knowledge Fusion

Zehan Wang, Ziang Zhang, xize cheng et al.

ICML 2024arXiv:2405.04883

citations

#984

Target Concrete Score Matching: A Holistic Framework for Discrete Diffusion

Ruixiang Zhang, Shuangfei Zhai, Yizhe Zhang et al.

ICML 2025arXiv:2504.16431

citations

#985

Explorations of Self-Repair in Language Models

Cody Rushing, Neel Nanda

ICML 2024arXiv:2402.15390

citations

#986

The Surprising Agreement Between Convex Optimization Theory and Learning-Rate Scheduling for Large Model Training

Fabian Schaipp, Alexander Hägele, Adrien Taylor et al.

ICML 2025arXiv:2501.18965

citations

#987

Hierarchical State Space Models for Continuous Sequence-to-Sequence Modeling

Raunaq Bhirangi, Chenyu Wang, Venkatesh Pattabiraman et al.

ICML 2024oralarXiv:2402.10211

citations

#988

Position: Optimization in SciML Should Employ the Function Space Geometry

Johannes Müller, Marius Zeinhofer

ICML 2024arXiv:2402.07318

citations

#989

Distillation of Discrete Diffusion through Dimensional Correlations

Satoshi Hayakawa, Yuhta Takida, Masaaki Imaizumi et al.

ICML 2025arXiv:2410.08709

citations

#990

Inherent Trade-Offs between Diversity and Stability in Multi-Task Benchmarks

Guanhua Zhang, Moritz Hardt

ICML 2024oralarXiv:2405.01719

citations

#991

KVTuner: Sensitivity-Aware Layer-Wise Mixed-Precision KV Cache Quantization for Efficient and Nearly Lossless LLM Inference

Xing Li, Zeyu Xing, Yiming Li et al.

ICML 2025arXiv:2502.04420

citations

#992

The Privacy Power of Correlated Noise in Decentralized Learning

Youssef Allouah, Anastasiia Koloskova, Aymane Firdoussi et al.

ICML 2024arXiv:2405.01031

citations

#993

Learning from Students: Applying t-Distributions to Explore Accurate and Efficient Formats for LLMs

Jordan Dotzel, Yuzong Chen, Bahaa Kotb et al.

ICML 2024arXiv:2405.03103

citations

#994

On the Expressive Power of Spectral Invariant Graph Neural Networks

Bohang Zhang, Lingxiao Zhao, Haggai Maron

ICML 2024arXiv:2406.04336

citations

#995

Multi-Source Conformal Inference Under Distribution Shift

Yi Liu, Alexander Levis, Sharon-Lise Normand et al.

ICML 2024arXiv:2405.09331

citations

#996

Scaling Tractable Probabilistic Circuits: A Systems Perspective

Anji Liu, Kareem Ahmed, Guy Van den Broeck

ICML 2024arXiv:2406.00766

citations

#997

MCU: An Evaluation Framework for Open-Ended Game Agents

Xinyue Zheng, Haowei Lin, Kaichen He et al.

ICML 2025spotlightarXiv:2310.08367

citations

#998

On the Sample Complexity and Metastability of Heavy-tailed Policy Search in Continuous Control

Amrit Singh Bedi, Anjaly Parayil, Junyu Zhang et al.

ICML 2024arXiv:2106.08414

citations

#999

ITBench: Evaluating AI Agents across Diverse Real-World IT Automation Tasks

Saurabh Jha, Rohan Arora, Yuji Watanabe et al.

ICML 2025oralarXiv:2502.05352

citations

#1000

Understanding the Learning Dynamics of Alignment with Human Feedback

Shawn Im, Sharon Li

ICML 2024arXiv:2403.18742

citations

← Previous

1...3 4 5 6 7...30