Most Cited ICML "strategic agent behavior" Papers

5,975 papers found • Page 4 of 30

Filters:Most Cited ICML strategic agent behavior Clear all

Conference

AAAI 2025 (3,028)COLM 2025 (418)CVPR 2025 (2,873)ICCV 2025 (2,701)ICLR 2025 (3,827)ICML 2025 (3,340)ISMAR 2025 (229)NEURIPS 2025 (5,858)AAAI 2024 (2,289)CVPR 2024 (2,716)ECCV 2024 (2,387)ICLR 2024 (2,297)ICML 2024 (2,635)

Paper Type

poster (24,624)paper (8,558)oral (1,594)spotlight (1,421)highlight (975)

#601

Position: Measure Dataset Diversity, Don't Just Claim It

Dora Zhao, Jerone Andrews, Orestis Papakyriakopoulos et al.

ICML 2024arXiv:2407.08188

citations

#602

A Unified Recipe for Deriving (Time-Uniform) PAC-Bayes Bounds

Ben Chugg, Hongjian Wang, Aaditya Ramdas

ICML 2024arXiv:2302.03421

citations

#603

Automated Red Teaming with GOAT: the Generative Offensive Agent Tester

Maya Pavlova, Erik Brinkman, Krithika Iyer et al.

ICML 2025arXiv:2410.01606

citations

#604

Light and Optimal Schrödinger Bridge Matching

Nikita Gushchin, Sergei Kholkin, Evgeny Burnaev et al.

ICML 2024arXiv:2402.03207

citations

#605

Archetypal SAE: Adaptive and Stable Dictionary Learning for Concept Extraction in Large Vision Models

Thomas Fel, Ekdeep Singh Lubana, Jacob Prince et al.

ICML 2025arXiv:2502.12892

citations

#606

INViT: A Generalizable Routing Problem Solver with Invariant Nested View Transformer

Han Fang, Zhihao Song, Paul Weng et al.

ICML 2024arXiv:2402.02317

citations

#607

Towards Efficient Exact Optimization of Language Model Alignment

Haozhe Ji, Cheng Lu, Yilin Niu et al.

ICML 2024arXiv:2402.00856

citations

#608

Case-Based or Rule-Based: How Do Transformers Do the Math?

Yi Hu, Xiaojuan Tang, Haotong Yang et al.

ICML 2024arXiv:2402.17709

citations

#609

GeoPixel: Pixel Grounding Large Multimodal Model in Remote Sensing

Akashah Shabbir, Ilmuz Zaman Mohammed Zumri, Mohammed Bennamoun et al.

ICML 2025arXiv:2501.13925

citations

#610

Understanding Chain-of-Thought in LLMs through Information Theory

Jean-Francois Ton, Muhammad Faaiz Taufiq, Yang Liu

ICML 2025arXiv:2411.11984

citations

#611

Q-value Regularized Transformer for Offline Reinforcement Learning

Shengchao Hu, Ziqing Fan, Chaoqin Huang et al.

ICML 2024arXiv:2405.17098

citations

#612

Conformal Prediction Sets Improve Human Decision Making

Jesse Cresswell, yi sui, Bhargava Kumar et al.

ICML 2024arXiv:2401.13744

citations

#613

Active Statistical Inference

Tijana Zrnic, Emmanuel J Candes

ICML 2024arXiv:2403.03208

citations

#614

FourierMamba: Fourier Learning Integration with State Space Models for Image Deraining

Dong Li, Yidi Liu, Xueyang Fu et al.

ICML 2025oralarXiv:2405.19450

citations

#615

Class-Imbalanced Graph Learning without Class Rebalancing

Zhining Liu, Ruizhong Qiu, Zhichen Zeng et al.

ICML 2024arXiv:2308.14181

citations

#616

Representation Surgery: Theory and Practice of Affine Steering

Shashwat Singh, Shauli Ravfogel, Jonathan Herzig et al.

ICML 2024arXiv:2402.09631

citations

#617

Sample, Scrutinize and Scale: Effective Inference-Time Search by Scaling Verification

Eric Zhao, Pranjal Awasthi, Sreenivas Gollapudi

ICML 2025arXiv:2502.01839

citations

#618

Unifying Image Processing as Visual Prompting Question Answering

Yihao Liu, Xiangyu Chen, Xianzheng Ma et al.

ICML 2024arXiv:2310.10513

citations

#619

C-RAG: Certified Generation Risks for Retrieval-Augmented Language Models

Mintong Kang, Nezihe Merve Gürel, Ning Yu et al.

ICML 2024arXiv:2402.03181

citations

#620

Overtrained Language Models Are Harder to Fine-Tune

Jacob Mitchell Springer, Sachin Goyal, Kaiyue Wen et al.

ICML 2025arXiv:2503.19206

citations

#621

Disentangled 3D Scene Generation with Layout Learning

Dave Epstein, Ben Poole, Ben Mildenhall et al.

ICML 2024arXiv:2402.16936

citations

#622

Graph-enhanced Large Language Models in Asynchronous Plan Reasoning

Fangru Lin, Emanuele La Malfa, Valentin Hofmann et al.

ICML 2024arXiv:2402.02805

citations

#623

SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator

Guoxuan Chen, Han Shi, jiawei li et al.

ICML 2025arXiv:2412.12094

citations

#624

Steer LLM Latents for Hallucination Detection

Seongheon Park, Xuefeng Du, Min-Hsuan Yeh et al.

ICML 2025arXiv:2503.01917

citations

#625

Efficient Online Reinforcement Learning for Diffusion Policy

Haitong Ma, Tianyi Chen, Kai Wang et al.

ICML 2025arXiv:2502.00361

citations

#626

Emergent Representations of Program Semantics in Language Models Trained on Programs

Charles Jin, Martin Rinard

ICML 2024arXiv:2305.11169

citations

#627

UGPhysics: A Comprehensive Benchmark for Undergraduate Physics Reasoning with Large Language Models

Xin Xu, Qiyun Xu, Tong Xiao et al.

ICML 2025arXiv:2502.00334

citations

#628

A Statistical Theory of Regularization-Based Continual Learning

Xuyang Zhao, Huiyuan Wang, Weiran Huang et al.

ICML 2024arXiv:2406.06213

citations

#629

EfficientZero V2: Mastering Discrete and Continuous Control with Limited Data

Shengjie Wang, Shaohuai Liu, Weirui Ye et al.

ICML 2024spotlightarXiv:2403.00564

citations

#630

Distillation Scaling Laws

Dan Busbridge, Amitis Shidani, Floris Weers et al.

ICML 2025arXiv:2502.08606

citations

#631

How to set AdamW's weight decay as you scale model and dataset size

Xi Wang, Laurence Aitchison

ICML 2025arXiv:2405.13698

citations

#632

CosPGD: an efficient white-box adversarial attack for pixel-wise prediction tasks

Shashank Agnihotri, Steffen Jung, Margret Keuper

ICML 2024arXiv:2302.02213

citations

#633

Token-Specific Watermarking with Enhanced Detectability and Semantic Coherence for Large Language Models

Mingjia Huo, Sai Ashish Somayajula, Youwei Liang et al.

ICML 2024arXiv:2402.18059

citations

#634

See More Details: Efficient Image Super-Resolution by Experts Mining

Eduard Zamfir, Zongwei Wu, Nancy Mehta et al.

ICML 2024arXiv:2402.03412

citations

#635

TimeX++: Learning Time-Series Explanations with Information Bottleneck

Zichuan Liu, Tianchun Wang, Jimeng Shi et al.

ICML 2024arXiv:2405.09308

citations

#636

Automated Statistical Model Discovery with Language Models

Michael Li, Emily Fox, Noah Goodman

ICML 2024arXiv:2402.17879

citations

#637

Language Models as Semantic Indexers

Bowen Jin, Hansi Zeng, Guoyin Wang et al.

ICML 2024arXiv:2310.07815

citations

#638

Locate-then-edit for Multi-hop Factual Recall under Knowledge Editing

Zhuoran Zhang, Yongxiang Li, Zijian Kan et al.

ICML 2025arXiv:2410.06331

citations

#639

LLark: A Multimodal Instruction-Following Language Model for Music

Josh Gardner, Simon Durand, Daniel Stoller et al.

ICML 2024arXiv:2310.07160

citations

#640

KABB: Knowledge-Aware Bayesian Bandits for Dynamic Expert Coordination in Multi-Agent Systems

Jusheng Zhang, Zimeng Huang, Yijia Fan et al.

ICML 2025arXiv:2502.07350

citations

#641

Over-Tokenized Transformer: Vocabulary is Generally Worth Scaling

Hongzhi Huang, Defa Zhu, Banggu Wu et al.

ICML 2025arXiv:2501.16975

citations

#642

Equivariant Deep Weight Space Alignment

Aviv Navon, Aviv Shamsian, Ethan Fetaya et al.

ICML 2024arXiv:2310.13397

citations

#643

Privacy Backdoors: Stealing Data with Corrupted Pretrained Models

Shanglun Feng, Florian Tramer

ICML 2024arXiv:2404.00473

citations

#644

Principled Penalty-based Methods for Bilevel Reinforcement Learning and RLHF

Han Shen, Zhuoran Yang, Tianyi Chen

ICML 2024arXiv:2402.06886

citations

#645

Revisiting the Power of Prompt for Visual Tuning

Yuzhu Wang, Lechao Cheng, Chaowei Fang et al.

ICML 2024spotlightarXiv:2402.02382

citations

#646

GeoReasoner: Geo-localization with Reasoning in Street Views using a Large Vision-Language Model

Ling Li, Yu Ye, Bingchuan Jiang et al.

ICML 2024arXiv:2406.18572

citations

#647

Graph Positional and Structural Encoder

Semih Cantürk, Renming Liu, Olivier Lapointe-Gagné et al.

ICML 2024arXiv:2307.07107

citations

#648

How Smooth Is Attention?

Valérie Castin, Pierre Ablin, Gabriel Peyré

ICML 2024arXiv:2312.14820

citations

#649

Hybrid Inverse Reinforcement Learning

Juntao Ren, Gokul Swamy, Steven Wu et al.

ICML 2024oralarXiv:2402.08848

citations

#650

Do Efficient Transformers Really Save Computation?

Kai Yang, Jan Ackermann, Zhenyu He et al.

ICML 2024arXiv:2402.13934

citations

#651

Slow and Steady Wins the Race: Maintaining Plasticity with Hare and Tortoise Networks

Hojoon Lee, Hyeonseo Cho, Hyunseung Kim et al.

ICML 2024arXiv:2406.02596

citations

#652

AdvAgent: Controllable Blackbox Red-teaming on Web Agents

Chejian Xu, Mintong Kang, Jiawei Zhang et al.

ICML 2025arXiv:2410.17401

citations

#653

Towards Scalable and Versatile Weight Space Learning

Konstantin Schürholt, Michael Mahoney, Damian Borth

ICML 2024arXiv:2406.09997

citations

#654

Diving into Self-Evolving Training for Multimodal Reasoning

Wei Liu, Junlong Li, Xiwen Zhang et al.

ICML 2025arXiv:2412.17451

citations

#655

BAGEL: Bootstrapping Agents by Guiding Exploration with Language

Shikhar Murty, Christopher Manning, Peter Shaw et al.

ICML 2024arXiv:2403.08140

citations

#656

Parameters vs FLOPs: Scaling Laws for Optimal Sparsity for Mixture-of-Experts Language Models

Samira Abnar, Harshay Shah, Dan Busbridge et al.

ICML 2025arXiv:2501.12370

citations

#657

Selecting Large Language Model to Fine-tune via Rectified Scaling Law

Haowei Lin, Baizhou Huang, Haotian Ye et al.

ICML 2024arXiv:2402.02314

citations

#658

Orient Anything: Learning Robust Object Orientation Estimation from Rendering 3D Models

Zehan Wang, Ziang Zhang, Tianyu Pang et al.

ICML 2025arXiv:2412.18605

citations

#659

Training Dynamics of In-Context Learning in Linear Attention

Yedi Zhang, Aaditya Singh, Peter Latham et al.

ICML 2025spotlightarXiv:2501.16265

citations

#660

Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale

Fan Zhou, Zengzhi Wang, Qian Liu et al.

ICML 2025arXiv:2409.17115

citations

#661

MARS: Unleashing the Power of Variance Reduction for Training Large Models

Huizhuo Yuan, Yifeng Liu, Shuang Wu et al.

ICML 2025arXiv:2411.10438

citations

#662

TERD: A Unified Framework for Safeguarding Diffusion Models Against Backdoors

Yichuan Mo, Hui Huang, Mingjie Li et al.

ICML 2024arXiv:2409.05294

citations

#663

RUN: Reversible Unfolding Network for Concealed Object Segmentation

Chunming He, Rihan Zhang, Fengyang Xiao et al.

ICML 2025arXiv:2501.18783

citations

#664

LMAct: A Benchmark for In-Context Imitation Learning with Long Multimodal Demonstrations

Anian Ruoss, Fabio Pardo, Harris Chan et al.

ICML 2025arXiv:2412.01441

citations

#665

Learning to Scale Logits for Temperature-Conditional GFlowNets

Minsu Kim, Joohwan Ko, Taeyoung Yun et al.

ICML 2024arXiv:2310.02823

citations

#666

VisionGraph: Leveraging Large Multimodal Models for Graph Theory Problems in Visual Context

yunxin li, Baotian Hu, Haoyuan Shi et al.

ICML 2024arXiv:2405.04950

citations

#667

Whispering Experts: Neural Interventions for Toxicity Mitigation in Language Models

Xavi Suau, Pieter Delobelle, Katherine Metcalf et al.

ICML 2024arXiv:2407.12824

citations

#668

Orthus: Autoregressive Interleaved Image-Text Generation with Modality-Specific Heads

Siqi Kou, Jiachun Jin, Zhihong Liu et al.

ICML 2025arXiv:2412.00127

citations

#669

Learning to Intervene on Concept Bottlenecks

David Steinmann, Wolfgang Stammer, Felix Friedrich et al.

ICML 2024arXiv:2308.13453

citations

#670

Harmonizing Generalization and Personalization in Federated Prompt Learning

Tianyu Cui, Hongxia Li, Jingya Wang et al.

ICML 2024arXiv:2405.09771

citations

#671

Scalable Equilibrium Sampling with Sequential Boltzmann Generators

Charlie Tan, Joey Bose, Chen Lin et al.

ICML 2025arXiv:2502.18462

citations

#672

LangCell: Language-Cell Pre-training for Cell Identity Understanding

Suyuan Zhao, Jiahuan Zhang, Yushuai Wu et al.

ICML 2024arXiv:2405.06708

citations

#673

SpikeLM: Towards General Spike-Driven Language Modeling via Elastic Bi-Spiking Mechanisms

Xingrun Xing, Zheng Zhang, Ziyi Ni et al.

ICML 2024arXiv:2406.03287

citations

#674

AI for Global Climate Cooperation: Modeling Global Climate Negotiations, Agreements, and Long-Term Cooperation in RICE-N

Tianyu Zhang, Andrew Williams, Phillip Wozny et al.

ICML 2025arXiv:2208.07004

citations

#675

How do Large Language Models Navigate Conflicts between Honesty and Helpfulness?

Ryan Liu, Theodore R Sumers, Ishita Dasgupta et al.

ICML 2024arXiv:2402.07282

citations

#676

Subspace Optimization for Large Language Models with Convergence Guarantees

Yutong He, Pengrui Li, Yipeng Hu et al.

ICML 2025arXiv:2410.11289

citations

#677

Evaluation of LLMs on Syntax-Aware Code Fill-in-the-Middle Tasks

Linyuan Gong, Sida Wang, Mostafa Elhoushi et al.

ICML 2024arXiv:2403.04814

citations

#678

Long Range Propagation on Continuous-Time Dynamic Graphs

Alessio Gravina, Giulio Lovisotto, Claudio Gallicchio et al.

ICML 2024oralarXiv:2406.02740

citations

#679

MedRAX: Medical Reasoning Agent for Chest X-ray

Adibvafa Fallahpour, Jun Ma, Alif Munim et al.

ICML 2025arXiv:2502.02673

citations

#680

From Mechanistic Interpretability to Mechanistic Biology: Training, Evaluating, and Interpreting Sparse Autoencoders on Protein Language Models

Etowah Adams, Liam Bai, Minji Lee et al.

ICML 2025spotlight

citations

#681

MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models

Justin Chih-Yao Chen, Swarnadeep Saha, Elias Stengel-Eskin et al.

ICML 2024arXiv:2402.01620

citations

#682

Make-A-Shape: a Ten-Million-scale 3D Shape Model

Ka-Hei Hui, Aditya Sanghi, Arianna Rampini et al.

ICML 2024arXiv:2401.11067

citations

#683

APT: Adaptive Pruning and Tuning Pretrained Language Models for Efficient Training and Inference

Bowen Zhao, Hannaneh Hajishirzi, Qingqing Cao

ICML 2024arXiv:2401.12200

citations

#684

LLM-SRBench: A New Benchmark for Scientific Equation Discovery with Large Language Models

Parshin Shojaee, Ngoc Hieu Nguyen, Kazem Meidani et al.

ICML 2025oralarXiv:2504.10415

citations

#685

Generalized Interpolating Discrete Diffusion

Dimitri von Rütte, Janis Fluri, Yuhui Ding et al.

ICML 2025arXiv:2503.04482

citations

#686

Envisioning Outlier Exposure by Large Language Models for Out-of-Distribution Detection

Chentao Cao, Zhun Zhong, Zhanke Zhou et al.

ICML 2024arXiv:2406.00806

citations

#687

Star Attention: Efficient LLM Inference over Long Sequences

Shantanu Acharya, Fei Jia, Boris Ginsburg

ICML 2025arXiv:2411.17116

citations

#688

Understanding Finetuning for Factual Knowledge Extraction

Gaurav Ghosal, Tatsunori Hashimoto, Aditi Raghunathan

ICML 2024arXiv:2406.14785

citations

#689

Simulation of Graph Algorithms with Looped Transformers

Artur Back de Luca, Kimon Fountoulakis

ICML 2024arXiv:2402.01107

citations

#690

CodeIt: Self-Improving Language Models with Prioritized Hindsight Replay

Natasha Butt, Blazej Manczak, Auke Wiggers et al.

ICML 2024arXiv:2402.04858

citations

#691

Fool Your (Vision and) Language Model with Embarrassingly Simple Permutations

Yongshuo Zong, Tingyang Yu, Ruchika Chavhan et al.

ICML 2024arXiv:2310.01651

citations

#692

Structured Chemistry Reasoning with Large Language Models

Siru Ouyang, Zhuosheng Zhang, Bing Yan et al.

ICML 2024arXiv:2311.09656

citations

#693

Graph Neural Networks Use Graphs When They Shouldn't

Maya Bechler-Speicher, Ido Amos, Ran Gilad-Bachrach et al.

ICML 2024arXiv:2309.04332

citations

#694

Scaling Down Deep Learning with MNIST-1D

Sam Greydanus, Dmitry Kobak

ICML 2024arXiv:2011.14439

citations

#695

TimeFilter: Patch-Specific Spatial-Temporal Graph Filtration for Time Series Forecasting

Yifan Hu, Guibin Zhang, Peiyuan Liu et al.

ICML 2025oralarXiv:2501.13041

citations

#696

Autoformulation of Mathematical Optimization Models Using LLMs

Nicolás Astorga, Tennison Liu, Yuanzhang Xiao et al.

ICML 2025arXiv:2411.01679

citations

#697

Position: Explain to Question not to Justify

Przemyslaw Biecek, Wojciech Samek

ICML 2024arXiv:2402.13914

citations

#698

Evaluating and Analyzing Relationship Hallucinations in Large Vision-Language Models

Mingrui Wu, Jiayi Ji, Oucheng Huang et al.

ICML 2024arXiv:2406.16449

citations

#699

PICLe: Eliciting Diverse Behaviors from Large Language Models with Persona In-Context Learning

Hyeong Kyu Choi, Sharon Li

ICML 2024oralarXiv:2405.02501

citations

#700

In-Context Freeze-Thaw Bayesian Optimization for Hyperparameter Optimization

Herilalaina Rakotoarison, Steven Adriaensen, Neeratyoy Mallik et al.

ICML 2024arXiv:2404.16795

citations

#701

SAM-E: Leveraging Visual Foundation Model with Sequence Imitation for Embodied Manipulation

Junjie Zhang, Chenjia Bai, Haoran He et al.

ICML 2024arXiv:2405.19586

citations

#702

LQER: Low-Rank Quantization Error Reconstruction for LLMs

Cheng Zhang, Jianyi Cheng, George Constantinides et al.

ICML 2024arXiv:2402.02446

citations

#703

Differentiable Weightless Neural Networks

Alan Bacellar, Zachary Susskind, Mauricio Breternitz Jr et al.

ICML 2024arXiv:2410.11112

citations

#704

Universal Sparse Autoencoders: Interpretable Cross-Model Concept Alignment

Harrish Thasarathan, Julian Forsyth, Thomas Fel et al.

ICML 2025arXiv:2502.03714

citations

#705

EVOLvE: Evaluating and Optimizing LLMs For In-Context Exploration

Allen Nie, Yi Su, Bo Chang et al.

ICML 2025arXiv:2410.06238

citations

#706

Predictive Dynamic Fusion

Bing Cao, Yinan Xia, Yi Ding et al.

ICML 2024arXiv:2406.04802

citations

#707

Contrastive Localized Language-Image Pre-Training

Hong-You Chen, Zhengfeng Lai, Haotian Zhang et al.

ICML 2025arXiv:2410.02746

citations

#708

Code as Reward: Empowering Reinforcement Learning with VLMs

David Venuto, Mohammad Sami Nur Islam, Martin Klissarov et al.

ICML 2024spotlightarXiv:2402.04764

citations

#709

FuRL: Visual-Language Models as Fuzzy Rewards for Reinforcement Learning

Yuwei Fu, Haichao Zhang, di wu et al.

ICML 2024arXiv:2406.00645

citations

#710

MAS-GPT: Training LLMs to Build LLM-based Multi-Agent Systems

Rui Ye, shuo tang, Rui Ge et al.

ICML 2025arXiv:2503.03686

citations

#711

PrE-Text: Training Language Models on Private Federated Data in the Age of LLMs

Charlie Hou, Akshat Shrivastava, Hongyuan Zhan et al.

ICML 2024arXiv:2406.02958

citations

#712

Image Clustering with External Guidance

Yunfan Li, Peng Hu, Dezhong Peng et al.

ICML 2024arXiv:2310.11989

citations

#713

Thermometer: Towards Universal Calibration for Large Language Models

Maohao Shen, Subhro Das, Kristjan Greenewald et al.

ICML 2024arXiv:2403.08819

citations

#714

GenMol: A Drug Discovery Generalist with Discrete Diffusion

Seul Lee, Karsten Kreis, Srimukh Veccham et al.

ICML 2025arXiv:2501.06158

citations

#715

BiSHop: Bi-Directional Cellular Learning for Tabular Data with Generalized Sparse Modern Hopfield Model

Chenwei Xu, Yu-Chao Huang, Jerry Yao-Chieh Hu et al.

ICML 2024arXiv:2404.03830

citations

#716

Fine-tuning Reinforcement Learning Models is Secretly a Forgetting Mitigation Problem

Maciej Wołczyk, Bartłomiej Cupiał, Mateusz Ostaszewski et al.

ICML 2024spotlightarXiv:2402.02868

citations

#717

Asymptotics of feature learning in two-layer networks after one gradient-step

Hugo Cui, Luca Pesce, Yatin Dandi et al.

ICML 2024spotlightarXiv:2402.04980

citations

#718

Generalist Equivariant Transformer Towards 3D Molecular Interaction Learning

Xiangzhe Kong, Wenbing Huang, Yang Liu

ICML 2024arXiv:2306.01474

citations

#719

Decomposed Linear Dynamical Systems (dLDS) for learning the latent components of neural dynamics

Noga Mudrik, Yenho Chen, Eva Yezerets et al.

ICML 2024arXiv:2206.02972

citations

#720

Out-of-Domain Generalization in Dynamical Systems Reconstruction

Niclas Göring, Florian Hess, Manuel Brenner et al.

ICML 2024arXiv:2402.18377

citations

#721

Visual-Text Cross Alignment: Refining the Similarity Score in Vision-Language Models

Jinhao Li, Haopeng Li, Sarah Erfani et al.

ICML 2024arXiv:2406.02915

citations

#722

Modeling Multi-Task Model Merging as Adaptive Projective Gradient Descent

Yongxian Wei, Anke Tang, Li Shen et al.

ICML 2025arXiv:2501.01230

citations

#723

T-Cal: An Optimal Test for the Calibration of Predictive Models

Donghwan Lee, Xinmeng Huang, Hamed Hassani et al.

ICML 2024arXiv:2203.01850

citations

#724

PPFLOW: Target-Aware Peptide Design with Torsional Flow Matching

Haitao Lin, Odin Zhang, Huifeng Zhao et al.

ICML 2024arXiv:2405.06642

citations

#725

A General Theory for Softmax Gating Multinomial Logistic Mixture of Experts

Huy Nguyen, Pedram Akbarian, TrungTin Nguyen et al.

ICML 2024arXiv:2310.14188

citations

#726

Time-Series Forecasting for Out-of-Distribution Generalization Using Invariant Learning

Haoxin Liu, Harshavardhan Kamarthi, Lingkai Kong et al.

ICML 2024oralarXiv:2406.09130

citations

#727

Transforming and Combining Rewards for Aligning Large Language Models

Zihao Wang, Chirag Nagpal, Jonathan Berant et al.

ICML 2024arXiv:2402.00742

citations

#728

Use Your INSTINCT: INSTruction optimization for LLMs usIng Neural bandits Coupled with Transformers

Xiaoqiang Lin, Zhaoxuan Wu, Zhongxiang Dai et al.

ICML 2024arXiv:2310.02905

citations

#729

EnIGMA: Interactive Tools Substantially Assist LM Agents in Finding Security Vulnerabilities

Talor Abramovich, Meet Udeshi, Minghao Shao et al.

ICML 2025arXiv:2409.16165

citations

#730

ConvNet vs Transformer, Supervised vs CLIP: Beyond ImageNet Accuracy

Kirill Vishniakov, Zhiqiang Shen, Zhuang Liu

ICML 2024arXiv:2311.09215

citations

#731

How Do Large Language Monkeys Get Their Power (Laws)?

Rylan Schaeffer, Joshua Kazdan, John Hughes et al.

ICML 2025oralarXiv:2502.17578

citations

#732

Regression with Multi-Expert Deferral

Anqi Mao, Mehryar Mohri, Yutao Zhong

ICML 2024spotlightarXiv:2403.19494

citations

#733

Emergence of In-Context Reinforcement Learning from Noise Distillation

Ilya Zisman, Vladislav Kurenkov, Alexander Nikulin et al.

ICML 2024arXiv:2312.12275

citations

#734

Outlier-robust Kalman Filtering through Generalised Bayes

Gerardo Duran-Martin, Matias Altamirano, Alex Shestopaloff et al.

ICML 2024arXiv:2405.05646

citations

#735

Token Cleaning: Fine-Grained Data Selection for LLM Supervised Fine-Tuning

Jinlong Pang, Na Di, Zhaowei Zhu et al.

ICML 2025arXiv:2502.01968

citations

#736

ResearchTown: Simulator of Human Research Community

Haofei Yu, Zhaochen Hong, Zirui Cheng et al.

ICML 2025arXiv:2412.17767

citations

#737

Learning Universal Predictors

Jordi Grau-Moya, Tim Genewein, Marcus Hutter et al.

ICML 2024arXiv:2401.14953

citations

#738

Evaluating Judges as Evaluators: The JETTS Benchmark of LLM-as-Judges as Test-Time Scaling Evaluators

Yilun Zhou, Austin Xu, PeiFeng Wang et al.

ICML 2025arXiv:2504.15253

citations

#739

Understanding Reasoning Ability of Language Models From the Perspective of Reasoning Paths Aggregation

Xinyi Wang, Alfonso Amayuelas, Kexun Zhang et al.

ICML 2024arXiv:2402.03268

citations

#740

Cross-Domain Policy Adaptation by Capturing Representation Mismatch

Jiafei Lyu, Chenjia Bai, Jing-Wen Yang et al.

ICML 2024arXiv:2405.15369

citations

#741

Robust Multi-Task Learning with Excess Risks

Yifei He, Shiji Zhou, Guojun Zhang et al.

ICML 2024arXiv:2402.02009

citations

#742

Critical windows: non-asymptotic theory for feature emergence in diffusion models

Marvin Li, Sitan Chen

ICML 2024arXiv:2403.01633

citations

#743

Fewer Truncations Improve Language Modeling

Hantian Ding, Zijian Wang, Giovanni Paolini et al.

ICML 2024arXiv:2404.10830

citations

#744

Human vs. Generative AI in Content Creation Competition: Symbiosis or Conflict?

Fan Yao, Chuanhao Li, Denis Nekipelov et al.

ICML 2024arXiv:2402.15467

citations

#745

Self-Consistency Preference Optimization

Archiki Prasad, Weizhe Yuan, Richard Yuanzhe Pang et al.

ICML 2025arXiv:2411.04109

citations

#746

OrcaLoca: An LLM Agent Framework for Software Issue Localization

Zhongming Yu, Hejia Zhang, Yujie Zhao et al.

ICML 2025arXiv:2502.00350

citations

#747

Fast Exact Unlearning for In-Context Learning Data for LLMs

Andrei Muresanu, Anvith Thudi, Michael Zhang et al.

ICML 2025arXiv:2402.00751

citations

#748

Decomposing and Editing Predictions by Modeling Model Computation

Harshay Shah, Andrew Ilyas, Aleksander Madry

ICML 2024arXiv:2404.11534

citations

#749

Accelerating Parallel Sampling of Diffusion Models

Zhiwei Tang, Jiasheng Tang, Hao Luo et al.

ICML 2024arXiv:2402.09970

citations

#750

FedRC: Tackling Diverse Distribution Shifts Challenge in Federated Learning by Robust Clustering

Yongxin Guo, Xiaoying Tang, Tao Lin

ICML 2024arXiv:2301.12379

citations

#751

In-Context Reinforcement Learning for Variable Action Spaces

Viacheslav Sinii, Alexander Nikulin, Vladislav Kurenkov et al.

ICML 2024arXiv:2312.13327

citations

#752

Catapults in SGD: spikes in the training loss and their impact on generalization through feature learning

Libin Zhu, Chaoyue Liu, Adityanarayanan Radhakrishnan et al.

ICML 2024arXiv:2306.04815

citations

#753

A Unified Approach to Routing and Cascading for LLMs

Jasper Dekoninck, Maximilian Baader, Martin Vechev

ICML 2025arXiv:2410.10347

citations

#754

Addressing Misspecification in Simulation-based Inference through Data-driven Calibration

Antoine Wehenkel, Juan L. Gamella, Ozan Sener et al.

ICML 2025oralarXiv:2405.08719

citations

#755

Compressible Dynamics in Deep Overparameterized Low-Rank Learning & Adaptation

Can Yaras, Peng Wang, Laura Balzano et al.

ICML 2024arXiv:2406.04112

citations

#756

Towards Certified Unlearning for Deep Neural Networks

Binchi Zhang, Yushun Dong, Tianhao Wang et al.

ICML 2024arXiv:2408.00920

citations

#757

Stochastic Localization via Iterative Posterior Sampling

Louis Grenioux, Maxence Noble, Marylou Gabrié et al.

ICML 2024spotlightarXiv:2402.10758

citations

#758

High-Probability Convergence for Composite and Distributed Stochastic Minimization and Variational Inequalities with Heavy-Tailed Noise

Eduard Gorbunov, Abdurakhmon Sadiev, Marina Danilova et al.

ICML 2024arXiv:2310.01860

citations

#759

On the Duality Between Sharpness-Aware Minimization and Adversarial Training

Yihao Zhang, Hangzhou He, Jingyu Zhu et al.

ICML 2024arXiv:2402.15152

citations

#760

Learning and Forgetting Unsafe Examples in Large Language Models

Jiachen Zhao, Zhun Deng, David Madras et al.

ICML 2024oralarXiv:2312.12736

citations

#761

Unifying Bayesian Flow Networks and Diffusion Models through Stochastic Differential Equations

Kaiwen Xue, Yuhao Zhou, Shen Nie et al.

ICML 2024arXiv:2404.15766

citations

#762

The Hidden Life of Tokens: Reducing Hallucination of Large Vision-Language Models Via Visual Information Steering

Zhuowei Li, Haizhou Shi, Yunhe Gao et al.

ICML 2025arXiv:2502.03628

citations

#763

Think Smarter not Harder: Adaptive Reasoning with Inference Aware Optimization

Zishun Yu, Tengyu Xu, Di Jin et al.

ICML 2025arXiv:2501.17974

citations

#764

StableSSM: Alleviating the Curse of Memory in State-space Models through Stable Reparameterization

Shida Wang, Qianxiao Li

ICML 2024arXiv:2311.14495

citations

#765

An Empirical Study of Realized GNN Expressiveness

Yanbo Wang, Muhan Zhang

ICML 2024arXiv:2304.07702

citations

#766

The Entropy Enigma: Success and Failure of Entropy Minimization

Ori Press, Ravid Shwartz-Ziv, Yann LeCun et al.

ICML 2024arXiv:2405.05012

citations

#767

ConceptAttention: Diffusion Transformers Learn Highly Interpretable Features

Alec Helbling, Tuna Han Salih Meral, Benjamin Hoover et al.

ICML 2025oralarXiv:2502.04320

citations

#768

Accelerated Algorithms for Constrained Nonconvex-Nonconcave Min-Max Optimization and Comonotone Inclusion

Yang Cai, Argyris Oikonomou, Weiqiang Zheng

ICML 2024arXiv:2206.05248

citations

#769

Learning Iterative Reasoning through Energy Diffusion

Yilun Du, Jiayuan Mao, Josh Tenenbaum

ICML 2024arXiv:2406.11179

citations

#770

Reinformer: Max-Return Sequence Modeling for Offline RL

Zifeng Zhuang, Dengyun Peng, Jinxin Liu et al.

ICML 2024arXiv:2405.08740

citations

#771

Do Large Language Models Perform the Way People Expect? Measuring the Human Generalization Function

Keyon Vafa, Ashesh Rambachan, Sendhil Mullainathan

ICML 2024arXiv:2406.01382

citations

#772

Pairwise Alignment Improves Graph Domain Adaptation

Shikun Liu, Deyu Zou, Han Zhao et al.

ICML 2024spotlightarXiv:2403.01092

citations

#773

PISA Experiments: Exploring Physics Post-Training for Video Diffusion Models by Watching Stuff Drop

Chenyu Li, Oscar Michel, Xichen Pan et al.

ICML 2025arXiv:2503.09595

citations

#774

Core Knowledge Deficits in Multi-Modal Language Models

Yijiang Li, Qingying Gao, Tianwei Zhao et al.

ICML 2025arXiv:2410.10855

citations

#775

Towards a Mechanistic Explanation of Diffusion Model Generalization

Matthew Niedoba, Berend Zwartsenberg, Kevin Murphy et al.

ICML 2025spotlightarXiv:2411.19339

citations

#776

Principled Data Selection for Alignment: The Hidden Risks of Difficult Examples

chengqian gao, Haonan Li, Liu Liu et al.

ICML 2025arXiv:2502.09650

citations

#777

On Discrete Prompt Optimization for Diffusion Models

Ruochen Wang, Ting Liu, Cho-Jui Hsieh et al.

ICML 2024arXiv:2407.01606

citations

#778

Efficient and Effective Time-Series Forecasting with Spiking Neural Networks

Changze Lv, Yansen Wang, Dongqi Han et al.

ICML 2024oralarXiv:2402.01533

citations

#779

On Mechanistic Knowledge Localization in Text-to-Image Generative Models

Samyadeep Basu, Keivan Rezaei, Priyatham Kattakinda et al.

ICML 2024arXiv:2405.01008

citations

#780

Bring Reason to Vision: Understanding Perception and Reasoning through Model Merging

Shiqi Chen, Jinghan Zhang, Tongyao Zhu et al.

ICML 2025arXiv:2505.05464

citations

#781

Position: Why We Must Rethink Empirical Research in Machine Learning

Moritz Herrmann, F. Julian D. Lange, Katharina Eggensperger et al.

ICML 2024arXiv:2405.02200

citations

#782

Designing Decision Support Systems using Counterfactual Prediction Sets

Eleni Straitouri, Manuel Gomez-Rodriguez

ICML 2024spotlightarXiv:2306.03928

citations

#783

InferCept: Efficient Intercept Support for Augmented Large Language Model Inference

Reyna Abhyankar, Zijian He, Vikranth Srivatsa et al.

ICML 2024arXiv:2402.01869

citations

#784

Matrix Information Theory for Self-Supervised Learning

Yifan Zhang, Zhiquan Tan, Jingqin Yang et al.

ICML 2024arXiv:2305.17326

citations

#785

Do Language Models Exhibit the Same Cognitive Biases in Problem Solving as Human Learners?

Andreas Opedal, Alessandro Stolfo, Haruki Shirakami et al.

ICML 2024arXiv:2401.18070

citations

#786

Self-Correcting Self-Consuming Loops for Generative Model Training

Nate Gillman, Michael Freeman, Daksh Aggarwal et al.

ICML 2024arXiv:2402.07087

citations

#787

Confronting Reward Overoptimization for Diffusion Models: A Perspective of Inductive and Primacy Biases

Ziyi Zhang, Sen Zhang, Yibing Zhan et al.

ICML 2024oralarXiv:2402.08552

citations

#788

Comparing Graph Transformers via Positional Encodings

Mitchell Black, Zhengchao Wan, Gal Mishne et al.

ICML 2024arXiv:2402.14202

citations

#789

Alpha-SQL: Zero-Shot Text-to-SQL using Monte Carlo Tree Search

Boyan Li, Jiayi Zhang, Ju Fan et al.

ICML 2025arXiv:2502.17248

citations

#790

Clifford-Steerable Convolutional Neural Networks

Maksim Zhdanov, David Ruhe, Maurice Weiler et al.

ICML 2024arXiv:2402.14730

citations

#791

Chain-of-Thought Predictive Control

Zhiwei Jia, Vineet Thumuluri, Fangchen Liu et al.

ICML 2024arXiv:2304.00776

citations

#792

Position: Key Claims in LLM Research Have a Long Tail of Footnotes

Anna Rogers, Sasha Luccioni

ICML 2024arXiv:2308.07120

citations

#793

Adaptive Horizon Actor-Critic for Policy Learning in Contact-Rich Differentiable Simulation

Ignat Georgiev, Krishnan Srinivasan, Jie Xu et al.

ICML 2024arXiv:2405.17784

citations

#794

Stable Differentiable Causal Discovery

Achille Nazaret, Justin Hong, Elham Azizi et al.

ICML 2024arXiv:2311.10263

citations

#795

LLM-Empowered State Representation for Reinforcement Learning

Boyuan Wang, Yun Qu, Yuhang Jiang et al.

ICML 2024arXiv:2407.13237

citations

#796

Various Lengths, Constant Speed: Efficient Language Modeling with Lightning Attention

Zhen Qin, Weigao Sun, Dong Li et al.

ICML 2024arXiv:2405.17381

citations

#797

Discrepancy Minimization in Input-Sparsity Time

Yichuan Deng, Xiaoyu Li, Zhao Song et al.

ICML 2025spotlightarXiv:2210.12468

citations

#798

Teaching Language Models to Critique via Reinforcement Learning

Zhihui Xie, Jie chen, Liyu Chen et al.

ICML 2025arXiv:2502.03492

citations

#799

ReinboT: Amplifying Robot Visual-Language Manipulation with Reinforcement Learning

Hongyin Zhang, Zifeng Zhuang, Han Zhao et al.

ICML 2025arXiv:2505.07395

citations

#800

On the Implicit Bias of Adam

Matias Cattaneo, Jason Klusowski, Boris Shigida

ICML 2024arXiv:2309.00079

citations

← Previous

1 2 3 4 5 6...30