Most Cited ICLR &quot;training-free augmentation&quot; Papers

ICLR 2024arXiv:2307.07697

#202

Think-on-Graph: Deep and Responsible Reasoning of Large Language Model on Knowledge Graph

Jiashuo Sun, Chengjin Xu, Lumingyuan Tang et al.

198

ICLR 2024arXiv:2308.07921

#203

Solving Challenging Math Word Problems Using GPT-4 Code Interpreter with Code-based Self-Verification

Aojun Zhou, Ke Wang, Zimu Lu et al.

198

ICLR 2025arXiv:2503.09573

#204

Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models

Marianne Arriola, Aaron Gokaslan, Justin Chiu et al.

197

ICLR 2024arXiv:2310.15213

#205

Function Vectors in Large Language Models

Eric Todd, Millicent Li, Arnab Sen Sharma et al.

197

ICLR 2025arXiv:2404.08471

#206

Revisiting Feature Prediction for Learning Visual Representations from Video

Quentin Garrido, Yann LeCun, Michael Rabbat et al.

196

ICLR 2024arXiv:2310.02575

#207

AdaMerging: Adaptive Model Merging for Multi-Task Learning

Enneng Yang, Zhenyi Wang, Li Shen et al.

196

ICLR 2025arXiv:2410.12557

#208

One Step Diffusion via Shortcut Models

Kevin Frans, Danijar Hafner, Sergey Levine et al.

195

ICLR 2024arXiv:2308.13418

#209

Nougat: Neural Optical Understanding for Academic Documents

Lukas Blecher, Guillem Cucurull Preixens, Thomas Scialom et al.

ICLR 2025arXiv:2409.16040

#210

Time-MoE: Billion-Scale Time Series Foundation Models with Mixture of Experts

Xiaoming Shi, Shiyu Wang, Yuqi Nie et al.

ICLR 2024spotlightarXiv:2308.07124

#211

OctoPack: Instruction Tuning Code Large Language Models

Niklas Muennighoff, Qian Liu, Armel Zebaze et al.

ICLR 2024arXiv:2310.08580

#212

OmniControl: Control Any Joint at Any Time for Human Motion Generation

Yiming Xie, Varun Jampani, Lei Zhong et al.

ICLR 2024spotlightarXiv:2307.08123

#213

Solving Inverse Problems with Latent Diffusion Models via Hard Data Consistency

Bowen Song, Soo Min Kwon, Zecheng Zhang et al.

193

ICLR 2024arXiv:2306.00814

#214

Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis

Hubert Siuzdak

192

ICLR 2024arXiv:2309.07915

#215

MMICL: Empowering Vision-language Model with Multi-Modal In-Context Learning

Haozhe Zhao, Zefan Cai, Shuzheng Si et al.

191

ICLR 2024arXiv:2310.01218

#216

Making LLaMA SEE and Draw with SEED Tokenizer

Yuying Ge, Sijie Zhao, Ziyun Zeng et al.

190

ICLR 2024arXiv:2310.06117

#217

Take a Step Back: Evoking Reasoning via Abstraction in Large Language Models

Huaixiu Steven Zheng, Swaroop Mishra, Xinyun Chen et al.

190

ICLR 2024arXiv:2310.02743

#218

Reward Model Ensembles Help Mitigate Overoptimization

Thomas Coste, Usman Anwar, Robert Kirk et al.

188

ICLR 2024spotlightarXiv:2308.03686

#219

Nearly $d$-Linear Convergence Bounds for Diffusion Models via Stochastic Localization

Joe Benton, Valentin De Bortoli, Arnaud Doucet et al.

188

ICLR 2024arXiv:2307.05695

#220

ReLoRA: High-Rank Training Through Low-Rank Updates

Vladislav Lialin, Sherin Muckatira, Namrata Shivagunde et al.

187

ICLR 2025arXiv:2306.09479

#221

Inverse Scaling: When Bigger Isn't Better

Joe Cavanagh, Andrew Gritsevskiy, Najoung Kim et al.

186

ICLR 2024arXiv:2309.16042

#222

Towards Best Practices of Activation Patching in Language Models: Metrics and Methods

Fred Zhang, Neel Nanda

185

ICLR 2024arXiv:2306.04634

#223

On the Reliability of Watermarks for Large Language Models

John Kirchenbauer, Jonas Geiping, Yuxin Wen et al.

185

ICLR 2024oralarXiv:2311.04892

#224

Bias Runs Deep: Implicit Reasoning Biases in Persona-Assigned LLMs

Shashank Gupta, Vaishnavi Shrivastava, Ameet Deshpande et al.

184

ICLR 2025arXiv:2404.02078

#225

Advancing LLM Reasoning Generalists with Preference Trees

Lifan Yuan, Ganqu Cui, Hanbin Wang et al.

183

ICLR 2025arXiv:2410.10819

#226

DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads

Guangxuan Xiao, Jiaming Tang, Jingwei Zuo et al.

179

ICLR 2024arXiv:2308.00436

#227

SelfCheck: Using LLMs to Zero-Shot Check Their Own Step-by-Step Reasoning

Ning Miao, Yee Whye Teh, Tom Rainforth

179

ICLR 2025arXiv:2412.06464

#228

Gated Delta Networks: Improving Mamba2 with Delta Rule

Songlin Yang, Jan Kautz, Ali Hatamizadeh

177

ICLR 2025arXiv:2410.19168

#229

MMAU: A Massive Multi-Task Audio Understanding and Reasoning Benchmark

Sakshi, Utkarsh Tyagi, Sonal Kumar et al.

176

ICLR 2024spotlightarXiv:2310.06773

#230

Uni3D: Exploring Unified 3D Representation at Scale

Junsheng Zhou, Jinsheng Wang, Baorui Ma et al.

175

ICLR 2024spotlightarXiv:2310.07298

#231

Beyond Memorization: Violating Privacy via Inference with Large Language Models

Robin Staab, Mark Vero, Mislav Balunovic et al.

175

ICLR 2024arXiv:2310.04560

#232

Talk like a Graph: Encoding Graphs for Large Language Models

Bahare Fatemi, Jonathan Halcrow, Bryan Perozzi

174

ICLR 2025arXiv:2312.11370

#233

G-LLaVA: Solving Geometric Problem with Multi-Modal Large Language Model

Jiahui Gao, Renjie Pi, Jipeng Zhang et al.

174

ICLR 2024arXiv:2310.10012

#234

Ring-A-Bell! How Reliable are Concept Removal Methods For Diffusion Models?

Yu-Lin Tsai, Chia-Yi Hsu, Chulin Xie et al.

173

ICLR 2025arXiv:2408.14837

#235

Diffusion Models Are Real-Time Game Engines

Dani Valevski, Yaniv Leviathan, Moab Arar et al.

172

ICLR 2025arXiv:2403.17887

#236

The Unreasonable Ineffectiveness of the Deeper Layers

Andrey Gromov, Kushal Tirumala, Hassan Shapourian et al.

172

ICLR 2024arXiv:2310.16818

#237

DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior

Jingxiang Sun, Bo Zhang, Ruizhi Shao et al.

172

ICLR 2024arXiv:2306.05836

#238

Can Large Language Models Infer Causation from Correlation?

Zhijing Jin, Jiarui Liu, Zhiheng LYU et al.

171

ICLR 2024arXiv:2310.16028

#239

What Algorithms can Transformers Learn? A Study in Length Generalization

Hattie Zhou, Arwen Bradley, Etai Littwin et al.

170

ICLR 2024arXiv:2303.11435

#240

Inversion by Direct Iteration: An Alternative to Denoising Diffusion for Image Restoration

Peyman Milanfar, Mauricio Delbracio

ICLR 2024arXiv:2310.03731

#241

MathCoder: Seamless Code Integration in LLMs for Enhanced Mathematical Reasoning

Ke Wang, Houxing Ren, Aojun Zhou et al.

ICLR 2025arXiv:2409.02060

#242

OLMoE: Open Mixture-of-Experts Language Models

Niklas Muennighoff, Luca Soldaini, Dirk Groeneveld et al.

ICLR 2025arXiv:2410.08146

#243

Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning

Amrith Setlur, Chirag Nagpal, Adam Fisch et al.

ICLR 2025arXiv:2410.11758

#244

Latent Action Pretraining from Videos

Seonghyeon Ye, Joel Jang, Byeongguk Jeon et al.

ICLR 2025arXiv:2403.07378

#245

SVD-LLM: Truncation-aware Singular Value Decomposition for Large Language Model Compression

Xin Wang, Yu Zheng, Zhongwei Wan et al.

ICLR 2024arXiv:2306.09896

#246

Is Self-Repair a Silver Bullet for Code Generation?

Theo X. Olausson, Jeevana Priya Inala, Chenglong Wang et al.

ICLR 2025arXiv:2407.06460

#247

MUSE: Machine Unlearning Six-Way Evaluation for Language Models

Weijia Shi, Jaechan Lee, Yangsibo Huang et al.

ICLR 2024arXiv:2305.03053

#248

ZipIt! Merging Models from Different Tasks without Training

George Stoica, Daniel Bolya, Jakob Bjorner et al.

167

ICLR 2024arXiv:2308.06259

#249

Self-Alignment with Instruction Backtranslation

Xian Li, Ping Yu, Chunting Zhou et al.

167

ICLR 2024spotlightarXiv:2310.17884

#250

Can LLMs Keep a Secret? Testing Privacy Implications of Language Models via Contextual Integrity Theory

Niloofar Mireshghallah, Hyunwoo Kim, Xuhui Zhou et al.

166

ICLR 2024arXiv:2310.00785

#251

BooookScore: A systematic exploration of book-length summarization in the era of LLMs

Yapei Chang, Kyle Lo, Tanya Goyal et al.

164

ICLR 2024oralarXiv:2403.01742

#252

Diffusion-TS: Interpretable Diffusion for General Time Series Generation

Xinyu Yuan, Yan Qiao

164

ICLR 2025arXiv:2410.12784

#253

JudgeBench: A Benchmark for Evaluating LLM-Based Judges

Sijun Tan, Siyuan Zhuang, Kyle Montgomery et al.

163

ICLR 2024arXiv:2305.15852

#254

Self-contradictory Hallucinations of Large Language Models: Evaluation, Detection and Mitigation

Niels Mündler, Jingxuan He, Slobodan Jenko et al.

161

ICLR 2025arXiv:2409.00750

#255

MaskGCT: Zero-Shot Text-to-Speech with Masked Generative Codec Transformer

Yuancheng Wang, Haoyue Zhan, Liwei Liu et al.

161

ICLR 2024arXiv:2309.07124

#256

RAIN: Your Language Models Can Align Themselves without Finetuning

Yuhui Li, Fangyun Wei, Jinjing Zhao et al.

161

ICLR 2024spotlightarXiv:2310.08576

#257

Learning to Act from Actionless Videos through Dense Correspondences

Po-Chen Ko, Jiayuan Mao, Yilun Du et al.

160

ICLR 2024arXiv:2302.03660

#258

Flow Matching on General Geometries

Ricky T. Q. Chen, Yaron Lipman

159

ICLR 2024arXiv:2302.02676

#259

Chain of Hindsight aligns Language Models with Feedback

Hao Liu, Carmelo Sferrazza, Pieter Abbeel

158

ICLR 2024arXiv:2307.05222

#260

Emu: Generative Pretraining in Multimodality

Quan Sun, Qiying Yu, Yufeng Cui et al.

158

ICLR 2025arXiv:2410.17891

#261

Scaling Diffusion Language Models via Adaptation from Autoregressive Models

Shansan Gong, Shivam Agarwal, Yizhe Zhang et al.

157

ICLR 2024arXiv:2310.03128

#262

MetaTool Benchmark for Large Language Models: Deciding Whether to Use Tools and Which to Use

Yue Huang, Jiawen Shi, Yuan Li et al.

157

ICLR 2024spotlightarXiv:2309.16240

#263

Beyond Reverse KL: Generalizing Direct Preference Optimization with Diverse Divergence Constraints

Chaoqi Wang, Yibo Jiang, Chenghao Yang et al.

157

ICLR 2024arXiv:2309.14717

#264

QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models

Yuhui Xu, Lingxi Xie, Xiaotao Gu et al.

156

ICLR 2025arXiv:2410.11081

#265

Simplifying, Stabilizing and Scaling Continuous-time Consistency Models

Cheng Lu, Yang Song

ICLR 2024spotlightarXiv:2311.12024

#266

PF-LRM: Pose-Free Large Reconstruction Model for Joint Pose and Shape Prediction

Peng Wang, Hao Tan, Sai Bi et al.

ICLR 2024arXiv:2310.05470

#267

Generative Judge for Evaluating Alignment

Junlong Li, Shichao Sun, Weizhe Yuan et al.

ICLR 2024arXiv:2303.07937

#268

Let 2D Diffusion Model Know 3D-Consistency for Robust Text-to-3D Generation

Junyoung Seo, Wooseok Jang, Min-Seop Kwak et al.

ICLR 2024oralarXiv:2402.05956

#269

Pathformer: Multi-scale Transformers with Adaptive Pathways for Time Series Forecasting

Peng Chen, Yingying ZHANG, Yunyao Cheng et al.

ICLR 2024spotlightarXiv:2309.17410

#270

Can Sensitive Information Be Deleted From LLMs? Objectives for Defending Against Extraction Attacks

Vaidehi Ramesh Patil, Peter Hase, Mohit Bansal

ICLR 2024arXiv:2309.05660

#271

Hypothesis Search: Inductive Reasoning with Language Models

Ruocheng Wang, Eric Zelikman, Gabriel Poesia et al.

ICLR 2025arXiv:2410.02355

#272

AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models

Junfeng Fang, Houcheng Jiang, Kun Wang et al.

ICLR 2024arXiv:2310.05916

#273

Interpreting CLIP's Image Representation via Text-Based Decomposition

Yossi Gandelsman, Alexei Efros, Jacob Steinhardt

ICLR 2024arXiv:2310.02596

#274

SweetDreamer: Aligning Geometric Priors in 2D diffusion for Consistent Text-to-3D

Weiyu LI, Rui Chen, Xuelin Chen et al.

153

ICLR 2025arXiv:2410.10762

#275

AFlow: Automating Agentic Workflow Generation

Jiayi Zhang, Jinyu Xiang, Zhaoyang Yu et al.

153

ICLR 2025oralarXiv:2412.10345

#276

TraceVLA: Visual Trace Prompting Enhances Spatial-Temporal Awareness for Generalist Robotic Policies

Ruijie Zheng, Yongyuan Liang, Shuaiyi Huang et al.

153

ICLR 2024oralarXiv:2310.15169

#277

FreeNoise: Tuning-Free Longer Video Diffusion via Noise Rescheduling

Haonan Qiu, Menghan Xia, Yong Zhang et al.

152

ICLR 2024spotlightarXiv:2308.08493

#278

Time Travel in LLMs: Tracing Data Contamination in Large Language Models

Shahriar Golchin, Mihai Surdeanu

ICLR 2025arXiv:2406.04770

#279

WildBench: Benchmarking LLMs with Challenging Tasks from Real Users in the Wild

Bill Yuchen Lin, Yuntian Deng, Khyathi Chandu et al.

ICLR 2024arXiv:2305.13269

#280

Chain-of-Knowledge: Grounding Large Language Models via Dynamic Knowledge Adapting over Heterogeneous Sources

Xingxuan Li, Ruochen Zhao, Yew Ken Chia et al.

ICLR 2025arXiv:2406.14598

#281

SORRY-Bench: Systematically Evaluating Large Language Model Safety Refusal

Tinghao Xie, Xiangyu Qi, Yi Zeng et al.

ICLR 2025arXiv:2404.15574

#282

Retrieval Head Mechanistically Explains Long-Context Factuality

Wenhao Wu, Yizhong Wang, Guangxuan Xiao et al.

150

ICLR 2024arXiv:2307.03576

#283

One Step of Gradient Descent is Provably the Optimal In-Context Learner with One Layer of Linear Self-Attention

Arvind Mahankali, Tatsunori Hashimoto, Tengyu Ma

150

ICLR 2024spotlightarXiv:2307.10928

#284

FLASK: Fine-grained Language Model Evaluation based on Alignment Skill Sets

Seonghyeon Ye, Doyoung Kim, Sungdong Kim et al.

150

ICLR 2024spotlightarXiv:2309.17102

#285

Guiding Instruction-based Image Editing via Multimodal Large Language Models

Tsu-Jui Fu, Wenze Hu, Xianzhi Du et al.

ICLR 2025arXiv:2410.07985

#286

Omni-MATH: A Universal Olympiad Level Mathematic Benchmark for Large Language Models

Bofei Gao, Feifan Song, Zhe Yang et al.

ICLR 2024arXiv:2305.17359

#287

DNA-GPT: Divergent N-Gram Analysis for Training-Free Detection of GPT-Generated Text

Xianjun Yang, Wei Cheng, Yue Wu et al.

ICLR 2025oralarXiv:2402.08268

#288

World Model on Million-Length Video And Language With Blockwise RingAttention

Hao Liu, Wilson Yan, Matei Zaharia et al.

ICLR 2024oralarXiv:2310.05922

#289

FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video editing

Yuren Cong, Mengmeng Xu, Christian Simon et al.

ICLR 2024spotlightarXiv:2310.02391

#290

SE(3)-Stochastic Flow Matching for Protein Backbone Generation

Joey Bose, Tara Akhound-Sadegh, Guillaume Huguet et al.

ICLR 2025arXiv:2408.13257

#291

MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?

YiFan Zhang, Huanyu Zhang, Haochen Tian et al.

ICLR 2024arXiv:2310.02557

#292

Generalization in diffusion models arises from geometry-adaptive harmonic representations

Zahra Kadkhodaie, Florentin Guth, Eero Simoncelli et al.

ICLR 2024arXiv:2310.10625

#293

Video Language Planning

Yilun Du, Sherry Yang, Pete Florence et al.

ICLR 2025arXiv:2409.00588

#294

Diffusion Policy Policy Optimization

Allen Ren, Justin Lidard, Lars Ankile et al.

146

ICLR 2025arXiv:2406.07155

#295

Scaling Large Language Model-based Multi-Agent Collaboration

Chen Qian, Zihao Xie, YiFei Wang et al.

146

ICLR 2025arXiv:2404.09990

#296

Inference Scaling Laws: An Empirical Analysis of Compute-Optimal Inference for LLM Problem-Solving

Yangzhen Wu, Zhiqing Sun, Shanda Li et al.

HQ-Edit: A High-Quality Dataset for Instruction-based Image Editing

MUDE HUI, Siwei Yang, Bingchen Zhao et al.

146

ICLR 2024arXiv:2310.03668

#298

GoLLIE: Annotation Guidelines improve Zero-Shot Information-Extraction

Oscar Sainz, Iker García-Ferrero, Rodrigo Agerri et al.

145

ICLR 2024spotlightarXiv:2310.16049

#299

MuSR: Testing the Limits of Chain-of-thought with Multistep Soft Reasoning

Zayne Sprague, Xi Ye, Kaj Bostrom et al.

144

ICLR 2024arXiv:2309.14322

#300

Small-scale proxies for large-scale Transformer training instabilities

Mitchell Wortsman, Peter Liu, Lechao Xiao et al.

144

ICLR 2024oralarXiv:2305.11854

#301

Multimodal Web Navigation with Instruction-Finetuned Foundation Models

Hiroki Furuta, Kuang-Huei Lee, Ofir Nachum et al.

144

ICLR 2025arXiv:2309.14402

#302

Physics of Language Models: Part 3.2, Knowledge Manipulation

Zeyuan Allen-Zhu, Yuanzhi Li

ICLR 2024arXiv:2309.05444

#303

Pushing Mixture of Experts to the Limit: Extremely Parameter Efficient MoE for Instruction Tuning

Ted Zadouri, Ahmet Üstün, Arash Ahmadian et al.

ICLR 2025arXiv:2410.09024

#304

AgentHarm: A Benchmark for Measuring Harmfulness of LLM Agents

Maksym Andriushchenko, Alexandra Souly, Mateusz Dziemian et al.

ICLR 2024spotlightarXiv:2308.09124

#305

Linearity of Relation Decoding in Transformer Language Models

Evan Hernandez, Arnab Sen Sharma, Tal Haklay et al.

ICLR 2025arXiv:2410.07095

#306

MLE-bench: Evaluating Machine Learning Agents on Machine Learning Engineering

Jun Shern Chan, Neil Chowdhury, Oliver Jaffe et al.

142

ICLR 2025arXiv:2408.08152

#307

DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search

Huajian Xin, Z.Z. Ren, Junxiao Song et al.

142

ICLR 2024arXiv:2309.05196

#308

Does Writing with Language Models Reduce Content Diversity?

Vishakh Padmakumar, He He

142

ICLR 2025arXiv:2411.07763

#309

Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows

Fangyu Lei, Jixuan Chen, Yuxiao Ye et al.

141

ICLR 2024arXiv:2403.12313

#310

Improving LoRA in Privacy-preserving Federated Learning

Youbang Sun, Zitao Li, Yaliang Li et al.

141

ICLR 2025arXiv:2410.01560

#311

OpenMathInstruct-2: Accelerating AI for Math with Massive Open-Source Instruction Data

Shubham Toshniwal, Wei Du, Ivan Moshkov et al.

141

ICLR 2025arXiv:2410.10733

#312

Deep Compression Autoencoder for Efficient High-Resolution Diffusion Models

Junyu Chen, Han Cai, Junsong Chen et al.

138

ICLR 2025oralarXiv:2409.12961

#313

Oryx MLLM: On-Demand Spatial-Temporal Understanding at Arbitrary Resolution

Zuyan Liu, Yuhao Dong, Ziwei Liu et al.

138

ICLR 2024arXiv:2310.11230

#314

Zipformer: A faster and better encoder for automatic speech recognition

Zengwei Yao, Liyong Guo, Xiaoyu Yang et al.

138

ICLR 2024arXiv:2310.12921

#315

Vision-Language Models are Zero-Shot Reward Models for Reinforcement Learning

Juan Rocamonde, Victoriano Montesinos, Elvis Nava et al.

137

ICLR 2024arXiv:2310.01714

#316

Large Language Models as Analogical Reasoners

Michihiro Yasunaga, Xinyun Chen, Yujia Li et al.

137

ICLR 2024oralarXiv:2312.03606

#317

DiffusionSat: A Generative Foundation Model for Satellite Imagery

Samar Khanna, Patrick Liu, Linqi Zhou et al.

136

ICLR 2024arXiv:2404.13628

#318

Mixture of LoRA Experts

xun wu, Shaohan Huang, Furu Wei

136

ICLR 2025arXiv:2409.06666

#319

LLaMA-Omni: Seamless Speech Interaction with Large Language Models

Qingkai Fang, Shoutao Guo, Yan Zhou et al.

135

ICLR 2024arXiv:2309.16948

#320

Denoising Diffusion Bridge Models

Linqi Zhou, Aaron Lou, Samar Khanna et al.

135

ICLR 2025arXiv:2405.17238

#321

IRIS: LLM-Assisted Static Analysis for Detecting Security Vulnerabilities

Ziyang Li, Saikat Dutta, Mayur Naik

135

ICLR 2024arXiv:2305.19523

#322

Harnessing Explanations: LLM-to-LM Interpreter for Enhanced Text-Attributed Graph Representation Learning

Xiaoxin He, Xavier Bresson, Thomas Laurent et al.

133

ICLR 2025arXiv:2408.08435

#323

Automated Design of Agentic Systems

Shengran Hu, Cong Lu, Jeff Clune

133

ICLR 2024arXiv:2307.06945

#324

In-context Autoencoder for Context Compression in a Large Language Model

Tao Ge, Hu Jing, Lei Wang et al.

132

ICLR 2025oralarXiv:2408.16532

#325

WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling

Shengpeng Ji, Ziyue Jiang, Wen Wang et al.

132

ICLR 2024arXiv:2310.13345

#326

An LLM can Fool Itself: A Prompt-Based Adversarial Attack

Xilie Xu, Keyi Kong, Ning Liu et al.

ICLR 2025arXiv:2409.08861

#327

Adjoint Matching: Fine-tuning Flow and Diffusion Generative Models with Memoryless Stochastic Optimal Control

Carles Domingo i Enrich, Michal Drozdzal, Brian Karrer et al.

ICLR 2024arXiv:2305.13310

#328

Matcher: Segment Anything with One Shot Using All-Purpose Feature Matching

Yang Liu, Muzhi Zhu, Hengtao Li et al.

ICLR 2024arXiv:2211.03295

#329

MogaNet: Multi-order Gated Aggregation Network

Siyuan Li, Zedong Wang, Zicheng Liu et al.

ICLR 2024arXiv:2310.06786

#330

OpenWebMath: An Open Dataset of High-Quality Mathematical Web Text

Keiran Paster, Marco Dos Santos, Zhangir Azerbayev et al.

ICLR 2024spotlightarXiv:2302.07867

#331

Learning Performance-Improving Code Edits

Alexander Shypula, Aman Madaan, Yimeng Zeng et al.

ICLR 2024arXiv:2404.03663

#332

Spike-driven Transformer V2: Meta Spiking Neural Network Architecture Inspiring the Design of Next-generation Neuromorphic Chips

Man Yao, Jiakui Hu, Tianxiang Hu et al.

ICLR 2024arXiv:2310.08461

#333

DistillSpec: Improving Speculative Decoding via Knowledge Distillation

Yongchao Zhou, Kaifeng Lyu, Ankit Singh Rawat et al.

ICLR 2024arXiv:2310.18235

#334

Davidsonian Scene Graph: Improving Reliability in Fine-grained Evaluation for Text-to-Image Generation

Jaemin Cho, Yushi Hu, Jason Baldridge et al.

ICLR 2024arXiv:2311.16424

#335

Manifold Preserving Guided Diffusion

Yutong He, Naoki Murata, Chieh-Hsin Lai et al.

ICLR 2024arXiv:2310.03094

#336

Large Language Model Cascades with Mixture of Thought Representations for Cost-Efficient Reasoning

Murong Yue, Jie Zhao, Min Zhang et al.

ICLR 2025arXiv:2410.02707

#337

LLMs Know More Than They Show: On the Intrinsic Representation of LLM Hallucinations

Hadas Orgad, Michael Toker, Zorik Gekhman et al.

ICLR 2025arXiv:2408.06195

#338

Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solver

Zhenting Qi, Mingyuan MA, Jiahang Xu et al.

ICLR 2025oralarXiv:2410.10813

#339

LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory

Di Wu, Hongwei Wang, Wenhao Yu et al.

128

ICLR 2025arXiv:2410.10594

#340

VisRAG: Vision-based Retrieval-augmented Generation on Multi-modality Documents

Shi Yu, Chaoyue Tang, Bokai Xu et al.

127

ICLR 2025arXiv:2409.18124

#341

Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction

Jing He, Haodong Li, Wei Yin et al.

127

ICLR 2024spotlightarXiv:2310.20707

#342

Adapting Large Language Models via Reading Comprehension

Daixuan Cheng, Shaohan Huang, Furu Wei

What's In My Big Data?

Yanai Elazar, Akshita Bhagia, Ian Magnusson et al.

126

ICLR 2024arXiv:2402.03921

#344

Large Language Models to Enhance Bayesian Optimization

Tennison Liu, Nicolás Astorga, Nabeel Seedat et al.

125

ICLR 2024spotlightarXiv:2311.03054

#345

AnyText: Multilingual Visual Text Generation and Editing

Yuxiang Tuo, Wangmeng Xiang, Jun-Yan He et al.

125

ICLR 2025arXiv:1901.03559

#346

Interpreting Emergent Planning in Model-Free Reinforcement Learning

Thomas Bush, Stephen Chung, Usman Anwar et al.

125

ICLR 2025oralarXiv:2410.18514

#347

Scaling up Masked Diffusion Models on Text

Shen Nie, Fengqi Zhu, Chao Du et al.

124

ICLR 2025arXiv:2409.00920

#348

ToolACE: Winning the Points of LLM Function Calling

Weiwen Liu, Xu Huang, Xingshan Zeng et al.

124

ICLR 2024arXiv:2306.08018

#349

Mol-Instructions: A Large-Scale Biomolecular Instruction Dataset for Large Language Models

Yin Fang, Xiaozhuan Liang, Ningyu Zhang et al.

123

ICLR 2024arXiv:2312.04927

#350

Zoology: Measuring and Improving Recall in Efficient Language Models

Simran Arora, Sabri Eyuboglu, Aman Timalsina et al.

123

ICLR 2025arXiv:2410.18647

#351

Data Scaling Laws in Imitation Learning for Robotic Manipulation

Fanqi Lin, Yingdong Hu, Pingyue Sheng et al.

123

ICLR 2025arXiv:2406.07522

#352

Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling

Liliang Ren, Yang Liu, Yadong Lu et al.

122

ICLR 2025arXiv:2410.01943

#353

CHASE-SQL: Multi-Path Reasoning and Preference Optimized Candidate Selection in Text-to-SQL

Mohammadreza Pourreza, Hailong Li, Ruoxi Sun et al.

122

ICLR 2024arXiv:2307.03381

#354

Teaching Arithmetic to Small Transformers

Nayoung Lee, Kartik Sreenivasan, Jason Lee et al.

ICLR 2025arXiv:2411.02337

#355

WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning

Zehan Qi, Xiao Liu, Iat Long Iong et al.

ICLR 2024spotlightarXiv:2310.01361

#356

GenSim: Generating Robotic Simulation Tasks via Large Language Models

Lirui Wang, Yiyang Ling, Zhecheng Yuan et al.

ICLR 2024arXiv:2303.05754

#357

Decomposed Diffusion Sampler for Accelerating Large-Scale Inverse Problems

Hyungjin Chung, Suhyeon Lee, Jong Chul YE

ICLR 2024arXiv:2310.19415

#358

Text-to-3D with Classifier Score Distillation

Xin Yu, Yuan-Chen Guo, Yangguang Li et al.

ICLR 2024arXiv:2310.05209

#359

Scaling Laws of RoPE-based Extrapolation

Xiaoran Liu, Hang Yan, Chenxin An et al.

ICLR 2025oralarXiv:2406.09411

#360

MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding

Fei Wang, XINGYU FU, James Y. Huang et al.

ICLR 2024arXiv:2309.14859

#361

Navigating Text-To-Image Customization: From LyCORIS Fine-Tuning to Model Evaluation

Shih-Ying Yeh, Yu-Guan Hsieh, Zhidong Gao et al.

ICLR 2025arXiv:2302.13939

#362

SpikeGPT: Generative Pre-trained Language Model with Spiking Neural Networks

Rui-Jie Zhu, Qihang Zhao, Jason Eshraghian et al.

ICLR 2025arXiv:2407.12883

#363

BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval

Hongjin SU, Howard Yen, Mengzhou Xia et al.

ICLR 2025arXiv:2410.24207

#364

No Pose, No Problem: Surprisingly Simple 3D Gaussian Splats from Sparse Unposed Images

Botao Ye, Sifei Liu, Haofei Xu et al.

ICLR 2024spotlightarXiv:2311.01977

#365

RT-Trajectory: Robotic Task Generalization via Hindsight Trajectory Sketches

Jiayuan Gu, Sean Kirmani, Paul Wohlhart et al.

ICLR 2024arXiv:2206.09557

#366

LUT-GEMM: Quantized Matrix Multiplication based on LUTs for Efficient Inference in Large-Scale Generative Language Models

Gunho Park, baeseong park, Minsub Kim et al.

ICLR 2025arXiv:2410.13863

#367

Fluid: Scaling Autoregressive Text-to-image Generative Models with Continuous Tokens

Lijie Fan, Tianhong Li, Siyang Qin et al.

ICLR 2024arXiv:2308.01907

#368

The All-Seeing Project: Towards Panoptic Visual Recognition and Understanding of the Open World

Weiyun Wang, Min Shi, Qingyun Li et al.

ICLR 2025oralarXiv:2407.12781

#369

VD3D: Taming Large Video Diffusion Transformers for 3D Camera Control

Sherwin Bahmani, Ivan Skorokhodov, Aliaksandr Siarohin et al.

ICLR 2024arXiv:2310.06313

#370

Advancing Pose-Guided Image Synthesis with Progressive Conditional Diffusion Models

Fei Shen, Hu Ye, Jun Zhang et al.

ICLR 2025arXiv:2403.16952

#371

Data Mixing Laws: Optimizing Data Mixtures by Predicting Language Modeling Performance

Jiasheng Ye, Peiju Liu, Tianxiang Sun et al.

ICLR 2024arXiv:2312.13558

#372

The Truth is in There: Improving Reasoning in Language Models with Layer-Selective Rank Reduction

Pratyusha Sharma, Jordan Ash, Dipendra Kumar Misra

ICLR 2025arXiv:2501.03895

#373

LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token

Shaolei Zhang, Qingkai Fang, Yang et al.

117

ICLR 2025arXiv:2410.05160

#374

VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks

Ziyan Jiang, Rui Meng, Xinyi Yang et al.

117

ICLR 2024arXiv:2402.14817

#375

Cameras as Rays: Pose Estimation via Ray Diffusion

Jason Zhang, Amy Lin, Moneish Kumar et al.

117

ICLR 2025arXiv:2409.02908

#376

Masked Diffusion Models are Secretly Time-Agnostic Masked Models and Exploit Inaccurate Categorical Sampling

Kaiwen Zheng, Yongxin Chen, Hanzi Mao et al.

ICLR 2024arXiv:2310.03025

#377

Retrieval meets Long Context Large Language Models

Peng Xu, Wei Ping, Xianchao Wu et al.

ICLR 2025arXiv:2410.02644

#378

Agent Security Bench (ASB): Formalizing and Benchmarking Attacks and Defenses in LLM-based Agents

Hanrong Zhang, Jingyuan Huang, Kai Mei et al.

ICLR 2025arXiv:2408.15998

#379

Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders

Min Shi, Fuxiao Liu, Shihao Wang et al.

ICLR 2025arXiv:2404.05405

#380

Physics of Language Models: Part 3.3, Knowledge Capacity Scaling Laws

Zeyuan Allen-Zhu, Yuanzhi Li

ICLR 2025arXiv:2410.05295

#381

AutoDAN-Turbo: A Lifelong Agent for Strategy Self-Exploration to Jailbreak LLMs

Xiaogeng Liu, Peiran Li, G. Edward Suh et al.

115

ICLR 2024arXiv:2309.14393

#382

LLMCarbon: Modeling the End-to-End Carbon Footprint of Large Language Models

Ahmad Faiz, Sotaro Kaneda, Ruhan Wang et al.

115

ICLR 2024arXiv:2310.05773

#383

Towards Lossless Dataset Distillation via Difficulty-Aligned Trajectory Matching

Ziyao Guo, Kai Wang, George Cazenavette et al.

114

ICLR 2024arXiv:2311.14455

#384

Universal Jailbreak Backdoors from Poisoned Human Feedback

Javier Rando, Florian Tramer

114

ICLR 2024arXiv:2309.10105

#385

Understanding Catastrophic Forgetting in Language Models via Implicit Inference

Suhas Kotha, Jacob Springer, Aditi Raghunathan

114

ICLR 2024arXiv:2310.05914

#386

NEFTune: Noisy Embeddings Improve Instruction Finetuning

Neel Jain, Ping-yeh Chiang, Yuxin Wen et al.

113

ICLR 2025arXiv:2408.00761

#387

Tamper-Resistant Safeguards for Open-Weight LLMs

Rishub Tamirisa, Bhrugu Bharathi, Long Phan et al.

113

ICLR 2025arXiv:2410.01257

#388

HelpSteer2-Preference: Complementing Ratings with Preferences

Zhilin Wang, Alexander Bukharin, Olivier Delalleau et al.

ICLR 2024arXiv:2306.06189

#389

FasterViT: Fast Vision Transformers with Hierarchical Attention

Ali Hatamizadeh, Greg Heinrich, Hongxu Yin et al.

ICLR 2025arXiv:2406.19435

#390

A Sanity Check for AI-generated Image Detection

Shilin Yan, Ouxiang Li, Jiayin Cai et al.

ICLR 2024arXiv:2310.00656

#391

LEGO-Prover: Neural Theorem Proving with Growing Libraries

Haiming Wang, Huajian Xin, Chuanyang Zheng et al.

ICLR 2024arXiv:2310.01557

#392

SmartPlay : A Benchmark for LLMs as Intelligent Agents

Yue Wu, Xuan Tang, Tom Mitchell et al.

ICLR 2024spotlightarXiv:2309.11489

#393

Text2Reward: Reward Shaping with Language Models for Reinforcement Learning

Tianbao Xie, Siheng Zhao, Chen Henry Wu et al.

ICLR 2025arXiv:2409.11295

#394

EIA: ENVIRONMENTAL INJECTION ATTACK ON GENERALIST WEB AGENTS FOR PRIVACY LEAKAGE

Zeyi Liao, Lingbo Mo, Chejian Xu et al.

ICLR 2024spotlightarXiv:2310.07702

#395

ScaleCrafter: Tuning-free Higher-Resolution Visual Generation with Diffusion Models

Yingqing He, Shaoshu Yang, Haoxin Chen et al.

ICLR 2025arXiv:2403.02308

#396

Vision-RWKV: Efficient and Scalable Visual Perception with RWKV-Like Architectures

Yuchen Duan, Weiyun Wang, Zhe Chen et al.

ICLR 2024arXiv:2308.03166

#397

Strategic Preys Make Acute Predators: Enhancing Camouflaged Object Detectors by Generating Camouflaged Objects

Chunming He, Kai Li, Yachao Zhang et al.

ICLR 2024arXiv:2306.07863

#398

Synapse: Trajectory-as-Exemplar Prompting with Memory for Computer Control

Longtao Zheng, Rundong Wang, Xinrun Wang et al.

ICLR 2025oralarXiv:2412.14169

#399

Autoregressive Video Generation without Vector Quantization

Haoge Deng, Ting Pan, Haiwen Diao et al.

ICLR 2024arXiv:2310.08559

#400

Phenomenal Yet Puzzling: Testing Inductive Reasoning Capabilities of Language Models with Hypothesis Refinement

Linlu Qiu, Liwei Jiang, Ximing Lu et al.