Most Cited ICLR "fairness in representations" Papers

6,124 papers found • Page 2 of 31

#201

Mixed-Type Tabular Data Synthesis with Score-based Diffusion in Latent Space

Hengrui Zhang, Jiani Zhang, Zhengyuan Shen et al.

ICLR 2024arXiv:2310.09656
199
citations
#202

Think-on-Graph: Deep and Responsible Reasoning of Large Language Model on Knowledge Graph

Jiashuo Sun, Chengjin Xu, Lumingyuan Tang et al.

ICLR 2024arXiv:2307.07697
198
citations
#203

Solving Challenging Math Word Problems Using GPT-4 Code Interpreter with Code-based Self-Verification

Aojun Zhou, Ke Wang, Zimu Lu et al.

ICLR 2024arXiv:2308.07921
198
citations
#204

Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models

Marianne Arriola, Aaron Gokaslan, Justin Chiu et al.

ICLR 2025arXiv:2503.09573
197
citations
#205

Function Vectors in Large Language Models

Eric Todd, Millicent Li, Arnab Sen Sharma et al.

ICLR 2024arXiv:2310.15213
197
citations
#206

Revisiting Feature Prediction for Learning Visual Representations from Video

Quentin Garrido, Yann LeCun, Michael Rabbat et al.

ICLR 2025arXiv:2404.08471
196
citations
#207

AdaMerging: Adaptive Model Merging for Multi-Task Learning

Enneng Yang, Zhenyi Wang, Li Shen et al.

ICLR 2024arXiv:2310.02575
196
citations
#208

One Step Diffusion via Shortcut Models

Kevin Frans, Danijar Hafner, Sergey Levine et al.

ICLR 2025arXiv:2410.12557
195
citations
#209

Nougat: Neural Optical Understanding for Academic Documents

Lukas Blecher, Guillem Cucurull Preixens, Thomas Scialom et al.

ICLR 2024arXiv:2308.13418
194
citations
#210

Time-MoE: Billion-Scale Time Series Foundation Models with Mixture of Experts

Xiaoming Shi, Shiyu Wang, Yuqi Nie et al.

ICLR 2025arXiv:2409.16040
194
citations
#211

OctoPack: Instruction Tuning Code Large Language Models

Niklas Muennighoff, Qian Liu, Armel Zebaze et al.

ICLR 2024spotlightarXiv:2308.07124
194
citations
#212

OmniControl: Control Any Joint at Any Time for Human Motion Generation

Yiming Xie, Varun Jampani, Lei Zhong et al.

ICLR 2024arXiv:2310.08580
194
citations
#213

Solving Inverse Problems with Latent Diffusion Models via Hard Data Consistency

Bowen Song, Soo Min Kwon, Zecheng Zhang et al.

ICLR 2024spotlightarXiv:2307.08123
193
citations
#214

Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis

Hubert Siuzdak

ICLR 2024arXiv:2306.00814
192
citations
#215

MMICL: Empowering Vision-language Model with Multi-Modal In-Context Learning

Haozhe Zhao, Zefan Cai, Shuzheng Si et al.

ICLR 2024arXiv:2309.07915
191
citations
#216

Making LLaMA SEE and Draw with SEED Tokenizer

Yuying Ge, Sijie Zhao, Ziyun Zeng et al.

ICLR 2024arXiv:2310.01218
190
citations
#217

Take a Step Back: Evoking Reasoning via Abstraction in Large Language Models

Huaixiu Steven Zheng, Swaroop Mishra, Xinyun Chen et al.

ICLR 2024arXiv:2310.06117
190
citations
#218

Reward Model Ensembles Help Mitigate Overoptimization

Thomas Coste, Usman Anwar, Robert Kirk et al.

ICLR 2024arXiv:2310.02743
188
citations
#219

Nearly $d$-Linear Convergence Bounds for Diffusion Models via Stochastic Localization

Joe Benton, Valentin De Bortoli, Arnaud Doucet et al.

ICLR 2024spotlightarXiv:2308.03686
188
citations
#220

ReLoRA: High-Rank Training Through Low-Rank Updates

Vladislav Lialin, Sherin Muckatira, Namrata Shivagunde et al.

ICLR 2024arXiv:2307.05695
187
citations
#221

Inverse Scaling: When Bigger Isn't Better

Joe Cavanagh, Andrew Gritsevskiy, Najoung Kim et al.

ICLR 2025arXiv:2306.09479
186
citations
#222

Towards Best Practices of Activation Patching in Language Models: Metrics and Methods

Fred Zhang, Neel Nanda

ICLR 2024arXiv:2309.16042
185
citations
#223

On the Reliability of Watermarks for Large Language Models

John Kirchenbauer, Jonas Geiping, Yuxin Wen et al.

ICLR 2024arXiv:2306.04634
185
citations
#224

Bias Runs Deep: Implicit Reasoning Biases in Persona-Assigned LLMs

Shashank Gupta, Vaishnavi Shrivastava, Ameet Deshpande et al.

ICLR 2024oralarXiv:2311.04892
184
citations
#225

Advancing LLM Reasoning Generalists with Preference Trees

Lifan Yuan, Ganqu Cui, Hanbin Wang et al.

ICLR 2025arXiv:2404.02078
183
citations
#226

DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads

Guangxuan Xiao, Jiaming Tang, Jingwei Zuo et al.

ICLR 2025arXiv:2410.10819
179
citations
#227

SelfCheck: Using LLMs to Zero-Shot Check Their Own Step-by-Step Reasoning

Ning Miao, Yee Whye Teh, Tom Rainforth

ICLR 2024arXiv:2308.00436
179
citations
#228

Gated Delta Networks: Improving Mamba2 with Delta Rule

Songlin Yang, Jan Kautz, Ali Hatamizadeh

ICLR 2025arXiv:2412.06464
177
citations
#229

MMAU: A Massive Multi-Task Audio Understanding and Reasoning Benchmark

Sakshi, Utkarsh Tyagi, Sonal Kumar et al.

ICLR 2025arXiv:2410.19168
176
citations
#230

Uni3D: Exploring Unified 3D Representation at Scale

Junsheng Zhou, Jinsheng Wang, Baorui Ma et al.

ICLR 2024spotlightarXiv:2310.06773
175
citations
#231

Beyond Memorization: Violating Privacy via Inference with Large Language Models

Robin Staab, Mark Vero, Mislav Balunovic et al.

ICLR 2024spotlightarXiv:2310.07298
175
citations
#232

Talk like a Graph: Encoding Graphs for Large Language Models

Bahare Fatemi, Jonathan Halcrow, Bryan Perozzi

ICLR 2024arXiv:2310.04560
174
citations
#233

G-LLaVA: Solving Geometric Problem with Multi-Modal Large Language Model

Jiahui Gao, Renjie Pi, Jipeng Zhang et al.

ICLR 2025arXiv:2312.11370
174
citations
#234

Ring-A-Bell! How Reliable are Concept Removal Methods For Diffusion Models?

Yu-Lin Tsai, Chia-Yi Hsu, Chulin Xie et al.

ICLR 2024arXiv:2310.10012
173
citations
#235

Diffusion Models Are Real-Time Game Engines

Dani Valevski, Yaniv Leviathan, Moab Arar et al.

ICLR 2025arXiv:2408.14837
172
citations
#236

The Unreasonable Ineffectiveness of the Deeper Layers

Andrey Gromov, Kushal Tirumala, Hassan Shapourian et al.

ICLR 2025arXiv:2403.17887
172
citations
#237

DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior

Jingxiang Sun, Bo Zhang, Ruizhi Shao et al.

ICLR 2024arXiv:2310.16818
172
citations
#238

Can Large Language Models Infer Causation from Correlation?

Zhijing Jin, Jiarui Liu, Zhiheng LYU et al.

ICLR 2024arXiv:2306.05836
171
citations
#239

What Algorithms can Transformers Learn? A Study in Length Generalization

Hattie Zhou, Arwen Bradley, Etai Littwin et al.

ICLR 2024arXiv:2310.16028
170
citations
#240

Inversion by Direct Iteration: An Alternative to Denoising Diffusion for Image Restoration

Peyman Milanfar, Mauricio Delbracio

ICLR 2024arXiv:2303.11435
169
citations
#241

MathCoder: Seamless Code Integration in LLMs for Enhanced Mathematical Reasoning

Ke Wang, Houxing Ren, Aojun Zhou et al.

ICLR 2024arXiv:2310.03731
169
citations
#242

OLMoE: Open Mixture-of-Experts Language Models

Niklas Muennighoff, Luca Soldaini, Dirk Groeneveld et al.

ICLR 2025arXiv:2409.02060
169
citations
#243

Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning

Amrith Setlur, Chirag Nagpal, Adam Fisch et al.

ICLR 2025arXiv:2410.08146
169
citations
#244

Latent Action Pretraining from Videos

Seonghyeon Ye, Joel Jang, Byeongguk Jeon et al.

ICLR 2025arXiv:2410.11758
168
citations
#245

SVD-LLM: Truncation-aware Singular Value Decomposition for Large Language Model Compression

Xin Wang, Yu Zheng, Zhongwei Wan et al.

ICLR 2025arXiv:2403.07378
168
citations
#246

Is Self-Repair a Silver Bullet for Code Generation?

Theo X. Olausson, Jeevana Priya Inala, Chenglong Wang et al.

ICLR 2024arXiv:2306.09896
168
citations
#247

MUSE: Machine Unlearning Six-Way Evaluation for Language Models

Weijia Shi, Jaechan Lee, Yangsibo Huang et al.

ICLR 2025arXiv:2407.06460
168
citations
#248

ZipIt! Merging Models from Different Tasks without Training

George Stoica, Daniel Bolya, Jakob Bjorner et al.

ICLR 2024arXiv:2305.03053
167
citations
#249

Self-Alignment with Instruction Backtranslation

Xian Li, Ping Yu, Chunting Zhou et al.

ICLR 2024arXiv:2308.06259
167
citations
#250

Can LLMs Keep a Secret? Testing Privacy Implications of Language Models via Contextual Integrity Theory

Niloofar Mireshghallah, Hyunwoo Kim, Xuhui Zhou et al.

ICLR 2024spotlightarXiv:2310.17884
166
citations
#251

BooookScore: A systematic exploration of book-length summarization in the era of LLMs

Yapei Chang, Kyle Lo, Tanya Goyal et al.

ICLR 2024arXiv:2310.00785
164
citations
#252

Diffusion-TS: Interpretable Diffusion for General Time Series Generation

Xinyu Yuan, Yan Qiao

ICLR 2024oralarXiv:2403.01742
164
citations
#253

JudgeBench: A Benchmark for Evaluating LLM-Based Judges

Sijun Tan, Siyuan Zhuang, Kyle Montgomery et al.

ICLR 2025arXiv:2410.12784
163
citations
#254

Self-contradictory Hallucinations of Large Language Models: Evaluation, Detection and Mitigation

Niels Mündler, Jingxuan He, Slobodan Jenko et al.

ICLR 2024arXiv:2305.15852
161
citations
#255

MaskGCT: Zero-Shot Text-to-Speech with Masked Generative Codec Transformer

Yuancheng Wang, Haoyue Zhan, Liwei Liu et al.

ICLR 2025arXiv:2409.00750
161
citations
#256

RAIN: Your Language Models Can Align Themselves without Finetuning

Yuhui Li, Fangyun Wei, Jinjing Zhao et al.

ICLR 2024arXiv:2309.07124
161
citations
#257

Learning to Act from Actionless Videos through Dense Correspondences

Po-Chen Ko, Jiayuan Mao, Yilun Du et al.

ICLR 2024spotlightarXiv:2310.08576
160
citations
#258

Flow Matching on General Geometries

Ricky T. Q. Chen, Yaron Lipman

ICLR 2024arXiv:2302.03660
159
citations
#259

Chain of Hindsight aligns Language Models with Feedback

Hao Liu, Carmelo Sferrazza, Pieter Abbeel

ICLR 2024arXiv:2302.02676
158
citations
#260

Emu: Generative Pretraining in Multimodality

Quan Sun, Qiying Yu, Yufeng Cui et al.

ICLR 2024arXiv:2307.05222
158
citations
#261

Scaling Diffusion Language Models via Adaptation from Autoregressive Models

Shansan Gong, Shivam Agarwal, Yizhe Zhang et al.

ICLR 2025arXiv:2410.17891
157
citations
#262

MetaTool Benchmark for Large Language Models: Deciding Whether to Use Tools and Which to Use

Yue Huang, Jiawen Shi, Yuan Li et al.

ICLR 2024arXiv:2310.03128
157
citations
#263

Beyond Reverse KL: Generalizing Direct Preference Optimization with Diverse Divergence Constraints

Chaoqi Wang, Yibo Jiang, Chenghao Yang et al.

ICLR 2024spotlightarXiv:2309.16240
157
citations
#264

QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models

Yuhui Xu, Lingxi Xie, Xiaotao Gu et al.

ICLR 2024arXiv:2309.14717
156
citations
#265

Simplifying, Stabilizing and Scaling Continuous-time Consistency Models

Cheng Lu, Yang Song

ICLR 2025arXiv:2410.11081
155
citations
#266

PF-LRM: Pose-Free Large Reconstruction Model for Joint Pose and Shape Prediction

Peng Wang, Hao Tan, Sai Bi et al.

ICLR 2024spotlightarXiv:2311.12024
155
citations
#267

Generative Judge for Evaluating Alignment

Junlong Li, Shichao Sun, Weizhe Yuan et al.

ICLR 2024arXiv:2310.05470
155
citations
#268

Let 2D Diffusion Model Know 3D-Consistency for Robust Text-to-3D Generation

Junyoung Seo, Wooseok Jang, Min-Seop Kwak et al.

ICLR 2024arXiv:2303.07937
155
citations
#269

Pathformer: Multi-scale Transformers with Adaptive Pathways for Time Series Forecasting

Peng Chen, Yingying ZHANG, Yunyao Cheng et al.

ICLR 2024oralarXiv:2402.05956
155
citations
#270

Can Sensitive Information Be Deleted From LLMs? Objectives for Defending Against Extraction Attacks

Vaidehi Ramesh Patil, Peter Hase, Mohit Bansal

ICLR 2024spotlightarXiv:2309.17410
154
citations
#271

Hypothesis Search: Inductive Reasoning with Language Models

Ruocheng Wang, Eric Zelikman, Gabriel Poesia et al.

ICLR 2024arXiv:2309.05660
154
citations
#272

AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models

Junfeng Fang, Houcheng Jiang, Kun Wang et al.

ICLR 2025arXiv:2410.02355
154
citations
#273

Interpreting CLIP's Image Representation via Text-Based Decomposition

Yossi Gandelsman, Alexei Efros, Jacob Steinhardt

ICLR 2024arXiv:2310.05916
154
citations
#274

SweetDreamer: Aligning Geometric Priors in 2D diffusion for Consistent Text-to-3D

Weiyu LI, Rui Chen, Xuelin Chen et al.

ICLR 2024arXiv:2310.02596
153
citations
#275

AFlow: Automating Agentic Workflow Generation

Jiayi Zhang, Jinyu Xiang, Zhaoyang Yu et al.

ICLR 2025arXiv:2410.10762
153
citations
#276

TraceVLA: Visual Trace Prompting Enhances Spatial-Temporal Awareness for Generalist Robotic Policies

Ruijie Zheng, Yongyuan Liang, Shuaiyi Huang et al.

ICLR 2025oralarXiv:2412.10345
153
citations
#277

FreeNoise: Tuning-Free Longer Video Diffusion via Noise Rescheduling

Haonan Qiu, Menghan Xia, Yong Zhang et al.

ICLR 2024oralarXiv:2310.15169
152
citations
#278

Time Travel in LLMs: Tracing Data Contamination in Large Language Models

Shahriar Golchin, Mihai Surdeanu

ICLR 2024spotlightarXiv:2308.08493
151
citations
#279

WildBench: Benchmarking LLMs with Challenging Tasks from Real Users in the Wild

Bill Yuchen Lin, Yuntian Deng, Khyathi Chandu et al.

ICLR 2025arXiv:2406.04770
151
citations
#280

Chain-of-Knowledge: Grounding Large Language Models via Dynamic Knowledge Adapting over Heterogeneous Sources

Xingxuan Li, Ruochen Zhao, Yew Ken Chia et al.

ICLR 2024arXiv:2305.13269
151
citations
#281

SORRY-Bench: Systematically Evaluating Large Language Model Safety Refusal

Tinghao Xie, Xiangyu Qi, Yi Zeng et al.

ICLR 2025arXiv:2406.14598
151
citations
#282

Retrieval Head Mechanistically Explains Long-Context Factuality

Wenhao Wu, Yizhong Wang, Guangxuan Xiao et al.

ICLR 2025arXiv:2404.15574
150
citations
#283

One Step of Gradient Descent is Provably the Optimal In-Context Learner with One Layer of Linear Self-Attention

Arvind Mahankali, Tatsunori Hashimoto, Tengyu Ma

ICLR 2024arXiv:2307.03576
150
citations
#284

FLASK: Fine-grained Language Model Evaluation based on Alignment Skill Sets

Seonghyeon Ye, Doyoung Kim, Sungdong Kim et al.

ICLR 2024spotlightarXiv:2307.10928
150
citations
#285

Guiding Instruction-based Image Editing via Multimodal Large Language Models

Tsu-Jui Fu, Wenze Hu, Xianzhi Du et al.

ICLR 2024spotlightarXiv:2309.17102
149
citations
#286

Omni-MATH: A Universal Olympiad Level Mathematic Benchmark for Large Language Models

Bofei Gao, Feifan Song, Zhe Yang et al.

ICLR 2025arXiv:2410.07985
149
citations
#287

DNA-GPT: Divergent N-Gram Analysis for Training-Free Detection of GPT-Generated Text

Xianjun Yang, Wei Cheng, Yue Wu et al.

ICLR 2024arXiv:2305.17359
149
citations
#288

World Model on Million-Length Video And Language With Blockwise RingAttention

Hao Liu, Wilson Yan, Matei Zaharia et al.

ICLR 2025oralarXiv:2402.08268
149
citations
#289

FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video editing

Yuren Cong, Mengmeng Xu, Christian Simon et al.

ICLR 2024oralarXiv:2310.05922
147
citations
#290

SE(3)-Stochastic Flow Matching for Protein Backbone Generation

Joey Bose, Tara Akhound-Sadegh, Guillaume Huguet et al.

ICLR 2024spotlightarXiv:2310.02391
147
citations
#291

MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?

YiFan Zhang, Huanyu Zhang, Haochen Tian et al.

ICLR 2025arXiv:2408.13257
147
citations
#292

Generalization in diffusion models arises from geometry-adaptive harmonic representations

Zahra Kadkhodaie, Florentin Guth, Eero Simoncelli et al.

ICLR 2024arXiv:2310.02557
147
citations
#293

Video Language Planning

Yilun Du, Sherry Yang, Pete Florence et al.

ICLR 2024arXiv:2310.10625
147
citations
#294

Diffusion Policy Policy Optimization

Allen Ren, Justin Lidard, Lars Ankile et al.

ICLR 2025arXiv:2409.00588
146
citations
#295

Scaling Large Language Model-based Multi-Agent Collaboration

Chen Qian, Zihao Xie, YiFei Wang et al.

ICLR 2025arXiv:2406.07155
146
citations
#296

Inference Scaling Laws: An Empirical Analysis of Compute-Optimal Inference for LLM Problem-Solving

Yangzhen Wu, Zhiqing Sun, Shanda Li et al.

ICLR 2025
146
citations
#297

HQ-Edit: A High-Quality Dataset for Instruction-based Image Editing

MUDE HUI, Siwei Yang, Bingchen Zhao et al.

ICLR 2025arXiv:2404.09990
146
citations
#298

GoLLIE: Annotation Guidelines improve Zero-Shot Information-Extraction

Oscar Sainz, Iker García-Ferrero, Rodrigo Agerri et al.

ICLR 2024arXiv:2310.03668
145
citations
#299

MuSR: Testing the Limits of Chain-of-thought with Multistep Soft Reasoning

Zayne Sprague, Xi Ye, Kaj Bostrom et al.

ICLR 2024spotlightarXiv:2310.16049
144
citations
#300

Small-scale proxies for large-scale Transformer training instabilities

Mitchell Wortsman, Peter Liu, Lechao Xiao et al.

ICLR 2024arXiv:2309.14322
144
citations
#301

Multimodal Web Navigation with Instruction-Finetuned Foundation Models

Hiroki Furuta, Kuang-Huei Lee, Ofir Nachum et al.

ICLR 2024oralarXiv:2305.11854
144
citations
#302

Physics of Language Models: Part 3.2, Knowledge Manipulation

Zeyuan Allen-Zhu, Yuanzhi Li

ICLR 2025arXiv:2309.14402
143
citations
#303

Pushing Mixture of Experts to the Limit: Extremely Parameter Efficient MoE for Instruction Tuning

Ted Zadouri, Ahmet Üstün, Arash Ahmadian et al.

ICLR 2024arXiv:2309.05444
143
citations
#304

AgentHarm: A Benchmark for Measuring Harmfulness of LLM Agents

Maksym Andriushchenko, Alexandra Souly, Mateusz Dziemian et al.

ICLR 2025arXiv:2410.09024
143
citations
#305

Linearity of Relation Decoding in Transformer Language Models

Evan Hernandez, Arnab Sen Sharma, Tal Haklay et al.

ICLR 2024spotlightarXiv:2308.09124
143
citations
#306

MLE-bench: Evaluating Machine Learning Agents on Machine Learning Engineering

Jun Shern Chan, Neil Chowdhury, Oliver Jaffe et al.

ICLR 2025arXiv:2410.07095
142
citations
#307

DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search

Huajian Xin, Z.Z. Ren, Junxiao Song et al.

ICLR 2025arXiv:2408.08152
142
citations
#308

Does Writing with Language Models Reduce Content Diversity?

Vishakh Padmakumar, He He

ICLR 2024arXiv:2309.05196
142
citations
#309

Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows

Fangyu Lei, Jixuan Chen, Yuxiao Ye et al.

ICLR 2025arXiv:2411.07763
141
citations
#310

Improving LoRA in Privacy-preserving Federated Learning

Youbang Sun, Zitao Li, Yaliang Li et al.

ICLR 2024arXiv:2403.12313
141
citations
#311

OpenMathInstruct-2: Accelerating AI for Math with Massive Open-Source Instruction Data

Shubham Toshniwal, Wei Du, Ivan Moshkov et al.

ICLR 2025arXiv:2410.01560
141
citations
#312

Deep Compression Autoencoder for Efficient High-Resolution Diffusion Models

Junyu Chen, Han Cai, Junsong Chen et al.

ICLR 2025arXiv:2410.10733
138
citations
#313

Oryx MLLM: On-Demand Spatial-Temporal Understanding at Arbitrary Resolution

Zuyan Liu, Yuhao Dong, Ziwei Liu et al.

ICLR 2025oralarXiv:2409.12961
138
citations
#314

Zipformer: A faster and better encoder for automatic speech recognition

Zengwei Yao, Liyong Guo, Xiaoyu Yang et al.

ICLR 2024arXiv:2310.11230
138
citations
#315

Vision-Language Models are Zero-Shot Reward Models for Reinforcement Learning

Juan Rocamonde, Victoriano Montesinos, Elvis Nava et al.

ICLR 2024arXiv:2310.12921
137
citations
#316

Large Language Models as Analogical Reasoners

Michihiro Yasunaga, Xinyun Chen, Yujia Li et al.

ICLR 2024arXiv:2310.01714
137
citations
#317

DiffusionSat: A Generative Foundation Model for Satellite Imagery

Samar Khanna, Patrick Liu, Linqi Zhou et al.

ICLR 2024oralarXiv:2312.03606
136
citations
#318

Mixture of LoRA Experts

xun wu, Shaohan Huang, Furu Wei

ICLR 2024arXiv:2404.13628
136
citations
#319

LLaMA-Omni: Seamless Speech Interaction with Large Language Models

Qingkai Fang, Shoutao Guo, Yan Zhou et al.

ICLR 2025arXiv:2409.06666
135
citations
#320

Denoising Diffusion Bridge Models

Linqi Zhou, Aaron Lou, Samar Khanna et al.

ICLR 2024arXiv:2309.16948
135
citations
#321

IRIS: LLM-Assisted Static Analysis for Detecting Security Vulnerabilities

Ziyang Li, Saikat Dutta, Mayur Naik

ICLR 2025arXiv:2405.17238
135
citations
#322

Harnessing Explanations: LLM-to-LM Interpreter for Enhanced Text-Attributed Graph Representation Learning

Xiaoxin He, Xavier Bresson, Thomas Laurent et al.

ICLR 2024arXiv:2305.19523
133
citations
#323

Automated Design of Agentic Systems

Shengran Hu, Cong Lu, Jeff Clune

ICLR 2025arXiv:2408.08435
133
citations
#324

In-context Autoencoder for Context Compression in a Large Language Model

Tao Ge, Hu Jing, Lei Wang et al.

ICLR 2024arXiv:2307.06945
132
citations
#325

WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling

Shengpeng Ji, Ziyue Jiang, Wen Wang et al.

ICLR 2025oralarXiv:2408.16532
132
citations
#326

An LLM can Fool Itself: A Prompt-Based Adversarial Attack

Xilie Xu, Keyi Kong, Ning Liu et al.

ICLR 2024arXiv:2310.13345
131
citations
#327

Adjoint Matching: Fine-tuning Flow and Diffusion Generative Models with Memoryless Stochastic Optimal Control

Carles Domingo i Enrich, Michal Drozdzal, Brian Karrer et al.

ICLR 2025arXiv:2409.08861
131
citations
#328

Matcher: Segment Anything with One Shot Using All-Purpose Feature Matching

Yang Liu, Muzhi Zhu, Hengtao Li et al.

ICLR 2024arXiv:2305.13310
131
citations
#329

MogaNet: Multi-order Gated Aggregation Network

Siyuan Li, Zedong Wang, Zicheng Liu et al.

ICLR 2024arXiv:2211.03295
131
citations
#330

OpenWebMath: An Open Dataset of High-Quality Mathematical Web Text

Keiran Paster, Marco Dos Santos, Zhangir Azerbayev et al.

ICLR 2024arXiv:2310.06786
130
citations
#331

Learning Performance-Improving Code Edits

Alexander Shypula, Aman Madaan, Yimeng Zeng et al.

ICLR 2024spotlightarXiv:2302.07867
130
citations
#332

Spike-driven Transformer V2: Meta Spiking Neural Network Architecture Inspiring the Design of Next-generation Neuromorphic Chips

Man Yao, Jiakui Hu, Tianxiang Hu et al.

ICLR 2024arXiv:2404.03663
130
citations
#333

DistillSpec: Improving Speculative Decoding via Knowledge Distillation

Yongchao Zhou, Kaifeng Lyu, Ankit Singh Rawat et al.

ICLR 2024arXiv:2310.08461
130
citations
#334

Davidsonian Scene Graph: Improving Reliability in Fine-grained Evaluation for Text-to-Image Generation

Jaemin Cho, Yushi Hu, Jason Baldridge et al.

ICLR 2024arXiv:2310.18235
130
citations
#335

Manifold Preserving Guided Diffusion

Yutong He, Naoki Murata, Chieh-Hsin Lai et al.

ICLR 2024arXiv:2311.16424
129
citations
#336

Large Language Model Cascades with Mixture of Thought Representations for Cost-Efficient Reasoning

Murong Yue, Jie Zhao, Min Zhang et al.

ICLR 2024arXiv:2310.03094
129
citations
#337

LLMs Know More Than They Show: On the Intrinsic Representation of LLM Hallucinations

Hadas Orgad, Michael Toker, Zorik Gekhman et al.

ICLR 2025arXiv:2410.02707
129
citations
#338

Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solver

Zhenting Qi, Mingyuan MA, Jiahang Xu et al.

ICLR 2025arXiv:2408.06195
129
citations
#339

LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory

Di Wu, Hongwei Wang, Wenhao Yu et al.

ICLR 2025oralarXiv:2410.10813
128
citations
#340

VisRAG: Vision-based Retrieval-augmented Generation on Multi-modality Documents

Shi Yu, Chaoyue Tang, Bokai Xu et al.

ICLR 2025arXiv:2410.10594
127
citations
#341

Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction

Jing He, Haodong Li, Wei Yin et al.

ICLR 2025arXiv:2409.18124
127
citations
#342

Adapting Large Language Models via Reading Comprehension

Daixuan Cheng, Shaohan Huang, Furu Wei

ICLR 2024
126
citations
#343

What's In My Big Data?

Yanai Elazar, Akshita Bhagia, Ian Magnusson et al.

ICLR 2024spotlightarXiv:2310.20707
126
citations
#344

Large Language Models to Enhance Bayesian Optimization

Tennison Liu, Nicolás Astorga, Nabeel Seedat et al.

ICLR 2024arXiv:2402.03921
125
citations
#345

AnyText: Multilingual Visual Text Generation and Editing

Yuxiang Tuo, Wangmeng Xiang, Jun-Yan He et al.

ICLR 2024spotlightarXiv:2311.03054
125
citations
#346

Interpreting Emergent Planning in Model-Free Reinforcement Learning

Thomas Bush, Stephen Chung, Usman Anwar et al.

ICLR 2025arXiv:1901.03559
125
citations
#347

Scaling up Masked Diffusion Models on Text

Shen Nie, Fengqi Zhu, Chao Du et al.

ICLR 2025oralarXiv:2410.18514
124
citations
#348

ToolACE: Winning the Points of LLM Function Calling

Weiwen Liu, Xu Huang, Xingshan Zeng et al.

ICLR 2025arXiv:2409.00920
124
citations
#349

Mol-Instructions: A Large-Scale Biomolecular Instruction Dataset for Large Language Models

Yin Fang, Xiaozhuan Liang, Ningyu Zhang et al.

ICLR 2024arXiv:2306.08018
123
citations
#350

Zoology: Measuring and Improving Recall in Efficient Language Models

Simran Arora, Sabri Eyuboglu, Aman Timalsina et al.

ICLR 2024arXiv:2312.04927
123
citations
#351

Data Scaling Laws in Imitation Learning for Robotic Manipulation

Fanqi Lin, Yingdong Hu, Pingyue Sheng et al.

ICLR 2025arXiv:2410.18647
123
citations
#352

Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling

Liliang Ren, Yang Liu, Yadong Lu et al.

ICLR 2025arXiv:2406.07522
122
citations
#353

CHASE-SQL: Multi-Path Reasoning and Preference Optimized Candidate Selection in Text-to-SQL

Mohammadreza Pourreza, Hailong Li, Ruoxi Sun et al.

ICLR 2025arXiv:2410.01943
122
citations
#354

Teaching Arithmetic to Small Transformers

Nayoung Lee, Kartik Sreenivasan, Jason Lee et al.

ICLR 2024arXiv:2307.03381
121
citations
#355

WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning

Zehan Qi, Xiao Liu, Iat Long Iong et al.

ICLR 2025arXiv:2411.02337
121
citations
#356

GenSim: Generating Robotic Simulation Tasks via Large Language Models

Lirui Wang, Yiyang Ling, Zhecheng Yuan et al.

ICLR 2024spotlightarXiv:2310.01361
121
citations
#357

Decomposed Diffusion Sampler for Accelerating Large-Scale Inverse Problems

Hyungjin Chung, Suhyeon Lee, Jong Chul YE

ICLR 2024arXiv:2303.05754
121
citations
#358

Text-to-3D with Classifier Score Distillation

Xin Yu, Yuan-Chen Guo, Yangguang Li et al.

ICLR 2024arXiv:2310.19415
121
citations
#359

Scaling Laws of RoPE-based Extrapolation

Xiaoran Liu, Hang Yan, Chenxin An et al.

ICLR 2024arXiv:2310.05209
121
citations
#360

MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding

Fei Wang, XINGYU FU, James Y. Huang et al.

ICLR 2025oralarXiv:2406.09411
120
citations
#361

Navigating Text-To-Image Customization: From LyCORIS Fine-Tuning to Model Evaluation

Shih-Ying Yeh, Yu-Guan Hsieh, Zhidong Gao et al.

ICLR 2024arXiv:2309.14859
120
citations
#362

SpikeGPT: Generative Pre-trained Language Model with Spiking Neural Networks

Rui-Jie Zhu, Qihang Zhao, Jason Eshraghian et al.

ICLR 2025arXiv:2302.13939
120
citations
#363

BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval

Hongjin SU, Howard Yen, Mengzhou Xia et al.

ICLR 2025arXiv:2407.12883
120
citations
#364

No Pose, No Problem: Surprisingly Simple 3D Gaussian Splats from Sparse Unposed Images

Botao Ye, Sifei Liu, Haofei Xu et al.

ICLR 2025arXiv:2410.24207
119
citations
#365

RT-Trajectory: Robotic Task Generalization via Hindsight Trajectory Sketches

Jiayuan Gu, Sean Kirmani, Paul Wohlhart et al.

ICLR 2024spotlightarXiv:2311.01977
119
citations
#366

LUT-GEMM: Quantized Matrix Multiplication based on LUTs for Efficient Inference in Large-Scale Generative Language Models

Gunho Park, baeseong park, Minsub Kim et al.

ICLR 2024arXiv:2206.09557
119
citations
#367

Fluid: Scaling Autoregressive Text-to-image Generative Models with Continuous Tokens

Lijie Fan, Tianhong Li, Siyang Qin et al.

ICLR 2025arXiv:2410.13863
119
citations
#368

The All-Seeing Project: Towards Panoptic Visual Recognition and Understanding of the Open World

Weiyun Wang, Min Shi, Qingyun Li et al.

ICLR 2024arXiv:2308.01907
118
citations
#369

VD3D: Taming Large Video Diffusion Transformers for 3D Camera Control

Sherwin Bahmani, Ivan Skorokhodov, Aliaksandr Siarohin et al.

ICLR 2025oralarXiv:2407.12781
118
citations
#370

Advancing Pose-Guided Image Synthesis with Progressive Conditional Diffusion Models

Fei Shen, Hu Ye, Jun Zhang et al.

ICLR 2024arXiv:2310.06313
118
citations
#371

Data Mixing Laws: Optimizing Data Mixtures by Predicting Language Modeling Performance

Jiasheng Ye, Peiju Liu, Tianxiang Sun et al.

ICLR 2025arXiv:2403.16952
118
citations
#372

The Truth is in There: Improving Reasoning in Language Models with Layer-Selective Rank Reduction

Pratyusha Sharma, Jordan Ash, Dipendra Kumar Misra

ICLR 2024arXiv:2312.13558
118
citations
#373

LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token

Shaolei Zhang, Qingkai Fang, Yang et al.

ICLR 2025arXiv:2501.03895
117
citations
#374

VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks

Ziyan Jiang, Rui Meng, Xinyi Yang et al.

ICLR 2025arXiv:2410.05160
117
citations
#375

Cameras as Rays: Pose Estimation via Ray Diffusion

Jason Zhang, Amy Lin, Moneish Kumar et al.

ICLR 2024arXiv:2402.14817
117
citations
#376

Masked Diffusion Models are Secretly Time-Agnostic Masked Models and Exploit Inaccurate Categorical Sampling

Kaiwen Zheng, Yongxin Chen, Hanzi Mao et al.

ICLR 2025arXiv:2409.02908
116
citations
#377

Retrieval meets Long Context Large Language Models

Peng Xu, Wei Ping, Xianchao Wu et al.

ICLR 2024arXiv:2310.03025
116
citations
#378

Agent Security Bench (ASB): Formalizing and Benchmarking Attacks and Defenses in LLM-based Agents

Hanrong Zhang, Jingyuan Huang, Kai Mei et al.

ICLR 2025arXiv:2410.02644
116
citations
#379

Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders

Min Shi, Fuxiao Liu, Shihao Wang et al.

ICLR 2025arXiv:2408.15998
116
citations
#380

Physics of Language Models: Part 3.3, Knowledge Capacity Scaling Laws

Zeyuan Allen-Zhu, Yuanzhi Li

ICLR 2025arXiv:2404.05405
116
citations
#381

AutoDAN-Turbo: A Lifelong Agent for Strategy Self-Exploration to Jailbreak LLMs

Xiaogeng Liu, Peiran Li, G. Edward Suh et al.

ICLR 2025arXiv:2410.05295
115
citations
#382

LLMCarbon: Modeling the End-to-End Carbon Footprint of Large Language Models

Ahmad Faiz, Sotaro Kaneda, Ruhan Wang et al.

ICLR 2024arXiv:2309.14393
115
citations
#383

Towards Lossless Dataset Distillation via Difficulty-Aligned Trajectory Matching

Ziyao Guo, Kai Wang, George Cazenavette et al.

ICLR 2024arXiv:2310.05773
114
citations
#384

Universal Jailbreak Backdoors from Poisoned Human Feedback

Javier Rando, Florian Tramer

ICLR 2024arXiv:2311.14455
114
citations
#385

Understanding Catastrophic Forgetting in Language Models via Implicit Inference

Suhas Kotha, Jacob Springer, Aditi Raghunathan

ICLR 2024arXiv:2309.10105
114
citations
#386

NEFTune: Noisy Embeddings Improve Instruction Finetuning

Neel Jain, Ping-yeh Chiang, Yuxin Wen et al.

ICLR 2024arXiv:2310.05914
113
citations
#387

Tamper-Resistant Safeguards for Open-Weight LLMs

Rishub Tamirisa, Bhrugu Bharathi, Long Phan et al.

ICLR 2025arXiv:2408.00761
113
citations
#388

HelpSteer2-Preference: Complementing Ratings with Preferences

Zhilin Wang, Alexander Bukharin, Olivier Delalleau et al.

ICLR 2025arXiv:2410.01257
112
citations
#389

FasterViT: Fast Vision Transformers with Hierarchical Attention

Ali Hatamizadeh, Greg Heinrich, Hongxu Yin et al.

ICLR 2024arXiv:2306.06189
112
citations
#390

A Sanity Check for AI-generated Image Detection

Shilin Yan, Ouxiang Li, Jiayin Cai et al.

ICLR 2025arXiv:2406.19435
112
citations
#391

LEGO-Prover: Neural Theorem Proving with Growing Libraries

Haiming Wang, Huajian Xin, Chuanyang Zheng et al.

ICLR 2024arXiv:2310.00656
112
citations
#392

SmartPlay : A Benchmark for LLMs as Intelligent Agents

Yue Wu, Xuan Tang, Tom Mitchell et al.

ICLR 2024arXiv:2310.01557
111
citations
#393

Text2Reward: Reward Shaping with Language Models for Reinforcement Learning

Tianbao Xie, Siheng Zhao, Chen Henry Wu et al.

ICLR 2024spotlightarXiv:2309.11489
111
citations
#394

EIA: ENVIRONMENTAL INJECTION ATTACK ON GENERALIST WEB AGENTS FOR PRIVACY LEAKAGE

Zeyi Liao, Lingbo Mo, Chejian Xu et al.

ICLR 2025arXiv:2409.11295
111
citations
#395

ScaleCrafter: Tuning-free Higher-Resolution Visual Generation with Diffusion Models

Yingqing He, Shaoshu Yang, Haoxin Chen et al.

ICLR 2024spotlightarXiv:2310.07702
111
citations
#396

Vision-RWKV: Efficient and Scalable Visual Perception with RWKV-Like Architectures

Yuchen Duan, Weiyun Wang, Zhe Chen et al.

ICLR 2025arXiv:2403.02308
111
citations
#397

Strategic Preys Make Acute Predators: Enhancing Camouflaged Object Detectors by Generating Camouflaged Objects

Chunming He, Kai Li, Yachao Zhang et al.

ICLR 2024arXiv:2308.03166
110
citations
#398

Synapse: Trajectory-as-Exemplar Prompting with Memory for Computer Control

Longtao Zheng, Rundong Wang, Xinrun Wang et al.

ICLR 2024arXiv:2306.07863
110
citations
#399

Autoregressive Video Generation without Vector Quantization

Haoge Deng, Ting Pan, Haiwen Diao et al.

ICLR 2025oralarXiv:2412.14169
110
citations
#400

Phenomenal Yet Puzzling: Testing Inductive Reasoning Capabilities of Language Models with Hypothesis Refinement

Linlu Qiu, Liwei Jiang, Ximing Lu et al.

ICLR 2024arXiv:2310.08559
110
citations