Research Alpha Leak - Rising Stars in Research

#1

CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer

Zhuoyi Yang, Jiayan Teng, Wendi Zheng et al.

ICLR 2025

1,318

citations

#2

LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code

Naman Jain, Han, Alex Gu et al.

ICLR 2025

1,016

citations

#3

RAFT: Reward rAnked FineTuning for Generative Foundation Model Alignment

Jipeng Zhang, Hanze Dong, Tong Zhang et al.

ICLR 2025

642

citations

#4

WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct

Haipeng Luo, Qingfeng Sun, Can Xu et al.

ICLR 2025

629

citations

#5

Show-o: One Single Transformer to Unify Multimodal Understanding and Generation

Jinheng Xie, Weijia Mao, Zechen Bai et al.

ICLR 2025

455

citations

#6

Causal Reasoning and Large Language Models: Opening a New Frontier for Causality

Chenhao Tan, Robert Ness, Amit Sharma et al.

ICLR 2025

386

citations

#7

Jailbreaking Leading Safety-Aligned LLMs with Simple Adaptive Attacks

Maksym Andriushchenko, francesco croce, Nicolas Flammarion

ICLR 2025

375

citations

#8

RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation

Songming Liu, Lingxuan Wu, Bangguo Li et al.

ICLR 2025

365

citations

#9

OpenHands: An Open Platform for AI Software Developers as Generalist Agents

Xingyao Wang, Boxuan Li, Yufan Song et al.

ICLR 2025

351

citations

#10

Generative Verifiers: Reward Modeling as Next-Token Prediction

Lunjun Zhang, Arian Hosseini, Hritik Bansal et al.

ICLR 2025

348

citations

#11

Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think

Sihyun Yu, Sangkyung Kwak, Huiwon Jang et al.

ICLR 2025

308

citations

#12

Depth Pro: Sharp Monocular Metric Depth in Less Than a Second

Alexey Bochkovskiy, Amaël Delaunoy, Hugo Germain et al.

ICLR 2025

299

citations

#13

Scaling and evaluating sparse autoencoders

Leo Gao, Tom Dupre la Tour, Henk Tillman et al.

ICLR 2025

298

citations

#14

Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model

Chunting Zhou, Lili Yu, Arun Babu et al.

ICLR 2025

294

citations

#15

Safety Alignment Should be Made More Than Just a Few Tokens Deep

Xiangyu Qi, Ashwinee Panda, Kaifeng Lyu et al.

ICLR 2025

277

citations

#16

Mixture-of-Agents Enhances Large Language Model Capabilities

Junlin Wang, Jue Wang, Ben Athiwaratkun et al.

ICLR 2025

274

citations

#17

Can LLMs Generate Novel Research Ideas? A Large-Scale Human Study with 100+ NLP Researchers

Chenglei Si, Diyi Yang, Tatsunori Hashimoto

ICLR 2025

272

citations

#18

MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion

Junyi Zhang, Charles Herrmann, Junhwa Hur et al.

ICLR 2025

262

citations

#19

Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language Models

Samuel Marks, Can Rager, Eric Michaud et al.

ICLR 2025

252

citations

#20

To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning

Zayne Sprague, Fangcong Yin, Juan Rodriguez et al.

ICLR 2025

239

citations

ICLR

Top Papers in ICLR 2025

CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer

LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code

RAFT: Reward rAnked FineTuning for Generative Foundation Model Alignment

WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct

Show-o: One Single Transformer to Unify Multimodal Understanding and Generation

Causal Reasoning and Large Language Models: Opening a New Frontier for Causality

Jailbreaking Leading Safety-Aligned LLMs with Simple Adaptive Attacks

RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation

OpenHands: An Open Platform for AI Software Developers as Generalist Agents

Generative Verifiers: Reward Modeling as Next-Token Prediction

Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think

Depth Pro: Sharp Monocular Metric Depth in Less Than a Second

Scaling and evaluating sparse autoencoders

Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model

Safety Alignment Should be Made More Than Just a Few Tokens Deep

Mixture-of-Agents Enhances Large Language Model Capabilities

Can LLMs Generate Novel Research Ideas? A Large-Scale Human Study with 100+ NLP Researchers

MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion

Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language Models

To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning