Most Cited ICLR "ranking-based supervision" Papers

6,124 papers found • Page 4 of 31

#601

DisenBooth: Identity-Preserving Disentangled Tuning for Subject-Driven Text-to-Image Generation

Hong Chen, Yipeng Zhang, Simin Wu et al.

ICLR 2024arXiv:2305.03374
75
citations
#602

Plug-and-Play Policy Planner for Large Language Model Powered Dialogue Agents

Yang Deng, Wenxuan Zhang, Wai Lam et al.

ICLR 2024arXiv:2311.00262
74
citations
#603

BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games

Davide Paglieri, Bartłomiej Cupiał, Samuel Coward et al.

ICLR 2025arXiv:2411.13543
74
citations
#604

PhysBench: Benchmarking and Enhancing Vision-Language Models for Physical World Understanding

Wei Chow, Jiageng Mao, Boyi Li et al.

ICLR 2025arXiv:2501.16411
74
citations
#605

Mega-TTS 2: Boosting Prompting Mechanisms for Zero-Shot Speech Synthesis

Ziyue Jiang, Jinglin Liu, Yi Ren et al.

ICLR 2024arXiv:2307.07218
74
citations
#606

Polynormer: Polynomial-Expressive Graph Transformer in Linear Time

Chenhui Deng, Zichao Yue, Zhiru Zhang

ICLR 2024arXiv:2403.01232
74
citations
#607

Dynamic Sparse No Training: Training-Free Fine-tuning for Sparse LLMs

Yuxin Zhang, Lirui Zhao, Mingbao Lin et al.

ICLR 2024arXiv:2310.08915
74
citations
#608

The Geometry of Categorical and Hierarchical Concepts in Large Language Models

Kiho Park, Yo Joong Choe, Yibo Jiang et al.

ICLR 2025arXiv:2406.01506
74
citations
#609

FreDF: Learning to Forecast in the Frequency Domain

Hao Wang, Lichen Pan, Yuan Shen et al.

ICLR 2025arXiv:2402.02399
73
citations
#610

Overthinking the Truth: Understanding how Language Models Process False Demonstrations

Danny Halawi, Jean-Stanislas Denain, Jacob Steinhardt

ICLR 2024spotlightarXiv:2307.09476
73
citations
#611

Relay Diffusion: Unifying diffusion process across resolutions for image synthesis

Jiayan Teng, Wendi Zheng, Ming Ding et al.

ICLR 2024spotlightarXiv:2309.03350
73
citations
#612

MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations for Medicine

Yunfei Xie, Ce Zhou, Lang Gao et al.

ICLR 2025arXiv:2408.02900
73
citations
#613

Planning in Natural Language Improves LLM Search for Code Generation

Evan Wang, Federico Cassano, Catherine Wu et al.

ICLR 2025arXiv:2409.03733
73
citations
#614

PromptTTS 2: Describing and Generating Voices with Text Prompt

Yichong Leng, ZHifang Guo, Kai Shen et al.

ICLR 2024arXiv:2309.02285
73
citations
#615

EfficientDM: Efficient Quantization-Aware Fine-Tuning of Low-Bit Diffusion Models

YEFEI HE, Jing Liu, Weijia Wu et al.

ICLR 2024oralarXiv:2310.03270
73
citations
#616

In-Context Learning through the Bayesian Prism

Madhur Panwar, Kabir Ahuja, Navin Goyal

ICLR 2024arXiv:2306.04891
72
citations
#617

Perplexed by Perplexity: Perplexity-Based Data Pruning With Small Reference Models

Zachary Ankner, Cody Blakeney, Kartik Sreenivasan et al.

ICLR 2025arXiv:2405.20541
72
citations
#618

Inducing High Energy-Latency of Large Vision-Language Models with Verbose Images

Kuofeng Gao, Yang Bai, Jindong Gu et al.

ICLR 2024oralarXiv:2401.11170
72
citations
#619

Improving Text-to-Image Consistency via Automatic Prompt Optimization

Melissa Hall, Michal Drozdzal, Oscar Mañas et al.

ICLR 2025arXiv:2403.17804
71
citations
#620

Frequency-Aware Transformer for Learned Image Compression

Han Li, Shaohui Li, Wenrui Dai et al.

ICLR 2024arXiv:2310.16387
71
citations
#621

Simple Hierarchical Planning with Diffusion

Chang Chen, Fei Deng, Kenji Kawaguchi et al.

ICLR 2024oralarXiv:2401.02644
71
citations
#622

Scalable Diffusion for Materials Generation

Sherry Yang, Kwanghwan Cho, Amil Merchant et al.

ICLR 2024arXiv:2311.09235
71
citations
#623

TorchRL: A data-driven decision-making library for PyTorch

Albert Bou, Matteo Bettini, Sebastian Dittert et al.

ICLR 2024spotlightarXiv:2306.00577
71
citations
#624

Transformers as Decision Makers: Provable In-Context Reinforcement Learning via Supervised Pretraining

Licong Lin, Yu Bai, Song Mei

ICLR 2024arXiv:2310.08566
70
citations
#625

SWE-Search: Enhancing Software Agents with Monte Carlo Tree Search and Iterative Refinement

Antonis Antoniades, Albert Örwall, Kexun Zhang et al.

ICLR 2025arXiv:2410.20285
70
citations
#626

DriveTransformer: Unified Transformer for Scalable End-to-End Autonomous Driving

Xiaosong Jia, Junqi You, Zhiyuan Zhang et al.

ICLR 2025oralarXiv:2503.07656
70
citations
#627

GameGen-X: Interactive Open-world Game Video Generation

Haoxuan Che, Xuanhua He, Quande Liu et al.

ICLR 2025arXiv:2411.00769
70
citations
#628

VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents

Xiao Liu, Tianjie Zhang, Yu Gu et al.

ICLR 2025arXiv:2408.06327
70
citations
#629

InstructScene: Instruction-Driven 3D Indoor Scene Synthesis with Semantic Graph Prior

Chenguo Lin, Yadong MU

ICLR 2024spotlightarXiv:2402.04717
70
citations
#630

On the Learnability of Watermarks for Language Models

Chenchen Gu, XIANG LI, Percy Liang et al.

ICLR 2024arXiv:2312.04469
70
citations
#631

Weak to Strong Generalization for Large Language Models with Multi-capabilities

Yucheng Zhou, Jianbing Shen, Yu Cheng

ICLR 2025
70
citations
#632

Circumventing Concept Erasure Methods For Text-To-Image Generative Models

Minh Pham, Kelly Marshall, Niv Cohen et al.

ICLR 2024arXiv:2308.01508
70
citations
#633

Composed Image Retrieval with Text Feedback via Multi-grained Uncertainty Regularization

Yiyang Chen, Zhedong Zheng, Wei Ji et al.

ICLR 2024arXiv:2211.07394
70
citations
#634

Arithmetic Without Algorithms: Language Models Solve Math with a Bag of Heuristics

Yaniv Nikankin, Anja Reusch, Aaron Mueller et al.

ICLR 2025arXiv:2410.21272
70
citations
#635

MagicPIG: LSH Sampling for Efficient LLM Generation

Zhuoming Chen, Ranajoy Sadhukhan, Zihao Ye et al.

ICLR 2025arXiv:2410.16179
69
citations
#636

Accelerating Diffusion Transformers with Token-wise Feature Caching

Chang Zou, Xuyang Liu, Ting Liu et al.

ICLR 2025arXiv:2410.05317
69
citations
#637

ChartMimic: Evaluating LMM's Cross-Modal Reasoning Capability via Chart-to-Code Generation

Cheng Yang, Chufan Shi, Yaxin Liu et al.

ICLR 2025arXiv:2406.09961
69
citations
#638

How do Language Models Bind Entities in Context?

Jiahai Feng, Jacob Steinhardt

ICLR 2024arXiv:2310.17191
69
citations
#639

CycleResearcher: Improving Automated Research via Automated Review

Yixuan Weng, Minjun Zhu, Guangsheng Bao et al.

ICLR 2025arXiv:2411.00816
69
citations
#640

AgentOccam: A Simple Yet Strong Baseline for LLM-Based Web Agents

Ke Yang, Yao Liu, Sapana Chaudhary et al.

ICLR 2025arXiv:2410.13825
69
citations
#641

SolidGen: An Autoregressive Model for Direct B-rep Synthesis

Karl Willis, Joseph Lambourne, Nigel Morris et al.

ICLR 2024
69
citations
#642

Deep Confident Steps to New Pockets: Strategies for Docking Generalization

Gabriele Corso, Arthur Deng, Nicholas Polizzi et al.

ICLR 2024arXiv:2402.18396
69
citations
#643

Guiding Masked Representation Learning to Capture Spatio-Temporal Relationship of Electrocardiogram

Yeongyeon Na, Minje Park, Yunwon Tae et al.

ICLR 2024oralarXiv:2402.09450
69
citations
#644

Does Refusal Training in LLMs Generalize to the Past Tense?

Maksym Andriushchenko, Nicolas Flammarion

ICLR 2025arXiv:2407.11969
69
citations
#645

Learn-by-interact: A Data-Centric Framework For Self-Adaptive Agents in Realistic Environments

Hongjin SU, Ruoxi Sun, Jinsung Yoon et al.

ICLR 2025arXiv:2501.10893
69
citations
#646

Long Context Compression with Activation Beacon

Peitian Zhang, Zheng Liu, Shitao Xiao et al.

ICLR 2025arXiv:2401.03462
68
citations
#647

CatVTON: Concatenation Is All You Need for Virtual Try-On with Diffusion Models

Zheng Chong, Xiao Dong, Haoxiang Li et al.

ICLR 2025arXiv:2407.15886
68
citations
#648

Grokking as the transition from lazy to rich training dynamics

Tanishq Kumar, Blake Bordelon, Samuel Gershman et al.

ICLR 2024arXiv:2310.06110
68
citations
#649

Multi-Source Diffusion Models for Simultaneous Music Generation and Separation

Giorgio Mariani, Irene Tallini, Emilian Postolache et al.

ICLR 2024arXiv:2302.02257
68
citations
#650

Recursive Generalization Transformer for Image Super-Resolution

Zheng Chen, Yulun Zhang, Jinjin Gu et al.

ICLR 2024arXiv:2303.06373
68
citations
#651

ReDeEP: Detecting Hallucination in Retrieval-Augmented Generation via Mechanistic Interpretability

Zhongxiang Sun, Xiaoxue Zang, Kai Zheng et al.

ICLR 2025arXiv:2410.11414
68
citations
#652

Scaling for Training Time and Post-hoc Out-of-distribution Detection Enhancement

Kai Xu, Rongyu Chen, Gianni Franchi et al.

ICLR 2024arXiv:2310.00227
68
citations
#653

QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Models

Jing Liu, Ruihao Gong, Xiuying Wei et al.

ICLR 2024arXiv:2310.08041
68
citations
#654

Scaling Laws for Precision

Tanishq Kumar, Zachary Ankner, Benjamin Spector et al.

ICLR 2025arXiv:2411.04330
68
citations
#655

ViDiT-Q: Efficient and Accurate Quantization of Diffusion Transformers for Image and Video Generation

Tianchen Zhao, Tongcheng Fang, Haofeng Huang et al.

ICLR 2025arXiv:2406.02540
68
citations
#656

ImageFolder: Autoregressive Image Generation with Folded Tokens

Xiang Li, Kai Qiu, Hao Chen et al.

ICLR 2025arXiv:2410.01756
68
citations
#657

Self-Introspective Decoding: Alleviating Hallucinations for Large Vision-Language Models

Fushuo Huo, Wenchao Xu, Zhong Zhang et al.

ICLR 2025arXiv:2408.02032
68
citations
#658

CATCH: Channel-Aware Multivariate Time Series Anomaly Detection via Frequency Patching

Xingjian Wu, Xiangfei Qiu, Zhengyu Li et al.

ICLR 2025arXiv:2410.12261
68
citations
#659

Tensor Programs VI: Feature Learning in Infinite Depth Neural Networks

Greg Yang, Dingli Yu, Chen Zhu et al.

ICLR 2024arXiv:2310.02244
68
citations
#660

Attention Satisfies: A Constraint-Satisfaction Lens on Factual Errors of Language Models

Mert Yuksekgonul, Varun Chandrasekaran, Erik Jones et al.

ICLR 2024arXiv:2309.15098
68
citations
#661

Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative Intelligence

Weize Chen, Ziming You, Ran Li et al.

ICLR 2025arXiv:2407.07061
68
citations
#662

Deep Temporal Graph Clustering

Meng Liu, Yue Liu, KE LIANG et al.

ICLR 2024oralarXiv:2305.10738
67
citations
#663

Process Reward Model with Q-value Rankings

Wendi Li, Yixuan Li

ICLR 2025arXiv:2410.11287
67
citations
#664

MM1.5: Methods, Analysis & Insights from Multimodal LLM Fine-tuning

Haotian Zhang, Mingfei Gao, Zhe Gan et al.

ICLR 2025arXiv:2409.20566
67
citations
#665

Successor Heads: Recurring, Interpretable Attention Heads In The Wild

Rhys Gould, Euan Ong, George Ogden et al.

ICLR 2024arXiv:2312.09230
67
citations
#666

Looped Transformers are Better at Learning Learning Algorithms

Liu Yang, Kangwook Lee, Robert Nowak et al.

ICLR 2024arXiv:2311.12424
67
citations
#667

Towards Seamless Adaptation of Pre-trained Models for Visual Place Recognition

Feng Lu, Lijun Zhang, Xiangyuan Lan et al.

ICLR 2024arXiv:2402.14505
67
citations
#668

AI Sandbagging: Language Models can Strategically Underperform on Evaluations

Teun van der Weij, Felix Hofstätter, Oliver Jaffe et al.

ICLR 2025arXiv:2406.07358
67
citations
#669

TimeSuite: Improving MLLMs for Long Video Understanding via Grounded Tuning

Xiangyu Zeng, Kunchang Li, Chenting Wang et al.

ICLR 2025oralarXiv:2410.19702
67
citations
#670

Learning Dynamics of LLM Finetuning

YI REN, Danica Sutherland

ICLR 2025arXiv:2407.10490
67
citations
#671

Interpreting and Editing Vision-Language Representations to Mitigate Hallucinations

Nick Jiang, Anish Kachinthaya, Suzanne Petryk et al.

ICLR 2025arXiv:2410.02762
67
citations
#672

Cybench: A Framework for Evaluating Cybersecurity Capabilities and Risks of Language Models

Andy K Zhang, Neil Perry, Riya Dulepet et al.

ICLR 2025arXiv:2408.08926
67
citations
#673

Matryoshka Diffusion Models

Jiatao Gu, Shuangfei Zhai, Yizhe Zhang et al.

ICLR 2024arXiv:2310.15111
67
citations
#674

Space Group Constrained Crystal Generation

Rui Jiao, Wenbing Huang, Yu Liu et al.

ICLR 2024arXiv:2402.03992
66
citations
#675

C-TPT: Calibrated Test-Time Prompt Tuning for Vision-Language Models via Text Feature Dispersion

Hee Suk Yoon, Eunseop Yoon, Joshua Tian Jin Tee et al.

ICLR 2024arXiv:2403.14119
66
citations
#676

Image and Video Tokenization with Binary Spherical Quantization

Yue Zhao, Yuanjun Xiong, Philipp Krähenbühl

ICLR 2025arXiv:2406.07548
66
citations
#677

Alleviating Exposure Bias in Diffusion Models through Sampling with Shifted Time Steps

Mingxiao Li, Tingyu Qu, Ruicong Yao et al.

ICLR 2024arXiv:2305.15583
66
citations
#678

Evaluating the Zero-shot Robustness of Instruction-tuned Language Models

Jiuding Sun, Chantal Shaib, Byron Wallace

ICLR 2024spotlightarXiv:2306.11270
66
citations
#679

An Emulator for Fine-tuning Large Language Models using Small Language Models

Eric Mitchell, Rafael Rafailov, Archit Sharma et al.

ICLR 2024oralarXiv:2310.12962
66
citations
#680

Smaller, Weaker, Yet Better: Training LLM Reasoners via Compute-Optimal Sampling

Hritik Bansal, Arian Hosseini, Rishabh Agarwal et al.

ICLR 2025arXiv:2408.16737
66
citations
#681

GIM: Learning Generalizable Image Matcher From Internet Videos

Xuelun Shen, zhipeng cai, Wei Yin et al.

ICLR 2024spotlightarXiv:2402.11095
66
citations
#682

On the Foundations of Shortcut Learning

Katherine Hermann, Hossein Mobahi, Thomas FEL et al.

ICLR 2024spotlightarXiv:2310.16228
66
citations
#683

Web Agents with World Models: Learning and Leveraging Environment Dynamics in Web Navigation

Hyungjoo Chae, Namyoung Kim, Kai Ong et al.

ICLR 2025arXiv:2410.13232
65
citations
#684

DSBench: How Far Are Data Science Agents from Becoming Data Science Experts?

Liqiang Jing, Zhehui Huang, Xiaoyang Wang et al.

ICLR 2025arXiv:2409.07703
65
citations
#685

MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequences

Canyu Zhao, Mingyu Liu, Wen Wang et al.

ICLR 2025arXiv:2407.16655
65
citations
#686

Don't Trust: Verify -- Grounding LLM Quantitative Reasoning with Autoformalization

Jin Zhou, Charles Staats, Wenda Li et al.

ICLR 2024arXiv:2403.18120
65
citations
#687

TabR: Tabular Deep Learning Meets Nearest Neighbors

Yury Gorishniy, Ivan Rubachev, Nikolay Kartashev et al.

ICLR 2024arXiv:2307.14338
65
citations
#688

GraphCare: Enhancing Healthcare Predictions with Personalized Knowledge Graphs

Pengcheng Jiang, Cao Xiao, Adam Cross et al.

ICLR 2024arXiv:2305.12788
65
citations
#689

Variational Bayesian Last Layers

James Harrison, John Willes, Jasper Snoek

ICLR 2024spotlightarXiv:2404.11599
65
citations
#690

Robust agents learn causal world models

Jonathan Richens, Tom Everitt

ICLR 2024arXiv:2402.10877
65
citations
#691

Towards Principled Evaluations of Sparse Autoencoders for Interpretability and Control

Aleksandar Makelov, Georg Lange, Neel Nanda

ICLR 2025arXiv:2405.08366
65
citations
#692

Cut the Crap: An Economical Communication Pipeline for LLM-based Multi-Agent Systems

Guibin Zhang, Yanwei Yue, Zhixun Li et al.

ICLR 2025oralarXiv:2410.02506
64
citations
#693

Animate-X: Universal Character Image Animation with Enhanced Motion Representation

Shuai Tan, Biao Gong, Xiang Wang et al.

ICLR 2025oralarXiv:2410.10306
64
citations
#694

Making Pre-trained Language Models Great on Tabular Prediction

Jiahuan Yan, Bo Zheng, Hongxia Xu et al.

ICLR 2024spotlightarXiv:2403.01841
64
citations
#695

CLaM-TTS: Improving Neural Codec Language Model for Zero-Shot Text-to-Speech

Jaehyeon Kim, Keon Lee, Seungjun Chung et al.

ICLR 2024arXiv:2404.02781
64
citations
#696

ImagenHub: Standardizing the evaluation of conditional image generation models

Max Ku, Tianle Li, Kai Zhang et al.

ICLR 2024arXiv:2310.01596
64
citations
#697

MagicDec: Breaking the Latency-Throughput Tradeoff for Long Context Generation with Speculative Decoding

Ranajoy Sadhukhan, Jian Chen, Zhuoming Chen et al.

ICLR 2025arXiv:2408.11049
64
citations
#698

Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks

Chien-yu Huang, Wei-Chih Chen, Shu-wen Yang et al.

ICLR 2025arXiv:2411.05361
64
citations
#699

Masked Audio Generation using a Single Non-Autoregressive Transformer

Alon Ziv, Itai Gat, Gael Le Lan et al.

ICLR 2024arXiv:2401.04577
64
citations
#700

Tell Your Model Where to Attend: Post-hoc Attention Steering for LLMs

Qingru Zhang, Chandan Singh, Liyuan Liu et al.

ICLR 2024arXiv:2311.02262
64
citations
#701

Sentence-level Prompts Benefit Composed Image Retrieval

Yang Bai, Xinxing Xu, Yong Liu et al.

ICLR 2024spotlightarXiv:2310.05473
63
citations
#702

Monte Carlo guided Denoising Diffusion models for Bayesian linear inverse problems.

Gabriel Cardoso, Yazid Janati el idrissi, Sylvain Le Corff et al.

ICLR 2024
63
citations
#703

Diffusion Generative Flow Samplers: Improving learning signals through partial trajectory optimization

Dinghuai Zhang, Ricky T. Q. Chen, Chenghao Liu et al.

ICLR 2024arXiv:2310.02679
63
citations
#704

Consistency Models as a Rich and Efficient Policy Class for Reinforcement Learning

Zihan Ding, Chi Jin

ICLR 2024arXiv:2309.16984
63
citations
#705

ViDA: Homeostatic Visual Domain Adapter for Continual Test Time Adaptation

Jiaming Liu, Senqiao Yang, Peidong Jia et al.

ICLR 2024arXiv:2306.04344
63
citations
#706

Toward effective protection against diffusion-based mimicry through score distillation

Haotian Xue, Chumeng Liang, Xiaoyu Wu et al.

ICLR 2024arXiv:2311.12832
63
citations
#707

Multiagent Finetuning: Self Improvement with Diverse Reasoning Chains

Vighnesh Subramaniam, Yilun Du, Joshua B Tenenbaum et al.

ICLR 2025arXiv:2501.05707
63
citations
#708

Matryoshka Multimodal Models

Mu Cai, Jianwei Yang, Jianfeng Gao et al.

ICLR 2025arXiv:2405.17430
63
citations
#709

Generative Pre-training for Speech with Flow Matching

Alexander Liu, Matthew Le, Apoorv Vyas et al.

ICLR 2024arXiv:2310.16338
63
citations
#710

PINNsFormer: A Transformer-Based Framework For Physics-Informed Neural Networks

Zhiyuan Zhao, Xueying Ding, B. Aditya Prakash

ICLR 2024oralarXiv:2307.11833
62
citations
#711

Scaling Transformers for Low-Bitrate High-Quality Speech Coding

Julian Parker, Anton Smirnov, Jordi Pons et al.

ICLR 2025arXiv:2411.19842
62
citations
#712

Value-Incentivized Preference Optimization: A Unified Approach to Online and Offline RLHF

Shicong Cen, Jincheng Mei, Katayoon Goshvadi et al.

ICLR 2025arXiv:2405.19320
62
citations
#713

FlexPrefill: A Context-Aware Sparse Attention Mechanism for Efficient Long-Sequence Inference

Xunhao Lai, Jianqiao Lu, Yao Luo et al.

ICLR 2025arXiv:2502.20766
62
citations
#714

RazorAttention: Efficient KV Cache Compression Through Retrieval Heads

Hanlin Tang, Yang Lin, Jing Lin et al.

ICLR 2025arXiv:2407.15891
62
citations
#715

PnP-Flow: Plug-and-Play Image Restoration with Flow Matching

Ségolène Martin, Anne Gagneux, Paul Hagemann et al.

ICLR 2025arXiv:2410.02423
62
citations
#716

DP-OPT: Make Large Language Model Your Privacy-Preserving Prompt Engineer

Junyuan Hong, Jiachen (Tianhao) Wang, Chenhui Zhang et al.

ICLR 2024spotlightarXiv:2312.03724
62
citations
#717

Negative Label Guided OOD Detection with Pretrained Vision-Language Models

Xue JIANG, Feng Liu, Zhen Fang et al.

ICLR 2024spotlightarXiv:2403.20078
62
citations
#718

SAFREE: Training-Free and Adaptive Guard for Safe Text-to-Image And Video Generation

Jaehong Yoon, Shoubin Yu, Vaidehi Ramesh Patil et al.

ICLR 2025arXiv:2410.12761
62
citations
#719

SKILL-MIX: a Flexible and Expandable Family of Evaluations for AI Models

Dingli Yu, Simran Kaur, Arushi Gupta et al.

ICLR 2024arXiv:2310.17567
61
citations
#720

AgentSquare: Automatic LLM Agent Search in Modular Design Space

Yu Shang, Yu Li, Keyu Zhao et al.

ICLR 2025arXiv:2410.06153
61
citations
#721

SafeDiffuser: Safe Planning with Diffusion Probabilistic Models

Wei Xiao, Johnson (Tsun-Hsuan) Wang, Chuang Gan et al.

ICLR 2025arXiv:2306.00148
61
citations
#722

BEND: Benchmarking DNA Language Models on Biologically Meaningful Tasks

Frederikke Marin, Felix Teufel, Marc Horlacher et al.

ICLR 2024arXiv:2311.12570
61
citations
#723

CodeChain: Towards Modular Code Generation Through Chain of Self-revisions with Representative Sub-modules

Hung Le, Hailin Chen, Amrita Saha et al.

ICLR 2024arXiv:2310.08992
61
citations
#724

Reti-Diff: Illumination Degradation Image Restoration with Retinex-based Latent Diffusion Model

Chunming He, Chengyu Fang, Yulun Zhang et al.

ICLR 2025arXiv:2311.11638
61
citations
#725

Compressing LLMs: The Truth is Rarely Pure and Never Simple

AJAY JAISWAL, Zhe Gan, Xianzhi Du et al.

ICLR 2024arXiv:2310.01382
61
citations
#726

Proteina: Scaling Flow-based Protein Structure Generative Models

Tomas Geffner, Kieran Didi, Zuobai Zhang et al.

ICLR 2025arXiv:2503.00710
61
citations
#727

ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery

Ziru Chen, Shijie Chen, Yuting Ning et al.

ICLR 2025arXiv:2410.05080
61
citations
#728

Building Math Agents with Multi-Turn Iterative Preference Learning

Wei Xiong, Chengshuai Shi, Jiaming Shen et al.

ICLR 2025arXiv:2409.02392
61
citations
#729

See What You Are Told: Visual Attention Sink in Large Multimodal Models

Seil Kang, Jinyeong Kim, Junhyeok Kim et al.

ICLR 2025arXiv:2503.03321
61
citations
#730

LEAP: Liberate Sparse-View 3D Modeling from Camera Poses

Hanwen Jiang, Zhenyu Jiang, Yue Zhao et al.

ICLR 2024arXiv:2310.01410
61
citations
#731

LaneSegNet: Map Learning with Lane Segment Perception for Autonomous Driving

Tianyu Li, Peijin Jia, Bangjun Wang et al.

ICLR 2024arXiv:2312.16108
61
citations
#732

Geographic Location Encoding with Spherical Harmonics and Sinusoidal Representation Networks

Marc Rußwurm, Konstantin Klemmer, Esther Rolf et al.

ICLR 2024spotlightarXiv:2310.06743
61
citations
#733

The Blessing of Randomness: SDE Beats ODE in General Diffusion-based Image Editing

Shen Nie, Hanzhong Guo, Cheng Lu et al.

ICLR 2024arXiv:2311.01410
61
citations
#734

Language Model Inversion

John X. Morris, Wenting Zhao, Justin Chiu et al.

ICLR 2024arXiv:2311.13647
60
citations
#735

Booster: Tackling Harmful Fine-tuning for Large Language Models via Attenuating Harmful Perturbation

Tiansheng Huang, Sihao Hu, Fatih Ilhan et al.

ICLR 2025arXiv:2409.01586
60
citations
#736

Differentially Private Synthetic Data via Foundation Model APIs 1: Images

Zinan Lin, Sivakanth Gopi, Janardhan Kulkarni et al.

ICLR 2024arXiv:2305.15560
60
citations
#737

Repetition Improves Language Model Embeddings

Jacob Springer, Suhas Kotha, Daniel Fried et al.

ICLR 2025arXiv:2402.15449
60
citations
#738

Lemur: Integrating Large Language Models in Automated Program Verification

Haoze Wu, Clark Barrett, Nina Narodytska

ICLR 2024arXiv:2310.04870
60
citations
#739

Not All Heads Matter: A Head-Level KV Cache Compression Method with Integrated Retrieval and Reasoning

Yu Fu, Zefan Cai, Abedelkadir Asi et al.

ICLR 2025arXiv:2410.19258
60
citations
#740

Self-Improvement in Language Models: The Sharpening Mechanism

Audrey Huang, Adam Block, Dylan Foster et al.

ICLR 2025arXiv:2412.01951
60
citations
#741

Ground-A-Video: Zero-shot Grounded Video Editing using Text-to-image Diffusion Models

Hyeonho Jeong, Jong Chul YE

ICLR 2024oralarXiv:2310.01107
60
citations
#742

COLLIE: Systematic Construction of Constrained Text Generation Tasks

Shunyu Yao, Howard Chen, Austin Hanjie et al.

ICLR 2024arXiv:2307.08689
59
citations
#743

Massive Editing for Large Language Models via Meta Learning

Chenmien Tan, Ge Zhang, Jie Fu

ICLR 2024arXiv:2311.04661
59
citations
#744

Exploring Target Representations for Masked Autoencoders

xingbin liu, Jinghao Zhou, Tao Kong et al.

ICLR 2024arXiv:2209.03917
59
citations
#745

On Diffusion Modeling for Anomaly Detection

Victor Livernoche, Vineet Jain, Yashar Hezaveh et al.

ICLR 2024spotlightarXiv:2305.18593
59
citations
#746

Tell me about yourself: LLMs are aware of their learned behaviors

Jan Betley, Xuchan Bao, Martín Soto et al.

ICLR 2025oralarXiv:2501.11120
59
citations
#747

NeuroLM: A Universal Multi-task Foundation Model for Bridging the Gap between Language and EEG Signals

Wei-Bang Jiang, Yansen Wang, Bao-liang Lu et al.

ICLR 2025oralarXiv:2409.00101
58
citations
#748

Magnushammer: A Transformer-Based Approach to Premise Selection

Maciej Mikuła, Szymon Tworkowski, Szymon Antoniak et al.

ICLR 2024arXiv:2303.04488
58
citations
#749

How to Evaluate Reward Models for RLHF

Evan Frick, Tianle Li, Connor Chen et al.

ICLR 2025arXiv:2410.14872
58
citations
#750

Multi-View Causal Representation Learning with Partial Observability

Dingling Yao, Danru Xu, Sébastien Lachapelle et al.

ICLR 2024spotlightarXiv:2311.04056
58
citations
#751

Towards Semantic Equivalence of Tokenization in Multimodal LLM

Shengqiong Wu, Hao Fei, Xiangtai Li et al.

ICLR 2025arXiv:2406.05127
58
citations
#752

FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality

Zhengyao Lyu, Chenyang Si, Junhao Song et al.

ICLR 2025oralarXiv:2410.19355
58
citations
#753

LLaRA: Supercharging Robot Learning Data for Vision-Language Policy

Xiang Li, Cristina Mata, Jongwoo Park et al.

ICLR 2025arXiv:2406.20095
58
citations
#754

OWL: A Large Language Model for IT Operations

Hongcheng Guo, Jian Yang, Jiaheng Liu et al.

ICLR 2024arXiv:2309.09298
58
citations
#755

Simple is Effective: The Roles of Graphs and Large Language Models in Knowledge-Graph-Based Retrieval-Augmented Generation

Mufei Li, Siqi Miao, Pan Li

ICLR 2025arXiv:2410.20724
58
citations
#756

What Matters When Repurposing Diffusion Models for General Dense Perception Tasks?

Guangkai Xu, yongtao ge, Mingyu Liu et al.

ICLR 2025arXiv:2403.06090
58
citations
#757

EdgeRunner: Auto-regressive Auto-encoder for Artistic Mesh Generation

Jiaxiang Tang, Max Li, Zekun Hao et al.

ICLR 2025arXiv:2409.18114
58
citations
#758

MuSc: Zero-Shot Industrial Anomaly Classification and Segmentation with Mutual Scoring of the Unlabeled Images

Xurui Li, Ziming Huang, Feng Xue et al.

ICLR 2024arXiv:2401.16753
58
citations
#759

Hymba: A Hybrid-head Architecture for Small Language Models

Xin Dong, Yonggan Fu, Shizhe Diao et al.

ICLR 2025arXiv:2411.13676
58
citations
#760

LLM-SR: Scientific Equation Discovery via Programming with Large Language Models

Parshin Shojaee, Kazem Meidani, Shashank Gupta et al.

ICLR 2025arXiv:2404.18400
57
citations
#761

Controlling Space and Time with Diffusion Models

Daniel Watson, Saurabh Saxena, Lala Li et al.

ICLR 2025arXiv:2407.07860
57
citations
#762

Physics-Informed Diffusion Models

Jan-Hendrik Bastek, WaiChing Sun, Dennis Kochmann

ICLR 2025arXiv:2403.14404
57
citations
#763

An Unforgeable Publicly Verifiable Watermark for Large Language Models

Aiwei Liu, Leyi Pan, Xuming Hu et al.

ICLR 2024arXiv:2307.16230
57
citations
#764

LLMOPT: Learning to Define and Solve General Optimization Problems from Scratch

caigao jiang, Xiang Shu, Hong Qian et al.

ICLR 2025arXiv:2410.13213
57
citations
#765

Dichotomy of Early and Late Phase Implicit Biases Can Provably Induce Grokking

Kaifeng Lyu, Jikai Jin, Zhiyuan Li et al.

ICLR 2024arXiv:2311.18817
57
citations
#766

Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model

Yinan Zheng, Jianxiong Li, Dongjie Yu et al.

ICLR 2024arXiv:2401.10700
56
citations
#767

Raidar: geneRative AI Detection viA Rewriting

Chengzhi Mao, Carl Vondrick, Hao Wang et al.

ICLR 2024arXiv:2401.12970
56
citations
#768

Eliminating Oversaturation and Artifacts of High Guidance Scales in Diffusion Models

Seyedmorteza Sadat, Otmar Hilliges, Romann Weber

ICLR 2025arXiv:2410.02416
56
citations
#769

CLoSD: Closing the Loop between Simulation and Diffusion for multi-task character control

Guy Tevet, Sigal Raab, Setareh Cohan et al.

ICLR 2025arXiv:2410.03441
56
citations
#770

Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent

Yangning Li, Yinghui Li, Xinyu Wang et al.

ICLR 2025arXiv:2411.02937
56
citations
#771

Prototypical Information Bottlenecking and Disentangling for Multimodal Cancer Survival Prediction

Yilan Zhang, Yingxue XU, Jianqi Chen et al.

ICLR 2024spotlightarXiv:2401.01646
56
citations
#772

TabM: Advancing tabular deep learning with parameter-efficient ensembling

Yury Gorishniy, Akim Kotelnikov, Artem Babenko

ICLR 2025arXiv:2410.24210
56
citations
#773

Progressive3D: Progressively Local Editing for Text-to-3D Content Creation with Complex Semantic Prompts

Xinhua Cheng, Tianyu Yang, Jianan Wang et al.

ICLR 2024arXiv:2310.11784
56
citations
#774

Controlled Text Generation via Language Model Arithmetic

Jasper Dekoninck, Marc Fischer, Luca Beurer-Kellner et al.

ICLR 2024spotlightarXiv:2311.14479
56
citations
#775

Simplifying Deep Temporal Difference Learning

Matteo Gallici, Mattie Fellows, Benjamin Ellis et al.

ICLR 2025oralarXiv:2407.04811
56
citations
#776

From Molecules to Materials: Pre-training Large Generalizable Models for Atomic Property Prediction

Nima Shoghi, Adeesh Kolluru, John Kitchin et al.

ICLR 2024arXiv:2310.16802
56
citations
#777

OpenNeRF: Open Set 3D Neural Scene Segmentation with Pixel-Wise Features and Rendered Novel Views

Francis Engelmann, Fabian Manhardt, Michael Niemeyer et al.

ICLR 2024arXiv:2404.03650
56
citations
#778

RepoGraph: Enhancing AI Software Engineering with Repository-level Code Graph

Siru Ouyang, Wenhao Yu, Kaixin Ma et al.

ICLR 2025arXiv:2410.14684
56
citations
#779

On Penalty Methods for Nonconvex Bilevel Optimization and First-Order Stochastic Approximation

Jeongyeol Kwon, Dohyun Kwon, Stephen Wright et al.

ICLR 2024spotlightarXiv:2309.01753
56
citations
#780

In-Context Learning Learns Label Relationships but Is Not Conventional Learning

Jannik Kossen, Yarin Gal, Tom Rainforth

ICLR 2024arXiv:2307.12375
56
citations
#781

LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts

Hanan Gani, Shariq Bhat, Muzammal Naseer et al.

ICLR 2024arXiv:2310.10640
56
citations
#782

Towards Interpreting Visual Information Processing in Vision-Language Models

Clement Neo, Luke Ong, Philip Torr et al.

ICLR 2025arXiv:2410.07149
56
citations
#783

Seer: Language Instructed Video Prediction with Latent Diffusion Models

Xianfan Gu, Chuan Wen, Weirui Ye et al.

ICLR 2024oralarXiv:2303.14897
55
citations
#784

Towards Realistic UAV Vision-Language Navigation: Platform, Benchmark, and Methodology

Xiangyu Wang, Donglin Yang, ziqin wang et al.

ICLR 2025arXiv:2410.07087
55
citations
#785

UniAdapter: Unified Parameter-Efficient Transfer Learning for Cross-modal Modeling

Haoyu Lu, Yuqi Huo, Guoxing Yang et al.

ICLR 2024arXiv:2302.06605
55
citations
#786

Test-Time Training on Nearest Neighbors for Large Language Models

Moritz Hardt, Yu Sun

ICLR 2024arXiv:2305.18466
55
citations
#787

LLM Unlearning via Loss Adjustment with Only Forget Data

Yaxuan Wang, Jiaheng Wei, Yuhao Liu et al.

ICLR 2025arXiv:2410.11143
55
citations
#788

SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints

Jianhong Bai, Menghan Xia, Xintao WANG et al.

ICLR 2025arXiv:2412.07760
55
citations
#789

Beyond Weisfeiler-Lehman: A Quantitative Framework for GNN Expressiveness

Bohang Zhang, Jingchu Gai, Yiheng Du et al.

ICLR 2024arXiv:2401.08514
55
citations
#790

Tuning LayerNorm in Attention: Towards Efficient Multi-Modal LLM Finetuning

Bingchen Zhao, Haoqin Tu, Chen Wei et al.

ICLR 2024spotlightarXiv:2312.11420
55
citations
#791

SALMON: Self-Alignment with Instructable Reward Models

Zhiqing Sun, Yikang Shen, Hongxin Zhang et al.

ICLR 2024arXiv:2310.05910
55
citations
#792

Model merging with SVD to tie the Knots

George Stoica, Pratik Ramesh, Boglarka Ecsedi et al.

ICLR 2025arXiv:2410.19735
55
citations
#793

KOR-Bench: Benchmarking Language Models on Knowledge-Orthogonal Reasoning Tasks

Kaijing Ma, Xeron Du, Yunran Wang et al.

ICLR 2025arXiv:2410.06526
55
citations
#794

Energy-Based Diffusion Language Models for Text Generation

Minkai Xu, Tomas Geffner, Karsten Kreis et al.

ICLR 2025arXiv:2410.21357
55
citations
#795

MVSFormer++: Revealing the Devil in Transformer's Details for Multi-View Stereo

chenjie cao, xinlin ren, Yanwei Fu

ICLR 2024arXiv:2401.11673
54
citations
#796

SEAL: Safety-enhanced Aligned LLM Fine-tuning via Bilevel Data Selection

Han Shen, Pin-Yu Chen, Payel Das et al.

ICLR 2025arXiv:2410.07471
54
citations
#797

LoRA-Pro: Are Low-Rank Adapters Properly Optimized?

Zhengbo Wang, Jian Liang, Ran He et al.

ICLR 2025arXiv:2407.18242
54
citations
#798

MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation

Chenxi Wang, Xiang Chen, Ningyu Zhang et al.

ICLR 2025arXiv:2410.11779
54
citations
#799

Correcting the Mythos of KL-Regularization: Direct Alignment without Overoptimization via Chi-Squared Preference Optimization

Audrey Huang, Wenhao Zhan, Tengyang Xie et al.

ICLR 2025arXiv:2407.13399
54
citations
#800

AgentTrek: Agent Trajectory Synthesis via Guiding Replay with Web Tutorials

Yiheng Xu, Dunjie Lu, Zhennan Shen et al.

ICLR 2025arXiv:2412.09605
54
citations