Most Cited ICLR "generalization curves" Papers

6,124 papers found • Page 5 of 31

#801

SimPER: A Minimalist Approach to Preference Alignment without Hyperparameters

Teng Xiao, Yige Yuan, Zhengyu Chen et al.

ICLR 2025posterarXiv:2502.00883
23
citations
#802

A Percolation Model of Emergence: Analyzing Transformers Trained on a Formal Language

Ekdeep Singh Lubana, Kyogo Kawaguchi, Robert Dick et al.

ICLR 2025posterarXiv:2408.12578
23
citations
#803

Fantastic Copyrighted Beasts and How (Not) to Generate Them

Luxi He, Yangsibo Huang, Weijia Shi et al.

ICLR 2025posterarXiv:2406.14526
23
citations
#804

Some Fundamental Aspects about Lipschitz Continuity of Neural Networks

Grigory Khromov, Sidak Pal Singh

ICLR 2024posterarXiv:2302.10886
23
citations
#805

Instant Policy: In-Context Imitation Learning via Graph Diffusion

Vitalis Vosylius, Edward Johns

ICLR 2025posterarXiv:2411.12633
23
citations
#806

L2MAC: Large Language Model Automatic Computer for Extensive Code Generation

Samuel Holt, Max Ruiz Luyten, Mihaela van der Schaar

ICLR 2024posterarXiv:2310.02003
23
citations
#807

Navigation-Guided Sparse Scene Representation for End-to-End Autonomous Driving

Peidong Li, Dixiao Cui

ICLR 2025oralarXiv:2409.18341
23
citations
#808

NatureLM-audio: an Audio-Language Foundation Model for Bioacoustics

David Robinson, Marius Miron, Masato Hagiwara et al.

ICLR 2025posterarXiv:2411.07186
23
citations
#809

Limits to scalable evaluation at the frontier: LLM as judge won’t beat twice the data

Florian Eddie Dorner, Vivian Nastl, Moritz Hardt

ICLR 2025poster
23
citations
#810

Semantics-Adaptive Activation Intervention for LLMs via Dynamic Steering Vectors

Weixuan Wang, JINGYUAN YANG, Wei Peng

ICLR 2025posterarXiv:2410.12299
23
citations
#811

The AdEMAMix Optimizer: Better, Faster, Older

Matteo Pagliardini, Pierre Ablin, David Grangier

ICLR 2025posterarXiv:2409.03137
23
citations
#812

Nonconvex Stochastic Optimization under Heavy-Tailed Noises: Optimal Convergence without Gradient Clipping

Zijian Liu, Zhengyuan Zhou

ICLR 2025posterarXiv:2412.19529
23
citations
#813

JetFormer: An autoregressive generative model of raw images and text

Michael Tschannen, André Susano Pinto, Alexander Kolesnikov

ICLR 2025posterarXiv:2411.19722
23
citations
#814

OSCAR: Operating System Control via State-Aware Reasoning and Re-Planning

Xiaoqiang Wang, Bang Liu

ICLR 2025posterarXiv:2410.18963
23
citations
#815

Diffusion Generative Modeling for Spatially Resolved Gene Expression Inference from Histology Images

Sichen Zhu, Yuchen Zhu, Molei Tao et al.

ICLR 2025posterarXiv:2501.15598
23
citations
#816

DisPose: Disentangling Pose Guidance for Controllable Human Image Animation

Hongxiang Li, Yaowei Li, Yuhang Yang et al.

ICLR 2025posterarXiv:2412.09349
23
citations
#817

Differentiation and Specialization of Attention Heads via the Refined Local Learning Coefficient

George Wang, Jesse Hoogland, Stan van Wingerden et al.

ICLR 2025posterarXiv:2410.02984
23
citations
#818

Language Representations Can be What Recommenders Need: Findings and Potentials

Leheng Sheng, An Zhang, Yi Zhang et al.

ICLR 2025posterarXiv:2407.05441
23
citations
#819

Controllable Safety Alignment: Inference-Time Adaptation to Diverse Safety Requirements

Jingyu Zhang, Ahmed Elgohary Ghoneim, Ahmed Magooda et al.

ICLR 2025posterarXiv:2410.08968
22
citations
#820

Understanding and Mitigating Hallucination in Large Vision-Language Models via Modular Attribution and Intervention

Tianyun Yang, Ziniu Li, Juan Cao et al.

ICLR 2025poster
22
citations
#821

Residual Connections and Normalization Can Provably Prevent Oversmoothing in GNNs

Michael Scholkemper, Xinyi Wu, Ali Jadbabaie et al.

ICLR 2025posterarXiv:2406.02997
22
citations
#822

Optimizing $(L_0, L_1)$-Smooth Functions by Gradient Methods

Daniil Vankov, Anton Rodomanov, Angelia Nedich et al.

ICLR 2025posterarXiv:2410.10800
22
citations
#823

From Lazy to Rich: Exact Learning Dynamics in Deep Linear Networks

Clementine Domine, Nicolas Anguita, Alexandra M Proca et al.

ICLR 2025poster
22
citations
#824

Syntactic and Semantic Control of Large Language Models via Sequential Monte Carlo

João Loula, Benjamin LeBrun, Li Du et al.

ICLR 2025posterarXiv:2504.13139
22
citations
#825

Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN

Pengxiang Li, Lu Yin, Shiwei Liu

ICLR 2025posterarXiv:2412.13795
22
citations
#826

LiDAR: Sensing Linear Probing Performance in Joint Embedding SSL Architectures

Vimal Thilak, Chen Huang, Omid Saremi et al.

ICLR 2024spotlightarXiv:2312.04000
22
citations
#827

Towards Scalable Exact Machine Unlearning Using Parameter-Efficient Fine-Tuning

Somnath Basu Roy Chowdhury, Krzysztof Choromanski, Arijit Sehanobish et al.

ICLR 2025posterarXiv:2406.16257
22
citations
#828

Do LLMs ``know'' internally when they follow instructions?

Juyeon Heo, Christina Heinze-Deml, Oussama Elachqar et al.

ICLR 2025posterarXiv:2410.14516
22
citations
#829

Multimodality Helps Few-shot 3D Point Cloud Semantic Segmentation

Zhaochong An, Guolei Sun, Yun Liu et al.

ICLR 2025posterarXiv:2410.22489
22
citations
#830

Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation

Peiwen Sun, Sitong Cheng, Xiangtai Li et al.

ICLR 2025posterarXiv:2410.10676
22
citations
#831

Understanding Certified Training with Interval Bound Propagation

Yuhao Mao, Mark N Müller, Marc Fischer et al.

ICLR 2024posterarXiv:2306.10426
22
citations
#832

Audio Large Language Models Can Be Descriptive Speech Quality Evaluators

CHEN CHEN, Yuchen Hu, Siyin Wang et al.

ICLR 2025posterarXiv:2501.17202
22
citations
#833

Straight to Zero: Why Linearly Decaying the Learning Rate to Zero Works Best for LLMs

Shane Bergsma, Nolan Dey, Gurpreet Gosal et al.

ICLR 2025posterarXiv:2502.15938
22
citations
#834

DenseMatcher: Learning 3D Semantic Correspondence for Category-Level Manipulation from a Single Demo

Junzhe Zhu, Yuanchen Ju, Junyi Zhang et al.

ICLR 2025posterarXiv:2412.05268
22
citations
#835

Deep SE(3)-Equivariant Geometric Reasoning for Precise Placement Tasks

Ben Eisner, Yi Yang, Todor Davchev et al.

ICLR 2024posterarXiv:2404.13478
22
citations
#836

Failures to Find Transferable Image Jailbreaks Between Vision-Language Models

Rylan Schaeffer, Dan Valentine, Luke Bailey et al.

ICLR 2025posterarXiv:2407.15211
22
citations
#837

Artificial Kuramoto Oscillatory Neurons

Takeru Miyato, Sindy Löwe, Andreas Geiger et al.

ICLR 2025oralarXiv:2410.13821
22
citations
#838

Towards Foundation Models for Mixed Integer Linear Programming

Sirui Li, Janardhan Kulkarni, Ishai Menache et al.

ICLR 2025posterarXiv:2410.08288
22
citations
#839

SONICS: Synthetic Or Not - Identifying Counterfeit Songs

Awsaf Rahman, Zaber Ibn Abdul Hakim, Najibul Haque Sarker et al.

ICLR 2025oralarXiv:2408.14080
22
citations
#840

LICO: Large Language Models for In-Context Molecular Optimization

Tung Nguyen, Aditya Grover

ICLR 2025posterarXiv:2406.18851
22
citations
#841

$\text{D}_{2}\text{O}$: Dynamic Discriminative Operations for Efficient Long-Context Inference of Large Language Models

Zhongwei Wan, Xinjian Wu, Yu Zhang et al.

ICLR 2025poster
22
citations
#842

Concept Bottleneck Large Language Models

Chung-En Sun, Tuomas Oikarinen, Berk Ustun et al.

ICLR 2025posterarXiv:2412.07992
22
citations
#843

DSPO: Direct Score Preference Optimization for Diffusion Model Alignment

Huaisheng Zhu, Teng Xiao, Vasant Honavar

ICLR 2025poster
22
citations
#844

3D-Properties: Identifying Challenges in DPO and Charting a Path Forward

Yuzi Yan, Yibo Miao, Jialian Li et al.

ICLR 2025posterarXiv:2406.07327
22
citations
#845

Bayesian Neural Controlled Differential Equations for Treatment Effect Estimation

Konstantin Hess, Valentyn Melnychuk, Dennis Frauen et al.

ICLR 2024posterarXiv:2310.17463
22
citations
#846

Meaning Representations from Trajectories in Autoregressive Models

Tian Yu Liu, Matthew Trager, Alessandro Achille et al.

ICLR 2024posterarXiv:2310.18348
22
citations
#847

Towards General-Purpose Model-Free Reinforcement Learning

Scott Fujimoto, Pierluca D'Oro, Amy Zhang et al.

ICLR 2025posterarXiv:2501.16142
22
citations
#848

On the Provable Advantage of Unsupervised Pretraining

Jiawei Ge, Shange Tang, Jianqing Fan et al.

ICLR 2024spotlightarXiv:2303.01566
22
citations
#849

Mixture Compressor for Mixture-of-Experts LLMs Gains More

Wei Huang, Yue Liao, Jianhui Liu et al.

ICLR 2025posterarXiv:2410.06270
22
citations
#850

Recite, Reconstruct, Recollect: Memorization in LMs as a Multifaceted Phenomenon

USVSN Sai Prashanth, Alvin Deng, Kyle O'Brien et al.

ICLR 2025posterarXiv:2406.17746
22
citations
#851

Hypothetical Minds: Scaffolding Theory of Mind for Multi-Agent Tasks with Large Language Models

Logan Cross, Violet Xiang, Agam Bhatia et al.

ICLR 2025posterarXiv:2407.07086
22
citations
#852

SyllableLM: Learning Coarse Semantic Units for Speech Language Models

Alan Baade, Puyuan Peng, David Harwath

ICLR 2025posterarXiv:2410.04029
22
citations
#853

Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates

Xiaosen Zheng, Tianyu Pang, Chao Du et al.

ICLR 2025posterarXiv:2410.07137
22
citations
#854

Mini-Monkey: Alleviating the Semantic Sawtooth Effect for Lightweight MLLMs via Complementary Image Pyramid

Mingxin Huang, Yuliang Liu, Dingkang Liang et al.

ICLR 2025posterarXiv:2408.02034
22
citations
#855

Towards Effective Evaluations and Comparisons for LLM Unlearning Methods

Qizhou Wang, Bo Han, Puning Yang et al.

ICLR 2025posterarXiv:2406.09179
21
citations
#856

DataGen: Unified Synthetic Dataset Generation via Large Language Models

Yue Huang, Siyuan Wu, Chujie Gao et al.

ICLR 2025posterarXiv:2406.18966
21
citations
#857

Harnessing Webpage UIs for Text-Rich Visual Understanding

Junpeng Liu, Tianyue Ou, Yifan Song et al.

ICLR 2025posterarXiv:2410.13824
21
citations
#858

Manifolds, Random Matrices and Spectral Gaps: The geometric phases of generative diffusion

Enrico Ventura, Beatrice Achilli, Gianluigi Silvestri et al.

ICLR 2025posterarXiv:2410.05898
21
citations
#859

Towards a General Time Series Anomaly Detector with Adaptive Bottlenecks and Dual Adversarial Decoders

Qichao Shentu, Beibu Li, Kai Zhao et al.

ICLR 2025posterarXiv:2405.15273
21
citations
#860

Flow: Modularized Agentic Workflow Automation

Boye Niu, Yiliao Song, Kai Lian et al.

ICLR 2025posterarXiv:2501.07834
21
citations
#861

Heavy-Tailed Diffusion Models

Kushagra Pandey, Jaideep Pathak, Yilun Xu et al.

ICLR 2025posterarXiv:2410.14171
21
citations
#862

Oscillatory State-Space Models

T. Konstantin Rusch, Daniela Rus

ICLR 2025posterarXiv:2410.03943
21
citations
#863

Towards Fast, Specialized Machine Learning Force Fields: Distilling Foundation Models via Energy Hessians

Ishan Amin, Sanjeev Raja, Aditi Krishnapriyan

ICLR 2025posterarXiv:2501.09009
21
citations
#864

GI-GS: Global Illumination Decomposition on Gaussian Splatting for Inverse Rendering

Hongze CHEN, Zehong Lin, Jun Zhang

ICLR 2025posterarXiv:2410.02619
21
citations
#865

ElasticTok: Adaptive Tokenization for Image and Video

Wilson Yan, Volodymyr Mnih, Aleksandra Faust et al.

ICLR 2025posterarXiv:2410.08368
21
citations
#866

MOFDiff: Coarse-grained Diffusion for Metal-Organic Framework Design

Xiang Fu, Tian Xie, Andrew Rosen et al.

ICLR 2024posterarXiv:2310.10732
21
citations
#867

Conditional Information Bottleneck Approach for Time Series Imputation

MinGyu Choi, Changhee Lee

ICLR 2024oral
21
citations
#868

Agent-Oriented Planning in Multi-Agent Systems

Ao LI, Yuexiang Xie, Songze Li et al.

ICLR 2025posterarXiv:2410.02189
21
citations
#869

Learning Distributions of Complex Fluid Simulations with Diffusion Graph Networks

Mario Lino, Tobias Pfaff, Nils Thuerey

ICLR 2025posterarXiv:2504.02843
21
citations
#870

OpenRCA: Can Large Language Models Locate the Root Cause of Software Failures?

Junjielong Xu, Qinan Zhang, Zhiqing Zhong et al.

ICLR 2025poster
21
citations
#871

Debiasing Algorithm through Model Adaptation

Tomasz Limisiewicz, David Mareček, Tomáš Musil

ICLR 2024posterarXiv:2310.18913
21
citations
#872

A Transfer Attack to Image Watermarks

Yuepeng Hu, Zhengyuan Jiang, Moyang Guo et al.

ICLR 2025posterarXiv:2403.15365
21
citations
#873

IMPUS: Image Morphing with Perceptually-Uniform Sampling Using Diffusion Models

Zhaoyuan Yang, Zhengyang Yu, Zhiwei Xu et al.

ICLR 2024posterarXiv:2311.06792
21
citations
#874

Diverse Preference Learning for Capabilities and Alignment

Stewart Slocum, Asher Parker-Sartori, Dylan Hadfield-Menell

ICLR 2025posterarXiv:2511.08594
21
citations
#875

Is In-Context Learning Sufficient for Instruction Following in LLMs?

Hao Zhao, Maksym Andriushchenko, francesco croce et al.

ICLR 2025posterarXiv:2405.19874
21
citations
#876

Open-YOLO 3D: Towards Fast and Accurate Open-Vocabulary 3D Instance Segmentation

Mohamed el amine Boudjoghra, Angela Dai, Jean Lahoud et al.

ICLR 2025posterarXiv:2406.02548
21
citations
#877

Pathologies of Predictive Diversity in Deep Ensembles

Geoff Pleiss, Taiga Abe, E. Kelly Buchanan et al.

ICLR 2024posterarXiv:2302.00704
21
citations
#878

Self-Correcting Decoding with Generative Feedback for Mitigating Hallucinations in Large Vision-Language Models

Ce Zhang, Zifu Wan, Zhehan Kan et al.

ICLR 2025posterarXiv:2502.06130
21
citations
#879

Beyond correlation: The impact of human uncertainty in measuring the effectiveness of automatic evaluation and LLM-as-a-judge

Aparna Elangovan, Lei Xu, Jongwoo Ko et al.

ICLR 2025posterarXiv:2410.03775
21
citations
#880

AnyTouch: Learning Unified Static-Dynamic Representation across Multiple Visuo-tactile Sensors

Ruoxuan Feng, Jiangyu Hu, Wenke Xia et al.

ICLR 2025posterarXiv:2502.12191
21
citations
#881

Lipschitz Singularities in Diffusion Models

Zhantao Yang, Ruili Feng, Han Zhang et al.

ICLR 2024posterarXiv:2306.11251
21
citations
#882

Variational Diffusion Posterior Sampling with Midpoint Guidance

Badr MOUFAD, Yazid Janati el idrissi, Lisa Bedin et al.

ICLR 2025posterarXiv:2410.09945
21
citations
#883

Image Clustering Conditioned on Text Criteria

Sehyun Kwon, Jaden Park, Minkyu Kim et al.

ICLR 2024posterarXiv:2310.18297
21
citations
#884

The Loss Landscape of Deep Linear Neural Networks: a Second-order Analysis

El Mehdi Achour, Francois Malgouyres, Sebastien Gerchinovitz

ICLR 2025posterarXiv:2107.13289
21
citations
#885

Monitoring Latent World States in Language Models with Propositional Probes

Jiahai Feng, Stuart Russell, Jacob Steinhardt

ICLR 2025posterarXiv:2406.19501
21
citations
#886

ConFIG: Towards Conflict-free Training of Physics Informed Neural Networks

Qiang Liu, Mengyu Chu, Nils Thuerey

ICLR 2025posterarXiv:2408.11104
21
citations
#887

When Semantic Segmentation Meets Frequency Aliasing

Linwei Chen, Lin Gu, Ying Fu

ICLR 2024posterarXiv:2403.09065
21
citations
#888

Halton Scheduler for Masked Generative Image Transformer

Victor Besnier, Mickael Chen, David Hurych et al.

ICLR 2025posterarXiv:2503.17076
21
citations
#889

Structure Language Models for Protein Conformation Generation

Jiarui Lu, Xiaoyin Chen, Stephen Lu et al.

ICLR 2025posterarXiv:2410.18403
21
citations
#890

Improving Semantic Understanding in Speech Language Models via Brain-tuning

Omer Moussa, Dietrich Klakow, Mariya Toneva

ICLR 2025posterarXiv:2410.09230
21
citations
#891

SpikeLLM: Scaling up Spiking Neural Network to Large Language Models via Saliency-based Spiking

Xingrun Xing, Boyan Gao, Zheng Liu et al.

ICLR 2025posterarXiv:2407.04752
21
citations
#892

AdvWave: Stealthy Adversarial Jailbreak Attack against Large Audio-Language Models

Mintong Kang, Chejian Xu, Bo Li

ICLR 2025oralarXiv:2412.08608
21
citations
#893

Probe before You Talk: Towards Black-box Defense against Backdoor Unalignment for Large Language Models

Biao Yi, Tiansheng Huang, Sishuo Chen et al.

ICLR 2025posterarXiv:2506.16447
21
citations
#894

Selective Attention Improves Transformer

Yaniv Leviathan, Matan Kalman, Yossi Matias

ICLR 2025posterarXiv:2410.02703
20
citations
#895

Framer: Interactive Frame Interpolation

Wen Wang, Qiuyu Wang, Kecheng Zheng et al.

ICLR 2025posterarXiv:2410.18978
20
citations
#896

Intervening Anchor Token: Decoding Strategy in Alleviating Hallucinations for MLLMs

Barrett Tang, Zile Huang, Chengzhi Liu et al.

ICLR 2025poster
20
citations
#897

Hyper-Connections

Defa Zhu, Hongzhi Huang, Zihao Huang et al.

ICLR 2025posterarXiv:2409.19606
20
citations
#898

UniGEM: A Unified Approach to Generation and Property Prediction for Molecules

Shikun Feng, Yuyan Ni, Lu yan et al.

ICLR 2025posterarXiv:2410.10516
20
citations
#899

On the Variance of Neural Network Training with respect to Test Sets and Distributions

Keller Jordan

ICLR 2024posterarXiv:2304.01910
20
citations
#900

PORF: POSE RESIDUAL FIELD FOR ACCURATE NEURAL SURFACE RECONSTRUCTION

Jia-Wang Bian, Wenjing Bian, Victor Prisacariu et al.

ICLR 2024posterarXiv:2310.07449
20
citations
#901

Neuro-Inspired Information-Theoretic Hierarchical Perception for Multimodal Learning

Xiongye Xiao, Gengshuo Liu, Gaurav Gupta et al.

ICLR 2024posterarXiv:2404.09403
20
citations
#902

{$\tau$}-bench: A Benchmark for \underline{T}ool-\underline{A}gent-\underline{U}ser Interaction in Real-World Domains

Shunyu Yao, Noah Shinn, Pedram Razavi et al.

ICLR 2025poster
20
citations
#903

Visual-O1: Understanding Ambiguous Instructions via Multi-modal Multi-turn Chain-of-thoughts Reasoning

Minheng Ni, YuTao Fan, Lei Zhang et al.

ICLR 2025posterarXiv:2410.03321
20
citations
#904

Embarrassingly Simple Dataset Distillation

Yunzhen Feng, Shanmukha Ramakrishna Vedantam, Julia Kempe

ICLR 2024poster
20
citations
#905

STRAP: Robot Sub-Trajectory Retrieval for Augmented Policy Learning

Marius Memmel, Jacob Berg, Bingqing Chen et al.

ICLR 2025posterarXiv:2412.15182
20
citations
#906

Amulet: ReAlignment During Test Time for Personalized Preference Adaptation of LLMs

Zhaowei Zhang, Fengshuo Bai, Qizhi Chen et al.

ICLR 2025posterarXiv:2502.19148
20
citations
#907

How Do Large Language Models Understand Graph Patterns? A Benchmark for Graph Pattern Comprehension

Xinnan Dai, Haohao QU, Yifei Shen et al.

ICLR 2025posterarXiv:2410.05298
20
citations
#908

Domain Randomization via Entropy Maximization

Gabriele Tiboni, Pascal Klink, Jan Peters et al.

ICLR 2024posterarXiv:2311.01885
20
citations
#909

DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory

Yutong Wang, Jiali Zeng, Xuebo Liu et al.

ICLR 2025posterarXiv:2410.08143
20
citations
#910

Do Vision & Language Decoders use Images and Text equally? How Self-consistent are their Explanations?

Letitia Parcalabescu, Anette Frank

ICLR 2025posterarXiv:2404.18624
20
citations
#911

GOAL: A Generalist Combinatorial Optimization Agent Learner

Darko Drakulić, Sofia Michel, Jean-Marc Andreoli

ICLR 2025posterarXiv:2406.15079
20
citations
#912

ConR: Contrastive Regularizer for Deep Imbalanced Regression

Mahsa Keramati, Lili Meng, R. Evans

ICLR 2024posterarXiv:2309.06651
20
citations
#913

Reflective Gaussian Splatting

Yuxuan Yao, Zixuan Zeng, Chun Gu et al.

ICLR 2025posterarXiv:2412.19282
20
citations
#914

Temporal Reasoning Transfer from Text to Video

Lei Li, Yuanxin Liu, Linli Yao et al.

ICLR 2025oralarXiv:2410.06166
20
citations
#915

Efficient Reinforcement Learning with Large Language Model Priors

Xue Yan, Yan Song, Xidong Feng et al.

ICLR 2025posterarXiv:2410.07927
20
citations
#916

CubeDiff: Repurposing Diffusion-Based Image Models for Panorama Generation

Nikolai Kalischek, Michael Oechsle, Fabian Manhardt et al.

ICLR 2025posterarXiv:2501.17162
20
citations
#917

Exploring the Promise and Limits of Real-Time Recurrent Learning

Kazuki Irie, Anand Gopalakrishnan, Jürgen Schmidhuber

ICLR 2024posterarXiv:2305.19044
20
citations
#918

Is Your Multimodal Language Model Oversensitive to Safe Queries?

Xirui Li, Hengguang Zhou, Ruochen Wang et al.

ICLR 2025posterarXiv:2406.17806
20
citations
#919

ExeDec: Execution Decomposition for Compositional Generalization in Neural Program Synthesis

Kensen Shi, Joey Hong, Yinlin Deng et al.

ICLR 2024posterarXiv:2307.13883
20
citations
#920

ECoFLaP: Efficient Coarse-to-Fine Layer-Wise Pruning for Vision-Language Models

Yi-Lin Sung, Jaehong Yoon, Mohit Bansal

ICLR 2024posterarXiv:2310.02998
20
citations
#921

ToolDial: Multi-turn Dialogue Generation Method for Tool-Augmented Language Models

Jeonghoon Shim, Gyuhyeon Seo, Cheongsu Lim et al.

ICLR 2025posterarXiv:2503.00564
20
citations
#922

Kinetix: Investigating the Training of General Agents through Open-Ended Physics-Based Control Tasks

Michael Matthews, Michael Beukman, Chris Lu et al.

ICLR 2025posterarXiv:2410.23208
20
citations
#923

Modeling Complex System Dynamics with Flow Matching Across Time and Conditions

Martin Rohbeck, Edward De Brouwer, Charlotte Bunne et al.

ICLR 2025oral
20
citations
#924

Hierarchical World Models as Visual Whole-Body Humanoid Controllers

Nick Hansen, Jyothir S V, Vlad Sobal et al.

ICLR 2025posterarXiv:2405.18418
20
citations
#925

Pre-training Sequence, Structure, and Surface Features for Comprehensive Protein Representation Learning

Youhan Lee, Hasun Yu, Jaemyung Lee et al.

ICLR 2024poster
20
citations
#926

To Grok or not to Grok: Disentangling Generalization and Memorization on Corrupted Algorithmic Datasets

Darshil Doshi, Aritra Das, Tianyu He et al.

ICLR 2024posterarXiv:2310.13061
19
citations
#927

First-Person Fairness in Chatbots

Tyna Eloundou, Alex Beutel, David Robinson et al.

ICLR 2025posterarXiv:2410.19803
19
citations
#928

Learning Long Range Dependencies on Graphs via Random Walks

Dexiong Chen, Till Schulz, Karsten Borgwardt

ICLR 2025posterarXiv:2406.03386
19
citations
#929

SPARTUN3D: Situated Spatial Understanding of 3D World in Large Language Model

Yue Zhang, Zhiyang Xu, Ying Shen et al.

ICLR 2025posterarXiv:2410.03878
19
citations
#930

Automated Proof Generation for Rust Code via Self-Evolution

Tianyu Chen, Shuai Lu, Shan Lu et al.

ICLR 2025posterarXiv:2410.15756
19
citations
#931

Understanding Optimization in Deep Learning with Central Flows

Jeremy Cohen, Alex Damian, Ameet Talwalkar et al.

ICLR 2025posterarXiv:2410.24206
19
citations
#932

Cut Your Losses in Large-Vocabulary Language Models

Erik Wijmans, Brody Huval, Alexander Hertzberg et al.

ICLR 2025posterarXiv:2411.09009
19
citations
#933

Influence-Guided Diffusion for Dataset Distillation

Mingyang Chen, Jiawei Du, Bo Huang et al.

ICLR 2025poster
19
citations
#934

Adversarial Supervision Makes Layout-to-Image Diffusion Models Thrive

Yumeng Li, Margret Keuper, Dan Zhang et al.

ICLR 2024posterarXiv:2401.08815
19
citations
#935

Efficient Subgraph GNNs by Learning Effective Selection Policies

Beatrice Bevilacqua, Moshe Eliasof, Eli Meirom et al.

ICLR 2024posterarXiv:2310.20082
19
citations
#936

MediConfusion: Can you trust your AI radiologist? Probing the reliability of multimodal medical foundation models

Mohammad Shahab Sepehri, Zalan Fabian, Maryam Soltanolkotabi et al.

ICLR 2025posterarXiv:2409.15477
19
citations
#937

Talking Turns: Benchmarking Audio Foundation Models on Turn-Taking Dynamics

Siddhant Arora, Zhiyun Lu, Chung-Cheng Chiu et al.

ICLR 2025posterarXiv:2503.01174
19
citations
#938

GameArena: Evaluating LLM Reasoning through Live Computer Games

Lanxiang Hu, Qiyu Li, Anze Xie et al.

ICLR 2025posterarXiv:2412.06394
19
citations
#939

Federated Q-Learning: Linear Regret Speedup with Low Communication Cost

Zhong Zheng, Fengyu Gao, Lingzhou Xue et al.

ICLR 2024posterarXiv:2312.15023
19
citations
#940

Deep Orthogonal Hypersphere Compression for Anomaly Detection

Yunhe Zhang, Yan Sun, Jinyu Cai et al.

ICLR 2024spotlightarXiv:2302.06430
19
citations
#941

Self-MoE: Towards Compositional Large Language Models with Self-Specialized Experts

Junmo Kang, Leonid Karlinsky, Hongyin Luo et al.

ICLR 2025posterarXiv:2406.12034
19
citations
#942

DeepRTL: Bridging Verilog Understanding and Generation with a Unified Representation Model

Yi Liu, Changran Xu, Yunhao Zhou et al.

ICLR 2025posterarXiv:2502.15832
19
citations
#943

Emergence of meta-stable clustering in mean-field transformer models

Giuseppe Bruno, Federico Pasqualotto, Andrea Agazzi

ICLR 2025posterarXiv:2410.23228
19
citations
#944

Image Clustering via the Principle of Rate Reduction in the Age of Pretrained Models

Tianzhe Chu, Shengbang Tong, Tianjiao Ding et al.

ICLR 2024posterarXiv:2306.05272
19
citations
#945

COAT: Compressing Optimizer states and Activations for Memory-Efficient FP8 Training

Haocheng Xi, Han Cai, Ligeng Zhu et al.

ICLR 2025posterarXiv:2410.19313
19
citations
#946

Neuroformer: Multimodal and Multitask Generative Pretraining for Brain Data

Antonis Antoniades, Yiyi Yu, Joe Canzano et al.

ICLR 2024oralarXiv:2311.00136
19
citations
#947

From Artificial Needles to Real Haystacks: Improving Retrieval Capabilities in LLMs by Finetuning on Synthetic Data

Zheyang Xiong, Vasilis Papageorgiou, Kangwook Lee et al.

ICLR 2025posterarXiv:2406.19292
19
citations
#948

Benchmarking Agentic Workflow Generation

Shuofei Qiao, Runnan Fang, Zhisong Qiu et al.

ICLR 2025posterarXiv:2410.07869
19
citations
#949

Adversarial Adaptive Sampling: Unify PINN and Optimal Transport for the Approximation of PDEs

Kejun Tang, Jiayu Zhai, Xiaoliang Wan et al.

ICLR 2024posterarXiv:2305.18702
19
citations
#950

MIA-DPO: Multi-Image Augmented Direct Preference Optimization For Large Vision-Language Models

Ziyu Liu, Yuhang Zang, Xiaoyi Dong et al.

ICLR 2025posterarXiv:2410.17637
19
citations
#951

Multimodal Large Language Models for Inverse Molecular Design with Retrosynthetic Planning

Gang Liu, Michael Sun, Wojciech Matusik et al.

ICLR 2025posterarXiv:2410.04223
19
citations
#952

Reading Your Heart: Learning ECG Words and Sentences via Pre-training ECG Language Model

Jiarui Jin, Haoyu Wang, Hongyan Li et al.

ICLR 2025posterarXiv:2502.10707
19
citations
#953

A Rainbow in Deep Network Black Boxes

Florentin Guth, Brice Ménard, Gaspar Rochette et al.

ICLR 2025posterarXiv:2305.18512
19
citations
#954

Boosting of Thoughts: Trial-and-Error Problem Solving with Large Language Models

Sijia Chen, Baochun Li, Di Niu

ICLR 2024posterarXiv:2402.11140
19
citations
#955

Efficient Diversity-Preserving Diffusion Alignment via Gradient-Informed GFlowNets

Zhen Liu, Tim Xiao, Weiyang Liu et al.

ICLR 2025posterarXiv:2412.07775
19
citations
#956

Distinguished In Uniform: Self-Attention Vs. Virtual Nodes

Eran Rosenbluth, Jan Tönshoff, Martin Ritzert et al.

ICLR 2024poster
19
citations
#957

Bounds on Representation-Induced Confounding Bias for Treatment Effect Estimation

Valentyn Melnychuk, Dennis Frauen, Stefan Feuerriegel

ICLR 2024spotlightarXiv:2311.11321
19
citations
#958

Online Preference Alignment for Language Models via Count-based Exploration

Chenjia Bai, Yang Zhang, Shuang Qiu et al.

ICLR 2025posterarXiv:2501.12735
19
citations
#959

KGARevion: An AI Agent for Knowledge-Intensive Biomedical QA

Xiaorui Su, Yibo Wang, Shanghua Gao et al.

ICLR 2025posterarXiv:2410.04660
19
citations
#960

Effective Interplay between Sparsity and Quantization: From Theory to Practice

Simla Harma, Ayan Chakraborty, Elizaveta Kostenok et al.

ICLR 2025posterarXiv:2405.20935
19
citations
#961

E(n) Equivariant Topological Neural Networks

Claudio Battiloro, Ege Karaismailoglu, Mauricio Tec et al.

ICLR 2025posterarXiv:2405.15429
19
citations
#962

Basis Sharing: Cross-Layer Parameter Sharing for Large Language Model Compression

Jingcun Wang, Yu-Guang Chen, Ing-Chao Lin et al.

ICLR 2025posterarXiv:2410.03765
19
citations
#963

Air Quality Prediction with Physics-Guided Dual Neural ODEs in Open Systems

jindong tian, Yuxuan Liang, Ronghui Xu et al.

ICLR 2025oralarXiv:2410.19892
18
citations
#964

Text2PDE: Latent Diffusion Models for Accessible Physics Simulation

Anthony Zhou, Zijie Li, Michael Schneier et al.

ICLR 2025oralarXiv:2410.01153
18
citations
#965

Towards Faithful XAI Evaluation via Generalization-Limited Backdoor Watermark

Mengxi Ya, Yiming Li, Tao Dai et al.

ICLR 2024poster
18
citations
#966

OVOR: OnePrompt with Virtual Outlier Regularization for Rehearsal-Free Class-Incremental Learning

Wei-Cheng Huang, Chun-Fu Chen, Hsiang Hsu

ICLR 2024posterarXiv:2402.04129
18
citations
#967

Transformers Learn to Implement Multi-step Gradient Descent with Chain of Thought

Jianhao Huang, Zixuan Wang, Jason Lee

ICLR 2025posterarXiv:2502.21212
18
citations
#968

Generalization through variance: how noise shapes inductive biases in diffusion models

John Vastola

ICLR 2025posterarXiv:2504.12532
18
citations
#969

SynFlowNet: Design of Diverse and Novel Molecules with Synthesis Constraints

Miruna Cretu, Charles Harris, Ilia Igashov et al.

ICLR 2025posterarXiv:2405.01155
18
citations
#970

Benchmarking Algorithms for Federated Domain Generalization

Ruqi Bai, Saurabh Bagchi, David Inouye

ICLR 2024spotlightarXiv:2307.04942
18
citations
#971

Perm: A Parametric Representation for Multi-Style 3D Hair Modeling

Chengan He, Xin Sun, Zhixin Shu et al.

ICLR 2025posterarXiv:2407.19451
18
citations
#972

Standard Gaussian Process is All You Need for High-Dimensional Bayesian Optimization

Zhitong Xu, Haitao Wang, Jeff Phillips et al.

ICLR 2025posterarXiv:2402.02746
18
citations
#973

Do as We Do, Not as You Think: the Conformity of Large Language Models

Zhiyuan Weng, Guikun Chen, Wenguan Wang

ICLR 2025posterarXiv:2501.13381
18
citations
#974

MaxInfoRL: Boosting exploration in reinforcement learning through information gain maximization

Bhavya, Stelian Coros, Andreas Krause et al.

ICLR 2025posterarXiv:2412.12098
18
citations
#975

CLIBD: Bridging Vision and Genomics for Biodiversity Monitoring at Scale

ZeMing Gong, Austin Wang, Xiaoliang Huo et al.

ICLR 2025posterarXiv:2405.17537
18
citations
#976

Learning to Discretize Denoising Diffusion ODEs

Vinh Tong, Trung-Dung Hoang, Anji Liu et al.

ICLR 2025posterarXiv:2405.15506
18
citations
#977

Flow Matching with Gaussian Process Priors for Probabilistic Time Series Forecasting

Marcel Kollovieh, Marten Lienen, David Lüdke et al.

ICLR 2025oralarXiv:2410.03024
18
citations
#978

Forking Paths in Neural Text Generation

Eric Bigelow, Ari Holtzman, Hidenori Tanaka et al.

ICLR 2025posterarXiv:2412.07961
18
citations
#979

BirdSet: A Large-Scale Dataset for Audio Classification in Avian Bioacoustics

Lukas Rauch, Raphael Schwinger, Moritz Wirth et al.

ICLR 2025posterarXiv:2403.10380
18
citations
#980

CrossMPT: Cross-attention Message-passing Transformer for Error Correcting Codes

Seong-Joon Park, Hee-Youl Kwak, Sang-Hyo Kim et al.

ICLR 2025posterarXiv:2405.01033
18
citations
#981

SemiReward: A General Reward Model for Semi-supervised Learning

Siyuan Li, Weiyang Jin, Zedong Wang et al.

ICLR 2024posterarXiv:2310.03013
18
citations
#982

KiVA: Kid-inspired Visual Analogies for Testing Large Multimodal Models

Eunice Yiu, Maan Qraitem, Anisa Majhi et al.

ICLR 2025posterarXiv:2407.17773
18
citations
#983

Non-myopic Generation of Language Models for Reasoning and Planning

Chang Ma, Haiteng Zhao, Junlei Zhang et al.

ICLR 2025posterarXiv:2410.17195
18
citations
#984

Benchmarking Predictive Coding Networks -- Made Simple

Luca Pinchetti, Chang Qi, Oleh Lokshyn et al.

ICLR 2025posterarXiv:2407.01163
18
citations
#985

Block Verification Accelerates Speculative Decoding

Ziteng Sun, Uri Mendlovic, Yaniv Leviathan et al.

ICLR 2025posterarXiv:2403.10444
18
citations
#986

Scaling Optimal LR Across Token Horizons

Johan Bjorck, Alon Benhaim, Vishrav Chaudhary et al.

ICLR 2025posterarXiv:2409.19913
18
citations
#987

SELF-EVOLVED REWARD LEARNING FOR LLMS

Chenghua Huang, Zhizhen Fan, Lu Wang et al.

ICLR 2025posterarXiv:2411.00418
18
citations
#988

AvatarGO: Zero-shot 4D Human-Object Interaction Generation and Animation

Yukang Cao, Liang Pan, Kai Han et al.

ICLR 2025posterarXiv:2410.07164
18
citations
#989

Discretization-invariance? On the Discretization Mismatch Errors in Neural Operators

Wenhan Gao, Ruichen Xu, Yuefan Deng et al.

ICLR 2025poster
18
citations
#990

You Only Sample Once: Taming One-Step Text-to-Image Synthesis by Self-Cooperative Diffusion GANs

Yihong Luo, Xiaolong Chen, Xinghua Qu et al.

ICLR 2025posterarXiv:2403.12931
18
citations
#991

Unprocessing Seven Years of Algorithmic Fairness

André F. Cruz, Moritz Hardt

ICLR 2024posterarXiv:2306.07261
18
citations
#992

Can LLMs Really Learn to Translate a Low-Resource Language from One Grammar Book?

Seth Aycock, David Stap, Di Wu et al.

ICLR 2025posterarXiv:2409.19151
18
citations
#993

Fundamental Limits of Prompt Tuning Transformers: Universality, Capacity and Efficiency

Jerry Yao-Chieh Hu, Wei-Po Wang, Ammar Gilani et al.

ICLR 2025posterarXiv:2411.16525
18
citations
#994

Analyzing and Boosting the Power of Fine-Grained Visual Recognition for Multi-modal Large Language Models

Hulingxiao He, Geng Li, Zijun Geng et al.

ICLR 2025posterarXiv:2501.15140
18
citations
#995

MetaCoCo: A New Few-Shot Classification Benchmark with Spurious Correlation

Min Zhang, Haoxuan Li, Fei Wu et al.

ICLR 2024posterarXiv:2404.19644
18
citations
#996

Cross-Embodiment Dexterous Grasping with Reinforcement Learning

Haoqi Yuan, Bohan Zhou, Yuhui Fu et al.

ICLR 2025posterarXiv:2410.02479
18
citations
#997

Enhancing High-Resolution 3D Generation through Pixel-wise Gradient Clipping

Zijie Pan, Jiachen Lu, Xiatian Zhu et al.

ICLR 2024posterarXiv:2310.12474
18
citations
#998

Class Incremental Learning via Likelihood Ratio Based Task Prediction

Haowei Lin, Yijia Shao, Weinan Qian et al.

ICLR 2024posterarXiv:2309.15048
18
citations
#999

Boosting Neural Combinatorial Optimization for Large-Scale Vehicle Routing Problems

Fu Luo, Xi Lin, Yaoxin Wu et al.

ICLR 2025poster
18
citations
#1000

EffoVPR: Effective Foundation Model Utilization for Visual Place Recognition

Issar Tzachor, Boaz Lerner, Matan Levy et al.

ICLR 2025posterarXiv:2405.18065
18
citations