ICLR Papers

6,124 papers found • Page 20 of 123

Don't flatten, tokenize! Unlocking the key to SoftMoE's efficacy in deep RL

Ghada Sokar, Johan S Obando Ceron, Aaron Courville et al.

ICLR 2025posterarXiv:2410.01930

DON’T STOP ME NOW: EMBEDDING BASED SCHEDULING FOR LLMS

Rana Shahout, Eran Malach, Chunwei Liu et al.

ICLR 2025poster
15
citations

Don't Take Things Out of Context: Attention Intervention for Enhancing Chain-of-Thought Reasoning in Large Language Models

Shaotian Yan, Chen Shen, Wenxiao Wang et al.

ICLR 2025posterarXiv:2503.11154

DOPL: Direct Online Preference Learning for Restless Bandits with Preference Feedback

GUOJUN XIONG, Ujwal Dinesha, Debajoy Mukherjee et al.

ICLR 2025posterarXiv:2410.05527

Do Stochastic, Feel Noiseless: Stable Stochastic Optimization via a Double Momentum Mechanism

Tehila Dahan, Kfir Y Levy

ICLR 2025poster

DOTS: Learning to Reason Dynamically in LLMs via Optimal Reasoning Trajectories Search

Murong Yue, Wenlin Yao, Haitao Mi et al.

ICLR 2025posterarXiv:2410.03864

Doubly Optimal Policy Evaluation for Reinforcement Learning

Shuze Liu, Claire Chen, Shangtong Zhang

ICLR 2025posterarXiv:2410.02226
3
citations

Doubly robust identification of treatment effects from multiple environments

Piersilvio De Bartolomeis, Julia Kostin, Javier Abad et al.

ICLR 2025posterarXiv:2503.14459

Do Vision & Language Decoders use Images and Text equally? How Self-consistent are their Explanations?

Letitia Parcalabescu, Anette Frank

ICLR 2025posterarXiv:2404.18624
20
citations

Do Vision-Language Models Represent Space and How? Evaluating Spatial Frame of Reference under Ambiguities

Zheyuan Zhang, Fengyuan Hu, Jayjun Lee et al.

ICLR 2025posterarXiv:2410.17385
40
citations

Do vision models perceive objects like toddlers ?

Arthur Aubret, Jochen Triesch

ICLR 2025poster

Do WGANs succeed because they minimize the Wasserstein Distance? Lessons from Discrete Generators

Ariel Elnekave, Yair Weiss

ICLR 2025poster

Do You Keep an Eye on What I Ask? Mitigating Multimodal Hallucination via Attention-Guided Ensemble Decoding

Yeongjae Cho, Keonwoo Kim, Taebaek Hwang et al.

ICLR 2025posterarXiv:2505.17529

DPaI: Differentiable Pruning at Initialization with Node-Path Balance Principle

Lichuan Xiang, Quan Nguyen-Tri, Lan-Cuong Nguyen et al.

ICLR 2025poster

DPLM-2: A Multimodal Diffusion Protein Language Model

Xinyou Wang, Zaixiang Zheng, Fei YE et al.

ICLR 2025posterarXiv:2410.13782

Drama: Mamba-Enabled Model-Based Reinforcement Learning Is Sample and Parameter Efficient

Wenlong Wang, Ivana Dusparic, Yucheng Shi et al.

ICLR 2025posterarXiv:2410.08893
3
citations

Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want

Weifeng Lin, Xinyu Wei, Ruichuan An et al.

ICLR 2025posterarXiv:2403.20271
86
citations

DreamBench++: A Human-Aligned Benchmark for Personalized Image Generation

Yuang Peng, Yuxin Cui, Haomiao Tang et al.

ICLR 2025posterarXiv:2406.16855
91
citations

DreamCatalyst: Fast and High-Quality 3D Editing via Controlling Editability and Identity Preservation

Jiwook Kim, Seonho Lee, Jaeyo Shin et al.

ICLR 2025posterarXiv:2407.11394
5
citations

DreamDistribution: Learning Prompt Distribution for Diverse In-distribution Generation

Brian Nlong Zhao, Yuhang Xiao, Jiashu Xu et al.

ICLR 2025posterarXiv:2312.14216
9
citations

Dream to Manipulate: Compositional World Models Empowering Robot Imitation Learning with Imagination

Leonardo Barcellona, Andrii Zadaianchuk, Davide Allegro et al.

ICLR 2025posterarXiv:2412.14957
24
citations

Dreamweaver: Learning Compositional World Models from Pixels

Junyeob Baek, Yi-Fu Wu, Gautam Singh et al.

ICLR 2025posterarXiv:2501.14174
2
citations

DRESSing Up LLM: Efficient Stylized Question-Answering via Style Subspace Editing

Xinyu Ma, Yifeng Xu, Yang Lin et al.

ICLR 2025posterarXiv:2501.14371
8
citations

DriveTransformer: Unified Transformer for Scalable End-to-End Autonomous Driving

Xiaosong Jia, Junqi You, Zhiyuan Zhang et al.

ICLR 2025oralarXiv:2503.07656
67
citations

DRL: Decomposed Representation Learning for Tabular Anomaly Detection

Hangting Ye, He Zhao, Wei Fan et al.

ICLR 2025poster
6
citations

DRoC: Elevating Large Language Models for Complex Vehicle Routing via Decomposed Retrieval of Constraints

Xia Jiang, Yaoxin Wu, Chenhao Zhang et al.

ICLR 2025poster
15
citations

DRoP: Distributionally Robust Data Pruning

Artem Vysogorets, Kartik Ahuja, Julia Kempe

ICLR 2025posterarXiv:2404.05579
4
citations

Drop-Upcycling: Training Sparse Mixture of Experts with Partial Re-initialization

Taishi Nakamura, Takuya Akiba, Kazuki Fujii et al.

ICLR 2025posterarXiv:2502.19261
8
citations

DSBench: How Far Are Data Science Agents from Becoming Data Science Experts?

Liqiang Jing, Zhehui Huang, Xiaoyang Wang et al.

ICLR 2025posterarXiv:2409.07703
62
citations

DS-LLM: Leveraging Dynamical Systems to Enhance Both Training and Inference of Large Language Models

Ruibing Song, Chuan Liu, Chunshu Wu et al.

ICLR 2025poster
2
citations

DSPO: Direct Score Preference Optimization for Diffusion Model Alignment

Huaisheng Zhu, Teng Xiao, Vasant Honavar

ICLR 2025poster
22
citations

Dualformer: Controllable Fast and Slow Thinking by Learning with Randomized Reasoning Traces

Andy (DiJia) Su, Sainbayar Sukhbaatar, Michael Rabbat et al.

ICLR 2025posterarXiv:2410.09918

DUALFormer: Dual Graph Transformer

Zhuo Jiaming, Yuwei Liu, Yintong Lu et al.

ICLR 2025poster
3
citations

Dual Process Learning: Controlling Use of In-Context vs. In-Weights Strategies with Weight Forgetting

Suraj Anand, Michael Lepori, Jack Merullo et al.

ICLR 2025posterarXiv:2406.00053
10
citations

DUET: Decentralized Bilevel Optimization without Lower-Level Strong Convexity

Zhen Qin, Zhuqing Liu, Songtao Lu et al.

ICLR 2025poster
1
citations

DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads

Guangxuan Xiao, Jiaming Tang, Jingwei Zuo et al.

ICLR 2025posterarXiv:2410.10819
165
citations

Duoduo CLIP: Efficient 3D Understanding with Multi-View Images

Han-Hung Lee, Yiming Zhang, Angel Chang

ICLR 2025posterarXiv:2406.11579

Durable Quantization Conditioned Misalignment Attack on Large Language Models

Peiran Dong, Haowei Li, Song Guo

ICLR 2025poster
1
citations

DyCAST: Learning Dynamic Causal Structure from Time Series

Yue Cheng, Bochen Lyu, Weiwei Xing et al.

ICLR 2025oral
1
citations

DynAlign: Unsupervised Dynamic Taxonomy Alignment for Cross-Domain Segmentation

HAN SUN, Rui Gong, Ismail Nejjar et al.

ICLR 2025posterarXiv:2501.16410
1
citations

DynaMath: A Dynamic Visual Benchmark for Evaluating Mathematical Reasoning Robustness of Vision Language Models

Chengke Zou, Xingang Guo, Rui Yang et al.

ICLR 2025posterarXiv:2411.00836
82
citations

Dynamical Diffusion: Learning Temporal Dynamics with Diffusion Models

Xingzhuo Guo, Yu Zhang, Baixu Chen et al.

ICLR 2025oralarXiv:2503.00951
6
citations

Dynamic Assortment Selection and Pricing with Censored Preference Feedback

Jung-hun Kim, Min-hwan Oh

ICLR 2025posterarXiv:2504.02324
1
citations

DynamicCity: Large-Scale 4D Occupancy Generation from Dynamic Scenes

Hengwei Bian, Lingdong Kong, Haozhe Xie et al.

ICLR 2025posterarXiv:2410.18084

Dynamic Contrastive Skill Learning with State-Transition Based Skill Clustering and Dynamic Length Adjustment

Jinwoo Choi, Seung-Woo Seo

ICLR 2025oralarXiv:2504.14805
1
citations

Dynamic Diffusion Transformer

Wangbo Zhao, Yizeng Han, Jiasheng Tang et al.

ICLR 2025posterarXiv:2410.03456
34
citations

Dynamic Gaussians Mesh: Consistent Mesh Reconstruction from Dynamic Scenes

Isabella Liu, Hao Su, Xiaolong Wang

ICLR 2025oralarXiv:2404.12379
15
citations

Dynamic-LLaVA: Efficient Multimodal Large Language Models via Dynamic Vision-language Context Sparsification

Wenxuan Huang, Zijie Zhai, Yunhang Shen et al.

ICLR 2025posterarXiv:2412.00876
38
citations

Dynamic Loss-Based Sample Reweighting for Improved Large Language Model Pretraining

Daouda Sow, Herbert Woisetschläger, Saikiran Bulusu et al.

ICLR 2025posterarXiv:2502.06733
13
citations

Dynamic Low-Rank Sparse Adaptation for Large Language Models

Weizhong Huang, Yuxin Zhang, Xiawu Zheng et al.

ICLR 2025posterarXiv:2502.14816