ICLR Papers
6,124 papers found • Page 20 of 123
Don't flatten, tokenize! Unlocking the key to SoftMoE's efficacy in deep RL
Ghada Sokar, Johan S Obando Ceron, Aaron Courville et al.
DON’T STOP ME NOW: EMBEDDING BASED SCHEDULING FOR LLMS
Rana Shahout, Eran Malach, Chunwei Liu et al.
Don't Take Things Out of Context: Attention Intervention for Enhancing Chain-of-Thought Reasoning in Large Language Models
Shaotian Yan, Chen Shen, Wenxiao Wang et al.
DOPL: Direct Online Preference Learning for Restless Bandits with Preference Feedback
GUOJUN XIONG, Ujwal Dinesha, Debajoy Mukherjee et al.
Do Stochastic, Feel Noiseless: Stable Stochastic Optimization via a Double Momentum Mechanism
Tehila Dahan, Kfir Y Levy
DOTS: Learning to Reason Dynamically in LLMs via Optimal Reasoning Trajectories Search
Murong Yue, Wenlin Yao, Haitao Mi et al.
Doubly Optimal Policy Evaluation for Reinforcement Learning
Shuze Liu, Claire Chen, Shangtong Zhang
Doubly robust identification of treatment effects from multiple environments
Piersilvio De Bartolomeis, Julia Kostin, Javier Abad et al.
Do Vision & Language Decoders use Images and Text equally? How Self-consistent are their Explanations?
Letitia Parcalabescu, Anette Frank
Do Vision-Language Models Represent Space and How? Evaluating Spatial Frame of Reference under Ambiguities
Zheyuan Zhang, Fengyuan Hu, Jayjun Lee et al.
Do vision models perceive objects like toddlers ?
Arthur Aubret, Jochen Triesch
Do WGANs succeed because they minimize the Wasserstein Distance? Lessons from Discrete Generators
Ariel Elnekave, Yair Weiss
Do You Keep an Eye on What I Ask? Mitigating Multimodal Hallucination via Attention-Guided Ensemble Decoding
Yeongjae Cho, Keonwoo Kim, Taebaek Hwang et al.
DPaI: Differentiable Pruning at Initialization with Node-Path Balance Principle
Lichuan Xiang, Quan Nguyen-Tri, Lan-Cuong Nguyen et al.
DPLM-2: A Multimodal Diffusion Protein Language Model
Xinyou Wang, Zaixiang Zheng, Fei YE et al.
Drama: Mamba-Enabled Model-Based Reinforcement Learning Is Sample and Parameter Efficient
Wenlong Wang, Ivana Dusparic, Yucheng Shi et al.
Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want
Weifeng Lin, Xinyu Wei, Ruichuan An et al.
DreamBench++: A Human-Aligned Benchmark for Personalized Image Generation
Yuang Peng, Yuxin Cui, Haomiao Tang et al.
DreamCatalyst: Fast and High-Quality 3D Editing via Controlling Editability and Identity Preservation
Jiwook Kim, Seonho Lee, Jaeyo Shin et al.
DreamDistribution: Learning Prompt Distribution for Diverse In-distribution Generation
Brian Nlong Zhao, Yuhang Xiao, Jiashu Xu et al.
Dream to Manipulate: Compositional World Models Empowering Robot Imitation Learning with Imagination
Leonardo Barcellona, Andrii Zadaianchuk, Davide Allegro et al.
Dreamweaver: Learning Compositional World Models from Pixels
Junyeob Baek, Yi-Fu Wu, Gautam Singh et al.
DRESSing Up LLM: Efficient Stylized Question-Answering via Style Subspace Editing
Xinyu Ma, Yifeng Xu, Yang Lin et al.
DriveTransformer: Unified Transformer for Scalable End-to-End Autonomous Driving
Xiaosong Jia, Junqi You, Zhiyuan Zhang et al.
DRL: Decomposed Representation Learning for Tabular Anomaly Detection
Hangting Ye, He Zhao, Wei Fan et al.
DRoC: Elevating Large Language Models for Complex Vehicle Routing via Decomposed Retrieval of Constraints
Xia Jiang, Yaoxin Wu, Chenhao Zhang et al.
DRoP: Distributionally Robust Data Pruning
Artem Vysogorets, Kartik Ahuja, Julia Kempe
Drop-Upcycling: Training Sparse Mixture of Experts with Partial Re-initialization
Taishi Nakamura, Takuya Akiba, Kazuki Fujii et al.
DSBench: How Far Are Data Science Agents from Becoming Data Science Experts?
Liqiang Jing, Zhehui Huang, Xiaoyang Wang et al.
DS-LLM: Leveraging Dynamical Systems to Enhance Both Training and Inference of Large Language Models
Ruibing Song, Chuan Liu, Chunshu Wu et al.
DSPO: Direct Score Preference Optimization for Diffusion Model Alignment
Huaisheng Zhu, Teng Xiao, Vasant Honavar
Dualformer: Controllable Fast and Slow Thinking by Learning with Randomized Reasoning Traces
Andy (DiJia) Su, Sainbayar Sukhbaatar, Michael Rabbat et al.
DUALFormer: Dual Graph Transformer
Zhuo Jiaming, Yuwei Liu, Yintong Lu et al.
Dual Process Learning: Controlling Use of In-Context vs. In-Weights Strategies with Weight Forgetting
Suraj Anand, Michael Lepori, Jack Merullo et al.
DUET: Decentralized Bilevel Optimization without Lower-Level Strong Convexity
Zhen Qin, Zhuqing Liu, Songtao Lu et al.
DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads
Guangxuan Xiao, Jiaming Tang, Jingwei Zuo et al.
Duoduo CLIP: Efficient 3D Understanding with Multi-View Images
Han-Hung Lee, Yiming Zhang, Angel Chang
Durable Quantization Conditioned Misalignment Attack on Large Language Models
Peiran Dong, Haowei Li, Song Guo
DyCAST: Learning Dynamic Causal Structure from Time Series
Yue Cheng, Bochen Lyu, Weiwei Xing et al.
DynAlign: Unsupervised Dynamic Taxonomy Alignment for Cross-Domain Segmentation
HAN SUN, Rui Gong, Ismail Nejjar et al.
DynaMath: A Dynamic Visual Benchmark for Evaluating Mathematical Reasoning Robustness of Vision Language Models
Chengke Zou, Xingang Guo, Rui Yang et al.
Dynamical Diffusion: Learning Temporal Dynamics with Diffusion Models
Xingzhuo Guo, Yu Zhang, Baixu Chen et al.
Dynamic Assortment Selection and Pricing with Censored Preference Feedback
Jung-hun Kim, Min-hwan Oh
DynamicCity: Large-Scale 4D Occupancy Generation from Dynamic Scenes
Hengwei Bian, Lingdong Kong, Haozhe Xie et al.
Dynamic Contrastive Skill Learning with State-Transition Based Skill Clustering and Dynamic Length Adjustment
Jinwoo Choi, Seung-Woo Seo
Dynamic Diffusion Transformer
Wangbo Zhao, Yizeng Han, Jiasheng Tang et al.
Dynamic Gaussians Mesh: Consistent Mesh Reconstruction from Dynamic Scenes
Isabella Liu, Hao Su, Xiaolong Wang
Dynamic-LLaVA: Efficient Multimodal Large Language Models via Dynamic Vision-language Context Sparsification
Wenxuan Huang, Zijie Zhai, Yunhang Shen et al.
Dynamic Loss-Based Sample Reweighting for Improved Large Language Model Pretraining
Daouda Sow, Herbert Woisetschläger, Saikiran Bulusu et al.
Dynamic Low-Rank Sparse Adaptation for Large Language Models
Weizhong Huang, Yuxin Zhang, Xiawu Zheng et al.