Most Cited ICLR "deep reinforcement learning" Papers

6,124 papers found • Page 17 of 31

Filters:Most Cited ICLR deep reinforcement learning Clear all

Conference

AAAI 2025 (3,028)COLM 2025 (418)CVPR 2025 (2,873)ICCV 2025 (2,701)ICLR 2025 (3,827)ICML 2025 (3,340)ISMAR 2025 (229)NEURIPS 2025 (5,858)AAAI 2024 (2,289)CVPR 2024 (2,716)ECCV 2024 (2,387)ICLR 2024 (2,297)ICML 2024 (2,635)

Paper Type

poster (24,624)paper (8,558)oral (1,594)spotlight (1,421)highlight (975)

#3201

Few-Class Arena: A Benchmark for Efficient Selection of Vision Models and Dataset Difficulty Measurement

Bryan Bo Cao, Lawrence OGorman, Michael Coss et al.

ICLR 2025posterarXiv:2411.01099

#3202

Logicbreaks: A Framework for Understanding Subversion of Rule-based Inference

Anton Xue, Avishree Khare, Rajeev Alur et al.

ICLR 2025posterarXiv:2407.00075

#3203

Bad-PFL: Exploiting Backdoor Attacks against Personalized Federated Learning

Mingyuan Fan, Zhanyi Hu, Fuyi Wang et al.

ICLR 2025poster

#3204

Preserving Deep Representations in One-Shot Pruning: A Hessian-Free Second-Order Optimization Framework

Ryan Lucas, Rahul Mazumder

ICLR 2025posterarXiv:2411.18376

#3205

Wicked Oddities: Selectively Poisoning for Effective Clean-Label Backdoor Attacks

Hung Quang Nguyen, Hieu Nguyen, Anh Ta et al.

ICLR 2025posterarXiv:2407.10825

#3206

CR-CTC: Consistency regularization on CTC for improved speech recognition

Zengwei Yao, Wei Kang, Xiaoyu Yang et al.

ICLR 2025oralarXiv:2410.05101

#3207

DEEM: Diffusion models serve as the eyes of large language models for image perception

Run Luo, Yunshui Li, Longze Chen et al.

ICLR 2025posterarXiv:2405.15232

#3208

TS-LIF: A Temporal Segment Spiking Neuron Network for Time Series Forecasting

Shibo Feng, Wanjin Feng, Xingyu Gao et al.

ICLR 2025oralarXiv:2503.05108

#3209

An Illustrated Guide to Automatic Sparse Differentiation

Adrian Hill, Guillaume Dalle, Alexis Montoison

ICLR 2025poster

#3210

Neural Functions for Learning Periodic Signal

Woojin Cho, Minju Jo, Kookjin Lee et al.

ICLR 2025oralarXiv:2506.09526

#3211

Measuring And Improving Persuasiveness Of Large Language Models

SOMESH SINGH, Yaman Singla, Harini S I et al.

ICLR 2025posterarXiv:2410.02653

#3212

Robust System Identification: Finite-sample Guarantees and Connection to Regularization

Hank Park, Grani A. Hanasusanto, Yingying Li

ICLR 2025poster

#3213

Discrete Diffusion Schrödinger Bridge Matching for Graph Transformation

Jun Hyeong Kim, Seonghwan Kim, Seokhyun Moon et al.

ICLR 2025posterarXiv:2410.01500

#3214

A Statistical Framework for Ranking LLM-based Chatbots

Siavash Ameli, Siyuan Zhuang, Ion Stoica et al.

ICLR 2025posterarXiv:2412.18407

#3215

Intricacies of Feature Geometry in Large Language Models

Satvik Golechha, Lucius Bushnaq, Euan Ong et al.

ICLR 2025poster

#3216

Shape as Line Segments: Accurate and Flexible Implicit Surface Representation

Siyu Ren, Junhui Hou

ICLR 2025poster

#3217

NExUME: Adaptive Training and Inference for DNNs under Intermittent Power Environments

Cyan Subhra Mishra, Deeksha Chaudhary, Jack Sampson et al.

ICLR 2025poster

#3218

PEARL: Parallel Speculative Decoding with Adaptive Draft Length

Tianyu Liu, Yun Li, Qitan Lv et al.

ICLR 2025posterarXiv:2408.11850

#3219

Provably Robust Explainable Graph Neural Networks against Graph Perturbation Attacks

Jiate Li, Meng Pang, Yun Dong et al.

ICLR 2025posterarXiv:2502.04224

#3220

Beyond Linear Approximations: A Novel Pruning Approach for Attention Matrix

Yingyu Liang, Jiangxuan Long, Zhenmei Shi et al.

ICLR 2025posterarXiv:2410.11261

#3221

Capturing the Temporal Dependence of Training Data Influence

Jiachen (Tianhao) Wang, Dawn Song, James Y Zou et al.

ICLR 2025oralarXiv:2412.09538

#3222

CL-MFAP: A Contrastive Learning-Based Multimodal Foundation Model for Molecular Property Prediction and Antibiotic Screening

Gen Zhou, Sugitha Janarthanan, Yutong Lu et al.

ICLR 2025posterarXiv:2502.11001

#3223

Accelerating Goal-Conditioned Reinforcement Learning Algorithms and Research

Michał Bortkiewicz, Władysław Pałucki, Vivek Myers et al.

ICLR 2025poster

#3224

BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval

Hongjin SU, Howard Yen, Mengzhou Xia et al.

ICLR 2025posterarXiv:2407.12883

#3225

Reconciling Model Multiplicity for Downstream Decision Making

Ally Du, Dung Daniel Ngo, Steven Wu

ICLR 2025posterarXiv:2405.19667

#3226

Unlearning or Obfuscating? Jogging the Memory of Unlearned LLMs via Benign Relearning

Shengyuan Hu, Yiwei Fu, Steven Wu et al.

ICLR 2025posterarXiv:2406.13356

#3227

Convergent Privacy Loss of Noisy-SGD without Convexity and Smoothness

Eli Chien, Pan Li

ICLR 2025posterarXiv:2410.01068

#3228

Explanations of GNN on Evolving Graphs via Axiomatic Layer edges

Yazheng Liu, Sihong Xie

ICLR 2025poster

#3229

ARB-LLM: Alternating Refined Binarizations for Large Language Models

Zhiteng Li, Xianglong Yan, Tianao Zhang et al.

ICLR 2025posterarXiv:2410.03129

#3230

Dynamic Modeling of Patients, Modalities and Tasks via Multi-modal Multi-task Mixture of Experts

Chenwei Wu, Zitao Shuai, Zhengxu Tang et al.

ICLR 2025poster

#3231

ASTrA: Adversarial Self-supervised Training with Adaptive-Attacks

Prakash Chandra Chhipa, Gautam Vashishtha, Jithamanyu Settur et al.

ICLR 2025poster

#3232

MambaQuant: Quantizing the Mamba Family with Variance Aligned Rotation Methods

Dawei Yang, Yuxuan Yue, Xing Hu et al.

ICLR 2025poster

#3233

OSTQuant: Refining Large Language Model Quantization with Orthogonal and Scaling Transformations for Better Distribution Fitting

Xing Hu, Yuan Cheng, Dawei Yang et al.

ICLR 2025posterarXiv:2501.13987

#3234

GOttack: Universal Adversarial Attacks on Graph Neural Networks via Graph Orbits Learning

Zulfikar Alom, Tran Gia Bao Ngo, Murat Kantarcioglu et al.

ICLR 2025poster

#3235

Misspecified $Q$-Learning with Sparse Linear Function Approximation: Tight Bounds on Approximation Error

Ally Du, Lin Yang, Ruosong Wang

ICLR 2025posterarXiv:2407.13622

#3236

Regret-Optimal List Replicable Bandit Learning: Matching Upper and Lower Bounds

Michael Chen, A. Pavan, N. V. Vinodchandran et al.

ICLR 2025poster

#3237

Efficient Imitation under Misspecification

Nicolas Espinosa Dice, Sanjiban Choudhury, Wen Sun et al.

ICLR 2025posterarXiv:2503.13162

#3238

FreeCG: Free the Design Space of Clebsch-Gordan Transform for Machine Learning Force Fields

Shihao Shao, Haoran Geng, Zun Wang et al.

ICLR 2025posterarXiv:2407.02263

#3239

From Layers to States: A State Space Model Perspective to Deep Neural Network Layer Dynamics

Qinshuo Liu, Weiqin Zhao, Wei Huang et al.

ICLR 2025posterarXiv:2502.10463

#3240

Grounding Video Models to Actions through Goal Conditioned Exploration

Yunhao Luo, Yilun Du

ICLR 2025posterarXiv:2411.07223

#3241

Trajectory-LLM: A Language-based Data Generator for Trajectory Prediction in Autonomous Driving

Kairui Yang, Zihao Guo, Gengjie Lin et al.

ICLR 2025poster

#3242

Federated $Q$-Learning with Reference-Advantage Decomposition: Almost Optimal Regret and Logarithmic Communication Cost

Zhong Zheng, Haochen Zhang, Lingzhou Xue

ICLR 2025posterarXiv:2405.18795

#3243

Robust-PIFu: Robust Pixel-aligned Implicit Function for 3D Human Digitalization from a Single Image

Kennard Chan, Fayao Liu, Guosheng Lin et al.

ICLR 2025poster

#3244

Probe Pruning: Accelerating LLMs through Dynamic Pruning via Model-Probing

Qi Le, Enmao Diao, Ziyan Wang et al.

ICLR 2025posterarXiv:2502.15618

#3245

Discrete Distribution Networks

Lei Yang

ICLR 2025posterarXiv:2401.00036

#3246

Adapt-$\infty$: Scalable Continual Multimodal Instruction Tuning via Dynamic Data Selection

Adyasha Maharana, Jaehong Yoon, Tianlong Chen et al.

ICLR 2025posterarXiv:2410.10636

#3247

MLPs Learn In-Context on Regression and Classification Tasks

William Tong, Cengiz Pehlevan

ICLR 2025posterarXiv:2405.15618

#3248

Re-Evaluating the Impact of Unseen-Class Unlabeled Data on Semi-Supervised Learning Model

Rundong He, Yicong Dong, Lan-Zhe Guo et al.

ICLR 2025posterarXiv:2503.00884

#3249

FreqPrior: Improving Video Diffusion Models with Frequency Filtering Gaussian Noise

Yunlong Yuan, Yuanfan Guo, Chunwei Wang et al.

ICLR 2025posterarXiv:2502.03496

#3250

ORSO: Accelerating Reward Design via Online Reward Selection and Policy Optimization

Chen Bo Calvin Zhang, Zhang-Wei Hong, Aldo Pacchiano et al.

ICLR 2025posterarXiv:2410.13837

#3251

Grid Cell-Inspired Fragmentation and Recall for Efficient Map Building

Jaedong Hwang, Zhang-Wei Hong, Eric Chen et al.

ICLR 2025posterarXiv:2307.05793

#3252

Be More Diverse than the Most Diverse: Optimal Mixtures of Generative Models via Mixture-UCB Bandit Algorithms

Parham Rezaei, Farzan Farnia, Cheuk Ting Li

ICLR 2025posterarXiv:2412.17622

#3253

Oryx MLLM: On-Demand Spatial-Temporal Understanding at Arbitrary Resolution

Zuyan Liu, Yuhao Dong, Ziwei Liu et al.

ICLR 2025oralarXiv:2409.12961

#3254

DynamicCity: Large-Scale 4D Occupancy Generation from Dynamic Scenes

Hengwei Bian, Lingdong Kong, Haozhe Xie et al.

ICLR 2025posterarXiv:2410.18084

#3255

Federated Granger Causality Learning For Interdependent Clients With State Space Representation

Ayush Mohanty, Nazal Mohamed, Paritosh Ramanan et al.

ICLR 2025posterarXiv:2501.13890

#3256

MambaPEFT: Exploring Parameter-Efficient Fine-Tuning for Mamba

Masakazu Yoshimura, Teruaki Hayashi, Yota Maeda

ICLR 2025posterarXiv:2411.03855

#3257

CBQ: Cross-Block Quantization for Large Language Models

Xin Ding, Xiaoyu Liu, Zhijun Tu et al.

ICLR 2025posterarXiv:2312.07950

#3258

Magnetic Preference Optimization: Achieving Last-iterate Convergence for Language Model Alignment

Mingzhi Wang, Chengdong Ma, Qizhi Chen et al.

ICLR 2025posterarXiv:2410.16714

#3259

GenVP: Generating Visual Puzzles with Contrastive Hierarchical VAEs

Kalliopi Basioti, Pritish Sahu, Qingze Liu et al.

ICLR 2025posterarXiv:2503.23598

#3260

Learning-Augmented Frequent Directions

Anders Aamand, Justin Chen, Siddharth Gollapudi et al.

ICLR 2025posterarXiv:2503.00937

#3261

SV4D: Dynamic 3D Content Generation with Multi-Frame and Multi-View Consistency

Yiming Xie, Chun-Han Yao, Vikram Voleti et al.

ICLR 2025oralarXiv:2407.17470

#3262

MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions

Yekun Chai, Haoran Sun, Huang Fang et al.

ICLR 2025oralarXiv:2410.02743

#3263

Structuring Benchmark into Knowledge Graphs to Assist Large Language Models in Retrieving and Designing Models

Hanmo Liu, Shimin Di, Jialiang Wang et al.

ICLR 2025poster

#3264

SRSA: Skill Retrieval and Adaptation for Robotic Assembly Tasks

Yijie Guo, Bingjie Tang, Iretiayo Akinola et al.

ICLR 2025posterarXiv:2503.04538

#3265

Learning local equivariant representations for quantum operators

YinZhangHao Zhou, Zixi Gan, Shishir Pandey et al.

ICLR 2025posterarXiv:2407.06053

#3266

Towards Auto-Regressive Next-Token Prediction: In-context Learning Emerges from Generalization

Zixuan Gong, Xiaolin Hu, Huayi Tang et al.

ICLR 2025posterarXiv:2502.17024

#3267

DynaPrompt: Dynamic Test-Time Prompt Tuning

Zehao Xiao, Shilin Yan, Jack Hong et al.

ICLR 2025posterarXiv:2501.16404

#3268

Forgetting Transformer: Softmax Attention with a Forget Gate

Zhixuan Lin, Evgenii Nikishin, Xu He et al.

ICLR 2025posterarXiv:2503.02130

#3269

Adaptive Retention & Correction: Test-Time Training for Continual Learning

Haoran Chen, Micah Goldblum, Zuxuan Wu et al.

ICLR 2025posterarXiv:2405.14318

#3270

ReDeEP: Detecting Hallucination in Retrieval-Augmented Generation via Mechanistic Interpretability

Zhongxiang Sun, Xiaoxue Zang, Kai Zheng et al.

ICLR 2025posterarXiv:2410.11414

#3271

Rethinking Audio-Visual Adversarial Vulnerability from Temporal and Modality Perspectives

Zeliang Zhang, Susan Liang, Daiki Shimada et al.

ICLR 2025oralarXiv:2502.11858

#3272

IntersectionZoo: Eco-driving for Benchmarking Multi-Agent Contextual Reinforcement Learning

Vindula Jayawardana, Baptiste Freydt, Ao Qu et al.

ICLR 2025posterarXiv:2410.15221

#3273

High-dimension Prototype is a Better Incremental Object Detection Learner

Yanjie Wang, Liqun Chen, Tianming Zhao et al.

ICLR 2025poster

#3274

A Simple yet Effective $\Delta\Delta G$ Predictor is An Unsupervised Antibody Optimizer and Explainer

Lirong Wu, Yunfan Liu, Haitao Lin et al.

ICLR 2025poster

#3275

Optimizing Neural Network Representations of Boolean Networks

Joshua Russell, Ignacio Gavier, Devdhar Patel et al.

ICLR 2025poster

#3276

On the Convergence of Adaptive Gradient Methods for Nonconvex Optimization

Quanquan Gu, Jinghui Chen, Yuan Cao et al.

ICLR 2025poster

#3277

Time After Time: Deep-Q Effect Estimation for Interventions on When and What to do

Yoav Wald, Mark Goldstein, Yonathan Efroni et al.

ICLR 2025posterarXiv:2503.15890

#3278

Mixture of Experts Made Personalized: Federated Prompt Learning for Vision-Language Models

Jun Luo, Chen Chen, Shandong Wu

ICLR 2025posterarXiv:2410.10114

#3279

LR0.FM: LOW-RESOLUTION ZERO-SHOT CLASSIFICATION BENCHMARK FOR FOUNDATION MODELS

Priyank Pathak, Shyam Marjit, Shruti Vyas et al.

ICLR 2025poster

#3280

Robust Simulation-Based Inference under Missing Data via Neural Processes

Yogesh Verma, Ayush Bharti, Vikas Garg

ICLR 2025posterarXiv:2503.01287

#3281

NoVo: Norm Voting off Hallucinations with Attention Heads in Large Language Models

Zhengyi Ho, Siyuan Liang, Sen Zhang et al.

ICLR 2025posterarXiv:2410.08970

#3282

Temporal Difference Learning: Why It Can Be Fast and How It Will Be Faster

Patrick Schnell, Luca Guastoni, Nils Thuerey

ICLR 2025oral

#3283

High-quality Text-to-3D Character Generation with SparseCubes and Sparse Transformers.

Jiachen Qian, Hongye Yang, Shuang Wu et al.

ICLR 2025poster

#3284

VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks

Ziyan Jiang, Rui Meng, Xinyi Yang et al.

ICLR 2025posterarXiv:2410.05160

#3285

Ensembling Diffusion Models via Adaptive Feature Aggregation

Cong Wang, kuan tian, Yonghang Guan et al.

ICLR 2025posterarXiv:2405.17082

#3286

EVA: Geometric Inverse Design for Fast Protein Motif-Scaffolding with Coupled Flow

Yufei Huang, Yunshu Liu, Lirong Wu et al.

ICLR 2025poster

#3287

LongMamba: Enhancing Mamba's Long-Context Capabilities via Training-Free Receptive Field Enlargement

Zhifan Ye, Kejing Xia, Yonggan Fu et al.

ICLR 2025posterarXiv:2504.16053

#3288

CodePlan: Unlocking Reasoning Potential in Large Language Models by Scaling Code-form Planning

Jiaxin Wen, Jian Guan, Hongning Wang et al.

ICLR 2025poster

#3289

Targeted Attack Improves Protection against Unauthorized Diffusion Customization

Boyang Zheng, Chumeng Liang, Xiaoyu Wu

ICLR 2025posterarXiv:2310.04687

#3290

Radar: Fast Long-Context Decoding for Any Transformer

Yongchang Hao, Mengyao Zhai, Hossein Hajimirsadeghi et al.

ICLR 2025posterarXiv:2503.10571

#3291

Learning the Optimal Stopping for Early Classification within Finite Horizons via Sequential Probability Ratio Test

Akinori F. Ebihara, Taiki Miyagawa, Kazuyuki Sakurai et al.

ICLR 2025posterarXiv:2501.18059

#3292

Interleaved Scene Graphs for Interleaved Text-and-Image Generation Assessment

Dongping Chen, Ruoxi Chen, Shu Pu et al.

ICLR 2025posterarXiv:2411.17188

#3293

Active Learning for Continual Learning: Keeping the Past Alive in the Present

Jaehyun Park, Dongmin Park, Jae-Gil Lee

ICLR 2025posterarXiv:2501.14278

#3294

Arithmetic Transformers Can Length-Generalize in Both Operand Length and Count

Hanseul Cho, Jaeyoung Cha, Srinadh Bhojanapalli et al.

ICLR 2025posterarXiv:2410.15787

#3295

Leave-One-Out Stable Conformal Prediction

Kiljae Lee, Yuan Zhang

ICLR 2025posterarXiv:2504.12189

#3296

DeepGate4: Efficient and Effective Representation Learning for Circuit Design at Scale

Ziyang Zheng, Shan Huang, Jianyuan Zhong et al.

ICLR 2025posterarXiv:2502.01681

#3297

Visually Guided Decoding: Gradient-Free Hard Prompt Inversion with Language Models

Donghoon Kim, Minji Bae, Kyuhong Shim et al.

ICLR 2025posterarXiv:2505.08622

#3298

Tight Time Complexities in Parallel Stochastic Optimization with Arbitrary Computation Dynamics

Alexander Tyurin

ICLR 2025posterarXiv:2408.04929

#3299

VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation

Yecheng Wu, Zhuoyang Zhang, Junyu Chen et al.

ICLR 2025posterarXiv:2409.04429

#3300

HART: Efficient Visual Generation with Hybrid Autoregressive Transformer

Haotian Tang, Yecheng Wu, Shang Yang et al.

ICLR 2025posterarXiv:2410.10812

#3301

Classic but Everlasting: Traditional Gradient-Based Algorithms Converge Fast Even in Time-Varying Multi-Player Games

Yanzheng Chen, Jun Yu

ICLR 2025poster

#3302

Null Counterfactual Factor Interactions for Goal-Conditioned Reinforcement Learning

Caleb Chuck, Fan Feng, Carl Qi et al.

ICLR 2025posterarXiv:2505.03172

#3303

$\text{I}^2\text{AM}$: Interpreting Image-to-Image Latent Diffusion Models via Bi-Attribution Maps

Junseo Park, Hyeryung Jang

ICLR 2025poster

#3304

Aligned Datasets Improve Detection of Latent Diffusion-Generated Images

Anirudh Sundara Rajan, Utkarsh Ojha, Jedidiah Schloesser et al.

ICLR 2025posterarXiv:2410.11835

#3305

MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?

YiFan Zhang, Huanyu Zhang, Haochen Tian et al.

ICLR 2025posterarXiv:2408.13257

#3306

MuPT: A Generative Symbolic Music Pretrained Transformer

Xingwei Qu, yuelin bai, Yinghao MA et al.

ICLR 2025posterarXiv:2404.06393

#3307

Shapley-Guided Utility Learning for Effective Graph Inference Data Valuation

Hongliang Chi, Qiong Wu, Zhengyi Zhou et al.

ICLR 2025posterarXiv:2503.18195

#3308

6DGS: Enhanced Direction-Aware Gaussian Splatting for Volumetric Rendering

Zhongpai Gao, Benjamin Planche, Meng Zheng et al.

ICLR 2025posterarXiv:2410.04974

#3309

3D Vision-Language Gaussian Splatting

Qucheng Peng, Benjamin Planche, Zhongpai Gao et al.

ICLR 2025posterarXiv:2410.07577

#3310

Order-aware Interactive Segmentation

Bin Wang, Anwesa Choudhuri, Meng Zheng et al.

ICLR 2025posterarXiv:2410.12214

#3311

Long-Context LLMs Meet RAG: Overcoming Challenges for Long Inputs in RAG

Bowen Jin, Jinsung Yoon, Jiawei Han et al.

ICLR 2025posterarXiv:2410.05983

#3312

Discovering Temporally Compositional Neural Manifolds with Switching Infinite GPFA

Changmin Yu, Maneesh Sahani, Máté Lengyel

ICLR 2025oral

#3313

ZIP: An Efficient Zeroth-order Prompt Tuning for Black-box Vision-Language Models

Seonghwan Park, Jaehyeon Jeong, Yongjun Kim et al.

ICLR 2025posterarXiv:2504.06838

#3314

Unlearning-based Neural Interpretations

Ching Lam Choi, Alexandre Duplessis, Serge Belongie

ICLR 2025posterarXiv:2410.08069

#3315

Bridging Information Asymmetry in Text-video Retrieval: A Data-centric Approach

Zechen Bai, Tianjun Xiao, Tong He et al.

ICLR 2025posterarXiv:2408.07249

#3316

A Truncated Newton Method for Optimal Transport

Mete Kemertas, Amir-massoud Farahmand, Allan Jepson

ICLR 2025posterarXiv:2504.02067

#3317

MamBEV: Enabling State Space Models to Learn Birds-Eye-View Representations

Hongyu Ke, Jack Morris, Kentaro Oguchi et al.

ICLR 2025oralarXiv:2503.13858

#3318

Sharpness-Aware Minimization Efficiently Selects Flatter Minima Late In Training

Zhanpeng Zhou, Mingze Wang, Yuchen Mao et al.

ICLR 2025posterarXiv:2410.10373

#3319

Differentially private optimization for non-decomposable objective functions

Weiwei Kong, Andres Munoz medina, Mónica Ribero

ICLR 2025posterarXiv:2310.03104

#3320

Multi-Robot Motion Planning with Diffusion Models

Yorai Shaoul, Itamar Mishani, Shivam Vats et al.

ICLR 2025posterarXiv:2410.03072

#3321

ImDy: Human Inverse Dynamics from Imitated Observations

Xinpeng Liu, Junxuan Liang, Zili Lin et al.

ICLR 2025posterarXiv:2410.17610

#3322

ReMatching Dynamic Reconstruction Flow

Sara Oblak, Despoina Paschalidou, Sanja Fidler et al.

ICLR 2025posterarXiv:2411.00705

#3323

Feature Averaging: An Implicit Bias of Gradient Descent Leading to Non-Robustness in Neural Networks

Binghui Li, Zhixuan Pan, Kaifeng Lyu et al.

ICLR 2025posterarXiv:2410.10322

#3324

Efficient Exploration and Discriminative World Model Learning with an Object-Centric Abstraction

Anthony GX-Chen, Kenneth Marino, Rob Fergus

ICLR 2025oralarXiv:2408.11816

#3325

CFD: Learning Generalized Molecular Representation via Concept-Enhanced Feedback Disentanglement

Aming Wu, Cheng Deng

ICLR 2025poster

#3326

Analysis of Linear Mode Connectivity via Permutation-Based Weight Matching: With Insights into Other Permutation Search Methods

Akira Ito, Masanori Yamada, Atsutoshi Kumagai

ICLR 2025posterarXiv:2402.04051

#3327

Scalable Benchmarking and Robust Learning for Noise-Free Ego-Motion and 3D Reconstruction from Noisy Video

Xiaohao Xu, Tianyi Zhang, Shibo Zhao et al.

ICLR 2025posterarXiv:2501.14319

#3328

MIND: Math Informed syNthetic Dialogues for Pretraining LLMs

Syeda Nahida Akter, Shrimai Prabhumoye, John Kamalu et al.

ICLR 2025posterarXiv:2410.12881

#3329

Building Math Agents with Multi-Turn Iterative Preference Learning

Wei Xiong, Chengshuai Shi, Jiaming Shen et al.

ICLR 2025posterarXiv:2409.02392

#3330

Local Loss Optimization in the Infinite Width: Stable Parameterization of Predictive Coding Networks and Target Propagation

Satoki Ishikawa, Rio Yokota, Ryo Karakida

ICLR 2025posterarXiv:2411.02001

#3331

Linear Partial Gromov-Wasserstein Embedding

Yikun Bai, Abihith Kothapalli, Hengrong Du et al.

ICLR 2025posterarXiv:2410.16669

#3332

SeCom: On Memory Construction and Retrieval for Personalized Conversational Agents

Zhuoshi Pan, Qianhui Wu, Huiqiang Jiang et al.

ICLR 2025poster

#3333

Determine-Then-Ensemble: Necessity of Top-k Union for Large Language Model Ensembling

Yuxuan YAO, Han Wu, Mingyang LIU et al.

ICLR 2025posterarXiv:2410.03777

#3334

State Space Model Meets Transformer: A New Paradigm for 3D Object Detection

Chuxin Wang, Wenfei Yang, Xiang Liu et al.

ICLR 2025posterarXiv:2503.14493

#3335

A primer on analytical learning dynamics of nonlinear neural networks

Rodrigo Carrasco-Davis, Erin Grant

ICLR 2025poster

#3336

Bio-xLSTM: Generative modeling, representation and in-context learning of biological and chemical sequences

Niklas Schmidinger, Lisa Schneckenreiter, Philipp Seidl et al.

ICLR 2025posterarXiv:2411.04165

#3337

Enhancing Clustered Federated Learning: Integration of Strategies and Improved Methodologies

Yongxin Guo, Xiaoying Tang, Tao Lin

ICLR 2025posterarXiv:2310.05397

#3338

TRACE: Temporal Grounding Video LLM via Causal Event Modeling

Yongxin Guo, Jingyu Liu, Mingda Li et al.

ICLR 2025oralarXiv:2410.05643

#3339

Timer-XL: Long-Context Transformers for Unified Time Series Forecasting

Yong Liu, Guo Qin, Xiangdong Huang et al.

ICLR 2025oralarXiv:2410.04803

#3340

Neural Fluid Simulation on Geometric Surfaces

Haoxiang Wang, Tao Yu, Hui Qiao et al.

ICLR 2025poster

#3341

Recovering Manifold Structure Using Ollivier Ricci Curvature

Tristan L. Saidi, Abigail Hickok, Andrew J Blumberg

ICLR 2025poster

#3342

EgoSim: Egocentric Exploration in Virtual Worlds with Multi-modal Conditioning

Wei Yu, Songheng Yin, Steve Easterbrook et al.

ICLR 2025oral

#3343

PWM: Policy Learning with Multi-Task World Models

Ignat Georgiev, Varun Giridhar, Nick Hansen et al.

ICLR 2025posterarXiv:2407.02466

#3344

Breaking the $\log(1/\Delta_2)$ Barrier: Better Batched Best Arm Identification with Adaptive Grids

Tianyuan Jin, Qin Zhang, Dongruo Zhou

ICLR 2025poster

#3345

Adaptive $Q$-Network: On-the-fly Target Selection for Deep Reinforcement Learning

Théo Vincent, Fabian Wahren, Jan Peters et al.

ICLR 2025posterarXiv:2405.16195

#3346

Single-agent Poisoning Attacks Suffice to Ruin Multi-Agent Learning

Fan Yao, Yuwei Cheng, Ermin Wei et al.

ICLR 2025poster

#3347

Do Contemporary Causal Inference Models Capture Real-World Heterogeneity? Findings from a Large-Scale Benchmark

Haining Yu, Yizhou Sun

ICLR 2025posterarXiv:2410.07021

#3348

A Differentiable Rank-Based Objective for Better Feature Learning

Krunoslav Lehman Pavasovic, Giulio Biroli, Levent Sagun

ICLR 2025posterarXiv:2502.09445

#3349

Mixture of In-Context Prompters for Tabular PFNs

Derek Xu, Olcay Cirit, Reza Asadi et al.

ICLR 2025posterarXiv:2405.16156

#3350

Point-based Instance Completion with Scene Constraints

Wesley Khademi, Li Fuxin

ICLR 2025posterarXiv:2504.05698

#3351

Spectral Compressive Imaging via Unmixing-driven Subspace Diffusion Refinement

Haijin Zeng, Benteng Sun, Yongyong Chen et al.

ICLR 2025poster

#3352

Tracking the Copyright of Large Vision-Language Models through Parameter Learning Adversarial Images

Yubo Wang, Jianting Tang, Liu et al.

ICLR 2025posterarXiv:2502.16593

#3353

Unifying Causal Representation Learning with the Invariance Principle

Dingling Yao, Dario Rancati, Riccardo Cadei et al.

ICLR 2025posterarXiv:2409.02772

#3354

Scalable Mechanistic Neural Networks

Jiale Chen, Dingling Yao, Adeel Pervez et al.

ICLR 2025oral

#3355

Deep Signature: Characterization of Large-Scale Molecular Dynamics

Tiexin Qin, Mengxu ZHU, Chunyang Li et al.

ICLR 2025posterarXiv:2410.02847

#3356

PICASO: Permutation-Invariant Context Composition with State Space Models

Tian Yu Liu, Alessandro Achille, Matthew Trager et al.

ICLR 2025oralarXiv:2502.17605

#3357

InstaSHAP: Interpretable Additive Models Explain Shapley Values Instantly

James Enouen, Yan Liu

ICLR 2025posterarXiv:2502.14177

#3358

LLMs' Potential Influences on Our Democracy: Challenges and Opportunities

Yujin Potter, David Rand, Yejin Choi et al.

ICLR 2025poster

#3359

Active Learning for Neural PDE Solvers

Daniel Musekamp, Marimuthu Kalimuthu, David Holzmüller et al.

ICLR 2025posterarXiv:2408.01536

#3360

LLaRA: Supercharging Robot Learning Data for Vision-Language Policy

Xiang Li, Cristina Mata, Jongwoo Park et al.

ICLR 2025posterarXiv:2406.20095

#3361

Solving hidden monotone variational inequalities with surrogate losses

Ryan D'Orazio, Danilo Vucetic, Zichu Liu et al.

ICLR 2025posterarXiv:2411.05228

#3362

Universal Sharpness Dynamics in Neural Network Training: Fixed Point Analysis, Edge of Stability, and Route to Chaos

Dayal Singh Kalra, Tianyu He, Maissam Barkeshli

ICLR 2025posterarXiv:2311.02076

#3363

Poisson-Dirac Neural Networks for Modeling Coupled Dynamical Systems across Domains

Razmik Khosrovian, Takaharu Yaguchi, Hiroaki Yoshimura et al.

ICLR 2025posterarXiv:2410.11480

#3364

AssembleFlow: Rigid Flow Matching with Inertial Frames for Molecular Assembly

Hongyu Guo, Yoshua Bengio, Shengchao Liu

ICLR 2025poster

#3365

Don't flatten, tokenize! Unlocking the key to SoftMoE's efficacy in deep RL

Ghada Sokar, Johan S Obando Ceron, Aaron Courville et al.

ICLR 2025posterarXiv:2410.01930

#3366

Dynamic Low-Rank Sparse Adaptation for Large Language Models

Weizhong Huang, Yuxin Zhang, Xiawu Zheng et al.

ICLR 2025posterarXiv:2502.14816

#3367

Conflict-Averse Gradient Aggregation for Constrained Multi-Objective Reinforcement Learning

Dohyeong Kim, Mineui Hong, Jeongho Park et al.

ICLR 2025posterarXiv:2403.00282

#3368

CAKE: Cascading and Adaptive KV Cache Eviction with Layer Preferences

Ziran Qin, Yuchen Cao, Mingbao Lin et al.

ICLR 2025oralarXiv:2503.12491

#3369

Identifiability for Gaussian Processes with Holomorphic Kernels

Ameer Qaqish, Didong Li

ICLR 2025poster

#3370

Neural Dueling Bandits: Preference-Based Optimization with Human Feedback

Arun Verma, Zhongxiang Dai, Xiaoqiang Lin et al.

ICLR 2025posterarXiv:2407.17112

#3371

U-Nets as Belief Propagation: Efficient Classification, Denoising, and Diffusion in Generative Hierarchical Models

Song Mei

ICLR 2025posterarXiv:2404.18444

#3372

Chemistry-Inspired Diffusion with Non-Differentiable Guidance

Yuchen Shen, Chenhao Zhang, Sijie Fu et al.

ICLR 2025posterarXiv:2410.06502

#3373

Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic Learning

Haque Ishfaq, Guangyuan Wang, Sami Islam et al.

ICLR 2025posterarXiv:2501.17827

#3374

Bootstrapping Language Models with DPO Implicit Rewards

Changyu Chen, Zichen Liu, Chao Du et al.

ICLR 2025posterarXiv:2406.09760

#3375

Hessian Free Efficient Single Loop Iterative Differentiation Methods for Bi-Level Optimization Problems

Peiran Yu, Junyi Li, Heng Huang

ICLR 2025poster

#3376

Is Your Video Language Model a Reliable Judge?

Ming Liu, Wensheng Zhang

ICLR 2025posterarXiv:2503.05977

#3377

Controllable Satellite-to-Street-View Synthesis with Precise Pose Alignment and Zero-Shot Environmental Control

Xianghui Ze, Zhenbo Song, Qiwei Wang et al.

ICLR 2025posterarXiv:2502.03498

#3378

R-Sparse: Rank-Aware Activation Sparsity for Efficient LLM Inference

Zhenyu Zhang, Zechun Liu, Yuandong Tian et al.

ICLR 2025posterarXiv:2504.19449

#3379

Param$\Delta$ for Direct Mixing: Post-Train Large Language Model At Zero Cost

Sheng Cao, Mingrui Wu, Karthik Prasad et al.

ICLR 2025poster

#3380

Dualformer: Controllable Fast and Slow Thinking by Learning with Randomized Reasoning Traces

Andy (DiJia) Su, Sainbayar Sukhbaatar, Michael Rabbat et al.

ICLR 2025posterarXiv:2410.09918

#3381

CraftRTL: High-quality Synthetic Data Generation for Verilog Code Models with Correct-by-Construction Non-Textual Representations and Targeted Code Repair

Mingjie Liu, Yun-Da Tsai, Wenfei Zhou et al.

ICLR 2025posterarXiv:2409.12993

#3382

Latent Bayesian Optimization via Autoregressive Normalizing Flows

Seunghun Lee, Jinyoung Park, Jaewon Chu et al.

ICLR 2025posterarXiv:2504.14889

#3383

Locality Sensitive Avatars From Video

Chunjin Song, Zhijie Wu, Shih-Yang Su et al.

ICLR 2025poster

#3384

Conditional Diffusion with Ordinal Regression: Longitudinal Data Generation for Neurodegenerative Disease Studies

Hyuna Cho, Ziquan Wei, Seungjoo Lee et al.

ICLR 2025poster

#3385

Vision-RWKV: Efficient and Scalable Visual Perception with RWKV-Like Architectures

Yuchen Duan, Weiyun Wang, Zhe Chen et al.

ICLR 2025posterarXiv:2403.02308

#3386

CREIMBO: Cross-Regional Ensemble Interactions in Multi-view Brain Observations

Noga Mudrik, Ryan Ly, Oliver Ruebel et al.

ICLR 2025oralarXiv:2405.17395

#3387

Understanding Model Calibration - A gentle introduction and visual exploration of calibration and the expected calibration error (ECE)

Maja Pavlovic

ICLR 2025posterarXiv:2501.19047

#3388

$\sigma$-zero: Gradient-based Optimization of $\ell_0$-norm Adversarial Examples

Antonio Emanuele Cinà, Francesco Villani, Maura Pintor et al.

ICLR 2025poster

#3389

Is uniform expressivity too restrictive? Towards efficient expressivity of GNNs

Sammy Khalife, Josué Tonelli-Cueto

ICLR 2025poster

#3390

Tuning Frequency Bias of State Space Models

Annan Yu, Dongwei Lyu, Soon Hoe Lim et al.

ICLR 2025posterarXiv:2410.02035

#3391

Denoising Task Difficulty-based Curriculum for Training Diffusion Models

Jin-Young Kim, Hyojun Go, Soonwoo Kwon et al.

ICLR 2025posterarXiv:2403.10348

#3392

Interpreting the Second-Order Effects of Neurons in CLIP

Yossi Gandelsman, Alexei Efros, Jacob Steinhardt

ICLR 2025posterarXiv:2406.04341

#3393

Interpreting and Editing Vision-Language Representations to Mitigate Hallucinations

Nick Jiang, Anish Kachinthaya, Suzanne Petryk et al.

ICLR 2025posterarXiv:2410.02762

#3394

Knowledge Entropy Decay during Language Model Pretraining Hinders New Knowledge Acquisition

Jiyeon Kim, Hyunji Lee, Hyowon Cho et al.

ICLR 2025posterarXiv:2410.01380

#3395

Model-agnostic meta-learners for estimating heterogeneous treatment effects over time

Dennis Frauen, Konstantin Hess, Stefan Feuerriegel

ICLR 2025posterarXiv:2407.05287

#3396

Interpretable Causal Representation Learning for Biological Data in the Pathway Space

Jesus de la Fuente Cedeño, Robert Lehmann, Carlos Ruiz-Arenas et al.

ICLR 2025posterarXiv:2506.12439

#3397

MCNC: Manifold-Constrained Reparameterization for Neural Compression

Chayne Thrash, Reed Andreas, Ali Abbasi et al.

ICLR 2025oralarXiv:2406.19301

#3398

Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning

Amrith Setlur, Chirag Nagpal, Adam Fisch et al.

ICLR 2025posterarXiv:2410.08146

#3399

Adaptive Rank Allocation: Speeding Up Modern Transformers with RaNA Adapters

Roberto Garcia, Jerry Liu, Daniel Sorvisto et al.

ICLR 2025posterarXiv:2503.18216

#3400

UniDetox: Universal Detoxification of Large Language Models via Dataset Distillation

Huimin LU, Masaru Isonuma, Junichiro Mori et al.

ICLR 2025posterarXiv:2504.20500

← Previous

1...15 16 17 18 19...31