Most Cited ICLR "rule-based reinforcement learning" Papers

6,124 papers found • Page 22 of 31

Filters:Most Cited ICLR rule-based reinforcement learning Clear all

Conference

AAAI 2025 (3,028)COLM 2025 (418)CVPR 2025 (2,873)ICCV 2025 (2,701)ICLR 2025 (3,827)ICML 2025 (3,340)ISMAR 2025 (229)NEURIPS 2025 (5,858)AAAI 2024 (2,289)CVPR 2024 (2,716)ECCV 2024 (2,387)ICLR 2024 (2,297)ICML 2024 (2,635)

Paper Type

poster (24,624)paper (8,558)oral (1,594)spotlight (1,421)highlight (975)

#4201

Pairwise Elimination with Instance-Dependent Guarantees for Bandits with Cost Subsidy

Ishank Juneja, Carlee Joe-Wong, Osman Yagan

ICLR 2025posterarXiv:2501.10290

#4202

gRNAde: Geometric Deep Learning for 3D RNA inverse design

Chaitanya Joshi, Arian Jamasb, Ramon Viñas et al.

ICLR 2025posterarXiv:2305.14749

#4203

From an LLM Swarm to a PDDL-empowered Hive: Planning Self-executed Instructions in a Multi-modal Jungle

Kaustubh Vyas, Damien Graux, Yijun Yang et al.

ICLR 2025posterarXiv:2412.12839

#4204

Contextual Self-paced Learning for Weakly Supervised Spatio-Temporal Video Grounding

Akash Kumar, Zsolt Kira, Yogesh S Rawat

ICLR 2025oralarXiv:2501.17053

#4205

JPEG Inspired Deep Learning

Ahmed Hussien Salamah, Kaixiang Zheng, Yiwen Liu et al.

ICLR 2025posterarXiv:2410.07081

#4206

Remove Symmetries to Control Model Expressivity and Improve Optimization

Liu Ziyin, Yizhou Xu, Isaac Chuang

ICLR 2025posterarXiv:2408.15495

#4207

TC-MoE: Augmenting Mixture of Experts with Ternary Expert Choice

Shen Yan, Xingyan Bin, Sijun Zhang et al.

ICLR 2025poster

#4208

Repurposing in AI: A Distinct Approach or an Extension of Creative Problem Solving?

Aissatou Diallo, Antonis Bikakis, Luke Dickens et al.

ICLR 2025poster

#4209

InstantPortrait: One-Step Portrait Editing via Diffusion Multi-Objective Distillation

Zhixin Lai, Keqiang Sun, Fu-Yun Wang et al.

ICLR 2025poster

#4210

Adam Exploits $\ell_\infty$-geometry of Loss Landscape via Coordinate-wise Adaptivity

Shuo Xie, Mohamad Amin Mohamadi, Zhiyuan Li

ICLR 2025posterarXiv:2410.08198

#4211

Decoupled Subgraph Federated Learning

Javad Aliakbari, Johan Östman, Alexandre Graell i Amat

ICLR 2025posterarXiv:2402.19163

#4212

Diffusion Bridge Implicit Models

Kaiwen Zheng, Guande He, Jianfei Chen et al.

ICLR 2025posterarXiv:2405.15885

#4213

SGD Finds then Tunes Features in Two-Layer Neural Networks with near-Optimal Sample Complexity: A Case Study in the XOR problem

Margalit Glasgow

ICLR 2024spotlightarXiv:2309.15111

#4214

Beyond Worst-Case Dimensionality Reduction for Sparse Vectors

Sandeep Silwal, David Woodruff, Qiuyi (Richard) Zhang

ICLR 2025posterarXiv:2502.19865

#4215

Elucidating the Preconditioning in Consistency Distillation

Kaiwen Zheng, Guande He, Jianfei Chen et al.

ICLR 2025posterarXiv:2502.02922

#4216

Homomorphism Counts as Structural Encodings for Graph Learning

Linus Bao, Emily Jin, Michael Bronstein et al.

ICLR 2025posterarXiv:2410.18676

#4217

Chain-of-Thought Provably Enables Learning the (Otherwise) Unlearnable

Chenxiao Yang, Zhiyuan Li, David Wipf

ICLR 2025poster

#4218

Achieving Dimension-Free Communication in Federated Learning via Zeroth-Order Optimization

Zhe Li, Bicheng Ying, Zidong Liu et al.

ICLR 2025posterarXiv:2405.15861

#4219

Improving Data Efficiency via Curating LLM-Driven Rating Systems

Jinlong Pang, Jiaheng Wei, Ankit Parag Shah et al.

ICLR 2025posterarXiv:2410.10877

#4220

Understanding Warmup-Stable-Decay Learning Rates: A River Valley Loss Landscape View

Kaiyue Wen, Zhiyuan Li, Jason Wang et al.

ICLR 2025poster

#4221

nGPT: Normalized Transformer with Representation Learning on the Hypersphere

Ilya Loshchilov, Cheng-Ping Hsieh, Simeng Sun et al.

ICLR 2025posterarXiv:2410.01131

#4222

A Coefficient Makes SVRG Effective

Yida Yin, Zhiqiu Xu, Zhiyuan Li et al.

ICLR 2025posterarXiv:2311.05589

#4223

PhysPDE: Rethinking PDE Discovery and a Physical HYpothesis Selection Benchmark

Mingquan Feng, Yixin Huang, Yizhou Liu et al.

ICLR 2025poster

#4224

Near-Optimal Policy Identification in Robust Constrained Markov Decision Processes via Epigraph Form

Toshinori Kitamura, Tadashi Kozuno, Wataru Kumagai et al.

ICLR 2025posterarXiv:2408.16286

#4225

Everything, Everywhere, All at Once: Is Mechanistic Interpretability Identifiable?

Maxime Méloux, Silviu Maniu, François Portet et al.

ICLR 2025posterarXiv:2502.20914

#4226

Optimality and Adaptivity of Deep Neural Features for Instrumental Variable Regression

Juno Kim, Dimitri Meunier, Arthur Gretton et al.

ICLR 2025posterarXiv:2501.04898

#4227

ODE Discovery for Longitudinal Heterogeneous Treatment Effects Inference

Krzysztof Kacprzyk, Samuel Holt, Jeroen Berrevoets et al.

ICLR 2024spotlightarXiv:2403.10766

#4228

CodeMMLU: A Multi-Task Benchmark for Assessing Code Understanding & Reasoning Capabilities of CodeLLMs

Dung Nguyen, Thang Phan, Nam Le Hai et al.

ICLR 2025posterarXiv:2410.01999

#4229

Compute-Optimal LLMs Provably Generalize Better with Scale

Marc Finzi, Sanyam Kapoor, Diego Granziol et al.

ICLR 2025posterarXiv:2504.15208

#4230

On the Benefits of Attribute-Driven Graph Domain Adaptation

Ruiyi Fang, Bingheng Li, zhao kang et al.

ICLR 2025posterarXiv:2502.06808

#4231

UniCO: On Unified Combinatorial Optimization via Problem Reduction to Matrix-Encoded General TSP

Wenzheng Pan, Hao Xiong, Jiale Ma et al.

ICLR 2025poster

#4232

Robust LLM safeguarding via refusal feature adversarial training

Lei Yu, Virginie Do, Karen Hambardzumyan et al.

ICLR 2025posterarXiv:2409.20089

#4233

SimBa: Simplicity Bias for Scaling Up Parameters in Deep Reinforcement Learning

Hojoon Lee, Dongyoon Hwang, Donghu Kim et al.

ICLR 2025posterarXiv:2410.09754

#4234

TabDiff: a Mixed-type Diffusion Model for Tabular Data Generation

Juntong Shi, Minkai Xu, Harper Hua et al.

ICLR 2025posterarXiv:2410.20626

#4235

ChatQA 2: Bridging the Gap to Proprietary LLMs in Long Context and RAG Capabilities

Peng Xu, Wei Ping, Xianchao Wu et al.

ICLR 2025posterarXiv:2407.14482

#4236

Boltzmann priors for Implicit Transfer Operators

Juan Viguera Diez, Mathias Schreiner, Ola Engkvist et al.

ICLR 2025posterarXiv:2410.10605

#4237

Tracking the Copyright of Large Vision-Language Models through Parameter Learning Adversarial Images

Yubo Wang, Jianting Tang, Liu et al.

ICLR 2025posterarXiv:2502.16593

#4238

Adversarial Mixup Unlearning

Zhuoyi Peng, Yixuan Tang, Yi Yang

ICLR 2025posterarXiv:2502.10288

#4239

Biologically Constrained Barrel Cortex Model Integrates Whisker Inputs and Replicates Key Brain Network Dynamics

Tianfang Zhu, Dongli Hu, Jiandong Zhou et al.

ICLR 2025oral

#4240

Optimality of Matrix Mechanism on $\ell_p^p$-metric

Zongrui Zou, Jingcheng Liu, Jalaj Upadhyay

ICLR 2025posterarXiv:2406.02140

#4241

Login

ICLR 2024posterarXiv:1006.2411

#4242

Towards Understanding the Universality of Transformers for Next-Token Prediction

Michael Sander, Gabriel Peyré

ICLR 2025posterarXiv:2410.03011

#4243

ImplicitSLIM and How it Improves Embedding-based Collaborative Filtering

Ilya Shenbin, Sergey Nikolenko

ICLR 2024posterarXiv:2406.00198

#4244

Adapting to Distribution Shift by Visual Domain Prompt Generation

Zhixiang Chi, Li Gu, Tao Zhong et al.

ICLR 2024posterarXiv:2405.02797

#4245

Learning Task Belief Similarity with Latent Dynamics for Meta-Reinforcement Learning

Menglong Zhang, Fuyuan Qian, Quanying Liu

ICLR 2025oralarXiv:2506.19785

#4246

Fine-Tuned Language Models Generate Stable Inorganic Materials as Text

Nate Gruver, Anuroop Sriram, Andrea Madotto et al.

ICLR 2024posterarXiv:2402.04379

#4247

Online Clustering with Nearly Optimal Consistency

T-H. Hubert Chan, Shaofeng Jiang, Tianyi Wu et al.

ICLR 2025poster

#4248

The Geometry of Categorical and Hierarchical Concepts in Large Language Models

Kiho Park, Yo Joong Choe, Yibo Jiang et al.

ICLR 2025posterarXiv:2406.01506

#4249

On the Stability of Iterative Retraining of Generative Models on their own Data

Quentin Bertrand, Joey Bose, Alexandre Duplessis et al.

ICLR 2024spotlightarXiv:2310.00429

#4250

One-shot Empirical Privacy Estimation for Federated Learning

Galen Andrew, Peter Kairouz, Sewoong Oh et al.

ICLR 2024posterarXiv:2302.03098

#4251

DSPy: Compiling Declarative Language Model Calls into State-of-the-Art Pipelines

Omar Khattab, Arnav Singhvi, Paridhi Maheshwari et al.

ICLR 2024spotlight

#4252

Efficient Multi-agent Reinforcement Learning by Planning

Qihan Liu, Jianing Ye, Xiaoteng Ma et al.

ICLR 2024posterarXiv:2405.11778

#4253

Watch Less, Do More: Implicit Skill Discovery for Video-Conditioned Policy

Wang, Zongqing Lu

ICLR 2025poster

#4254

RT-Trajectory: Robotic Task Generalization via Hindsight Trajectory Sketches

Jiayuan Gu, Sean Kirmani, Paul Wohlhart et al.

ICLR 2024spotlightarXiv:2311.01977

#4255

Generative Learning for Financial Time Series with Irregular and Scale-Invariant Patterns

Hongbin Huang, Minghua Chen, Xiao Qiao

ICLR 2024oral

#4256

From Pixels to Tokens: Byte-Pair Encoding on Quantized Visual Modalities

Wanpeng Zhang, Zilong Xie, Yicheng Feng et al.

ICLR 2025posterarXiv:2410.02155

#4257

RLCD: Reinforcement Learning from Contrastive Distillation for LM Alignment

Kevin Yang, Dan Klein, Asli Celikyilmaz et al.

ICLR 2024poster

#4258

Dynamic Sparse No Training: Training-Free Fine-tuning for Sparse LLMs

Yuxin Zhang, Lirui Zhao, Mingbao Lin et al.

ICLR 2024posterarXiv:2310.08915

#4259

Learning to Act from Actionless Videos through Dense Correspondences

Po-Chen Ko, Jiayuan Mao, Yilun Du et al.

ICLR 2024spotlightarXiv:2310.08576

#4260

ZipIt! Merging Models from Different Tasks without Training

George Stoica, Daniel Bolya, Jakob Bjorner et al.

ICLR 2024posterarXiv:2305.03053

#4261

Information Bottleneck Analysis of Deep Neural Networks via Lossy Compression

Ivan Butakov, Aleksandr Tolmachev, Sofia Malanchuk et al.

ICLR 2024posterarXiv:2305.08013

#4262

In-context Autoencoder for Context Compression in a Large Language Model

Tao Ge, Hu Jing, Lei Wang et al.

ICLR 2024posterarXiv:2307.06945

#4263

Lost in Prediction: Why Social Media Narratives Don't Help Macroeconomic Forecasting?

Almog Gueta, Roi Reichart, Amir Feder et al.

ICLR 2025poster

#4264

DQ-LoRe: Dual Queries with Low Rank Approximation Re-ranking for In-Context Learning

Jing Xiong, Zixuan Li, Chuanyang Zheng et al.

ICLR 2024posterarXiv:2310.02954

#4265

Rayleigh Quotient Graph Neural Networks for Graph-level Anomaly Detection

Xiangyu Dong, Xingyi Zhang, Sibo WANG

ICLR 2024posterarXiv:2310.02861

#4266

GIM: Learning Generalizable Image Matcher From Internet Videos

Xuelun Shen, zhipeng cai, Wei Yin et al.

ICLR 2024spotlightarXiv:2402.11095

#4267

DiffEnc: Variational Diffusion with a Learned Encoder

Beatrix M. G. Nielsen, Anders Christensen, Andrea Dittadi et al.

ICLR 2024posterarXiv:2310.19789

#4268

GPT-4 Is Too Smart To Be Safe: Stealthy Chat with LLMs via Cipher

Youliang Yuan, Wenxiang Jiao, Wenxuan Wang et al.

ICLR 2024posterarXiv:2308.06463

#4269

LEGO-Prover: Neural Theorem Proving with Growing Libraries

Haiming Wang, Huajian Xin, Chuanyang Zheng et al.

ICLR 2024posterarXiv:2310.00656

#4270

Procedural Synthesis of Synthesizable Molecules

Michael Sun, Alston Lo, Minghao Guo et al.

ICLR 2025posterarXiv:2409.05873

#4271

Conditional Testing based on Localized Conformal $p$-values

Xiaoyang Wu, Lin Lu, Zhaojun Wang et al.

ICLR 2025posterarXiv:2409.16829

#4272

Unified Language-Vision Pretraining in LLM with Dynamic Discrete Visual Tokenization

Yang Jin, Kun Xu, Kun Xu et al.

ICLR 2024posterarXiv:2309.04669

#4273

Function Vectors in Large Language Models

Eric Todd, Millicent Li, Arnab Sen Sharma et al.

ICLR 2024posterarXiv:2310.15213

#4274

3DitScene: Editing Any Scene via Language-guided Disentangled Gaussian Splatting

Qihang Zhang, Yinghao Xu, Chaoyang Wang et al.

ICLR 2025posterarXiv:2405.18424

#4275

The False Promise of Imitating Proprietary Language Models

Arnav Gudibande, Eric Wallace, Charlie Snell et al.

ICLR 2024spotlight

#4276

Retrieval-Enhanced Contrastive Vision-Text Models

Ahmet Iscen, Mathilde Caron, Alireza Fathi et al.

ICLR 2024posterarXiv:2306.07196

#4277

Navigating Neural Space: Revisiting Concept Activation Vectors to Overcome Directional Divergence

Frederik Pahde, Maximilian Dreyer, Moritz Weckbecker et al.

ICLR 2025posterarXiv:2202.03482

#4278

The Truth is in There: Improving Reasoning in Language Models with Layer-Selective Rank Reduction

Pratyusha Sharma, Jordan Ash, Dipendra Kumar Misra

ICLR 2024posterarXiv:2312.13558

#4279

Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning

Mengzhou Xia, Tianyu Gao, Zhiyuan Zeng et al.

ICLR 2024posterarXiv:2310.06694

#4280

Time-to-Event Pretraining for 3D Medical Imaging

Zepeng Frazier Huo, Jason Fries, Alejandro Lozano et al.

ICLR 2025oralarXiv:2411.09361

#4281

Denoising Autoregressive Transformers for Scalable Text-to-Image Generation

Jiatao Gu, Yuyang Wang, Yizhe Zhang et al.

ICLR 2025posterarXiv:2410.08159

#4282

Few-Class Arena: A Benchmark for Efficient Selection of Vision Models and Dataset Difficulty Measurement

Bryan Bo Cao, Lawrence OGorman, Michael Coss et al.

ICLR 2025posterarXiv:2411.01099

#4283

Bad-PFL: Exploiting Backdoor Attacks against Personalized Federated Learning

Mingyuan Fan, Zhanyi Hu, Fuyi Wang et al.

ICLR 2025poster

#4284

On the Role of General Function Approximation in Offline Reinforcement Learning

Chenjie Mao, Qiaosheng Zhang, Zhen Wang et al.

ICLR 2024spotlight

#4285

Prototypical Information Bottlenecking and Disentangling for Multimodal Cancer Survival Prediction

Yilan Zhang, Yingxue XU, Jianqi Chen et al.

ICLR 2024spotlightarXiv:2401.01646

#4286

Preserving Deep Representations in One-Shot Pruning: A Hessian-Free Second-Order Optimization Framework

Ryan Lucas, Rahul Mazumder

ICLR 2025posterarXiv:2411.18376

#4287

Wicked Oddities: Selectively Poisoning for Effective Clean-Label Backdoor Attacks

Hung Quang Nguyen, Hieu Nguyen, Anh Ta et al.

ICLR 2025posterarXiv:2407.10825

#4288

CR-CTC: Consistency regularization on CTC for improved speech recognition

Zengwei Yao, Wei Kang, Xiaoyu Yang et al.

ICLR 2025oralarXiv:2410.05101

#4289

Policy Rehearsing: Training Generalizable Policies for Reinforcement Learning

Chengxing Jia, Chen-Xiao Gao, Hao Yin et al.

ICLR 2024poster

#4290

An Illustrated Guide to Automatic Sparse Differentiation

Adrian Hill, Guillaume Dalle, Alexis Montoison

ICLR 2025poster

#4291

Language Model Self-improvement by Reinforcement Learning Contemplation

Jing-Cheng Pang, Pengyuan Wang, Kaiyuan Li et al.

ICLR 2024posterarXiv:2305.14483

#4292

Aux-NAS: Exploiting Auxiliary Labels with Negligibly Extra Inference Cost

Yuan Gao, WEIZHONG ZHANG, Wenhan Luo et al.

ICLR 2024posterarXiv:2405.05695

#4293

Robust System Identification: Finite-sample Guarantees and Connection to Regularization

Hank Park, Grani A. Hanasusanto, Yingying Li

ICLR 2025poster

#4294

Imitation Learning from Observation with Automatic Discount Scheduling

Yuyang Liu, Weijun Dong, Yingdong Hu et al.

ICLR 2024posterarXiv:2310.07433

#4295

Offline RL with Observation Histories: Analyzing and Improving Sample Complexity

Joey Hong, Anca Dragan, Sergey Levine

ICLR 2024posterarXiv:2310.20663

#4296

Predictive, scalable and interpretable knowledge tracing on structured domains

Hanqi Zhou, Robert Bamler, Charley Wu et al.

ICLR 2024spotlightarXiv:2403.13179

#4297

Discrete Diffusion Schrödinger Bridge Matching for Graph Transformation

Jun Hyeong Kim, Seonghwan Kim, Seokhyun Moon et al.

ICLR 2025posterarXiv:2410.01500

#4298

A Statistical Framework for Ranking LLM-based Chatbots

Siavash Ameli, Siyuan Zhuang, Ion Stoica et al.

ICLR 2025posterarXiv:2412.18407

#4299

The LLM Surgeon

Tycho van der Ouderaa, Markus Nagel, Mart van Baalen et al.

ICLR 2024posterarXiv:2312.17244

#4300

Can Transformers Capture Spatial Relations between Objects?

Chuan Wen, Dinesh Jayaraman, Yang Gao

ICLR 2024posterarXiv:2403.00729

#4301

InstructScene: Instruction-Driven 3D Indoor Scene Synthesis with Semantic Graph Prior

Chenguo Lin, Yadong MU

ICLR 2024spotlightarXiv:2402.04717

#4302

NExUME: Adaptive Training and Inference for DNNs under Intermittent Power Environments

Cyan Subhra Mishra, Deeksha Chaudhary, Jack Sampson et al.

ICLR 2025poster

#4303

PEARL: Parallel Speculative Decoding with Adaptive Draft Length

Tianyu Liu, Yun Li, Qitan Lv et al.

ICLR 2025posterarXiv:2408.11850

#4304

A General Framework for User-Guided Bayesian Optimization

Carl Hvarfner, Frank Hutter, Luigi Nardi

ICLR 2024spotlightarXiv:2311.14645

#4305

SmartPlay : A Benchmark for LLMs as Intelligent Agents

Yue Wu, Xuan Tang, Tom Mitchell et al.

ICLR 2024posterarXiv:2310.01557

#4306

RingAttention with Blockwise Transformers for Near-Infinite Context

Hao Liu, Matei Zaharia, Pieter Abbeel

ICLR 2024poster

#4307

Capturing the Temporal Dependence of Training Data Influence

Jiachen (Tianhao) Wang, Dawn Song, James Y Zou et al.

ICLR 2025oralarXiv:2412.09538

#4308

Active Test-Time Adaptation: Theoretical Analyses and An Algorithm

Shurui Gui, Xiner Li, Shuiwang Ji

ICLR 2024posterarXiv:2404.05094

#4309

Unveiling Options with Neural Network Decomposition

Mahdi Alikhasi, Levi Lelis

ICLR 2024oral

#4310

Reconciling Model Multiplicity for Downstream Decision Making

Ally Du, Dung Daniel Ngo, Steven Wu

ICLR 2025posterarXiv:2405.19667

#4311

Unlearning or Obfuscating? Jogging the Memory of Unlearned LLMs via Benign Relearning

Shengyuan Hu, Yiwei Fu, Steven Wu et al.

ICLR 2025posterarXiv:2406.13356

#4312

FlashAttention-2: Faster Attention with Better Parallelism and Work Partitioning

Tri Dao

ICLR 2024posterarXiv:2307.08691

#4313

Are Models Biased on Text without Gender-related Language?

Catarina Belém, Preethi Seshadri, Yasaman Razeghi et al.

ICLR 2024posterarXiv:2405.00588

#4314

On Stationary Point Convergence of PPO-Clip

Ruinan Jin, Shuai Li, Baoxiang Wang

ICLR 2024poster

#4315

Convergent Privacy Loss of Noisy-SGD without Convexity and Smoothness

Eli Chien, Pan Li

ICLR 2025posterarXiv:2410.01068

#4316

The Human-AI Substitution game: active learning from a strategic labeler

Tom Yan, Chicheng Zhang

ICLR 2024poster

#4317

ARB-LLM: Alternating Refined Binarizations for Large Language Models

Zhiteng Li, Xianglong Yan, Tianao Zhang et al.

ICLR 2025posterarXiv:2410.03129

#4318

Random Sparse Lifts: Construction, Analysis and Convergence of finite sparse networks

David Robin, Kevin Scaman, marc lelarge

ICLR 2024posterarXiv:2501.05930

#4319

OSTQuant: Refining Large Language Model Quantization with Orthogonal and Scaling Transformations for Better Distribution Fitting

Xing Hu, Yuan Cheng, Dawei Yang et al.

ICLR 2025posterarXiv:2501.13987

#4320

GOttack: Universal Adversarial Attacks on Graph Neural Networks via Graph Orbits Learning

Zulfikar Alom, Tran Gia Bao Ngo, Murat Kantarcioglu et al.

ICLR 2025poster

#4321

Masks, Signs, And Learning Rate Rewinding

Advait Gadhikar, Rebekka Burkholz

ICLR 2024spotlightarXiv:2402.19262

#4322

Regret-Optimal List Replicable Bandit Learning: Matching Upper and Lower Bounds

Michael Chen, A. Pavan, N. V. Vinodchandran et al.

ICLR 2025poster

#4323

Efficient Imitation under Misspecification

Nicolas Espinosa Dice, Sanjiban Choudhury, Wen Sun et al.

ICLR 2025posterarXiv:2503.13162

#4324

RAIN: Your Language Models Can Align Themselves without Finetuning

Yuhui Li, Fangyun Wei, Jinjing Zhao et al.

ICLR 2024posterarXiv:2309.07124

#4325

Learning From Simplicial Data Based on Random Walks and 1D Convolutions

Florian Frantzen, Michael Schaub

ICLR 2024posterarXiv:2404.03434

#4326

Multimodal Molecular Pretraining via Modality Blending

Qiying Yu, Yudi Zhang, yuyan ni et al.

ICLR 2024posterarXiv:2307.06235

#4327

Sample-Efficient Multi-Agent RL: An Optimization Perspective

Nuoya Xiong, Zhihan Liu, Zhaoran Wang et al.

ICLR 2024posterarXiv:2310.06243

#4328

From Latent Graph to Latent Topology Inference: Differentiable Cell Complex Module

Claudio Battiloro, Indro Spinelli, Lev Telyatinkov et al.

ICLR 2024posterarXiv:2305.16174

#4329

Project and Probe: Sample-Efficient Adaptation by Interpolating Orthogonal Features

Annie Chen, Yoonho Lee, Amrith Setlur et al.

ICLR 2024spotlight

#4330

Data Distillation Can Be Like Vodka: Distilling More Times For Better Quality

Xuxi Chen, Yu Yang, Zhangyang Wang et al.

ICLR 2024posterarXiv:2310.06982

#4331

Probe Pruning: Accelerating LLMs through Dynamic Pruning via Model-Probing

Qi Le, Enmao Diao, Ziyan Wang et al.

ICLR 2025posterarXiv:2502.15618

#4332

Discrete Distribution Networks

Lei Yang

ICLR 2025posterarXiv:2401.00036

#4333

MAMBA: an Effective World Model Approach for Meta-Reinforcement Learning

Zohar Rimon, Tom Jurgenson, Orr Krupnik et al.

ICLR 2024posterarXiv:2403.09859

#4334

Beyond IID weights: sparse and low-rank deep Neural Networks are also Gaussian Processes

Thiziri Nait Saada, Alireza Naderi, Jared Tanner

ICLR 2024posterarXiv:2310.16597

#4335

Re-Evaluating the Impact of Unseen-Class Unlabeled Data on Semi-Supervised Learning Model

Rundong He, Yicong Dong, Lan-Zhe Guo et al.

ICLR 2025posterarXiv:2503.00884

#4336

RETSim: Resilient and Efficient Text Similarity

Marina Zhang, Owen Vallis, Aysegul Bumin et al.

ICLR 2024posterarXiv:2311.17264

#4337

ORSO: Accelerating Reward Design via Online Reward Selection and Policy Optimization

Chen Bo Calvin Zhang, Zhang-Wei Hong, Aldo Pacchiano et al.

ICLR 2025posterarXiv:2410.13837

#4338

Grid Cell-Inspired Fragmentation and Recall for Efficient Map Building

Jaedong Hwang, Zhang-Wei Hong, Eric Chen et al.

ICLR 2025posterarXiv:2307.05793

#4339

Neural Spectral Methods: Self-supervised learning in the spectral domain

Yiheng Du, Nithin Chalapathi, Aditi Krishnapriyan

ICLR 2024oralarXiv:2312.05225

#4340

Learning the greatest common divisor: explaining transformer predictions

François Charton

ICLR 2024spotlightarXiv:2308.15594

#4341

DynamicCity: Large-Scale 4D Occupancy Generation from Dynamic Scenes

Hengwei Bian, Lingdong Kong, Haozhe Xie et al.

ICLR 2025posterarXiv:2410.18084

#4342

Separating common from salient patterns with Contrastive Representation Learning

Robin Louiset, Edouard Duchesnay, Grigis Antoine et al.

ICLR 2024posterarXiv:2402.11928

#4343

Fusion Is Not Enough: Single Modal Attacks on Fusion Models for 3D Object Detection

Zhiyuan Cheng, Hongjun Choi, Shiwei Feng et al.

ICLR 2024posterarXiv:2304.14614

#4344

MambaPEFT: Exploring Parameter-Efficient Fine-Tuning for Mamba

Masakazu Yoshimura, Teruaki Hayashi, Yota Maeda

ICLR 2025posterarXiv:2411.03855

#4345

Strategic Preys Make Acute Predators: Enhancing Camouflaged Object Detectors by Generating Camouflaged Objects

Chunming He, Kai Li, Yachao Zhang et al.

ICLR 2024posterarXiv:2308.03166

#4346

CBQ: Cross-Block Quantization for Large Language Models

Xin Ding, Xiaoyu Liu, Zhijun Tu et al.

ICLR 2025posterarXiv:2312.07950

#4347

Identifying Representations for Intervention Extrapolation

Sorawit (James) Saengkyongam, Elan Rosenfeld, Pradeep K Ravikumar et al.

ICLR 2024posterarXiv:2310.04295

#4348

SEGNO: Generalizing Equivariant Graph Neural Networks with Physical Inductive Biases

Yang Liu, Jiashun Cheng, Haihong Zhao et al.

ICLR 2024spotlightarXiv:2308.13212

#4349

DORSal: Diffusion for Object-centric Representations of Scenes $\textit{et al.}$

Allan Jabri, Sjoerd van Steenkiste, Emiel Hoogeboom et al.

ICLR 2024poster

#4350

Learning-Augmented Frequent Directions

Anders Aamand, Justin Chen, Siddharth Gollapudi et al.

ICLR 2025posterarXiv:2503.00937

#4351

MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions

Yekun Chai, Haoran Sun, Huang Fang et al.

ICLR 2025oralarXiv:2410.02743

#4352

Structuring Benchmark into Knowledge Graphs to Assist Large Language Models in Retrieving and Designing Models

Hanmo Liu, Shimin Di, Jialiang Wang et al.

ICLR 2025poster

#4353

Chameleon: Increasing Label-Only Membership Leakage with Adaptive Poisoning

Harsh Chaudhari, Giorgio Severi, Alina Oprea et al.

ICLR 2024posterarXiv:2310.03838

#4354

Locality-Aware Graph Rewiring in GNNs

Federico Barbero, Ameya Velingker, Amin Saberi et al.

ICLR 2024poster

#4355

Adaptive Instrument Design for Indirect Experiments

Yash Chandak, Shiv Shankar, Vasilis Syrgkanis et al.

ICLR 2024posterarXiv:2312.02438

#4356

Learning 3D Particle-based Simulators from RGB-D Videos

William Whitney, Tatiana Lopez-Guevara, Tobias Pfaff et al.

ICLR 2024posterarXiv:2312.05359

#4357

Space and time continuous physics simulation from partial observations

Steeven Janny, Madiha Nadri, Julie Digne et al.

ICLR 2024oralarXiv:2401.09198

#4358

Adaptive Retention & Correction: Test-Time Training for Continual Learning

Haoran Chen, Micah Goldblum, Zuxuan Wu et al.

ICLR 2025posterarXiv:2405.14318

#4359

Optimal Sample Complexity for Average Reward Markov Decision Processes

Shengbo Wang, Jose Blanchet, Peter Glynn

ICLR 2024posterarXiv:2310.08833

#4360

IntersectionZoo: Eco-driving for Benchmarking Multi-Agent Contextual Reinforcement Learning

Vindula Jayawardana, Baptiste Freydt, Ao Qu et al.

ICLR 2025posterarXiv:2410.15221

#4361

Interpreting CLIP's Image Representation via Text-Based Decomposition

Yossi Gandelsman, Alexei Efros, Jacob Steinhardt

ICLR 2024posterarXiv:2310.05916

#4362

Revisiting the Last-Iterate Convergence of Stochastic Gradient Methods

Zijian Liu, Zhengyuan Zhou

ICLR 2024posterarXiv:2312.08531

#4363

Time-Efficient Reinforcement Learning with Stochastic Stateful Policies

Firas Al-Hafez, Guoping Zhao, Jan Peters et al.

ICLR 2024posterarXiv:2311.04082

#4364

NeurRev: Train Better Sparse Neural Network Practically via Neuron Revitalization

Gen Li, Lu Yin, Jie Ji et al.

ICLR 2024poster

#4365

Time After Time: Deep-Q Effect Estimation for Interventions on When and What to do

Yoav Wald, Mark Goldstein, Yonathan Efroni et al.

ICLR 2025posterarXiv:2503.15890

#4366

GraphChef: Decision-Tree Recipes to Explain Graph Neural Networks

Peter Müller, Lukas Faber, Karolis Martinkus et al.

ICLR 2024poster

#4367

Fantastic Gains and Where to Find Them: On the Existence and Prospect of General Knowledge Transfer between Any Pretrained Model

Karsten Roth, Lukas Thede, A. Sophia Koepke et al.

ICLR 2024spotlightarXiv:2310.17653

#4368

LR0.FM: LOW-RESOLUTION ZERO-SHOT CLASSIFICATION BENCHMARK FOR FOUNDATION MODELS

Priyank Pathak, Shyam Marjit, Shruti Vyas et al.

ICLR 2025poster

#4369

On the Vulnerability of Adversarially Trained Models Against Two-faced Attacks

Shengjie Zhou, Lue Tao, Yuzhou Cao et al.

ICLR 2024poster

#4370

Efficient Score Matching with Deep Equilibrium Layers

Yuhao Huang, Qingsong Wang, Akwum Onwunta et al.

ICLR 2024poster

#4371

Rethinking Label Poisoning for GNNs: Pitfalls and Attacks

Vijay Chandra Lingam, Mohammad Sadegh Akhondzadeh, Aleksandar Bojchevski

ICLR 2024poster

#4372

A path-norm toolkit for modern networks: consequences, promises and challenges

Antoine Gonon, Nicolas Brisebarre, Elisa Riccietti et al.

ICLR 2024spotlightarXiv:2310.01225

#4373

LongMamba: Enhancing Mamba's Long-Context Capabilities via Training-Free Receptive Field Enlargement

Zhifan Ye, Kejing Xia, Yonggan Fu et al.

ICLR 2025posterarXiv:2504.16053

#4374

Approximating Nash Equilibria in Normal-Form Games via Stochastic Optimization

Ian Gemp, Luke Marris, Georgios Piliouras

ICLR 2024posterarXiv:2310.06689

#4375

Navigating Text-To-Image Customization: From LyCORIS Fine-Tuning to Model Evaluation

Shih-Ying Yeh, Yu-Guan Hsieh, Zhidong Gao et al.

ICLR 2024posterarXiv:2309.14859

#4376

Radar: Fast Long-Context Decoding for Any Transformer

Yongchang Hao, Mengyao Zhai, Hossein Hajimirsadeghi et al.

ICLR 2025posterarXiv:2503.10571

#4377

FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video editing

Yuren Cong, Mengmeng Xu, Christian Simon et al.

ICLR 2024oralarXiv:2310.05922

#4378

LabelDP-Pro: Learning with Label Differential Privacy via Projections

Badih Ghazi, Yangsibo Huang, Pritish Kamath et al.

ICLR 2024poster

#4379

Learning the Optimal Stopping for Early Classification within Finite Horizons via Sequential Probability Ratio Test

Akinori F. Ebihara, Taiki Miyagawa, Kazuyuki Sakurai et al.

ICLR 2025posterarXiv:2501.18059

#4380

Rethinking Channel Dimensions to Isolate Outliers for Low-bit Weight Quantization of Large Language Models

Jung Hwan Heo, Jeonghoon Kim, Beomseok Kwon et al.

ICLR 2024posterarXiv:2309.15531

#4381

Robust Similarity Learning with Difference Alignment Regularization

Shuo Chen, Gang Niu, Chen Gong et al.

ICLR 2024poster

#4382

Diving Segmentation Model into Pixels

Chen Gan, Zihao Yin, Kelei He et al.

ICLR 2024poster

#4383

A Cognitive Model for Learning Abstract Relational Structures from Memory-based Decision-Making Tasks

Haruo Hosoya

ICLR 2024oral

#4384

Particle Guidance: non-I.I.D. Diverse Sampling with Diffusion Models

Gabriele Corso, Yilun Xu, Valentin De Bortoli et al.

ICLR 2024posterarXiv:2310.13102

#4385

Visually Guided Decoding: Gradient-Free Hard Prompt Inversion with Language Models

Donghoon Kim, Minji Bae, Kyuhong Shim et al.

ICLR 2025posterarXiv:2505.08622

#4386

Tight Time Complexities in Parallel Stochastic Optimization with Arbitrary Computation Dynamics

Alexander Tyurin

ICLR 2025posterarXiv:2408.04929

#4387

AMAGO: Scalable In-Context Reinforcement Learning for Adaptive Agents

Jake Grigsby, Jim Fan, Yuke Zhu

ICLR 2024spotlightarXiv:2310.09971

#4388

Improving Offline RL by Blending Heuristics

Sinong Geng, Aldo Pacchiano, Andrey Kolobov et al.

ICLR 2024spotlightarXiv:2306.00321

#4389

Delta-AI: Local objectives for amortized inference in sparse graphical models

Jean-Pierre Falet, Hae Beom Lee, Nikolay Malkin et al.

ICLR 2024posterarXiv:2310.02423

#4390

V-DETR: DETR with Vertex Relative Position Encoding for 3D Object Detection

Yichao Shen, Zigang Geng, YUHUI YUAN et al.

ICLR 2024posterarXiv:2308.04409

#4391

Lagrangian Flow Networks for Conservation Laws

Fabricio Arend Torres, Marcello Negri, Marco Inversi et al.

ICLR 2024spotlightarXiv:2305.16846

#4392

WizardLM: Empowering Large Pre-Trained Language Models to Follow Complex Instructions

Can Xu, Qingfeng Sun, Kai Zheng et al.

ICLR 2024posterarXiv:2304.12244

#4393

Looking Backward: Streaming Video-to-Video Translation with Feature Banks

Feng Liang, Akio Kodaira, Chenfeng Xu et al.

ICLR 2025oralarXiv:2405.15757

#4394

Why Does the Effective Context Length of LLMs Fall Short?

Chenxin An, Jun Zhang, Ming Zhong et al.

ICLR 2025posterarXiv:2410.18745

#4395

Modelling complex vector drawings with stroke-clouds

Alexander Ashcroft, Ayan Das, Yulia Gryaditskaya et al.

ICLR 2024poster

#4396

Multi-View Causal Representation Learning with Partial Observability

Dingling Yao, Danru Xu, Sébastien Lachapelle et al.

ICLR 2024spotlightarXiv:2311.04056

#4397

PILOT: An $\mathcal{O}(1/K)$-Convergent Approach for Policy Evaluation with Nonlinear Function Approximation

Zhuqing Liu, Xin Zhang, Jia Liu et al.

ICLR 2024spotlight

#4398

Universal Humanoid Motion Representations for Physics-Based Control

Zhengyi Luo, Jinkun Cao, Josh Merel et al.

ICLR 2024spotlightarXiv:2310.04582

#4399

Null Counterfactual Factor Interactions for Goal-Conditioned Reinforcement Learning

Caleb Chuck, Fan Feng, Carl Qi et al.

ICLR 2025posterarXiv:2505.03172

#4400

$\text{I}^2\text{AM}$: Interpreting Image-to-Image Latent Diffusion Models via Bi-Attribution Maps

Junseo Park, Hyeryung Jang

ICLR 2025poster

← Previous

1...20 21 22 23 24...31