ICLR Papers

6,124 papers found • Page 107 of 123

On Harmonizing Implicit Subpopulations

Feng Hong, Jiangchao Yao, YUEMING LYU et al.

ICLR 2024poster
8
citations

Online Continual Learning for Interactive Instruction Following Agents

Byeonghwi Kim, Minhyuk Seo, Jonghyun Choi

ICLR 2024posterarXiv:2403.07548

Online GNN Evaluation Under Test-time Graph Distribution Shifts

Xin Zheng, Dongjin Song, Qingsong Wen et al.

ICLR 2024spotlightarXiv:2403.09953
14
citations

Online Information Acquisition: Hiring Multiple Agents

Federico Cacciamani, Matteo Castiglioni, Nicola Gatti

ICLR 2024posterarXiv:2307.06210

Online Stabilization of Spiking Neural Networks

Yaoyu Zhu, Jianhao Ding, Tiejun Huang et al.

ICLR 2024spotlight

Only Pay for What Is Uncertain: Variance-Adaptive Thompson Sampling

Aadirupa Saha, Branislav Kveton

ICLR 2024posterarXiv:2303.09033

On Penalty Methods for Nonconvex Bilevel Optimization and First-Order Stochastic Approximation

Jeongyeol Kwon, Dohyun Kwon, Stephen Wright et al.

ICLR 2024spotlightarXiv:2309.01753

On-Policy Distillation of Language Models: Learning from Self-Generated Mistakes

Rishabh Agarwal, Nino Vieillard, Yongchao Zhou et al.

ICLR 2024posterarXiv:2306.13649

On Representation Complexity of Model-based and Model-free Reinforcement Learning

Hanlin Zhu, Baihe Huang, Stuart Russell

ICLR 2024posterarXiv:2310.01706

On Stationary Point Convergence of PPO-Clip

Ruinan Jin, Shuai Li, Baoxiang Wang

ICLR 2024poster

On the Analysis of GAN-based Image-to-Image Translation with Gaussian Noise Injection

Chaohua Shi, Kexin Huang, Lu Gan et al.

ICLR 2024poster

On the Effect of Batch Size in Byzantine-Robust Distributed Learning

Yi-Rui Yang, Chang-Wei Shi, Wu-Jun Li

ICLR 2024poster

On the Expressivity of Objective-Specification Formalisms in Reinforcement Learning

Rohan Subramani, Marcus Williams, Max Heitmann et al.

ICLR 2024oralarXiv:2310.11840

On the Fairness ROAD: Robust Optimization for Adversarial Debiasing

Vincent Grari, Thibault Laugel, Tatsunori Hashimoto et al.

ICLR 2024posterarXiv:2310.18413

On the Foundations of Shortcut Learning

Katherine Hermann, Hossein Mobahi, Thomas FEL et al.

ICLR 2024spotlightarXiv:2310.16228

On the Generalization and Approximation Capacities of Neural Controlled Differential Equations

Linus Bleistein, Agathe Guilloux

ICLR 2024posterarXiv:2305.16791

On the generalization capacity of neural networks during generic multimodal reasoning

Takuya Ito, Soham Dan, Mattia Rigotti et al.

ICLR 2024posterarXiv:2401.15030

On the Hardness of Constrained Cooperative Multi-Agent Reinforcement Learning

Ziyi Chen, Yi Zhou, Heng Huang

ICLR 2024poster

On the hardness of learning under symmetries

Bobak Kiani, Thien Le, Hannah Lawrence et al.

ICLR 2024spotlightarXiv:2401.01869
12
citations

On the Hardness of Online Nonconvex Optimization with Single Oracle Feedback

Ziwei Guan, Yi Zhou, Yingbin Liang

ICLR 2024poster

On the Humanity of Conversational AI: Evaluating the Psychological Portrayal of LLMs

Jen-tse Huang, Wenxuan Wang, Eric John Li et al.

ICLR 2024poster

On the Joint Interaction of Models, Data, and Features

Yiding Jiang, Christina Baek, J Kolter

ICLR 2024posterarXiv:2306.04793
4
citations

On the Learnability of Watermarks for Language Models

Chenchen Gu, XIANG LI, Percy Liang et al.

ICLR 2024posterarXiv:2312.04469
68
citations

On the Limitations of Temperature Scaling for Distributions with Overlaps

Muthu Chidambaram, Rong Ge

ICLR 2024posterarXiv:2306.00740
8
citations

On the Markov Property of Neural Algorithmic Reasoning: Analyses and Methods

Montgomery Bohde, Meng Liu, Alexandra Saxton et al.

ICLR 2024spotlightarXiv:2403.04929

On the Over-Memorization During Natural, Robust and Catastrophic Overfitting

Runqi Lin, Chaojian Yu, Bo Han et al.

ICLR 2024posterarXiv:2310.08847

On the Parameterization of Second-Order Optimization Effective towards the Infinite Width

Satoki Ishikawa, Ryo Karakida

ICLR 2024posterarXiv:2312.12226

On the Posterior Distribution in Denoising: Application to Uncertainty Quantification

Hila Manor, Tomer Michaeli

ICLR 2024posterarXiv:2309.13598

On the Power of the Weisfeiler-Leman Test for Graph Motif Parameters

Matthias Lanzinger, Pablo Barcelo

ICLR 2024posterarXiv:2309.17053

On the Provable Advantage of Unsupervised Pretraining

Jiawei Ge, Shange Tang, Jianqing Fan et al.

ICLR 2024spotlightarXiv:2303.01566
22
citations

On the Reliability of Watermarks for Large Language Models

John Kirchenbauer, Jonas Geiping, Yuxin Wen et al.

ICLR 2024posterarXiv:2306.04634
176
citations

On the Role of Discrete Tokenization in Visual Representation Learning

Tianqi Du, Yifei Wang, Yisen Wang

ICLR 2024spotlightarXiv:2407.09087

On the Role of General Function Approximation in Offline Reinforcement Learning

Chenjie Mao, Qiaosheng Zhang, Zhen Wang et al.

ICLR 2024spotlight

On the Sample Complexity of Lipschitz Constant Estimation

Stephen Roberts, Julien Huang, Jan-Peter Calliess

ICLR 2024poster

On the Scalability and Memory Efficiency of Semidefinite Programs for Lipschitz Constant Estimation of Neural Networks

Zi Wang, Bin Hu, Aaron Havens et al.

ICLR 2024poster

On the Stability of Expressive Positional Encodings for Graphs

Yinan Huang, William Lu, Joshua Robinson et al.

ICLR 2024posterarXiv:2310.02579

On the Stability of Iterative Retraining of Generative Models on their own Data

Quentin Bertrand, Joey Bose, Alexandre Duplessis et al.

ICLR 2024spotlightarXiv:2310.00429

On the Variance of Neural Network Training with respect to Test Sets and Distributions

Keller Jordan

ICLR 2024posterarXiv:2304.01910
20
citations

On the Vulnerability of Adversarially Trained Models Against Two-faced Attacks

Shengjie Zhou, Lue Tao, Yuzhou Cao et al.

ICLR 2024poster

On Trajectory Augmentations for Off-Policy Evaluation

Ge Gao, Qitong Gao, Xi Yang et al.

ICLR 2024poster

OpenChat: Advancing Open-source Language Models with Mixed-Quality Data

Guan Wang, Sijie Cheng, Xianyuan Zhan et al.

ICLR 2024posterarXiv:2309.11235
309
citations

Open-ended VQA benchmarking of Vision-Language models by exploiting Classification datasets and their semantic hierarchy

Simon Ging, Maria A. Bravo, Thomas Brox

ICLR 2024spotlightarXiv:2402.07270

OpenNeRF: Open Set 3D Neural Scene Segmentation with Pixel-Wise Features and Rendered Novel Views

Francis Engelmann, Fabian Manhardt, Michael Niemeyer et al.

ICLR 2024posterarXiv:2404.03650

OpenTab: Advancing Large Language Models as Open-domain Table Reasoners

Kezhi Kong, Jiani Zhang, Zhengyuan Shen et al.

ICLR 2024posterarXiv:2402.14361
36
citations

Open the Black Box: Step-based Policy Updates for Temporally-Correlated Episodic Reinforcement Learning

Ge Li, Hongyi Zhou, Dominik Roth et al.

ICLR 2024oralarXiv:2401.11437
11
citations

OpenWebMath: An Open Dataset of High-Quality Mathematical Web Text

Keiran Paster, Marco Dos Santos, Zhangir Azerbayev et al.

ICLR 2024posterarXiv:2310.06786

Optimal criterion for feature learning of two-layer linear neural network in high dimensional interpolation regime

Keita Suzuki, Taiji Suzuki

ICLR 2024poster

OPTIMAL ROBUST MEMORIZATION WITH RELU NEURAL NETWORKS

Lijia Yu, XIAOSHAN GAO, Lijun Zhang

ICLR 2024spotlight

Optimal Sample Complexity for Average Reward Markov Decision Processes

Shengbo Wang, Jose Blanchet, Peter Glynn

ICLR 2024posterarXiv:2310.08833

Optimal Sample Complexity of Contrastive Learning

Noga Alon, Dmitrii Avdiukhin, Dor Elboim et al.

ICLR 2024spotlightarXiv:2312.00379
11
citations