ICLR Spotlight Papers

345 papers found • Page 7 of 7

Submodular Reinforcement Learning

Manish Prajapat, Mojmir Mutny, Melanie Zeilinger et al.

ICLR 2024spotlightarXiv:2307.13372

Subtractive Mixture Models via Squaring: Representation and Learning

Lorenzo Loconte, Aleksanteri Sladek, Stefan Mengel et al.

ICLR 2024spotlightarXiv:2310.00724

Sudden Drops in the Loss: Syntax Acquisition, Phase Transitions, and Simplicity Bias in MLMs

Angelica Chen, Ravid Shwartz-Ziv, Kyunghyun Cho et al.

ICLR 2024spotlightarXiv:2309.07311

SWAP-NAS: Sample-Wise Activation Patterns for Ultra-fast NAS

Yameng Peng, Andy Song, Haytham Fayek et al.

ICLR 2024spotlightarXiv:2403.04161
16
citations

Symmetric Mean-field Langevin Dynamics for Distributional Minimax Problems

Juno Kim, Kakei Yamamoto, Kazusato Oko et al.

ICLR 2024spotlightarXiv:2312.01127
13
citations

Synaptic Weight Distributions Depend on the Geometry of Plasticity

Roman Pogodin, Jonathan Cornford, Arna Ghosh et al.

ICLR 2024spotlightarXiv:2305.19394

SyncDreamer: Generating Multiview-consistent Images from a Single-view Image

Yuan Liu, Cheng Lin, Zijiao Zeng et al.

ICLR 2024spotlightarXiv:2309.03453

Task Adaptation from Skills: Information Geometry, Disentanglement, and New Objectives for Unsupervised Reinforcement Learning

Yucheng Yang, Tianyi Zhou, Qiang HE et al.

ICLR 2024spotlightarXiv:2506.10629

TD-MPC2: Scalable, Robust World Models for Continuous Control

Nicklas Hansen, Hao Su, Xiaolong Wang

ICLR 2024spotlightarXiv:2310.16828
293
citations

Tensor Trust: Interpretable Prompt Injection Attacks from an Online Game

Sam Toyer, Olivia Watkins, Ethan Mendes et al.

ICLR 2024spotlightarXiv:2311.01011

Text2Reward: Reward Shaping with Language Models for Reinforcement Learning

Tianbao Xie, Siheng Zhao, Chen Henry Wu et al.

ICLR 2024spotlightarXiv:2309.11489

The Consensus Game: Language Model Generation via Equilibrium Search

Athul Jacob, Yikang Shen, Gabriele Farina et al.

ICLR 2024spotlightarXiv:2310.09139
34
citations

The Effective Horizon Explains Deep RL Performance in Stochastic Environments

Cassidy Laidlaw, Banghua Zhu, Stuart Russell et al.

ICLR 2024spotlightarXiv:2312.08369
5
citations

The False Promise of Imitating Proprietary Language Models

Arnav Gudibande, Eric Wallace, Charlie Snell et al.

ICLR 2024spotlight

Thin-Shell Object Manipulations With Differentiable Physics Simulations

Yian Wang, Juntian Zheng, Zhehuan Chen et al.

ICLR 2024spotlightarXiv:2404.00451

Time Travel in LLMs: Tracing Data Contamination in Large Language Models

Shahriar Golchin, Mihai Surdeanu

ICLR 2024spotlightarXiv:2308.08493

Tool-Augmented Reward Modeling

Lei Li, Yekun Chai, Shuohuan Wang et al.

ICLR 2024spotlightarXiv:2310.01045

ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs

Yujia Qin, Shihao Liang, Yining Ye et al.

ICLR 2024spotlightarXiv:2307.16789
1128
citations

TorchRL: A data-driven decision-making library for PyTorch

Albert Bou, Matteo Bettini, Sebastian Dittert et al.

ICLR 2024spotlightarXiv:2306.00577

Towards Energy Efficient Spiking Neural Networks: An Unstructured Pruning Framework

Xinyu Shi, Jianhao Ding, Zecheng Hao et al.

ICLR 2024spotlight
35
citations

Towards LLM4QPE: Unsupervised Pretraining of Quantum Property Estimation and A Benchmark

Yehui Tang, Hao Xiong, Nianzu Yang et al.

ICLR 2024spotlight

Towards Meta-Pruning via Optimal Transport

Alexander Theus, Olin Geimer, Friedrich Wicke et al.

ICLR 2024spotlightarXiv:2402.07839

Towards Optimal Regret in Adversarial Linear MDPs with Bandit Feedback

Haolin Liu, Chen-Yu Wei, Julian Zimmert

ICLR 2024spotlightarXiv:2310.11550

Towards Reliable and Efficient Backdoor Trigger Inversion via Decoupling Benign Features

Xiong Xu, Kunzhe Huang, Yiming Li et al.

ICLR 2024spotlight

Towards Robust Fidelity for Evaluating Explainability of Graph Neural Networks

Xu Zheng, Farhad Shirani, Tianchun Wang et al.

ICLR 2024spotlightarXiv:2310.01820
15
citations

Towards Robust Offline Reinforcement Learning under Diverse Data Corruption

Rui Yang, Han Zhong, Jiawei Xu et al.

ICLR 2024spotlightarXiv:2310.12955

Towards Robust Out-of-Distribution Generalization Bounds via Sharpness

Yingtian Zou, Kenji Kawaguchi, Yingnan Liu et al.

ICLR 2024spotlightarXiv:2403.06392

TRAM: Bridging Trust Regions and Sharpness Aware Minimization

Tom Sherborne, Naomi Saphra, Pradeep Dasigi et al.

ICLR 2024spotlightarXiv:2310.03646

Tuning LayerNorm in Attention: Towards Efficient Multi-Modal LLM Finetuning

Bingchen Zhao, Haoqin Tu, Chen Wei et al.

ICLR 2024spotlightarXiv:2312.11420

Unbiased Watermark for Large Language Models

Zhengmian Hu, Lichang Chen, Xidong Wu et al.

ICLR 2024spotlightarXiv:2310.10669

Understanding and Mitigating the Label Noise in Pre-training on Downstream Tasks

Hao Chen, Jindong Wang, Ankit Parag Shah et al.

ICLR 2024spotlightarXiv:2309.17002

Understanding Augmentation-based Self-Supervised Representation Learning via RKHS Approximation and Regression

Runtian Zhai, Bingbin Liu, Andrej Risteski et al.

ICLR 2024spotlightarXiv:2306.00788
17
citations

Uni3D: Exploring Unified 3D Representation at Scale

Junsheng Zhou, Jinsheng Wang, Baorui Ma et al.

ICLR 2024spotlightarXiv:2310.06773
165
citations

Unified Human-Scene Interaction via Prompted Chain-of-Contacts

Zeqi Xiao, Tai Wang, Jingbo Wang et al.

ICLR 2024spotlightarXiv:2309.07918
100
citations

Universal Humanoid Motion Representations for Physics-Based Control

Zhengyi Luo, Jinkun Cao, Josh Merel et al.

ICLR 2024spotlightarXiv:2310.04582

Unleashing the Potential of Fractional Calculus in Graph Neural Networks with FROND

Qiyu Kang, Kai Zhao, Qinxu Ding et al.

ICLR 2024spotlightarXiv:2404.17099
16
citations

Unlocking the Power of Representations in Long-term Novelty-based Exploration

Alaa Saade, Steven Kapturowski, Daniele Calandriello et al.

ICLR 2024spotlightarXiv:2305.01521
9
citations

Variational Bayesian Last Layers

James Harrison, John Willes, Jasper Snoek

ICLR 2024spotlightarXiv:2404.11599

Variational Inference for SDEs Driven by Fractional Noise

Rembert Daems, Manfred Opper, Guillaume Crevecoeur et al.

ICLR 2024spotlightarXiv:2310.12975
10
citations

Views Can Be Deceiving: Improved SSL Through Feature Space Augmentation

Kimia Hamidieh, Haoran Zhang, Swami Sankaranarayanan et al.

ICLR 2024spotlightarXiv:2406.18562

Vision-Language Foundation Models as Effective Robot Imitators

Xinghang Li, Minghuan Liu, Hanbo Zhang et al.

ICLR 2024spotlightarXiv:2311.01378
310
citations

What does automatic differentiation compute for neural networks?

Sejun Park, Sanghyuk Chun, Wonyeol Lee

ICLR 2024spotlight

What does the Knowledge Neuron Thesis Have to do with Knowledge?

Jingcheng Niu, Andrew Liu, Zining Zhu et al.

ICLR 2024spotlightarXiv:2405.02421
47
citations

What's In My Big Data?

Yanai Elazar, Akshita Bhagia, Ian Magnusson et al.

ICLR 2024spotlightarXiv:2310.20707

Whole-Song Hierarchical Generation of Symbolic Music Using Cascaded Diffusion Models

Ziyu Wang, Lejun Min, Gus Xia

ICLR 2024spotlightarXiv:2405.09901
24
citations
Previous
1...567
Next