Peter Stone

23
Papers
78
Total Citations

Papers (23)

Longhorn: State Space Models are Amortized Online Learners

ICLR 2025
29
citations

Minimum Coverage Sets for Training Robust Ad Hoc Teamwork Agents

AAAI 2024arXiv
19
citations

Learning Optimal Advantage from Preferences and Mistaking It for Reward

AAAI 2024arXiv
15
citations

Building Minimal and Reusable Causal State Abstractions for Reinforcement Learning

AAAI 2024arXiv
9
citations

RLZero: Direct Policy Inference from Language Without In-Domain Supervision

NeurIPS 2025arXiv
3
citations

Sample Efficient Myopic Exploration Through Multitask Reinforcement Learning with Diverse Tasks

ICLR 2024
2
citations

Learning a Fast Mixing Exogenous Block MDP using a Single Trajectory

ICLR 2025
1
citations

Coopernaut: End-to-End Driving With Cooperative Perception for Networked Vehicles

CVPR 2022arXiv
0
citations

Argus: A Compact and Versatile Foundation Model for Vision

CVPR 2025
0
citations

ELDEN: Exploration via Local Dependencies

NeurIPS 2023
0
citations

LIBERO: Benchmarking Knowledge Transfer for Lifelong Robot Learning

NeurIPS 2023
0
citations

FAMO: Fast Adaptive Multitask Optimization

NeurIPS 2023
0
citations

On the Analysis of Complex Backup Strategies in Monte Carlo Tree Search

ICML 2016
0
citations

Data-Efficient Policy Evaluation Through Behavior Policy Search

ICML 2017
0
citations

Importance Sampling Policy Evaluation with an Estimated Behavior Policy

ICML 2019
0
citations

An Imitation from Observation Approach to Transfer Learning with Dynamics Mismatch

NeurIPS 2020
0
citations

Firefly Neural Architecture Descent: a General Approach for Growing Neural Networks

NeurIPS 2020
0
citations

Adversarial Intrinsic Motivation for Reinforcement Learning

NeurIPS 2021
0
citations

Conflict-Averse Gradient Descent for Multi-task learning

NeurIPS 2021
0
citations

Machine versus Human Attention in Deep Reinforcement Learning Tasks

NeurIPS 2021
0
citations

Value Function Decomposition for Iterative Design of Reinforcement Learning Agents

NeurIPS 2022
0
citations

BOME! Bilevel Optimization Made Easy: A Simple First-Order Approach

NeurIPS 2022
0
citations

f-Policy Gradients: A General Framework for Goal-Conditioned RL using f-Divergences

NeurIPS 2023
0
citations