Peter Stone
23
Papers
78
Total Citations
Papers (23)
Longhorn: State Space Models are Amortized Online Learners
ICLR 2025
29
citations
Minimum Coverage Sets for Training Robust Ad Hoc Teamwork Agents
AAAI 2024arXiv
19
citations
Learning Optimal Advantage from Preferences and Mistaking It for Reward
AAAI 2024arXiv
15
citations
Building Minimal and Reusable Causal State Abstractions for Reinforcement Learning
AAAI 2024arXiv
9
citations
RLZero: Direct Policy Inference from Language Without In-Domain Supervision
NeurIPS 2025arXiv
3
citations
Sample Efficient Myopic Exploration Through Multitask Reinforcement Learning with Diverse Tasks
ICLR 2024
2
citations
Learning a Fast Mixing Exogenous Block MDP using a Single Trajectory
ICLR 2025
1
citations
Coopernaut: End-to-End Driving With Cooperative Perception for Networked Vehicles
CVPR 2022arXiv
0
citations
Argus: A Compact and Versatile Foundation Model for Vision
CVPR 2025
0
citations
ELDEN: Exploration via Local Dependencies
NeurIPS 2023
0
citations
LIBERO: Benchmarking Knowledge Transfer for Lifelong Robot Learning
NeurIPS 2023
0
citations
FAMO: Fast Adaptive Multitask Optimization
NeurIPS 2023
0
citations
On the Analysis of Complex Backup Strategies in Monte Carlo Tree Search
ICML 2016
0
citations
Data-Efficient Policy Evaluation Through Behavior Policy Search
ICML 2017
0
citations
Importance Sampling Policy Evaluation with an Estimated Behavior Policy
ICML 2019
0
citations
An Imitation from Observation Approach to Transfer Learning with Dynamics Mismatch
NeurIPS 2020
0
citations
Firefly Neural Architecture Descent: a General Approach for Growing Neural Networks
NeurIPS 2020
0
citations
Adversarial Intrinsic Motivation for Reinforcement Learning
NeurIPS 2021
0
citations
Conflict-Averse Gradient Descent for Multi-task learning
NeurIPS 2021
0
citations
Machine versus Human Attention in Deep Reinforcement Learning Tasks
NeurIPS 2021
0
citations
Value Function Decomposition for Iterative Design of Reinforcement Learning Agents
NeurIPS 2022
0
citations
BOME! Bilevel Optimization Made Easy: A Simple First-Order Approach
NeurIPS 2022
0
citations
f-Policy Gradients: A General Framework for Goal-Conditioned RL using f-Divergences
NeurIPS 2023
0
citations