Pieter Abbeel

91
Papers
8,229
Total Citations

Papers (91)

InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets

NeurIPS 2016arXiv
4,424
citations

#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning

NeurIPS 2017arXiv
826
citations

One-Shot Imitation Learning

NeurIPS 2017arXiv
721
citations

Value Iteration Networks

NeurIPS 2016arXiv
675
citations

Learning to Poke by Poking: Experiential Learning of Intuitive Physics

NeurIPS 2016arXiv
595
citations

Learning Interactive Real-World Simulators

ICLR 2024
334
citations

Backprop KF: Learning Discriminative Deterministic State Estimators

NeurIPS 2016arXiv
213
citations

World Model on Million-Length Video And Language With Blockwise RingAttention

ICLR 2025arXiv
144
citations

Video Language Planning

ICLR 2024
144
citations

VIME: Variational Information Maximizing Exploration

NeurIPS 2016arXiv
80
citations

Combinatorial Energy Learning for Image Segmentation

NeurIPS 2016arXiv
27
citations

ElasticTok: Adaptive Tokenization for Image and Video

ICLR 2025
21
citations

Prioritized Generative Replay

ICLR 2025
9
citations

DreamSmooth: Improving Model-based Reinforcement Learning via Reward Smoothing

ICLR 2024
8
citations

Bigger, Regularized, Categorical: High-Capacity Value Functions are Efficient Multi-Task Learners

NeurIPS 2025
6
citations

Efficient Long Video Tokenization via Coordinate-based Patch Reconstruction

CVPR 2025
2
citations

Gradient Estimation Using Stochastic Computation Graphs

NeurIPS 2015arXiv
0
citations

Sim-to-Real 6D Object Pose Estimation via Iterative Self-Training for Robotic Bin Picking

ECCV 2022
0
citations

Autoregressive Uncertainty Modeling for 3D Bounding Box Prediction

ECCV 2022
0
citations

Putting NeRF on a Diet: Semantically Consistent Few-Shot View Synthesis

ICCV 2021arXiv
0
citations

VectorFusion: Text-to-SVG by Abstracting Pixel-Based Diffusion Models

CVPR 2023arXiv
0
citations

Zero-Shot Text-Guided Object Generation With Dream Fields

CVPR 2022arXiv
0
citations

Bottleneck Transformers for Visual Recognition

CVPR 2021arXiv
0
citations

Cooperative Inverse Reinforcement Learning

NeurIPS 2016arXiv
0
citations

Inverse Reward Design

NeurIPS 2017arXiv
0
citations

Learning to Model the World With Language

ICML 2024
0
citations

Position: Video as the New Language for Real-World Decision Making

ICML 2024
0
citations

Visual Representation Learning with Stochastic Frame Prediction

ICML 2024
0
citations

Unsupervised Zero-Shot Reinforcement Learning via Functional Reward Encodings

ICML 2024
0
citations

Learning a Diffusion Model Policy from Rewards via Q-Score Matching

ICML 2024
0
citations

Masked Autoencoding for Scalable and Generalizable Decision Making

NeurIPS 2022
0
citations

Deep Hierarchical Planning from Pixels

NeurIPS 2022
0
citations

On the Effectiveness of Fine-tuning Versus Meta-reinforcement Learning

NeurIPS 2022
0
citations

Unsupervised Reinforcement Learning with Contrastive Intrinsic Control

NeurIPS 2022
0
citations

Chain of Thought Imitation with Procedure Cloning

NeurIPS 2022
0
citations

Where are we in the search for an Artificial Visual Cortex for Embodied Intelligence?

NeurIPS 2023
0
citations

Language Quantized AutoEncoders: Towards Unsupervised Text-Image Alignment

NeurIPS 2023
0
citations

Blockwise Parallel Transformers for Large Context Models

NeurIPS 2023
0
citations

Learning Universal Policies via Text-Guided Video Generation

NeurIPS 2023
0
citations

Accelerating Reinforcement Learning with Value-Conditional State Entropy Exploration

NeurIPS 2023
0
citations

Video Prediction Models as Rewards for Reinforcement Learning

NeurIPS 2023
0
citations

AlberDICE: Addressing Out-Of-Distribution Joint Actions in Offline Multi-Agent RL via Alternating Stationary Distribution Correction Estimation

NeurIPS 2023
0
citations

DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models

NeurIPS 2023
0
citations

Alpha-Beta Divergences Discover Micro and Macro Structures in Data

ICML 2015
0
citations

Trust Region Policy Optimization

ICML 2015
0
citations

Guided Cost Learning: Deep Inverse Optimal Control via Policy Optimization

ICML 2016
0
citations

Benchmarking Deep Reinforcement Learning for Continuous Control

ICML 2016
0
citations

Constrained Policy Optimization

ICML 2017
0
citations

Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks

ICML 2017
0
citations

Reinforcement Learning with Deep Energy-Based Policies

ICML 2017
0
citations

Prediction and Control with Temporal Segment Models

ICML 2017
0
citations

PixelSNAIL: An Improved Autoregressive Generative Model

ICML 2018
0
citations

Self-Consistent Trajectory Autoencoder: Hierarchical Reinforcement Learning with Trajectory Embeddings

ICML 2018
0
citations

Automatic Goal Generation for Reinforcement Learning Agents

ICML 2018
0
citations

Latent Space Policies for Hierarchical Reinforcement Learning

ICML 2018
0
citations

Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor

ICML 2018
0
citations

Universal Planning Networks: Learning Generalizable Representations for Visuomotor Control

ICML 2018
0
citations

Flow++: Improving Flow-Based Generative Models with Variational Dequantization and Architecture Design

ICML 2019
0
citations

Population Based Augmentation: Efficient Learning of Augmentation Policy Schedules

ICML 2019
0
citations

Bit-Swap: Recursive Bits-Back Coding for Lossless Compression with Hierarchical Latent Variables

ICML 2019
0
citations

On the Feasibility of Learning, Rather than Assuming, Human Biases for Reward Inference

ICML 2019
0
citations

SOLAR: Deep Structured Representations for Model-Based Reinforcement Learning

ICML 2019
0
citations

Learning Plannable Representations with Causal InfoGAN

NeurIPS 2018
0
citations

Meta-Reinforcement Learning of Structured Exploration Strategies

NeurIPS 2018
0
citations

Evolved Policy Gradients

NeurIPS 2018
0
citations

The Importance of Sampling inMeta-Reinforcement Learning

NeurIPS 2018
0
citations

Compositional Plan Vectors

NeurIPS 2019
0
citations

Evaluating Protein Transfer Learning with TAPE

NeurIPS 2019
0
citations

Addressing Sample Complexity in Visual Tasks Using HER and Hallucinatory GANs

NeurIPS 2019
0
citations

MCP: Learning Composable Hierarchical Control with Multiplicative Compositional Policies

NeurIPS 2019
0
citations

Geometry-Aware Neural Rendering

NeurIPS 2019
0
citations

Goal-conditioned Imitation Learning

NeurIPS 2019
0
citations

Guided Meta-Policy Search

NeurIPS 2019
0
citations

On the Utility of Learning about Humans for Human-AI Coordination

NeurIPS 2019
0
citations

Compression with Flows via Local Bits-Back Coding

NeurIPS 2019
0
citations

Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model

NeurIPS 2020
0
citations

AvE: Assistance via Empowerment

NeurIPS 2020
0
citations

Sparse Graphical Memory for Robust Planning

NeurIPS 2020
0
citations

Denoising Diffusion Probabilistic Models

NeurIPS 2020
0
citations

Automatic Curriculum Learning through Value Disagreement

NeurIPS 2020
0
citations

Generalized Hindsight for Reinforcement Learning

NeurIPS 2020
0
citations

Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning

NeurIPS 2020
0
citations

Reinforcement Learning with Augmented Data

NeurIPS 2020
0
citations

Hindsight Task Relabelling: Experience Replay for Sparse Reward Meta-RL

NeurIPS 2021
0
citations

Teachable Reinforcement Learning via Advice Distillation

NeurIPS 2021
0
citations

Decision Transformer: Reinforcement Learning via Sequence Modeling

NeurIPS 2021
0
citations

Behavior From the Void: Unsupervised Active Pre-Training

NeurIPS 2021
0
citations

Reinforcement Learning with Latent Flow

NeurIPS 2021
0
citations

Mastering Atari Games with Limited Data

NeurIPS 2021
0
citations

Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings

NeurIPS 2021
0
citations

Spending Thinking Time Wisely: Accelerating MCTS with Virtual Expansions

NeurIPS 2022
0
citations