Search filters

List of works by Sergey Levine

Backprop KF: Learning Discriminative Deterministic State Estimators

scientific article published in January 2016

Causal Confusion in Imitation Learning

scientific article published in January 2019

Compositional Plan Vectors

scientific article published in January 2019

Conservative Q-Learning for Offline Reinforcement Learning

scientific article published in November 2020

Continual Learning of Control Primitives : Skill Discovery via Reset-Games

scientific article published in November 2020

Data-Efficient Hierarchical Reinforcement Learning

scholarly article by Ofir Nachum et al published 2018 in Advances in Neural Information Processing Systems 31

Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models

DisCor: Corrective Feedback in Reinforcement Learning via Distribution Correction

scholarly article by Aviral Kumar et al published November 2020 in Advances in Neural Information Processing Systems 33

EX2: Exploration with Exemplar Models for Deep Reinforcement Learning

scientific article published in January 2017

Emergent Complexity and Zero-shot Transfer via Unsupervised Environment Design

scientific article published in November 2020

End-to-End Training of Deep Visuomotor Policies

scientific article published in 2016

Feature Construction for Inverse Reinforcement Learning

scholarly article by Sergey Levine et al published 2010 in Advances in Neural Information Processing Systems 23

Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction

scientific article published in November 2020

Gradient Surgery for Multi-Task Learning

scientific article published in November 2020

Guided Meta-Policy Search

Guided Policy Search via Approximate Mirror Descent

scientific article published in January 2016

Interpolated Policy Gradient: Merging On-Policy and Off-Policy Gradient Estimation for Deep Reinforcement Learning

Learning Neural Network Policies with Guided Policy Search under Unknown Dynamics

scientific article published in January 2014

Learning to Poke by Poking: Experiential Learning of Intuitive Physics

scientific article published in January 2016

Long-Horizon Visual Planning with Goal-Conditioned Hierarchical Predictors

MCP: Learning Composable Hierarchical Control with Multiplicative Compositional Policies

MOPO: Model-based Offline Policy Optimization

scientific article published in November 2020

Meta-Learning with Implicit Gradients

Meta-Reinforcement Learning of Structured Exploration Strategies

scholarly article by Abhishek Gupta et al published 2018 in Advances in Neural Information Processing Systems 31

Model Inversion Networks for Model-Based Optimization

scientific article published in November 2020

Nonlinear Inverse Reinforcement Learning with Gaussian Processes

scientific article published in January 2011

Off-Policy Evaluation via Off-Policy Classification

One Solution is Not All You Need: Few-Shot Extrapolation via Structured MaxEnt RL

Planning with Goal-Conditioned Policies

scientific article published in January 2019

Probabilistic Model-Agnostic Meta-Learning

Rewriting History with Inverse RL: Hindsight Inference for Policy Improvement

scientific article published in November 2020

Search on the Replay Buffer: Bridging Planning and Reinforcement Learning

scientific article published in January 2019

Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction

Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model

scientific article published in November 2020

Unsupervised Curricula for Visual Meta-Reinforcement Learning

scholarly article by Allan Jabri et al published 2019 in Advances in Neural Information Processing Systems 32

Unsupervised Learning for Physical Interaction through Video Prediction

scientific article published in January 2016

Value Iteration Networks

scholarly article

Variational Inverse Control with Events: A General Framework for Data-Driven Reward Definition

Variational Policy Search via Trajectory Optimization

Visual Memory for Robust Path Following

Visual Reinforcement Learning with Imagined Goals

scholarly article by Ashvin V. Nair et al published 2018 in Advances in Neural Information Processing Systems 31

Wasserstein Dependency Measure for Representation Learning

scientific article published in January 2019

When to Trust Your Model: Model-Based Policy Optimization

article by Michael Janner et al published 2019 in Advances in Neural Information Processing Systems 32

Where Do You Think You're Going?: Inferring Beliefs about Dynamics from Behavior

article