Continuous control with deep reinforcement learning

Lillicrap, Timothy; Hunt, Jonathan J.; Pritzel, Alexander; Heess, Nicolas; Erez, Tom; Tassa, Yuval; Silver, David; Wierstra, Daan

articlearXiv (Cornell University)Jul 22, 2016GREEN OA

Continuous control with deep reinforcement learning

TLTimothy Lillicrap JJJonathan J. Hunt APAlexander Pritzel NHNicolas Heess TETom Erez

Google (United States) · Google DeepMind (United Kingdom)

Abstract

Abstract: We adapt the ideas underlying the success of Deep Q-Learning to the continuous action domain. We present an actor-critic, model-free algorithm based on the deterministic policy gradient that can operate over continuous action spaces. Using the same learning algorithm, network architecture and hyper-parameters, our algorithm robustly solves more than 20 simulated physics tasks, including classic problems such as cartpole swing-up, dexterous manipulation, legged locomotion and car driving. Our algorithm is able to find policies whose performance is competitive with those found by a planning algorithm with full access to the dynamics of the domain and its derivatives. We further demonstrate that for…

Citation impact

6,778

total citations

FWCI: 503.28
Percentile: 100%
References: 31

Citations per year

Authors

8

Topics & keywords

Topics

Keywords

Reinforcement learning
Computer science
Domain (mathematical analysis)
Artificial intelligence
Action (physics)
Control (management)
Swing
Architecture

No related works found for this paper.