Playing Atari with Deep Reinforcement Learning

Mnih, Volodymyr; Kavukcuoglu, Koray; Silver, David; Graves, Alex; Antonoglou, Ioannis; Wierstra, Daan; Riedmiller, Martin

doi:10.48550/arxiv.1312.5602

preprintarXiv (Cornell University)Dec 19, 2013GREEN OA

Playing Atari with Deep Reinforcement Learning

VMVolodymyr Mnih KKKoray Kavukcuoglu DSDavid Silver AGAlex Graves IAIoannis Antonoglou

Indexed inarxivdatacite

Abstract

We present the first deep learning model to successfully learn control policies directly from high-dimensional sensory input using reinforcement learning. The model is a convolutional neural network, trained with a variant of Q-learning, whose input is raw pixels and whose output is a value function estimating future rewards. We apply our method to seven Atari 2600 games from the Arcade Learning Environment, with no adjustment of the architecture or learning algorithm. We find that it outperforms all previous approaches on six of the games and surpasses a human expert on three of them.

Citation impact

5,121

total citations

FWCI: —
Percentile: —
References: 29

Citations per year

Authors

7

Topics & keywords

Topics

Keywords

Reinforcement learning
Computer science
Artificial intelligence
Convolutional neural network
Deep learning
Function (biology)
Bellman equation
Value (mathematics)

No related works found for this paper.