Dota 2 with Large Scale Deep Reinforcement Learning

OpenAI; :; Berner, Christopher; Brockman, Greg; Chan, Brooke; Cheung, Vicki; Przemysław, Dębiak,; Dennison, Christy; Farhi, David; Fischer, Quirin; Hashme, Shariq; Chris, Hesse,; Józefowicz, Rafał; Gray, Scott; Olsson, Catherine; Pachocki, Jakub; Petrov, Michael; O., Pinto, Henrique P. d.; Raiman, Jonathan; Salimans, Tim; Schlatter, Jeremy; Schneider, Jonas; Sidor, Szymon; Sutskever, Ilya; Tang, Jie; Wolski, Filip; Zhang, Susan

doi:10.48550/arxiv.1912.06680

preprintarXiv (Cornell University)Dec 13, 2019GREEN OA

Dota 2 with Large Scale Deep Reinforcement Learning

OOpenAI::CBChristopher Berner GBGreg Brockman BCBrooke Chan

Indexed inarxivdatacite

Abstract

On April 13th, 2019, OpenAI Five became the first AI system to defeat the world champions at an esports game. The game of Dota 2 presents novel challenges for AI systems such as long time horizons, imperfect information, and complex, continuous state-action spaces, all challenges which will become increasingly central to more capable AI systems. OpenAI Five leveraged existing reinforcement learning techniques, scaled to learn from batches of approximately 2 million frames every 2 seconds. We developed a distributed training system and tools for continual training which allowed us to train OpenAI Five for 10 months. By defeating the Dota 2 world champion (Team OG), OpenAI Five demonstrates that self-play…

Citation impact

1,045

total citations

FWCI: —
Percentile: —
References: 37

Citations per year

Authors

27

Topics & keywords

Topics

Keywords

Reinforcement learning
Champion
Task (project management)
Artificial intelligence
Computer science
Scale (ratio)
DOTA
Management

No related works found for this paper.