Unifying count-based exploration and intrinsic motivation

Bellemare, Marc G.; Srinivasan, Sriram; Ostrovski, Georg; Schaul, Tom; Saxton, David; Munos, Rémi

articleNeural Information Processing SystemsDec 5, 2016Closed access

Unifying count-based exploration and intrinsic motivation

MGMarc G. Bellemare SSSriram Srinivasan GOGeorg Ostrovski TSTom Schaul DSDavid Saxton

Google DeepMind (United Kingdom) · Google (United Kingdom)

Abstract

We consider an agent's uncertainty about its environment and the problem of generalizing this uncertainty across states. Specifically, we focus on the problem of exploration in non-tabular reinforcement learning. Drawing inspiration from the intrinsic motivation literature, we use density models to measure uncertainty, and propose a novel algorithm for deriving a pseudo-count from an arbitrary density model. This technique enables us to generalize count-based exploration algorithms to the non-tabular case. We apply our ideas to Atari 2600 games, providing sensible pseudo-counts from raw pixels. We transform these pseudo-counts into exploration bonuses and obtain significantly improved exploration in a number…

Citation impact

672

total citations

FWCI: 52.37
Percentile: 100%
References: 20

Citations per year

Authors

6

Topics & keywords

Topics

Keywords

Computer science
Focus (optics)
Reinforcement learning
Artificial intelligence
Pixel
Measure (data warehouse)
Data mining
Physics

No related works found for this paper.