Humans use directed and random exploration to solve the explore–exploit dilemma.
Neuroscience Institute · Princeton University
Abstract
All adaptive organisms face the fundamental tradeoff between pursuing a known reward (exploitation) and sampling lesser-known options in search of something better (exploration). Theory suggests at least two strategies for solving this dilemma: a directed strategy in which choices are explicitly biased toward information seeking, and a random strategy in which decision noise leads to exploration by chance. In this work we investigated the extent to which humans use these two strategies. In our "Horizon task," participants made explore-exploit decisions in two contexts that differed in the number of choices that they would make in the future (the time horizon). Participants were allowed to make either a single…
Citation impact
- FWCI
- 30.43
- Percentile
- 100%
- References
- 34
Authors
5Topics & keywords
- Dilemma
- Exploit
- Task (project management)
- Psychology
- Time horizon
- Horizon
- Prisoner's dilemma
- Stochastic game
- Peace, Justice and strong institutions