Apprenticeship learning via inverse reinforcement learning

Abbeel, Pieter; Ng, Andrew Y.

doi:10.1145/1015330.1015430

articleJan 1, 2004Closed access

Apprenticeship learning via inverse reinforcement learning

PAPieter Abbeel AYAndrew Y. Ng

Stanford University

Indexed incrossref

Abstract

We consider learning in a Markov decision process where we are not explicitly given a reward function, but where instead we can observe an expert demonstrating the task that we want to learn to perform. This setting is useful in applications (such as the task of driving) where it may be difficult to write down an explicit reward function specifying exactly how different desiderata should be traded off. We think of the expert as trying to maximize a reward function that is expressible as a linear combination of known features, and give an algorithm for learning the task demonstrated by the expert. Our algorithm is based on using "inverse reinforcement learning" to try to recover the unknown reward function. We…

Citation impact

2,864

total citations

FWCI: 14.80
Percentile: 100%
References: 14

Citations per year

Authors

2

Topics & keywords

Topics

Keywords

Computer science
Markov decision process
Task (project management)
Reinforcement learning
Function (biology)
Artificial intelligence
Machine learning
Process (computing)

UN Sustainable Development Goals

Peace, Justice and strong institutions

No related works found for this paper.

Funding

DA
Defense Advanced Research Projects Agency