Bayesian Inverse Reinforcement Learning
University of Illinois Urbana-Champaign · Cornell University
Abstract
Inverse Reinforcement Learning (IRL) is the problem of learning the reward function underlying a Markov Decision Process given the dynamics of the system and the behaviour of an expert. IRL is motivated by situations where knowledge of the rewards is a goal by itself (as in preference elicitation) and by the task of apprenticeship learning (learning policies from an expert). In this paper we show how to combine prior knowledge and evidence from the expert's actions to derive a probability distribution over the space of reward functions. We present efficient algorithms that find solutions for the reward learning and apprenticeship learning tasks that generalize well over these distributions. Experimental…
Citation impact
- FWCI
- 32.56
- Percentile
- 100%
- References
- 16
Authors
1- VKVikram KrishnamurthyCorresponding
University of Illinois Urbana-Champaign, Cornell University
Topics & keywords
- Artificial intelligence
- Machine learning
- Markov decision process
- Computer science
- Reinforcement learning
- Task (project management)
- Heuristic
- Bayesian probability
- Peace, Justice and strong institutions