Maximum Entropy Semi-Supervised Inverse Reinforcement Learning
École Normale Supérieure Paris-Saclay · Centre National de la Recherche Scientifique · +1 more institution
Abstract
A popular approach to apprenticeship learning (AL) is to formulate it as an inverse reinforcement learning (IRL) problem. The MaxEnt-IRL algorithm successfully integrates the maximum entropy principle into IRL and unlike its predecessors, it resolves the ambiguity arising from the fact that a possibly large number of policies could match the expert's behavior. In this paper, we study an AL setting in which in addition to the expert's trajectories, a number of unsupervised trajectories is available. We introduce MESSI, a novel algorithm that combines MaxEnt-IRL with principles coming from semi-supervised learning. In particular, MESSI integrates the unsupervised data into the MaxEnt-IRL framework using a…
Citation impact
- FWCI
- —
- Percentile
- —
- References
- 12
Authors
4Topics & keywords
- Principle of maximum entropy
- Pairwise comparison
- Computer science
- Ambiguity
- Reinforcement learning
- Artificial intelligence
- Unsupervised learning
- Machine learning