preprintarXiv (Cornell University)Apr 22, 2026GREEN OA

Maximum Entropy Semi-Supervised Inverse Reinforcement Learning

École Normale Supérieure Paris-Saclay · Centre National de la Recherche Scientifique · +1 more institution

Indexed inarxivdatacite

Abstract

A popular approach to apprenticeship learning (AL) is to formulate it as an inverse reinforcement learning (IRL) problem. The MaxEnt-IRL algorithm successfully integrates the maximum entropy principle into IRL and unlike its predecessors, it resolves the ambiguity arising from the fact that a possibly large number of policies could match the expert's behavior. In this paper, we study an AL setting in which in addition to the expert's trajectories, a number of unsupervised trajectories is available. We introduce MESSI, a novel algorithm that combines MaxEnt-IRL with principles coming from semi-supervised learning. In particular, MESSI integrates the unsupervised data into the MaxEnt-IRL framework using a…

Citation impact

20
total citations
FWCI
Percentile
References
12
Citations per year

Authors

4

Topics & keywords

Keywords
  • Principle of maximum entropy
  • Pairwise comparison
  • Computer science
  • Ambiguity
  • Reinforcement learning
  • Artificial intelligence
  • Unsupervised learning
  • Machine learning
No related works found for this paper.