Maximum Entropy Semi-Supervised Inverse Reinforcement Learning

Audiffren, Julien; Vaľko, Michal; Lazaric, Alessandro; Ghavamzadeh, Mohammad

doi:10.48550/arxiv.2604.20074

preprintarXiv (Cornell University)Apr 22, 2026GREEN OA

Maximum Entropy Semi-Supervised Inverse Reinforcement Learning

JAJulien Audiffren MVMichal Vaľko ALAlessandro Lazaric MGMohammad Ghavamzadeh

École Normale Supérieure Paris-Saclay · Centre National de la Recherche Scientifique · +1 more institution

Indexed inarxivdatacite

Abstract

A popular approach to apprenticeship learning (AL) is to formulate it as an inverse reinforcement learning (IRL) problem. The MaxEnt-IRL algorithm successfully integrates the maximum entropy principle into IRL and unlike its predecessors, it resolves the ambiguity arising from the fact that a possibly large number of policies could match the expert's behavior. In this paper, we study an AL setting in which in addition to the expert's trajectories, a number of unsupervised trajectories is available. We introduce MESSI, a novel algorithm that combines MaxEnt-IRL with principles coming from semi-supervised learning. In particular, MESSI integrates the unsupervised data into the MaxEnt-IRL framework using a…

Citation impact

20

total citations

FWCI: —
Percentile: —
References: 12

Citations per year

Authors

4

Topics & keywords

Topics

Keywords

Principle of maximum entropy
Pairwise comparison
Computer science
Ambiguity
Reinforcement learning
Artificial intelligence
Unsupervised learning
Machine learning

No related works found for this paper.