Robust Control of Markov Decision Processes with Uncertain Transition Matrices

Nilim, Arnab; Ghaoui, Laurent El

doi:10.1287/opre.1050.0216

articleOperations ResearchOct 1, 2005Closed access

Robust Control of Markov Decision Processes with Uncertain Transition Matrices

ANArnab Nilim LELaurent El Ghaoui

University of California, Berkeley

Indexed incrossref

Abstract

Optimal solutions to Markov decision problems may be very sensitive with respect to the state transition probabilities. In many practical problems, the estimation of these probabilities is far from accurate. Hence, estimation errors are limiting factors in applying Markov decision processes to real-world problems. We consider a robust control problem for a finite-state, finite-action Markov decision process, where uncertainty on the transition matrices is described in terms of possibly nonconvex sets. We show that perfect duality holds for this problem, and that as a consequence, it can be solved with a variant of the classical dynamic programming algorithm, the “robust dynamic programming” algorithm. We show…

Citation impact

695

total citations

FWCI: 32.77
Percentile: 100%
References: 32

Citations per year

Authors

2

Topics & keywords

Topics

Keywords

Markov decision process
Mathematical optimization
Recursion (computer science)
Mathematics
Robustness (evolution)
Markov chain
Dynamic programming
Entropy (arrow of time)

UN Sustainable Development Goals

Peace, Justice and strong institutions

No related works found for this paper.