The Complexity of Decentralized Control of Markov Decision Processes

Bernstein, Daniel S.; Givan, Robert; Immerman, Neil; Zilberstein, Shlomo

doi:10.1287/moor.27.4.819.297

articleMathematics of Operations ResearchNov 1, 2002Closed access

The Complexity of Decentralized Control of Markov Decision Processes

DSDaniel S. Bernstein RGRobert Givan NINeil Immerman SZShlomo Zilberstein

University of Massachusetts Amherst · Purdue University West Lafayette

Indexed incrossref

Abstract

We consider decentralized control of Markov decision processes and give complexity bounds on the worst-case running time for algorithms that find optimal solutions. Generalizations of both the fully observable case and the partially observable case that allow for decentralized control are described. For even two agents, the finite-horizon problems corresponding to both of these models are hard for nondeterministic exponential time. These complexity results illustrate a fundamental difference between centralized and decentralized control of Markov decision processes. In contrast to the problems involving centralized control, the problems we consider provably do not admit polynomial-time algorithms. Furthermore,…

Citation impact

1,180

total citations

FWCI: 10.86
Percentile: 100%
References: 25

Citations per year

Authors

4

Topics & keywords

Topics

Keywords

Nondeterministic algorithm
Markov decision process
Mathematics
Mathematical optimization
Partially observable Markov decision process
Observable
Computational complexity theory
Markov chain

UN Sustainable Development Goals

Peace, Justice and strong institutions

No related works found for this paper.