Strategy complexity of finite-horizon Markov decision processes and simple stochastic games

Chatterjee K, Ibsen-Jensen R. 2013. Strategy complexity of finite-horizon Markov decision processes and simple stochastic games. Mathematical and Engineering Methods in Computer Science. MEMICS: Mathematical and Engineering Methods in Computer Science, LNCS, vol. 7721, 106–117.

Download (ext.)

Conference Paper | Published | English

Scopus indexed

Corresponding author has ISTA affiliation

Department
Series Title
LNCS
Abstract
Markov decision processes (MDPs) and simple stochastic games (SSGs) provide a rich mathematical framework to study many important problems related to probabilistic systems. MDPs and SSGs with finite-horizon objectives, where the goal is to maximize the probability to reach a target state in a given finite time, is a classical and well-studied problem. In this work we consider the strategy complexity of finite-horizon MDPs and SSGs. We show that for all ε > 0, the natural class of counter-based strategies require at most log log/(1/ε) + n + 1 memory states, and memory of size Omega(log log(1/ε) + n) is required, for ε-optimality, where n is the number of states of the MDP (resp. SSG). Thus our bounds are asymptotically optimal. We then study the periodic property of optimal strategies, and show a sub-exponential lower bound on the period for optimal strategies.
Publishing Year
Date Published
2013-01-17
Proceedings Title
Mathematical and Engineering Methods in Computer Science
Publisher
Springer Nature
Acknowledgement
Work of the second author supported by the Sino-Danish Center for the Theory of Interactive Computation, funded by the Danish National Research Foundation and the National Science Foundation of China (under the grant 61061130540). The second author acknowledge support from the Center for research in the Foundations of Electronic Markets (CFEM), supported by the Danish Strategic Research Council. The first author was supported by FWF Grant No P 23499-N23, FWF NFN Grant No S11407-N23 (RiSE), ERC Start grant (279307: Graph Games), and Microsoft faculty fellows award.
Volume
7721
Page
106-117
Conference
MEMICS: Mathematical and Engineering Methods in Computer Science
Conference Location
Znojmo, Czech Republic
Conference Date
2012-10-25 – 2012-10-28
ISSN
eISSN
IST-REx-ID

Cite this

Chatterjee K, Ibsen-Jensen R. Strategy complexity of finite-horizon Markov decision processes and simple stochastic games. In: Mathematical and Engineering Methods in Computer Science. Vol 7721. Springer Nature; 2013:106-117. doi:10.1007/978-3-642-36046-6_11
Chatterjee, K., & Ibsen-Jensen, R. (2013). Strategy complexity of finite-horizon Markov decision processes and simple stochastic games. In Mathematical and Engineering Methods in Computer Science (Vol. 7721, pp. 106–117). Znojmo, Czech Republic: Springer Nature. https://doi.org/10.1007/978-3-642-36046-6_11
Chatterjee, Krishnendu, and Rasmus Ibsen-Jensen. “Strategy Complexity of Finite-Horizon Markov Decision Processes and Simple Stochastic Games.” In Mathematical and Engineering Methods in Computer Science, 7721:106–17. Springer Nature, 2013. https://doi.org/10.1007/978-3-642-36046-6_11.
K. Chatterjee and R. Ibsen-Jensen, “Strategy complexity of finite-horizon Markov decision processes and simple stochastic games,” in Mathematical and Engineering Methods in Computer Science, Znojmo, Czech Republic, 2013, vol. 7721, pp. 106–117.
Chatterjee K, Ibsen-Jensen R. 2013. Strategy complexity of finite-horizon Markov decision processes and simple stochastic games. Mathematical and Engineering Methods in Computer Science. MEMICS: Mathematical and Engineering Methods in Computer Science, LNCS, vol. 7721, 106–117.
Chatterjee, Krishnendu, and Rasmus Ibsen-Jensen. “Strategy Complexity of Finite-Horizon Markov Decision Processes and Simple Stochastic Games.” Mathematical and Engineering Methods in Computer Science, vol. 7721, Springer Nature, 2013, pp. 106–17, doi:10.1007/978-3-642-36046-6_11.
All files available under the following license(s):
Copyright Statement:
This Item is protected by copyright and/or related rights. [...]

Link(s) to Main File(s)
Access Level
OA Open Access

Export

Marked Publications

Open Data ISTA Research Explorer

Sources

arXiv 1209.3617

Search this title in

Google Scholar
ISBN Search