Strategy complexity of finite-horizon Markov decision processes and simple stochastic games
Chatterjee K, Ibsen-Jensen R. 2013. Strategy complexity of finite-horizon Markov decision processes and simple stochastic games. Mathematical and Engineering Methods in Computer Science. MEMICS: Mathematical and Engineering Methods in Computer Science, LNCS, vol. 7721, 106–117.
Download (ext.)
Conference Paper
| Published
| English
Scopus indexed
Corresponding author has ISTA affiliation
Department
Grant
Series Title
LNCS
Abstract
Markov decision processes (MDPs) and simple stochastic games (SSGs) provide a rich mathematical framework to study many important problems related to probabilistic systems. MDPs and SSGs with finite-horizon objectives, where the goal is to maximize the probability to reach a target state in a given finite time, is a classical and well-studied problem. In this work we consider the strategy complexity of finite-horizon MDPs and SSGs. We show that for all ε > 0, the natural class of counter-based strategies require at most log log/(1/ε) + n + 1 memory states, and memory of size Omega(log log(1/ε) + n) is required, for ε-optimality, where n is the number of states of the MDP (resp. SSG). Thus our bounds are asymptotically optimal. We then study the periodic property of optimal strategies, and show a sub-exponential lower bound on the period for optimal strategies.
Publishing Year
Date Published
2013-01-17
Proceedings Title
Mathematical and Engineering Methods in Computer Science
Publisher
Springer Nature
Acknowledgement
Work of the second author supported by the Sino-Danish Center for the Theory of Interactive Computation, funded by the Danish National Research Foundation and the National Science Foundation of China (under the grant 61061130540). The second author acknowledge support from the Center for research in the Foundations of Electronic Markets (CFEM), supported by the Danish Strategic Research Council. The first author was supported by FWF Grant No P 23499-N23, FWF NFN Grant No S11407-N23 (RiSE), ERC Start grant (279307: Graph Games), and Microsoft faculty fellows award.
Volume
7721
Page
106-117
Conference
MEMICS: Mathematical and Engineering Methods in Computer Science
Conference Location
Znojmo, Czech Republic
Conference Date
2012-10-25 – 2012-10-28
ISBN
ISSN
eISSN
IST-REx-ID
Cite this
Chatterjee K, Ibsen-Jensen R. Strategy complexity of finite-horizon Markov decision processes and simple stochastic games. In: Mathematical and Engineering Methods in Computer Science. Vol 7721. Springer Nature; 2013:106-117. doi:10.1007/978-3-642-36046-6_11
Chatterjee, K., & Ibsen-Jensen, R. (2013). Strategy complexity of finite-horizon Markov decision processes and simple stochastic games. In Mathematical and Engineering Methods in Computer Science (Vol. 7721, pp. 106–117). Znojmo, Czech Republic: Springer Nature. https://doi.org/10.1007/978-3-642-36046-6_11
Chatterjee, Krishnendu, and Rasmus Ibsen-Jensen. “Strategy Complexity of Finite-Horizon Markov Decision Processes and Simple Stochastic Games.” In Mathematical and Engineering Methods in Computer Science, 7721:106–17. Springer Nature, 2013. https://doi.org/10.1007/978-3-642-36046-6_11.
K. Chatterjee and R. Ibsen-Jensen, “Strategy complexity of finite-horizon Markov decision processes and simple stochastic games,” in Mathematical and Engineering Methods in Computer Science, Znojmo, Czech Republic, 2013, vol. 7721, pp. 106–117.
Chatterjee K, Ibsen-Jensen R. 2013. Strategy complexity of finite-horizon Markov decision processes and simple stochastic games. Mathematical and Engineering Methods in Computer Science. MEMICS: Mathematical and Engineering Methods in Computer Science, LNCS, vol. 7721, 106–117.
Chatterjee, Krishnendu, and Rasmus Ibsen-Jensen. “Strategy Complexity of Finite-Horizon Markov Decision Processes and Simple Stochastic Games.” Mathematical and Engineering Methods in Computer Science, vol. 7721, Springer Nature, 2013, pp. 106–17, doi:10.1007/978-3-642-36046-6_11.
All files available under the following license(s):
Copyright Statement:
This Item is protected by copyright and/or related rights. [...]
Link(s) to Main File(s)
Access Level
Open Access
Export
Marked PublicationsOpen Data ISTA Research Explorer
Sources
arXiv 1209.3617
