Value iteration with guessing for Markov chains and Markov decision processes
Chatterjee K, Jafariraviz M, Saona Urmeneta RJ, Svoboda J. 2025. Value iteration with guessing for Markov chains and Markov decision processes. 31st International Conference on Tools and Algorithms for the Construction and Analysis of Systems. TACAS: Tools and Algorithms for the Construction and Analysis of Systems, LNCS, vol. 15697, 217–236.
Download
Conference Paper
| Published
| English
Scopus indexed
Author
Corresponding author has ISTA affiliation
Department
Series Title
LNCS
Abstract
Two standard models for probabilistic systems are Markov chains (MCs) and Markov decision processes (MDPs). Classic objectives for such probabilistic models for control and planning problems are reachability and stochastic shortest path. The widely studied algorithmic approach for these problems is the Value Iteration (VI) algorithm which iteratively applies local updates called Bellman updates. There are many practical approaches for VI in the literature but they all require exponentially many Bellman updates for MCs in the worst case. A preprocessing step is an algorithm that is discrete, graph-theoretical, and requires linear space. An important open question is whether, after a polynomial-time preprocessing, VI can be achieved with sub-exponentially many Bellman updates. In this work, we present a new approach for VI based on guessing values. Our theoretical contributions are twofold. First, for MCs, we present an almost-linear-time preprocessing algorithm after which, along with guessing values, VI requires only subexponentially many Bellman updates. Second, we present an improved analysis of the speed of convergence of VI for MDPs. Finally, we present a practical algorithm for MDPs based on our new approach. Experimental results show that our approach provides a considerable improvement over existing VI-based approaches on several benchmark examples from the literature.
Publishing Year
Date Published
2025-05-01
Proceedings Title
31st International Conference on Tools and Algorithms for the Construction and Analysis of Systems
Publisher
Springer Nature
Acknowledgement
This research was partially supported by the ERC CoG 863818 (ForM-SMArt) grant and Austrian Science Fund (FWF) 10.55776/COE12 grant.
Volume
15697
Page
217-236
Conference
TACAS: Tools and Algorithms for the Construction and Analysis of Systems
Conference Location
Hamilton, ON, Canada
Conference Date
2025-05-03 – 2025-05-08
ISBN
ISSN
eISSN
IST-REx-ID
Cite this
Chatterjee K, Jafariraviz M, Saona Urmeneta RJ, Svoboda J. Value iteration with guessing for Markov chains and Markov decision processes. In: 31st International Conference on Tools and Algorithms for the Construction and Analysis of Systems. Vol 15697. Springer Nature; 2025:217-236. doi:10.1007/978-3-031-90653-4_11
Chatterjee, K., Jafariraviz, M., Saona Urmeneta, R. J., & Svoboda, J. (2025). Value iteration with guessing for Markov chains and Markov decision processes. In 31st International Conference on Tools and Algorithms for the Construction and Analysis of Systems (Vol. 15697, pp. 217–236). Hamilton, ON, Canada: Springer Nature. https://doi.org/10.1007/978-3-031-90653-4_11
Chatterjee, Krishnendu, Mahdi Jafariraviz, Raimundo J Saona Urmeneta, and Jakub Svoboda. “Value Iteration with Guessing for Markov Chains and Markov Decision Processes.” In 31st International Conference on Tools and Algorithms for the Construction and Analysis of Systems, 15697:217–36. Springer Nature, 2025. https://doi.org/10.1007/978-3-031-90653-4_11.
K. Chatterjee, M. Jafariraviz, R. J. Saona Urmeneta, and J. Svoboda, “Value iteration with guessing for Markov chains and Markov decision processes,” in 31st International Conference on Tools and Algorithms for the Construction and Analysis of Systems, Hamilton, ON, Canada, 2025, vol. 15697, pp. 217–236.
Chatterjee K, Jafariraviz M, Saona Urmeneta RJ, Svoboda J. 2025. Value iteration with guessing for Markov chains and Markov decision processes. 31st International Conference on Tools and Algorithms for the Construction and Analysis of Systems. TACAS: Tools and Algorithms for the Construction and Analysis of Systems, LNCS, vol. 15697, 217–236.
Chatterjee, Krishnendu, et al. “Value Iteration with Guessing for Markov Chains and Markov Decision Processes.” 31st International Conference on Tools and Algorithms for the Construction and Analysis of Systems, vol. 15697, Springer Nature, 2025, pp. 217–36, doi:10.1007/978-3-031-90653-4_11.
All files available under the following license(s):
Creative Commons Attribution 4.0 International Public License (CC-BY 4.0):
Main File(s)
File Name
2025_TACAS_Chatterjee.pdf
557.48 KB
Access Level

Date Uploaded
2025-06-02
MD5 Checksum
45da6efbcbed20aada16c48c8e55e2d6
Export
Marked PublicationsOpen Data ISTA Research Explorer
Sources
arXiv 2505.06769