Value iteration with guessing for Markov chains and Markov decision processes

Chatterjee, Krishnendu; Jafariraviz, Mahdi; Saona Urmeneta, Raimundo J; Svoboda, Jakub

Value iteration with guessing for Markov chains and Markov decision processes

Chatterjee K, Jafariraviz M, Saona Urmeneta RJ, Svoboda J. 2025. Value iteration with guessing for Markov chains and Markov decision processes. 31st International Conference on Tools and Algorithms for the Construction and Analysis of Systems. TACAS: Tools and Algorithms for the Construction and Analysis of Systems, LNCS, vol. 15697, 217–236.

Download

2025_TACAS_Chatterjee.pdf 557.48 KB [Published Version]

DOI

10.1007/978-3-031-90653-4_11

Conference Paper | Published | English

Scopus indexed

Author

Chatterjee, Krishnendu^ISTA ; Jafariraviz, Mahdi; Saona, Raimundo^ISTA ; Svoboda, Jakub^ISTA

Corresponding author has ISTA affiliation

Department

Chatterjee Group

Grant

Formal Methods for Stochastic Models: Algorithms and Applications

Series Title

LNCS

Abstract

Two standard models for probabilistic systems are Markov chains (MCs) and Markov decision processes (MDPs). Classic objectives for such probabilistic models for control and planning problems are reachability and stochastic shortest path. The widely studied algorithmic approach for these problems is the Value Iteration (VI) algorithm which iteratively applies local updates called Bellman updates. There are many practical approaches for VI in the literature but they all require exponentially many Bellman updates for MCs in the worst case. A preprocessing step is an algorithm that is discrete, graph-theoretical, and requires linear space. An important open question is whether, after a polynomial-time preprocessing, VI can be achieved with sub-exponentially many Bellman updates. In this work, we present a new approach for VI based on guessing values. Our theoretical contributions are twofold. First, for MCs, we present an almost-linear-time preprocessing algorithm after which, along with guessing values, VI requires only subexponentially many Bellman updates. Second, we present an improved analysis of the speed of convergence of VI for MDPs. Finally, we present a practical algorithm for MDPs based on our new approach. Experimental results show that our approach provides a considerable improvement over existing VI-based approaches on several benchmark examples from the literature.

Publishing Year

2025

Date Published

2025-05-01

Proceedings Title

31st International Conference on Tools and Algorithms for the Construction and Analysis of Systems

Publisher

Springer Nature

Acknowledgement

This research was partially supported by the ERC CoG 863818 (ForM-SMArt) grant and Austrian Science Fund (FWF) 10.55776/COE12 grant.

Volume

15697

Page

217-236

Conference

TACAS: Tools and Algorithms for the Construction and Analysis of Systems

Conference Location

Hamilton, ON, Canada

Conference Date

2025-05-03 – 2025-05-08

ISBN

9783031906527

ISSN

0302-9743

eISSN

1611-3349

IST-REx-ID

19740

Cite this

Chatterjee K, Jafariraviz M, Saona Urmeneta RJ, Svoboda J. Value iteration with guessing for Markov chains and Markov decision processes. In: 31st International Conference on Tools and Algorithms for the Construction and Analysis of Systems. Vol 15697. Springer Nature; 2025:217-236. doi:10.1007/978-3-031-90653-4_11

Chatterjee, K., Jafariraviz, M., Saona Urmeneta, R. J., & Svoboda, J. (2025). Value iteration with guessing for Markov chains and Markov decision processes. In 31st International Conference on Tools and Algorithms for the Construction and Analysis of Systems (Vol. 15697, pp. 217–236). Hamilton, ON, Canada: Springer Nature. https://doi.org/10.1007/978-3-031-90653-4_11

Chatterjee, Krishnendu, Mahdi Jafariraviz, Raimundo J Saona Urmeneta, and Jakub Svoboda. “Value Iteration with Guessing for Markov Chains and Markov Decision Processes.” In 31st International Conference on Tools and Algorithms for the Construction and Analysis of Systems, 15697:217–36. Springer Nature, 2025. https://doi.org/10.1007/978-3-031-90653-4_11.

K. Chatterjee, M. Jafariraviz, R. J. Saona Urmeneta, and J. Svoboda, “Value iteration with guessing for Markov chains and Markov decision processes,” in 31st International Conference on Tools and Algorithms for the Construction and Analysis of Systems, Hamilton, ON, Canada, 2025, vol. 15697, pp. 217–236.

Chatterjee, Krishnendu, et al. “Value Iteration with Guessing for Markov Chains and Markov Decision Processes.” 31st International Conference on Tools and Algorithms for the Construction and Analysis of Systems, vol. 15697, Springer Nature, 2025, pp. 217–36, doi:10.1007/978-3-031-90653-4_11.

All files available under the following license(s):

Creative Commons Attribution 4.0 International Public License (CC-BY 4.0):