Solving robust Markov decision processes: Generic, reliable, efficient

Meggendorfer T, Weininger M, Wienhöft P. 2025. Solving robust Markov decision processes: Generic, reliable, efficient. Proceedings of the AAAI Conference on Artificial Intelligence. AAAI: Conference on Artificial Intelligence vol. 39, 26631–26641.

Download
No fulltext has been uploaded. References only!

Conference Paper | Published | English

Scopus indexed
Department
Abstract
Markov decision processes (MDP) are a well-established model for sequential decision-making in the presence of probabilities. In *robust* MDP (RMDP), every action is associated with an *uncertainty set* of probability distributions, modelling that transition probabilities are not known precisely. Based on the known theoretical connection to stochastic games, we provide a framework for solving RMDPs that is generic, reliable, and efficient. It is *generic* both with respect to the model, allowing for a wide range of uncertainty sets, including but not limited to intervals, L1- or L2-balls, and polytopes; and with respect to the objective, including long-run average reward, undiscounted total reward, and stochastic shortest path. It is *reliable*, as our approach not only converges in the limit, but provides precision guarantees at any time during the computation. It is *efficient* because -- in contrast to state-of-the-art approaches -- it avoids explicitly constructing the underlying stochastic game. Consequently, our prototype implementation outperforms existing tools by several orders of magnitude and can solve RMDPs with a million states in under a minute.
Publishing Year
Date Published
2025-04-11
Proceedings Title
Proceedings of the AAAI Conference on Artificial Intelligence
Publisher
Association for the Advancement of Artificial Intelligence
Acknowledgement
This project has received funding from the European Union’s Horizon 2020 research and innovation programme under the Marie Sklodowska-Curie grant agreement No. 101034413, the ERC CoG 863818 (ForM-SMArt), and the DFG through the Cluster of Excellence EXC 2050/1 (CeTI, project ID 390696704, as part of Germany’s Excellence Strategy) and the TRR 248 (see https://perspicuous-computing.science, project ID 389792660).
Volume
39
Issue
25
Page
26631-26641
Conference
AAAI: Conference on Artificial Intelligence
Conference Location
Philadelphia, PA, United States
Conference Date
2025-02-25 – 2025-03-04
ISSN
eISSN
IST-REx-ID

Cite this

Meggendorfer T, Weininger M, Wienhöft P. Solving robust Markov decision processes: Generic, reliable, efficient. In: Proceedings of the AAAI Conference on Artificial Intelligence. Vol 39. Association for the Advancement of Artificial Intelligence; 2025:26631-26641. doi:10.1609/aaai.v39i25.34865
Meggendorfer, T., Weininger, M., & Wienhöft, P. (2025). Solving robust Markov decision processes: Generic, reliable, efficient. In Proceedings of the AAAI Conference on Artificial Intelligence (Vol. 39, pp. 26631–26641). Philadelphia, PA, United States: Association for the Advancement of Artificial Intelligence. https://doi.org/10.1609/aaai.v39i25.34865
Meggendorfer, Tobias, Maximilian Weininger, and Patrick Wienhöft. “Solving Robust Markov Decision Processes: Generic, Reliable, Efficient.” In Proceedings of the AAAI Conference on Artificial Intelligence, 39:26631–41. Association for the Advancement of Artificial Intelligence, 2025. https://doi.org/10.1609/aaai.v39i25.34865.
T. Meggendorfer, M. Weininger, and P. Wienhöft, “Solving robust Markov decision processes: Generic, reliable, efficient,” in Proceedings of the AAAI Conference on Artificial Intelligence, Philadelphia, PA, United States, 2025, vol. 39, no. 25, pp. 26631–26641.
Meggendorfer T, Weininger M, Wienhöft P. 2025. Solving robust Markov decision processes: Generic, reliable, efficient. Proceedings of the AAAI Conference on Artificial Intelligence. AAAI: Conference on Artificial Intelligence vol. 39, 26631–26641.
Meggendorfer, Tobias, et al. “Solving Robust Markov Decision Processes: Generic, Reliable, Efficient.” Proceedings of the AAAI Conference on Artificial Intelligence, vol. 39, no. 25, Association for the Advancement of Artificial Intelligence, 2025, pp. 26631–41, doi:10.1609/aaai.v39i25.34865.

Export

Marked Publications

Open Data ISTA Research Explorer

Sources

arXiv 2412.10185

Search this title in

Google Scholar