---
OA_place: publisher
OA_type: green
_id: '18974'
abstract:
- lang: eng
  text: Reinforcement Learning (RL) from temporal logical specifications is a fundamental
    problem in sequential decision making. One of the basic and core such specification
    is the reachability specification that requires a target set to be eventually
    visited. Despite strong empirical results for RL from such specifications, the
    theoretical guarantees are bleak, including the impossibility of Probably Approximately
    Correct (PAC) guarantee for reachability specifications. Given the impossibility
    result, in this work we consider the problem of RL from reachability specifications
    along with the information of expected conditional distance (ECD). We present
    (a) lower bound results which establish the necessity of ECD information for PAC
    guarantees and (b) an algorithm that establishes PAC-guarantees given the ECD
    information. To the best of our knowledge, this is the first RL from reachability
    specifications that does not make any assumptions on the underlying environment
    to learn policies.
alternative_title:
- PMLR
article_processing_charge: No
author:
- first_name: Jakub
  full_name: Svoboda, Jakub
  id: 130759D2-D7DD-11E9-87D2-DE0DE6697425
  last_name: Svoboda
  orcid: 0000-0002-1419-3267
- first_name: Suguman
  full_name: Bansal, Suguman
  last_name: Bansal
- first_name: Krishnendu
  full_name: Chatterjee, Krishnendu
  id: 2E5DCA20-F248-11E8-B48F-1D18A9856A87
  last_name: Chatterjee
  orcid: 0000-0002-4561-241X
citation:
  ama: 'Svoboda J, Bansal S, Chatterjee K. Reinforcement learning from reachability
    specifications: PAC guarantees with expected conditional distance. In: <i>41st
    International Conference on Machine Learning</i>. Vol 235. ML Research Press;
    2024:47331-47344.'
  apa: 'Svoboda, J., Bansal, S., &#38; Chatterjee, K. (2024). Reinforcement learning
    from reachability specifications: PAC guarantees with expected conditional distance.
    In <i>41st International Conference on Machine Learning</i> (Vol. 235, pp. 47331–47344).
    Vienna, Austria: ML Research Press.'
  chicago: 'Svoboda, Jakub, Suguman Bansal, and Krishnendu Chatterjee. “Reinforcement
    Learning from Reachability Specifications: PAC Guarantees with Expected Conditional
    Distance.” In <i>41st International Conference on Machine Learning</i>, 235:47331–44.
    ML Research Press, 2024.'
  ieee: 'J. Svoboda, S. Bansal, and K. Chatterjee, “Reinforcement learning from reachability
    specifications: PAC guarantees with expected conditional distance,” in <i>41st
    International Conference on Machine Learning</i>, Vienna, Austria, 2024, vol.
    235, pp. 47331–47344.'
  ista: 'Svoboda J, Bansal S, Chatterjee K. 2024. Reinforcement learning from reachability
    specifications: PAC guarantees with expected conditional distance. 41st International
    Conference on Machine Learning. ICML: International Conference on Machine Learning,
    PMLR, vol. 235, 47331–47344.'
  mla: 'Svoboda, Jakub, et al. “Reinforcement Learning from Reachability Specifications:
    PAC Guarantees with Expected Conditional Distance.” <i>41st International Conference
    on Machine Learning</i>, vol. 235, ML Research Press, 2024, pp. 47331–44.'
  short: J. Svoboda, S. Bansal, K. Chatterjee, in:, 41st International Conference
    on Machine Learning, ML Research Press, 2024, pp. 47331–47344.
conference:
  end_date: 2024-07-27
  location: Vienna, Austria
  name: 'ICML: International Conference on Machine Learning'
  start_date: 2024-07-21
corr_author: '1'
date_created: 2025-01-30T07:45:22Z
date_published: 2024-07-29T00:00:00Z
date_updated: 2025-01-30T07:46:16Z
day: '29'
department:
- _id: KrCh
intvolume: '       235'
language:
- iso: eng
main_file_link:
- open_access: '1'
  url: https://openreview.net/forum?id=mXUDDL4r1Q
month: '07'
oa: 1
oa_version: Preprint
page: 47331-47344
publication: 41st International Conference on Machine Learning
publication_status: published
publisher: ML Research Press
quality_controlled: '1'
scopus_import: '1'
status: public
title: 'Reinforcement learning from reachability specifications: PAC guarantees with
  expected conditional distance'
type: conference
user_id: 2DF688A6-F248-11E8-B48F-1D18A9856A87
volume: 235
year: '2024'
...
---
OA_place: publisher
OA_type: hybrid
_id: '17283'
abstract:
- lang: eng
  text: 'We consider the problems of statically refuting equivalence and similarity
    of output distributions defined by a pair of probabilistic programs. Equivalence
    and similarity are two fundamental relational properties of probabilistic programs
    that are essential for their correctness both in implementation and in compilation.
    In this work, we present a new method for static equivalence and similarity refutation.
    Our method refutes equivalence and similarity by computing a function over program
    outputs whose expected value with respect to the output distributions of two programs
    is different. The function is computed simultaneously with an upper expectation
    supermartingale and a lower expectation submartingale for the two programs, which
    we show to together provide a formal certificate for refuting equivalence and
    similarity. To the best of our knowledge, our method is the first approach to
    relational program analysis to offer the combination of the following desirable
    features: (1) it is fully automated, (2) it is applicable to infinite-state probabilistic
    programs, and (3) it provides formal guarantees on the correctness of its results.
    We implement a prototype of our method and our experiments demonstrate the effectiveness
    of our method to refute equivalence and similarity for a number of examples collected
    from the literature.'
acknowledgement: "This research was partially supported by the ERC CoG 863818 (ForM-SMArt)
  grant. Petr Novotný\r\nis supported by the Czech Science Foundation grant no. GA23-06963S.\r\n"
article_number: '232'
article_processing_charge: Yes (via OA deal)
article_type: original
arxiv: 1
author:
- first_name: Krishnendu
  full_name: Chatterjee, Krishnendu
  id: 2E5DCA20-F248-11E8-B48F-1D18A9856A87
  last_name: Chatterjee
  orcid: 0000-0002-4561-241X
- first_name: Ehsan
  full_name: Kafshdar Goharshadi, Ehsan
  id: 103b4fa0-896a-11ed-bdf8-87b697bef40d
  last_name: Kafshdar Goharshadi
  orcid: 0000-0002-8595-0587
- first_name: Petr
  full_name: Novotný, Petr
  id: 3CC3B868-F248-11E8-B48F-1D18A9856A87
  last_name: Novotný
- first_name: Dorde
  full_name: Zikelic, Dorde
  id: 294AA7A6-F248-11E8-B48F-1D18A9856A87
  last_name: Zikelic
  orcid: 0000-0002-4681-1699
citation:
  ama: Chatterjee K, Goharshady E, Novotný P, Zikelic D. Equivalence and similarity
    refutation for probabilistic programs. <i>Proceedings of the ACM on Programming
    Languages</i>. 2024;8. doi:<a href="https://doi.org/10.1145/3656462">10.1145/3656462</a>
  apa: Chatterjee, K., Goharshady, E., Novotný, P., &#38; Zikelic, D. (2024). Equivalence
    and similarity refutation for probabilistic programs. <i>Proceedings of the ACM
    on Programming Languages</i>. Association for Computing Machinery. <a href="https://doi.org/10.1145/3656462">https://doi.org/10.1145/3656462</a>
  chicago: Chatterjee, Krishnendu, Ehsan Goharshady, Petr Novotný, and Dorde Zikelic.
    “Equivalence and Similarity Refutation for Probabilistic Programs.” <i>Proceedings
    of the ACM on Programming Languages</i>. Association for Computing Machinery,
    2024. <a href="https://doi.org/10.1145/3656462">https://doi.org/10.1145/3656462</a>.
  ieee: K. Chatterjee, E. Goharshady, P. Novotný, and D. Zikelic, “Equivalence and
    similarity refutation for probabilistic programs,” <i>Proceedings of the ACM on
    Programming Languages</i>, vol. 8. Association for Computing Machinery, 2024.
  ista: Chatterjee K, Goharshady E, Novotný P, Zikelic D. 2024. Equivalence and similarity
    refutation for probabilistic programs. Proceedings of the ACM on Programming Languages.
    8, 232.
  mla: Chatterjee, Krishnendu, et al. “Equivalence and Similarity Refutation for Probabilistic
    Programs.” <i>Proceedings of the ACM on Programming Languages</i>, vol. 8, 232,
    Association for Computing Machinery, 2024, doi:<a href="https://doi.org/10.1145/3656462">10.1145/3656462</a>.
  short: K. Chatterjee, E. Goharshady, P. Novotný, D. Zikelic, Proceedings of the
    ACM on Programming Languages 8 (2024).
corr_author: '1'
date_created: 2024-07-21T22:01:01Z
date_published: 2024-06-20T00:00:00Z
date_updated: 2025-04-14T07:52:47Z
day: '20'
ddc:
- '000'
department:
- _id: KrCh
- _id: GradSch
doi: 10.1145/3656462
ec_funded: 1
external_id:
  arxiv:
  - '2404.03430'
file:
- access_level: open_access
  checksum: 8cbf220f284a4a87d093db5320c5afdd
  content_type: application/pdf
  creator: dernst
  date_created: 2024-07-22T07:17:14Z
  date_updated: 2024-07-22T07:17:14Z
  file_id: '17290'
  file_name: 2024_ACMProgLang_Chatterjee.pdf
  file_size: 355421
  relation: main_file
  success: 1
file_date_updated: 2024-07-22T07:17:14Z
has_accepted_license: '1'
intvolume: '         8'
language:
- iso: eng
month: '06'
oa: 1
oa_version: Published Version
project:
- _id: 0599E47C-7A3F-11EA-A408-12923DDC885E
  call_identifier: H2020
  grant_number: '863818'
  name: 'Formal Methods for Stochastic Models: Algorithms and Applications'
publication: Proceedings of the ACM on Programming Languages
publication_identifier:
  eissn:
  - 2475-1421
publication_status: published
publisher: Association for Computing Machinery
quality_controlled: '1'
scopus_import: '1'
status: public
title: Equivalence and similarity refutation for probabilistic programs
tmp:
  image: /images/cc_by.png
  legal_code_url: https://creativecommons.org/licenses/by/4.0/legalcode
  name: Creative Commons Attribution 4.0 International Public License (CC-BY 4.0)
  short: CC BY (4.0)
type: journal_article
user_id: 2DF688A6-F248-11E8-B48F-1D18A9856A87
volume: 8
year: '2024'
...
---
_id: '17328'
abstract:
- lang: eng
  text: "We study selfish mining attacks in longest-chain blockchains like Bitcoin,
    but where the proof of work is replaced with efficient proof systems - like proofs
    of stake or proofs of space - and consider the problem of computing an optimal
    selfish mining attack which maximizes expected relative revenue of the adversary,
    thus minimizing the chain quality. To this end, we propose a novel selfish mining
    attack that aims to maximize this objective and formally model the attack as a
    Markov decision process (MDP). We then present a formal analysis procedure which
    computes an ϵ-tight lower bound on the optimal expected relative revenue in the
    MDP and a strategy that achieves this ϵ-tight lower bound, where ϵ > 0 may be
    any specified precision. Our analysis is fully automated and provides formal guarantees
    on the correctness. We evaluate our selfish mining attack and observe that it
    achieves superior expected relative revenue compared to two considered baselines.\r\nIn
    concurrent work [Sarenche FC'24] does an automated analysis on selfish mining
    in predictable longest-chain blockchains based on efficient proof systems. Predictable
    means the randomness for the challenges is fixed for many blocks (as used e.g.,
    in Ouroboros), while we consider unpredictable (Bitcoin-like) chains where the
    challenge is derived from the previous block."
acknowledgement: "This work was supported in part by the ERC-2020-CoG 863818 (FoRM-SMArt)
  grant and the MOE-T2EP20122-0014 (Data-Driven Distributed Algorithms) grant.\r\n"
article_processing_charge: Yes (via OA deal)
arxiv: 1
author:
- first_name: Krishnendu
  full_name: Chatterjee, Krishnendu
  id: 2E5DCA20-F248-11E8-B48F-1D18A9856A87
  last_name: Chatterjee
  orcid: 0000-0002-4561-241X
- first_name: Amirali
  full_name: Ebrahimzadeh, Amirali
  last_name: Ebrahimzadeh
- first_name: Mehrdad
  full_name: Karrabi, Mehrdad
  id: 67638922-f394-11eb-9cf6-f20423e08757
  last_name: Karrabi
- first_name: Krzysztof Z
  full_name: Pietrzak, Krzysztof Z
  id: 3E04A7AA-F248-11E8-B48F-1D18A9856A87
  last_name: Pietrzak
  orcid: 0000-0002-9139-1654
- first_name: Michelle X
  full_name: Yeo, Michelle X
  id: 2D82B818-F248-11E8-B48F-1D18A9856A87
  last_name: Yeo
  orcid: 0009-0001-3676-4809
- first_name: Dorde
  full_name: Zikelic, Dorde
  id: 294AA7A6-F248-11E8-B48F-1D18A9856A87
  last_name: Zikelic
  orcid: 0000-0002-4681-1699
citation:
  ama: 'Chatterjee K, Ebrahimzadeh A, Karrabi M, Pietrzak KZ, Yeo MX, Zikelic D. Fully
    automated selfish mining analysis in efficient proof systems blockchains. In:
    <i> Proceedings of the 43rd Annual ACM Symposium on Principles of Distributed
    Computing</i>. Association for Computing Machinery; 2024:268-278. doi:<a href="https://doi.org/10.1145/3662158.3662769">10.1145/3662158.3662769</a>'
  apa: 'Chatterjee, K., Ebrahimzadeh, A., Karrabi, M., Pietrzak, K. Z., Yeo, M. X.,
    &#38; Zikelic, D. (2024). Fully automated selfish mining analysis in efficient
    proof systems blockchains. In <i> Proceedings of the 43rd Annual ACM Symposium
    on Principles of Distributed Computing</i> (pp. 268–278). Nantes, France: Association
    for Computing Machinery. <a href="https://doi.org/10.1145/3662158.3662769">https://doi.org/10.1145/3662158.3662769</a>'
  chicago: Chatterjee, Krishnendu, Amirali Ebrahimzadeh, Mehrdad Karrabi, Krzysztof
    Z Pietrzak, Michelle X Yeo, and Dorde Zikelic. “Fully Automated Selfish Mining
    Analysis in Efficient Proof Systems Blockchains.” In <i> Proceedings of the 43rd
    Annual ACM Symposium on Principles of Distributed Computing</i>, 268–78. Association
    for Computing Machinery, 2024. <a href="https://doi.org/10.1145/3662158.3662769">https://doi.org/10.1145/3662158.3662769</a>.
  ieee: K. Chatterjee, A. Ebrahimzadeh, M. Karrabi, K. Z. Pietrzak, M. X. Yeo, and
    D. Zikelic, “Fully automated selfish mining analysis in efficient proof systems
    blockchains,” in <i> Proceedings of the 43rd Annual ACM Symposium on Principles
    of Distributed Computing</i>, Nantes, France, 2024, pp. 268–278.
  ista: 'Chatterjee K, Ebrahimzadeh A, Karrabi M, Pietrzak KZ, Yeo MX, Zikelic D.
    2024. Fully automated selfish mining analysis in efficient proof systems blockchains.  Proceedings
    of the 43rd Annual ACM Symposium on Principles of Distributed Computing. PODC:
    Symposium on Principles of Distributed Computing, 268–278.'
  mla: Chatterjee, Krishnendu, et al. “Fully Automated Selfish Mining Analysis in
    Efficient Proof Systems Blockchains.” <i> Proceedings of the 43rd Annual ACM Symposium
    on Principles of Distributed Computing</i>, Association for Computing Machinery,
    2024, pp. 268–78, doi:<a href="https://doi.org/10.1145/3662158.3662769">10.1145/3662158.3662769</a>.
  short: K. Chatterjee, A. Ebrahimzadeh, M. Karrabi, K.Z. Pietrzak, M.X. Yeo, D. Zikelic,
    in:,  Proceedings of the 43rd Annual ACM Symposium on Principles of Distributed
    Computing, Association for Computing Machinery, 2024, pp. 268–278.
conference:
  end_date: 2024-06-21
  location: Nantes, France
  name: 'PODC: Symposium on Principles of Distributed Computing'
  start_date: 2024-06-17
corr_author: '1'
date_created: 2024-07-28T22:01:10Z
date_published: 2024-06-17T00:00:00Z
date_updated: 2025-04-14T07:52:47Z
day: '17'
ddc:
- '000'
department:
- _id: KrCh
- _id: KrPi
doi: 10.1145/3662158.3662769
ec_funded: 1
external_id:
  arxiv:
  - '2405.04420'
file:
- access_level: open_access
  checksum: 6122bd97b42751ff81c452a19970f67d
  content_type: application/pdf
  creator: dernst
  date_created: 2024-07-29T07:18:12Z
  date_updated: 2024-07-29T07:18:12Z
  file_id: '17334'
  file_name: 2024_ACM_Chatterjee.pdf
  file_size: 832034
  relation: main_file
  success: 1
file_date_updated: 2024-07-29T07:18:12Z
has_accepted_license: '1'
language:
- iso: eng
month: '06'
oa: 1
oa_version: Published Version
page: 268-278
project:
- _id: 0599E47C-7A3F-11EA-A408-12923DDC885E
  call_identifier: H2020
  grant_number: '863818'
  name: 'Formal Methods for Stochastic Models: Algorithms and Applications'
publication: ' Proceedings of the 43rd Annual ACM Symposium on Principles of Distributed
  Computing'
publication_identifier:
  isbn:
  - '9798400706684'
publication_status: published
publisher: Association for Computing Machinery
quality_controlled: '1'
scopus_import: '1'
status: public
title: Fully automated selfish mining analysis in efficient proof systems blockchains
tmp:
  image: /images/cc_by.png
  legal_code_url: https://creativecommons.org/licenses/by/4.0/legalcode
  name: Creative Commons Attribution 4.0 International Public License (CC-BY 4.0)
  short: CC BY (4.0)
type: conference
user_id: 2DF688A6-F248-11E8-B48F-1D18A9856A87
year: '2024'
...
---
_id: '17329'
abstract:
- lang: eng
  text: 'We initiate the study of game dynamics in the population protocol model:
    n agents each maintain a current local strategy and interact in pairs uniformly
    at random. Upon each interaction, the agents play a two-person game and receive
    a payoff from an underlying utility function, and they can subsequently update
    their strategies according to a fixed local algorithm. In this setting, we ask
    how the distribution over agent strategies evolves over a sequence of interactions,
    and we introduce a new distributional equilibrium concept to quantify the quality
    of such distributions. As an initial example, we study a class of repeated prisoner''s
    dilemma games, and we consider a family of simple local update algorithms that
    yield non-trivial dynamics over the distribution of agent strategies. We show
    that these dynamics are related to a new class of high-dimensional Ehrenfest random
    walks, and we derive exact characterizations of their stationary distributions,
    bounds on their mixing times, and prove their convergence to approximate distributional
    equilibria. Our results highlight trade-offs between the local state space of
    each agent, and the convergence rate and approximation factor of the underlying
    dynamics. Our approach opens the door towards the further characterization of
    equilibrium computation for other classes of games and dynamics in the population
    setting.'
acknowledgement: This work was supported in part by the ERC-2020-CoG 863818 (FoRM-SMArt)
  grant. We thank James Aspnes and Thomas Sauerwald for several helpful discussions
  on Ehrenfest random walks.
article_processing_charge: Yes (via OA deal)
author:
- first_name: Dan-Adrian
  full_name: Alistarh, Dan-Adrian
  id: 4A899BFC-F248-11E8-B48F-1D18A9856A87
  last_name: Alistarh
  orcid: 0000-0003-3650-940X
- first_name: Krishnendu
  full_name: Chatterjee, Krishnendu
  id: 2E5DCA20-F248-11E8-B48F-1D18A9856A87
  last_name: Chatterjee
  orcid: 0000-0002-4561-241X
- first_name: Mehrdad
  full_name: Karrabi, Mehrdad
  id: 67638922-f394-11eb-9cf6-f20423e08757
  last_name: Karrabi
- first_name: John M
  full_name: Lazarsfeld, John M
  id: 17ce3656-183e-11ef-84c3-8932383e1b23
  last_name: Lazarsfeld
citation:
  ama: 'Alistarh D-A, Chatterjee K, Karrabi M, Lazarsfeld JM. Game dynamics and equilibrium
    computation in the population protocol model. In: <i>Proceedings of the 43rd Annual
    ACM Symposium on Principles of Distributed Computing</i>. Association for Computing
    Machinery; 2024:40-49. doi:<a href="https://doi.org/10.1145/3662158.3662768">10.1145/3662158.3662768</a>'
  apa: 'Alistarh, D.-A., Chatterjee, K., Karrabi, M., &#38; Lazarsfeld, J. M. (2024).
    Game dynamics and equilibrium computation in the population protocol model. In
    <i>Proceedings of the 43rd Annual ACM Symposium on Principles of Distributed Computing</i>
    (pp. 40–49). Nantes, France: Association for Computing Machinery. <a href="https://doi.org/10.1145/3662158.3662768">https://doi.org/10.1145/3662158.3662768</a>'
  chicago: Alistarh, Dan-Adrian, Krishnendu Chatterjee, Mehrdad Karrabi, and John
    M Lazarsfeld. “Game Dynamics and Equilibrium Computation in the Population Protocol
    Model.” In <i>Proceedings of the 43rd Annual ACM Symposium on Principles of Distributed
    Computing</i>, 40–49. Association for Computing Machinery, 2024. <a href="https://doi.org/10.1145/3662158.3662768">https://doi.org/10.1145/3662158.3662768</a>.
  ieee: D.-A. Alistarh, K. Chatterjee, M. Karrabi, and J. M. Lazarsfeld, “Game dynamics
    and equilibrium computation in the population protocol model,” in <i>Proceedings
    of the 43rd Annual ACM Symposium on Principles of Distributed Computing</i>, Nantes,
    France, 2024, pp. 40–49.
  ista: 'Alistarh D-A, Chatterjee K, Karrabi M, Lazarsfeld JM. 2024. Game dynamics
    and equilibrium computation in the population protocol model. Proceedings of the
    43rd Annual ACM Symposium on Principles of Distributed Computing. PODC: Symposium
    on Principles of Distributed Computing, 40–49.'
  mla: Alistarh, Dan-Adrian, et al. “Game Dynamics and Equilibrium Computation in
    the Population Protocol Model.” <i>Proceedings of the 43rd Annual ACM Symposium
    on Principles of Distributed Computing</i>, Association for Computing Machinery,
    2024, pp. 40–49, doi:<a href="https://doi.org/10.1145/3662158.3662768">10.1145/3662158.3662768</a>.
  short: D.-A. Alistarh, K. Chatterjee, M. Karrabi, J.M. Lazarsfeld, in:, Proceedings
    of the 43rd Annual ACM Symposium on Principles of Distributed Computing, Association
    for Computing Machinery, 2024, pp. 40–49.
conference:
  end_date: 2024-06-21
  location: Nantes, France
  name: 'PODC: Symposium on Principles of Distributed Computing'
  start_date: 2024-06-17
corr_author: '1'
date_created: 2024-07-28T22:01:10Z
date_published: 2024-06-17T00:00:00Z
date_updated: 2025-04-14T07:52:47Z
day: '17'
ddc:
- '000'
department:
- _id: DaAl
- _id: KrCh
doi: 10.1145/3662158.3662768
ec_funded: 1
file:
- access_level: open_access
  checksum: 65a40437f83373fa79dd999d5287509e
  content_type: application/pdf
  creator: dernst
  date_created: 2024-07-29T07:37:31Z
  date_updated: 2024-07-29T07:37:31Z
  file_id: '17335'
  file_name: 2024_ACMPODC_Alistarh.pdf
  file_size: 750908
  relation: main_file
  success: 1
file_date_updated: 2024-07-29T07:37:31Z
has_accepted_license: '1'
language:
- iso: eng
month: '06'
oa: 1
oa_version: Published Version
page: 40-49
project:
- _id: 0599E47C-7A3F-11EA-A408-12923DDC885E
  call_identifier: H2020
  grant_number: '863818'
  name: 'Formal Methods for Stochastic Models: Algorithms and Applications'
publication: Proceedings of the 43rd Annual ACM Symposium on Principles of Distributed
  Computing
publication_identifier:
  isbn:
  - '9798400706684'
publication_status: published
publisher: Association for Computing Machinery
quality_controlled: '1'
scopus_import: '1'
status: public
title: Game dynamics and equilibrium computation in the population protocol model
tmp:
  image: /images/cc_by.png
  legal_code_url: https://creativecommons.org/licenses/by/4.0/legalcode
  name: Creative Commons Attribution 4.0 International Public License (CC-BY 4.0)
  short: CC BY (4.0)
type: conference
user_id: 2DF688A6-F248-11E8-B48F-1D18A9856A87
year: '2024'
...
---
_id: '17402'
abstract:
- lang: eng
  text: We present version 2.0 of the Partial Exploration Tool (PET), a tool for verification
    of probabilistic systems. We extend the previous version by adding support for
    stochastic games, based on a recent unified framework for sound value iteration
    algorithms. Thereby, PET2 is the first tool implementing a sound and efficient
    approach for solving stochastic games with objectives of the type reachability/safety
    and mean payoff. We complement this approach by developing and implementing a
    partial-exploration based variant for all three objectives. Our experimental evaluation
    shows that PET2 offers the most efficient partial-exploration based algorithm
    and is the most viable tool on SGs, even outperforming unsound tools.
acknowledgement: M. Weininger has received funding from the EU’s Horizon 2020 research
  and innovation programme under the Marie Skłodowska-Curie grant agreement No. 101034413.
alternative_title:
- LNCS
article_processing_charge: Yes (in subscription journal)
arxiv: 1
author:
- first_name: Tobias
  full_name: Meggendorfer, Tobias
  id: b21b0c15-30a2-11eb-80dc-f13ca25802e1
  last_name: Meggendorfer
  orcid: 0000-0002-1712-2165
- first_name: Maximilian
  full_name: Weininger, Maximilian
  id: 02ab0197-cc70-11ed-ab61-918e71f56881
  last_name: Weininger
citation:
  ama: 'Meggendorfer T, Weininger M. Playing games with your PET: Extending the Partial
    Exploration Tool to stochastic games. In: <i>36th International Conference on
    Computer Aided Verification</i>. Vol 14683. Springer Nature; 2024:359-372. doi:<a
    href="https://doi.org/10.1007/978-3-031-65633-0_16">10.1007/978-3-031-65633-0_16</a>'
  apa: 'Meggendorfer, T., &#38; Weininger, M. (2024). Playing games with your PET:
    Extending the Partial Exploration Tool to stochastic games. In <i>36th International
    Conference on Computer Aided Verification</i> (Vol. 14683, pp. 359–372). Montreal,
    Canada: Springer Nature. <a href="https://doi.org/10.1007/978-3-031-65633-0_16">https://doi.org/10.1007/978-3-031-65633-0_16</a>'
  chicago: 'Meggendorfer, Tobias, and Maximilian Weininger. “Playing Games with Your
    PET: Extending the Partial Exploration Tool to Stochastic Games.” In <i>36th International
    Conference on Computer Aided Verification</i>, 14683:359–72. Springer Nature,
    2024. <a href="https://doi.org/10.1007/978-3-031-65633-0_16">https://doi.org/10.1007/978-3-031-65633-0_16</a>.'
  ieee: 'T. Meggendorfer and M. Weininger, “Playing games with your PET: Extending
    the Partial Exploration Tool to stochastic games,” in <i>36th International Conference
    on Computer Aided Verification</i>, Montreal, Canada, 2024, vol. 14683, pp. 359–372.'
  ista: 'Meggendorfer T, Weininger M. 2024. Playing games with your PET: Extending
    the Partial Exploration Tool to stochastic games. 36th International Conference
    on Computer Aided Verification. CAV: Computer Aided Verification, LNCS, vol. 14683,
    359–372.'
  mla: 'Meggendorfer, Tobias, and Maximilian Weininger. “Playing Games with Your PET:
    Extending the Partial Exploration Tool to Stochastic Games.” <i>36th International
    Conference on Computer Aided Verification</i>, vol. 14683, Springer Nature, 2024,
    pp. 359–72, doi:<a href="https://doi.org/10.1007/978-3-031-65633-0_16">10.1007/978-3-031-65633-0_16</a>.'
  short: T. Meggendorfer, M. Weininger, in:, 36th International Conference on Computer
    Aided Verification, Springer Nature, 2024, pp. 359–372.
conference:
  end_date: 2024-07-27
  location: Montreal, Canada
  name: 'CAV: Computer Aided Verification'
  start_date: 2024-07-24
corr_author: '1'
date_created: 2024-08-09T11:24:54Z
date_published: 2024-07-01T00:00:00Z
date_updated: 2025-09-08T08:53:55Z
day: '01'
ddc:
- '000'
department:
- _id: KrCh
doi: 10.1007/978-3-031-65633-0_16
ec_funded: 1
external_id:
  arxiv:
  - '2405.03885'
  isi:
  - '001307897000016'
file:
- access_level: open_access
  checksum: c888231d0a47b55786b7b4c0f02216bb
  content_type: application/pdf
  creator: dernst
  date_created: 2024-08-12T08:39:12Z
  date_updated: 2024-08-12T08:39:12Z
  file_id: '17419'
  file_name: 2024_CAV_Meggendorfer.pdf
  file_size: 368487
  relation: main_file
  success: 1
file_date_updated: 2024-08-12T08:39:12Z
has_accepted_license: '1'
intvolume: '     14683'
isi: 1
language:
- iso: eng
month: '07'
oa: 1
oa_version: Published Version
page: 359-372
project:
- _id: fc2ed2f7-9c52-11eb-aca3-c01059dda49c
  call_identifier: H2020
  grant_number: '101034413'
  name: 'IST-BRIDGE: International postdoctoral program'
publication: 36th International Conference on Computer Aided Verification
publication_identifier:
  eisbn:
  - '9783031656330'
  eissn:
  - 1611-3349
  isbn:
  - '9783031656323'
  issn:
  - 0302-9743
publication_status: published
publisher: Springer Nature
quality_controlled: '1'
scopus_import: '1'
status: public
title: 'Playing games with your PET: Extending the Partial Exploration Tool to stochastic
  games'
tmp:
  image: /images/cc_by.png
  legal_code_url: https://creativecommons.org/licenses/by/4.0/legalcode
  name: Creative Commons Attribution 4.0 International Public License (CC-BY 4.0)
  short: CC BY (4.0)
type: conference
user_id: 317138e5-6ab7-11ef-aa6d-ffef3953e345
volume: 14683
year: '2024'
...
---
OA_place: publisher
OA_type: hybrid
_id: '17474'
abstract:
- lang: eng
  text: Entropic risk (ERisk) is an established risk measure in finance, quantifying
    risk by an exponential re-weighting of rewards. We study ERisk for the first time
    in the context of turn-based stochastic games with the total reward objective.
    This gives rise to an objective function that demands the control of systems in
    a risk-averse manner. We show that the resulting games are determined and, in
    particular, admit optimal memoryless deterministic strategies. This contrasts
    risk measures that previously have been considered in the special case of Markov
    decision processes and that require randomization and/or memory. We provide several
    results on the decidability and the computational complexity of the threshold
    problem, i.e. whether the optimal value of ERisk exceeds a given threshold. Furthermore,
    an approximation algorithm for the optimal value of ERisk is provided.
acknowledgement: Krishnendu Chatterjee reports financial support was provided by European
  Research Council.
article_number: '105214'
article_processing_charge: Yes (in subscription journal)
article_type: original
arxiv: 1
author:
- first_name: Christel
  full_name: Baier, Christel
  last_name: Baier
- first_name: Krishnendu
  full_name: Chatterjee, Krishnendu
  id: 2E5DCA20-F248-11E8-B48F-1D18A9856A87
  last_name: Chatterjee
  orcid: 0000-0002-4561-241X
- first_name: Tobias
  full_name: Meggendorfer, Tobias
  id: b21b0c15-30a2-11eb-80dc-f13ca25802e1
  last_name: Meggendorfer
  orcid: 0000-0002-1712-2165
- first_name: Jakob
  full_name: Piribauer, Jakob
  last_name: Piribauer
citation:
  ama: Baier C, Chatterjee K, Meggendorfer T, Piribauer J. Entropic risk for turn-based
    stochastic games. <i>Information and Computation</i>. 2024;301. doi:<a href="https://doi.org/10.1016/j.ic.2024.105214">10.1016/j.ic.2024.105214</a>
  apa: Baier, C., Chatterjee, K., Meggendorfer, T., &#38; Piribauer, J. (2024). Entropic
    risk for turn-based stochastic games. <i>Information and Computation</i>. Elsevier.
    <a href="https://doi.org/10.1016/j.ic.2024.105214">https://doi.org/10.1016/j.ic.2024.105214</a>
  chicago: Baier, Christel, Krishnendu Chatterjee, Tobias Meggendorfer, and Jakob
    Piribauer. “Entropic Risk for Turn-Based Stochastic Games.” <i>Information and
    Computation</i>. Elsevier, 2024. <a href="https://doi.org/10.1016/j.ic.2024.105214">https://doi.org/10.1016/j.ic.2024.105214</a>.
  ieee: C. Baier, K. Chatterjee, T. Meggendorfer, and J. Piribauer, “Entropic risk
    for turn-based stochastic games,” <i>Information and Computation</i>, vol. 301.
    Elsevier, 2024.
  ista: Baier C, Chatterjee K, Meggendorfer T, Piribauer J. 2024. Entropic risk for
    turn-based stochastic games. Information and Computation. 301, 105214.
  mla: Baier, Christel, et al. “Entropic Risk for Turn-Based Stochastic Games.” <i>Information
    and Computation</i>, vol. 301, 105214, Elsevier, 2024, doi:<a href="https://doi.org/10.1016/j.ic.2024.105214">10.1016/j.ic.2024.105214</a>.
  short: C. Baier, K. Chatterjee, T. Meggendorfer, J. Piribauer, Information and Computation
    301 (2024).
corr_author: '1'
date_created: 2024-09-01T22:01:07Z
date_published: 2024-12-01T00:00:00Z
date_updated: 2025-09-08T09:10:06Z
day: '01'
ddc:
- '000'
department:
- _id: KrCh
doi: 10.1016/j.ic.2024.105214
external_id:
  arxiv:
  - '2307.06611'
  isi:
  - '001301143400001'
file:
- access_level: open_access
  checksum: f68e0c2f46f9b9c86815406bcf2ee2d4
  content_type: application/pdf
  creator: dernst
  date_created: 2025-01-09T13:49:03Z
  date_updated: 2025-01-09T13:49:03Z
  file_id: '18817'
  file_name: 2024_InformationComputation_Baier.pdf
  file_size: 724703
  relation: main_file
  success: 1
file_date_updated: 2025-01-09T13:49:03Z
has_accepted_license: '1'
intvolume: '       301'
isi: 1
language:
- iso: eng
month: '12'
oa: 1
oa_version: Published Version
publication: Information and Computation
publication_identifier:
  eissn:
  - 1090-2651
  issn:
  - 0890-5401
publication_status: published
publisher: Elsevier
quality_controlled: '1'
related_material:
  record:
  - id: '14417'
    relation: earlier_version
    status: public
scopus_import: '1'
status: public
title: Entropic risk for turn-based stochastic games
tmp:
  image: /images/cc_by.png
  legal_code_url: https://creativecommons.org/licenses/by/4.0/legalcode
  name: Creative Commons Attribution 4.0 International Public License (CC-BY 4.0)
  short: CC BY (4.0)
type: journal_article
user_id: 317138e5-6ab7-11ef-aa6d-ffef3953e345
volume: 301
year: '2024'
...
---
_id: '18155'
abstract:
- lang: eng
  text: We study the classical problem of verifying programs with respect to formal
    specifications given in the linear temporal logic (LTL). We first present novel
    sound and complete witnesses for LTL verification over imperative programs. Our
    witnesses are applicable to both verification (proving) and refutation (finding
    bugs) settings. We then consider LTL formulas in which atomic propositions can
    be polynomial constraints and turn our focus to polynomial arithmetic programs,
    i.e. programs in which every assignment and guard consists only of polynomial
    expressions. For this setting, we provide an efficient algorithm to automatically
    synthesize such LTL witnesses. Our synthesis procedure is both sound and semi-complete.
    Finally, we present experimental results demonstrating the effectiveness of our
    approach and that it can handle programs which were beyond the reach of previous
    state-of-the-art tools.
acknowledgement: This work was supported in part by the ERC-2020-CoG 863818 (FoRM-SMArt)
  and the Hong Kong Research Grants Council ECS Project Number 26208122.
alternative_title:
- LNCS
article_processing_charge: Yes (in subscription journal)
arxiv: 1
author:
- first_name: Krishnendu
  full_name: Chatterjee, Krishnendu
  id: 2E5DCA20-F248-11E8-B48F-1D18A9856A87
  last_name: Chatterjee
  orcid: 0000-0002-4561-241X
- first_name: Amir Kafshdar
  full_name: Goharshady, Amir Kafshdar
  id: 391365CE-F248-11E8-B48F-1D18A9856A87
  last_name: Goharshady
  orcid: 0000-0003-1702-6584
- first_name: Ehsan
  full_name: Goharshady, Ehsan
  last_name: Goharshady
- first_name: Mehrdad
  full_name: Karrabi, Mehrdad
  id: 67638922-f394-11eb-9cf6-f20423e08757
  last_name: Karrabi
- first_name: Dorde
  full_name: Zikelic, Dorde
  id: 294AA7A6-F248-11E8-B48F-1D18A9856A87
  last_name: Zikelic
  orcid: 0000-0002-4681-1699
citation:
  ama: 'Chatterjee K, Goharshady AK, Goharshady E, Karrabi M, Zikelic D. Sound and complete
    witnesses for template-based verification of LTL properties on polynomial programs.
    In: <i>Lecture Notes in Computer Science (Including Subseries Lecture Notes in
    Artificial Intelligence and Lecture Notes in Bioinformatics)</i>. Vol 14933. Springer
    Nature; 2024:600-619. doi:<a href="https://doi.org/10.1007/978-3-031-71162-6_31">10.1007/978-3-031-71162-6_31</a>'
  apa: 'Chatterjee, K., Goharshady, A. K., Goharshady, E., Karrabi, M., &#38; Zikelic,
    D. (2024). Sound and complete witnesses for template-based verification of LTL
    properties on polynomial programs. In <i>Lecture Notes in Computer Science (including
    subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)</i>
    (Vol. 14933, pp. 600–619). Milan, Italy: Springer Nature. <a href="https://doi.org/10.1007/978-3-031-71162-6_31">https://doi.org/10.1007/978-3-031-71162-6_31</a>'
  chicago: Chatterjee, Krishnendu, Amir Kafshdar Goharshady, Ehsan Goharshady, Mehrdad
    Karrabi, and Dorde Zikelic. “Sound and Complete Witnesses for Template-Based Verification
    of LTL Properties on Polynomial Programs.” In <i>Lecture Notes in Computer Science
    (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes
    in Bioinformatics)</i>, 14933:600–619. Springer Nature, 2024. <a href="https://doi.org/10.1007/978-3-031-71162-6_31">https://doi.org/10.1007/978-3-031-71162-6_31</a>.
  ieee: K. Chatterjee, A. K. Goharshady, E. Goharshady, M. Karrabi, and D. Zikelic,
    “Sound and complete witnesses for template-based verification of LTL properties
    on polynomial programs,” in <i>Lecture Notes in Computer Science (including subseries
    Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)</i>,
    Milan, Italy, 2024, vol. 14933, pp. 600–619.
  ista: 'Chatterjee K, Goharshady AK, Goharshady E, Karrabi M, Zikelic D. 2024. Sound
    and complete witnesses for template-based verification of LTL properties on polynomial
    programs. Lecture Notes in Computer Science (including subseries Lecture Notes
    in Artificial Intelligence and Lecture Notes in Bioinformatics). FM: Formal Methods,
    LNCS, vol. 14933, 600–619.'
  mla: Chatterjee, Krishnendu, et al. “Sound and Complete Witnesses for Template-Based
    Verification of LTL Properties on Polynomial Programs.” <i>Lecture Notes in Computer
    Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture
    Notes in Bioinformatics)</i>, vol. 14933, Springer Nature, 2024, pp. 600–19, doi:<a
    href="https://doi.org/10.1007/978-3-031-71162-6_31">10.1007/978-3-031-71162-6_31</a>.
  short: K. Chatterjee, A.K. Goharshady, E. Goharshady, M. Karrabi, D. Zikelic, in:,
    Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial
    Intelligence and Lecture Notes in Bioinformatics), Springer Nature, 2024, pp.
    600–619.
conference:
  end_date: 2024-09-13
  location: Milan, Italy
  name: 'FM: Formal Methods'
  start_date: 2024-09-09
corr_author: '1'
date_created: 2024-09-29T22:01:37Z
date_published: 2024-09-11T00:00:00Z
date_updated: 2025-09-08T09:51:34Z
day: '11'
ddc:
- '000'
department:
- _id: KrCh
doi: 10.1007/978-3-031-71162-6_31
ec_funded: 1
external_id:
  arxiv:
  - '2403.05386'
  isi:
  - '001336893300031'
file:
- access_level: open_access
  checksum: 223845be9e754681ee218866827c95e7
  content_type: application/pdf
  creator: dernst
  date_created: 2024-10-01T09:56:54Z
  date_updated: 2024-10-01T09:56:54Z
  file_id: '18165'
  file_name: 2024_LNCS_Chatterjee.pdf
  file_size: 650495
  relation: main_file
  success: 1
file_date_updated: 2024-10-01T09:56:54Z
has_accepted_license: '1'
intvolume: '     14933'
isi: 1
language:
- iso: eng
month: '09'
oa: 1
oa_version: Published Version
page: 600-619
project:
- _id: 0599E47C-7A3F-11EA-A408-12923DDC885E
  call_identifier: H2020
  grant_number: '863818'
  name: 'Formal Methods for Stochastic Models: Algorithms and Applications'
publication: Lecture Notes in Computer Science (including subseries Lecture Notes
  in Artificial Intelligence and Lecture Notes in Bioinformatics)
publication_identifier:
  eissn:
  - 1611-3349
  isbn:
  - '9783031711619'
  issn:
  - 0302-9743
publication_status: published
publisher: Springer Nature
quality_controlled: '1'
scopus_import: '1'
status: public
title: Sound and complete witnesses for template-based verification of LTL properties
  on polynomial programs
tmp:
  image: /images/cc_by.png
  legal_code_url: https://creativecommons.org/licenses/by/4.0/legalcode
  name: Creative Commons Attribution 4.0 International Public License (CC-BY 4.0)
  short: CC BY (4.0)
type: conference
user_id: 317138e5-6ab7-11ef-aa6d-ffef3953e345
volume: 14933
year: '2024'
...
---
_id: '18159'
abstract:
- lang: eng
  text: "Markov Decision Processes (MDPs) are a classical model for decision making
    in the presence of uncertainty. Often they are viewed as state transformers with
    planning objectives defned with respect to paths over MDP states. An increasingly\r\npopular
    alternative is to view them as distribution transformers, giving rise to a sequence
    of probability distributions over MDP states. For instance, reachability and safety
    properties in modeling robot swarms or chemical reaction networks are naturally
    defned in terms of probability distributions over states. Verifying such distributional
    properties is known to be hard and often beyond the reach of classical state-based
    verifcation techniques. In this work, we consider the problems of certifed policy
    (i.e. controller) verifcation and synthesis in MDPs under distributional reach-avoidance
    specifcations. By certifed we mean that, along with a policy, we also aim to synthesize
    a (checkable) certifcate ensuring that the MDP indeed satisfes the property. Thus,
    given the target set of distributions and an unsafe set of distributions over
    MDP states, our goal is to either synthesize a certifcate for a given policy or
    synthesize a policy along with a certifcate, proving that the target distribution
    can be reached while avoiding unsafe distributions. To solve this problem, we
    introduce the novel notion of distributional reach-avoid certifcates and present
    automated procedures for (1) synthesizing a certifcate for a given policy, and
    (2) synthesizing a policy together with the certifcate, both providing formal
    guarantees on certifcate correctness. Our experimental evaluation demonstrates
    the ability of our method to solve several non-trivial examples, including a multi-agent
    robot-swarm model, to synthesize certifed policies and to certify existing policies. "
acknowledgement: This work was supported in part by the ERC-2020-CoG 863818 (FoRM-SMArt),
  the Singapore Ministry of Education (MOE) Academic Research Fund (AcRF) Tier 1 grant,
  Google Research Award 2023 and the SBI Foundation Hub for Data and Analytics.
article_processing_charge: No
arxiv: 1
author:
- first_name: S
  full_name: Akshay, S
  last_name: Akshay
- first_name: Krishnendu
  full_name: Chatterjee, Krishnendu
  id: 2E5DCA20-F248-11E8-B48F-1D18A9856A87
  last_name: Chatterjee
  orcid: 0000-0002-4561-241X
- first_name: Tobias
  full_name: Meggendorfer, Tobias
  id: b21b0c15-30a2-11eb-80dc-f13ca25802e1
  last_name: Meggendorfer
  orcid: 0000-0002-1712-2165
- first_name: Dorde
  full_name: Zikelic, Dorde
  id: 294AA7A6-F248-11E8-B48F-1D18A9856A87
  last_name: Zikelic
  orcid: 0000-0002-4681-1699
citation:
  ama: 'Akshay S, Chatterjee K, Meggendorfer T, Zikelic D. Certified policy verification
    and synthesis for MDPs under distributional reach-avoidance properties. In: <i>Proceedings
    of the Thirty-Third International Joint Conference on Artificial Intelligence</i>.
    International Joint Conferences on Artificial Intelligence; 2024:3-12. doi:<a
    href="https://doi.org/10.24963/ijcai.2024/1">10.24963/ijcai.2024/1</a>'
  apa: 'Akshay, S., Chatterjee, K., Meggendorfer, T., &#38; Zikelic, D. (2024). Certified
    policy verification and synthesis for MDPs under distributional reach-avoidance
    properties. In <i>Proceedings of the Thirty-Third International Joint Conference
    on Artificial Intelligence</i> (pp. 3–12). Jeju, Korea: International Joint Conferences
    on Artificial Intelligence. <a href="https://doi.org/10.24963/ijcai.2024/1">https://doi.org/10.24963/ijcai.2024/1</a>'
  chicago: Akshay, S, Krishnendu Chatterjee, Tobias Meggendorfer, and Dorde Zikelic.
    “Certified Policy Verification and Synthesis for MDPs under Distributional Reach-Avoidance
    Properties.” In <i>Proceedings of the Thirty-Third International Joint Conference
    on Artificial Intelligence</i>, 3–12. International Joint Conferences on Artificial
    Intelligence, 2024. <a href="https://doi.org/10.24963/ijcai.2024/1">https://doi.org/10.24963/ijcai.2024/1</a>.
  ieee: S. Akshay, K. Chatterjee, T. Meggendorfer, and D. Zikelic, “Certified policy
    verification and synthesis for MDPs under distributional reach-avoidance properties,”
    in <i>Proceedings of the Thirty-Third International Joint Conference on Artificial
    Intelligence</i>, Jeju, Korea, 2024, pp. 3–12.
  ista: 'Akshay S, Chatterjee K, Meggendorfer T, Zikelic D. 2024. Certified policy
    verification and synthesis for MDPs under distributional reach-avoidance properties.
    Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence.
    IJCAI: International Joint Conference on Artificial Intelligence, 3–12.'
  mla: Akshay, S., et al. “Certified Policy Verification and Synthesis for MDPs under
    Distributional Reach-Avoidance Properties.” <i>Proceedings of the Thirty-Third
    International Joint Conference on Artificial Intelligence</i>, International Joint
    Conferences on Artificial Intelligence, 2024, pp. 3–12, doi:<a href="https://doi.org/10.24963/ijcai.2024/1">10.24963/ijcai.2024/1</a>.
  short: S. Akshay, K. Chatterjee, T. Meggendorfer, D. Zikelic, in:, Proceedings of
    the Thirty-Third International Joint Conference on Artificial Intelligence, International
    Joint Conferences on Artificial Intelligence, 2024, pp. 3–12.
conference:
  end_date: 2024-08-09
  location: Jeju, Korea
  name: 'IJCAI: International Joint Conference on Artificial Intelligence'
  start_date: 2024-08-03
corr_author: '1'
date_created: 2024-09-29T22:01:38Z
date_published: 2024-09-01T00:00:00Z
date_updated: 2025-04-14T07:52:46Z
day: '01'
department:
- _id: KrCh
doi: 10.24963/ijcai.2024/1
ec_funded: 1
external_id:
  arxiv:
  - '2405.04015'
language:
- iso: eng
main_file_link:
- open_access: '1'
  url: https://doi.org/10.48550/arXiv.2405.04015
month: '09'
oa: 1
oa_version: Preprint
page: 3-12
project:
- _id: 0599E47C-7A3F-11EA-A408-12923DDC885E
  call_identifier: H2020
  grant_number: '863818'
  name: 'Formal Methods for Stochastic Models: Algorithms and Applications'
publication: Proceedings of the Thirty-Third International Joint Conference on Artificial
  Intelligence
publication_identifier:
  isbn:
  - '9781956792041'
  issn:
  - 1045-0823
publication_status: published
publisher: International Joint Conferences on Artificial Intelligence
quality_controlled: '1'
scopus_import: '1'
status: public
title: Certified policy verification and synthesis for MDPs under distributional reach-avoidance
  properties
type: conference
user_id: 2DF688A6-F248-11E8-B48F-1D18A9856A87
year: '2024'
...
---
OA_place: repository
OA_type: green
_id: '18160'
abstract:
- lang: eng
  text: 'Markov decision processes (MDPs) provide a standard framework for sequential
    decision making under uncertainty. However, MDPs do not take uncertainty in transition
    probabilities into account. Robust Markov decision processes (RMDPs) address this
    shortcoming of MDPs by assigning to each transition an uncertainty set rather
    than a single probability value. In this work, we consider polytopic RMDPs in
    which all uncertainty sets are polytopes and study the problem of solving long-run
    average reward polytopic RMDPs. We present a novel perspective on this problem
    and show that it can be reduced to solving long-run average reward turn-based
    stochastic games with finite state and action spaces. This reduction allows us
    to derive several important consequences that were hitherto not known to hold
    for polytopic RMDPs. First, we derive new computational complexity bounds for
    solving long-run average reward polytopic RMDPs, showing for the first time that
    the threshold decision problem for them is in NP∩CONP and that they admit a randomized
    algorithm with sub-exponential expected runtime. Second, we present Robust Polytopic
    Policy Iteration (RPPI), a novel policy iteration algorithm for solving long-run
    average reward polytopic RMDPs. Our experimental evaluation shows that RPPI is
    much more efficient in solving long-run average reward polytopic RMDPs compared
    to state-of-the-art methods based on value iteration. '
acknowledgement: "This work was supported in part by the ERC-2020-CoG 863818 (FoRM-SMArt)
  and the Czech Science Foundation\r\ngrant no. GA23-06963S."
article_processing_charge: No
arxiv: 1
author:
- first_name: Krishnendu
  full_name: Chatterjee, Krishnendu
  id: 2E5DCA20-F248-11E8-B48F-1D18A9856A87
  last_name: Chatterjee
  orcid: 0000-0002-4561-241X
- first_name: Ehsan
  full_name: Kafshdar Goharshadi, Ehsan
  id: 103b4fa0-896a-11ed-bdf8-87b697bef40d
  last_name: Kafshdar Goharshadi
  orcid: 0000-0002-8595-0587
- first_name: Mehrdad
  full_name: Karrabi, Mehrdad
  id: 67638922-f394-11eb-9cf6-f20423e08757
  last_name: Karrabi
- first_name: Petr
  full_name: Novotný, Petr
  id: 3CC3B868-F248-11E8-B48F-1D18A9856A87
  last_name: Novotný
- first_name: Dorde
  full_name: Zikelic, Dorde
  id: 294AA7A6-F248-11E8-B48F-1D18A9856A87
  last_name: Zikelic
  orcid: 0000-0002-4681-1699
citation:
  ama: 'Chatterjee K, Goharshady E, Karrabi M, Novotný P, Zikelic D. Solving long-run
    average reward robust MDPs via stochastic games. In: <i>33rd International Joint
    Conference on Artificial Intelligence</i>. International Joint Conferences on
    Artificial Intelligence; 2024:6707-6715. doi:<a href="https://doi.org/10.24963/ijcai.2024/741">10.24963/ijcai.2024/741</a>'
  apa: 'Chatterjee, K., Goharshady, E., Karrabi, M., Novotný, P., &#38; Zikelic, D.
    (2024). Solving long-run average reward robust MDPs via stochastic games. In <i>33rd
    International Joint Conference on Artificial Intelligence</i> (pp. 6707–6715).
    Jeju, South Korea: International Joint Conferences on Artificial Intelligence.
    <a href="https://doi.org/10.24963/ijcai.2024/741">https://doi.org/10.24963/ijcai.2024/741</a>'
  chicago: Chatterjee, Krishnendu, Ehsan Goharshady, Mehrdad Karrabi, Petr Novotný,
    and Dorde Zikelic. “Solving Long-Run Average Reward Robust MDPs via Stochastic
    Games.” In <i>33rd International Joint Conference on Artificial Intelligence</i>,
    6707–15. International Joint Conferences on Artificial Intelligence, 2024. <a
    href="https://doi.org/10.24963/ijcai.2024/741">https://doi.org/10.24963/ijcai.2024/741</a>.
  ieee: K. Chatterjee, E. Goharshady, M. Karrabi, P. Novotný, and D. Zikelic, “Solving
    long-run average reward robust MDPs via stochastic games,” in <i>33rd International
    Joint Conference on Artificial Intelligence</i>, Jeju, South Korea, 2024, pp.
    6707–6715.
  ista: 'Chatterjee K, Goharshady E, Karrabi M, Novotný P, Zikelic D. 2024. Solving
    long-run average reward robust MDPs via stochastic games. 33rd International Joint
    Conference on Artificial Intelligence. IJCAI: International Joint Conference on
    Artificial Intelligence, 6707–6715.'
  mla: Chatterjee, Krishnendu, et al. “Solving Long-Run Average Reward Robust MDPs
    via Stochastic Games.” <i>33rd International Joint Conference on Artificial Intelligence</i>,
    International Joint Conferences on Artificial Intelligence, 2024, pp. 6707–15,
    doi:<a href="https://doi.org/10.24963/ijcai.2024/741">10.24963/ijcai.2024/741</a>.
  short: K. Chatterjee, E. Goharshady, M. Karrabi, P. Novotný, D. Zikelic, in:, 33rd
    International Joint Conference on Artificial Intelligence, International Joint
    Conferences on Artificial Intelligence, 2024, pp. 6707–6715.
conference:
  end_date: 2024-08-09
  location: Jeju, South Korea
  name: 'IJCAI: International Joint Conference on Artificial Intelligence'
  start_date: 2024-08-03
corr_author: '1'
date_created: 2024-09-29T22:01:39Z
date_published: 2024-09-01T00:00:00Z
date_updated: 2025-04-14T07:52:46Z
day: '01'
department:
- _id: KrCh
doi: 10.24963/ijcai.2024/741
ec_funded: 1
external_id:
  arxiv:
  - '2312.13912'
language:
- iso: eng
main_file_link:
- open_access: '1'
  url: https://doi.org/10.48550/arXiv.2312.13912
month: '09'
oa: 1
oa_version: Preprint
page: 6707-6715
project:
- _id: 0599E47C-7A3F-11EA-A408-12923DDC885E
  call_identifier: H2020
  grant_number: '863818'
  name: 'Formal Methods for Stochastic Models: Algorithms and Applications'
publication: 33rd International Joint Conference on Artificial Intelligence
publication_identifier:
  isbn:
  - '9781956792041'
  issn:
  - 1045-0823
publication_status: published
publisher: International Joint Conferences on Artificial Intelligence
quality_controlled: '1'
scopus_import: '1'
status: public
title: Solving long-run average reward robust MDPs via stochastic games
type: conference
user_id: 2DF688A6-F248-11E8-B48F-1D18A9856A87
year: '2024'
...
---
OA_type: closed access
_id: '18266'
abstract:
- lang: eng
  text: Matrix games are the most basic model in game theory, and yet robustness with
    respect to small perturbations of the matrix entries is not fully understood.
    In this paper, we introduce value positivity and uniform value positivity, two
    properties that refine the notion of optimality in the context of polynomially
    perturbed matrix games. The first concept captures how the value depends on the
    perturbation parameter, and the second consists of the existence of a fixed strategy
    that guarantees the value of the unperturbed matrix game for every sufficiently
    small positive parameter. We provide polynomial-time algorithms to check whether
    a polynomially perturbed matrix game satisfies these properties. We further provide
    the functional form for a parameterized optimal strategy and the value function.
    Finally, we translate our results to linear programming and stochastic games,
    where value positivity is related to the existence of robust solutions.
acknowledgement: This research was supported by Fondation CFM pour la Recherche, the
  H2020 European Research Council [Grant ERC-CoG-863818 (ForM-SMArt)], the Austrian
  Science Fund [Grant 10.55776/COE12], ANID Chile [Grant ACT210005], and Agence Nationale
  de la Recherche [Grant ANR-21-CE40-0020].
article_processing_charge: No
article_type: original
author:
- first_name: Krishnendu
  full_name: Chatterjee, Krishnendu
  id: 2E5DCA20-F248-11E8-B48F-1D18A9856A87
  last_name: Chatterjee
  orcid: 0000-0002-4561-241X
- first_name: Miquel
  full_name: Oliu-Barton, Miquel
  last_name: Oliu-Barton
- first_name: Raimundo J
  full_name: Saona Urmeneta, Raimundo J
  id: BD1DF4C4-D767-11E9-B658-BC13E6697425
  last_name: Saona Urmeneta
  orcid: 0000-0001-5103-038X
citation:
  ama: Chatterjee K, Oliu-Barton M, Saona Urmeneta RJ. Value-positivity for matrix
    games. <i>Mathematics of Operations Research</i>. 2024;50(4):2433-3282. doi:<a
    href="https://doi.org/10.1287/moor.2022.0332">10.1287/moor.2022.0332</a>
  apa: Chatterjee, K., Oliu-Barton, M., &#38; Saona Urmeneta, R. J. (2024). Value-positivity
    for matrix games. <i>Mathematics of Operations Research</i>. Institute for Operations
    Research and the Management Sciences. <a href="https://doi.org/10.1287/moor.2022.0332">https://doi.org/10.1287/moor.2022.0332</a>
  chicago: Chatterjee, Krishnendu, Miquel Oliu-Barton, and Raimundo J Saona Urmeneta.
    “Value-Positivity for Matrix Games.” <i>Mathematics of Operations Research</i>.
    Institute for Operations Research and the Management Sciences, 2024. <a href="https://doi.org/10.1287/moor.2022.0332">https://doi.org/10.1287/moor.2022.0332</a>.
  ieee: K. Chatterjee, M. Oliu-Barton, and R. J. Saona Urmeneta, “Value-positivity
    for matrix games,” <i>Mathematics of Operations Research</i>, vol. 50, no. 4.
    Institute for Operations Research and the Management Sciences, pp. 2433–3282,
    2024.
  ista: Chatterjee K, Oliu-Barton M, Saona Urmeneta RJ. 2024. Value-positivity for
    matrix games. Mathematics of Operations Research. 50(4), 2433–3282.
  mla: Chatterjee, Krishnendu, et al. “Value-Positivity for Matrix Games.” <i>Mathematics
    of Operations Research</i>, vol. 50, no. 4, Institute for Operations Research
    and the Management Sciences, 2024, pp. 2433–3282, doi:<a href="https://doi.org/10.1287/moor.2022.0332">10.1287/moor.2022.0332</a>.
  short: K. Chatterjee, M. Oliu-Barton, R.J. Saona Urmeneta, Mathematics of Operations
    Research 50 (2024) 2433–3282.
corr_author: '1'
date_created: 2024-10-09T07:02:20Z
date_published: 2024-10-01T00:00:00Z
date_updated: 2026-04-07T12:31:21Z
day: '01'
department:
- _id: GradSch
- _id: KrCh
doi: 10.1287/moor.2022.0332
ec_funded: 1
external_id:
  isi:
  - '001328875900001'
intvolume: '        50'
isi: 1
issue: '4'
language:
- iso: eng
month: '10'
oa_version: None
page: 2433-3282
project:
- _id: 0599E47C-7A3F-11EA-A408-12923DDC885E
  call_identifier: H2020
  grant_number: '863818'
  name: 'Formal Methods for Stochastic Models: Algorithms and Applications'
publication: Mathematics of Operations Research
publication_identifier:
  eissn:
  - 1526-5471
  issn:
  - 0364-765X
publication_status: published
publisher: Institute for Operations Research and the Management Sciences
quality_controlled: '1'
related_material:
  record:
  - id: '20234'
    relation: dissertation_contains
    status: public
scopus_import: '1'
status: public
title: Value-positivity for matrix games
type: journal_article
user_id: 2DF688A6-F248-11E8-B48F-1D18A9856A87
volume: 50
year: '2024'
...
---
_id: '12676'
abstract:
- lang: eng
  text: Turn-based stochastic games (aka simple stochastic games) are two-player zero-sum
    games played on directed graphs with probabilistic transitions. The goal of player-max
    is to maximize the probability to reach a target state against the adversarial
    player-min. These games lie in NP ∩ coNP and are among the rare combinatorial
    problems that belong to this complexity class for which the existence of polynomial-time
    algorithm is a major open question. While randomized sub-exponential time algorithm
    exists, all known deterministic algorithms require exponential time in the worst-case.
    An important open question has been whether faster algorithms can be obtained
    parametrized by the treewidth of the game graph. Even deterministic sub-exponential
    time algorithm for constant treewidth turn-based stochastic games has remain elusive.
    In this work our main result is a deterministic algorithm to solve turn-based
    stochastic games that, given a game with n states, treewidth at most t, and the
    bit-complexity of the probabilistic transition function log D, has running time
    O ((tn2 log D)t log n). In particular, our algorithm is quasi-polynomial time
    for games with constant or poly-logarithmic treewidth.
acknowledgement: This research was partially supported by the ERC CoG 863818 (ForM-SMArt)
  grant.
article_processing_charge: No
author:
- first_name: Krishnendu
  full_name: Chatterjee, Krishnendu
  id: 2E5DCA20-F248-11E8-B48F-1D18A9856A87
  last_name: Chatterjee
  orcid: 0000-0002-4561-241X
- first_name: Tobias
  full_name: Meggendorfer, Tobias
  id: b21b0c15-30a2-11eb-80dc-f13ca25802e1
  last_name: Meggendorfer
  orcid: 0000-0002-1712-2165
- first_name: Raimundo J
  full_name: Saona Urmeneta, Raimundo J
  id: BD1DF4C4-D767-11E9-B658-BC13E6697425
  last_name: Saona Urmeneta
  orcid: 0000-0001-5103-038X
- first_name: Jakub
  full_name: Svoboda, Jakub
  id: 130759D2-D7DD-11E9-87D2-DE0DE6697425
  last_name: Svoboda
  orcid: 0000-0002-1419-3267
citation:
  ama: 'Chatterjee K, Meggendorfer T, Saona Urmeneta RJ, Svoboda J. Faster algorithm
    for turn-based stochastic games with bounded treewidth. In: <i>Proceedings of
    the 2023 Annual ACM-SIAM Symposium on Discrete Algorithms</i>. Society for Industrial
    and Applied Mathematics; 2023:4590-4605. doi:<a href="https://doi.org/10.1137/1.9781611977554.ch173">10.1137/1.9781611977554.ch173</a>'
  apa: 'Chatterjee, K., Meggendorfer, T., Saona Urmeneta, R. J., &#38; Svoboda, J.
    (2023). Faster algorithm for turn-based stochastic games with bounded treewidth.
    In <i>Proceedings of the 2023 Annual ACM-SIAM Symposium on Discrete Algorithms</i>
    (pp. 4590–4605). Florence, Italy: Society for Industrial and Applied Mathematics.
    <a href="https://doi.org/10.1137/1.9781611977554.ch173">https://doi.org/10.1137/1.9781611977554.ch173</a>'
  chicago: Chatterjee, Krishnendu, Tobias Meggendorfer, Raimundo J Saona Urmeneta,
    and Jakub Svoboda. “Faster Algorithm for Turn-Based Stochastic Games with Bounded
    Treewidth.” In <i>Proceedings of the 2023 Annual ACM-SIAM Symposium on Discrete
    Algorithms</i>, 4590–4605. Society for Industrial and Applied Mathematics, 2023.
    <a href="https://doi.org/10.1137/1.9781611977554.ch173">https://doi.org/10.1137/1.9781611977554.ch173</a>.
  ieee: K. Chatterjee, T. Meggendorfer, R. J. Saona Urmeneta, and J. Svoboda, “Faster
    algorithm for turn-based stochastic games with bounded treewidth,” in <i>Proceedings
    of the 2023 Annual ACM-SIAM Symposium on Discrete Algorithms</i>, Florence, Italy,
    2023, pp. 4590–4605.
  ista: 'Chatterjee K, Meggendorfer T, Saona Urmeneta RJ, Svoboda J. 2023. Faster
    algorithm for turn-based stochastic games with bounded treewidth. Proceedings
    of the 2023 Annual ACM-SIAM Symposium on Discrete Algorithms. SODA: Symposium
    on Discrete Algorithms, 4590–4605.'
  mla: Chatterjee, Krishnendu, et al. “Faster Algorithm for Turn-Based Stochastic
    Games with Bounded Treewidth.” <i>Proceedings of the 2023 Annual ACM-SIAM Symposium
    on Discrete Algorithms</i>, Society for Industrial and Applied Mathematics, 2023,
    pp. 4590–605, doi:<a href="https://doi.org/10.1137/1.9781611977554.ch173">10.1137/1.9781611977554.ch173</a>.
  short: K. Chatterjee, T. Meggendorfer, R.J. Saona Urmeneta, J. Svoboda, in:, Proceedings
    of the 2023 Annual ACM-SIAM Symposium on Discrete Algorithms, Society for Industrial
    and Applied Mathematics, 2023, pp. 4590–4605.
conference:
  end_date: 2023-01-25
  location: Florence, Italy
  name: 'SODA: Symposium on Discrete Algorithms'
  start_date: 2023-01-22
corr_author: '1'
date_created: 2023-02-24T12:20:47Z
date_published: 2023-02-01T00:00:00Z
date_updated: 2026-06-18T17:28:38Z
day: '01'
ddc:
- '000'
department:
- _id: GradSch
- _id: KrCh
doi: 10.1137/1.9781611977554.ch173
ec_funded: 1
language:
- iso: eng
main_file_link:
- open_access: '1'
  url: https://doi.org/10.1137/1.9781611977554.ch173
month: '02'
oa: 1
oa_version: Published Version
page: 4590-4605
project:
- _id: 0599E47C-7A3F-11EA-A408-12923DDC885E
  call_identifier: H2020
  grant_number: '863818'
  name: 'Formal Methods for Stochastic Models: Algorithms and Applications'
publication: Proceedings of the 2023 Annual ACM-SIAM Symposium on Discrete Algorithms
publication_identifier:
  isbn:
  - '9781611977554'
publication_status: published
publisher: Society for Industrial and Applied Mathematics
quality_controlled: '1'
status: public
title: Faster algorithm for turn-based stochastic games with bounded treewidth
type: conference
user_id: 2DF688A6-F248-11E8-B48F-1D18A9856A87
year: '2023'
...
---
_id: '12706'
abstract:
- lang: eng
  text: Allometric settings of population dynamics models are appealing due to their
    parsimonious nature and broad utility when studying system level effects. Here,
    we parameterise the size-scaled Rosenzweig-MacArthur differential equations to
    eliminate prey-mass dependency, facilitating an in depth analytic study of the
    equations which incorporates scaling parameters’ contributions to coexistence.
    We define the functional response term to match empirical findings, and examine
    situations where metabolic theory derivations and observation diverge. The dynamical
    properties of the Rosenzweig-MacArthur system, encompassing the distribution of
    size-abundance equilibria, the scaling of period and amplitude of population cycling,
    and relationships between predator and prey abundances, are consistent with empirical
    observation. Our parameterisation is an accurate minimal model across 15+ orders
    of mass magnitude.
acknowledgement: "This research was supported by an Australian Government Research
  Training Program\r\n(RTP) Scholarship to JCM (https://www.dese.gov.au), and LB is
  supported by the Centre de\r\nrecherche sur le vieillissement Fellowship Program.
  The funders had no role in study design, data collection and analysis, decision
  to publish, or preparation of the manuscript."
article_processing_charge: No
article_type: original
author:
- first_name: Jody C.
  full_name: Mckerral, Jody C.
  last_name: Mckerral
- first_name: Maria
  full_name: Kleshnina, Maria
  id: 4E21749C-F248-11E8-B48F-1D18A9856A87
  last_name: Kleshnina
- first_name: Vladimir
  full_name: Ejov, Vladimir
  last_name: Ejov
- first_name: Louise
  full_name: Bartle, Louise
  last_name: Bartle
- first_name: James G.
  full_name: Mitchell, James G.
  last_name: Mitchell
- first_name: Jerzy A.
  full_name: Filar, Jerzy A.
  last_name: Filar
citation:
  ama: Mckerral JC, Kleshnina M, Ejov V, Bartle L, Mitchell JG, Filar JA. Empirical
    parameterisation and dynamical analysis of the allometric Rosenzweig-MacArthur
    equations. <i>PLoS One</i>. 2023;18(2):e0279838. doi:<a href="https://doi.org/10.1371/journal.pone.0279838">10.1371/journal.pone.0279838</a>
  apa: Mckerral, J. C., Kleshnina, M., Ejov, V., Bartle, L., Mitchell, J. G., &#38;
    Filar, J. A. (2023). Empirical parameterisation and dynamical analysis of the
    allometric Rosenzweig-MacArthur equations. <i>PLoS One</i>. Public Library of
    Science. <a href="https://doi.org/10.1371/journal.pone.0279838">https://doi.org/10.1371/journal.pone.0279838</a>
  chicago: Mckerral, Jody C., Maria Kleshnina, Vladimir Ejov, Louise Bartle, James
    G. Mitchell, and Jerzy A. Filar. “Empirical Parameterisation and Dynamical Analysis
    of the Allometric Rosenzweig-MacArthur Equations.” <i>PLoS One</i>. Public Library
    of Science, 2023. <a href="https://doi.org/10.1371/journal.pone.0279838">https://doi.org/10.1371/journal.pone.0279838</a>.
  ieee: J. C. Mckerral, M. Kleshnina, V. Ejov, L. Bartle, J. G. Mitchell, and J. A.
    Filar, “Empirical parameterisation and dynamical analysis of the allometric Rosenzweig-MacArthur
    equations,” <i>PLoS One</i>, vol. 18, no. 2. Public Library of Science, p. e0279838,
    2023.
  ista: Mckerral JC, Kleshnina M, Ejov V, Bartle L, Mitchell JG, Filar JA. 2023. Empirical
    parameterisation and dynamical analysis of the allometric Rosenzweig-MacArthur
    equations. PLoS One. 18(2), e0279838.
  mla: Mckerral, Jody C., et al. “Empirical Parameterisation and Dynamical Analysis
    of the Allometric Rosenzweig-MacArthur Equations.” <i>PLoS One</i>, vol. 18, no.
    2, Public Library of Science, 2023, p. e0279838, doi:<a href="https://doi.org/10.1371/journal.pone.0279838">10.1371/journal.pone.0279838</a>.
  short: J.C. Mckerral, M. Kleshnina, V. Ejov, L. Bartle, J.G. Mitchell, J.A. Filar,
    PLoS One 18 (2023) e0279838.
date_created: 2023-03-05T23:01:05Z
date_published: 2023-02-27T00:00:00Z
date_updated: 2023-10-17T12:53:30Z
day: '27'
ddc:
- '000'
department:
- _id: KrCh
doi: 10.1371/journal.pone.0279838
external_id:
  isi:
  - '000996122900022'
  pmid:
  - '36848357'
file:
- access_level: open_access
  checksum: 798ed5739a4117b03173e5d56e0534c9
  content_type: application/pdf
  creator: cchlebak
  date_created: 2023-03-07T10:26:45Z
  date_updated: 2023-03-07T10:26:45Z
  file_id: '12712'
  file_name: 2023_PLOSOne_Mckerral.pdf
  file_size: 1257003
  relation: main_file
  success: 1
file_date_updated: 2023-03-07T10:26:45Z
has_accepted_license: '1'
intvolume: '        18'
isi: 1
issue: '2'
language:
- iso: eng
month: '02'
oa: 1
oa_version: Published Version
page: e0279838
pmid: 1
publication: PLoS One
publication_identifier:
  eissn:
  - 1932-6203
publication_status: published
publisher: Public Library of Science
quality_controlled: '1'
scopus_import: '1'
status: public
title: Empirical parameterisation and dynamical analysis of the allometric Rosenzweig-MacArthur
  equations
tmp:
  image: /images/cc_by.png
  legal_code_url: https://creativecommons.org/licenses/by/4.0/legalcode
  name: Creative Commons Attribution 4.0 International Public License (CC-BY 4.0)
  short: CC BY (4.0)
type: journal_article
user_id: 2DF688A6-F248-11E8-B48F-1D18A9856A87
volume: 18
year: '2023'
...
---
_id: '12787'
abstract:
- lang: eng
  text: "Populations evolve in spatially heterogeneous environments. While a certain
    trait might bring a fitness advantage in some patch of the environment, a different
    trait might be advantageous in another patch. Here, we study the Moran birth–death
    process with two types of individuals in a population stretched across two patches
    of size N, each patch favouring one of the two types. We show that the long-term
    fate of such populations crucially depends on the migration rate μ\r\n between
    the patches. To classify the possible fates, we use the distinction between polynomial
    (short) and exponential (long) timescales. We show that when μ is high then one
    of the two types fixates on the whole population after a number of steps that
    is only polynomial in N. By contrast, when μ is low then each type holds majority
    in the patch where it is favoured for a number of steps that is at least exponential
    in N. Moreover, we precisely identify the threshold migration rate μ⋆ that separates
    those two scenarios, thereby exactly delineating the situations that support long-term
    coexistence of the two types. We also discuss the case of various cycle graphs
    and we present computer simulations that perfectly match our analytical results."
acknowledgement: J.S. and K.C. acknowledge support from the ERC CoG 863818 (ForM-SMArt)
article_number: '20220685'
article_processing_charge: No
article_type: original
author:
- first_name: Jakub
  full_name: Svoboda, Jakub
  id: 130759D2-D7DD-11E9-87D2-DE0DE6697425
  last_name: Svoboda
  orcid: 0000-0002-1419-3267
- first_name: Josef
  full_name: Tkadlec, Josef
  id: 3F24CCC8-F248-11E8-B48F-1D18A9856A87
  last_name: Tkadlec
  orcid: 0000-0002-1097-9684
- first_name: Kamran
  full_name: Kaveh, Kamran
  last_name: Kaveh
- first_name: Krishnendu
  full_name: Chatterjee, Krishnendu
  id: 2E5DCA20-F248-11E8-B48F-1D18A9856A87
  last_name: Chatterjee
  orcid: 0000-0002-4561-241X
citation:
  ama: 'Svoboda J, Tkadlec J, Kaveh K, Chatterjee K. Coexistence times in the Moran
    process with environmental heterogeneity. <i>Proceedings of the Royal Society
    A: Mathematical, Physical and Engineering Sciences</i>. 2023;479(2271). doi:<a
    href="https://doi.org/10.1098/rspa.2022.0685">10.1098/rspa.2022.0685</a>'
  apa: 'Svoboda, J., Tkadlec, J., Kaveh, K., &#38; Chatterjee, K. (2023). Coexistence
    times in the Moran process with environmental heterogeneity. <i>Proceedings of
    the Royal Society A: Mathematical, Physical and Engineering Sciences</i>. The
    Royal Society. <a href="https://doi.org/10.1098/rspa.2022.0685">https://doi.org/10.1098/rspa.2022.0685</a>'
  chicago: 'Svoboda, Jakub, Josef Tkadlec, Kamran Kaveh, and Krishnendu Chatterjee.
    “Coexistence Times in the Moran Process with Environmental Heterogeneity.” <i>Proceedings
    of the Royal Society A: Mathematical, Physical and Engineering Sciences</i>. The
    Royal Society, 2023. <a href="https://doi.org/10.1098/rspa.2022.0685">https://doi.org/10.1098/rspa.2022.0685</a>.'
  ieee: 'J. Svoboda, J. Tkadlec, K. Kaveh, and K. Chatterjee, “Coexistence times in
    the Moran process with environmental heterogeneity,” <i>Proceedings of the Royal
    Society A: Mathematical, Physical and Engineering Sciences</i>, vol. 479, no.
    2271. The Royal Society, 2023.'
  ista: 'Svoboda J, Tkadlec J, Kaveh K, Chatterjee K. 2023. Coexistence times in the
    Moran process with environmental heterogeneity. Proceedings of the Royal Society
    A: Mathematical, Physical and Engineering Sciences. 479(2271), 20220685.'
  mla: 'Svoboda, Jakub, et al. “Coexistence Times in the Moran Process with Environmental
    Heterogeneity.” <i>Proceedings of the Royal Society A: Mathematical, Physical
    and Engineering Sciences</i>, vol. 479, no. 2271, 20220685, The Royal Society,
    2023, doi:<a href="https://doi.org/10.1098/rspa.2022.0685">10.1098/rspa.2022.0685</a>.'
  short: 'J. Svoboda, J. Tkadlec, K. Kaveh, K. Chatterjee, Proceedings of the Royal
    Society A: Mathematical, Physical and Engineering Sciences 479 (2023).'
date_created: 2023-04-02T22:01:09Z
date_published: 2023-03-29T00:00:00Z
date_updated: 2026-04-07T11:49:11Z
day: '29'
ddc:
- '000'
department:
- _id: KrCh
doi: 10.1098/rspa.2022.0685
ec_funded: 1
external_id:
  isi:
  - '000957125500002'
file:
- access_level: open_access
  checksum: 13953d349fbefcb5d21ccc6b303297eb
  content_type: application/pdf
  creator: dernst
  date_created: 2023-04-03T06:25:29Z
  date_updated: 2023-04-03T06:25:29Z
  file_id: '12796'
  file_name: 2023_ProceedingsRoyalSocietyA_Svoboda.pdf
  file_size: 827784
  relation: main_file
  success: 1
file_date_updated: 2023-04-03T06:25:29Z
has_accepted_license: '1'
intvolume: '       479'
isi: 1
issue: '2271'
language:
- iso: eng
month: '03'
oa: 1
oa_version: Published Version
project:
- _id: 0599E47C-7A3F-11EA-A408-12923DDC885E
  call_identifier: H2020
  grant_number: '863818'
  name: 'Formal Methods for Stochastic Models: Algorithms and Applications'
publication: 'Proceedings of the Royal Society A: Mathematical, Physical and Engineering
  Sciences'
publication_identifier:
  eissn:
  - 1471-2946
  issn:
  - 1364-5021
publication_status: published
publisher: The Royal Society
quality_controlled: '1'
related_material:
  link:
  - relation: research_data
    url: https://doi.org/10.6084/m9.figshare.21261771.v1
  record:
  - id: '20138'
    relation: dissertation_contains
    status: public
scopus_import: '1'
status: public
title: Coexistence times in the Moran process with environmental heterogeneity
tmp:
  image: /images/cc_by.png
  legal_code_url: https://creativecommons.org/licenses/by/4.0/legalcode
  name: Creative Commons Attribution 4.0 International Public License (CC-BY 4.0)
  short: CC BY (4.0)
type: journal_article
user_id: 4359f0d1-fa6c-11eb-b949-802e58b17ae8
volume: 479
year: '2023'
...
---
_id: '12833'
abstract:
- lang: eng
  text: 'The input to the token swapping problem is a graph with vertices v1, v2,
    . . . , vn, and n tokens with labels 1,2, . . . , n, one on each vertex. The goal
    is to get token i to vertex vi for all i= 1, . . . , n using a minimum number
    of swaps, where a swap exchanges the tokens on the endpoints of an edge.Token
    swapping on a tree, also known as “sorting with a transposition tree,” is not
    known to be in P nor NP-complete. We present some partial results: 1. An optimum
    swap sequence may need to perform a swap on a leaf vertex that has the correct
    token (a “happy leaf”), disproving a conjecture of Vaughan. 2. Any algorithm that
    fixes happy leaves—as all known approximation algorithms for the problem do—has
    approximation factor at least 4/3. Furthermore, the two best-known 2-approximation
    algorithms have approximation factor exactly 2. 3. A generalized problem—weighted
    coloured token swapping—is NP-complete on trees, but solvable in polynomial time
    on paths and stars. In this version, tokens and vertices have colours, and colours
    have weights. The goal is to get every token to a vertex of the same colour, and
    the cost of a swap is the sum of the weights of the two tokens involved.'
acknowledgement: "This work was begun at the University of Waterloo and was partially
  supported by the Natural Sciences and Engineering Council of Canada (NSERC).\r\n"
article_number: '9'
article_processing_charge: No
article_type: original
arxiv: 1
author:
- first_name: Ahmad
  full_name: Biniaz, Ahmad
  last_name: Biniaz
- first_name: Kshitij
  full_name: Jain, Kshitij
  last_name: Jain
- first_name: Anna
  full_name: Lubiw, Anna
  last_name: Lubiw
- first_name: Zuzana
  full_name: Masárová, Zuzana
  id: 45CFE238-F248-11E8-B48F-1D18A9856A87
  last_name: Masárová
  orcid: 0000-0002-6660-1322
- first_name: Tillmann
  full_name: Miltzow, Tillmann
  last_name: Miltzow
- first_name: Debajyoti
  full_name: Mondal, Debajyoti
  last_name: Mondal
- first_name: Anurag Murty
  full_name: Naredla, Anurag Murty
  last_name: Naredla
- first_name: Josef
  full_name: Tkadlec, Josef
  id: 3F24CCC8-F248-11E8-B48F-1D18A9856A87
  last_name: Tkadlec
  orcid: 0000-0002-1097-9684
- first_name: Alexi
  full_name: Turcotte, Alexi
  last_name: Turcotte
citation:
  ama: Biniaz A, Jain K, Lubiw A, et al. Token swapping on trees. <i>Discrete Mathematics
    and Theoretical Computer Science</i>. 2023;24(2). doi:<a href="https://doi.org/10.46298/DMTCS.8383">10.46298/DMTCS.8383</a>
  apa: Biniaz, A., Jain, K., Lubiw, A., Masárová, Z., Miltzow, T., Mondal, D., … Turcotte,
    A. (2023). Token swapping on trees. <i>Discrete Mathematics and Theoretical Computer
    Science</i>. EPI Sciences. <a href="https://doi.org/10.46298/DMTCS.8383">https://doi.org/10.46298/DMTCS.8383</a>
  chicago: Biniaz, Ahmad, Kshitij Jain, Anna Lubiw, Zuzana Masárová, Tillmann Miltzow,
    Debajyoti Mondal, Anurag Murty Naredla, Josef Tkadlec, and Alexi Turcotte. “Token
    Swapping on Trees.” <i>Discrete Mathematics and Theoretical Computer Science</i>.
    EPI Sciences, 2023. <a href="https://doi.org/10.46298/DMTCS.8383">https://doi.org/10.46298/DMTCS.8383</a>.
  ieee: A. Biniaz <i>et al.</i>, “Token swapping on trees,” <i>Discrete Mathematics
    and Theoretical Computer Science</i>, vol. 24, no. 2. EPI Sciences, 2023.
  ista: Biniaz A, Jain K, Lubiw A, Masárová Z, Miltzow T, Mondal D, Naredla AM, Tkadlec
    J, Turcotte A. 2023. Token swapping on trees. Discrete Mathematics and Theoretical
    Computer Science. 24(2), 9.
  mla: Biniaz, Ahmad, et al. “Token Swapping on Trees.” <i>Discrete Mathematics and
    Theoretical Computer Science</i>, vol. 24, no. 2, 9, EPI Sciences, 2023, doi:<a
    href="https://doi.org/10.46298/DMTCS.8383">10.46298/DMTCS.8383</a>.
  short: A. Biniaz, K. Jain, A. Lubiw, Z. Masárová, T. Miltzow, D. Mondal, A.M. Naredla,
    J. Tkadlec, A. Turcotte, Discrete Mathematics and Theoretical Computer Science
    24 (2023).
date_created: 2023-04-16T22:01:08Z
date_published: 2023-01-18T00:00:00Z
date_updated: 2025-01-20T14:05:09Z
day: '18'
ddc:
- '000'
department:
- _id: KrCh
- _id: HeEd
- _id: UlWa
doi: 10.46298/DMTCS.8383
external_id:
  arxiv:
  - '1903.06981'
file:
- access_level: open_access
  checksum: 439102ea4f6e2aeefd7107dfb9ccf532
  content_type: application/pdf
  creator: dernst
  date_created: 2023-04-17T08:10:28Z
  date_updated: 2023-04-17T08:10:28Z
  file_id: '12844'
  file_name: 2022_DMTCS_Biniaz.pdf
  file_size: 2072197
  relation: main_file
  success: 1
file_date_updated: 2023-04-17T08:10:28Z
has_accepted_license: '1'
intvolume: '        24'
issue: '2'
language:
- iso: eng
month: '01'
oa: 1
oa_version: Published Version
publication: Discrete Mathematics and Theoretical Computer Science
publication_identifier:
  eissn:
  - 1365-8050
  issn:
  - 1462-7264
publication_status: published
publisher: EPI Sciences
quality_controlled: '1'
related_material:
  record:
  - id: '7950'
    relation: earlier_version
    status: public
scopus_import: '1'
status: public
title: Token swapping on trees
tmp:
  image: /images/cc_by.png
  legal_code_url: https://creativecommons.org/licenses/by/4.0/legalcode
  name: Creative Commons Attribution 4.0 International Public License (CC-BY 4.0)
  short: CC BY (4.0)
type: journal_article
user_id: 3E5EF7F0-F248-11E8-B48F-1D18A9856A87
volume: 24
year: '2023'
...
---
_id: '12861'
abstract:
- lang: eng
  text: The field of indirect reciprocity investigates how social norms can foster
    cooperation when individuals continuously monitor and assess each other’s social
    interactions. By adhering to certain social norms, cooperating individuals can
    improve their reputation and, in turn, receive benefits from others. Eight social
    norms, known as the “leading eight," have been shown to effectively promote the
    evolution of cooperation as long as information is public and reliable. These
    norms categorize group members as either ’good’ or ’bad’. In this study, we examine
    a scenario where individuals instead assign nuanced reputation scores to each
    other, and only cooperate with those whose reputation exceeds a certain threshold.
    We find both analytically and through simulations that such quantitative assessments
    are error-correcting, thus facilitating cooperation in situations where information
    is private and unreliable. Moreover, our results identify four specific norms
    that are robust to such conditions, and may be relevant for helping to sustain
    cooperation in natural populations.
acknowledgement: 'This work was supported by the European Research Council CoG 863818
  (ForM-SMArt) (to K.C.) and the European Research Council Starting Grant 850529:
  E-DIRECT (to C.H.). L.S. received additional partial support by the Austrian Science
  Fund (FWF) under grant Z211-N23 (Wittgenstein Award), and also thanks the support
  by the Stochastic Analysis and Application Research Center (SAARC) under National
  Research Foundation of Korea grant NRF-2019R1A5A1028324. The authors additionally
  thank Stefan Schmid for providing access to his lab infrastructure at the University
  of Vienna for the purpose of collecting simulation data.'
article_number: '2086'
article_processing_charge: No
article_type: original
author:
- first_name: Laura
  full_name: Schmid, Laura
  id: 38B437DE-F248-11E8-B48F-1D18A9856A87
  last_name: Schmid
  orcid: 0000-0002-6978-7329
- first_name: Farbod
  full_name: Ekbatani, Farbod
  last_name: Ekbatani
- first_name: Christian
  full_name: Hilbe, Christian
  id: 2FDF8F3C-F248-11E8-B48F-1D18A9856A87
  last_name: Hilbe
  orcid: 0000-0001-5116-955X
- first_name: Krishnendu
  full_name: Chatterjee, Krishnendu
  id: 2E5DCA20-F248-11E8-B48F-1D18A9856A87
  last_name: Chatterjee
  orcid: 0000-0002-4561-241X
citation:
  ama: Schmid L, Ekbatani F, Hilbe C, Chatterjee K. Quantitative assessment can stabilize
    indirect reciprocity under imperfect information. <i>Nature Communications</i>.
    2023;14. doi:<a href="https://doi.org/10.1038/s41467-023-37817-x">10.1038/s41467-023-37817-x</a>
  apa: Schmid, L., Ekbatani, F., Hilbe, C., &#38; Chatterjee, K. (2023). Quantitative
    assessment can stabilize indirect reciprocity under imperfect information. <i>Nature
    Communications</i>. Springer Nature. <a href="https://doi.org/10.1038/s41467-023-37817-x">https://doi.org/10.1038/s41467-023-37817-x</a>
  chicago: Schmid, Laura, Farbod Ekbatani, Christian Hilbe, and Krishnendu Chatterjee.
    “Quantitative Assessment Can Stabilize Indirect Reciprocity under Imperfect Information.”
    <i>Nature Communications</i>. Springer Nature, 2023. <a href="https://doi.org/10.1038/s41467-023-37817-x">https://doi.org/10.1038/s41467-023-37817-x</a>.
  ieee: L. Schmid, F. Ekbatani, C. Hilbe, and K. Chatterjee, “Quantitative assessment
    can stabilize indirect reciprocity under imperfect information,” <i>Nature Communications</i>,
    vol. 14. Springer Nature, 2023.
  ista: Schmid L, Ekbatani F, Hilbe C, Chatterjee K. 2023. Quantitative assessment
    can stabilize indirect reciprocity under imperfect information. Nature Communications.
    14, 2086.
  mla: Schmid, Laura, et al. “Quantitative Assessment Can Stabilize Indirect Reciprocity
    under Imperfect Information.” <i>Nature Communications</i>, vol. 14, 2086, Springer
    Nature, 2023, doi:<a href="https://doi.org/10.1038/s41467-023-37817-x">10.1038/s41467-023-37817-x</a>.
  short: L. Schmid, F. Ekbatani, C. Hilbe, K. Chatterjee, Nature Communications 14
    (2023).
date_created: 2023-04-23T22:01:03Z
date_published: 2023-04-12T00:00:00Z
date_updated: 2025-04-15T06:26:15Z
day: '12'
ddc:
- '000'
department:
- _id: KrCh
doi: 10.1038/s41467-023-37817-x
ec_funded: 1
external_id:
  isi:
  - '001003644100020'
  pmid:
  - '37045828'
file:
- access_level: open_access
  checksum: a4b3b7b36fbef068cabf4fb99501fef6
  content_type: application/pdf
  creator: dernst
  date_created: 2023-04-25T09:13:53Z
  date_updated: 2023-04-25T09:13:53Z
  file_id: '12868'
  file_name: 2023_NatureComm_Schmid.pdf
  file_size: 1786475
  relation: main_file
  success: 1
file_date_updated: 2023-04-25T09:13:53Z
has_accepted_license: '1'
intvolume: '        14'
isi: 1
language:
- iso: eng
month: '04'
oa: 1
oa_version: Published Version
pmid: 1
project:
- _id: 0599E47C-7A3F-11EA-A408-12923DDC885E
  call_identifier: H2020
  grant_number: '863818'
  name: 'Formal Methods for Stochastic Models: Algorithms and Applications'
- _id: 25F42A32-B435-11E9-9278-68D0E5697425
  call_identifier: FWF
  grant_number: Z211
  name: Formal methods for the design and analysis of complex systems
publication: Nature Communications
publication_identifier:
  eissn:
  - 2041-1723
publication_status: published
publisher: Springer Nature
quality_controlled: '1'
scopus_import: '1'
status: public
title: Quantitative assessment can stabilize indirect reciprocity under imperfect
  information
tmp:
  image: /images/cc_by.png
  legal_code_url: https://creativecommons.org/licenses/by/4.0/legalcode
  name: Creative Commons Attribution 4.0 International Public License (CC-BY 4.0)
  short: CC BY (4.0)
type: journal_article
user_id: 4359f0d1-fa6c-11eb-b949-802e58b17ae8
volume: 14
year: '2023'
...
---
_id: '13139'
abstract:
- lang: eng
  text: A classical problem for Markov chains is determining their stationary (or
    steady-state) distribution. This problem has an equally classical solution based
    on eigenvectors and linear equation systems. However, this approach does not scale
    to large instances, and iterative solutions are desirable. It turns out that a
    naive approach, as used by current model checkers, may yield completely wrong
    results. We present a new approach, which utilizes recent advances in partial
    exploration and mean payoff computation to obtain a correct, converging approximation.
alternative_title:
- LNCS
article_processing_charge: No
arxiv: 1
author:
- first_name: Tobias
  full_name: Meggendorfer, Tobias
  id: b21b0c15-30a2-11eb-80dc-f13ca25802e1
  last_name: Meggendorfer
  orcid: 0000-0002-1712-2165
citation:
  ama: 'Meggendorfer T. Correct approximation of stationary distributions. In: <i>TACAS
    2023: Tools and Algorithms for the Construction and Analysis of Systems</i>. Vol
    13993. Springer Nature; 2023:489-507. doi:<a href="https://doi.org/10.1007/978-3-031-30823-9_25">10.1007/978-3-031-30823-9_25</a>'
  apa: 'Meggendorfer, T. (2023). Correct approximation of stationary distributions.
    In <i>TACAS 2023: Tools and Algorithms for the Construction and Analysis of Systems</i>
    (Vol. 13993, pp. 489–507). Paris, France: Springer Nature. <a href="https://doi.org/10.1007/978-3-031-30823-9_25">https://doi.org/10.1007/978-3-031-30823-9_25</a>'
  chicago: 'Meggendorfer, Tobias. “Correct Approximation of Stationary Distributions.”
    In <i>TACAS 2023: Tools and Algorithms for the Construction and Analysis of Systems</i>,
    13993:489–507. Springer Nature, 2023. <a href="https://doi.org/10.1007/978-3-031-30823-9_25">https://doi.org/10.1007/978-3-031-30823-9_25</a>.'
  ieee: 'T. Meggendorfer, “Correct approximation of stationary distributions,” in
    <i>TACAS 2023: Tools and Algorithms for the Construction and Analysis of Systems</i>,
    Paris, France, 2023, vol. 13993, pp. 489–507.'
  ista: 'Meggendorfer T. 2023. Correct approximation of stationary distributions.
    TACAS 2023: Tools and Algorithms for the Construction and Analysis of Systems.
    TACAS: Tools and Algorithms for the Construction and Analysis of Systems, LNCS,
    vol. 13993, 489–507.'
  mla: 'Meggendorfer, Tobias. “Correct Approximation of Stationary Distributions.”
    <i>TACAS 2023: Tools and Algorithms for the Construction and Analysis of Systems</i>,
    vol. 13993, Springer Nature, 2023, pp. 489–507, doi:<a href="https://doi.org/10.1007/978-3-031-30823-9_25">10.1007/978-3-031-30823-9_25</a>.'
  short: 'T. Meggendorfer, in:, TACAS 2023: Tools and Algorithms for the Construction
    and Analysis of Systems, Springer Nature, 2023, pp. 489–507.'
conference:
  end_date: 2023-04-27
  location: Paris, France
  name: 'TACAS: Tools and Algorithms for the Construction and Analysis of Systems'
  start_date: 2023-04-22
corr_author: '1'
date_created: 2023-06-18T22:00:46Z
date_published: 2023-04-22T00:00:00Z
date_updated: 2025-09-09T12:28:12Z
day: '22'
ddc:
- '000'
department:
- _id: KrCh
doi: 10.1007/978-3-031-30823-9_25
external_id:
  arxiv:
  - '2301.08137'
  isi:
  - '001288688000025'
file:
- access_level: open_access
  checksum: 59f707a3949c03793251b0d04c62542a
  content_type: application/pdf
  creator: dernst
  date_created: 2023-06-19T07:18:40Z
  date_updated: 2023-06-19T07:18:40Z
  file_id: '13148'
  file_name: 2023_LNCS_Meggendorfer.pdf
  file_size: 521951
  relation: main_file
  success: 1
file_date_updated: 2023-06-19T07:18:40Z
has_accepted_license: '1'
intvolume: '     13993'
isi: 1
language:
- iso: eng
month: '04'
oa: 1
oa_version: Published Version
page: 489-507
publication: 'TACAS 2023: Tools and Algorithms for the Construction and Analysis of
  Systems'
publication_identifier:
  eissn:
  - 1611-3349
  isbn:
  - '9783031308222'
  issn:
  - 0302-9743
publication_status: published
publisher: Springer Nature
quality_controlled: '1'
related_material:
  record:
  - id: '14990'
    relation: research_data
    status: public
scopus_import: '1'
status: public
title: Correct approximation of stationary distributions
tmp:
  image: /images/cc_by.png
  legal_code_url: https://creativecommons.org/licenses/by/4.0/legalcode
  name: Creative Commons Attribution 4.0 International Public License (CC-BY 4.0)
  short: CC BY (4.0)
type: conference
user_id: 317138e5-6ab7-11ef-aa6d-ffef3953e345
volume: 13993
year: '2023'
...
---
_id: '13142'
abstract:
- lang: eng
  text: Reinforcement learning has received much attention for learning controllers
    of deterministic systems. We consider a learner-verifier framework for stochastic
    control systems and survey recent methods that formally guarantee a conjunction
    of reachability and safety properties. Given a property and a lower bound on the
    probability of the property being satisfied, our framework jointly learns a control
    policy and a formal certificate to ensure the satisfaction of the property with
    a desired probability threshold. Both the control policy and the formal certificate
    are continuous functions from states to reals, which are learned as parameterized
    neural networks. While in the deterministic case, the certificates are invariant
    and barrier functions for safety, or Lyapunov and ranking functions for liveness,
    in the stochastic case the certificates are supermartingales. For certificate
    verification, we use interval arithmetic abstract interpretation to bound the
    expected values of neural network functions.
acknowledgement: This work was supported in part by the ERC-2020-AdG 101020093, ERC
  CoG 863818 (FoRM-SMArt) and the European Union’s Horizon 2020 research and innovation
  programme under the Marie Skłodowska-Curie Grant Agreement No. 665385.
alternative_title:
- LNCS
article_processing_charge: No
author:
- first_name: Krishnendu
  full_name: Chatterjee, Krishnendu
  id: 2E5DCA20-F248-11E8-B48F-1D18A9856A87
  last_name: Chatterjee
  orcid: 0000-0002-4561-241X
- first_name: Thomas A
  full_name: Henzinger, Thomas A
  id: 40876CD8-F248-11E8-B48F-1D18A9856A87
  last_name: Henzinger
  orcid: 0000-0002-2985-7724
- first_name: Mathias
  full_name: Lechner, Mathias
  id: 3DC22916-F248-11E8-B48F-1D18A9856A87
  last_name: Lechner
- first_name: Dorde
  full_name: Zikelic, Dorde
  id: 294AA7A6-F248-11E8-B48F-1D18A9856A87
  last_name: Zikelic
  orcid: 0000-0002-4681-1699
citation:
  ama: 'Chatterjee K, Henzinger TA, Lechner M, Zikelic D. A learner-verifier framework
    for neural network controllers and certificates of stochastic systems. In: <i>Tools
    and Algorithms for the Construction and Analysis of Systems </i>. Vol 13993. Springer
    Nature; 2023:3-25. doi:<a href="https://doi.org/10.1007/978-3-031-30823-9_1">10.1007/978-3-031-30823-9_1</a>'
  apa: 'Chatterjee, K., Henzinger, T. A., Lechner, M., &#38; Zikelic, D. (2023). A
    learner-verifier framework for neural network controllers and certificates of
    stochastic systems. In <i>Tools and Algorithms for the Construction and Analysis
    of Systems </i> (Vol. 13993, pp. 3–25). Paris, France: Springer Nature. <a href="https://doi.org/10.1007/978-3-031-30823-9_1">https://doi.org/10.1007/978-3-031-30823-9_1</a>'
  chicago: Chatterjee, Krishnendu, Thomas A Henzinger, Mathias Lechner, and Dorde
    Zikelic. “A Learner-Verifier Framework for Neural Network Controllers and Certificates
    of Stochastic Systems.” In <i>Tools and Algorithms for the Construction and Analysis
    of Systems </i>, 13993:3–25. Springer Nature, 2023. <a href="https://doi.org/10.1007/978-3-031-30823-9_1">https://doi.org/10.1007/978-3-031-30823-9_1</a>.
  ieee: K. Chatterjee, T. A. Henzinger, M. Lechner, and D. Zikelic, “A learner-verifier
    framework for neural network controllers and certificates of stochastic systems,”
    in <i>Tools and Algorithms for the Construction and Analysis of Systems </i>,
    Paris, France, 2023, vol. 13993, pp. 3–25.
  ista: 'Chatterjee K, Henzinger TA, Lechner M, Zikelic D. 2023. A learner-verifier
    framework for neural network controllers and certificates of stochastic systems.
    Tools and Algorithms for the Construction and Analysis of Systems . TACAS: Tools
    and Algorithms for the Construction and Analysis of Systems, LNCS, vol. 13993,
    3–25.'
  mla: Chatterjee, Krishnendu, et al. “A Learner-Verifier Framework for Neural Network
    Controllers and Certificates of Stochastic Systems.” <i>Tools and Algorithms for
    the Construction and Analysis of Systems </i>, vol. 13993, Springer Nature, 2023,
    pp. 3–25, doi:<a href="https://doi.org/10.1007/978-3-031-30823-9_1">10.1007/978-3-031-30823-9_1</a>.
  short: K. Chatterjee, T.A. Henzinger, M. Lechner, D. Zikelic, in:, Tools and Algorithms
    for the Construction and Analysis of Systems , Springer Nature, 2023, pp. 3–25.
conference:
  end_date: 2023-04-27
  location: Paris, France
  name: 'TACAS: Tools and Algorithms for the Construction and Analysis of Systems'
  start_date: 2023-04-22
corr_author: '1'
date_created: 2023-06-18T22:00:47Z
date_published: 2023-04-22T00:00:00Z
date_updated: 2025-09-09T12:29:26Z
day: '22'
ddc:
- '000'
department:
- _id: KrCh
- _id: ToHe
doi: 10.1007/978-3-031-30823-9_1
ec_funded: 1
external_id:
  isi:
  - '001288688000001'
file:
- access_level: open_access
  checksum: 3d8a8bb24d211bc83360dfc2fd744307
  content_type: application/pdf
  creator: dernst
  date_created: 2023-06-19T08:29:30Z
  date_updated: 2023-06-19T08:29:30Z
  file_id: '13150'
  file_name: 2023_LNCS_Chatterjee.pdf
  file_size: 528455
  relation: main_file
  success: 1
file_date_updated: 2023-06-19T08:29:30Z
has_accepted_license: '1'
intvolume: '     13993'
isi: 1
language:
- iso: eng
month: '04'
oa: 1
oa_version: Published Version
page: 3-25
project:
- _id: 0599E47C-7A3F-11EA-A408-12923DDC885E
  call_identifier: H2020
  grant_number: '863818'
  name: 'Formal Methods for Stochastic Models: Algorithms and Applications'
- _id: 2564DBCA-B435-11E9-9278-68D0E5697425
  call_identifier: H2020
  grant_number: '665385'
  name: International IST Doctoral Program
publication: 'Tools and Algorithms for the Construction and Analysis of Systems '
publication_identifier:
  eissn:
  - 1611-3349
  isbn:
  - '9783031308222'
  issn:
  - 0302-9743
publication_status: published
publisher: Springer Nature
quality_controlled: '1'
scopus_import: '1'
status: public
title: A learner-verifier framework for neural network controllers and certificates
  of stochastic systems
tmp:
  image: /images/cc_by.png
  legal_code_url: https://creativecommons.org/licenses/by/4.0/legalcode
  name: Creative Commons Attribution 4.0 International Public License (CC-BY 4.0)
  short: CC BY (4.0)
type: conference
user_id: 317138e5-6ab7-11ef-aa6d-ffef3953e345
volume: 13993
year: '2023'
...
---
_id: '13258'
abstract:
- lang: eng
  text: Many human interactions feature the characteristics of social dilemmas where
    individual actions have consequences for the group and the environment. The feedback
    between behavior and environment can be studied with the framework of stochastic
    games. In stochastic games, the state of the environment can change, depending
    on the choices made by group members. Past work suggests that such feedback can
    reinforce cooperative behaviors. In particular, cooperation can evolve in stochastic
    games even if it is infeasible in each separate repeated game. In stochastic games,
    participants have an interest in conditioning their strategies on the state of
    the environment. Yet in many applications, precise information about the state
    could be scarce. Here, we study how the availability of information (or lack thereof)
    shapes evolution of cooperation. Already for simple examples of two state games
    we find surprising effects. In some cases, cooperation is only possible if there
    is precise information about the state of the environment. In other cases, cooperation
    is most abundant when there is no information about the state of the environment.
    We systematically analyze all stochastic games of a given complexity class, to
    determine when receiving information about the environment is better, neutral,
    or worse for evolution of cooperation.
acknowledgement: 'This work was supported by the European Research Council CoG 863818
  (ForM-SMArt) (to K.C.), the European Research Council Starting Grant 850529: E-DIRECT
  (to C.H.), the European Union’s Horizon 2020 research and innovation program under
  the Marie Sklodowska-Curie Grant Agreement #754411 and the French Agence Nationale
  de la Recherche (under the Investissement d’Avenir programme, ANR-17-EURE-0010)
  (to M.K.).'
article_number: '4153'
article_processing_charge: Yes
article_type: original
author:
- first_name: Maria
  full_name: Kleshnina, Maria
  id: 4E21749C-F248-11E8-B48F-1D18A9856A87
  last_name: Kleshnina
- first_name: Christian
  full_name: Hilbe, Christian
  id: 2FDF8F3C-F248-11E8-B48F-1D18A9856A87
  last_name: Hilbe
  orcid: 0000-0001-5116-955X
- first_name: Stepan
  full_name: Simsa, Stepan
  id: 409d615c-2f95-11ee-b934-90a352102c1e
  last_name: Simsa
  orcid: 0000-0001-6687-1210
- first_name: Krishnendu
  full_name: Chatterjee, Krishnendu
  id: 2E5DCA20-F248-11E8-B48F-1D18A9856A87
  last_name: Chatterjee
  orcid: 0000-0002-4561-241X
- first_name: Martin A.
  full_name: Nowak, Martin A.
  last_name: Nowak
citation:
  ama: Kleshnina M, Hilbe C, Simsa S, Chatterjee K, Nowak MA. The effect of environmental
    information on evolution of cooperation in stochastic games. <i>Nature Communications</i>.
    2023;14. doi:<a href="https://doi.org/10.1038/s41467-023-39625-9">10.1038/s41467-023-39625-9</a>
  apa: Kleshnina, M., Hilbe, C., Simsa, S., Chatterjee, K., &#38; Nowak, M. A. (2023).
    The effect of environmental information on evolution of cooperation in stochastic
    games. <i>Nature Communications</i>. Springer Nature. <a href="https://doi.org/10.1038/s41467-023-39625-9">https://doi.org/10.1038/s41467-023-39625-9</a>
  chicago: Kleshnina, Maria, Christian Hilbe, Stepan Simsa, Krishnendu Chatterjee,
    and Martin A. Nowak. “The Effect of Environmental Information on Evolution of
    Cooperation in Stochastic Games.” <i>Nature Communications</i>. Springer Nature,
    2023. <a href="https://doi.org/10.1038/s41467-023-39625-9">https://doi.org/10.1038/s41467-023-39625-9</a>.
  ieee: M. Kleshnina, C. Hilbe, S. Simsa, K. Chatterjee, and M. A. Nowak, “The effect
    of environmental information on evolution of cooperation in stochastic games,”
    <i>Nature Communications</i>, vol. 14. Springer Nature, 2023.
  ista: Kleshnina M, Hilbe C, Simsa S, Chatterjee K, Nowak MA. 2023. The effect of
    environmental information on evolution of cooperation in stochastic games. Nature
    Communications. 14, 4153.
  mla: Kleshnina, Maria, et al. “The Effect of Environmental Information on Evolution
    of Cooperation in Stochastic Games.” <i>Nature Communications</i>, vol. 14, 4153,
    Springer Nature, 2023, doi:<a href="https://doi.org/10.1038/s41467-023-39625-9">10.1038/s41467-023-39625-9</a>.
  short: M. Kleshnina, C. Hilbe, S. Simsa, K. Chatterjee, M.A. Nowak, Nature Communications
    14 (2023).
corr_author: '1'
date_created: 2023-07-23T22:01:11Z
date_published: 2023-07-12T00:00:00Z
date_updated: 2025-04-14T07:43:55Z
day: '12'
ddc:
- '000'
department:
- _id: KrCh
doi: 10.1038/s41467-023-39625-9
ec_funded: 1
external_id:
  isi:
  - '001029450400031'
  pmid:
  - '37438341'
file:
- access_level: open_access
  checksum: 5aceefdfe76686267b93ae4fe81899f1
  content_type: application/pdf
  creator: dernst
  date_created: 2023-07-31T11:32:36Z
  date_updated: 2023-07-31T11:32:36Z
  file_id: '13337'
  file_name: 2023_NatureComm_Kleshnina.pdf
  file_size: 1601682
  relation: main_file
  success: 1
file_date_updated: 2023-07-31T11:32:36Z
has_accepted_license: '1'
intvolume: '        14'
isi: 1
language:
- iso: eng
month: '07'
oa: 1
oa_version: Published Version
pmid: 1
project:
- _id: 0599E47C-7A3F-11EA-A408-12923DDC885E
  call_identifier: H2020
  grant_number: '863818'
  name: 'Formal Methods for Stochastic Models: Algorithms and Applications'
- _id: 260C2330-B435-11E9-9278-68D0E5697425
  call_identifier: H2020
  grant_number: '754411'
  name: ISTplus - Postdoctoral Fellowships
publication: Nature Communications
publication_identifier:
  eissn:
  - 2041-1723
publication_status: published
publisher: Springer Nature
quality_controlled: '1'
related_material:
  record:
  - id: '13336'
    relation: research_data
    status: public
scopus_import: '1'
status: public
title: The effect of environmental information on evolution of cooperation in stochastic
  games
tmp:
  image: /images/cc_by.png
  legal_code_url: https://creativecommons.org/licenses/by/4.0/legalcode
  name: Creative Commons Attribution 4.0 International Public License (CC-BY 4.0)
  short: CC BY (4.0)
type: journal_article
user_id: 2DF688A6-F248-11E8-B48F-1D18A9856A87
volume: 14
year: '2023'
...
---
_id: '13336'
article_processing_charge: No
author:
- first_name: Maria
  full_name: Kleshnina, Maria
  id: 4E21749C-F248-11E8-B48F-1D18A9856A87
  last_name: Kleshnina
citation:
  ama: 'Kleshnina M. kleshnina/stochgames_info: The effect of environmental information
    on evolution of cooperation in stochastic games. 2023. doi:<a href="https://doi.org/10.5281/ZENODO.8059564">10.5281/ZENODO.8059564</a>'
  apa: 'Kleshnina, M. (2023). kleshnina/stochgames_info: The effect of environmental
    information on evolution of cooperation in stochastic games. Zenodo. <a href="https://doi.org/10.5281/ZENODO.8059564">https://doi.org/10.5281/ZENODO.8059564</a>'
  chicago: 'Kleshnina, Maria. “Kleshnina/Stochgames_info: The Effect of Environmental
    Information on Evolution of Cooperation in Stochastic Games.” Zenodo, 2023. <a
    href="https://doi.org/10.5281/ZENODO.8059564">https://doi.org/10.5281/ZENODO.8059564</a>.'
  ieee: 'M. Kleshnina, “kleshnina/stochgames_info: The effect of environmental information
    on evolution of cooperation in stochastic games.” Zenodo, 2023.'
  ista: 'Kleshnina M. 2023. kleshnina/stochgames_info: The effect of environmental
    information on evolution of cooperation in stochastic games, Zenodo, <a href="https://doi.org/10.5281/ZENODO.8059564">10.5281/ZENODO.8059564</a>.'
  mla: 'Kleshnina, Maria. <i>Kleshnina/Stochgames_info: The Effect of Environmental
    Information on Evolution of Cooperation in Stochastic Games</i>. Zenodo, 2023,
    doi:<a href="https://doi.org/10.5281/ZENODO.8059564">10.5281/ZENODO.8059564</a>.'
  short: M. Kleshnina, (2023).
corr_author: '1'
date_created: 2023-07-31T11:30:46Z
date_published: 2023-06-20T00:00:00Z
date_updated: 2025-04-15T06:54:58Z
day: '20'
ddc:
- '000'
department:
- _id: KrCh
doi: 10.5281/ZENODO.8059564
main_file_link:
- open_access: '1'
  url: https://doi.org/10.5281/zenodo.8059564
month: '06'
oa: 1
oa_version: Published Version
publisher: Zenodo
related_material:
  record:
  - id: '13258'
    relation: used_in_publication
    status: public
status: public
title: 'kleshnina/stochgames_info: The effect of environmental information on evolution
  of cooperation in stochastic games'
type: research_data_reference
user_id: 2DF688A6-F248-11E8-B48F-1D18A9856A87
year: '2023'
...
---
_id: '13967'
abstract:
- lang: eng
  text: 'A classic solution technique for Markov decision processes (MDP) and stochastic
    games (SG) is value iteration (VI). Due to its good practical performance, this
    approximative approach is typically preferred over exact techniques, even though
    no practical bounds on the imprecision of the result could be given until recently.
    As a consequence, even the most used model checkers could return arbitrarily wrong
    results. Over the past decade, different works derived stopping criteria, indicating
    when the precision reaches the desired level, for various settings, in particular
    MDP with reachability, total reward, and mean payoff, and SG with reachability.In
    this paper, we provide the first stopping criteria for VI on SG with total reward
    and mean payoff, yielding the first anytime algorithms in these settings. To this
    end, we provide the solution in two flavours: First through a reduction to the
    MDP case and second directly on SG. The former is simpler and automatically utilizes
    any advances on MDP. The latter allows for more local computations, heading towards
    better practical efficiency.Our solution unifies the previously mentioned approaches
    for MDP and SG and their underlying ideas. To achieve this, we isolate objective-specific
    subroutines as well as identify objective-independent concepts. These structural
    concepts, while surprisingly simple, form the very essence of the unified solution.'
acknowledgement: This research was funded in part by DFG projects 383882557 “SUV”
  and 427755713 “GOPro”.
article_processing_charge: No
arxiv: 1
author:
- first_name: Jan
  full_name: Kretinsky, Jan
  id: 44CEF464-F248-11E8-B48F-1D18A9856A87
  last_name: Kretinsky
  orcid: 0000-0002-8122-2881
- first_name: Tobias
  full_name: Meggendorfer, Tobias
  id: b21b0c15-30a2-11eb-80dc-f13ca25802e1
  last_name: Meggendorfer
  orcid: 0000-0002-1712-2165
- first_name: Maximilian
  full_name: Weininger, Maximilian
  id: 02ab0197-cc70-11ed-ab61-918e71f56881
  last_name: Weininger
citation:
  ama: 'Kretinsky J, Meggendorfer T, Weininger M. Stopping criteria for value iteration
    on stochastic games with quantitative objectives. In: <i>38th Annual ACM/IEEE
    Symposium on Logic in Computer Science</i>. Vol 2023. Institute of Electrical
    and Electronics Engineers; 2023. doi:<a href="https://doi.org/10.1109/LICS56636.2023.10175771">10.1109/LICS56636.2023.10175771</a>'
  apa: 'Kretinsky, J., Meggendorfer, T., &#38; Weininger, M. (2023). Stopping criteria
    for value iteration on stochastic games with quantitative objectives. In <i>38th
    Annual ACM/IEEE Symposium on Logic in Computer Science</i> (Vol. 2023). Boston,
    MA, United States: Institute of Electrical and Electronics Engineers. <a href="https://doi.org/10.1109/LICS56636.2023.10175771">https://doi.org/10.1109/LICS56636.2023.10175771</a>'
  chicago: Kretinsky, Jan, Tobias Meggendorfer, and Maximilian Weininger. “Stopping
    Criteria for Value Iteration on Stochastic Games with Quantitative Objectives.”
    In <i>38th Annual ACM/IEEE Symposium on Logic in Computer Science</i>, Vol. 2023.
    Institute of Electrical and Electronics Engineers, 2023. <a href="https://doi.org/10.1109/LICS56636.2023.10175771">https://doi.org/10.1109/LICS56636.2023.10175771</a>.
  ieee: J. Kretinsky, T. Meggendorfer, and M. Weininger, “Stopping criteria for value
    iteration on stochastic games with quantitative objectives,” in <i>38th Annual
    ACM/IEEE Symposium on Logic in Computer Science</i>, Boston, MA, United States,
    2023, vol. 2023.
  ista: 'Kretinsky J, Meggendorfer T, Weininger M. 2023. Stopping criteria for value
    iteration on stochastic games with quantitative objectives. 38th Annual ACM/IEEE
    Symposium on Logic in Computer Science. LICS: Logic in Computer Science vol. 2023.'
  mla: Kretinsky, Jan, et al. “Stopping Criteria for Value Iteration on Stochastic
    Games with Quantitative Objectives.” <i>38th Annual ACM/IEEE Symposium on Logic
    in Computer Science</i>, vol. 2023, Institute of Electrical and Electronics Engineers,
    2023, doi:<a href="https://doi.org/10.1109/LICS56636.2023.10175771">10.1109/LICS56636.2023.10175771</a>.
  short: J. Kretinsky, T. Meggendorfer, M. Weininger, in:, 38th Annual ACM/IEEE Symposium
    on Logic in Computer Science, Institute of Electrical and Electronics Engineers,
    2023.
conference:
  end_date: 2023-06-29
  location: Boston, MA, United States
  name: 'LICS: Logic in Computer Science'
  start_date: 2023-06-26
corr_author: '1'
date_created: 2023-08-06T22:01:10Z
date_published: 2023-07-01T00:00:00Z
date_updated: 2025-07-10T11:50:43Z
day: '01'
department:
- _id: KrCh
doi: 10.1109/LICS56636.2023.10175771
external_id:
  arxiv:
  - '2304.09930'
  isi:
  - '001036707700042'
intvolume: '      2023'
isi: 1
language:
- iso: eng
main_file_link:
- open_access: '1'
  url: https://doi.org/10.48550/arXiv.2304.09930
month: '07'
oa: 1
oa_version: Preprint
publication: 38th Annual ACM/IEEE Symposium on Logic in Computer Science
publication_identifier:
  isbn:
  - '9798350335873'
  issn:
  - 1043-6871
publication_status: published
publisher: Institute of Electrical and Electronics Engineers
quality_controlled: '1'
scopus_import: '1'
status: public
title: Stopping criteria for value iteration on stochastic games with quantitative
  objectives
type: conference
user_id: 2DF688A6-F248-11E8-B48F-1D18A9856A87
volume: 2023
year: '2023'
...