Unilateral incentive alignment in two-agent stochastic games
Mcavoy A, Sehwag UM, Hilbe C, Chatterjee K, Barfuss W, Su Q, Leonard NE, Plotkin JB. 2025. Unilateral incentive alignment in two-agent stochastic games. Proceedings of the National Academy of Sciences. 122(25), e2319927121.
Download
              
            
            
            
            Journal Article
            
            
            
            | Published
            
            
              |              English
              
            
          
        Scopus indexed
Author
        
      Mcavoy, Alex;
      Sehwag, Udari Madhushani;
      Hilbe, ChristianISTA  ;
      Chatterjee, KrishnenduISTA
;
      Chatterjee, KrishnenduISTA  ;
      Barfuss, Wolfram;
      Su, Qi;
      Leonard, Naomi Ehrich;
      Plotkin, Joshua B.
;
      Barfuss, Wolfram;
      Su, Qi;
      Leonard, Naomi Ehrich;
      Plotkin, Joshua B.
 ;
      Chatterjee, KrishnenduISTA
;
      Chatterjee, KrishnenduISTA  ;
      Barfuss, Wolfram;
      Su, Qi;
      Leonard, Naomi Ehrich;
      Plotkin, Joshua B.
;
      Barfuss, Wolfram;
      Su, Qi;
      Leonard, Naomi Ehrich;
      Plotkin, Joshua B.Department
    Abstract
    Multiagent learning is challenging when agents face mixed-motivation interactions, where conflicts of interest arise as agents independently try to optimize their respective outcomes. Recent advancements in evolutionary game theory have identified a class of “zero-determinant” strategies, which confer an agent with significant unilateral control over outcomes in repeated games. Building on these insights, we present a comprehensive generalization of zero-determinant strategies to stochastic games, encompassing dynamic environments. We propose an algorithm that allows an agent to discover strategies enforcing predetermined linear (or approximately linear) payoff relationships. Of particular interest is the relationship in which both payoffs are equal, which serves as a proxy for fairness in symmetric games. We demonstrate that an agent can discover strategies enforcing such relationships through experience alone, without coordinating with an opponent. In finding and using such a strategy, an agent (“enforcer”) can incentivize optimal and equitable outcomes, circumventing potential exploitation. In particular, from the opponent’s viewpoint, the enforcer transforms a mixed-motivation problem into a cooperative problem, paving the way for more collaboration and fairness in multiagent systems.
    
  Publishing Year
    
  Date Published
    2025-06-24
  Journal Title
    Proceedings of the National Academy of Sciences
  Publisher
    National Academy of Sciences
  Acknowledgement
    We gratefully acknowledge the support from the European Research Council (Starting Grant 850529: E-DIRECT) and the Max Planck Society (C.H.), the European Research Council (Consolidator Grant 863818: ForM-SMArt) (K.C.), the Shanghai Pujiang Program (No. 23PJ1405500) (Q.S.), the Army Research Office (Grant No. W911NF-18-1-0325) (N.E.L.), and the John Templeton Foundation (Grant No. 62281) (J.B.P.).
  Volume
      122
    Issue
      25
    Article Number
      e2319927121
    ISSN
    
  eISSN
    
  IST-REx-ID
    
  Cite this
Mcavoy A, Sehwag UM, Hilbe C, et al. Unilateral incentive alignment in two-agent stochastic games. Proceedings of the National Academy of Sciences. 2025;122(25). doi:10.1073/pnas.2319927121
    Mcavoy, A., Sehwag, U. M., Hilbe, C., Chatterjee, K., Barfuss, W., Su, Q., … Plotkin, J. B. (2025). Unilateral incentive alignment in two-agent stochastic games. Proceedings of the National Academy of Sciences. National Academy of Sciences. https://doi.org/10.1073/pnas.2319927121
    Mcavoy, Alex, Udari Madhushani Sehwag, Christian Hilbe, Krishnendu Chatterjee, Wolfram Barfuss, Qi Su, Naomi Ehrich Leonard, and Joshua B. Plotkin. “Unilateral Incentive Alignment in Two-Agent Stochastic Games.” Proceedings of the National Academy of Sciences. National Academy of Sciences, 2025. https://doi.org/10.1073/pnas.2319927121.
    A. Mcavoy et al., “Unilateral incentive alignment in two-agent stochastic games,” Proceedings of the National Academy of Sciences, vol. 122, no. 25. National Academy of Sciences, 2025.
    Mcavoy A, Sehwag UM, Hilbe C, Chatterjee K, Barfuss W, Su Q, Leonard NE, Plotkin JB. 2025. Unilateral incentive alignment in two-agent stochastic games. Proceedings of the National Academy of Sciences. 122(25), e2319927121.
    Mcavoy, Alex, et al. “Unilateral Incentive Alignment in Two-Agent Stochastic Games.” Proceedings of the National Academy of Sciences, vol. 122, no. 25, e2319927121, National Academy of Sciences, 2025, doi:10.1073/pnas.2319927121.
  
      All files available under the following license(s):
      
      
        
          
        
      
      
    
  
            Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0):
          
        
      Main File(s)
    
  File Name
    
        
          
          
            2025_PNAS_McAvoy.pdf
          
        
       29.53 MB
    
  Access Level
     Open Access
 Open Access
    Date Uploaded
    
      2025-07-08
    
  MD5 Checksum
    
      3b35befd959a3e37aa9080a64a6afaf3
    
  Export
Marked PublicationsOpen Data ISTA Research Explorer
Web of Science
View record in Web of Science®Sources
 PMID: 40523172
PMID: 40523172
	    PubMed | Europe PMC

 Google Scholar
Google Scholar