extensive form games
Recently Published Documents


TOTAL DOCUMENTS

157
(FIVE YEARS 23)

H-INDEX

15
(FIVE YEARS 1)

Games ◽  
2021 ◽  
Vol 13 (1) ◽  
pp. 2
Author(s):  
Valeria Zahoransky ◽  
Julian Gutierrez ◽  
Paul Harrenstein ◽  
Michael Wooldridge

We introduce a non-cooperative game model in which players’ decision nodes are partially ordered by a dependence relation, which directly captures informational dependencies in the game. In saying that a decision node v is dependent on decision nodes v1,…,vk, we mean that the information available to a strategy making a choice at v is precisely the choices that were made at v1,…,vk. Although partial order games are no more expressive than extensive form games of imperfect information (we show that any partial order game can be reduced to a strategically equivalent extensive form game of imperfect information, though possibly at the cost of an exponential blowup in the size of the game), they provide a more natural and compact representation for many strategic settings of interest. After introducing the game model, we investigate the relationship to extensive form games of imperfect information, the problem of computing Nash equilibria, and conditions that enable backwards induction in this new model.


Author(s):  
Andrea Celli ◽  
Alberto Marchesi ◽  
Gabriele Farina ◽  
Nicola Gatti

The existence of uncoupled no-regret learning dynamics converging to correlated equilibria in normal-form games is a celebrated result in the theory of multi-agent systems. Specifically, it has been known for more than 20 years that when all players seek to minimize their internal regret in a repeated normal-form game, the empirical frequency of play converges to a normal-form correlated equilibrium. Extensive-form games generalize normal-form games by modeling both sequential and simultaneous moves, as well as imperfect information. Because of the sequential nature and the presence of private information, correlation in extensive-form games possesses significantly different properties than in normal-form games. The extensive-form correlated equilibrium (EFCE) is the natural extensive-form counterpart to the classical notion of correlated equilibrium in normal-form games. Compared to the latter, the constraints that define the set of EFCEs are significantly more complex, as the correlation device ({\em a.k.a.} mediator) must take into account the evolution of beliefs of each player as they make observations throughout the game. Due to this additional complexity, the existence of uncoupled learning dynamics leading to an EFCE has remained a challenging open research question for a long time. In this article, we settle that question by giving the first uncoupled no-regret dynamics which provably converge to the set of EFCEs in n-player general-sum extensive-form games with perfect recall. We show that each iterate can be computed in time polynomial in the size of the game tree, and that, when all players play repeatedly according to our learning dynamics, the empirical frequency of play after T game repetitions is guaranteed to be a O(T^-1/2)-approximate EFCE with high probability, and an EFCE almost surely in the limit.


2021 ◽  
Vol 14 ◽  
pp. 257-272
Author(s):  
Denis Kuzyutin ◽  
◽  
Yulia Skorodumova ◽  
Nadezhda Smirnova ◽  
◽  
...  

A novel approach to sustainable cooperation called subgameperfect core (S-P Core) was introduced by P. Chander and M. Wooders in 2020 for n-person extensive-form games with terminal payoffs. This solution concept incorporates both subgame perfection and cooperation incentives and implies certain distribution of the total players' payoff at the terminal node of the cooperative history. We use in the paper an extension of the S-P Core to the class of extensive games with payoffs defined at all nodes of the game tree that is based on designing an appropriate payoff distribution procedure β and its implementation when a game unfolds along the cooperative history. The difference is that in accordance with this so-called β-subgameperfect core the players can redistribute total current payoff at each node in the cooperative path. Moreover, a payoff distribution procedure from the β-S-P Core satisfies a number of good properties such as subgame efficiency, non-negativity and strict balance condition. In the paper, we examine different properties of the β-S-P Core, introduce several refinements of this cooperative solution and provide examples of its implementation in extensive-form games. Finally, we consider an application of the β-S-P Core to the symmetric discrete-time alternating-move model of fishery management.


Author(s):  
Jiří Čermák ◽  
Viliam Lisý ◽  
Branislav Bošanský

Information abstraction is one of the methods for tackling large extensive-form games (EFGs). Removing some information available to players reduces the memory required for computing and storing strategies. We present novel domain-independent abstraction methods for creating very coarse abstractions of EFGs that still compute strategies that are (near) optimal in the original game. First, the methods start with an arbitrary abstraction of the original game (domain-specific or the coarsest possible). Next, they iteratively detect which information is required in the abstract game so that a (near) optimal strategy in the original game can be found and include this information into the abstract game. Moreover, the methods are able to exploit imperfect-recall abstractions where players can even forget the history of their own actions. We present two algorithms that follow these steps -- FPIRA, based on fictitious play, and CFR+IRA, based on counterfactual regret minimization. The experimental evaluation confirms that our methods can closely approximate Nash equilibrium of large games using abstraction with only 0.9% of information sets of the original game.


2020 ◽  
Vol 0 (0) ◽  
Author(s):  
Aviad Heifetz ◽  
Martin Meier ◽  
Burkhard C. Schipper

AbstractWe define a cautious version of extensive-form rationalizability for generalized extensive-form games with unawareness that we call prudent rationalizability. It is an extensive-form analog of iterated admissibility. In each round of the procedure, for each tree and each information set of a player a surviving strategy of hers is required to be rational vis-a-vis a belief system with a full-support belief on the opponents' previously surviving strategies that reach that information set. We demonstrate the applicability of prudent rationalizability. In games of disclosure of verifiable information, we show that prudent rationalizability yields unraveling under full awareness but unraveling might fail under unawareness. We compare prudent rationalizability to extensive-form rationalizability. We show that prudent rationalizability may not refine extensive-form rationalizability strategies but conjecture that the paths induced by prudent rationalizable strategy profiles (weakly) refine the set of paths induced by extensive-form rationalizable strategies.


Sign in / Sign up

Export Citation Format

Share Document