Potential game

From Infogalactic: the planetary knowledge core
Jump to: navigation, search

In game theory, a game is said to be a potential game if the incentive of all players to change their strategy can be expressed using a single global function called the potential function. Robert W. Rosenthal created the concept of a congestion game in 1973. Dov Monderer and Lloyd Shapley [1] created the concept of a potential game and proved that every congestion game is a potential game.

The properties of several types of potential games have since been studied. Games can be either ordinal or cardinal potential games. In cardinal games, the difference in individual payoffs for each player from individually changing one's strategy ceteris paribus has to have the same value as the difference in values for the potential function. In ordinal games, only the signs of the differences have to be the same.

The potential function is a useful tool to analyze equilibrium properties of games, since the incentives of all players are mapped into one function, and the set of pure Nash equilibria can be found by locating the local optima of the potential function. Convergence and finite-time convergence of an iterated game towards a Nash equilibrium can also be understood by studying the potential function.

Definition

We will define some notation required for the definition. Let N be the number of players, A the set of action profiles over the action sets A_{i} of each player and u be the payoff function.

A game G=(N,A=A_{1}\times\ldots\times A_{N}, u: A \rightarrow \reals^N) is:

  • an exact potential game if there is a function \Phi: A \rightarrow \reals such that  \forall {a_{-i}\in A_{-i}},\ \forall {a'_{i},\ a''_{i}\in A_{i}},
 \Phi(a'_{i},a_{-i})-\Phi(a''_{i},a_{-i}) = u_{i}(a'_{i},a_{-i})-u_{i}(a''_{i},a_{-i})
That is: when player i switches from action a' to action a'', the change in the potential equals the change in the utility of that player.
  • a weighted potential game if there is a function \Phi: A \rightarrow \reals and a vector w \in \reals_{++}^N such that  \forall {a_{-i}\in A_{-i}},\ \forall {a'_{i},\ a''_{i}\in A_{i}},
 \Phi(a'_{i},a_{-i})-\Phi(a''_{i},a_{-i}) = w_{i}(u_{i}(a'_{i},a_{-i})-u_{i}(a''_{i},a_{-i}))
  • an ordinal potential game if there is a function \Phi: A \rightarrow \reals such that  \forall {a_{-i}\in A_{-i}},\ \forall {a'_{i},\ a''_{i}\in A_{i}},
 u_{i}(a'_{i},a_{-i})-u_{i}(a''_{i},a_{-i})>0 \Leftrightarrow
 \Phi(a'_{i},a_{-i})-\Phi(a''_{i},a_{-i})>0
  • a generalized ordinal potential game if there is a function \Phi: A \rightarrow \reals such that  \forall {a_{-i}\in A_{-i}},\  \forall {a'_{i},\ a''_{i}\in A_{i}},
 u_{i}(a'_{i},a_{-i})-u_{i}(a''_{i},a_{-i})>0 \Rightarrow
 \Phi(a'_{i},a_{-i})-\Phi(a''_{i},a_{-i}) >0
  • a best-response potential game if there is a function \Phi: A \rightarrow \reals such that \forall i\in N,\ \forall {a_{-i}\in A_{-i}},
b_i(a_{-i})=\arg\max_{a_i\in A_i} \Phi(a_i,a_{-i})

where b_i(a_{-i}) is the best payoff for player i given a_{-i}.

A simple example

+1 –1
+1 +b1+w, +b2+w +b1–w, –b2–w
–1 –b1–w, +b2–w –b1+w, –b2+w
Fig. 1: Potential game example

In a 2-player, 2-strategy game with externalities, individual players' payoffs are given by the function ui(si, sj) = bi si + w si sj, where si is players i's strategy, sj is the opponent's strategy, and w is a positive externality from choosing the same strategy. The strategy choices are +1 and −1, as seen in the payoff matrix in Figure 1.

This game has a potential function P(s1, s2) = b1 s1 + b2 s2 + w s1 s2.

If player 1 moves from −1 to +1, the payoff difference is Δu1 = u1(+1, s2) – u1(–1, s2) = 2 b1 + 2 w s2.

The change in potential is ΔP = P(+1, s2) – P(–1, s2) = (b1 + b2 s2 + w s2) – (–b1 + b2 s2 – w s2) = 2 b1 + 2 w s2 = Δu1.

The solution for player 2 is equivalent. Using numerical values b1 = 2, b2 = −1, w = 3, this example transforms into a simple battle of the sexes, as shown in Figure 2. The game has two pure Nash equilibria, (+1, +1) and (−1, −1). These are also the local maxima of the potential function (Figure 3). The only stochastically stable equilibrium is (+1, +1), the global maximum of the potential function.

+1 –1
+1 5, 2 –1, –2
–1 –5, –4 1, 4
Fig. 2: Battle of the sexes (payoffs)
+1 –1
+1 4 0
–1 –6 2
Fig. 3: Battle of the sexes (potentials)

A 2-player, 2-strategy game cannot be a potential game unless


[u_{1}(+1,-1)+u_1(-1,+1)]-[u_1(+1,+1)+u_1(-1,-1)] =
[u_{2}(+1,-1)+u_2(-1,+1)]-[u_2(+1,+1)+u_2(-1,-1)]

Equilibrium Selection

The existence of pure strategy Nash equilibrium is guaranteed in potential games, and multiple Nash equilibria may exist. Learning algorithms such as "best response" and "better response" can only guarantee that the iterative learning process can converge to one of the Nash equilibria (if multiple). Equilibrium selective learning algorithms aim to design a strategy where convergence to the best Nash equilibrium, with respect to the potential function, is guaranteed. In,[2] the authors propose an equilibrium selective algorithm named MaxLogit, which provably converges to the best Nash equilibrium at the fastest speed in its class, using mixing rate analysis of induced Markovian chains. In a special case where every player shares the same objective function (hence the potential function), and possibly the same action set, the problem is equivalent to distributed combinatorial optimization which arises in many engineering applications. Equilibrium selective learning algorithms such as MaxLogit can be used in such combinatorial optimizations, even in a distributed fashion.

Bounded Rational Models

A logit equilibrium, the Gibbs measure from statistical mechanics, was shown to be the equilibrium of a finite-player potential game,[3] where players are assumed to be bounded-rational in one of two ways. Dynamically, players follow the gradient of the potential on pure strategy space, perturbed by a random variable (motivated by the inherent behavior strategy randomness used to justify a classical mixed-strategy Nash equilibrium). Alternately, a static notion of equilibrium can be used, based on agents arbitraging information out of the system to adapt and improve, as gauged by (Shannon) information entropy.

For a finite number of agents, both result in the same Gibbs equilibrium measure, where the potential exactly corresponds to the negative of the "energy" in physics. There are some important corollaries to this:

  • even a single player is complex in the sense that her endogenous randomness results in her motion being that for an irreversible dissipative system (convective), which converges to a steady-state Gibbs equilibrium (the concept of equilibrium is qualitatively irreversible, otherwise agents would pass right through it as though it were any other point),
  • any model in economics that uses, a-priori, a Gibbsian-derived model (standard or mean-field interacting particle systems, such as Curie-Weiss) from statistical mechanics, has a potential (the negative of the Hamiltonian/energy of the a-priori model) and can thus be interpreted in the context of a bounded-rational potential game, and
  • as a potential refines the Nash equilibriums (eliminates local maximums that aren't global), statistical mechanics can refine the potential by singling out multiple global maximums via symmetry breaking (much as minimum free-energy "all up" or "all down" can be selected in a ferromagnet).

This model satisfies the (Bohr) Correspondence Principle for any finite number or infinite number of players, since the Gibbs measure yields the (refined) Nash equilibrium in the limit of zero-temperature perfect rational dynamics; i.e., this model has a limiting classical behavior.

Phase transitions never occur for a finite number of players, but can occur in the infinite-player games, with spontaneous symmetry breaking and multiple equilibriums below a critical "temperature" (degree of non-rational behavior). For high enough non-rational behavior, i.e., high "temperature", there will always be a unique equilibrium state (e.g., Dobrushin uniqueness theorem). These phase transitions give rise to the emergence of self-organized patterns (i.e., phases) which, for example, correspond to different macroscopic buying/selling patterns of agents in a particular Cournot competition.

An economic interpretation of other parameters in the Gibbs formalism, such as "entropy", "magnetization", "susceptibility", etc., as well as scaling interactions (local[4] , power-law decay, global competition or collusion, mixtures of local/global coopetition), are explained in [3] as well as in an application to a speculative and hedging model.[5]

In this speculative and hedging model, two interdependent markets are examined, with bounded rationality assumptions. The existence of multiple equilibriums is shown to be dependent on certain parameters in the model; i.e., equilibrium(s) depend on the phase of the model (c.f., Sonnenschein–Mantel–Debreu theorem).

Other models assume an agent has the ability to compute her instantaneous expected payoff by conditionally averaging out other agents' distributions. This results in a mean-field-type model for which the equilibrium is obtained by finding fixed points.[6]

References

  1. Lua error in package.lua at line 80: module 'strict' not found.
  2. Lua error in package.lua at line 80: module 'strict' not found.
  3. 3.0 3.1 Lua error in package.lua at line 80: module 'strict' not found.
  4. Lua error in package.lua at line 80: module 'strict' not found.
  5. Lua error in package.lua at line 80: module 'strict' not found.
  6. Lua error in package.lua at line 80: module 'strict' not found.

External links