Study of Evolutionary Game Theory

About This Presentation

Title:

Study of Evolutionary Game Theory

Description:

... how a population evolves over time ... General questions in GT and EGT. 1. t. Ratio of Group A. Ratio of Group C ... Individuals can only be programmed to play ... – PowerPoint PPT presentation

Number of Views:236

Avg rating:3.0/5.0

Slides: 50

Provided by: CHL68

Category:

more less

Transcript and Presenter's Notes

Title: Study of Evolutionary Game Theory

1
Study of Evolutionary Game Theory
2

Introduction of game theory (GT)
Nash equilibrium
Mixed strategy Nash equilibrium
Evolutionary game theory (EGT)
Difference between GT and EGT
Evolutionary stable strategy
Replicator dynamics
Formalization of EGT
Dynamics in a single population
Dynamics in multiple populations

3
Introduction to Game Theory (GT)

The game theory is the model to describe
decision-making in the game, which involves
players, each of whom performs his behavior to
gain payoff according to given conditions
The game theory answers the question, What is
the best behavior to gain payoff when the result
of a players decision is affected by other
players decision involving in the game?
Behaviors a player can perform is called
strategies
The game is denoted as G (N, X, P)
N is the number of players
X Xi where Xi S1, , SM is a set of
strategies for player i
M is the number of strategies
P Pi where Pi is a payoff matrix for player i
A payoff matrix specifies the amount of gain for
players when they have decided their strategies

4
Type of Strategy

There are two types of strategies Pure and Mixed
Pure strategy
A player selects one of the strategies
Mixed strategy
A player selects the strategies according to the
probabilistic distribution
X (x, 1-x) indicates mixed strategy, where a
player selects S1 with probability x and S2 with
probability 1-x
Note For different mixed strategy (which has
different distribution), we can use Y (y, 1-y),
which means that a player selects S1 with
probability y and S2 with probability 1-y

5
Expected payoff

Expected payoff for the strategy
Pure strategy
U1(S1, S1) indicates payoff for player 1 when
player 1 selects S1 and player 2 selects S1,
which is aA
Similarly, U2(S1, S1) aB, U1(S1, S2) bA,
U2(S1, S2) bA,
Mixed strategy
Ui(X, Y) indicates payoff for player i when
player 1 use mixed strategy X and player 2 use
mixed strategy Y
x y aA (1-y) bA (1-x) y cA
(1-y) dA

Go back
6
Nash Equilibrium

Nash Equilibrium is a strategy pair that no
player can increase his payoff by changing his
strategy when the other players have decided
their strategies
By using expected payoff
S is called as Nash Equilibrium if
For all i (players) and all m (strategies),
Ui(S) gt Ui(Si, S-i)
where S is a strategy pair of players
Si is a strategy for player 1
S-i is a strategy for other players
Note Ui(S) is equal to Ui(Si, S-i)
There is no better strategy to increase payoff
when the strategies for the other players are
fixed

7
Example of Nash Equilibrium

Chicken race game
2 players game
A player drives a car toward a wall
If a player stops a car closer to the wall, then
he wins
Strategy
S1 keep running until the other player stops
S2 stop driving a car before the other player
Player 1 plays
Player 2 plays
Payoff matrix shows scores a player can gain

8
Example of Nash Equilibrium

Now, player 1 wants to compute x, which maximizes
U1
We can denote such x as x(y)
Find x (between 0 and 1)
which satisfy dU1/dx 0 ? 3/5 y 0 ? y
3/5
Similarly, player 2 wants to compute y(x),
maximizes U2

Nash Equilibria
9
Evolutionary Game Theory (EGT)

Evolutionary game theory is a model to describe
the dynamics of strategy change in the repeated
game
In EGT, we analyze how a population evolves over
time
EGT formalizes the evolution of population by
using the concept of GT
A population indicates a set of multiple players
Each player has its own strategy
A set of players who play the same strategy is
called a group
EGT assumes that the strategy, which a player
performs, is pre-programmed before the game
E.g., Player1 plays Sa when timelt10 and Sb at
timegt10
Player1 plays a different strategy of
Player2 at time t-1
Player plays Sa with probability p and Sb
with 1-p
At each play during a game, a player plays
against another player randomly drawn in a
population
The opponent may be from the same group or
different group
The winner will be replicated, and the loser will
be dead (or, we can say the loser change its
strategy to that of the winner)
Generally, it is assume that the replication rate
is proportional to payoffs from the game

10
Differences between GT and EGT

GT
Focus on a single game play of a player vs. a
player
Observe how a player behaves (micro)
Evaluate the best rational strategy for a player
in the game
Interested in the properties of stable state
e.g., Nash Equilibria

EGT
Focus on population, consisting of multiple
players, and multiple plays of the repeated game
among the players
Observe how population behave (macro)
Evaluate how the group (a set of players with the
same strategy) shares the population over time
Interested in the dynamics of state change of
population
e.g., Replicator dynamics

11
General questions in GT and EGT
A player1 (from group A) plays against a player2
(from group B) What is the reasonable strategy
for player 1 to maximize his gain? How about the
strategy of the player 2?
GT
A population consists of three types of players A
player plays against randomly selected player in
a population How a population share by groups
changes over time? Does it converge or not? Which
initial conditions conclude the equilibrium of
all groups?
EGT
1
Ratio of Group A
Ratio of Group B
0.5
Ratio of Group C
0.25
t
12
Key Concepts in EGT

Two key concepts in EGT
Evolutionary stable strategy (ESS) and Replicator
dynamics (RD)
ESS is a strategy (group) which cannot be invaded
by any alternative strategy (group) in a
population over time
This is the refinement of Nash equilibrium for
EGT
Replicator dynamics specifies how population
state changes over time
Population state indicates the population shares
by different groups
A population state represents population shares
by groups
A population state at time T is defined by the
vector of the size of groups N1(T), N2(T), ,
Nk(T)

13
Evolutionary Stable Strategy

An evolutionary stable strategy (ESS) is a
strategy which cannot be invaded by any
alternative strategy over time
Assume there are two groups of players in a
population
If the payoff of a group with alternative
strategy is less than that of another group with
a particular strategy, then the alternative
strategy cannot be propagated into a population
over time
Such the particular strategy is called ESS

14
Definition of ESS
y
1-y
x

Lets formalize ESS in the case that
A group A with strategy appears in a
population together with a group B with strategy
The ratio of group A and group B is (1e, e)
Expected payoff for players of a group A
Expected payoff for players of a group B
Strategy is evolutionary stable if

1-x
15
Definition of ESS
y
1-y
x

Lets formalize ESS in the case that
A group A with strategy appears in a
population together with a group B with strategy
The ratio of group A and group B is (1e, e)
When e? 0
When ,
?
?
therefore,

1-x
(1)
(2)
16
Replicator Dynamics (RD)

Replicator dynamics is a model to analyze the
dynamics of population in a evolutionary game
mathematically
A way to formalize how population evolve over
time
i.e. how population state changes over time
Replicator dynamics are represented as a set of
differential equations
There are two types of Replicator dynamics in
terms of interactions between individuals
Interactions among individuals in a same group
Interactions between individuals in different
groups

17
RD in a single group

Again, a single group RD is a model to analyze
how the population state (i.e., strategy
distribution) change over time
Game involves N players
Two Strategies
At time t (generation)
The total number of players at time t
where Ni(t) of players with strategy si
The ratio of group i to the population size
The population state (i.e. strategy distribution)

18
RD in a single group

Rule of the game
of players at time t is proportional to the
payoff gain at the previous generation at time
t-1
At time t1, of players with strategy ei (e1
S1, e2 S2)
From

19
RD in a single group
On the other hand
Therefore
20
Example of a single group RD

Consider how Beetle swarm evolves
There are two types (classes) of beetle Big and
Small
Which Beetle will survive, or stable ?
Beetles are contesting a resource with a value 12
Strategy
S1 having Big body
S2 having Small body
Rule
Beetles with the same strategy share the resource
(6)
But, Big v.s. Big ? they get 5 (decline in value)
When one is S1 and the other is S2
The Big wins, but get 4 (because of fight)
Let x be the ratio of Big beetles to total of
Beetle
By analyzing RD , Lets see which Beetle
will survive

21
Example of a single group RD

Expected payoff
Replicator equation

22
Example of a single group RD

Three equilibria point at
Stability analysis
x0, (eq 3) -2 lt0 (stable)
x1, (eq 3) -5 lt0 (stable)
x2/7, (eq 3) 10/7 gt0 (unstable)
If Beetle swarm contains Big beetlesmore than
2/7, then the Big beetles willoccupy swarm over
time

(3)
23
(No Transcript)
24
Example of ESS

Hawk-Dove game
Animals are contesting a resource with value of v
Strategy
S1 aggressive strategy (hawk)
Escalate and continue to fight until injured or
until the opponent retreats
S2 accommodating strategy (dove)
Display and retreat immediately if the opponent
escalates
Assumption under the conflict
When hawk meets, winning probability is 0.5
When the conflict occurs, the cost c has to be
paid
When dove meets, retreating probability is 0.5
Payoff matrix
When they fight, they get (v-c)/2
When both avoid fighting, both get v/2
When one tries to fight but the other not, the
winner gets v

25
Example of ESS
Calculation of the expected payoff
26
Example of ESS
Assume a group A with strategy S (v/c, 1-(v/c))
First condition
Expected payoff when an animal meets the animal
with different strategy
Second condition
SEES (v/c, 1-(v/c)) is evolutionary stable
strategy
27
(No Transcript)
28
RD in a single group

Assume that at each time t (generation), of
individuals constantly increase with natural
birth rate B and decrease with natural death rate
D
? Gain by natural birth, and dis loss by natural
death
At time t1, of individuals with strategy ei
From and

29
RD in a single group
30
(No Transcript)
31
(No Transcript)
32
Example of EGT

Two players two strategies game
A player has two cards A and B
Players open one of the cards at the same time
They gain money written in the matrix according
to the result of opened cards
In game theory (consider one time game)
A player tries to maximize his own gain (selfish
player)
If both players select card B, then they both
gain benefit
But, there is a risk to be betrayed (i.e. loosing
money)
The state (A, A) is Nash equilibrium
It ends up no gaining
In evolutionary game theory (consider multiple
games)
How can players maximize their benefit?
They may start to think about cooperation
(altruistic player)
They can gain 100 every game if they cooperate
Is that really good strategy for both players?

33
Example of EGT
q
1-q
p
1-p

Assumption
Player 1 selects a card A with probability p
Player 2 selects a card B with probability q
Note this assumption is equivalent to Player 1
selects a card A pN times during N game plays
Expected value of gain for each strategy
For player 1, E(1SA) q lu1 (1-q) ru1 4
(1-q)
E(1SB) q lb1 (1-q) rb1 1 2q
For player 2, E(2SA) p lu2 (1-p) lb2 4
(1-p)
E(2SB) p ru2 (1-p) rb2 1 2p
Expected value of gain for each player
For player 1, E(P1) p E(1SA) (1-p) E(1SB)
For player 2, E(P2) q E(1SA) (1-q) E(1SB)

34
Example of EGT
q
1-q
p
1-p

Expected value of gain for each player
For player 1, E(P1) p E(1SA) (1-p) E(1SB)
For player 2, E(P2) q E(1SA) (1-q) E(1SB)
A player tries to maximize his gain to control p
or q
For player 1, find p that maximize E(P1)
-gt d E(P1) / dp 0
0 E(1SA) E(1SB) q (lu1 lb1)
(1-q) (ru1 rb1)
0 q 3 (1 q) 1 4q ? q ¼
For player 2, find p that maximize E(P2)
-gt d E(P2) / dq 0
0 E(2SA) E(2SB) p (lu2 lb2)
(1-p) (ru2 rb2)
0 p 3 (1 p) 1 4p ? p ¼

35
Mixed strategy Nash equilibrium
q
1-q
p

When players select card A with probability¼
(card B with probability ¾), they both canget
more gain than the strategy concludedby Nash
equilibrium
E(P1) E(P2) ¼ (¾ 4) ¾ (- ¼ ¾ 1)
9/8
This mixed strategy is better than cooperation
strategy
Mixed strategy Nash equilibrium is the
equilibrium where a players mixed strategy
yields the player as high expected payoff as any
other mixed strategy of the player, given the
mixed strategies of the other players.
We can call state(strategy(p, q)(¼, ¼)) as mixed
strategy Nash equilibrium
Note there

1-p
36
(No Transcript)
37
Replicator Dynamics

Mixed strategy x (x1, x2, , xn) is interpreted
as population state, where each element xi is the
share of individuals who play strategy i in a
population
Payoff represents fitness, the degree of
survivability to the next play (e.g., next
generation) in the game
Replicators are strategies, which can be copied
from parent to child

38
The Replicator Dynamics Model

There are n pure strategies in the whole
population. Individuals can only be programmed to
play pure strategies.
A mixed strategy x (x1,x2,,xn) is interpreted
as a population state, each component xi is the
population share of individuals who play pure
strategy i.
Payoff represents fitness (the number of
offsprings), and each offspring inherits its
single parents strategy.
Replicators are pure strategies, which can be
copied without error from parent to child.
Reproduction takes place continuously over time.

39
Payoff in replicator dynamics
ei pure strategy i. xi population share of
pure strategy ei. (equivalent to a component of
mixed strategy x.) u(ei, x) expected payoff
(fitness) of strategy ei in a random match with a
random player when the population is in state
x(x1, , xn). (equivalent to payoff of strategy
ei against mixed strategy x.) u(x, x)
population expected payoff (fitness) is the
expected payoff to an individual drawn at random
from the population. (equivalent to the payoff of
mixed strategy x against mixed strategy x.)
40
Replicator Dynamics Model

xi population share of pure strategy i.
ei pure strategy i.
u(ei, x) expected payoff (fitness) of strategy
i at a random match when the population is in
state x(x1, , xn).
u(x, x) population expected payoff (fitness) is
the expected payoff to an individual drawn at
random from the population

41
Example replicator dynamics for a doubly
symmetric game
Fitness of strategy 1
A
Mixed strategy
Average population fitness
42
Stability Concepts in Nonlinear System
Nonlinear system with state variable
x(t)(x1(t),,xn(t))

Lyapunov Stability a state x is stable or
Lyapunov stable if no small perturbation of the
state induces a movement away from x(x1,,xn).
no push away from x
Asymptotical Stability a state x is
asymptotical stable if it is Lyapunov stable
and all sufficiently small perturbations of the
state induce a movement back toward x.

43
ESS and Replicator Dynamics

ESS x asymptotical stability of population
state x.
proved by choosing a Lyapunov function, which is
a relative-entropy function in this case.
converse may be not true.

44
Example Rock-Scissors-Paper (RSP) Game

Unique NE strategy x(1/3, 1/3, 1/3) is NOT ESS !
How about the Replicator Dynamics?

45
Example Rock-Scissors-Paper (RSP) Game
A is the payoff matrix of one player
Replicator Dynamics
46
Rock-Scissors-Paper (RSP) Game
NE strategy x(1/3, 1/3, 1/3), but not ESS NE
strategy is Lyapunov stable, but not
asymptotically stable
Replicator Dynamics
Start from any initial state, the system moves
forever along a closed curve!
Paper (x3)
Rock (x1)
Scissors (x2)
47
Evolutionary Game Theory and Computer Networking

An evolutionary game perspective to ALOHA with
power control.
E. Altman, N. Bonneau, M. Debbah, and G. Caire.
Proceedings of the 19th International Teletraffic
Congress, 2005.
An evolutionary game-theoretic approach to
congestion control.
D.S. Menasch a, D.R. Figueiredob, E. de Souza e
Silvaa. Performance Evaluation 62 (2005)

48
Example of Replicator Equation