Title: The Evolution of Cooperation
1The Evolution of Cooperation
Robert Axelrods
A Computer Game for Political Science
- Kentaro Toyama
- Microsoft Research India
- Indian Institute of Science August 10, 2005
2Outline
- Prisoners Dilemma
- Two Contests
- Some Analysis
- Real-World Scenarios
- Agent-Based Simulation
- Discussion
3Robert Axelrod
- Professor of Political Science and Public Policy
at U Michigan, Ann Arbor - First paper on cooperation published in 1980.
- Book (left) published in 1984 to wide acclaim.
- Best known for this and related work still
active in this area and publishing new research. - http//www-personal.umich.edu/axe/
4Outline
- Prisoners Dilemma
- Two Contests
- Some Analysis
- Real-World Scenarios
- Agent-Based Simulation
- Discussion
5The Prisoners Dilemma
- Two-player game
- Non-zero-sum
- Model for many real-world scenarios
- Story based on two criminals caught by police and
interrogated separately
6The Prisoners Dilemma
Player B
Cooperate
Defect
Payoff Matrix
-2
-2
0
-5
Cooperate
Player A
0
-5
-4
-4
Defect
Think of payoffs as number of years of life lost,
spent in jail.
7The Prisoners Dilemma
Player B
Cooperate
Defect
Payoff Matrix
3
3
5
0
Cooperate
Player A
5
0
1
1
Defect
(For ease of thinking, add 5 to each payoff.
The larger the payoff, the better.)
8The Prisoners Dilemma
Player B
Cooperate
Defect
Payoff Matrix
3
3
5
0
Cooperate
Player A
5
0
1
1
Defect
If Player A cooperates, Player B should defect.
9The Prisoners Dilemma
Player B
Cooperate
Defect
Payoff Matrix
3
3
5
0
Cooperate
Player A
5
0
1
1
Defect
If Player A cooperates, Player B should defect.
If Player A defects, Player B should defect.
10The Prisoners Dilemma
Player B
Cooperate
Defect
Payoff Matrix
3
3
5
0
Cooperate
Player A
5
0
1
1
Defect
No matter what the other player does, a rational,
self-interested player will defect. (This is a
Nash equilibrium.)
11The Prisoners Dilemma
Player B
Cooperate
Defect
Payoff Matrix
3
3
5
0
Cooperate
Player A
5
0
1
1
Defect
No matter what the other player does, a rational,
self-interested player will defect. (This is a
Nash equilibrium.)
The Dilemma There is a joint strategy that
could result in better payoffs for both players.
(The Nash equilibrium is not Pareto-optimal.)
12Other Kinds of Games
C
D
C
D
1 1
C
1 2
4 4
C
2 3
0 0
0 1
3 2
1 1
D
D
Exploitation
Linked Fates
Swerve
Straight
Heads
Tails
2 2
Swerve
1 3
1 -1
Heads
-1 1
3 1
0 0
-1 1
1 -1
Straight
Tails
Chicken
Matching Coins
13The Prisoners Dilemma
Player B
Cooperate
Defect
Payoff Matrix
Reward for cooperation
Temptation to defect
3
5
Cooperate
Player A
Punishment for defection
Suckers payoff
0
1
Defect
T gt R gt P gt S
R gt (S T) / 2
14The Prisoners Dilemma
Player B
Cooperate
Defect
Payoff Matrix
3
30
5
0
Cooperate
Player A
50
0
1
10
Defect
T gt R gt P gt S
R gt (S T) / 2
Payoffs do not have to be symmetrical.
15PD as a Model for Real-Life Scenarios
16Iterated Prisoners Dilemma
- Two players
- Prisoners Dilemma played repeatedly
- History of previous interactions remembered by
each player - No other outside knowledge
17Iterated Prisoners Dilemma
No matter what the other player does, a rational,
self-interested player will defect on the second
(last) game.
18Iterated Prisoners Dilemma
No matter what the other player does, a rational,
self-interested player will defect on the second
(last) game.
Both players know this, so on the first game,
both players will defect, as well.
19Iterated Prisoners Dilemma
No matter what the other player does, a rational,
self-interested player will defect on the second
game.
Both players know this, so on the first game,
both players will defect, as well.
A rational, self-interested player should defect
all N times.
20Iterated Prisoners Dilemma
- If number of iterations uncertain
?
?
?
?
Best strategy is no longer clear! Unlike, e.g.,
chess, there is no single best strategy it
depends on the strategy of the other player.
21Outline
- Prisoners Dilemma
- Two Contests
- Some Analysis
- Real-World Scenarios
- Agent-Based Simulation
- Discussion
22Contest 1
- Call for entries to game theorists
- All entrants told of preliminary experiments
- 15 strategies
- 14 entries 1 RANDOM
- Round-robin tournament against all other players
and twin - Each game 200 iterations
- Games run 5 times against each strategy
- Scores averaged over all games
23And, the winner is
- TIT FOR TAT
- Cooperate on first move, thereafter reciprocate
opponents previous action - Shortest program submitted
- By psychologist, Anatol Rapoport
24Analysis Nice Guys Finish First
- Top 8 strategies never defect first.
25Analysis To Forgive, Divine
- Top two rules are willing to cooperate even after
defections, if other player is contrite
DOWNING - Kingmaker - Tries to learn
behavior of other player starts by defecting
twice. - Hurts strategies that are unforgiving.
26Other Interesting Strategies
- TIT FOR TWO TATS
- Retaliate only if previous two are Ds
- Could have won tournament, if entered
- NICE DOWNING
- Like DOWNING, but start with Cs
- Could have won tournament, if entered
- Variations on TIT FOR TAT
- Did well, but none beat TIT FOR TAT
27Contest 2
- Same set up as Contest 1, except
- Entries from first-round contestants as well as
open call in magazine - 63 strategies
- 62 entries 1 RANDOM
- Each game iterated an uncertain number of
iterations, with probability 0.00346 of ending
28And, the winner is
- TIT FOR TAT, again!
- (Again, by Anatol Rapoport)
29Analysis Contest 1 Lessons Validated
- 14 of top 15 strategies never defect first.
- 14 of bottom 15 strategies were not nice.
- Forgiveness important.
30Analysis Be Retaliatory
- Some entrees tried to take advantage of nice
strategies - TRANQUILIZER cooperate first, if other
cooperates, too, throw in a few defections. - TESTER defect first, if other doesnt
retaliate, cooperate twice, then alternate
defection and cooperation. If other ever
defects, do TIT FOR TAT. - Strategies that were unresponsive to defections
get taken advantage of. - Top strategies retaliate quickly.
31Analysis Sneaki-ness Doesnt Pay
- Entrees that try to take advantage of nice
strategies, dont gain as much as they lose. - TRANQUILIZER 27th place in tournament.
- TESTER 46th place in tournament (out of 63).
32Outline
- Prisoners Dilemma
- Two Contests
- Some Analysis
- Real-World Scenarios
- Agent-Based Simulation
- Discussion
33Robustness of TIT FOR TAT
- In six variations of Contest 2, TIT FOR TAT took
first place in five and second place in one. - In a population simulation with 63 strategies
(right), TIT FOR TAT emerges as the winner. - In an genetic algorithm experiment (1987),
TIT-FOR-TAT-like algorithms prevailed.
34Stability of TIT FOR TAT
- A population of TIT FOR TAT strategists cannot be
invaded by a single strategy. - Nor can a population of ALWAYS DEFECT
strategists. - But! A cluster of TIT FOR TATs can invade ALWAYS
DEFECT, while the converse is not true.
35Stability of TIT FOR TAT
- A population of TIT FOR TAT strategists cannot be
invaded by a single strategy. - Nor can a population of ALWAYS DEFECT
strategists. - But! A cluster of TIT FOR TATs can invade ALWAYS
DEFECT, while the converse is not true.
36Stability of TIT FOR TAT
- A population of TIT FOR TAT strategists cannot be
invaded by a single strategy. - Nor can a population of ALWAYS DEFECT
strategists. - But! A cluster of TIT FOR TATs can invade ALWAYS
DEFECT, while the converse is not true.
Under certain conditions that imply that the
future is sufficiently important for all players.
37General Lessons
- Dont be envious. (It doesnt matter if others
win.) - TIT FOR TAT never scores more than the other
player. - Be nice. (Dont defect first.)
- The best way to do well is to cooperate with
others who are also nice. - Retaliate swiftly.
- Or, others will take advantage.
- Forgive.
- Feuds are costly. Defections shouldnt prevent
cooperation later on. - Dont be too clever.
- Too much cleverness looks RANDOM.
38Outline
- Prisoners Dilemma
- Two Contests
- Some Analysis
- Real-World Scenarios
- Agent-Based Simulation
- Discussion
39Trench Warfare
- Common form of battle in World War I
- Armies in deep trenches on either side of battle
line - Machine guns and artillery
- Prolonged engagement with same group of enemy
troops
40Trench Warfare is an IPD
You
Cooperate
Shoot to Kill
Payoff Matrix
You live and win a medal.
Cooperate
Both live.
Them
You die and they win.
Shoot to Kill
Both die.
For a single round, no matter what the enemy
does, its better to shoot to kill.
But, for an indefinite number of rounds?
41Trench Warfare
- Cooperation spontaneously evolved
- If the British shelled the Germans, the Germans
replied, and the damage was equal. - A British staff officer was astonished to
observe German soldiers walking about within
rifle range - These people did not know there was a war on.
Both sides believed in live and let live. - Suddenly a salvo arrived but did no damage.
Naturally both sides got down and our men started
swearing at the Germans, when all at once a brave
German got on to his parapet and shouted out We
are very sorry about that we hope no one was
hurt. It is not our fault, it is that damned
Prussian artillery.
42Biological Mutualism
43Fig Tree and Fig Wasp
Wasp
Lay eggs without pollinating
Lay eggs and pollinate
Payoff Matrix
Bear fruit breed good wasps.
No fruit breed bad wasps.
Let fig ripen
Live, and have kids.
Die, but have kids.
Tree
No fruit.
Bear fruit.
Cut off fig
Die, no kids.
Live, no kids.
For a single round, trees should cut off figs,
wasps should lay eggs without pollinating.
But, for an indefinite number of rounds?
44Fig Tree and Fig Wasp Mutualism
45Outline
- Prisoners Dilemma
- Two Contests
- Some Analysis
- Real-World Scenarios
- Agent-Based Simulation
- Discussion
46Agent-Based Modeling
- Simulation as a scientific method
- Simulation allows hypothesis discovery,
verification, and prediction.
Simulation is particularly valuable for
interactions of many agents and the agents are
expected to adapt.
47Other Modeled Social Theories
- 1963 Cyert March Behavioral theory of the
firm - 1974 Schelling Segregated neighborhoods
- 1980 Axelrod Cooperation
- 2003 Axelrod Ethnocentrism
48Outline
- Prisoners Dilemma
- Two Contests
- Some Analysis
- Real-World Scenarios
- Agent-Based Simulation
- Discussion
49Summary
- Iterated Prisoners Dilemma as a model for many
different types of interaction - There is no single optimal strategy in an IPD
game, but TIT FOR TAT is strong, robust, and
stable. - In real-world IPD scenarios, TIT-FOR-TAT-like
strategies naturally evolve, even among
antagonists and unintelligent players. - Agent-based modeling is a powerful tool for
modeling populations in social and biological
sciences.
50TIT FOR TAT and Ethics
- Mahabharata (3000 BC)
- One should not behave towards others in a way
which is disagreeable to oneself. This is the
essence of morality. All other activities are due
to selfish desire. - Hammurabis Code (1750 BC)
- If a man put out the eye of another man, his eye
shall be put out. If he break another man's
bone, his bone shall be broken. - The Golden Rule (30 AD)
- Do unto others as you would have them do unto
you. - Kants Categorical Imperative
- Act so that the maxim of action may be capable
of becoming a universal law. - Garrett Hardin (The Tragedy of the Commons,
1968) - Conscience is self-eliminating.
51Thank you!