News and Notes 3/18 - PowerPoint PPT Presentation

1 / 21

About This Presentation

Title:

News and Notes 3/18

Description:

e.g. 1/3 rock, 1/3 paper, 1/3 scissors. Might also have 2 players ... Alternative interpretation for C V: the Kansas Cornfield Intersection game (a.k.a. Chicken) ... – PowerPoint PPT presentation

Number of Views:34

Avg rating:3.0/5.0

Slides: 22

Provided by: CIS4

Learn more at: https://www.cis.upenn.edu

Category:

Tags: news | notes

more less

Transcript and Presenter's Notes

Title: News and Notes 3/18

1
News and Notes 3/18

Two readings in game theory assigned
Short lecture today due to 10 AM fire drill
HW 2 handed back today, midterm handed back
Tuesday
No MK OHs today

2
Introduction to Game Theory

Networked Life
CSE 112
Spring 2004
Prof. Michael Kearns

3
Game Theory

A mathematical theory designed to model
how rational individuals should behave
when individual outcomes are determined by
collective behavior
strategic behavior
Rational usually means selfish --- but not always
Rich history, flourished during the Cold War
Traditionally viewed as a subject of economics
Subsequently applied by many fields
evolutionary biology, social psychology
Perhaps the branch of pure math most widely
examined outside of the hard sciences

4
Prisoners Dilemma
cooperate defect
cooperate -1, -1 -10, -0.25
defect -0.25, -10 -8, -8

Cooperate deny the crime defect confess
guilt of both
Claim that (defect, defect) is an equilibrium
if I am definitely going to defect, you choose
between -10 and -8
so you will also defect
same logic applies to me
Note unilateral nature of equilibrium
I fix a behavior or strategy for you, then choose
my best response
Claim no other pair of strategies is an
equilibrium
But we would have been so much better off
cooperating
Looking ahead what do people actually do?

5
Penny Matching
heads tails
heads 1, 0 0, 1
tails 0, 1 1, 0

What are the equilibrium strategies now?
There are none!
if I play heads then you will of course play
tails
but that makes me want to play tails too
which in turn makes you want to play heads
etc. etc. etc.
But what if we can each (privately) flip coins?
the strategy pair (1/2, 1/2) is an equilibrium
Such randomized strategies are called mixed
strategies

6
The World According to Nash

If gt 2 actions, mixed strategy is a distribution
on them
e.g. 1/3 rock, 1/3 paper, 1/3 scissors
Might also have gt 2 players
A general mixed strategy is a vector P (P1,
P2, Pn)
Pi is a distribution over the actions for
player i
assume everyone knows all the distributions Pj
but the coin flips used to select from Pi
known only to i
P is an equilibrium if
for every i, Pi is a best response to all the
other Pj
Nash 1950 every game has a mixed strategy
equilibrium
no matter how many rows and columns there are
in fact, no matter how many players there are
Thus known as a Nash equilibrium
A major reason for Nashs Nobel Prize in
economics

7
Facts about Nash Equilibria

While there is always at least one, there might
be many
zero-sum games all equilibria give the same
payoffs to each player
non zero-sum different equilibria may give
different payoffs!
Equilibrium is a static notion
does not suggest how players might learn to play
equilibrium
does not suggest how we might choose among
multiple equilibria
Nash equilibrium is a strictly competitive notion
players cannot have pre-play communication
bargains, side payments, threats, collusions,
etc. not allowed
Computing Nash equilibria for large games is
difficult

8
Hawks and Doves
hawk dove
hawk (V-C)/2, (V-C)/2 V, 0
dove 0, V V/2, V/2

Two parties confront over a resource of value V
May simply display aggression, or actually have a
fight
Cost of losing a fight C gt V
Assume parties are equally likely to win or lose
There are three Nash equilibria
(hawk, dove), (dove, hawk) and (V/C hawk, V/C
hawk)
Alternative interpretation for C gtgt V
the Kansas Cornfield Intersection game (a.k.a.
Chicken)
hawk speed through intersection, dove yield

9
Board Games and Game Theory

What does game theory say about richer games?
tic-tac-toe, checkers, backgammon, go,
these are all games of complete information with
state
incomplete information poker
Imagine an absurdly large game matrix for
chess
each row/column represents a complete strategy
for playing
strategy a mapping from every possible board
configuration to the next move for the player
number of rows or columns is huge --- but finite!
Thus, a Nash equilibrium for chess exists!
its just completely infeasible to compute it
note can often push randomization inside the
strategy

10
Repeated Games

Nash equilibrium analyzes one-shot games
we meet for the first time, play once, and
separate forever
Natural extension repeated games
we play the same game (e.g. Prisoners Dilemma)
many times in a row
like a board game, where the state is the
history of play so far
strategy a mapping from the history so far to
your next move
So repeated games also have a Nash equilibrium
may be different from the one-shot equilibrium!
depends on the game and details of the setting
We are approaching learning in games
natural to adapt your behavior (strategy) based
on play so far

11
Repeated Prisoners Dilemma

If we play for R rounds, and both know R
(always defect, always defect) still the only
Nash equilibrium
argue by backwards induction
If uncertainty about R is introduced (e.g. random
stopping)
cooperation and tit-for-tat can become equilibria
If computational restrictions are placed on our
strategies
as long as were too feeble to count, cooperative
equilibria arise
formally lt log(R) states in a finite automaton
a form of bounded rationality

12
The Folk Theorem

Take any one-shot, two-player game
Suppose that (u,v) are the (expected) payoffs
under some mixed strategy pair (P1,P2) for
the two players
(P1, P2) not necessarily a Nash equilibrium
but (u,v) gives better payoffs than the security
levels
security level what a player can get no matter
what the other does
example sec. level is (-8, -8) in Prisoners
Dilemma (-1,-1) is better
Then there is always a Nash equilibrium for the
infinite repeated game giving payoffs (u,v)
makes use of the concept of threats
Partial resolution of the difficulties of Nash
equilibria

13
Correlated Equilibrium

In a Nash equilibrium (P1,P2)
player 2 knows the distribution P1
but doesnt know the random bits player 1 uses
to select from P1
equilibrium relies on private randomization
Suppose now we also allow public (shared)
randomization
so strategy might say things like if private
bits 100110 and shared bits 110100110, then
play hawk
Then two strategies are in correlated equilibrium
if
knowing only your strategy and the shared bits,
my strategy is a best response, and vice-versa
Nash is the special case of no shared bits

14
Hawks and Doves Revisited
hawk dove
hawk (V-C)/2, (V-C)/2 V, 0
dove 0, V V/2, V/2

There are three Nash equilibria
(hawk, dove), (dove, hawk) and (V/C hawk, V/C
hawk)
Alternative interpretation for C gtgt V
the Kansas Cornfield Intersection game (a.k.a.
Chicken)
hawk speed through intersection, dove yield
Correlated equilibrium the traffic signal
if the shared bit is green to me, I am playing
hawk
if the shared bit is red to me, I will play dove
you play the symmetric strategy
splits waiting time between us --- a different
outcome than Nash

15
Correlated Equilibrium Facts

Always exists
all Nash equilibria are correlated equilibria
all probability distributions over Nash
equilibria are C.E.
and some more things are C.E. as well
a broader concept than Nash
Technical advantages of correlated equilibria
often easier to compute than Nash
Conceptual advantages
correlated behavior is a fact of the real world
model a limited form of cooperation
more general cooperation becomes extremely
complex and messy
Breaking news (late 90s now)
CE is the natural convergence notion for
rational learning in games!

16
A More Complex SettingBargaining

Convex set S of possible payoffs
Players must bargain to settle on a solution
(x,y) in S
What should the solution be?
Want a general answer
A function F(S) mapping S to a solution (x,y) in
S
Nashs axioms for F
choose on red boundary (Pareto)
scale invariance
symmetry in the role of x and y
independence of irrelevant alternatives
if green solution was contained in smaller red
set, must also be red solution

17
Nashs Bargaining Solution

Theres only one choice of F that satisfies all
these axioms
And the winner is
choose (x,y) on the boundary of S that maximizes
xy
Example rich-poor bargaining

Cash 1 (rich) Cash 2 (poor) Utility 1 (rich) Utility 2 (poor) U1 x U2
0 100 0.00 1.00 0.000
25 75 0.25 0.98 0.245
50 50 0.50 0.90 0.450
75 25 0.75 0.73 0.548
100 0 1.00 0.00 0.000
18
Social Choice Theory

Suppose we must collectively choose between
alternatives
e.g. Bush, Kerry, Nader, Sharpton,
Under current voting scheme, gamesmanship
encouraged
e.g. prefer Nader to Gore, but Gore is more
electable
not a truth revealing mechanism
An idealized voting scheme
we each submit a complete ordering on the
candidates
e.g. Sharpton gt Bush gt Nader gt Kerry
then combine the orderings to choose a global
ordering (outcome)
we would like the outcome to be fair and
reasonable
What do fair and reasonable mean?
Again take an axiomatic approach

19
Social Choice Axioms

Lets call F the mapping from preferences to
outcome
Suppose for some preferences, x gt y in the global
outcome
then if we move x up in all preferences, F still
has x gt y
Suppose we look at some subset S of alternatives
e.g. S Kerry, Sharpton
suppose we modify preferences only outside of S
Fs ranking of S should remain unchanged
(irrelevant alternatives)
Always some way of making F output x gt y, for any
x,y
otherwise F is ignoring the preferences!
Non-dictatorship
no single individual determines output of F

20
Arrows Impossibility Theorem

There is no mapping F satisfying all four axioms!
Long history of alternate axioms, (im)possibility
research
A mathematical demonstration of the difficulty of
selecting collective outcomes from individual
preferences

21
Next Up