News and Notes 3/18 - PowerPoint PPT Presentation

1 / 21
About This Presentation
Title:

News and Notes 3/18

Description:

e.g. 1/3 rock, 1/3 paper, 1/3 scissors. Might also have 2 players ... Alternative interpretation for C V: the Kansas Cornfield Intersection game (a.k.a. Chicken) ... – PowerPoint PPT presentation

Number of Views:34
Avg rating:3.0/5.0
Slides: 22
Provided by: CIS4
Category:
Tags: news | notes

less

Transcript and Presenter's Notes

Title: News and Notes 3/18


1
News and Notes 3/18
  • Two readings in game theory assigned
  • Short lecture today due to 10 AM fire drill
  • HW 2 handed back today, midterm handed back
    Tuesday
  • No MK OHs today

2
Introduction to Game Theory
  • Networked Life
  • CSE 112
  • Spring 2004
  • Prof. Michael Kearns

3
Game Theory
  • A mathematical theory designed to model
  • how rational individuals should behave
  • when individual outcomes are determined by
    collective behavior
  • strategic behavior
  • Rational usually means selfish --- but not always
  • Rich history, flourished during the Cold War
  • Traditionally viewed as a subject of economics
  • Subsequently applied by many fields
  • evolutionary biology, social psychology
  • Perhaps the branch of pure math most widely
    examined outside of the hard sciences

4
Prisoners Dilemma
cooperate defect
cooperate -1, -1 -10, -0.25
defect -0.25, -10 -8, -8
  • Cooperate deny the crime defect confess
    guilt of both
  • Claim that (defect, defect) is an equilibrium
  • if I am definitely going to defect, you choose
    between -10 and -8
  • so you will also defect
  • same logic applies to me
  • Note unilateral nature of equilibrium
  • I fix a behavior or strategy for you, then choose
    my best response
  • Claim no other pair of strategies is an
    equilibrium
  • But we would have been so much better off
    cooperating
  • Looking ahead what do people actually do?

5
Penny Matching
heads tails
heads 1, 0 0, 1
tails 0, 1 1, 0
  • What are the equilibrium strategies now?
  • There are none!
  • if I play heads then you will of course play
    tails
  • but that makes me want to play tails too
  • which in turn makes you want to play heads
  • etc. etc. etc.
  • But what if we can each (privately) flip coins?
  • the strategy pair (1/2, 1/2) is an equilibrium
  • Such randomized strategies are called mixed
    strategies

6
The World According to Nash
  • If gt 2 actions, mixed strategy is a distribution
    on them
  • e.g. 1/3 rock, 1/3 paper, 1/3 scissors
  • Might also have gt 2 players
  • A general mixed strategy is a vector P (P1,
    P2, Pn)
  • Pi is a distribution over the actions for
    player i
  • assume everyone knows all the distributions Pj
  • but the coin flips used to select from Pi
    known only to i
  • P is an equilibrium if
  • for every i, Pi is a best response to all the
    other Pj
  • Nash 1950 every game has a mixed strategy
    equilibrium
  • no matter how many rows and columns there are
  • in fact, no matter how many players there are
  • Thus known as a Nash equilibrium
  • A major reason for Nashs Nobel Prize in
    economics

7
Facts about Nash Equilibria
  • While there is always at least one, there might
    be many
  • zero-sum games all equilibria give the same
    payoffs to each player
  • non zero-sum different equilibria may give
    different payoffs!
  • Equilibrium is a static notion
  • does not suggest how players might learn to play
    equilibrium
  • does not suggest how we might choose among
    multiple equilibria
  • Nash equilibrium is a strictly competitive notion
  • players cannot have pre-play communication
  • bargains, side payments, threats, collusions,
    etc. not allowed
  • Computing Nash equilibria for large games is
    difficult

8
Hawks and Doves
hawk dove
hawk (V-C)/2, (V-C)/2 V, 0
dove 0, V V/2, V/2
  • Two parties confront over a resource of value V
  • May simply display aggression, or actually have a
    fight
  • Cost of losing a fight C gt V
  • Assume parties are equally likely to win or lose
  • There are three Nash equilibria
  • (hawk, dove), (dove, hawk) and (V/C hawk, V/C
    hawk)
  • Alternative interpretation for C gtgt V
  • the Kansas Cornfield Intersection game (a.k.a.
    Chicken)
  • hawk speed through intersection, dove yield

9
Board Games and Game Theory
  • What does game theory say about richer games?
  • tic-tac-toe, checkers, backgammon, go,
  • these are all games of complete information with
    state
  • incomplete information poker
  • Imagine an absurdly large game matrix for
    chess
  • each row/column represents a complete strategy
    for playing
  • strategy a mapping from every possible board
    configuration to the next move for the player
  • number of rows or columns is huge --- but finite!
  • Thus, a Nash equilibrium for chess exists!
  • its just completely infeasible to compute it
  • note can often push randomization inside the
    strategy

10
Repeated Games
  • Nash equilibrium analyzes one-shot games
  • we meet for the first time, play once, and
    separate forever
  • Natural extension repeated games
  • we play the same game (e.g. Prisoners Dilemma)
    many times in a row
  • like a board game, where the state is the
    history of play so far
  • strategy a mapping from the history so far to
    your next move
  • So repeated games also have a Nash equilibrium
  • may be different from the one-shot equilibrium!
  • depends on the game and details of the setting
  • We are approaching learning in games
  • natural to adapt your behavior (strategy) based
    on play so far

11
Repeated Prisoners Dilemma
  • If we play for R rounds, and both know R
  • (always defect, always defect) still the only
    Nash equilibrium
  • argue by backwards induction
  • If uncertainty about R is introduced (e.g. random
    stopping)
  • cooperation and tit-for-tat can become equilibria
  • If computational restrictions are placed on our
    strategies
  • as long as were too feeble to count, cooperative
    equilibria arise
  • formally lt log(R) states in a finite automaton
  • a form of bounded rationality

12
The Folk Theorem
  • Take any one-shot, two-player game
  • Suppose that (u,v) are the (expected) payoffs
    under some mixed strategy pair (P1,P2) for
    the two players
  • (P1, P2) not necessarily a Nash equilibrium
  • but (u,v) gives better payoffs than the security
    levels
  • security level what a player can get no matter
    what the other does
  • example sec. level is (-8, -8) in Prisoners
    Dilemma (-1,-1) is better
  • Then there is always a Nash equilibrium for the
    infinite repeated game giving payoffs (u,v)
  • makes use of the concept of threats
  • Partial resolution of the difficulties of Nash
    equilibria

13
Correlated Equilibrium
  • In a Nash equilibrium (P1,P2)
  • player 2 knows the distribution P1
  • but doesnt know the random bits player 1 uses
    to select from P1
  • equilibrium relies on private randomization
  • Suppose now we also allow public (shared)
    randomization
  • so strategy might say things like if private
    bits 100110 and shared bits 110100110, then
    play hawk
  • Then two strategies are in correlated equilibrium
    if
  • knowing only your strategy and the shared bits,
    my strategy is a best response, and vice-versa
  • Nash is the special case of no shared bits

14
Hawks and Doves Revisited
hawk dove
hawk (V-C)/2, (V-C)/2 V, 0
dove 0, V V/2, V/2
  • There are three Nash equilibria
  • (hawk, dove), (dove, hawk) and (V/C hawk, V/C
    hawk)
  • Alternative interpretation for C gtgt V
  • the Kansas Cornfield Intersection game (a.k.a.
    Chicken)
  • hawk speed through intersection, dove yield
  • Correlated equilibrium the traffic signal
  • if the shared bit is green to me, I am playing
    hawk
  • if the shared bit is red to me, I will play dove
  • you play the symmetric strategy
  • splits waiting time between us --- a different
    outcome than Nash

15
Correlated Equilibrium Facts
  • Always exists
  • all Nash equilibria are correlated equilibria
  • all probability distributions over Nash
    equilibria are C.E.
  • and some more things are C.E. as well
  • a broader concept than Nash
  • Technical advantages of correlated equilibria
  • often easier to compute than Nash
  • Conceptual advantages
  • correlated behavior is a fact of the real world
  • model a limited form of cooperation
  • more general cooperation becomes extremely
    complex and messy
  • Breaking news (late 90s now)
  • CE is the natural convergence notion for
    rational learning in games!

16
A More Complex SettingBargaining
  • Convex set S of possible payoffs
  • Players must bargain to settle on a solution
    (x,y) in S
  • What should the solution be?
  • Want a general answer
  • A function F(S) mapping S to a solution (x,y) in
    S
  • Nashs axioms for F
  • choose on red boundary (Pareto)
  • scale invariance
  • symmetry in the role of x and y
  • independence of irrelevant alternatives
  • if green solution was contained in smaller red
    set, must also be red solution

17
Nashs Bargaining Solution
  • Theres only one choice of F that satisfies all
    these axioms
  • And the winner is
  • choose (x,y) on the boundary of S that maximizes
    xy
  • Example rich-poor bargaining

Cash 1 (rich) Cash 2 (poor) Utility 1 (rich) Utility 2 (poor) U1 x U2
0 100 0.00 1.00 0.000
25 75 0.25 0.98 0.245
50 50 0.50 0.90 0.450
75 25 0.75 0.73 0.548
100 0 1.00 0.00 0.000
18
Social Choice Theory
  • Suppose we must collectively choose between
    alternatives
  • e.g. Bush, Kerry, Nader, Sharpton,
  • Under current voting scheme, gamesmanship
    encouraged
  • e.g. prefer Nader to Gore, but Gore is more
    electable
  • not a truth revealing mechanism
  • An idealized voting scheme
  • we each submit a complete ordering on the
    candidates
  • e.g. Sharpton gt Bush gt Nader gt Kerry
  • then combine the orderings to choose a global
    ordering (outcome)
  • we would like the outcome to be fair and
    reasonable
  • What do fair and reasonable mean?
  • Again take an axiomatic approach

19
Social Choice Axioms
  • Lets call F the mapping from preferences to
    outcome
  • Suppose for some preferences, x gt y in the global
    outcome
  • then if we move x up in all preferences, F still
    has x gt y
  • Suppose we look at some subset S of alternatives
  • e.g. S Kerry, Sharpton
  • suppose we modify preferences only outside of S
  • Fs ranking of S should remain unchanged
    (irrelevant alternatives)
  • Always some way of making F output x gt y, for any
    x,y
  • otherwise F is ignoring the preferences!
  • Non-dictatorship
  • no single individual determines output of F

20
Arrows Impossibility Theorem
  • There is no mapping F satisfying all four axioms!
  • Long history of alternate axioms, (im)possibility
    research
  • A mathematical demonstration of the difficulty of
    selecting collective outcomes from individual
    preferences

21
Next Up
  • Have so far examined simple games between two
    players
  • Strategic interaction on the smallest network
  • two vertices with a single link between them
  • much richer interaction than just info
    transmission, messages, etc.
  • Classical game theory generalizes to many players
  • e.g. Nash equilibria always exist in multi-player
    matrix games
  • but this fails to capture/exploit/examine
    structured interaction
  • We need specific models for networked games
  • games on networks local interaction
  • shared information economies, financial markets
  • voting systems
  • evolutionary games
Write a Comment
User Comments (0)
About PowerShow.com