Mini Quiche - PowerPoint PPT Presentation

1 / 25

About This Presentation

Title:

Mini Quiche

Description:

... (G nets) are a novel class of game representations and associated ... for M N, let XM:= k M Xk , and let x 0=(xM0, xN -M 0) be an arbitrary reference point. ... – PowerPoint PPT presentation

Number of Views:45

Avg rating:3.0/5.0

Slides: 26

Provided by: pierol

Category:

more less

Transcript and Presenter's Notes

Title: Mini Quiche

1
Mini Quiche
Prepare a double batch of pie crust or purchase a
pie crust mix. Roll out and cut in circles using
a donut cutter or cooking cutter. Place circles
into bottom of muffin pan so that pie crust curls
up the sides by about 1/4 inch. Fill with any
combo of filling. I used bacon, mushrooms,
onions, and swiss for one filling. The other was
broccoli, onions and cheddar cheese. Place just
enough filling so it does not go above lip of
crust. Beat 2 eggs and 3/4 cup sour cream.
Spoon 1 teaspoon of egg mixture over each
quiche. Bake for 20-25 min in 375 oven. Reheat
defrosted quiche for 10 min in a 400
oven. Recipe Archive at Carnegie Mellon's School
of Computer Science (SCS)
2
Game Networks Piero La Mura Stanford University
Motivation providing a natural and
computationally efficient framework for
strategic inference in multi-agent decision
problems
3
Beer/Quiche Game
4
A Nash Equilibrium
5
Another Nash Equilibrium
6

Existing frameworks for strategic inference
include
normal forms (NF)
extensive forms (EF)
sequence forms (SF)
influence diagrams (ID)
expected utility networks (EUN).
Of these,
EF, SF and ID are dynamic, i.e. capture the
causal flow of events in the game
NF, EF and SF are multi-agent representations
EUN capture strategic independencies.
G nets are a dynamic, multi-agent generalization
of EUN.

7
Multi-agent Influence Diagrams?
Player1
Player2
This doesnt work. (of course)
8
G nets, informally Game networks (G nets) are a
novel class of game representations and
associated inference procedures, which can
exploit strategic separabilities in order to
simplify the inference process. G nets come
with two novel inference procedures one which
selects a single strategic equilibrium as a
function of the game payoffs, and one which
identifies all equilibria .
9

Decision-theoretic background
Let S be a set of states, A a Boolean algebra on
S, and
u S? R a strictly positive (utility) function.
An expected utility function is a pair (p,u),
where p is a probability function on A.
A conditional probability system is a family of
conditional probability functions p( . F) such
that, for all E?F?G,
p(EG)p(EF)p(FG)
p( . F) is well defined even when p(FG)0,
i.e. even when a condition F has zero
probability.
A conditional expected utility function is a pair
(p,u) where p is a conditional probability system.

10
The expected utility of a possible event E is
defined as u(E)? u(s) p(s E) and the
conditional expected utility of E given F is
defined as u(E F) u(E ? F) / u(F). Finally,
the conditional value of E given F is defined
as v(E F)p(E F) u(E F).
11
Ceteris Paribus Comparisons
Let Xkk?N be a set of variables, for M?N, let
XM?k?M Xk , and let x 0(xM0, xN -M 0) be an
arbitrary reference point. Ceteris Paribus
Comparisons for probabilities q(xMxN) p(xM ,
xN-M) / p(xM0 , xN-M) Ceteris Paribus
Comparisons for utilities w(xMxN) u(xM , xN-M)
/ u(xM0 , xN-M)
12
Conditional independence of probabilities and
utilities Let A,B,C be a partition of N and
let x0(a0,b0,c0) be an arbitrary reference
point. We say that A is p-independent of B given
C if and only if p(ab,c)p(ab0,c) for all
(a,b,c)?(A,B,C). We say that A is u-independent
of B given C if and only if w(ab,c)w(ab0,c),
where w(ab,c)u(a,b,c)/u(a0,b,c). Finally,
we say that A is strategically independent of B
given C if A is both p- and u- independent of B
given C.
13
Health and Wealth
Shohams Additive Utility Independence u(H,W) -
u(-H,-W) u(-H,W) - u(-H,-W)
u(H,-W)-u(-H,-W) u(H,W) u(-H,-W) u(-H,W)
u(H,-W) u-independence u(H,W) /u(-H,-W)
u(-H,W) / u(-H,-W) u(H,-W)/u(-H,-W) u(H,W)
u(-H,-W) u(-H,W) u(H,-W)
14
Consequences of strategic independence If
strategic independence holds, decisions regarding
A and B given C can be effectively decentralized
the agent who decides on A does not need to know
about B in order to choose its optimal action
given C, and vice versa. (La Mura and Shoham
1999)
15

A G net is comprised of the following elements
a finite, ordered set of nodes XN
(N1,2,...,n) corresponding to a set of
strategically relevant variables
a partition I of N which determines the
identity of the agent responsible for the
decision at each node (including Nature)
a set of directed (probability) arcs, with no
cycles, representing causal dependencies
a set of undirected (utility) arcs representing
payoff dependencies
for each Xk, two functions p(xkxPP(k)) and
w(xkxUN(k)), where XPP(k) are the probability
parents of Xk and XUN(k) its utility neighbors.

16
Example the Beer/Quiche game
reference point -S,-B,-F (not strong, no beer,
no fight) payoffs utility relative to
reference point, times 12.
p(S)0.9 w1(S-B,-F)1 w1(SB,-F)1 w1(S-B,F)1 w
1(SB,F)1 w2(S-F)1
S
w1(FS)1/2 w1(F-S)1/4 w2(FS)1/2 w2(F-S)2
w1(BS)3/2 w1(B-S)2/3
B
F
17
Another Example a multi-stage game with
observed actions (e.g., the repeated prisoners
dilemma)
A1H0
B1H0
A2H1
B2H1
A1
A2
A3
A4
H1
H2
H3
H0
B1
B2
B3
B4
18
Bayesian rationality in G nets A G frame is an
incompletely specified G net, where only the
probabilities of Natures actions (but not those
of the other agents) are specified. Theorem Any
finite game in extensive form has a G frame
representation. A G net satisfies Bayesian
rationality if, for all k?N and xPP(k) ? XPP(k),
ui(k)(xkxPP(k))gt ui(k)(xkxPP(k)) implies
p(xkxPP(k))0. Corollary For any G frame
there exists a corresponding G net which
satisfies Bayesian rationality.
19

Strategic equilibrium in G nets
A Nash equilibrium (in the agent-strategic form)
is a conditional probability system p?? such
that, for all q ??,
?Xk p(xkxPP(k)) ui(k)(p)(xkxPP(k)) ? ?Xk
q(xkxPP(k)) ui(k)(p)(xkxPP(k)).
Let f? ?? ? be defined by f? ?z (1- ?)v,
where
z is the conditional probability system which
gives uniform probability to all the possible
actions, and
v(p)(xkxPP(k)) p(xkxPP(k))
ui(k)(p)(xkxPP(k)).
Then f? has a fixed point by Brouwers theorem. A
limit point of a sequence of fixed points of f?
as ? goes to zero is a Nash (indeed, a
sequential) equilibrium.

20
Global convergence to equilibrium in G nets Nash
equilibria are zeros of the function F(p),
defined by F(p)(xkxPP(k)) p(xkxPP(k)) -
v(p)(xkxPP(k)). Consider the perturbed
problem F?(p)(xkxPP(k)) p(xkxPP(k)) -
f?(p)(xkxPP(k)). This can be rewritten as
?F0(p) (1- ?)F(p), where F is the target system
whose zeros we want to find, and F0 is the
trivial system p(xkxPP(k)) - z(xkxPP(k)), whose
unique solution is p z. Then F? defines a
convex-linear homotopy h(p,t) F1-t. Its
homotopy paths are very well behaved, and can be
tracked using standard computational methods
(Morgan 1987).
21

First equilibrium
For G nets with generic payoffs, the end point
of the homotopy path generated by h(p,t)
identifies a unique sequential equilibrium (first
equilibrium).
The first equilibrium is uniquely determined by
the G net structure, and can be computed using
standard path-tracking techniques.
Most importantly, strategically independent
variables do not affect the values of
F(p)(xkxPP(k)). It follows that, in the presence
of strategic independencies, convergence to the
zeros of a large system can be reduced to
convergence to the zeros of smaller,
strategically independent subsystems.

22
Computing all equilibria Let G(p) be defined by
G(p)(xkxPP(k)) ui(k)(p)(xPP(k))
F(p)(xkxPP(k)). Since u is strictly positive,
the zeros of G coincide with the zeros of F.
Note that G(p) is a vector of polynomial
functions, whose zeros include all the Nash
equilibria. A Nash equilibrium may not have any
homotopy path converging to it in Rn yet, as it
turns out, one can always get at all of them in
the complex space Cn.
23

Let G0 be the start system defined by
G0j(p) ?jdj pjdj - ?jdj,
where j (xkxPP(k)), dj is the degree of Gj(p),
and ?j and ?j are generic complex constants. Then
G0(p) 0 has ?j dj solutions.
Let h(p,t) be the homotopy defined by h(p,t)(1 -
t) G0(p) t G(p) then the following result
holds.
Theorem (Morgan 1987) Given G, there are sets of
measure zero, A? and A? in Cn such that, if ? ?
A? and ? ? A?, then
the solution set (p,t)? Cn ? 0,1) h(p,t)0
is a collection of non-overlapping, smooth
paths
the paths move from t0 to t1 without
backtracking in t
each geometrically isolated solution of G0 of
multiplicity m has exactly m continuation paths
converging to it.

The Nash equilibria are the real solutions of G
0 in the region delimited by ui(k)(p)(xkxPP(k))
? 1
The mixed-strategy equilibria are the solutions
with multiplicity higher than one
The homotopy paths are very well-behaved, and
can be tracked using standard computational
methods
The procedure applies to general games yet, as
in the single-equilibrium case, one can exploit
the strategic separabilities in the G net
structure in order to simplify the computations.

Conclusions
G nets are a novel multi-agent framework for
strategic inference
G nets are more modular and compact than
standard game-theoretic representations, thanks
to a structured representation of probabilities,
utilities and expected utilities
moreover, they come with two novel computational
procedures which can take advantage of the extra
structure in order to simplify the inference
process
the first procedure selects a unique sequential
equilibrium as a function of the G net structure
while the second identifies all Nash equilibria.