Asymptotic Analysis for Large Scale Dynamic Stochastic Games - PowerPoint PPT Presentation

About This Presentation
Title:

Asymptotic Analysis for Large Scale Dynamic Stochastic Games

Description:

Asymptotic Analysis for Large Scale Dynamic Stochastic Games Sachin Adlakha, Ramesh Johari, Gabriel Weintraub and Andrea Goldsmith DARPA ITMANET Meeting – PowerPoint PPT presentation

Number of Views:84
Avg rating:3.0/5.0
Slides: 13
Provided by: Rame84
Learn more at: http://www.mit.edu
Category:

less

Transcript and Presenter's Notes

Title: Asymptotic Analysis for Large Scale Dynamic Stochastic Games


1
Asymptotic Analysis for Large Scale Dynamic
Stochastic Games
  • Sachin Adlakha, Ramesh Johari, Gabriel Weintraub
    and Andrea Goldsmith
  • DARPA ITMANET Meeting
  • September 13-14, 2009.

2
Asymptotic Analysis for Large Scale Dynamic
Stochastic Games S. Adlakha, R. Johari, G.
Weintraub, A. Goldsmith
  • MAIN RESULT Taxonomy of Stochastic Games
  • HOW IT WORKS
  • Existence results for competitive model are based
    on continuity arguments.
  • AME property for a competitive model is derived
    from the fact that opponents at higher states
    lead to lower payoff.
  • ASSUMPTIONS AND LIMITATIONS
  • Mean field requires all nodes to interact with
    each other applies to Dense networks only
  • Coordination model requires different existence
    proof

  • General Framework for interaction of multiple
    devices
  • Further, our results
  • provide common thread to analyze both
    competitive and coordination models.
  • provide exogenous conditions for existence and
    AME for competitive models
  • provide results on a special class of
    coordination model linear quadratic tracking
    games.

Many cognitive radio models do notaccount for
reaction of other devicesto a single devices
action. In prior work, we developed a
generalstochastic game model to tractably
capture interactions of many devices.
In principle, tracking state of other
devices is complex. We approximate state of other
devices via a mean field limit.
Provide existence and AME results for general
class of coordination games. Our main goal is
to develop arelated model that applies when a
single node interacts with a small number of
other nodes each period.
New Paradigm for analyzing large scale
competitive and coordination games
3
Modeling Interaction between Devices
  • Wireless spectrum sharing
  • Nodes interact with each other.
  • The environment for a single node comprises of
    active devices.
  • Nodes operate in a reactive environment.
  • Markov Perfect Equilibrium (MPE)
  • Standard solution concept for stochastic games.
  • The action of each player depends on the state of
    everyone.
  • Problems
  • MPE is hard to compute.
  • Requires excessive information exchange.

4
Oblivious Equilibrium (OE)
  • Mean field equilibrium concept.
  • Each device reacts to an average state of other
    players.
  • Requires little information exchange.
  • Easy to compute and implement.
  • Questions
  • When does such policies exist?
  • How close is OE to MPE in terms of payoff
    received to a device?

5
Taxonomy of Stochastic Games
  • Competitive models
  • Non-cooperative games
  • Payoff characterized by non-increasing
    differences between own state and opponent states
    sub modular structure.
  • Opponents at higher state leads to lower payoff.
  • Coordination models
  • Cooperative games
  • Payoff has increasing differences between own
    state and opponent state super modular
    structure.
  • Payoff depends on how close are nodes to other
    players.
  • Contribution

6
State of the Art
  • Generalized the idea of OE to general stochastic
    games Allerton 07.
  • Unified existing models, such as LQG games, via
    our framework CDC 08.
  • Exogenous conditions for approximating MPE using
    OE for linear dynamics and separable payoffs
    Allerton 08.
  • Current Results
  • Exogenous conditions on model primitives which
  • Prove the existence of an oblivious equilibrium
    for competitive models.
  • Show that OE is close to MPE asymptotically for
    competitive models.

7
Our model
  • m players
  • State of player i is xi action of player i is ai
  • State evolution
  • Payoff
  • where f-i empirical distribution of other
    players states

8
Common Assumptions
  • A1 The state transition function is concave in
    state and action and has decreasing differences
    in state and action.
  • A2 For any action, is a non-increasing
    function of state, non-decreasing function of
    action and has negative drift at zero action.
  • A3 The payoff function is jointly concave in
    state and action and has decreasing differences
    in state and action.
  • A4 The first derivative of the payoff function
    w.r.t state becomes negative as the state
    increases.

These assumptions imply that the optimal policy
is non-increasing and asymptotically goes to zero.
9
Competitive Model - Assumptions
  • A5 The payoff function has decreasing
    difference between state and f-i and between
    action and f-i .
  • Ordering relation on f-i first order stochastic
    dominance.
  • A6 The payoff decreases as f increases. That
    is,
  • if f1 f2, then ¼(x, a, f1) ¼(x, a, f2).
  • A7 The logarithm of the payoff is Gateaux
    differentiable w.r.t. f-i.
  • Define
  • A8 Assume that the payoff function is such
    that g(y) O(yK) for some K.

10
Main Result Competitive Model
  • Under A1-A8, OE exists for competitive models
    and OE payoff is approximately optimal over
    Markov policies, as m ? 1.
  • In other words, OE is approximately an MPE.
  • The key point here is that no single player is
    overly influential and the true state
    distribution is close to the time averageso
    knowledge of other players policies does not
    significantly improve payoff.
  • Advantage
  • Each player can use oblivious policy without loss
    in performance.

11
Competitive vs. Coordination Models
  • Competitive and coordination models have
    significant differences.
  • Existence in competitive models comes from
    continuity arguments.
  • For coordination model, assumption A6 does not
    holds - requires different existence proof.
  • This dichotomy exists even in single shot games.
  • We have results for a special class of
    coordination games linear quadratic tracking
    models (a generalization of model by Caines et.
    al.)

12
Main Contributions and Future Work
  • Provide common thread to analyze both competitive
    and coordination model.
  • Provide exogenous conditions for existence and
    AME property for competitive model.
  • Existence results are important for these models
    to be meaningful.
  • Provide results on a special class of
    coordination model linear quadratic tracking
    games.
  • Future Work
  • Provide exogenous conditions for existence and
    AME property for general coordination games.
  • Develop similar models where a single node
    interacts with a small set of nodes at each time
    period.
  • Apply these models to interfering transmissions
    between energy constrained nodes.
Write a Comment
User Comments (0)
About PowerShow.com