Marco Adelfio - PowerPoint PPT Presentation

1 / 17
About This Presentation
Title:

Marco Adelfio

Description:

Create systems to play arbitrary games (given formal game definitions) ... 10,000 Grand Prize. General Game Playing. Questions: ... – PowerPoint PPT presentation

Number of Views:31
Avg rating:3.0/5.0
Slides: 18
Provided by: Mar5513
Learn more at: http://www.cs.umd.edu
Category:
Tags: adelfio | marco

less

Transcript and Presenter's Notes

Title: Marco Adelfio


1
General Game Playing (GGP)
  • Marco Adelfio
  • CMSC 828N Spring 2009

2
Classic Game Playing AI
Deep Blue
TD-Gammon
Poki
3
General Game Playing AI
GGP Agent
4
General Game Playing
  • GGP Goals
  • Create systems to play arbitrary games (given
    formal game definitions)
  • Eliminate game-specific strategies
  • Emphasize generic strategy formulation
  • Competition created by Stanford Logic Group
  • Hosted during AAAI conference since 2005
  • 10,000 Grand Prize

5
General Game Playing
  • Questions
  • What additional challenges arise for GGP agents?
  • How should a GGP agent evaluate game states?
  • Can a GGP agent transfer knowledge between games?

6
General Game Playing
  • Finitely many players, states
  • Game play controlled by Game Manager over network
  • Players act synchronously (noops allowed)
  • Time limits enforced
  • Basic agent must
  • Understand rule specification
  • Respond to game states with legal actions
  • Recognize a terminal state and its payoffs

7
Game Definition Language
  • A game definition must logically define
  • Set of states in the game
  • Legal actions for each player from a given game
    state
  • Transition function
  • Initial state
  • Terminal states and their payoffs

8
Game Definition Language - Example
  • (role p1)
  • (role p2)
  • (init (cell 1 1 b))
  • (init (cell 1 2 b))
  • (init (control p1)
  • (lt (legal ?w (mark ?x ?y))
  • (true (cell ?x ?y b))
  • (true (control ?w)))
  • (lt (next (cell ?m ?n x))
  • (does p1 (mark ?m ?n))
  • (true (cell ?m ?n b)))

(lt (row ?m ?x) (true (cell ?m 1 ?x))
(true (cell ?m 2 ?x)) (true (cell ?m 3
?x))) (lt (line ?x) (row ?m ?x)) (lt (line ?x)
(column ?m ?x)) (lt (line ?x) (diagonal
?x)) (lt (goal p1 100) (line x)) (lt (goal
p1 0) (line o) (lt terminal (line x))
9
Game Communication
Game Manager Message Game Player Response
(START MATCH.435 WHITE description 90 30) READY
(PLAY MATCH.435 (NIL NIL)) (MARK 2 2)
(PLAY MATCH.435 ((MARK 2 2) NOOP))) NOOP
(PLAY MATCH.435 (NOOP (MARK 1 3)) (MARK 1 2)
(PLAY MATCH.435 ((MARK 1 2) NOOP)) NOOP
... ...
(STOP MATCH.435 ((MARK 3 3) NOOP) DONE
10
General Game Playing
  • Design Challenges
  • Indeterminacy
  • Size
  • Multi-game Commonalities
  • Opponent Recognition

11
AAAI Competition Past Winners
  • 2005 - ClunePlayer (UCLA)
  • 2006 - FluxPlayer (Technical University of
    Dresden)
  • 2007 - CADIA (Reykjavik University)
  • 2008 - CADIA (Reykjavik University)

12
Agent 1 ClunePlayer
  • Approach Minimax
  • Problem Needs to assign values to intermediate
    game states in arbitrary games.
  • Solution
  • Calculate a vector of generic features at each
    node
  • Simulate games to determine which features are
    stable and correlated with either payoff or
    control
  • When running minimax, use a combination of those
    scores as the evaluation heuristic

13
Agent 2 CADIA-Player
  • Approach UCT (Variant of Monte Carlo simulation)
  • Monte Carlo
  • Pick random actions for each player to descend
    the tree
  • After reaching a terminal state, update expected
    payoff Q(s,a) for each visited state s and action
    a
  • Introduces explore/exploit tradeoff

14
Agent 2 CADIA-Player
  • UCT (Upper Confidence bound for Trees)
  • Balance exploration and exploitation
  • Give bonus to less travelled paths

15
Agent 3 UTexas LARG
  • Approach Knowledge Transfer
  • Uses lessons from past games to improve play in
    new games
  • War Games!
  • Determines whether a new game is isomorphic or
    similar to a previous game. If so, transfer
    estimated rewards

16
Summary
  • General Game Playing introduces a different set
    of challenges than designing game-specific AI
  • Biggest challenge is evaluating states in a novel
    game
  • Better understanding of general strategy
    formation has many applications

17
References
  • GGP Website http//games.stanford.edu/
  • Hilmar Finnson. CADIA-Player A General Game
    Playing Agent. MSc Thesis, School of Computer
    Science, Reykjavik University. 2007.
  • Kuhlmann, Gregory and Peter Stone. Graph-Based
    Domain Mapping for Transfer Learning in General
    Games. Lecture Notes in Computer Science, Volume
    4701/2007.
Write a Comment
User Comments (0)
About PowerShow.com