Bayesian Ranking: From Xbox Live to Computer Go - PowerPoint PPT Presentation

1 / 30
About This Presentation
Title:

Bayesian Ranking: From Xbox Live to Computer Go

Description:

Title: Bayesian Ranking: From Xbox Live to Computer Go Author: Ralf Herbrich Thore Graepel Last modified by: Dr Fabien A. P. Petitcolas Created Date – PowerPoint PPT presentation

Number of Views:50
Avg rating:3.0/5.0
Slides: 31
Provided by: RalfHerbr
Category:

less

Transcript and Presenter's Notes

Title: Bayesian Ranking: From Xbox Live to Computer Go


1
Bayesian RankingFrom Xbox Live to Computer Go
  • Ralf Herbrich and Thore Graepel

2
Overview
  • Motivation Ranking in Video Games
  • Bayesian Player Ranking TrueSkill
  • Skill Belief, Likelihood, Update Equation
  • Applications in Online Gaming
  • Numerical Results on Halo 2 data
  • Ranking Moves Computer Go
  • Conclusion

3
Motivation
  • Microsoft is the leader in online video gaming
    (Xbox Live).
  • Centralised game-independent service.
  • Gamercard, Achievements, TrueSkill, etc.
  • What makes playing online games fun?
  • Good network connection (Broadband).
  • Seamless setup (Xbox Live).
  • Competitive matches (Ranking!).

4
Ranking in Video Games
  • Problem Setting
  • k teams of n1,,nk many players compete.
  • The outcome is a ranking between the teams
    (including draws).
  • Questions
  • Skill si of each player such that
  • Global ranking between all players.
  • High quality of match between k teams.

5
Overview
  • Motivation Ranking in Video Games
  • Bayesian Player Ranking TrueSkill
  • Skill Belief, Likelihood, Update Equation
  • Applications in Online Gaming
  • Numerical Results on Halo 2 data
  • Ranking Moves Computer Go
  • Conclusion

6
The Bayesian Approach
  • Classical logic deals with certain statements.
  • In the real world, uncertainty is abundant.
  • Degree of Belief (logic 0 or 100).

Player is skill is between 35 and 40
and Player js skill is between 30 and 35
Player i won against Player j
Bayesian Approach Probability for Logic under
Uncertainty P(AB) P(BA) P(A) / P(B)
Prior
Posterior
Likelihood
Evidence
7
Overview
  • Motivation Ranking in Video Games
  • Bayesian Player Ranking TrueSkill
  • Skill Belief, Likelihood, Update Equation
  • Applications in Online Gaming
  • Numerical Results on Halo 2 data
  • Ranking Moves Computer Go
  • Conclusion

8
TrueSkill Skill Belief
  • Track two numbers per player
  • µ Average skill of player
  • s Uncertainty about skill of player
  • Benefits
  • Faster Skill Learning
  • Better Matchmaking
  • Accurate Prediction of Game Outcomes

s
Belief in Skill Level
µ

10
15
20
25
30
35
40
Skill Level
9
TrueSkill Likelihood
  • Likelihood
  • Game outcomes are all permutations including
    draws between pairs of teams.
  • Latent performance model for players

P(game outcomes1,,sn) P(tis are in game
outcome order s1,,sn)
  • Team performance, ti, is sum of players
    performances in the team.

10
Likelihood Example Two Players
0.2
7
Player 2 wins
0.18
6
Players 1 and 2 draw
0.16
5
0.14
0.12
4
Probability density
Performance of Player 2
0.1
3
0.08
0.06
2
Players 1 and 2 draw
0.04
Player 1 wins
1
0.02
Player 2 wins
Player 1 wins
0
0
0
1
2
3
4
5
6
7
-8
-6
-4
-2
0
2
4
x
- x
Performance of Player 1
1
2
11
TrueSkill Skill Updates
Game Outcome
Belief in Skill Level
0
10
20
30
40
50
12
Two Team Update Algorithm
Draw
Win\Loss
bi 1 (for players in winning team) bi -1
(for players in losing team) bi 0 (for all
other players)
13
Overview
  • Motivation Ranking in Video Games
  • Bayesian Player Ranking TrueSkill
  • Skill Belief, Likelihood, Update Equation
  • Applications in Online Gaming
  • Numerical Results on Halo 2 data
  • Ranking Moves Computer Go
  • Conclusion

14
The True Skill System Applications
  • Leaderboard
  • (Conservative) skill estimate µ - 3s
  • Matchmaking
  • Competitive game Fun game!
  • Match quality Probability of a draw
  • Team Balancing
  • Maximise match quality by greedy search.

15
TrueSkill Matchmaking
Lobby
Possible Matches
?
16
Alternative Ranking Systems ELO
  • Only 2 players and disregard draws.
  • Assumptions
  • Use a moving average estimator for pi,j
  • Assume a constant average skill ? each game is a
    zero-sum game.
  • Make a linear approximation of F.

17
ELO Properties
  • ELO is approximation of the TrueSkill system.
  • ELO deteriorates without matchmaking!
  • ELO only maintains one point the mode.
  • ELO has a fixed step-size per update.
  • ELO cannot deal with team games or multiple (gt 2)
    player games.
  • People need a provisional ranking in ELO starting
    at the mid-point of the scale.

18
Overview
  • Motivation Ranking in Video Games
  • Bayesian Player Ranking TrueSkill
  • Skill Belief, Likelihood, Update Equation
  • Applications in Online Gaming
  • Numerical Results on Halo 2 data
  • Ranking Moves Computer Go
  • Conclusion

19
Results Halo 2 Multiplayer Beta
  • 5 different hoppers
  • Free-For-All 60261 games (5946 players)
  • 1 vs. 1 6240 games (1672 players)
  • 5 maps, 3 different game variants.
  • Matchmaking was relaxed to level gap 9.
  • Parameters in all experiments
  • Performance variation factor 60
  • Draw Probability 5
  • Dynamics variation factor 2

20
Free-For-All char vs. SQLwildman?
40
35
30
25
Level
20
15
char (
)
TrueSkill
10
)
SQLwildman (
TrueSkill
char (Halo 2)
5
SQLwildman (Halo 2)
0
0
100
200
300
400
Number of games played
21
Free-For-All char vs. SQLwildman?
100
char wins
SQLwildman wins
80
Both players draw
60
Winning probability
40
20
5/8 games won by char
0
0
100
200
300
400
500
Number of games played
22
Near Normal Level Distribution 650,000 Players
  • TrueSkill Analysis of Halo 2 (Nov. Dec. 2004)

50
40
30
Level
20
10
1
0
0.5
1
1.5
2
2.5
3
Level Occupancy
4
x 10
x 10
23
TrueSkill
  • Skill based ranking instead of experience based
    ranking for better matchmaking.
  • TrueSkill system is
  • a generalisation of ELO
  • tracks a belief distribution
  • can deal with multiple team/players/draws
  • Every Xbox 360 Live game uses TrueSkill ranking
    matchmaking!

24
Overview
  • Motivation Ranking in Video Games
  • Bayesian Player Ranking TrueSkill
  • Skill Belief, Likelihood, Update Equation
  • Applications in Online Gaming
  • Numerical Results on Halo 2 data
  • Ranking Moves Computer Go
  • Conclusion

25
Liberty Fast Pattern Based Computer Go
  • Go Simple rules yet very complex game.
  • Computer Go
  • Infancy (best programs at weak amateur level).
  • Problems Evaluation and Branching factor (250).
  • New grand challenge of AI (replacing Chess).
  • Idea Learning good moves from expert play.
  • Applications
  • Reduce branching factor for search.
  • Fast pattern based Go engine.

26
From Local Patterns to Probability of Moves
?25, ? 5.2
Not in database!
Black to move
27
Harvesting and Learning
  • Two processes
  • Harvesting patterns.
  • Ranking patterns.
  • We learn the value of these patterns using a
    modified Bayesian TrueSkill ranking system.
  • Partial ranking from every expert move
  • Move made wins over any other move available on
    the board.
  • Nothing is known about the ranking of the
    un-played moves.

28
Better than State-of-the-Art
1
Liberty (120K games)
Liberty (20K games)
Liberty (20K games)
Werf et al. (2002)
0.8
0.6
55 in top 5
cumulative probability
0.4
32 top
0.2
0
1
5
10
15
20
25
30
expert move rank
29
Better than State-of-the-Art
1
Liberty (120K games)
Liberty (20K games)
Liberty (20K games)
Werf et al. (2002)
0.8
0.6
cumulative probability
0.4
0.2
10 alternative moves
50 alternative moves
max. 50 alternative moves
0
expert move rank
1
5
10
15
20
25
30
30
Better Prediction Early
31
Bigger Patterns Early
32
Bigger Patterns ? Better Predictions
33
Bayesian Ranking for Go
  • Bayesian ranking makes full use of the
    information available from expert moves.
  • Simple features used in the approach already
    beats state-of-the-art prediction methods.
  • Approach is ideal for server-side Go AI
  • Very fast at move selection time.
  • Large memory footprint.
  • Planned extension to 1,000,000 game records and
    context-aware patterns.

34
Conclusions
  • Bayesian Ranking is a powerful technique
  • TrueSkill generalises ELO and has a large
    influence on the online gaming experience of Xbox
    gamers.
  • Provides a principled and efficient way for
    learning the value of local patterns in the game
    of Go.
Write a Comment
User Comments (0)
About PowerShow.com