Title: Mixing
1Mixing
A tutorial on Markov chains
- Dana Randall
- Georgia Tech
( Slides at www.math.gatech.edu/randall )
2Outline
- Fundamentals for designing a Markov chain
- Bounding running times (convergence rates)
- Connections to statistical physics
3Markov chains for sampling
Given A large set (matchings, colorings,
independent sets,)
Main Q What do typical elements
look like?
4Markov chains
Andrei Andreyevich Markov 1856-1922
5Sampling using Markov chains
State space ?
( ? cn )
6Sampling using Markov chains
State space ?
( ? cn )
- Step 1. Connect the state space.
7Basics of Markov chains
x
y
H
8The stationary distribution p
9Sampling from non-uniform distributions
Q What if we want to sample from some other
distribution?
- Step 2. Carefully define the
- transition probabilities.
10 The Metropolis Algorithm
(MRRTT 53)
Propose a move from x to y as before, but accept
with probability
min (1, p(y)/p(x))
(with remaining probability stay at x).
11Basics continued
- Step 1. Connect the state space.
- Step 2. Carefully define the
- transition probabilities.
Starting at any state x0, take a random walk for
some number of steps . . . and output the final
state (from p?).
12The mixing rate
Defn The total variation distance is
Pt,p max __ ? Pt(x,y) - p(x).
1
2
xÎ ?
yÎ ?
A Markov chain is rapidly mixing if t(e)
is poly (n, log(e-1)).
13Spectral gap
Let 1 l1 gt l2 l? be
the eigenvalues of P.
Defn Gap(P) 1-l2 is the
spectral gap.
14Outline
- Fundamentals for designing a Markov chain
- Bounding running times (convergence rates)
- Connections to statistical physics
15Outline for rest of talk
- Techniques
- Coupling
- Flows and paths
- Indirect methods
- Problems
- Walk on the hypercube
- Colorings
- Matchings
- Independent sets
- Connections with statistical physics
- - problems
- - algorithms
- - physical insights
16Coupling
17Coupling
Simulate 2 processes
18Coupling
Defn A coupling is a MC on ? x ?
- Each process Xt, Yt is a faithful copy of
the original MC, - If Xt Yt, then Xt1 Yt1.
19Ex1 Walk on the hypercube
- MCCUBE
- Start at v0(0,0,,0).
- Repeat
- - Pick i Î n, b Î 0,1.
- - Set vi b.
Symmetric, ergodic p is uniform.
Mixing time? Use coupling
20Outline
- Techniques
- Coupling
- - path coupling
- Flows and paths
- Indirect methods
- Problems
- Walk on the hypercube
- Colorings
- Matchings
- Independent sets
- Connections with statistical physics
- - problems
- - algorithms
- - physical insights
21Ex 2 Colorings
Given A graph G (max deg d), k gt 1. Goal Find
a random k-coloring of G.
- MCCOL (Single point replacement)
- Starting at some k-coloring C0
- Repeat
- - With prob 1/2 do nothing.
- - Pick v Î V, c Î k
- - Recolor v with c, if possible.
The lazy chain
If k d 2, then the state
space is connected.
(Therefore p is uniform.)
22Path Coupling
Bubley,Dyer,Greenhill97-8
Coupling Show for all x,y Î W, E D
(dist(x,y)) lt 0.
-
Path coupling Show for all u,v s.t.
dist(u,v)1, that E D (dist(u,v)) lt
0.
-
23Path coupling for MCCOL
Thm MCCOL is rapidly mixing if k 3d.
(Jerrum 95)
Pf Use path coupling dist(x,y) 1.
24Summary Coupling
Pros Can yield very easy proofs
Cons Demands a lot from the chain
- Extensions
- Careful coupling (k 2d) (Jerrum95)
- Change the MC (Luby-R-Sinclair95)
- Macromoves
- - burn in (Dyer-Frieze01, Molloy02)
- - non-Markovian couplings
- (Hayes-Vigoda03)
25Outline
- Techniques
- Coupling
- Flows and paths
- Indirect methods
- Problems
- Walk on the hypercube
- Colorings
- Matchings
- Independent sets
- Connections with statistical physics
- - problems
- - algorithms
- - physical insights
26Conductance and flows
(Jerrum-Sinclair88)
27Min cut Max flow
(Sinclair92)
paths gxy from xÎ?, to yÎ?, x ? y,
carrying p(x)p(y) units of flow.
?
28Ex 3 Back to the hypercube
- Define a canonical path from s to t.
- Bound the number of paths through (u,v) Î E.
29Outline
- Techniques
- Coupling
- Flows and paths
- Indirect methods
- Problems
- Walk on the hypercube
- Colorings
- Matchings
- Independent sets
- Connections with statistical physics
- - problems
- - algorithms
- - physical insights
30Ex 4 Sampling matchings
31Ex 4 Sampling matchings
- MCMATCH
- Starting at M0, repeat
- Pick e (u,v) Î E
- If e Î M, remove e - If u and v unmatched in
M, add e - If u matched (by e) and v
unmatched (or vice versa), add e and remove
e - Otherwise do nothing.
Thm Coupling wont work!
(Kumar-Ramesh99)
32Mixing time of MCMATCH
s
t
Å
33Outline
- Techniques
- Coupling
- Flows and paths
- Indirect methods
- Problems
- Walk on the hypercube
- Colorings
- Matchings
- Independent sets
- Connections with statistical physics
- - problems
- - algorithms
- - physical insights
34Ex 5 Independent Sets
Goal Given l, sample ind. set I
with prob p(I) lI/Z, Z ?J
lJ.
35Slow mixing of MCIND (large l)
(Even)
(Odd)
36Summary Flows
Pros Offers a combinatorial approach to
mixing especially useful for proving slow
mixing.
Cons Requires global knowledge of the chain
to spread out paths.
Extensions Balanced flows
(Morris-Sinclair99) MCMC -- Major
highlights - The permanent
(Jerrum-Sinclair-Vigoda02) -
Volume of a convex polytope
(Dyer-Frieze-Kannan89, )
37Outline
- Techniques
- Coupling
- Flows and paths
- Indirect methods
- - Comparison
- - Decomposition
- Problems
- Walk on the hypercube
- Colorings
- Matchings
- Independent sets
- Connections with statistical physics
- - problems
- - algorithms
- - physical insights
38Comparison
(Diaconis,Saloff-Coste93)
39Comparison
x
y
known P
_
_
(x,y) Î P gx,y (using P) G(z,w) is the
set of paths gx,y using (z,w)
w
unknown P
z
_
1
Thm Gap(P) Gap(P).
A
40Comparison, aka . . .
Adjacency . . . The Matrix Reloaded
41Disjoint decomposition
(Madras-R.96, Martin-R.00)
A2
A1
P
A5
A3
A4
A6
?
42Ex 6 MCIND on small ind. sets
For G(V,E)
Let ? ind. sets of G ?k ind.
sets of size k.
43Ind. sets w/bounded size (cont.)
Thm MCIND is rapidly mixing on
K
Ç
?k , where KV/2(?1).
k 1
MCSWAP
?0 ?1 ?2 . . . ?K-1 ?K
Projection
Restrictions
a0 a1 a2 . . .aK-1 aK
?k
44The Restrictions of MCswap
Projection
Restrictions
45Summary Indirect methods
Pros Offer a top down approach allow hybrid
methods to be used..
Cons Can increase the complexity.
Extensions Comparison thm for log-Sobolev
(Diaconis-Saloff-Coste96)
Comparison for Glauber dynamics
(R.-Tetali 98)
Decomposition for log-Sobolev
(Jerrum-Son-Tetali-Vigoda 02)
46Outline
- Techniques
- Coupling
- Flows and paths
- Hybrid methods
-
- Problems
- Walk on the hypercube
- Colorings
- Matchings
- Independent sets
- Connections with statistical physics
- - problems
- - algorithms
- - physical insights
47Why Statistical Physics?
- They have a need for sampling
- Use many interesting heuristics
- Great intuition
- Experts on large data sets
- Microscopic Macroscopic
- details behavior
- (i.e., phase transitions)
48 49Models from statistical physics
Hardcore model
Potts model
(3-colorings) (Independent
sets) (Matchings)
(Min cut)
-
-
-
-
Dimer model
50Models (cont.)
51Models (The physics perspective)
Given A physical system ? s Define A
Gibbs measure as follows
H(s) (the Hamiltonian),
b 1/kT (inverse temperature),
p(s) e-bH(s)/ Z,
52Physics perspective (cont.)
Q What about on the infinite lattice?
Use conditional probabilities
53Phase transitions Ind. sets
Low temperature long range effects
High temperature ? effects die out
TC indicates a phase transition.
54Slow mixing of MCIND revisited
R/B
8
55 Group by of fault lines
Fault lines are vacant paths of width 2 from top
to bottom (or left to
right).
56Peierls Argument
57Peierls Argument cont.
2n/2 3l
S1
SB
( l - n/2 more points)
(and similarly for S2, S3, ) x
58Conclusions
Techniques
59Conclusions
Open problems
- Sampling 4,5,6-colorings on the grid.
- Sampling perfect matchings on
- non-bipartite graphs.
- Sampling acyclic orientations in a graph.
- Sampling configurations of the Potts
- model (a generalization
- of Ising, but with more colors).
- How can we further exploit phase
- transitions? Other physical intuition?
-
...
60(No Transcript)