Semi-Supervised Learning Using Randomized Mincuts

About This Presentation

Title:

Semi-Supervised Learning Using Randomized Mincuts

Description:

Semi-Supervised Learning Using Randomized Mincuts Avrim Blum, John Lafferty, Raja Reddy, Mugizi Rwebangira Carnegie Mellon Motivation Often have little labeled data ... – PowerPoint PPT presentation

Number of Views:178

Avg rating:3.0/5.0

Slides: 24

Provided by: Shuchi2

Learn more at: http://www.cs.cmu.edu

Category:

more less

Transcript and Presenter's Notes

Title: Semi-Supervised Learning Using Randomized Mincuts

1
Semi-Supervised Learning Using Randomized Mincuts

Avrim Blum, John Lafferty, Raja Reddy, Mugizi
Rwebangira
Carnegie Mellon

2
Motivation

Often have little labeled data but lots of
unlabeled data.
We want to use the relationships between the
unlabeled examples to guide our predictions.
Assumption Similar examples should generally
be labeled similarly."

3
Learning using Graph MincutsBlum and Chawla
(ICML 2001)
4
Construct an (unweighted) Graph
5
Add auxiliary super-nodes
6
Obtain s-t mincut
-

Mincut
7
Classification

-
Mincut
8

Problem
Plain mincut gives no indication of its
confidence on different examples.

Solution
Add random weights to the edges.
Run plain mincut and obtain a classification.
Repeat the above process several times.
For each unlabeled example take a majority vote.
Margin of the vote gives a measure of the
confidence.

9
Before adding random weights

-
Mincut
10
After adding random weights

-
Mincut
11

PAC-Bayes
PAC-Bayes bounds show that the average of
several hypotheses that are all consistent with
the training data will probably be more accurate
than any single hypothesis.
In our case each distinct cut corresponds to a
different hypothesis.
Hence the average of these cuts will probably be
more accurate than any single cut.

Markov Random Fields
Ideally we would like to assign a weight to each
cut in the graph (a higher weight to small cuts)
and then take a weighted vote over all the cuts
in the graph.
This corresponds to a Markov Random Field model.
We dont know how to do this efficiently, but we
can view randomized mincuts as an approximation.

13
Related Work Gaussian Fields