EM Algorithm and Mixture of Gaussians - PowerPoint PPT Presentation

1 / 32

About This Presentation

Title:

EM Algorithm and Mixture of Gaussians

Description:

Formulae : the Log Likelihood ... EM increases the log likelihood of the data at every iteration. Kullback-Liebler (KL) divergence ... – PowerPoint PPT presentation

Number of Views:254

Avg rating:3.0/5.0

Slides: 33

Provided by: aiKai

Category:

Tags: algorithm | gaussians | likelihood | mixture

Transcript and Presenter's Notes

Title: EM Algorithm and Mixture of Gaussians

1
EM AlgorithmandMixture of Gaussians

Collard Fabien - 20046056
??? (Kim Jinsik) - 20043152
??? (Joo Chanhye) - 20043595

2
Summary

Hidden Factors
EM Algorithm
Principles
Formalization
Mixture of Gaussians
Generalities
Processing
Formalization
Other Issues
Bayesian Network with hidden variables
Hidden Markov models
Bayes net structures with hidden variables

3
The Problem Hidden Factors
Hidden factors

Unobservable / Latent / Hidden
Make them as variables
Simplicity of the model

4
Simplicity details (graph1)
Hidden factors
2
2
2
Smoking
Diet
Exercise
708 Priors !
5
Simplicity details (Graph2)
Hidden factors
2
2
2
Smoking
Diet
Exercise
78 Priors
6
6
6
Symptom 1
Symptom 2
Symptom 3
6
A Solution EM Algorithm
EM Algorithm

Expectation
Maximization

7
Principles Generalities
EM Algorithm

Given
Cause (or Factor / Component)
Evidence
Compute
Probability in connection table

8
Principles The two steps
EM Algorithm
Parameters P(effects/causes) P(causes)
9
Principles the E-Step
EM Algorithm

Perception Step
For each evidence and cause
Compute probablities
Find probable relationships

10
Principles the M-Step
EM Algorithm

Learning Step
Recompute the probability
Cause event / Evidence event
Sum for all Evidence events
Maximize the log likelihood
Modify the model parameters

11
Formulae Notations
EM Algorithm

Terms
? underlying probability distribution
x observed data
z unobserved data
h current hypothesis of ?
h revised hypothesis
q a hidden variable distribution
Task estimate ? from X
E-step
M-step

12
Formulae the Log Likelihood
EM Algorithm

L(h) estimates the fitting of the parameter h to
the data x with the given hidden variables z
Jensen's inequality ? for any distribution of
hidden states q(z)
Defines the auxiliary function A(q,h)
Lower bound on the log likelihood
What we want to optimize

13
Formulae the E-step
EM Algorithm

Lower bound on log likelihood
H(q) entropy of q(z),
Optimize A(q,h)
By distribute data over hidden variables

14
Formulae the M-step
EM Algorithm

Maximise A(q,h)
By choosing the optimal parameters
Equivalent to optimize likelihood

15
Formulae Convergence (1/2)
EM Algorithm

EM increases the log likelihood of the data at
every iteration
Kullback-Liebler (KL) divergence
Non negative
Equals 0 iff q(z)p(z/x,h)

16
Formulae Convergence (2/2)

Likelihood increases at each iteration
Usually, EM converges to a local optimum of L

17
Problem of likelihood

Can be high dimensional integral
Latent variables ? additional dimensions
Likelihood term can be complicated

18
The Issue Mixture of Gaussian
Mixture of Gaussians

Unsupervised clustering
Set of data points (Evidences)
Data generated from mixture distribution
Continuous data Mixture of Gaussians
Not easy to handle
Number of parameters is Dimension-squared

19
Gaussian Mixture model (2/2)
Mixture of Gaussians

Distribution
Likelihood of Gaussian Distribution
Likelihood given a GMM
N number of Gaussians
wi the weight of Gaussian I
All weights positive
Total weight 1

20
EM for Gaussian Mixture Model

What for ?
Find parameters
Weights wiP(Ci)
Means ?i
Covariances ?i
How ?
Guess the priority Distribution
Guess components (Classes -or Causes)
Guess the distribution function

21
Processing EM Initialization
Mixture of Gaussians

Initialization
Assign random value to parameters

22
Processing the E-Step (1/2)
Mixture of Gaussians

Expectation
Pretend to know the parameter
Assign data point to a component

23
Processing the E-Step (2/2)
Mixture of Gaussians

Competition of Hypotheses
Compute the expected values of Pij of hidden
indicator variables.
Each gives membership weights to data point
Normalization
Weight relative likelihood of class membership

24
Processing the M-Step (1/2)
Mixture of Gaussians

Maximization
Fit the parameter to its set of points

25
Processing the M-Step (2/2)
Mixture of Gaussians

For each Hypothesis
Find the new value of parameters to maximize the
log likelihood
Based on
Weight of points in the class
Location of the points
Hypotheses are pulled toward data

26
Applied formulae the E-Step
Mixture of Gaussians

Find Gaussian for every data point
Use Bayes rule

27
Applied formulae the M-Step
Mixture of Gaussians

Maximize A
For each parameter of h, search for
Results
µ
s2
w

28
Eventual problems
Mixture of Gaussians

Gaussian Component shrinks
Variance 0
Likelihood infinite
Gaussian Components merge
Same values
Share the data points
A Solution reasonable prior values

29
Bayesian Networks
Other Issues
30
Hidden Markov models
Other Issues

Forward-Backward Algorithm
Smooth rather than filter

31
Bayes net with hidden variables
Other Issues

Pretend that data is complete
Or invent new hidden variable
No label or meaning

32
Conclusion

Widely applicable
Diagnosis
Classification
Distribution Discovery
Does not work for complex models
High dimension
? Structural EM

Write a Comment

User Comments (0)

About PowerShow.com

Recommended Relevance Latest Highest Rated Most Viewed

Sort by:

Related More from user

CrystalGraphics Presentations

Introducing-PowerShowcom PowerPoint PPT Presentation

Introducing-PowerShowcom - Introducing-PowerShowcom (Without Music)

CrystalGraphics 3D Character Slides for PowerPoint PowerPoint PPT Presentation

CrystalGraphics 3D Character Slides for PowerPoint - CrystalGraphics 3D Character Slides for PowerPoint

Chart and Diagram Slides for PowerPoint PowerPoint PPT Presentation

Chart and Diagram Slides for PowerPoint - Beautifully designed chart and diagram s for PowerPoint with visually stunning graphics and animation effects. Our new CrystalGraphics Chart and Diagram Slides for PowerPoint is a collection of over 1000 impressively designed data-driven chart and editable diagram s guaranteed to impress any audience. They are all artistically enhanced with visually stunning color, shadow and lighting effects. Many of them are also animated. And they’re ready for you to use in your PowerPoint presentations the moment you need them. – PowerPoint PPT presentation

Related Presentations

Search and Optimization Methods PowerPoint PPT Presentation

Search and Optimization Methods - This chapter is about finding the models and parameters that ... No calculation of derivatives... EM for two-component Gaussian mixture. Tricky to maximize... | PowerPoint PPT presentation | free to view

PhonemeBased Speaker Verification using Adapted Gaussian Mixture Models PowerPoint PPT Presentation

PhonemeBased Speaker Verification using Adapted Gaussian Mixture Models - Enrollment speech for. each speaker. Trained Model for. each speaker. Feature Extraction ... day. d. 33. bee. b. 32. Stops. she. sh. 31. sea. s. 30. thin. th ... | PowerPoint PPT presentation | free to view

Population Stratification with Limited Data PowerPoint PPT Presentation

Population Stratification with Limited Data - P2 : For each feature f, p2f = log n/k. Information-theoretically optimal: ... k = (k log2n) Running Time : O(nk log2 n) An Algorithm. Algorithm: ... | PowerPoint PPT presentation | free to view

Estimating Local Optimums in EM Algorithm over GMM PowerPoint PPT Presentation

Estimating Local Optimums in EM Algorithm over GMM - There are k components following Gaussian distributions ... Solution: run multiple EM with different initial configuration, and return the best result ... | PowerPoint PPT presentation | free to view

Transformation-invariant clustering using the EM algorithm PowerPoint PPT Presentation

Transformation-invariant clustering using the EM algorithm - unsupervised learning of image structure regardless of transformation ... clustering as density modeling grouping 'similar' images together. Goal. Invariance ... | PowerPoint PPT presentation | free to view

Uncertain Data Clustering: Models, Methods and Applications PowerPoint PPT Presentation

Uncertain Data Clustering: Models, Methods and Applications - Extension to EM Algorithm. Monitoring Moving ... Efficient Uncertain Clustering Algorithm (k-means, EM) ... Modeling methods: EM algorithm. A Basic Requirement ... | PowerPoint PPT presentation | free to view

An Introduction to the Expectation-Maximization (EM) Algorithm PowerPoint PPT Presentation

An Introduction to the Expectation-Maximization (EM) Algorithm - Data collection and feature generation are not perfect. Objects in a pattern recognition application may have missing features ... Multiple Imputation ... | PowerPoint PPT presentation | free to view

Independent Component Analysis: The Fast ICA algorithm PowerPoint PPT Presentation

Independent Component Analysis: The Fast ICA algorithm - Kurtosis for gaussian random variables is 0. Con not a robust measure of ... Instead of kurtosis function, choose a contrast function G that doesn't grow too ... | PowerPoint PPT presentation | free to view

Learning mixture models for speech recognition PowerPoint PPT Presentation

Learning mixture models for speech recognition - Usually modeled by a mixture of gaussians for each phone ... Uses eigenvectors of the Kernel Matrix to estimate parameters of a Gaussian Mixture Model ... | PowerPoint PPT presentation | free to view

Large Data Set Analysis using Mixture Models Seminar at IBM Watson Research Center June 27th 2001 Padhraic Smyth Information and Computer Science University of California, Irvine www.datalab.uci.edu PowerPoint PPT Presentation

Large Data Set Analysis using Mixture Models Seminar at IBM Watson Research Center June 27th 2001 Padhraic Smyth Information and Computer Science University of California, Irvine www.datalab.uci.edu - Large Data Set Analysis using Mixture Models. Seminar at IBM Watson Research Center ... Each mixture component is a multidimensional Gaussian with its own mean mk and ... | PowerPoint PPT presentation | free to view

David Newman, UC Irvine Lecture 10: Mixture Models 1 PowerPoint PPT Presentation

David Newman, UC Irvine Lecture 10: Mixture Models 1 - Homework 2 due Tuesday Nov 6 in class. Any questions? Do ... K-Means: Converged solution. Finite Mixture Models. Finite Mixture Models. Finite Mixture Models ... | PowerPoint PPT presentation | free to view

Efficient%20Algorithms%20for%20Non-parametric%20Clustering%20With%20Clutter PowerPoint PPT Presentation

Efficient%20Algorithms%20for%20Non-parametric%20Clustering%20With%20Clutter - Mixture model approach mixture of Gaussians for features, Poisson process for clutter ... 1. Introduction: Clustering and Clutter. 2. The Cuevas-Febreiro ... | PowerPoint PPT presentation | free to view

Gaussian Mixture Model and the EM algorithm in Speech Recognition PowerPoint PPT Presentation

Gaussian Mixture Model and the EM algorithm in Speech Recognition - The Posteriori Probability (zt) : ('fuzzy membership' of ot to ith gaussian) ... Rabiner - A Tutorial on Hidden Markov Models and Selected Aplivations in Speech ... | PowerPoint PPT presentation | free to view

Effective Gaussian mixture learning for video background subtraction PowerPoint PPT Presentation

Effective Gaussian mixture learning for video background subtraction - The basic algorithm follows the formulation by Stauffer and Grimson [9] Differences: [9] C. Stauffer and W.E.L. Grimson, 'Adaptive Background Mixture Models for Real ... | PowerPoint PPT presentation | free to view

A Guided Tour of Finite Mixture Models: From Pearson to the Web ICML PowerPoint PPT Presentation

A Guided Tour of Finite Mixture Models: From Pearson to the Web ICML - general framework for likelihood-based parameter estimation with missing data ... to a (local) maximum of likelihood. Estep and Mstep are often computationally ... | PowerPoint PPT presentation | free to view

Expectation Maximization Algorithm PowerPoint PPT Presentation

Expectation Maximization Algorithm - There is a jar with balls in three different colors. ... In an experiment, one guy picked N balls (n1 red, n2 green and n3 blue) ... | PowerPoint PPT presentation | free to view

Gaussian Mixture Density Estimation PowerPoint PPT Presentation

Gaussian Mixture Density Estimation - Gaussian Mixture Density Estimation. Applied to Microarray Data ... Lowly expressed housekeeping genes. 0.987. 0.439. 0.294. 3.017. 1.923. 1.499. 0.326. 0.412 ... | PowerPoint PPT presentation | free to view

Dealing with Connected Speech and CI Models PowerPoint PPT Presentation

Dealing with Connected Speech and CI Models - ... (Yes I read the book) At other times they are not AN : AX N (That s an apple) AN : AE N ... So also for the forward algorithm Pruning: ... | PowerPoint PPT presentation | free to view

Graphical Models - Learning - PowerPoint PPT Presentation

Graphical Models - Learning - - Advanced I WS 06/07 Based on J. A. Bilmes, A Gentle Tutorial of the EM Algorithm and its Application to Parameter Estimation for Gaussian Mixture and Hidden Markov ... | PowerPoint PPT presentation | free to view

Combining Gaussian Mixture Models with Support Vector Machines for Text-independent Speaker Verification PowerPoint PPT Presentation

Combining Gaussian Mixture Models with Support Vector Machines for Text-independent Speaker Verification - Combining Gaussian Mixture Models with Support Vector Machines for Text-independent Speaker Verification Jamal Kharroubi Dijana Petrovska-Delacr taz | PowerPoint PPT presentation | free to view

What is it? PowerPoint PPT Presentation

What is it? - EM algorithm reading group What is it? When would you use it? Why does it work? How do you implement it? Where does it stand in relation to other methods? | PowerPoint PPT presentation | free to view

Part 6 HMM in Practice PowerPoint PPT Presentation

Part 6 HMM in Practice - ... Tying Model Tying State Tying Mixture Tying Beam Search Although the Viterbi Algorithm has linear complexity ... Extraction Recognition n-gram ft ... | PowerPoint PPT presentation | free to view

Bayesian%20Hierarchical%20Clustering PowerPoint PPT Presentation

Bayesian%20Hierarchical%20Clustering - ... used to decide which merges are advantageous, and to decide appropriate depth of tree. Algorithm can be interpreted as approximate inference method for a DPM; ... | PowerPoint PPT presentation | free to view

HMM - Part 2 PowerPoint PPT Presentation

HMM - Part 2 - HMM - Part 2 The EM algorithm Continuous density HMM | PowerPoint PPT presentation | free to view