CS 59000 Statistical Machine learning Lecture 4

About This Presentation

Title:

CS 59000 Statistical Machine learning Lecture 4

Description:

Binomial Distribution. ML Parameter Estimation for Bernoulli (1) Given: Beta Distribution ... What is the probability that the next coin toss will land heads up? ... – PowerPoint PPT presentation

Number of Views:24

Avg rating:3.0/5.0

Slides: 33

Provided by: Markus105

Category:

more less

Transcript and Presenter's Notes

Title: CS 59000 Statistical Machine learning Lecture 4

1
CS 59000 Statistical Machine learningLecture 4

Yuan (Alan) Qi (alanqi_at_cs.purdue.edu)
Sept. 2 2008

2
Binary Variables (1)

Coin flipping heads1, tails0
Bernoulli Distribution

3
Binary Variables (2)

N coin flips
Binomial Distribution

4
ML Parameter Estimation for Bernoulli (1)

Given

5
Beta Distribution

Distribution over .

6
Bayesian Bernoulli
The Beta distribution provides the conjugate
prior for the Bernoulli distribution.
7
Prediction under the Posterior
What is the probability that the next coin toss
will land heads up?
Predictive posterior distribution
8
The Gaussian Distribution
9
Central Limit Theorem

The distribution of the sum of N i.i.d. random
variables becomes increasingly Gaussian as N
grows.
Example N uniform 0,1 random variables.

10
Geometry of the Multivariate Gaussian
11
Moments of the Multivariate Gaussian (1)
thanks to anti-symmetry of z
12
Moments of the Multivariate Gaussian (2)
13
Partitioned Gaussian Distributions
14
Partitioned Conditionals and Marginals
15
Partitioned Conditionals and Marginals
16
Bayes Theorem for Gaussian Variables

Given
we have
where

17
Maximum Likelihood for the Gaussian (1)

Given i.i.d. data ,
the log likeli-hood function is given by
Sufficient statistics

18
Maximum Likelihood for the Gaussian (2)

Set the derivative of the log likelihood
function to zero,
and solve to obtain
Similarly

19
Maximum Likelihood for the Gaussian (3)
Under the true distribution Hence define
20
Sequential Estimation
Contribution of the N th data point, xN
21
Bayesian Inference for the Gaussian (1)

Assume ¾2 is known. Given i.i.d. data
, the likelihood function for¹ is
given by
This has a Gaussian shape as a function of ¹ (but
it is not a distribution over ¹).

22
Bayesian Inference for the Gaussian (2)