Bayes

About This Presentation

Title:

Bayes

Description:

600.465 - Intro to NLP - J. Eisner. 1. Bayes' Theorem. 600.465 - Intro to NLP - J. Eisner ... Use 1: Converts p(B | A) to p(A | B) Use 2: Updates p(A) to p(A | B) ... – PowerPoint PPT presentation

Number of Views:43

Avg rating:3.0/5.0

Slides: 14

Provided by: jasone2

Learn more at: https://www.cs.jhu.edu

Category:

more less

Transcript and Presenter's Notes

Title: Bayes

1
Bayes Theorem
2
Remember Language ID?

Let p(X) probability of text X in English
Let q(X) probability of text X in Polish
Which probability is higher?
(wed also like bias toward English since its
more likely a priori ignore that for now)
Horses and Lukasiewicz are on the curriculum.
p(x1h, x2o, x3r, x4s, x5e, x6s, )

3
Bayes Theorem

p(A B) p(B A) p(A) / p(B)
Easy to check by removing syntactic sugar
Use 1 Converts p(B A) to p(A B)
Use 2 Updates p(A) to p(A B)
Stare at it so youll recognize it later

4
Language ID

Given a sentence x, I suggested comparing its
prob in different languages
p(SENTx LANGenglish) (i.e., penglish(SENTx))
p(SENTx LANGpolish) (i.e., ppolish(SENTx))
p(SENTx LANGxhosa) (i.e., pxhosa(SENTx))
But surely for language ID we should compare
p(LANGenglish SENTx)
p(LANGpolish SENTx)
p(LANGxhosa SENTx)

5
Language ID

For language ID we should compare
p(LANGenglish SENTx)
p(LANGpolish SENTx)
p(LANGxhosa SENTx)
For ease, multiply by p(SENTx) and compare
p(LANGenglish, SENTx)
p(LANGpolish, SENTx)
p(LANGxhosa, SENTx)
Must know prior probabilities then rewrite as
p(LANGenglish) p(SENTx LANGenglish)
p(LANGpolish) p(SENTx LANGpolish)
p(LANGxhosa) p(SENTx LANGxhosa)

6
Lets try it!
First we pick a random LANG, then we roll a
random SENT with the LANG dice.

p(SENTx LANGenglish)
p(SENTx LANGpolish)
p(SENTx LANGxhosa)

best
best
best compromise
total over all ways of getting SENTx
600.465 - Intro to NLP - J. Eisner
6
7
Lets try it!
First we pick a random LANG, then we roll a
random SENT with the LANG dice.

p(LANGenglish, SENTx) p(LANGpolish,
SENTx) p(LANGxhosa, SENTx)
0.000007

0.000008
best compromise
0.000005
p(SENTx)
total probability of getting SENTx one way or
another!
0.000020
add up
normalize(divide bya constantso theyllsum to
1)
best
given the evidence SENTx,the possible
languages sum to 1
600.465 - Intro to NLP - J. Eisner
7
8
Lets try it!

p(LANGenglish, SENTx) p(LANGpolish,
SENTx) p(LANGxhosa, SENTx)
0.000007
0.000008
best compromise
0.000005
p(SENTx)
total over all ways of getting x
0.000020
600.465 - Intro to NLP - J. Eisner
8
9
General Case (noisy channel)
noisy channel
mess up a into b
a
b
language ? text text ? speech spelled ?
misspelled English ? French
maximize p(Aa Bb) p(Aa) p(Bb Aa) /
(Bb) p(Aa) p(Bb Aa) / ?a p(Aa)
p(Bb Aa)
10
Language ID