Maximum Entropy ME Theory and Examples - PowerPoint PPT Presentation

1 / 11

About This Presentation

Title:

Maximum Entropy ME Theory and Examples

Description:

Machine translation. Word translation model - P(French word | English word) Information sources ... Machine translation (continued) ... – PowerPoint PPT presentation

Number of Views:87

Avg rating:3.0/5.0

Slides: 12

Provided by: mihair

Category:

Tags: entropy | examples | maximum | theory

Transcript and Presenter's Notes

Title: Maximum Entropy ME Theory and Examples

1
Maximum Entropy (ME)Theory and Examples

Mihai Rotaru
ITSPOKE Presentation

2
Introduction

Maximum entropy
Estimate a probability distribution given a set
of constraints
Principle
Model what is known
Assume nothing else
(Berger et al., 1996) example
Model translation of word in from English to
French
Need to model P(wordFrench)
Constraints
1 Possible translations dans, en, à, au course
de, pendant
2 dans or en used in 30 of the time
3 dans or à in 50 of the time

3
Theory

Model what is known (conditions)
Feature functions (subspaces)
Expected value of feature functions
Assume nothing else
? Flattest distribution
? Distribution with the maximum Entropy
Entropy
Inverse of the (Kullback-Leibler) distance from
the uniform distribution
Unique solution guaranteed
Generalized Iterative Scaling algorithm
Weights (?i) interpretation

4
ME in practice

Expected value of feature functions
Computed from empirical distribution
Conditional form of ME
What features to use?
Feature templates
Feature selection algorithms
Cutoffs
Basic Feature Selection (Berger et al., 1996)

5
ME applications

Part of Speech (POS) Tagging (Ratnaparkhi, 1996)
P(POS tag context)
Information sources
Word window (4)
Word features (prefix, suffix, capitalization)
Previous POS tags

6
ME applications (continued)

Abbreviation expansion (Pakhomov, 2002)
Information sources
Word window (4)
Document title
Word Sense Disambiguation (WSD) (Chao Dyer,
2002)
Information sources
Word window (4)
Structurally related words (4)
Sentence Boundary Detection (Reynar
Ratnaparkhi, 1997)
Information sources
Token features (prefix, suffix, capitalization,
abbreviation)
Word window (2)

7
ME applications (continued)

Machine translation
Word translation model - P(French word English
word)
Information sources
Word window (6)

8
ME applications (continued)

Machine translation (continued)
Full ME modeling P(English sentence French
sentence) (Och Ney, 2002)
Source channel model as a simplification of ME
Information sources
Sentence Length Model
Conventional Lexicon
Additional English language models

9
Why ME?

Advantages
Combine multiple knowledge sources
Local
Word prefix, suffix, capitalization (POS -
(Ratnaparkhi, 1996))
Word POS, POS class, suffix (WSD - (Chao Dyer,
2002))
Token prefix, suffix, capitalization,
abbreviation (Sentence Boundary - (Reynar
Ratnaparkhi, 1997))
Global
N-grams (Rosenfeld, 1997)
Word window
Document title (Pakhomov, 2002)
Structurally related words (Chao Dyer, 2002)
Sentence length, conventional lexicon (Och Ney,
2002)
Combine dependent knowledge sources

10
Why ME?

Advantages
Add additional knowledge sources
Implicit smoothing
Disadvantages
Computational
Expected value at each iteration
Normalizing constant
Overfitting
Feature selection
Cutoffs
Basic Feature Selection (Berger et al., 1996)

11
References

See my comprehensive write-up and reading list
http//www.cs.pitt.edu/mrotaru/comp/
ME software available on the net
YASMET (http//www.fjoch.com/YASMET.html)
yasmetFS (http//www.isi.edu/ravichan/YASMET/)
OpenNLP MaxEnt (http//maxent.sourceforge.net/)

Write a Comment

User Comments (0)

About PowerShow.com

Recommended Relevance Latest Highest Rated Most Viewed

Sort by:

Related More from user

CrystalGraphics Presentations

Introducing-PowerShowcom PowerPoint PPT Presentation

Introducing-PowerShowcom - Introducing-PowerShowcom (Without Music)

CrystalGraphics 3D Character Slides for PowerPoint PowerPoint PPT Presentation

CrystalGraphics 3D Character Slides for PowerPoint - CrystalGraphics 3D Character Slides for PowerPoint

Chart and Diagram Slides for PowerPoint PowerPoint PPT Presentation

Chart and Diagram Slides for PowerPoint - Beautifully designed chart and diagram s for PowerPoint with visually stunning graphics and animation effects. Our new CrystalGraphics Chart and Diagram Slides for PowerPoint is a collection of over 1000 impressively designed data-driven chart and editable diagram s guaranteed to impress any audience. They are all artistically enhanced with visually stunning color, shadow and lighting effects. Many of them are also animated. And they’re ready for you to use in your PowerPoint presentations the moment you need them. – PowerPoint PPT presentation

Related Presentations

16.548 Coding, Information Theory (and Advanced Modulation) PowerPoint PPT Presentation

16.548 Coding, Information Theory (and Advanced Modulation) - 1. 16.548. Coding, Information Theory (and Advanced Modulation) Prof. Jay Weitzen. Ball 411 ... know the variables X and Z (for Spock) are independent, we can ... | PowerPoint PPT presentation | free to view

Information Theory and Source Coding PowerPoint PPT Presentation

Information Theory and Source Coding - The highest entropy occurs when the symbols have equal probabilities and in this ... in HELP with extremely low data rate transmission. ... compared with a CD. ... | PowerPoint PPT presentation | free to view

Understanding early visual coding from information theory PowerPoint PPT Presentation

Understanding early visual coding from information theory - Understanding early visual coding from information theory By Li Zhaoping Lecture at EU advanced course in computational neuroscience, Arcachon, France, August, 2006. | PowerPoint PPT presentation | free to view

Likelihood and entropy for quantum tomography PowerPoint PPT Presentation

Likelihood and entropy for quantum tomography - Likelihood and entropy for quantum tomography Z. Hradil, J. eh ek Department of Optics Palack University,Olomouc Czech Republic Work was supported by the Czech ... | PowerPoint PPT presentation | free to view

Correspondence of The Principle of Maximum Entropy With other Laws of Physics PowerPoint PPT Presentation

Correspondence of The Principle of Maximum Entropy With other Laws of Physics - In a relativistic universe Atomic Clocks' events appear at differing rates ... The TIC of the clock is the atomic events E1,E2,E3. ... | PowerPoint PPT presentation | free to view

Beyond Onsager-Machlup Theory of Fluctuations PowerPoint PPT Presentation

Beyond Onsager-Machlup Theory of Fluctuations - Linear response theory and fluctuation-dissipation relations (Kubo, Mori, ... It provides a 'microscopic' basis for the least dissipation principle: ... | PowerPoint PPT presentation | free to view

Thermodynamics PowerPoint PPT Presentation

Thermodynamics - Thermodynamics Spontaneity, Entropy, and Free Energy | PowerPoint PPT presentation | free to view

FDR, Evidence Theory, Robustness PowerPoint PPT Presentation

FDR, Evidence Theory, Robustness - FDR, Evidence Theory, Robustness ... PowerPoint Presentation Hur anv nds imprecisa sannolikheter? Dempster/Shafer/Smets Correspondence DS-structure ... | PowerPoint PPT presentation | free to view

Bayesian Decision Theory (Classification) PowerPoint PPT Presentation

Bayesian Decision Theory (Classification) - The Normal Distribution. Basics of Probability. Discrete random variable (X) ... Probability mass function (pmf): Cumulative distribution function (cdf) ... | PowerPoint PPT presentation | free to view

Basic Concepts in Information Theory PowerPoint PPT Presentation

Basic Concepts in Information Theory - Developed by Shannon in the 40s. Maximizing the amount of information that can ... Criterion for selecting a good model Perplexity(p) Mutual Information I(X;Y) ... | PowerPoint PPT presentation | free to view

Information Theory For Data Management PowerPoint PPT Presentation

Information Theory For Data Management - Title: Information Theory for Data Management Author: Divesh & Suresh Last modified by: SRIVASTAVA, DIVESH (DIVESH) Created Date: 7/13/2006 3:34:23 AM | PowerPoint PPT presentation | free to view

Quantum Gravity As an Ordinary Gauge Theory PowerPoint PPT Presentation

Quantum Gravity As an Ordinary Gauge Theory - Quantum Gravity As an Ordinary Gauge Theory Juan Maldacena Institute for Advanced Study Princeton, New Jersey Is there a dS/CFT ? Future Further studies of black holes. | PowerPoint PPT presentation | free to view

Statistics in astronomy and physics PowerPoint PPT Presentation

Statistics in astronomy and physics - Maximum Entropy deconvolution Another Bayesian technique Aim is to minimise a smoothness function proportional to the Shannon entropy, ... | PowerPoint PPT presentation | free to view

Daubechies Wavelets PowerPoint PPT Presentation

Daubechies Wavelets - Arithmetic coding : asymptotically close to entropy ... For greasy, entropy = 5.43. 16,384 points in graph ... The entropy for the histogram in Figure 3.10(d) is 4.34 ... | PowerPoint PPT presentation | free to view

Information Theory For Data Management PowerPoint PPT Presentation

Information Theory For Data Management - exif:DateTimeOriginal 2099-12-07T23:14:14 02:00 /exif:DateTimeOriginal exif:DateTimeDigitized 2099-12-07T23:14:14 02:00 /exif:DateTimeDigitized ... | PowerPoint PPT presentation | free to view

Entropy Production in a System of Coupled Nonlinear Driven Oscillators PowerPoint PPT Presentation

Entropy Production in a System of Coupled Nonlinear Driven Oscillators - Entropy Production in a System of Coupled Nonlinear Driven Oscillators ... MATH/CHEM/COMP, Dubrovnik-2006. Nonequilibrium thermodynamics of complex biological networks ... | PowerPoint PPT presentation | free to view

Digitization and Information Theory PowerPoint PPT Presentation

Digitization and Information Theory - The continuous, analog signal is converted to a discrete set of ... Code 1: The obvious code A=0 B=1. Symbol Probability Representation # Digits. A 0.8 0 0.8 ... | PowerPoint PPT presentation | free to view

Communication Theory (EC 2252) PowerPoint PPT Presentation

Communication Theory (EC 2252) - Communication Theory (EC 2252) Prof.J.B.Bhattacharjee K.Senthil Kumar ECE Department Rajalakshmi Engineering College * Review of Spectral characteristics Periodic and ... | PowerPoint PPT presentation | free to view

Reversible Computing Theory I: Reversible Logic Models PowerPoint PPT Presentation

Reversible Computing Theory I: Reversible Logic Models - Any function can be made invertible by simply preserving copies of all inputs in extra outputs. ... out = AB. Inverters only needed. to restore A, B. Can be ... | PowerPoint PPT presentation | free to view

Transition state theory and reaction kinetics PowerPoint PPT Presentation

Transition state theory and reaction kinetics - Arrhenius equation and transition state theory ... to the concentration of reactants raised ... The reactants must engage in an encounter (collision) event. ... | PowerPoint PPT presentation | free to view

Physical Computing Theory, Ultimate Models, and the Tight Church PowerPoint PPT Presentation

Physical Computing Theory, Ultimate Models, and the Tight Church - Physical Computing Theory, Ultimate Models, and the Tight Church's Thesis: ... Conjecture: A 2- or 3-D mesh multiprocessor with a fixed-size memory hierarchy per node ... | PowerPoint PPT presentation | free to view

Chapter 11 Solution Thermodynamics: Theory PowerPoint PPT Presentation

Chapter 11 Solution Thermodynamics: Theory - Chapter 11 Solution Thermodynamics: Theory Chapter 6 treats the thermodynamic properties of pure species or constant-composition fluids. However, the preceding ... | PowerPoint PPT presentation | free to view

Gravity and strongly coupled field theories PowerPoint PPT Presentation

Gravity and strongly coupled field theories - Duality: g2 N is small perturbation theory is easy gravity is bad ... Particle theory = gravity theory. N colors. N = magnetic flux through S5 ... | PowerPoint PPT presentation | free to view

Uses of Information Theory in Medical Imaging PowerPoint PPT Presentation

Uses of Information Theory in Medical Imaging - Uses of Information Theory in Medical Imaging. Wang Zhan, Ph.D. Center for Imaging of ... Hartley defined the first information measure: H = n log s ... | PowerPoint PPT presentation | free to view

Emittance Calculations based on Maximum Entropy (MENT, MaxEnt,....) PowerPoint PPT Presentation

Emittance Calculations based on Maximum Entropy (MENT, MaxEnt,....) - ... Rule Simulation about its | PowerPoint PPT presentation | free to view

A Bit of Information Theory PowerPoint PPT Presentation

A Bit of Information Theory - Numbers and math. Well... Signs on paper/screen. Written Language ... k words Dk possible messages, where D is English dictionary size. Length ~ log(complexity) ... | PowerPoint PPT presentation | free to view

Maximum Entropy PowerPoint PPT Presentation

Maximum Entropy - 'Regularization' 13. find distribution p such that ... Effect of regularization: multiplier = 5. Larger confidence. Intervals. Higher entropy ... | PowerPoint PPT presentation | free to view