Statistical Language Modelling Part I Observable Models - PowerPoint PPT Presentation

1 / 21

About This Presentation

Title:

Statistical Language Modelling Part I Observable Models

Description:

test set Perplexity. Preferred (by me!): Recognition accuracy. Dictionary Extrapolation. Perplexity assumes all models are playing by the same rules ... – PowerPoint PPT presentation

Number of Views:41

Avg rating:3.0/5.0

Slides: 22

Provided by: smlu

Category:

Tags: language | modelling | models | observable | part | perplexity | statistical

Transcript and Presenter's Notes

Title: Statistical Language Modelling Part I Observable Models

1
Statistical Language Modelling Part I
Observable Models

Simon Lucas

2
Summary

Applications
The fundamentals
Observable v. hidden (latent) models
N-gram and scanning n-tuple models
Incremental classifiers and LOO optimisation
Evaluation methods
Results
Conclusions and further work

3
Statistical Language Models

Compute p(xM) the probability of a sequence x
given the Model M
Java interface
public interface LanguageModel
public void train(SequenceDataset sd)
public double p(int seq)

4
Sequence Dataset

public interface SequenceDataset
public int nSymbols()
public int nSequences()
public int getSequence(int i)

5
Evaluating Language Models

Standard
test set Perplexity
Preferred (by me!)
Recognition accuracy
Dictionary Extrapolation
Perplexity assumes all models are playing by the
same rules
The other models make no such assumptions

6
Distributed Mode Evaluation

Use Algoval evaluation server
Currently http//ace.essex.ac.uk
Download the developer pack
Configure model or write your own
Specify test parameters
Run tests
View results immediately on web site!

7
Sequence Recognition

Given a statistical language model
Can easily deploy it for sequence recognition
Build a model for each class
Assign pattern to class with highest posterior
Better still return the vector of posteriors
for soft recognition
Interesting to try these models against simple
nearest LD and WLD nearest neighbour

8
App1 Recognising OCR Chain Codes
9
Results (OLD!)
10
SN-Tuple MethodCurrent Status for OCR

Actively being researched at IBM TJ Watson
See Ratzlaff , proc ICDAR 2001, pages 18 22 (on
djvu.com (note NOT dejavu.com!!!!!))
Concludes the sn-tuple is a viable method for
on-line handwriting recognition

11
App2 Contextual OCR

12
Dictionary Extrapolation

Previous slide showed how well we can do with
noisy images, with the aid of dictionary context
BUT suppose the dictionary only has 50 coverage
Need a trainable model that can extrapolate from
the given data
How to evaluate such a model?

13
Left Out Rank Estimate

For each word in the dictionary
Create a new dictionary with that word left out
Create a set of neighbouring words to the left
out word
Get model to evaluate likelihood of each
neighbouring word and the left out word
Return a rank-based score between 1.0 and 0.0
(from top to bottom of list)

14
App3 Human Chromosome Recognition (Banded Images)
15
Example Data 22 Human Chromosomes

Chromosome 10
/ 1802 3 10 55 19 / AAaBaDdDe
BbBbAaa
/ 3843 84 10 55 18 / ABaAaDdDd
CbAcAaa
/ 7231 158 10 55 20 / ABaCaAcDd
CbBdAaa
/ 787 15 10 55 18 / ABaAaBbDe
AaAaAaAaa
/ 2459 60 10 54 19 / ABaBaAaCcCd
CbAcAaa
/ 3290 21 10 54 19 / ABaAaBbDc
BbAcAaa
/ 5591 122 10 54 17 / AAaAaAaEd
BbAbAaa
Chromosome 15
/ 1447 5 15 43 10 / AAaDbCd
AaAba
/ 2120 32 15 43 11 / BaEcAaCd
AaAaAba
/ 2759 16 15 43 9 / AADaAaAc
AaAca

16
N-gram Recognizers

Bigram

17
Leave One Out Error

Generally a good estimate of test-set error
Especially fast to compute for Incremental
classifiers (O(n))
As opposed to O(n2) for non-incremental

18
Incremental Classifiers

Can learn new patterns on demand without access
to rest of training set
Can forget or unlearn patterns on demand also
Incremental n-gram, n-tuple, nearest neighbour
(memory or counting methods)
Non-incremental MLP, HMM, (SVM?) (latent
variable re-estimation methods)

19
Statistical Model Servers

Server model of statistical models
Each server supports a range of models
Each model can have many instances
Each instance can be invoked for training or
estimation
Now we can independently evaluate the service,
not just the model!

20
Results

Bioinformatics
Dictionary modelling

21
Statistical Language ModellingPart II

Ensembles of observable models
Latent variable models
HMM
SCFG
Category n-gram
Other applications Robot Sensors?

Write a Comment

User Comments (0)

About PowerShow.com

Recommended Relevance Latest Highest Rated Most Viewed

Sort by:

Related More from user

CrystalGraphics Presentations

Introducing-PowerShowcom PowerPoint PPT Presentation

Introducing-PowerShowcom - Introducing-PowerShowcom (Without Music)

CrystalGraphics 3D Character Slides for PowerPoint PowerPoint PPT Presentation

CrystalGraphics 3D Character Slides for PowerPoint - CrystalGraphics 3D Character Slides for PowerPoint

Chart and Diagram Slides for PowerPoint PowerPoint PPT Presentation

Chart and Diagram Slides for PowerPoint - Beautifully designed chart and diagram s for PowerPoint with visually stunning graphics and animation effects. Our new CrystalGraphics Chart and Diagram Slides for PowerPoint is a collection of over 1000 impressively designed data-driven chart and editable diagram s guaranteed to impress any audience. They are all artistically enhanced with visually stunning color, shadow and lighting effects. Many of them are also animated. And they’re ready for you to use in your PowerPoint presentations the moment you need them. – PowerPoint PPT presentation

Related Presentations

Constrained Conditional Models Learning and Inference for Information Extraction and Natural Language Understanding PowerPoint PPT Presentation

Constrained Conditional Models Learning and Inference for Information Extraction and Natural Language Understanding - Constrained Conditional Models Learning and Inference for Information Extraction and Natural Language Understanding Dan Roth Department of Computer Science | PowerPoint PPT presentation | free to view

CS 388: Natural Language Processing: Part-Of-Speech Tagging, Sequence Labeling, and Hidden Markov Models (HMMs) PowerPoint PPT Presentation

CS 388: Natural Language Processing: Part-Of-Speech Tagging, Sequence Labeling, and Hidden Markov Models (HMMs) - Part-Of-Speech Tagging ... Verb * Forward Classification NNP VBD DT NN CC John saw the saw and decided to take it to the table . classifier VBD ... John saw the saw ... | PowerPoint PPT presentation | free to view

Tutorial on Neural Network Models for Speech and Image Processing PowerPoint PPT Presentation

Tutorial on Neural Network Models for Speech and Image Processing - ... Applications in speech and image processing PART I Feature Extraction and Classification Problems in ... Analysis Feature extraction Image ... | PowerPoint PPT presentation | free to view

Part II. Statistical NLP PowerPoint PPT Presentation

Part II. Statistical NLP - Advanced Artificial Intelligence Part II. Statistical NLP Introduction and Grammar Models Wolfram Burgard, Luc De Raedt, Bernhard Nebel, Kristian Kersting | PowerPoint PPT presentation | free to view

Modelling the evolution of language for modellers and non-modellers PowerPoint PPT Presentation

Modelling the evolution of language for modellers and non-modellers - Language evolution and computation research unit, University of ... Bow-wow theory. Pooh-pooh theory. Ding-dong theory. Yo-he-ho theory. But his own theory: ... | PowerPoint PPT presentation | free to view

Social Network Inspired Models of NLP and Language Evolution PowerPoint PPT Presentation

Social Network Inspired Models of NLP and Language Evolution - Social Network Inspired Models of NLP and Language Evolution Monojit Choudhury (Microsoft Research India) Animesh Mukherjee (IIT Kharagpur) Niloy Ganguly (IIT Kharagpur) | PowerPoint PPT presentation | free to view

Situation Models and Embodied Language Processes PowerPoint PPT Presentation

Situation Models and Embodied Language Processes - Situation Models and Embodied Language Processes Franz Schmalhofer University of Osnabr ck / Germany Memory and Situation Models Computational Modeling of Inferences | PowerPoint PPT presentation | free to view

Generative and Discriminative Models in NLP: A Survey PowerPoint PPT Presentation

Generative and Discriminative Models in NLP: A Survey - Natural Language Processing. N L P. S. Motivation. Many problems in natural language processing are disambiguation problems. word senses ... | PowerPoint PPT presentation | free to view

Unified Models of Information Extraction and Data Mining with Application to Social Network Analysis PowerPoint PPT Presentation

Unified Models of Information Extraction and Data Mining with Application to Social Network Analysis - Unified Models of Information Extraction and Data Mining with Application to Social Network Analysis Andrew McCallum Information Extraction and Synthesis Laboratory | PowerPoint PPT presentation | free to view

From Synergy to Knowledge: Integrating multiple language resources Part II: Creating Synergy and Multi-functionality of Language Resources PowerPoint PPT Presentation

From Synergy to Knowledge: Integrating multiple language resources Part II: Creating Synergy and Multi-functionality of Language Resources - From Synergy to Knowledge: Integrating multiple language resources Part II: Creating Synergy and Multi-functionality of Language Resources Chu-Ren Huang | PowerPoint PPT presentation | free to view

Part III Learning structured representations Hierarchical Bayesian models PowerPoint PPT Presentation

Part III Learning structured representations Hierarchical Bayesian models - Bags in general. Meta-constraints. Shape of the Beta prior. A hierarchical Bayesian model ... Bags in general. Meta-constraints. Learning about feature ... | PowerPoint PPT presentation | free to view

Conditional Random Fields For Speech and Language Processing PowerPoint PPT Presentation

Conditional Random Fields For Speech and Language Processing - e.g. neural networks, ... discriminative classification model for sequences. ... feature functions are often built around words or spelling features in the text. | PowerPoint PPT presentation | free to view

BIO-INSPIRED AND COGNITIVE COMPUTING for data mining, tracking, fusion, financial prediction, language understanding, web search engines, and diagnostic modeling of cultures PowerPoint PPT Presentation

BIO-INSPIRED AND COGNITIVE COMPUTING for data mining, tracking, fusion, financial prediction, language understanding, web search engines, and diagnostic modeling of cultures - BIO-INSPIRED AND COGNITIVE COMPUTING for data mining, tracking, fusion, financial prediction, language understanding, web search engines, and diagnostic modeling of ... | PowerPoint PPT presentation | free to view

Measuring Science, Technology and Innovation (STI): Definitions from a statistical perspective PowerPoint PPT Presentation

Measuring Science, Technology and Innovation (STI): Definitions from a statistical perspective - Measuring Science, Technology and Innovation (STI): Definitions from a statistical perspective NATIONAL TRAINING WORKSHOP ON SCIENCE, TECHNOLOGY AND INNOVATION (STI ... | PowerPoint PPT presentation | free to view

Specifying the Conceptual and Operational Models and the Research Questions that Follow PowerPoint PPT Presentation

Specifying the Conceptual and Operational Models and the Research Questions that Follow - Title: Session 2: Specifying the Conceptual and Operational Models and the Research Questions that Follow Author: Mark W. Lipsey Last modified by | PowerPoint PPT presentation | free to view

Graphical models for part of speech tagging PowerPoint PPT Presentation

Graphical models for part of speech tagging - Graphical models for part of speech tagging | PowerPoint PPT presentation | free to view

Freedom to the Designs Multiple logistic regression and mixed models PowerPoint PPT Presentation

Freedom to the Designs Multiple logistic regression and mixed models - Freedom to the Designs. Multiple logistic regression and mixed models. Florian Jaeger Roger Levy ... time data by Florian Jaeger http://www.stanford.edu/~tiflo ... | PowerPoint PPT presentation | free to view

Foundations of statistical natural language processing PowerPoint PPT Presentation

Foundations of statistical natural language processing - Introduction Chapter 1 Foundations of statistical natural language processing | PowerPoint PPT presentation | free to view

LING 138/238 SYMBSYS 138 Intro to Computer Speech and Language Processing PowerPoint PPT Presentation

LING 138/238 SYMBSYS 138 Intro to Computer Speech and Language Processing - short b closure, voicing barely visible. ... Build a statistical model of the speech-to-words process. Collect lots and lots of speech, and transcribe all the words. ... | PowerPoint PPT presentation | free to view

An introduction to machine learning and probabilistic graphical models PowerPoint PPT Presentation

An introduction to machine learning and probabilistic graphical models - An introduction to machine learning and probabilistic graphical models Kevin Murphy MIT AI Lab Presented at Intel s workshop on Machine learning | PowerPoint PPT presentation | free to view

The Emergence of Language (from Brain, Body, and Discourse) PowerPoint PPT Presentation

The Emergence of Language (from Brain, Body, and Discourse) - We couldn't study learning in vivo - PSLC. ... Consolidation and Time ... embodied learning (self-motion) statistical learning (basal ganglia, circuits) ... | PowerPoint PPT presentation | free to view

E-Science, the GRID and Statistical Modelling in Social Research Rob Crouchley Collaboratory for Quantitative e-Social Science University of Lancaster PowerPoint PPT Presentation

E-Science, the GRID and Statistical Modelling in Social Research Rob Crouchley Collaboratory for Quantitative e-Social Science University of Lancaster - E-Science, the GRID and Statistical Modelling in Social Research Rob Crouchley Collaboratory for Quantitative e-Social Science University of Lancaster | PowerPoint PPT presentation | free to view

Natural Language Processing PowerPoint PPT Presentation

Natural Language Processing - Natural Language Processing Jian-Yun Nie Example of utilization Statistical tagging Training corpus = word + tag (e.g. Penn Tree Bank) For w1, , wn: argmaxtag1 ... | PowerPoint PPT presentation | free to view

Analyzing unstructured text with topic models PowerPoint PPT Presentation

Analyzing unstructured text with topic models - Analyzing unstructured text with topic models Mark Steyvers Dep. of Cognitive Sciences & Dep. of Computer Science University of California, Irvine | PowerPoint PPT presentation | free to view

USER MODELING meets the Web PowerPoint PPT Presentation

USER MODELING meets the Web - ... to the task at hand ... User preferences Bottom up Top down Used in Natural Language Interpretation. ... for Human-Computer Interaction What information ... | PowerPoint PPT presentation | free to view

What are the Various uses of R programming language? PowerPoint PPT Presentation

What are the Various uses of R programming language? - Our experts share opinions and views on the technologies. However, we use to manage our work using the Uses of R programming language. | PowerPoint PPT presentation | free to view

Predicting phonotactic difficulty in second language acquisition PowerPoint PPT Presentation

Predicting phonotactic difficulty in second language acquisition - Predicting phonotactic difficulty in second language acquisition Katarzyna Dziubalska-Ko aczyk Adam Mickiewicz University, Pozna dkasia@ifa.amu.edu.pl | PowerPoint PPT presentation | free to view