Probabilistic Language Processing - PowerPoint PPT Presentation

About This Presentation

Title:

Probabilistic Language Processing

Description:

Machine Translation Goals. Rough Translation (E.g. p. 851) Restricted Doman (mergers, weather) ... Literary Translation -- not yet! ... – PowerPoint PPT presentation

Number of Views:151

Avg rating:3.0/5.0

Slides: 17

Provided by: chris1183

Learn more at: https://www.cs.rochester.edu

Category:

Tags: language | probabilistic | processing

Transcript and Presenter's Notes

Title: Probabilistic Language Processing

1
Probabilistic Language Processing

Chapter 23

2
Probabilistic Language Models

Goal -- define probability distribution over set
of strings
Unigram, bigram, n-gram
Count using corpus but need smoothing
add-one
Linear interpolation
Evaluate with Perplexity measure
E.g. segmentwordswithoutspaces w/ Viterbi

3
PCFGs

Rewrite rules have probabilities.
Prob of a string is sum of probs of its parse
trees.
Context-freedom means no lexical constraints.
Prefers short sentences.

4
Learning PCFGs

Parsed corpus -- count trees.
Unparsed corpus
Rule structure known -- use EM (inside-outside
algorithm)
Rules unknown -- Chomsky normal form problems.

5
Information Retrieval

Goal Google. Find docs relevant to users
needs.
IR system has doc. Collection, query in some
language, set of results, and a presentation of
results.
Ideally, parse docs into knowledge base too hard.

6
IR 2

Boolean Keyword Model -- in or out?
Problem -- single bit of relevance
Boolean combinations a bit mysterious
How compute P(Rtrue D,Q)?
Estimate language model for each doc, computes
prob of query given the model.
Can rank documents by P(rD,Q)/P(rD,Q)

7
IR3

For this, need model of how queries are related
to docs. Bag of words freq of words in doc.,
naïve Bayes.
Good example pp 842-843.

8
Evaluating IR

Precision is proportion of results that are
relevant.
Recall is proportion of relevant docs that are in
results
ROC curve (there are several varieties) standard
is to plot false negatives vs. false positives.
More practical for web reciprocal rank of
first relevant result, or just time to answer

9
IR Refinements

Case
Stems
Synonyms
Spelling correction
Metadata --keywords

10
IR Presentation

Give list in order of relevance, deal with
duplicates
Cluster results into classes
Agglomerative
K-means
How describe automatically-generated clusters?
Word list? Title of centroid doc?

11
IR Implementation

CSC172!
Lexicon with stop list,
inverted index where words occur
Match with vectors vectorof freq of words dotted
with query terms.

12
Information Extraction

Goal create database entries from docs.
Emphasis on massive data, speed, stylized
expressions
Regular expression grammars OK if stylized enough
Cascaded Finite State Transducers,,,stages of
grouping and structure-finding

13
Machine Translation Goals

Rough Translation (E.g. p. 851)
Restricted Doman (mergers, weather)
Pre-edited (Caterpillar or Xerox English)
Literary Translation -- not yet!
Interlingua-- or canonical semantic
representation like Conceptual Dependency
Basic Problem ! languages, ! categories

14
MT in Practice

Transfer -- uses data base of rules for
translating small units of language
Memory -based. Memorize sentence pairs
Good diagram p. 853

15
Statistical MT

Bilingual corpus
Find most likely translation given corpus.
Argmax_F P(FE) argmax_F P(EF)P(F)
P(F) is language model
P(EF) is translation model
Lots of interesting problems fertility (home vs.
a la maison).
Horrible drastic simplfications and hacks work
pretty well!

16
Learning and MT

Stat. MT needs language model, fertility model,
word choice model, offset model.
Millions of parameters
Counting , estimate, EM.

Write a Comment

User Comments (0)

About PowerShow.com

Recommended Relevance Latest Highest Rated Most Viewed

Sort by:

Related More from user

CrystalGraphics Presentations

Introducing-PowerShowcom PowerPoint PPT Presentation

Introducing-PowerShowcom - Introducing-PowerShowcom (Without Music)

CrystalGraphics 3D Character Slides for PowerPoint PowerPoint PPT Presentation

CrystalGraphics 3D Character Slides for PowerPoint - CrystalGraphics 3D Character Slides for PowerPoint

Chart and Diagram Slides for PowerPoint PowerPoint PPT Presentation

Chart and Diagram Slides for PowerPoint - Beautifully designed chart and diagram s for PowerPoint with visually stunning graphics and animation effects. Our new CrystalGraphics Chart and Diagram Slides for PowerPoint is a collection of over 1000 impressively designed data-driven chart and editable diagram s guaranteed to impress any audience. They are all artistically enhanced with visually stunning color, shadow and lighting effects. Many of them are also animated. And they’re ready for you to use in your PowerPoint presentations the moment you need them. – PowerPoint PPT presentation

Related Presentations

CS 388: Natural Language Processing: Statistical Parsing PowerPoint PPT Presentation

CS 388: Natural Language Processing: Statistical Parsing - None. NP:.6*.6*.15 =.054. Probabilistic CKY Parser. 14. Book the ... None. None. None. Prep:.2. Probabilistic CKY Parser. 17. Book the flight through Houston ... | PowerPoint PPT presentation | free to view

CS 388: Natural Language Processing Introduction PowerPoint PPT Presentation

CS 388: Natural Language Processing Introduction - NLP is the branch of computer science focused on developing ... Clouseau: [bowing down to pet the dog] Nice doggie. [Dog barks and bites Clouseau in the hand] ... | PowerPoint PPT presentation | free to view

Speech and Language Processing PowerPoint PPT Presentation

Speech and Language Processing - Speech and Language Processing - Jurafsky and Martin. 3. Regular Expressions and Text Searching ... Processing - Jurafsky and Martin. 13. Dollars and Cents ... | PowerPoint PPT presentation | free to view

Natural Language Processing 11 Speech Recognition PowerPoint PPT Presentation

Natural Language Processing 11 Speech Recognition - Intelligence Computing Research Center. Harbin Institute ... HMMs, Lexicons, and Pronunciation. Decoding. Language Modeling. Feature Extraction. Digitize Speech ... | PowerPoint PPT presentation | free to view

First-Order Probabilistic Languages: Into the Unknown PowerPoint PPT Presentation

First-Order Probabilistic Languages: Into the Unknown - Physical Review A 40:404--421. Russell, S., and Norvig, P. 1995. ... Same car? Need to take into account. competing matches! 35. Example: natural language ... | PowerPoint PPT presentation | free to view

Linguistics 362: Introduction to Natural Language Processing PowerPoint PPT Presentation

Linguistics 362: Introduction to Natural Language Processing - Natural Language Processing (NLP) Computers use (analyze, understand, ... Let's augment the grammar with feature constraints. S NP VP S subj = NP S = VP ... | PowerPoint PPT presentation | free to view

A Probabilistic Approach to Semantic Representation PowerPoint PPT Presentation

A Probabilistic Approach to Semantic Representation - requires efficient abstraction. Why do we store this information? function of semantic memory ... Since we only sample z we need. number of times word w ... | PowerPoint PPT presentation | free to view

An introduction to machine learning and probabilistic graphical models PowerPoint PPT Presentation

An introduction to machine learning and probabilistic graphical models - An introduction to machine learning and probabilistic graphical models Kevin Murphy MIT AI Lab Presented at Intel s workshop on Machine learning | PowerPoint PPT presentation | free to view

Lexical Semantic Students Presentations ICS 482 Natural Language Processing PowerPoint PPT Presentation

Lexical Semantic Students Presentations ICS 482 Natural Language Processing - Lexical Semantic + Students Presentations ICS 482 Natural Language Processing Lecture 27: Husni Al-Muhtaseb * * * * Lexical Relations in WordNet * * Structure of ... | PowerPoint PPT presentation | free to view

Natural Language Processing in 2004 PowerPoint PPT Presentation

Natural Language Processing in 2004 - Natural Language Processing in 2004 Bob Carpenter Alias-i, Inc. Collins Parser Derivation Example (John (gave Mary Fido yesterday)) Generate Sentential head root=S ... | PowerPoint PPT presentation | free to view

Markov Logic in Natural Language Processing PowerPoint PPT Presentation

Markov Logic in Natural Language Processing - Markov Logic in Natural Language Processing Hoifung Poon Dept. of Computer Science & Eng. University of Washington PCFG? * Lifted An attractive solution is to use aux ... | PowerPoint PPT presentation | free to view

ICS 482: Natural language Processing Pre-introduction PowerPoint PPT Presentation

ICS 482: Natural language Processing Pre-introduction - Title: ICS 482: Natural language Processing Pre-introduction Subject: ICS 482: Natural language Processing - Husni Al-Muhtaseb Author: Husni Al-Muhtaseb | PowerPoint PPT presentation | free to view

Lexicalized and Probabilistic Parsing PowerPoint PPT Presentation

Lexicalized and Probabilistic Parsing - Lexicalized and Probabilistic Parsing Part 2 ICS 482 Natural Language Processing Lecture 15: Lexicalized and Probabilistic Parsing Part 2 | PowerPoint PPT presentation | free to view

I256: Applied Natural Language Processing PowerPoint PPT Presentation

I256: Applied Natural Language Processing - I256: Applied Natural Language Processing Marti Hearst Aug 30, 2006 Today Introductions Python Basics Introduction to NLTK The Natural Language Toolkit (NLTK ... | PowerPoint PPT presentation | free to view

Natural Language Processing (NLP) PowerPoint PPT Presentation

Natural Language Processing (NLP) - Title: LING 180 Intro to Computer Speech and Language Lecture 1 Author: Dan Jurafsky Last modified by: Engineering Science Created Date: 1/18/2003 3:56:53 AM | PowerPoint PPT presentation | free to view

CS460/626 : Natural Language Processing/Language Technology for the Web (Lecture 1 PowerPoint PPT Presentation

CS460/626 : Natural Language Processing/Language Technology for the Web (Lecture 1 - CS460/626 : Natural Language Processing/Language Technology for the Web (Lecture 1 Introduction) Pushpak Bhattacharyya CSE Dept., IIT Bombay | PowerPoint PPT presentation | free to view

CS 388: Natural Language Processing: Semantic Parsing PowerPoint PPT Presentation

CS 388: Natural Language Processing: Semantic Parsing - Title: Intelligent Information Retrieval and Web Search Author: Raymond Mooney Last modified by: Ray Mooney Created Date: 5/20/2001 10:11:52 PM Document presentation ... | PowerPoint PPT presentation | free to view

Lexical Semantic Students Presentations ICS 482 Natural Language Processing PowerPoint PPT Presentation

Lexical Semantic Students Presentations ICS 482 Natural Language Processing - Lexical Semantic + Students Presentations ICS 482 Natural Language Processing Lecture 26: Lexical Semantic + Students Presentations Husni Al-Muhtaseb | PowerPoint PPT presentation | free to view

Representing Meaning Part 3 ICS 482 Natural Language Processing PowerPoint PPT Presentation

Representing Meaning Part 3 ICS 482 Natural Language Processing - Title: Representing Meaning Part 3 Subject: Natural Language Processing - Husni Al-Muhtaseb Author: Husni Al-Muhtaseb Last modified by: Admin99 Created Date | PowerPoint PPT presentation | free to view

Natural Language Processing PowerPoint PPT Presentation

Natural Language Processing - Natural Language Processing Jian-Yun Nie Example of utilization Statistical tagging Training corpus = word + tag (e.g. Penn Tree Bank) For w1, , wn: argmaxtag1 ... | PowerPoint PPT presentation | free to view

Human and Machine Performance in Speech Processing PowerPoint PPT Presentation

Human and Machine Performance in Speech Processing - Title: Flexible, Robust, and Efficient Human Speech Processing Versus Present-day Speech Technology Author: Louis C.W. Pols Last modified by: Louis Pols | PowerPoint PPT presentation | free to view

COMP 791A: Statistical Language Processing PowerPoint PPT Presentation

COMP 791A: Statistical Language Processing - Title: COMP 790: Statistical Language Processing Last modified by: Leila Kosseim Created Date: 12/7/1999 2:57:41 AM Document presentation format | PowerPoint PPT presentation | free to view

Learning Language from its Perceptual Context PowerPoint PPT Presentation

Learning Language from its Perceptual Context - Learning Language from its Perceptual Context Ray Mooney Department of Computer Sciences University of Texas at Austin Joint work with David Chen Joohyun Kim | PowerPoint PPT presentation | free to view

Natural Language Semantics using Probabilistic Logic PowerPoint PPT Presentation

Natural Language Semantics using Probabilistic Logic - ... No man is playing a flute H: ... * Outline Introduction Semantic representations Probabilistic logic Evaluation tasks Completed research Parsing and ... | PowerPoint PPT presentation | free to view

2013-2020 Global Natural Language Processing Market (NLP) PowerPoint PPT Presentation

2013-2020 Global Natural Language Processing Market (NLP) - Big Market Research, Global Natural Language Processing (NLP) Market Size, Share, Global Trends, Company Profiles, Demand, Insights, Analysis, Research, Report, Opportunities, Segmentation and Forecast, 2013 – 2020. Natural Language processing is a field of computer science, and artificial intelligence that is concerned with interaction between computer and human language.It is a component of artificial intelligence, capable of understanding human language and later converts into machine language. Porter’s five force model and SWOT analysis discusses the market players’ business plans, which would aid in developing new market strategies. | PowerPoint PPT presentation | free to view

Foundations of statistical natural language processing PowerPoint PPT Presentation

Foundations of statistical natural language processing - Introduction Chapter 1 Foundations of statistical natural language processing | PowerPoint PPT presentation | free to view