Chapter1 Introduction to NLP, CL, and Speech Recognition - PowerPoint PPT Presentation

1 / 35
About This Presentation
Title:

Chapter1 Introduction to NLP, CL, and Speech Recognition

Description:

study what goes into getting computers to perform useful and interesting tasks ... Q-systems and metamorphosis grammars (Colmerauer, 1970, 1975) ... – PowerPoint PPT presentation

Number of Views:130
Avg rating:3.0/5.0
Slides: 36
Provided by: nlpKo
Category:

less

Transcript and Presenter's Notes

Title: Chapter1 Introduction to NLP, CL, and Speech Recognition


1
Chapter1Introduction to NLP, CL, and Speech
Recognition
  • Hae-Chang Rim

2
speech and language processing
  • 1.1 Knowledge in SLP
  • 1.2 Ambiguity
  • 1.3 Models and Algorithms
  • 1.4 Language, Thought and Understanding
  • 1.5 The State of the Art and the near Future
  • 1.6 Brief history

3
What should we study?
  • study what goes into getting computers to perform
    useful and interesting tasks involving human
    languages?
  • Consider HAL, the computer from 2001 A Space
    Odyssey

4
What should we study?
  • Such an artificial agent interacts with humans
    via languages
  • understanding humans via speech recognition and
    natural language understanding
  • communicating with humans via natural language
    generation and speech synthesis
  • replying to humans via information retrieval,
    information extraction, and inference

5
Speech Langue Processing
  • Solving these language-related problems,
  • Natural Language Processing
  • Computational Linguistics speech language
  • Speech Recognition Synthesis processing

6
Whats needed?
  • categories of linguistic knowledge in SLP
  • phonetics(???) phonology(???) production of
    speech sounds, patterns/rules of sounds
    (phonemes)
  • morphology(???) shape of word/morpheme,
    meaningful components of words and behavior of
    words in contexts
  • syntax(???) properly order and group words
    together to make phrases, clauses, and sentences
    (structural relationships between words)

7
Whats needed?
  • categories of linguistic knowledge in SLP(cont.)
  • semantics(???) lexical semantics(the meaning of
    the component words), compositional semantics(how
    the components combine to form larger meanings)
  • pragmatics(???) appropriate use of language, in
    terms of their context of use (background
    knowledge, beliefs of speaker and hearer,
    relevant answer), how language is used to
    accomplish goals
  • discourse(??) structured conversation, the
    study of linguistic units larger than a single
    utterance

8
What Should You Care?
  • all tasks in SLP can be viewed as resolving
    ambiguity at one of six levels

9
Ambiguity
  • Consider the spoken sentence I made her duck.
  • five interpretations
  • (1.1) I cooked waterfowl for her.
  • (1.2) I cooked waterfowl belonging to her.
  • (1.3) I created the (plaster?) duck she owns.
  • (1.4) I caused her to quickly lower her head or
    body.
  • (1.5) I waved my magic wand and turned her into
    undifferentiated waterfowl.

10
Ambiguity
  • Ambiguities of I made her duck
  • duck verb, noun (morphologically ambiguous) ?
    POS tagging
  • her dative pronoun, a possessive pronoun
    (morphologically or syntactically ambiguous) ?
    syntactic disambiguation
  • make create, cook (semantically ambiguous) ?
    word sense disambiguation
  • make taking a single object (transitive), taking
    two objects (ditransitive) (syntactically
    ambiguous) ?syntactic disambiguation

11
Resolving Ambiguities
  • Lexical disambiguation
  • Part-of-speech tagging
  • Word sense disambiguation
  • Syntactic disambiguation
  • E.g. probabilistic parsing
  • Speech act interpretation
  • Given sentence is statement or a question

12
Models and Algorithms
  • models the formalisms that are used to capture
    the various kinds of linguistic facts (knowledge)
    we need
  • State machines, formal rule systems, logic, etc.
  • Algorithms used to search or manipulate input
    representations to create the structures that are
    needed
  • Depth first search, best-first search, etc.

13
Models in SLP
  • State Machines formal models that consist of
    states, transitions among states and an input
    representations
  • deterministic/non-deterministic FSA, FST,
    weighted automata, (hidden) markov models
  • Formal rule systems
  • regular grammar, regular relations, context-free
    grammars, feature-augmented grammars, and their
    probabilistic variants
  • algorithms associated with both state-machines
    and formal rule systems
  • search algorithm In most problems, the input
    spaces are normally too large to exhaustively
    explore, depth-first search, best-first, A
  • dynamic algorithm redundant computations are
    avoided

14
Models in SLP
  • Logical formalisms
  • first-order logic (predicate-calculus),
    feature-structures, semantic networks,
    conceptual-dependency
  • Probability theory to solve the many kinds of
    ambiguity problems (choose the most probable one)
  • Each of the other models (state-machines, formal
    rule systems, and logic) can be augmented with
    probabilities
  • Machine learning tools focus on ways to
    automatically learn the various representations
    automata, rule systems, search heuristics,
    classifiers
  • trained on large corpora

15
language, thought and understanding
  • SLP has an AI-ish flavor
  • the effective use of language is intertwined with
    our general cognitive abilities
  • machine and think
  • Turing test (1950) .en empirical test in which a
    computers use of language would form the basis
    for determining if it could think
  • ELIZA program(1966) early natural language
    processing system capable of carrying on a
    limited form of conversation with a user, make
    use of simple pattern-matching to mimic a
    psychotherapist

16
Turing Test
INTERFACE CONTROLLED BY JUDGE
INTELLIGENT SUBJECT
HUMAN
JUDGE
QUESTION
ANSWER
QUESTION
HUMAN
ANSWER
QUESTION
MACHINE
ANSWER
  • The goal of the machine is to fool the judge into
    believing that it is the person.
  • If the machine succeeds at this, then we will
    conclude that the machine can think.

17
The state of the art
  • recent commercialization of robust speech
    recognition systems and the rise of the Web
  • SLP in spotlight a plethora of exciting
    possible applications
  • current applications
  • METEO project broadcast weather reports in
    English and French (Chandioux, 1976)
  • Babel Fish translation system from Systran
    operating on Alta Vista search engine
  • VOYAGER system spoken language interface system
    can answer a number of different types of
    questions concerning navigation within a city, as
    well as provide certain information about
    hotels, restaurants, libraries (Zue et al., 1991)

18
The state of the art
  • current applications (cont.)
  • IEA system scoring written essays by computer
    (Landauer et al., 1997)
  • project LISTENs Reading Tutor helps children
    learn to read, uses speech recognition to listen
    to them read and responds with spoken and
    graphical feedback (Mostow and Aist 1999).
  • VITRA system (visual translator) watch a short
    video clip of a soccer match and provide a
    natural language report (integrating vision
    processing and natural language processing)
    (Wahlster 1989)
  • intelligent communication aids for people with
    disabilities (Newell et al., 1998 McCoy et al.,
    1998)

19
Some brief history
  • SLP is interdisciplinary.., has different
    historical threads
  • computational linguistics in linguistics,
  • natural language processing in computer science,
  • speech recognition in electrical engineering,
  • computational psycholinguistics in psychology.

20
Some brief history
  • Foundational insights(1940s and 1950s)
    intensive work on two paradigms the automaton
    and probabilistic or information-theoretic models
  • automaton (Turing 1936), McCulloch-Pitts neuron
    (McCulloch and Pitts 1943)
  • probabilistic models of discrete Markov processes
    to automata for language (Shannon 1948)
  • finite-state grammar (Chomsky 1956)
  • noisy channel, decoding, entropy (Shannon)
  • first a statistical machine speech recognizer
    that recognize any of the 10 digits from a single
    speaker (Bell Labs, Davis et al., 1952)

21
Some brief history
  • 19571970 two paradigms symbolic and
    stochastic
  • symbolic paradigm
  • took off from two lines of linguistic research
    the work of Chomsky, work on formal language
    theory and generative syntax
  • many works on parsing top-down, bottom-up,
    dynamic programming, e.g. Harriss parser (1962)
  • AI-related works (reasoning and logic,
    knowledge-representation, general problem solver)
    John McCathy, Marvin Minsky, Claude Shannon,
    Newell

22
Some brief history
  • 19571970 two paradigms symbolic and
    stochastic (cont.)
  • stochastic paradigm
  • took hold mainly in statistics and electrical
    engineering
  • Bayesian methods were applied to optical
    character recognition and text recognition
    (Browning, 1959 Mosteller and Wallace, 1964)
  • first on-line corpora, one-line dictionary
  • Brown corpus a 1 million word collection of
    samples (Kucera and Francis, 1967 1979 1982)
  • DOC on-line Chinese dialect dictionary

23
Some brief history
  • 1970-1983 Four paradigms (stochastic,
    logic-based, natural language understanding,
    discourse modeling)
  • stochastic paradigm
  • played a huge role in the development of speech
    recognition algorithms, particularly the Hidden
    Markov Model, noisy channel, and decoding
  • SR research group
  • IBMs TJ Watson Research group (Jelinek, Bahl,
    Mercer)
  • CMU group (Baker)
  • ATT Bell Lab. (Rabiner and Juang)

24
Some brief history
  • 1970-1983 Four paradigms (cont.)
  • logic-based paradigm
  • Q-systems and metamorphosis grammars (Colmerauer,
    1970, 1975)
  • Definite Clause Grammars (Pereira and Warren,
    1980)
  • Functional grammar (Kay, 1979)
  • LFG and feature structure unification (Bresnan
    and Kaplan, 1982)

25
Some brief history
  • 1970-1983 Four paradigms (cont.)
  • natural language understanding paradigm
  • SHRDLU system which simulated a robot embedded in
    a world of toy blocks by accepting natural
    language text commands (Winograd, 1972)
  • Conceptual knowledge representation researches
    such as scripts, plans, goals, and human memory
    organization (Schank and his colleagues, 1972,
    1975, 1979)
  • Network based semantics (Quillian, 1968
    Rumelhart, 1975 Fillmore, 1968 Simmons, 1973)
  • LUNAR QA system (Woods, 1973)

26
Some brief history
  • 1970-1982 Four paradigms (cont.)
  • discourse modeling paradigm
  • focused on four key areas in discourse
  • study of substructure in discourse (Groz, 1977)
  • study of discourse focus (Sidner, 1983)
  • study of automatic reference resolution (Hobbs,
    1978)
  • study of BDI (belief-desire-intention) framework
    and speech acts (Perrault and Allen, 1980 Cohen
    and Perrault 1979)

27
Some brief history
  • 1983-1993 Empiricism and Finite State Models
    Redux
  • Finite State Models
  • finite-state phonology and morphology (Kaplan and
    Kay, 1981)
  • finite-state models of syntax (Church, 1980)
  • Return of empiricism
  • the rise of probabilistic models throughout
    speech and language processing
  • probabilistic methods and data-driven approaches
    spread into POS tagging, parsing, attachment
    disambiguation
  • connectionist approaches

28
Some brief history
  • 1994-1999 the field comes together
  • probabilistic and data-driven models had become
    quite standard throughout natural language
    processing
  • the increases in the speed and memory of
    computers had allowed commercial exploitation of
    a number of SLP speech recognition, and spelling
    grammar checking
  • SLP algorithms began to be applied to
    Augmentative and Alternative Communication (AAC)
  • the rise of the Web emphasized the need for
    language-based information retrieval and
    information extraction

29
Summary
  • A good way to understand the concerns of SLP
    processing research is to consider what it would
    take to create an intelligent agent like HAL from
    2001 A Space Odyssey.
  • Speech and language technology relies on formal
    models, or representations, of knowledge of
    language at the 6 levels of phonology and
    phonetics, morphology, syntax, semantics,
    pragmatics and discourse

30
Summary
  • The foundations of speech and language technology
    lie in computer science, linguistics,
    mathematics, electrical engineering and
    psychology.
  • The critical connection between language and
    thought has placed speech and language processing
    technology at the center of debate over
    intelligent machines.
  • Revolutionary applications of speech and language
    processing are currently in use around the world.
  • Recent advances in speech recognition and the
    creation of the World-Wide Web will lead to many
    more applications

31
bibliographical and historical notes
  • NLP-related conferences
  • ACL/EACL/NAACL
  • COLING
  • IJCNLP
  • ANLP(Applied Natural Language Processing)
  • EMNLP(Empirical Methods in Natural Language
    Processing
  • IR-related conferences
  • SIGIR
  • AIRS
  • TREC
  • NLP-related journal
  • Computational Linguistics
  • Natural Language Engineering

32
bibliographical and historical notes
  • speech-related conferences
  • ICSLP (International Conference on Spoken
    Language Processing)
  • EUROSPEECH
  • IEEE ICASSP(IEEE International Conference on
    Acoustics, Speech, and Signal Processing)
  • speech-related journal
  • Speech Communication
  • Computer Speech and Language
  • IEEE Transactions on Pattern Analysis and Machine
    Intelligence

33
bibliographical and historical notes
  • AI-related conferences
  • AAAI (American Association for Artificial
    Intelligence)
  • IJCAI (International Joint Conference on
    Artificial Intelligence)
  • AI-related journal
  • Artificial Intelligence
  • Computational Intelligence
  • IEEE Transactions on Intelligent Systems
  • Journal of Artificial Intelligence Research

34
bibliographical and historical notes
  • Cognitive Science-related Workshops
  • DARPA Speech and Natural Language Processing
    Workshop
  • ARPA Workshop on Human Language Technology
  • Cognitive Science-related journal
  • Cognitive Science

35
bibliographical and historical notes
  • Textbooks
  • Foundations of Statistical Language Processing
    (Manning and Schütze, 1999)
  • Statistical Language Learning (Charniak, 1993)
  • Natural Language Understanding (Allen, 1995)
  • Natural Language Processing in Lisp/Prolog
    (Gazdar and Mellish, 1989)
  • Readings in Natural Language Processing (Grosz et
    al., 1986)
Write a Comment
User Comments (0)
About PowerShow.com