Using the NASA Thesaurus to Support the Indexing of Streaming Media

About This Presentation
Title:

Using the NASA Thesaurus to Support the Indexing of Streaming Media

Description:

Using the NASA Thesaurus to Support the Indexing of Streaming Media. Gail Hodge ... The Library has collected and circulated the Center's colloquia on audio or ... –

Number of Views:34
Avg rating:3.0/5.0
Slides: 15
Provided by: patrickh150
Category:

less

Transcript and Presenter's Notes

Title: Using the NASA Thesaurus to Support the Indexing of Streaming Media


1
Using the NASA Thesaurus to Support the Indexing
of Streaming Media
  • Gail Hodge
  • Information International Associates, Inc.
  • Janet Ormes Patrick Healey
  • NASA Goddard Space Flight Center Library

2
Historic Context
  • The Library has collected and circulated the
    Centers colloquia on audio or video since 1967
  • A catalog of these holdings have been posted on
    the Librarys web site since 2001
  • Patrons required to come to the Library,
    resulting in limited accessibility of recorded
    colloquia
  • Streaming Media Center Project began in 2001 as
    part of the Librarys response to Knowledge
    Management initiatives

3
Introducing the GSFC Media Center
4
Streaming Media
  • Streaming media
  • Video that is encoded for delivery across the
    internet/intranet
  • Encoding
  • Computer processing of video to a format for web
    casting
  • Web casting
  • The act of delivering audio and video content
    across the internet/intranet
  • Can be delivered live or on-demand

5
The Goddard Library Streaming Media Center
  • The Streaming Media Center is now available from
    the Library website (http//library.gsfc.nasa.gov)
  • Can be included in personalized portals
  • Library has collected gt350 hours of video
  • gt100 hours indexed
  • Currently broadcasting 2 hours daily for the
    Earth Observing Systems Knowledge Management Pilot

6
Access Issues
  • Current Needs
  • Need to know the overall topic of the video
  • More likely to remember the topic, presenter,
    date or series
  • Permanent Access
  • Less likely that users will remember the videos
    metadata
  • More likely that users will want specific
    information
  • Terminology may change over time

7
Indexing Video Content
  • Video indexing is similar to a back-of-the book
    index for specific information
  • Entering a keyword leads you to the specific
    location of the subject

8
Features of Selected Software
  • Compares recognized speech with stored default
    terminology
  • Uses speaker inflection to identify meaningful
    intervals
  • Indexing and Search components included

9
Incorporation of NASA Thesaurus
  • Added specific scientific terminology
  • Incorporated terms and their NTs, RTs and UF/USE
    relationships
  • Used text of Astrophysics Data System to provide
    terms in grammatical structures
  • Provides query expansion and improves relevancy

10
Query Expansion
  • Saturn Moons
  • Ios
  • Triton
  • Or
  • Scatha Satellite
  • P78-2 Satellite

11
Query Expansion (Illustrated)
Sample Search (aurora) on same one hour lecture
entitled Jupiters Aurora. One file was
indexed using the NASA thesaurus, the other was
indexed using a more basic scientific word list.
Benefits
GREATER overall relevance understanding
MORE relevant content found (2M VS 20 Secs)
Ignores IRRELEVANT content (Speech Recognition
Error)
12
Relevance Interval Creation
  • Relevance Interval Creation links related
    concepts within media files, which drives
    Relevance Intervals
  • External knowledge from the thesaurus improves
    the accuracy of the Creation process because the
    explicit knowledge in text is incomplete

13
Relevance Interval (Illustrated)
Sample Search (aurora) on same one hour lecture
entitled Jupiters Aurora. One file was
indexed using the NASA thesaurus, the other was
indexed using a more basic scientific word list.
Benefits
GREATER overall relevance understanding
MORE relevant content found (2M VS 20 Secs)
Ignores IRRELEVANT content (Speech Recognition
Error)
14
Benefits
  • Identify relevant pieces of content within a
    longer video
  • Stream more relevant, specific information
    intervals to users
  • Minimize manual processing
  • Ultimately improve reuse of information and
    increase opportunities for knowledge sharing
Write a Comment
User Comments (0)
About PowerShow.com