MMSEM background - PowerPoint PPT Presentation

1 / 35
About This Presentation
Title:

MMSEM background

Description:

MMSEM F2F meeting, Amsterdam, 10/07/2006. 3 ... MMSEM F2F meeting, Amsterdam, 10/07/2006. 13. CIL: 3-D Graphics retrieval based on shape ... – PowerPoint PPT presentation

Number of Views:129
Avg rating:3.0/5.0
Slides: 36
Provided by: Aias
Category:

less

Transcript and Presenter's Notes

Title: MMSEM background


1
MMSEM background
Institute of Informatics Telecommunications NCSR
Demokritos, Athens, Greece
Dr Ioannis Pratikakis
MMSEM F2F meeting Amsterdam, 10 July 2006
2
NCSR Demokritos - Athens, GREECE
  • The largest self-governing research organisation,
    under the supervision of the Greek Government
  • It is composed of the following Institutes
  • Biology
  • Materials Science
  • Microelectronics
  • Informatics Telecommunications
  • Nuclear Technology Radiation Protection
  • Nuclear Physics
  • Radioisotopes Radiodiagnostic Producrs
  • Physical Chemistry

3
Institute of Informatics and Telecommunications
(IIT)
Informatics Section
SKEL Software Knowledge Engineering Laboratory
CIL Computational Intelligence Laboratory
4
SKEL profile
Information Integration
User-friendly information access
Ontology Creation and Maintenance
SKEL researchers aim to develop knowledge
technologies that will enable the efficient,
cost-effective and user-adaptive management and
presentation of information
5
Basic Research
  • Grammar induction
  • Active learning of classifiers
  • Focused crawling
  • Wrapper induction
  • Information extraction
  • Natural language generation
  • Evolving summarization
  • Ontology population and enrichment
  • Web usage mining

6
Applied Research
  • The general-purpose language engineering platform
    Ellogon (http//www.ellogon.org/)
  • Language processing tools and resources
  • The i-DIP platform for developing web content
    collection and extraction systems
  • The QUATRO proxy server, for validating RDF
    labels of web resources
  • The FILTRON e-mail filter, that blocks
    unsolicited commercial e-mail (spam messages)
  • The FilterX Web proxy filter, that blocks obscene
    Web content
  • Tools for creating and maintaining ontologies
  • The PServer general-purpose server for
    personalization
  • The KOINOTHTES system for knowledge discovery
    from web usage data
  • An authoring tool for porting language generation
    systems to new domains and languages

7
CIL profile
Biologically inspired modelling
Neural Networks
Computational Intelligence- Pattern recognition
background
Multimedia Information Processing, Semantic
analysis Retrieval
Bayesian networks
Support Vector Machines
Image
3D Graphics
Video
8
CIL Platform for intelligent information
processing
  • Preprocessing and feature extraction methods
  • Machine learning (neural networks, statistical,
    support vector machines)
  • Novel algorithm development and testing
  • Biologically inspired algorithms and architectures

9
CIL Processing and Recognition of old
manuscripts
Recognition
Feature extraction
10
  • Camera Based Document
  • Analysis Recognition

Page Segmentation
Text Identification in Web images
Table Detection
11
CIL Word spotting-Image based search in early
handwritten and printed documents
12
CIL Content Based Image Retrieval
Query view
Results and relative similarity to the query
13
CIL 3-D Graphics retrieval based on shape
Query 3D Model
First 12 answers
14
CIL Human Tracking
  • Tracker initialisation through
  • Face detection
  • Separation from background
  • Motion field calculation
  • Tracking methods
  • CAMSHIFT
  • Snakes
  • Features to use for tracking
  • Skin color
  • Clothing color - texture

15
CIL Human Behavior Analysis
  • Behavior modeling using
  • Bayesian Networks
  • Hidden Markov Models
  • Application case Violence detection in video

Automatic violence detection
16
BOEMIE Bootstrapping Ontology Evolution with
Multimedia Information Extraction
Institute of Informatics Telecommunications NCSR
Demokritos, Athens, Greece
Dr Ioannis Pratikakis
MMSEM F2F meeting Amsterdam, 10 July 2006
17
Contents
  • Consortium
  • Motivation
  • BOEMIE proposal
  • Application scenario
  • Concluding remarks

18
BOEMIE project
  • Bootstrapping Ontology Evolution with Multimedia
    Information Extraction
  • STRP, IST-2004-2.4.7 Semantic-based Knowledge
    and Content Systems
  • Started 01/03/2006, Duration 36 months
  • Consortium
  • Inst. of Informatics Telecommunications, NCSR
    Demokritos (SKEL CIL), Greece (Coordinator)
  • Fraunhofer Institute for Media Communication
    (NetMedia), Germany
  • Dip. di Informatica e Comunicazione, University
    of Milano (ISLab), Italy
  • Inst. of Telematics and Informatics CERTH (IPL),
    Greece
  • Hamburg University of Technology (STS), Germany
  • Tele Atlas, Belgium

19
Multimedia Content Analysis - I
  • Multimedia content grows with increasing rates
  • Hard to provide semantic indexing of multimedia
    content
  • Significant advances in automatic extraction of
    low-level features from visual content
  • Little progress in the identification of
    high-level semantic features

20
Multimedia Content Analysis - II
  • Inadequate the analysis of single modalities
  • Little progress in the effective combination of
    semantic features from different modalities.
  • Significant effort in producing ontologies for
    semantic webs.
  • Hard to build and maintain domain-specific
    multimedia ontologies.

21
Existing approaches - I
  • Combination of modalities may serve as a
    verification method, a method compensating for
    inaccuracies, or as an additional information
    source
  • Combination methods may be iterated allowing for
    incremental use of context
  • Major open issues in combination concern
  • the efficient utilization of prior knowledge,
  • the specification of open architecture for the
    integration of information from multiple sources,
    and
  • the use of inference tools

22
Existing approaches - II
  • Most of the extraction approaches are based on
    machine learning methods
  • With the advent of promising methodologies in
    multimedia ontology engineering
  • knowledge-based approaches are expected to gain
    in popularity and
  • be combined with the machine learning methods

23
Existing approaches III
  • Use of Ontologies to drive the information
    extraction process
  • providing high-level semantic information that
    helps disambiguating the labels assigned to MM
    objects
  • Major open issues in building and maintaining MM
    ontologies concern
  • automatic mapping between low level audio-visual
    features and high level domain concepts,
  • automated population and enrichment from
    unconstrained content,
  • employing of ontology coordination techniques
    when multiple ontologies are present

24
Existing approaches - IV
  • Synergy between information extraction and
    ontology learning through a bootstrapping
    process
  • to improve both the conceptual model and the
    extraction system through iterative refinement
  • Applied so far in knowledge acquisition from
    textual content
  • bootstrapping starts with an information
    extraction system that uses a domain ontology, or
  • bootstrapping starts with a seed ontology,
    usually small

25
BOEMIE proposal - I
  • Driven by domain-specific multimedia ontologies,
    BOEMIE systems will be able to identify
    high-level semantic features in image, video,
    audio and text and fuse these features for
    optimal extraction.
  • The ontologies will be continuously populated and
    enriched using the extracted semantic content.
  • This is a bootstrapping process, since the
    enriched ontologies will in turn be used to drive
    the multimedia information extraction system.

26
BOEMIE Proposal - II
27
BOEMIE proposal - III
  • Semantics extraction
  • Emphasis to visual content, from images and
    video, due to its richness and the difficulty of
    extracting useful information.
  • Non-visual content, audio/speech and text, will
    provide supportive evidence, to improve
    extraction precision.
  • Fusing information from multiple media sources is
    needed since
  • no single modality is powerful enough to
    encompass all aspects of the content and identify
    concepts precisely.

28
BOEMIE proposal - IV
  • Multimedia Semantic Model
  • development of a unifying representation, a
    multimedia semantic model to integrate
  • a multimedia ontology which
  • describes the structure of multimedia content
    (content objects, such as a segment in a static
    image, a time window in audio, a video shot,
    ...),
  • describes visual characteristics of content
    objects in terms of low-level features (colour,
    shape, texture, motion, )
  • a domain ontology which contains knowledge about
    the selected application domain, and
  • a geographic ontology which contains additional
    knowledge about the locations to be used

29
BOEMIE proposal V
  • Ontology evolution involves
  • ontology population and enrichment, i.e.,
    addition of concepts, relations, properties and
    instances,
  • coordination of
  • homogeneous ontologies e.g. when more than one
    ontology for the same domain are available, and
  • heterogeneous ontologies, e.g., updating the
    links between a modified domain ontology and a
    multimedia descriptor ontology,
  • maintenance of semantic consistency
  • any of the above changes may generate
    inconsistencies in other parts of the same
    ontology, in the linked ontologies or in the
    annotated content base.

30
Application scenario - I
  • Enrichment of digital maps with semantic
    information
  • Domain sport events in a given area (big cities)
  • Sub-domain initially selected athletics
    (running, jumping and throwing events)
  • Cities will be selected taking into account
    number and frequency of sports events,
    availability of multimedia coverage in English of
    these events, availability of map and landmark
    data for the city
  • BOEMIE will collect multimedia coverage for sport
    events and strive to extract as much knowledge
    from the extracted features as possible, using
    and evolving the corresponding domain ontologies
  • The identified entities and their properties,
    will be linked to geographical locations and
    stored in a content server
  • The user will be provided with immediate access
    to the annotated content

31
Application scenario - II
  • Querying
  • The prototype will perform reasoning using
    knowledge from the domain ontology and
    geographical knowledge to deduce further
    information and answer user queries.
  • The user will be able to perform the following
    queries
  • events in a time frame
  • events of a particular type
  • events at a certain location
  • persons related to events
  • events similar to a given one
  • events at nearby venues
  • points of interest near a venue
  • combinations of the above

32
Application scenario - II
  • Querying an example
  • Find out the location of the venues in which
    Athlete A has participated in a high jump
    competition in the city X.
  • From transcribed radio commentary, the BOEMIE
    system knows that in 2001, the World
    Championships in Athletics were held in city X in
    venue Y. From the geographical data, it knows the
    exact location of venue Y in city X.
  • It has further analyzed a video snippet and
    identified it as a high jump event. From the meta
    data of the video, the system knows its date of
    recording in 2001, and in the audio of this
    snippet, the keywords X and A's name were
    spotted.
  • Therefore, the system can deduce that A has
    indeed participated in a high jump competition in
    city X, namely the World Championships in
    Athletics 2001.
  • As a result, the BOEMIE system presents all used
    multimedia assets as prove for its answer and
    gives the exact location of the venue where the
    World Championship in Athletics took place.

33
Concluding remarks - I
  • BOEMIE work aims to initiate a discussion on the
    problem of knowledge acquisition and the synergy
    of information extraction and ontology evolution
  • Several open issues
  • the role of ontology in fusing information from
    multiple media
  • ways to learn the optimal combination of features
    derived from MM content
  • how existing ontology languages can be extended
    to tackle the requirements of MM content analysis
  • the application of existing ontology learning and
    inference techniques in the context of MM content
  • the application of the coordination task in a new
    context which involves not only homogeneous
    ontologies, but also heterogeneous ones

34
Concluding remarks - II
  • The main measurable objective of BOEMIE
    initiative is to improve significantly the
    performance of existing single-modality
    approaches in terms of scalability and precision.
  • Towards that goal, our aim is to
  • develop a new methodology for extraction and
    evolution, using a rich multimedia semantic
    model, and
  • realize it as an open architecture that will be
    coupled with the appropriate set of tools.

35
BOEMIE Bootstrapping Ontology Evolution with
Multimedia Information Extraction
http//www.boemie.org
THANK YOU !!!
Write a Comment
User Comments (0)
About PowerShow.com