Title: Logics for Data and Knowledge Representation
1Logics for Data and KnowledgeRepresentation
- Applications of ClassL Lightweight Ontologies
2Outline
- Ontologies
- Descriptive and classification ontologies
- Real world and classification semantics
- Lightweight Ontologies
- Converting classifications into Lightweight
Ontologies - Applications on Lightweight Ontologies
- Document Classification
- Query-answering
- Semantic Matching
2
3Ontologies
ONTOLOGIES LIGHTWEIGHT ONTOLOGIES
CLASSIFICATIONS APPLICATIONS ON LIGHTWEIGHT
ONTOLOGIES
- Ontologies are explicit specifications of
conceptualizations - Gruber, 1993
- They are often thought of as directed graphs
whose nodes represent concepts and whose edges
represent relations between concepts
4Concepts and Relations between them
ONTOLOGIES LIGHTWEIGHT ONTOLOGIES
CLASSIFICATIONS APPLICATIONS ON LIGHTWEIGHT
ONTOLOGIES
- CONCEPT it represents a set of objects or
individuals - EXTENSION the set above is called the concept
extension or the concept interpretation - Concepts are often lexically defined, i.e. they
have natural language names which are used to
describe the concept extensions (e.g. Animal,
Lion, Rome), often with an additional description
(gloss) - RELATION a link from the source concept to the
target concept - The backbone structure of an ontology graph is a
taxonomy in which the relations are is-a,
part-of and instance-of, whereas the
remaining structure of the graph supplies
auxiliary information about the modeled domain
and may include relations like located-in,
eats, ant, etc. They are respectively called
hierarchical (BT/NT) and associative (RT)
relations in Library Science.
5Ontology as a graph a mathematical definition
ONTOLOGIES LIGHTWEIGHT ONTOLOGIES
CLASSIFICATIONS APPLICATIONS ON LIGHTWEIGHT
ONTOLOGIES
- An ontology is an ordered pair
- O ltV, Egt
- V is the set of vertices describing the concepts
- E is the set of edges describing relations
6Tree-like Ontologies
ONTOLOGIES LIGHTWEIGHT ONTOLOGIES
CLASSIFICATIONS APPLICATIONS ON LIGHTWEIGHT
ONTOLOGIES
- Take the ontology in the previous slide and
remove those auxiliary relations - we get a tree-like ontology consisting of its
backbone structure with is-a and part-of
relations (), that is an informal lightweight
ontology. - () Notice that in some cases we can obtain more
complex structures like DAGs or even with cycles
7ONTOLOGIES LIGHTWEIGHT ONTOLOGIES
CLASSIFICATIONS APPLICATIONS ON LIGHTWEIGHT
ONTOLOGIES
Classification vs. Descriptive Ontologies
- Classification ontologies
- They are used to classify things, such as books,
documents, web pages, etc. the purpose is to
provide domain specific terminology and organize
individuals accordingly. Such ontologies usually
take the form of classifications with (BT\NT\RT)
or without explicit relations. - Descriptive ontologies
- They are used to describe a piece of world, such
as the Gene ontology, Industry ontology, etc.
the purpose is to offer an unambiguous
description of the world. Relations are typically
explicit (e.g. is-a) and can be of any kind.
8ONTOLOGIES LIGHTWEIGHT ONTOLOGIES
CLASSIFICATIONS APPLICATIONS ON LIGHTWEIGHT
ONTOLOGIES
Classification vs. Real World semantics
- Classification ontologies are in classification
semantics - In classification ontologies, the extension of
each concept (label of a node) is the set of
documents about the entities or individual
objects described by the label of the concept.
For example, the extension of the concept animal
is the set of documents about animals of any
kind. - Descriptive ontologies are in real world
semantics - In descriptive ontologies, concepts represent
real world entities. - For example, the extension of the concept animal
is the set of real world animals, which can be
connected via relations of the proper kind.
9ONTOLOGIES LIGHTWEIGHT ONTOLOGIES
CLASSIFICATIONS APPLICATIONS ON LIGHTWEIGHT
ONTOLOGIES
Classification ontologies
10ONTOLOGIES LIGHTWEIGHT ONTOLOGIES
CLASSIFICATIONS APPLICATIONS ON LIGHTWEIGHT
ONTOLOGIES
Descriptive ontologies
11Why Lightweight Ontologies?
ONTOLOGIES LIGHTWEIGHT ONTOLOGIES
CLASSIFICATIONS APPLICATIONS ON LIGHTWEIGHT
ONTOLOGIES
- The majority of existing ontologies are simple
taxonomies or classifications, i.e.,
hierarchically organized categories used to
classify resources. - Ontologies with arbitrary relations do exist, but
no intuitive and efficient reasoning techniques
support such ontologies in general. - so we need lightweight ontologies.
12Lightweight Ontologies
ONTOLOGIES LIGHTWEIGHT ONTOLOGIES
CLASSIFICATIONS APPLICATIONS ON LIGHTWEIGHT
ONTOLOGIES
- A (formal) lightweight ontology is a triple
- O ltN,E,Cgt
- where
- N is a finite set of nodes,
- E is a set of edges on N, such that ltN,Egt is a
rooted tree, - C is a finite set of concepts expressed in a
formal language F, such that for any node ni ? N,
there is one and only one concept ci ? C, and, if
ni is the parent node for nj, then cj ? ci. - NOTE lightweight ontologies are in
classification semantics
13Converting tree-like structures into LOs
ONTOLOGIES LIGHTWEIGHT ONTOLOGIES
CLASSIFICATIONS APPLICATIONS ON LIGHTWEIGHT
ONTOLOGIES
- For a descriptive ontology, the backbone taxonomy
of is-a and instance-of is intuitively
coincident with the subsumption (?) relation in
LOs. - NOTE part-of relations correspond to
subsumption only if transitive. For instance the
following chain cannot be translated - handle part-of door part-of school part-of
school system - For a classification ontology, the extension of
each node is the set of documents (books,
websites, etc.) that should be classified under
the node. Therefore, the links has to be
interpreted as subset relations and can be
transformed directly into subsumption in the
target LOs.
14Descriptive and classification ontologies
ONTOLOGIES LIGHTWEIGHT ONTOLOGIES
CLASSIFICATIONS APPLICATIONS ON LIGHTWEIGHT
ONTOLOGIES
- (a) and (b) are two descriptive ontologies. The
corresponding classification ontologies are
obtained by substituting all the relations with
subset. - (a) and (b) can be converted into lightweight
ontologies by substituting the relations into
subsumptions. However, the semantics changes from
real world to classification semantics.
14
15Populated (Lightweight) Ontologies
ONTOLOGIES LIGHTWEIGHT ONTOLOGIES
CLASSIFICATIONS APPLICATIONS ON LIGHTWEIGHT
ONTOLOGIES
- In Information Retrieval, the term classification
is seen as the process of arranging a set of
objects (e.g., documents) into a set of
categories or classes. - A classification ontology is said populated if a
set of objects has been classified under proper
nodes. - Thus a populated (lightweight) ontology includes
(explicit or implicit) instance-of relations
16Example of a Populated Ontology
ONTOLOGIES LIGHTWEIGHT ONTOLOGIES
CLASSIFICATIONS APPLICATIONS ON LIGHTWEIGHT
ONTOLOGIES
Animal
?
?
?
?
Head
Body
Bird
Mammal
?
?
?
Predator
Herbivore
Chicken
Instance-of
?
?
?
Chicken Soup
Instance-of
Goat
Tiger
Cat
How to Raise Chicken
Instance-of
Instance-of
Instance-of
Tom and Jerry
www.protectTiger.org
17Lightweight Ontologies in ClassL TBox
ONTOLOGIES LIGHTWEIGHT ONTOLOGIES
CLASSIFICATIONS APPLICATIONS ON LIGHTWEIGHT
ONTOLOGIES
- Subsumption terminologies. Recall
- C is a finite set of concepts expressed in a
formal language F, such that for any node ni?N,
there is one and only one concept ci?C, and, if
ni is the parent node for nj ,then cj ? ci. - Bird ? Animal
- Mammal ? Animal
- Chicken ? Bird
- Cat ? Predator
-
- NOTE a tree-like ontology can be transformed
into a lightweight ontology, but not vice versa.
This is because we loose information during the
translation.
18Populated LOs in ClassL TBox ABox
ONTOLOGIES LIGHTWEIGHT ONTOLOGIES
CLASSIFICATIONS APPLICATIONS ON LIGHTWEIGHT
ONTOLOGIES
- instance-of links are encoded into concept
assertions - Chicken(ChickenSoup)
- Cat(TomAndJerry)
-
- Instances are the elements of the domain, namely
the documents classified in the categories.
19Classifications are
ONTOLOGIES LIGHTWEIGHT ONTOLOGIES
CLASSIFICATIONS APPLICATIONS ON LIGHTWEIGHT
ONTOLOGIES
- Easy to use for humans
- Pervasive (Google, Yahoo, Amazon, our PC
directories, email folders, address book, etc.). - Largely used in commercial applications (Google,
Yahoo, eBay, Amazon, BBC, CNN, libraries, etc.). - Have been studied for very long time (e.g.,
Dewey Decimal Classification system - DDC,
Library of Congress Classification system - LCC,
etc.).
20Classification Example Yahoo! Directory
ONTOLOGIES LIGHTWEIGHT ONTOLOGIES
CLASSIFICATIONS APPLICATIONS ON LIGHTWEIGHT
ONTOLOGIES
21Classification Example Email Folders
ONTOLOGIES LIGHTWEIGHT ONTOLOGIES
CLASSIFICATIONS APPLICATIONS ON LIGHTWEIGHT
ONTOLOGIES
22Classification Example E-Commerce Category
ONTOLOGIES LIGHTWEIGHT ONTOLOGIES
CLASSIFICATIONS APPLICATIONS ON LIGHTWEIGHT
ONTOLOGIES
23Label Semantics
ONTOLOGIES LIGHTWEIGHT ONTOLOGIES
CLASSIFICATIONS APPLICATIONS ON LIGHTWEIGHT
ONTOLOGIES
Level
- Natural language words are often ambiguous
- E.g. Java (an island, a beverage, a programming
language) - When used with other words in a label, improper
senses can be pruned - E.g., Java Language only the 3rd sense of
Java is preserved - We translate node labels into unambiguous
propositions in ClassL in classification
semantics - This can be done by using NLP (Natural Language
Processing) techniques
4
24Link semantics
ONTOLOGIES LIGHTWEIGHT ONTOLOGIES
CLASSIFICATIONS APPLICATIONS ON LIGHTWEIGHT
ONTOLOGIES
- Get-specific principle Child nodes in a
classification are always considered in the
context of their parent nodes. As a consequence
they specialize the meaning of the parent nodes. - Subsumption relation (a) the extension of the
child node is a proper subset of the parent node.
The meaning of node 2 is B. - General intersection relation (b) the extension
of the child node is a subset of the parent node.
The meaning of node 2 is C A ? B. - We generalize to (b). The meaning of the node is
what we call the concept at node.
25Concept at node
ONTOLOGIES LIGHTWEIGHT ONTOLOGIES
CLASSIFICATIONS APPLICATIONS ON LIGHTWEIGHT
ONTOLOGIES
In ClassL C4 Ceurope ? Cpictures ? Citaly
26Document Classification
ONTOLOGIES LIGHTWEIGHT ONTOLOGIES
CLASSIFICATIONS APPLICATIONS ON LIGHTWEIGHT
ONTOLOGIES
- Document concept each document d in a
classification is assigned a proposition Cd in
ClassL, build from d in two steps - keywords are retrieved from d by using standard
text mining techniques. - keywords are converted into propositions by using
the methodology discussed above to translate node
labels. - Automatic classification For any given document
d and its concept Cd we classify d in each node
ni such that - ? Cd ? Ci,
- and there is no node nj (j ? i), for which ? Cj ?
Ci and ? Cd ? Cj. -
- In other words we always classify in the node
with the most specific concept.
27Query-answering
ONTOLOGIES LIGHTWEIGHT ONTOLOGIES
CLASSIFICATIONS APPLICATIONS ON LIGHTWEIGHT
ONTOLOGIES
- Query-answering on a hierarchy of documents based
on a query q as a set of keywords is defined in
two steps - The ClassL proposition Cq is build from q by
converting qs keywords as said above. - The set of answers (retrieval set) to q is
defined as a set of subsumption checking problems
in ClassL -
- Aq d ? document T ? Cd ? Cq
28Semantic Matching Why?
ONTOLOGIES LIGHTWEIGHT ONTOLOGIES
CLASSIFICATIONS APPLICATIONS ON LIGHTWEIGHT
ONTOLOGIES
- Most popular knowledge can be represented as
graphs. The heterogeneity between knowledge
graphs demands the exposition of relations, such
as semantically equivalent. - Some popular situations that can be modeled as a
matching problem are - Concept matching in semantic networks.
- Schema matching in distributed databases.
- Ontology matching (ontology alignment) in the
Semantic Web.
28
29The Matching Problem
ONTOLOGIES LIGHTWEIGHT ONTOLOGIES
CLASSIFICATIONS APPLICATIONS ON LIGHTWEIGHT
ONTOLOGIES
- Matching Problem given two finite graphs, finds
all nodes in the two graphs that syntactically or
semantically correspond to each other. - Given two graph-like structures (e.g.,
classifications, XML and database schemas,
ontologies), a matching operator produces a
mapping between the nodes of the graphs. - Solution A possible solution Giunchiglia
Shvaiko, 2003, consists in the conversion of the
two graphs in input into lightweight ontologies
and then matching them semantically.
29
30A Matching Problem
ONTOLOGIES LIGHTWEIGHT ONTOLOGIES
CLASSIFICATIONS APPLICATIONS ON LIGHTWEIGHT
ONTOLOGIES
30