Semantic Web - PowerPoint PPT Presentation

1 / 68
About This Presentation
Title:

Semantic Web

Description:

'book', 'publication', 'greyhound', 'dog' And their relationships 'book is-a-kind-of publication' 'greyhound is-a-kind-of dog' Dublin Core. One well-known ontology ... – PowerPoint PPT presentation

Number of Views:25
Avg rating:3.0/5.0
Slides: 69
Provided by: senaar
Category:
Tags: semantic | web

less

Transcript and Presenter's Notes

Title: Semantic Web


1
Semantic Web
2
What is it?
  • Many things to many people...

3
Semantic Web
  • Web of named relationships amongst named objects
    (Tim-Berners Lee).

Researcher
W3C Activity
is a
Budak Arpinar
Semantic Web
is a
working on
4
Current Web
  • Hypertext a set of nodes and links.

5
Not machine-readable
  • There is very little machine-readable information
    there.
  • The meaning of the documents is clear to those
    with a grasp of (normally) English, and the
    significance of the links is only evident from
    the context around the link.

6
Current Web
  • Current Web represents information using
  • Natural language (e.g., English)
  • Graphics, multimedia
  • Page layout
  • Okay for human understanding
  • Difficult for machine processing

7
Analogy
  • "Stay off the couch now, Ginger! You hear me?
    Ginger, stay off of the couch!"
  • What Dogs Understand
  • "Blah blah blah blah blah GINGER blah blah blah
    GINGER blah blah blah blah blah"
  • Semantic Web Dog might understand
  • "Blah blah COUCH blah GINGER blah blah blah
    GINGER blah blah blah COUCH"

8
What we say to computers?
  • "Stay off the couch now, Ginger! You hear me?
    Ginger, stay off of the couch!"
  • What Computers Understand
  • "Blah blah blah blah blah blah ltA HREF...gt blah
    blah blah . . . ."

9
Enabling machine processing
  • Two approaches
  • Smarter machines
  • Smarter data

10
Appr. 1 Smarter machines
  • Teach computers to understand the meaning of Web
    data
  • Natural language processing
  • Image recognition
  • Etc.
  • The Artificial Intelligence (AI) approach

11
Smarter machines
  • Not the Semantic Web approach

12
Approach 2 Smarter data
  • Make data easier for machines to understand
  • Express meaning in a machine processable format
  • Example metadata
  • The Semantic Web approach

13
Smarter data
  • The Semantic Web approach

14
The Current Web
  • Minimal machine-processable information -- dumb
    links

15
The Semantic Web - An extension of the current Web
  • More machine-processable information

16
How Google works?
  • Links into page determine importance
  • "Importance" is cumulative
  • Links are machine processable
  • Links have (Minimal) semantics
  • Amazing results from minimal semantics

17
Why is machine processing difficult?
  • Identifying the key problems
  • Ambiguity
  • Complexity of information formats
  • Solving the ambiguity problem
  • URIs
  • Ontologies

18
Problem 1 Ambiguity
  • Budak Arpinar owns VIN 2775534."
  • Which Budak Arpinar"?
  • Vehicle 2775534?
  • Vinyl siding order 2775534?
  • Need to identify things
  • Unambiguously, in a
  • Uniform
  • Web-friendly way

19
Kinds of things to identify
  • Three kinds of things in the universe
  • Web resources
  • Non-Web resources
  • Physical objects
  • Cars, people, houses, etc.
  • Abstract concepts
  • Sizes, colors, verbs, "love", etc.
  • "Creator" (e.g., the creator of a document)
  • "Location"
  • "Airline reservation"
  • "Airline reservation service"

20
Unambiguously identifying Web resources
  • Solution (trivial) URLs
  • http//www.example.org/index.html

21
Unambiguously identifying physical objects
  • Many human systems
  • Vehicle Identification Numbers (VIN)
  • Product serial numbers
  • UPC product codes
  • Employee numbers
  • Etc.
  • Problems
  • Too many formats
  • Most are not global in scope
  • Solution Convert to URIs
  • http//www.example.com/employeeid/85740

22
Unambiguously identifying abstract concepts
  • Solution Use URIs
  • Problem Which URIs?
  • Need to agree on common vocabulary
  • Solution Ontology

23
Ontology
  • "Formal description of concepts and their
    relationships"
  • In other words
  • Vocabulary of terms
  • "book", "publication", "greyhound", "dog"
  • And their relationships
  • "book is-a-kind-of publication"
  • "greyhound is-a-kind-of dog"

24
Dublin Core
  • One well-known ontology
  • Defines 14(?) basic terms for documents and
    publishing
  • "title", "creator", "subject", "publisher" 
  • Each term unambiguously identified by URI
  • http//purl.org/dc/elements/1.1/creator

25
One global ontology?
  • No.  Not realistic.
  • Multiple ontologies will co-exist
  • Often specialized for problem domain
  • But
  • Can be merged later
  • "Popularity contest"

26
Example of unambiguous identification
  • To say "Web page foo.html  was created by  John
    Smith"
  • Need to unambiguously identify 3 things
  • Web pagehttp//www.example.org/foo.html
  • "was created by"http//purl.org/dc/elements/1.1/
    creator
  • "John Smith"http//www.example.org/staffid/85740

27
Entities and relations
  • Documents describe real objects and imaginary
    concepts, and give particular relationships
    between them.
  • A document might describe a person.
  • The title document to a house describes a house
    and also the ownership relation with a person.
  • A program could search for a house and negotiate
    transfer of ownership of the house to a new
    owner.

28
Semantic Web goals
  • Realizing the full potential of the Web
  • Making it cost-effective for people to
    effectively record their knowledge.
  • Focus on machine consumption.
  • The Semantic Web is an extension of the current
    web in which information is given well-defined
    meaning, better enabling computers and people to
    work in cooperation.
  • Ultimate goal - effective and efficient global
    knowledge exchange.

29
Semantic Web goals
  • Ultimate goal - effective and efficient global
    knowledge exchange.
  • Allow you to find, share, and combine information
    more easily

30
Complexity of information formats
  • Web pages use complex information formats
  • English grammar, page layout, etc.
  • Easy for human to parse / understand
  • Hard for machine to parse / "understand"
  • Example "Time flies like an arrow"
  • How to parse?
  • Which is Subject?  Verb?  Object?
  • Need a common, machine-processable information
    format

31
Important characteristics for a
machine-processable format
  • Scalable (the whole Web!)
  • General
  • Allow any info to be expressed
  • Extremely flexible
  • Allow new data to be added
  • From any source
  • Without breaking existing data/systems
  • Allow any kind of query
  • Easily combine/join data in new ways
  • Solution RDF

32
Enabling standard RDF
  • RDF Resource Description Framework
  • Resources things that can be named with URIs
  • Description statements about the properties of
    these resources
  • RDF aims to build a Web of overlapping metadata
    vocabularies
  • Use URIs to define metadata vocabularies
  • Build graphs using these vocabularies to say
    things

33
RDF
  • W3C Recommendation
  • Language for making statements about things
  • Primarily for metadata
  • Author, title, subject, date-of-last-access
  • Can be used for any kind of statements
  • Has XML syntax "RDF/XML"

34
RDF Triples
  • All info expressed as triples
  • ltsubjectgt ltverbgt ltobjectgt
  • ltsubjectgt ltpropertygt ltvaluegt

35
Example triple
  • (Not RDF/XML syntax)
  • http//www.example.org/foo.htm (Subject)  
  • http//purl.org/dc/elements/1.1/creator
    (Verb/Property)    
  • http//www.example.org/staffid/85740
    (Object/Value)
  • Meaning "Web page foo.html  was created by  John
    Smith"

36
Another example
  • ltrdfRDF
  • xmlnsrdf"http//www.w3.org/1999/02/22-rdf-syn
    tax-ns"
  • xmlnslove"http//love.example.org/terms/" gt
  • ltrdfDescription rdfabout"http//aaronsw.co
    m/"gt
  • ltlovereallyLikes rdfresource"http//www.
    w3.org/
  • People/Berners-Lee/Weaving/" /gt
  • lt/rdfDescriptiongt
  • lt/rdfRDFgt

Difficult to create by humans
37
Joining triples to create a graph
  • Triples can be viewed as links in a graph
  • Equivalent of "joining" in relational database
  • Joining is automatic in RDF, because
  • Nodes are URIs (unambiguous)

38
Joining triples to create a graph
39
Joining data from multiple sources
  • Trivial Same URI gt same node.
  • How about extracted data?

40
Point vs. general solutions
  • Any specific problem can be solved by a point
    solution 
  • Many conceptually similar problems, different in
    details
  • Approach doesn't scale well
  • NN solutions required?
  • Inflexible Point solutions don't facilitate new
    uses
  • Conclusion Need general solution

41
Application Integration XML Versus RDF
NN complexity
N1 complexity
42
What information could be machine processable?
Ideally All Web data.  (Not realistic)
"RDF/mappable" RDF or RDF-mappable
43
Semantic Web building blocks
44
Schemas and ontologies
  • Any system hard-coded to understand certain terms
    will likely to go out of date
  • New terms can be invented and defined
  • Rate books on a scale 1-10 instead of just saying
    someone reallyLikes them.
  • Schemas and ontologies help computer systems to
    use terms more easily and decide how to convert
    between them.
  • RDF Schema and DARPA Agent Markup Language with
    Ontology Inference Layer (DAMLOIL)

45
Example
  • dccreator rdfssubClassOf dccontributor.
  • Creators and contributors of various documents
  • Old way lthttpgt is dccreator of lthttpgt
  • New way lthttpgt edhasAuthor lthttpgt
  • Bridge the gap dccreator damlinverse
    edhasAuthor.

46
Semantic Web future
47
Logic and proofs
  • Current semantic Web research
  • Good systems can understand basic concepts
    (subclass, inverse etc.)
  • Better if we could state any logical principles
    we wanted to.
  • Logical statements (rules) that allow the
    computer to make inferences and deductions.

48
Logic
  • I am an employee of MemberCo.
  • MemberCo is a member of W3C.
  • MemberCo has GET access to http//www.w3.org/Membe
    r/.
  • I (therefor) have access to http//www.w3.org/Memb
    er/.

49
Example (deduction)
  • If someone sell more than 100 products then they
    are a member of Super Salesman club.
  • John sold 102 things therefore John is a member
    of the Super Salesman club.
  • More complex rules and inference engines
    explored.

50
Proof
  • Different people can write logic statements.
  • Machines can follow semantic links to prove facts
  • Prove John is a Super Salesman
  • Sales John sold 55 widgets 47 sprockets
  • Widgets sprockets company products
  • 55 47 102
  • 102 gt 100
  • Super Salesman rule
  • Proved John is a Super Salesman
  • A Web of information processors (e.g. P2P)

51
Proof
  • MemberCo's document employList lists me as an
    employee.
  • W3C'c member list includes MemberCo.
  • The ACLs for http//www.w3.org/Member/ assert
    that employees of members have GET access.

52
Information processors
53
Trust
  • Useless if anyone can say whatever they want
  • Digital signatures provide proof that a certain
    person wrote (or agrees with) a document or
    statement
  • Digitally sign all RDF statements
  • Tell programs whom to trust

54
Trust
  • MemberCo's document employList is signed by a
    private key that W3C trusts to make such
    assertions.
  • W3C'c member list is trusted by the access
    control mechanism.
  • The ACLs for http//www.w3.org/Member/ were set
    by an agent trusted by the access control
    mechanism.

55
Web of trust
  • I trust my best friend Robert
  • Robert trusts quite a number of people, and so
    on
  • Robert can trust Wendy a whole lot, but Sally
    only a little

56
Ontology-based trust policies
Ms. Ys trust policy trust(msy, Person,
Information) - about (Information, south asia
based terrorist groups or their political
sympathizers), says(Person, Information),
person(Person), name(Person, Jim Hoagland),
affiliation(Person, Washington
Post). trust(msy, Person, Information) - about
(Information, south asia based terrorist groups
or their political sympathizers), says(Person,
Information), person(Person), says(Person1,
expert(Person, Information)), name(Person1, Jim
Hoagland), affiliation(Person1, Washington
Post). Mr. Xs trust policy trust(mrx, Person,
Information) - trust(msy, Person, Information).
57
Layers of semantic Web
58
More information
  • Semantic Web Home Page http//www.w3.org/2001/sw/
  • RDF Home Page http//www.w3.org/rdf/

59
Reading
  • The Semantic Web A new form of Web content that
    is meaningful to computers will unleash a
    revolution of new possibilities, By Tim
    Berners-Lee, James Hendler and Ora Lassila
  • May 2001 issue

60
Web searches today
WEB SEARCHES TODAY typically turn up innumerable
completely irrelevant "hits," requiring much
manual filtering by the user. If you search using
the keyword "cook," for example, the computer has
no way of knowing whether you are looking for a
chef, information about how to cook something, or
simply a place, person, business or some other
entity with "cook" in its name. The problem is
that the word "cook" has no meaning, or semantic
content, to the computer.
61
Intelligent Agents
62
Elaborate, Precise Automated Searches
63
The semantic Web triangle
Software Knowledge Engineering (Software
Components, Agents, Process Modeling)
Libraries of Components, Interoperation for Web
Services
Reasoning, Planning, DAML-S
AI (Knowledge Representation, Ontologies)
DB (Semi-structured data, Interoperability)
Ontology Languages Semi-structured
DataOntology Transformation
64
Research Issues
  • Ontology Development
  • Top-down approach
  • Bottom-up approach
  • Specification and Languages
  • RDF(S), DAMLOIL, OWL
  • Multiple Ontologies
  • Ontology merging
  • Entity disambiguation

65
Research Issues
  • Meta-data Creation
  • Top-down approach Annotation
  • Bottom-approach Extractors
  • Classification and Clustering
  • Logic and Rules
  • Inference engines, RuleML etc.
  • Trust, Provenance, and Reputation

66
Research Issues
  • Algebra and Query Languages
  • RQL etc.
  • System Issues
  • RDF Databases, Jena etc.
  • Applications
  • Knowledge Discovery
  • Semantic Associations, Similarity
  • Collaboration

67
Research Issues
  • Semantic Processes
  • Semantic Web Services
  • Top-down approaches DAML-S etc.
  • Bottom-up approaches METEOR-S etc.
  • Discovery and Composition

68
Book
  • Spinning the Semantic Web Bringing the World
    Wide Web to Its Full Potentialby Dieter Fensel
    (Editor), Wolfgang Wahlster, Henry Lieberman,
    James Hendler, MIT Press (November 15, 2002)
Write a Comment
User Comments (0)
About PowerShow.com