Towards Telesophy: Federating All the Worlds Knowledge - PowerPoint PPT Presentation

1 / 47
About This Presentation
Title:

Towards Telesophy: Federating All the Worlds Knowledge

Description:

Extract Units automatically from text. Compute Context Graphs from units ... Extract and Index Concepts within Collections -Navigate Concepts within Documents ... – PowerPoint PPT presentation

Number of Views:27
Avg rating:3.0/5.0
Slides: 48
Provided by: CAN128
Category:

less

Transcript and Presenter's Notes

Title: Towards Telesophy: Federating All the Worlds Knowledge


1
Towards TelesophyFederating All the Worlds
Knowledge
Bruce R. SchatzCANIS Laboratory Department of
Medical Information Science Department of
Computer Science University of Illinois at
Urbana-Champaign schatz_at_uiuc.edu,
www.canis.uiuc.edu
Google Tech Talk Mountain View, CA July 11, 2007
2
Telesophy Session 1985
3
Half-way to Hive-mind
  • Access Fetching
  • Organization Searching
  • Analysis Comparing
  • Synthesis Combining

4
OUTLINE
  • Point of View
  • Scalable Semantics
  • Concept Navigation
  • Hero experiment

5
Cyberspace Visions
6
THE THIRD WAVE OF NET EVOLUTION
CONCEPTS
OBJECTS
PACKETS
7
Linguistics Levels and Universal Units
  • 1985 Syntax Files (wholes)
  • 1995 Structure Records (parts)
  • 2005 Semantics Concepts (meaning)
  • 2015 Pragmatics Features (reality)

8
Federation Levels and Functions
  • 1985 Syntax Federation (e.g. Telesophy)
  • uniform formats, duplicate elimination
  • 1995 Structure Federation (e.g. DeLIver)
  • uniform markups, tag-value equivalence
  • 2005 Semantics Federation (e.g. BeeSpace)
  • phrase typing, concept switching

9
SCALABLE SEMANTICS
  • Surface versus Deep Structure
  • Broad Context beats Deep Meaning
  • Parsing from Phrases to Entities
  • Co-occurring from Pairs to Graphs
  • Think Globally, Act Locally

10
LEVELS OF INDEXES
11
Towards Typed Entities
  • Hand Tagged XML (Semantic Web)
  • Domain Dependent DTDs (Entity Types)
  • Machine Tagged with Training Sets
  • Using Phrases and Parts of Speech
  • Names Persons Places - Things

12
Functional Phrases
  • ltgenegt encodes ltchemicalgt
  • Sokolowski and colleagues demonstrated in
    Drosophila melanogaster that the foraging gene
    (for) encodes a cGMP dependent protein kinase
    (PKG).
  • The dg2 gene encodes a cyclic guanosine
    monophosphate (cGMP)- dependent protein kinase
    (PKG).
  • ltchemicalgt affects/causes ltbehaviorgt
  • Thus, PKG levels affected food-search behavior.
  • cGMP treatment elevated PKG activity and caused
    foraging behavior.
  • ltgenegt regulates ltbehaviorgt
  • Amfor, an ortholog of the Drosophila for gene, is
    involved in the regulation of age at onset of
    foraging in honey bees.
  • This idea is supported by results for malvolio
    (mvl), which encodes a manganese transporter and
    is involved in regulating Drosophila feeding and
    age at onset of foraging in honey bees.

13
Biology Entities
  • Easy (Systematic Names)
  • Organism / Chemical
  • Medium (Some Variations)
  • Gene / Bodypart
  • Hard (Always Idiosyncratic)
  • Behavior / Phenotype

14
Towards Concept Spaces
  • Extract Units automatically from text
  • Compute Context Graphs from units
  • Co-occurrence Frequency pairwise
  • Mutual Information all pairwise links
  • Bandpass Filters and Domain Weights

15
COMPUTING CONCEPTS
92 4,000 (molecular biology) 93 40,000
(molecular biology) 95 400,000 (electrical
engineering) 96 4,000,000 (engineering) 98
40,000,000 (medicine)
16
Medical Concept Spaces (1998)
  • Medical Literature (Medline, 10M abstracts)
  • Partition with Medical Subject Headings (MeSH)
  • Community is all abstracts classified by core
    term
  • 40M abstracts containing 280M concepts
  • computation is 2 days on NCSA Origin 2000
  • Simulating World of Medical Communities
  • 10K repositories with gt 1K abstracts
  • (1K with gt 10K)

17
Small World Graph
  • Community Structure enables Dynamic Clustering
    with High Coherence

18
CONCEPT NAVIGATION
  • Manual by Humans
  • Interaction user navigating
  • Classification collection tagging
  • Automatic by Computers
  • Federation search bridges
  • Integration results links

19
Towards the Interspace
  • from Objects to Concepts
  • from Syntax to Semantics
  • Infrastructure is Interaction with Abstraction

Internet is packet transmission across
computers Interspace is concept navigation
across repositories
20
Concept Navigation
21
(No Transcript)
22
(No Transcript)
23
Concept Navigation in BeeSpace
24
BeeSpace General Bioinformatics
  • Bioinformatics of Genes and Behavior
  • Using scalable semantics technology
  • Using General Expressions and Literatures
  • Annotation Pipelines from Sequence and Text
  • Creating and Merging multiple SPACES
  • Where REGIONS are semantically created
  • And useful regions become shared spaces

25
Analysis Environment Functions
  • SPACE is a Paradigm not a Metaphor!
  • Point of View for YOUR Problem
  • Externally
  • -Dynamically describe custom Region of Space
  • -Merge Regions to form Hypothesis Space
  • -Differentially express genes against Space

26
Analysis Environment Structures
  • Concepts and Genes are Universal Entities
  • Uniformly Represented
  • Uniformly Manipulated
  • Internally
  • -Extract and Index Concepts within Collections
  • -Navigate Concepts within Documents
  • -Follow Genes from Documents into Databases

27
BeeSpace Semantic Operations
  • Extract
  • S
  • R
  • Map
  • R
  • S
  • Merge (S1,S2) into S3
  • Summarize (S) into Gene classify

28
BeeSpace v3 Example
  • Refining and Merging Space Regions
  • Cross bee species differential gene expression
    for behavioral maturation into adult forager
  • Comparative Analysis for Similar Situation
  • Behavioral Maturation merge into
  • Cross-Species Comparisons

29
(No Transcript)
30
(No Transcript)
31
(No Transcript)
32
(No Transcript)
33
(No Transcript)
34
(No Transcript)
35
(No Transcript)
36
(No Transcript)
37
(No Transcript)
38
(No Transcript)
39
(No Transcript)
40
(No Transcript)
41
(No Transcript)
42
(No Transcript)
43
(No Transcript)
44
Towards the Interspace
  • The Analysis Environment technology is
    GENERAL! BirdSpace? BeeSpace?
  • PigSpace? CowSpace?
  • BrainSpace? BehaviorSpace?
  • BioSpace
  • Interspace

45
THE DISTRIBUTED WORLD
  • Community Repositories in the Interspace
  • Peer to Peer Networking Infrastructure
  • Every Person performs Every Role

USER request LIBRARIAN reference INDEXER classif
y PUBLISHER quality AUTHOR generate
46
ISPACE (Illinois Interspace)
  • TEXT (library, courses)
  • CONTEXT (conversations, relationships)
  • Meta-Analysis to Forge useful Links
  • Google Books plus Rice Connexions
  • GMAIL plus GPHONE
  • Text plus Message plus Voice
  • Internal Federate plus Integrate
  • External Science plus Scholarship
  • University Environment via Social Networks

47
Today the Hive Tomorrow the HiveMind
Write a Comment
User Comments (0)
About PowerShow.com