Title: Federal Enterprise Architecture Data Reference Model Taxonomies and Ontologies
1Federal Enterprise Architecture Data Reference
Model Taxonomies and Ontologies
- Presentation to the
- NIH Forum on Informatics Solutions, October 7,
2005 - Brand Niemann,
- U.S. Environmental Protection Agency and
- Chair, Semantic Interoperability Community of
Practice - http//colab.cim3.net/cgi-bin/wiki.pl?SICoP
- and http//web-services.gov/
2Introduction
- A Semantics Lesson
- With no context
- Enterprise Architecture
- Enterprise A Starship
- Architecture Blueprints
- So You Might Infer the Relationship Blueprints
of the Starship Enterprise!!! - With context (and subject matter expertise and up
to date information as of yesterday) - Federal Enterprise Architecture
- But not a Real Architecture Only four
Reference Models that are taxonomies - Really just paper documents that we have fostered
ontologies of. - Evolving it to a Real and Target Architecture
- So I Inferred the Relationship Data Reference
Model that we have been developing
collaboratively for the past seven months that
has a taxonomy (even an ontology) and is a real
target architecture that is implementable.
OMB Chief Architect, Dick Burk, Chief Architect
Forum, October 6, 2005.
3Introduction
- Bio-informatics
- You become both the subject matter expert and the
computer scientist to build your own information
system or collaborate with others to do so. - The use of Information Technology is moving
towards do you own with open collaboration
with open standards. - So how can we work efficiently and effectively
both locally and globally? - What is your personal and/or enterprise
collaboration architecture? - I have adopted one (see slides 5-6).
- What is a recent government Best Practice
example? - The Geospatial Intelligence Community! (see slide
7)
4Introduction
- And you should ask me, what is the Data Reference
Model Metamodel and how do I learn about it? - Just in case you did not think to ask, it is
coming next (see slide 8). - And if I had more time I would give you an
overview tutorial on how to deploy RDF and OWL. - See XML 2005 Conference Presentation
- http//web-services.gov/SICoPXML2005.ppt
- See recent Collaboration Expedition Workshop
(hands-on) - http//colab.cim3.net/cgi-bin/wiki.pl?ExpeditionWo
rkshop/DesigningTheDRM_DataAccessibility_2005_08_1
6
5Introduction
Sir Tim Berners-Lee at the SWANS Conference,
April 7, 2005, on the constant tension between
Keep a wise balance. The semantic web allows a
mixture of the two approaches, and smooth
transitions between them.
6Introduction
Ontologies on Web Servers
Standard namespaces XMLS Datatype OWL
Specification RDFS Specification RDF Specification
Ontologies being Used/Extended
OWL
namespace references
Ontology Stewards Web Server
Imports
Ontology
Information Publishers Web Server
Ontology- specific Datatypes
compliant with
OWL
OWL
Imports
RDF
Instance Data
Web Ontology Language Architecture. Source Lee
Lacy, OWL Representing Information Using the
Web Ontology Language, Trafford, 2005, page 144.
7The DRM Implementation Through Iteration and
Testing (IT2) Strategy
FEA Reference Models and Profiles PRM BRM SRM
TRM Security Privacy Records
Management Geospatial Other E-Gov Section 207
(d)
Communities of Interest IC MWG NIEM GEOINT NI
CS NARA State of Pennsylvania ISO 11179/XMDR,
UBL, UDEF, etc. Etc.
Phase 1 Taxonomy/Ontology Phase 2 Metadata
Interoperability Phase 3 Executable Data
Interoperability
Semantic Interoperability
DRM Core
Semantic Interoperability
The Focus of the August 16th DRM Workshop.
8DRM IT2 Education Pilot
Relationships and associations
- Precise definitions of constructs and rules
needed for abstraction, generalization, and
semantic models. - Relationships between the data and its metadata.
- Data about the data.
- Facts or figures from which conclusions can be
inferred.
Source Professor Andreas Tolk, August 16, 2005
9DRM IT2 Education Pilot
- Use DRM Version 1.5 itself as a pilot project for
education and training and use in FEA
DRM-related information sharing. - Use the simple framework in the previous slide to
both illustrate and demonstrate the
relationships, associations, and searches. - Show how this addresses the E-Gov Act Section 207
(d) requirements and the GSA/OMB RFI questions. - Efficient and Effective Information Retrieval and
Sharing.
10DRM IT2 Education Pilot
- The Emerging Technology Component Break Through
Performance Life Cycle of Vivisimo.Com - A product of Phases I and II of the National
Science Foundations SBIR (Small Business
Innovation Research Program). - A product of the Phase III SBIR from Innovation
Works Associated with the NASVF (National
Association of Seed and Venture Funds). - Highly Recommendation by the NSF SBIR Program
Manager for Our October 20th First Quarterly
Conference. - An Outstanding Presentation and Answers to
Questions. - Sets the Standard for Break Through Performance
for eGov - Sustainable Business Model/Profitable (Vivisimo
well over 1 million/year within two years). - Open Standards/Interoperable/Reusable (e.g. works
with FirstGov and supports eGov Act of 2002 need
for categorization of government information!) - Product Commercialization and Procurement
(Available through GSA Schedule-SBIR Phase II). - Publicity (e.g. Washington Post Express, January
6, 2004, Googles to Come.) - FirstGov.gov to Partner with Vivisimo, MSN to
Build Next-Generation Search Portal (September
23, 2005).
11DRM IT2 Education Pilot
12DRM IT2 Education Pilot
This Data Architecture Provides the Three Ss
Structure, Searchability, and Semantics.
National Infrastructure for Community Statistics
Pilot
Metamodel
Model
Figures
Metadata
Data Stories
Data
13DRM IT2 Education Pilot
14DRM IT2 Education Pilot
15DRM IT2 Education Pilot
Search Context Continuum Mills Davis
Strong Semantics
Modal Logic
First Order Logic
Logical Theory
Description Logic
OWL
Conceptual Model
UML
Semantic Interoperability
RDF/S
Increasing Metadata
Topic Map
Thesaurus
ER Model
Structural Interoperability
DB Schema, XML Schema
Taxonomy
Relational Model, XML
Glossary
Syntactic Interoperability
Weak Semantics
Controlled Vocabulary
Recovery
Discovery
Intelligence
QA
Reasoning
Increasing Search Capability
16DRM IT2 Education Pilot
Ontology Domain
KR System
No metadata
Semantic Interoperability
OO Software Model
Minimal metadata
Entity-Relationship Model
Level of Expressivity
Concept Map
(Google, Vivisimo, etc.)
Topic Map
Amount of Content with Metadata
Database Schema
Syntactical Interoperability
XML Schema
Registry metadata
Hierarchical Taxonomy
Simple Taxonomy
(Siderian Cerebra)
RDF/OWL metadata
Glossary
(BioCAD)
Level of Complexity
(some search and knowledge computing examples-see
next slide)
Partial Source The Model-Driven Semantic Web
Emerging Technologies Implementation
Strategies, Elisa Kendall, Sandpiper Software,
August 16, 2005.
17DRM IT2 Education Pilot
The Tradeoff Between Search and Knowledge
Computing How would you really accomplish the
seven scenarios in the GSA/OMB RFI?
See the next slide for the BioCAD example.
18DRM IT2 Education Pilot
Source Knowledge Computing Example in the
Business Value of Semantic Technology Exploiting
New Value Paradigms, Mills Davis, Joint CAF,
PMCoP, SICoP Meeting, Reagan Center, September
21, 2005, http//colab.cim3.net/file/work/SICoP/20
05-09-21/BizValue050921.pdf
19DRM IT2 Education Pilot
- Some Nice Things We Have Found About Ontologies
- Both the Architecture and the Main Component of
An Information System - See Nicola Guarino, Formal Ontology and
Information Systems, Proceedings of the FOIS 98,
Trento, Italy, 6-8 June 1998. - Collaborative Governance Commitment to Make the
Content Semantically Interoperable - Broadstrokes Group Pilot (SWANS Conference, April
7-8, 2005). - Execution in Composite Application Platform
- Digital Harbor and TopQuadrant Use Cases Pilot
(in process). - Respected in the Enterprise Architecture
Community - Continuity of Communications for the Federal
Executive Branch Using Ontological Engineering by
Roy Roebuck. - Executable Enterprise Models (The FEA Reference
Model Ontology) by Irene Polikoff and Robert
Coyne Published in the Premier Issue of the
Journal of Enterprise Architecture. - Also at the 4th International Semantic Web
Conference, November 6-10, 2005.
20DRM IT2 Education Pilot
- Pat Cassidy, Chair, Ontology and Taxonomy
Coordinating WG First Meeting, October 5, 2005,
Recommended - The Common Semantic Model (COSMO)
- An inventory of logically defined higher-level
concepts adequate to specify the meanings of the
terms and concepts in all domain Knowledge
Classification Systems used by participants. - Structured as a set of precisely interrelated
ontologies without duplicated concepts and with a
set of logically consistent default core concepts.
See http//colab.cim3.net/cgi-bin/wiki.pl?Ontology
andTaxonomyCoordinatingWGMeeting_2005_10_05 and
this is remarkably similar to the recent European
Commission report see next slide.
21DRM IT2 Education Pilot
- European Commission IDABC Content
Interoperability Strategy - Semantic Interoperability Assets
- Dictionaries
- Thesauri
- Multilingual thesauri
- Nomenclatures
- Cross-references and mapping tables
- Ontologies
- Service registries
- Milestones
- Production of a pivot ontology of life business
events connected to administrative activities. - Publication of the ontology on the Clearinghouse
server.
Interoperable Delivery of European eGovernment
Services to public Administrations, Business and
Citizens.
22DRM IT2 Education Pilot
Site Map is a Hierarchical Taxonomy! Note
similarity to our slide 13! (We are on the same
track)
http//europa.eu.int/idabc/en/sitemap
23The DRM Implementation Through Iteration and
Testing Strategy
- The DRM Implementation Through Iteration and
Testing Strategy includes five key activities
over the next year - Education and Training in DRM Version 1.5 and use
in FEA DRM-based Information Sharing Pilots
(started June 13th). - Testing of XML Schemas and OWL Ontologies by NIST
and the National Center for Ontological Research,
respectively, among others (beginning after
October 27th). - Inventory/Repository of Semantic Interoperability
Assets and Development of a Common Semantic Model
(COSMO) by the new Ontology and Taxonomy
Coordinating Work Group (ONTACWG) (started
October 5th). - Continued early implementation of DRM 1.5
concepts and artifacts by industry in open
collaboration with open standards pilot projects
and workshops (started July 19th). - Fostering champions of DRM Best Practices to
improve (1) agency data architectures within
agencies and (2) cross-agency data sharing across
agencies in funded projects (in process).
24Peter Yim, President CEO of CIM Engineering,
Inc., and Mark Musen, Stanford Medical
Informatics, Stanford University
Presented at the 2005 SICoP Annual Meeting,
September 14, 2005, at the MITRE Corporation,
McLean, VA, by SICoP Chair, Brand Niemann, U.S.
EPA.