Title: The National Center for Biomedical Ontology
1The National Center for Biomedical Ontology
- Stanford Berkeley Mayo Victoria Buffalo
UCSF Oregon Cambridge
2Ontologies are essential to make sense of
biomedical data
3A biological ontology is
- A machine interpretable representation of some
aspect of biological reality
- what kinds of things exist?
sense organ
eye disc
develops from
is_a
- what are the relationships between these things?
eye
part_of
ommatidium
4The Foundational Model of Anatomy
5Knowledge workers seem trapped in a
pre-industrial age
- Most ontologies are
- Of relatively small scale
- Built by small groups working arduously in
isolation - Success rests heavily on the particular talents
of individual artisans, rather than on SOPs and
best practices - There are few technologies available to make this
process faster, better, cheaper
6A Portion of the OBO Library
7National Center for Biomedical Ontology
Capture and index experimental results
Open Biomedical Ontologies (OBO)
Open Biomedical Data (OBD)
BioPortal
Revise biomedicalunderstanding
Relate experimental data to results from other
sources
8- Stanford Tools for ontology alignment,
indexing, and management (Cores 1, 47 Mark
Musen) - LawrenceBerkeley Labs Tools to use ontologies
for data annotation (Cores 2, 57 Suzanna Lewis) - Mayo Clinic Tools for access to large
controlled terminologies (Core 1 Chris Chute) - Victoria Tools for ontology and data
visualization (Cores 1 and 2 Margaret-Anne
Story) - University at Buffalo Dissemination of best
practices for ontology engineering (Core 6 Barry
Smith)
9cBio Driving Biological Projects
- Trial Bank UCSF, Ida Sim
- Flybase Cambridge, Michael Ashburner
- ZFIN Oregon, Monte Westerfield
10The National Center for Biomedical Ontology
- Core 3 Driving Biological Projects
- Monte Westerfield
11Animal disease models
Animal models
Mutant Gene Mutant or missing
ProteinMutant Phenotype
12 Mutant Gene Mutant or missing
ProteinMutant Phenotype (disease)
Animal disease models
Humans
Animal models
Mutant Gene Mutant or missing
ProteinMutant Phenotype (disease model)
13 Mutant Gene Mutant or missing
ProteinMutant Phenotype (disease)
Animal disease models
Humans
Animal models
Mutant Gene Mutant or missing
ProteinMutant Phenotype (disease model)
14 Mutant Gene Mutant or missing
ProteinMutant Phenotype (disease)
Animal disease models
Humans
Animal models
Mutant Gene Mutant or missing
ProteinMutant Phenotype (disease model)
15SHH-/
SHH-/-
shh-/
shh-/-
16 Phenotype (clinical sign) entity
attribute
17 Phenotype (clinical sign) entity
attribute P1 eye hypoteloric
18 Phenotype (clinical sign) entity
attribute P1 eye hypoteloric P2
midface hypoplastic
19 Phenotype (clinical sign) entity
attribute P1 eye hypoteloric P2
midface hypoplastic P3 kidney
hypertrophied
20 Phenotype (clinical sign) entity
attribute P1 eye hypoteloric P2
midface hypoplastic P3 kidney
hypertrophied
PATO hypoteloric hypoplastic
hypertrophied
ZFIN eye midface kidney
21 Phenotype (clinical sign) entity
attribute
Anatomy ontology Cell tissue ontology
Developmental ontology Gene ontology
biological process molecular function
cellular component
PATO (phenotype and trait ontology)
22 Phenotype (clinical sign) entity
attribute P1 eye hypoteloric P2
midface hypoplastic P3 kidney
hypertrophied
Syndrome P1 P2 P3 (disease)
holoprosencephaly
23Human holo- prosencephaly
Zebrafish shh
Zebrafish oep
24Human holo- prosencephaly
Zebrafish shh
Zebrafish oep
25ZFIN mutant genes
26OMIM genes
ZFIN mutant genes
27OMIM genes
ZFIN mutant genes
FlyBase mutant genes
28(No Transcript)
29National Center for Biomedical Ontology
Capture and index experimental results
Open Biomedical Ontologies (OBO)
Open Biomedical Data (OBD)
BioPortal
Revise biomedicalunderstanding
Relate experimental data to results from other
sources
30The National Center for Biomedical Ontology
- Core 2 Bioinformatics
- Suzanna Lewis
31cBio Bioinformatics Goals
- Apply ontologies
- Software toolkit for annotation
- Manage data
- Databases and interfaces to store and view
annotations - Investigate and compare
- Linking human diseases to genetic models
- Maintain
- Ongoing reconciliation of ontologies with
annotations
32cBio Bioinformatics Goals
- Apply ontologies
- Software toolkit for annotation
- Manage data
- Databases and interfaces to store and view
annotations - Investigate and compare
- Linking human diseases to genetic models
- Maintain
- Ongoing reconciliation of ontologies with
annotations
33Elicitation of Requirements for Annotation Tools
- Applications pull from pioneer users in Core 3
- ZFIN
- FlyBase
- Trial Bank
- Study these groups currently annotate data
- Determine how our Core 2 tools can integrate with
existing data flows and databases - Evaluate the commonalities and differences among
approaches
34Development of Data-Annotation Tool
- Develop plug-in architecture
- Default user interface for generic
data-annotation tasks - Custom-tailored interfaces for particular
biomedical domains - Enable interoperability with existing
ontology-management platforms - Integrate ontology-annotation tool with BioPortal
- Access ontologies for data annotation from OBO
- Store data annotations in OBD
35Phenotype as an observation
context
The class of thing observed
evidence
publication
environment
figures
assay
genetic
sequence ID
ontology
36Phenotype from published evidence
37Ontologies enable users to describe assays
38Phenotype as an observation
context
The class of thing observed
evidence
publication
environment
figures
assay
genetic
sequence ID
ontology
39Ontologies enable users to describe environments
40Phenotype as an observation
context
The class of thing observed
evidence
publication
environment
figures
assay
genetic
sequence ID
ontology
41Ontologies enable users to describe genotypes
42Phenotypes as collections
- Coincidence
- Same organism, same time
- Relative
- Reduced, enhanced
- Same focus of observation
- All left hands
- Differing levels of scale
- Molecular, cellular, organismal
- Recognizable patterns
- Set of observations that describe a disease
43(No Transcript)
44National Center for Biomedical Ontology
Capture and index experimental results
Open Biomedical Ontologies (OBO)
Open Biomedical Data (OBD)
BioPortal
Revise biomedicalunderstanding
Relate experimental data to results from other
sources
45The National Center for Biomedical Ontology
- Core 1 Computer Science
- Mark Musen
46E-science needs technologies
- To help build and extend ontologies
- To locate ontologies and to relate them to one
another - To visualize relationships and to aid
understanding - To facilitate evaluation and annotation of
ontologies
47We need to relate ontologies to one another
- We keep reinventing the wheel
- We dont even know whats out there!
- We need to make comparisons between ontologies
automatically - We need to keep track of ontology history and to
compare versions
48We need to compute both similarities and
differences
- Similarities
- Merging ontologies
- Mapping ontologies
- Differences
- Versioning
49Ontology engineering requires management of
complexity
- How can we
- keep track of hundreds of relationships?
- understand the implications of changes to a large
ontology? - know where ontologies are underspecified? And
where they are over constrained?
50(No Transcript)
51E-science needs technologies
- To help build and extend ontologies
- To locate ontologies and to relate them to one
another - To visualize relationships and to aid
understanding - To facilitate evaluation and annotation of
ontologies
52Core 1 Components
53Core 1 Contributors
- Stanford Tools for ontology management,
alignment, versioning, metadata management,
automated critiquing, and peer review - Mayo LexGrid technology for access to large
controlled terminologies, ontology indexing,
Soundex, search - Victoria Technology for ontology visualization
54National Center for Biomedical Ontology
Capture and index experimental results
Open Biomedical Ontologies (OBO)
Open Biomedical Data (OBD)
BioPortal
Revise biomedicalunderstanding
Relate experimental data to results from other
sources
55Core 4 Infrastructure
- Builds on existing IT infrastructure at Stanford
and at our collaborating institutions - Adds
- Online resources and technical support for the
user community - Collaboration tools to link all participating
sites
56Core 5 Education and Training
- Builds on existing, strong informatics training
programs at Stanford, Berkeley, UCSF,
Mayo/Minnesota, and Buffalo - New postdoctoral positions at Stanford, Berkeley,
and Buffalo - New visiting scholars program
57Core 6 Dissemination
- Active relationships with relevant professional
societies and agencies (e.g., HL7, IEEE, WHO,
NIH) - Internet-based resources for discussing,
critiquing, and annotating ontologies in OBO - Cooperation with other NCBCs to offer a library
of open-source software tools - Training workshops to aid biomedical scientists
in ontology development
58Upcoming cBio Dissemination Workshops
- Image Ontology Workshop Stanford CA, March
2425, 2006 - Training in Biomedical Ontology Schloss
Dagstuhl, May 2124, 2006 - Training in Biomedical Ontology Baltimore,
November 68, 2006 (in association with FOIS and
AMIA conferences)
59Core 7 Administration
- Project management shared between Stanford and
Berkeley - Executive committee (PI, co-PI, Center director,
and Center associate director) provides
day-to-day management and oversight - Council (All site PIs, including PIs of DBPs)
provides guidance and coordination of work plans - Each Core has a designated lead selected from
the Council
60cBiO Organization Chart
61Ontologies are essential to make sense of
biomedical data