Title: Introduction to Bio-Ontologies
1Introduction to Bio-Ontologies
- Barry Smith
- http//ontology.buffalo.edu/smith
2Outline
- Who am I?
- How to find your data
- How to do biology across the genome
- How to extend the GO methodology to clinical and
translational medicine - Anatomy Ontologies An OBO Foundry success story
- The Infectious Disease Ontology
- The Environment Ontology
3- Who am I?
- How to find your data
- How to do biology across the genome
- How to extend the GO methodology to clinical and
translational medicine - Anatomy Ontologies An OBO Foundry success story
- The Infectious Disease Ontology
- The Environment Ontology
4Who am I?
- Foundational Model of Anatomy Ontology (FMA)
- Common Anatomy Reference Ontology (CARO)
- Protein Ontology (PRO)
- Infectious Disease Ontology (IDO)
- Ontology for General Medical Science (OGMS)
- Plant Ontology (PO)
- Biometrics Upper Ontology
5- NCBO National Center for Biomedical Ontology
(NIH Roadmap Center)
- Stanford Biomedical Informatics Research
- The Mayo Clinic
- University at Buffalo Department of Philosophy
- http//bioportal.bioontology.org
6(No Transcript)
7National Cancer Institute Thesaurus
Preferred Name (Preferred_Name)
Wood Definitions (DEFINITION) The hard, fibrous
substance composing most of the stem and branches
of a tree or shrub, and lying beneath the bark
the xylem. Full Id http//ncicb.nci.nih.gov/xml/
owl/EVS/Thesaurus.owlWood Alt Definition
Fibrous plant material under the bark that is
created by lateral cell division from the
vascular cambium. Noted for high content
cellulose, hemicellulose, and lignin in the cell
walls.FDA
8- Who am I?
- How to find your data
- How to do biology across the genome
- How to extend the GO methodology to clinical and
translational medicine - Anatomy Ontologies An OBO Foundry success story
- The Infectious Disease Ontology
- The Environment Ontology
9(No Transcript)
10(No Transcript)
11(No Transcript)
12The Infinite Monkey (Fortuitous
Interoperability) strategy to resolve data silos
13How to find your data?
- How to find and integrate other peoples data?
- How to reason with data when you find it?
- How to understand the significance of the data
you collected 3 years earlier? - Part of the solution must involve
consensus-based, standardized terminologies and
coding schemes
14 NIH Mandates for Sharing of Research
Data Investigators submitting an NIH application
seeking 500,000 or more in any single year are
expected to include a plan for data sharing
(http//grants.nih.gov/grants/policy/data_sharin
g)
15Making data (re-)usable through standards
- Standards provide
- common structure and terminology
- single data source for review (less redundant
data) - Standards allow
- use of common tools and techniques
- common training
- single validation of data
16Problems with standards
- Standards involve considerable costs of
re-tooling, maintenance, training, ... - Not all standards are of equal quality
- Bad standards create lasting problems
17Ontology success stories, and some reasons for
failure
Linked Open Data in the Semantic Web
18etc.
19The more ontology is successful, the more it fails
- As ontologies (controlled vocabularies) become
easier to create, and to use - more and more ontologies are constructed
- thereby recreating the very silo problems
ontologies were designed to solve - How to solve this problem?
20- Who am I?
- How to find your data
- How to do biology across the genome
- How to extend the GO methodology to clinical and
translational medicine - Anatomy Ontologies An OBO Foundry success story
- The Infectious Disease Ontology
- The Environment Ontology
21How to do biology across the genome?
- MKVSDRRKFEKANFDEFESALNNKNDLVHCPSITLFESIPTEVRSFYED
EKSGLIKVVKFRTGAMDRKRSFEKVVISVMVGKNVKKFLTFVEDEPDFQG
GPISKYLIPKKINLMVYTLFQVHTLKFNRKDYDTLSLFYLNRGYYNELSF
RVLERCHEIASARPNDSSTMRTFTDFVSGAPIVRSLQKSTIRKYGYNLAP
YMFLLLHVDELSIFSAYQASLPGEKKVDTERLKRDLCPRKPIEIKYFSQI
CNDMMNKKDRLGDILHIILRACALNFGAGPRGGAGDEEDRSITNEEPIIP
SVDEHGLKVCKLRSPNTPRRLRKTLDAVKALLVSSCACTARDLDIFDDNN
GVAMWKWIKILYHEVAQETTLKDSYRITLVPSSDGISLLAFAGPQRNVYV
DDTTRRIQLYTDYNKNGSSEPRLKTLDGLTSDYVFYFVTVLRQMQICALG
NSYDAFNHDPWMDVVGFEDPNQVTNRDISRIVLYSYMFLNTAKGCLVEYA
TFRQYMRELPKNAPQKLNFREMRQGLIALGRHCVGSRFETDLYESATSEL
MANHSVQTGRNIYGVDFSLTSVSGTTATLLQERASERWIQWLGLESDYHC
SFSSTRNAEDV
22- MKVSDRRKFEKANFDEFESALNNKNDLVHCPSITLFESIPTEVRSFYED
EKSGLIKVVKFRTGAMDRKRSFEKVVISVMVGKNVKKFLTFVEDEPDFQG
GPIPSKYLIPKKINLMVYTLFQVHTLKFNRKDYDTLSLFYLNRGYYNELS
FRVLERCHEIASARPNDSSTMRTFTDFVSGAPIVRSLQKSTIRKYGYNLA
PYMFLLLHVDELSIFSAYQASLPGEKKVDTERLKRDLCPRKPIEIKYFSQ
ICNDMMNKKDRLGDILHIILRACALNFGAGPRGGAGDEEDRSITNEEPII
PSVDEHGLKVCKLRSPNTPRRLRKTLDAVKALLVSSCACTARDLDIFDDN
NGVAMWKWIKILYHEVAQETTLKDSYRITLVPSSDGISLLAFAGPQRNVY
VDDTTRRIQLYTDYNKNGSSEPRLKTLDGLTSDYVFYFVTVLRQMQICAL
GNSYDAFNHDPWMDVVGFEDPNQVTNRDISRIVLYSYMFLNTAKGCLVEY
ATFRQYMRELPKNAPQKLNFREMRQGLIALGRHCVGSRFETDLYESATSE
LMANHSVQTGRNIYGVDSFSLTSVSGTTATLLQERASERWIQWLGLESDY
HCSFSSTRNAEDVVAGEAASSNHHQKISRVTRKRPREPKSTNDILVAGQK
LFGSSFEFRDLHQLRLCYEIYMADTPSVAVQAPPGYGKTELFHLPLIALA
SKGDVEYVSFLFVPYTVLLANCMIRLGRRGCLNVAPVRNFIEEGYDGVTD
LYVGIYDDLASTNFTDRIAAWENIVECTFRTNNVKLGYLIVDEFHNFETE
VYRQSQFGGITNLDFDAFEKAIFLSGTAPEAVADAALQRIGLTGLAKKSM
DINELKRSEDLSRGLSSYPTRMFNLIKEKSEVPLGHVHKIRKKVESQPEE
ALKLLLALFESEPESKAIVVASTTNEVEELACSWRKYFRVVWIHGKLGAA
EKVSRTKEFVTDGSMQVLIGTKLVTEGIDIKQLMMVIMLDNRLNIIELIQ
GVGRLRDGGLCYLLSRKNSWAARNRKGELPPKEGCITEQVREFYGLESKK
GKKGQHVGCCGSRTDLSADTVELIERMDRLAEKQATASMSIVALPSSFQE
SNSSDRYRKYCSSDEDSNTCIHGSANASTNASTNAITTASTNVRTNATTN
ASTNATTNASTNASTNATTNASTNATTNSSTNATTTASTNVRTSATTTAS
INVRTSATTTESTNSSTNATTTESTNSSTNATTTESTNSNTSATTTASIN
VRTSATTTESTNSSTSATTTASINVRTSATTTKSINSSTNATTTESTNSN
TNATTTESTNSSTNATTTESTNSSTNATTTESTNSNTSAATTESTNSNTS
ATTTESTNASAKEDANKDGNAEDNRFHPVTDINKESYKRKGSQMVLLERK
KLKAQFPNTSENMNVLQFLGFRSDEIKHLFLYGIDIYFCPEGVFTQYGLC
KGCQKMFELCVCWAGQKVSYRRIAWEALAVERMLRNDEEYKEYLEDIEPY
HGDPVGYLKYFSVKRREIYSQIQRNYAWYLAITRRRETISVLDSTRGKQG
SQVFRMSGRQIKELYFKVWSNLRESKTEVLQYFLNWDEKKCQEEWEAKDD
TVVVEALEKGGVFQRLRSMTSAGLQGPQYVKLQFSRHHRQLRSRYELSLG
MHLRDQIALGVTPSKVPHWTAFLSMLIGLFYNKTFRQKLEYLLEQISEVW
LLPHWLDLANVEVLAADDTRVPLYMLMVAVHKELDSDDVPDGRFDILLCR
DSSREVGE
23Biomedical Ontology in PubMed
24By far the most successful GO (Gene Ontology)
25Clark et al., 2005
is_a
part_of
26(No Transcript)
27 The Gene Ontology
28the GO works through annotation of data
what cellular component?
what molecular function?
what biological process?
29three types of data
what cellular component?
what molecular function?
what biological process?
30Gene Ontology Consortium
- WormBase
- Gramene
- FlyBase
- Rat Genome Database
- DictyBase
- Mouse Genome Database
- The Arabidopsis Information Resource
- The Zebrafish Information Network
- Berkeley Drosophila Genome Project
- Saccharomyces Genome Database
- ...
31Benefits of GO
- rooted in basic experimental biology
- links people to data and to literature
- links data to data
- across species (human, mouse, yeast, fly ...)
- across granularities (molecule, cell, organ,
organism, population) - links medicine to biological science
- cumulation of scientific knowledge in
algorithmically tractable form
32A strategy for translational medicine
- Sjöblöm T, et al. analyzed 13,023 genes in 11
breast and 11 colorectal cancers - using functional information captured by GO
identified 189 genes as being mutated at
significant frequency and thus as providing
targets for diagnostic and therapeutic
intervention. - Science. 2006 Oct 13314(5797)268-74.
33- Who am I?
- How to find your data
- How to do biology across the genome
- How to extend the GO methodology to clinical and
translational medicine Open Biomedical
Ontologies - Anatomy Ontologies An OBO Foundry success story
- The Infectious Disease Ontology
- The Environment Ontology
34(No Transcript)
35Ontology Scope URL Custodians
Cell Ontology (CL) cell types from prokaryotes to mammals obo.sourceforge.net/cgi- bin/detail.cgi?cell Jonathan Bard, Michael Ashburner, Oliver Hofman
Chemical Entities of Bio- logical Interest (ChEBI) molecular entities ebi.ac.uk/chebi Paula Dematos, Rafael Alcantara
Common Anatomy Refer- ence Ontology (CARO) anatomical structures in human and model organisms (under development) Melissa Haendel, Terry Hayamizu, Cornelius Rosse, David Sutherland,
Foundational Model of Anatomy (FMA) structure of the human body fma.biostr.washington. edu JLV Mejino Jr., Cornelius Rosse
Functional Genomics Investigation Ontology (FuGO) design, protocol, data instrumentation, and analysis fugo.sf.net FuGO Working Group
Gene Ontology (GO) cellular components, molecular functions, biological processes www.geneontology.org Gene Ontology Consortium
Phenotypic Quality Ontology (PaTO) qualities of anatomical structures obo.sourceforge.net/cgi -bin/ detail.cgi? attribute_and_value Michael Ashburner, Suzanna Lewis, Georgios Gkoutos
Protein Ontology (PrO) protein types and modifications (under development) Protein Ontology Consortium
Relation Ontology (RO) relations obo.sf.net/relationship Barry Smith, Chris Mungall
RNA Ontology (RnaO) three-dimensional RNA structures (under development) RNA Ontology Consortium
Sequence Ontology (SO) properties and features of nucleic sequences song.sf.net Karen Eilbeck
36 RELATION TO TIME GRANULARITY CONTINUANT CONTINUANT CONTINUANT CONTINUANT OCCURRENT
RELATION TO TIME GRANULARITY INDEPENDENT INDEPENDENT DEPENDENT DEPENDENT
ORGAN AND ORGANISM Organism (NCBI Taxonomy) Anatomical Entity (FMA, CARO) Organ Function (FMP, CPRO) Phenotypic Quality(PaTO) Biological Process (GO)
CELL AND CELLULAR COMPONENT Cell (CL) Cellular Component (FMA, GO) Cellular Function (GO) Phenotypic Quality(PaTO) Biological Process (GO)
MOLECULE Molecule (ChEBI, SO, RnaO, PrO) Molecule (ChEBI, SO, RnaO, PrO) Molecular Function (GO) Molecular Function (GO) Molecular Process (GO)
http//obofoundry.org
37Community / Population Ontology
- family, clan
- ethnicity
- religion
- diet
- social networking
- education (literacy ...)
- healthcare (economics ...)
- household forms
- demography
- public health
- ...
38 RELATION TO TIME GRANULARITY CONTINUANT CONTINUANT CONTINUANT CONTINUANT OCCURRENT
RELATION TO TIME GRANULARITY INDEPENDENT INDEPENDENT DEPENDENT DEPENDENT
ORGAN AND ORGANISM Organism (NCBI Taxonomy) Anatomical Entity (FMA, CARO) Organ Function (FMP, CPRO) Phenotypic Quality(PaTO) Biological Process (GO)
CELL AND CELLULAR COMPONENT Cell (CL) Cellular Component (FMA, GO) Cellular Function (GO) Phenotypic Quality(PaTO) Biological Process (GO)
MOLECULE Molecule (ChEBI, SO, RnaO, PrO) Molecule (ChEBI, SO, RnaO, PrO) Molecular Function (GO) Molecular Function (GO) Molecular Process (GO)
http//obofoundry.org
39 RELATION TO TIME GRANULARITY CONTINUANT CONTINUANT CONTINUANT CONTINUANT OCCURRENT
RELATION TO TIME GRANULARITY INDEPENDENT INDEPENDENT DEPENDENT DEPENDENT
ORGAN AND ORGANISM Family, Community, Deme, Population Family, Community, Deme, Population Organ Function (FMP, CPRO) Phenotypic Quality(PaTO) Biological Process (GO)
ORGAN AND ORGANISM Organism (NCBI Taxonomy) Anatomical Entity (FMA, CARO) Organ Function (FMP, CPRO) Phenotypic Quality(PaTO) Biological Process (GO)
CELL AND CELLULAR COMPONENT Cell (CL) Cellular Component (FMA, GO) Cellular Function (GO) Phenotypic Quality(PaTO) Biological Process (GO)
MOLECULE Molecule (ChEBI, SO, RnaO, PrO) Molecule (ChEBI, SO, RnaO, PrO) Molecular Function (GO) Molecular Function (GO) Molecular Process (GO)
http//obofoundry.org
40 RELATION TO TIME GRANULARITY CONTINUANT CONTINUANT CONTINUANT CONTINUANT OCCURRENT
RELATION TO TIME GRANULARITY INDEPENDENT INDEPENDENT DEPENDENT DEPENDENT
COMPLEX OF ORGANISMS Family, Community, Deme, Population Family, Community, Deme, Population Organ Function (FMP, CPRO) Population Phenotype Population Process
ORGAN AND ORGANISM Organism (NCBI Taxonomy) Anatomical Entity (FMA, CARO) Organ Function (FMP, CPRO) Phenotypic Quality(PaTO) Biological Process (GO)
CELL AND CELLULAR COMPONENT Cell (CL) Cellular Component (FMA, GO) Cellular Function (GO) Phenotypic Quality(PaTO) Biological Process (GO)
MOLECULE Molecule (ChEBI, SO, RnaO, PrO) Molecule (ChEBI, SO, RnaO, PrO) Molecular Function (GO) Molecular Function (GO) Molecular Process (GO)
http//obofoundry.org
41 RELATION TO TIME GRANULARITY CONTINUANT CONTINUANT CONTINUANT CONTINUANT CONTINUANT OCCURRENT
RELATION TO TIME GRANULARITY INDEPENDENT INDEPENDENT INDEPENDENT DEPENDENT DEPENDENT
COMPLEX OF ORGANISMS Family, Community, Deme, Population Family, Community, Deme, Population Organ Function (FMP, CPRO) Population Phenotype Population Process
ORGAN AND ORGANISM Organism (NCBI Taxonomy) (FMA, CARO) Organ Function (FMP, CPRO) Phenotypic Quality(PaTO) Biological Process (GO)
CELL AND CELLULAR COMPONENT Cell (CL) Cell Com-ponent (FMA, GO) Cellular Function (GO) Phenotypic Quality(PaTO) Biological Process (GO)
MOLECULE Molecule (ChEBI, SO, RnaO, PrO) Molecule (ChEBI, SO, RnaO, PrO) Molecular Function (GO) Molecular Function (GO) Molecular Process (GO)
E N V I R O N M E N T
http//obofoundry.org
42 RELATION TO TIME GRANULARITY CONTINUANT CONTINUANT CONTINUANT CONTINUANT
RELATION TO TIME GRANULARITY INDEPENDENT INDEPENDENT INDEPENDENT INDEPENDENT
COMPLEX OF ORGANISMS Family, Community, Deme, Population Family, Community, Deme, Population Environment of population
ORGAN AND ORGANISM Organism (NCBI Taxonomy) (FMA, CARO) Environment of single organism
CELL AND CELLULAR COMPONENT Cell (CL) Cell Com-ponent (FMA, GO) Environment of cell
MOLECULE Molecule (ChEBI, SO, RnaO, PrO) Molecule (ChEBI, SO, RnaO, PrO) Molecular environment
E N V I R O N M E N T
http//obofoundry.org
43 RELATION TO TIME GRANULARITY CONTINUANT CONTINUANT CONTINUANT CONTINUANT
RELATION TO TIME GRANULARITY INDEPENDENT INDEPENDENT INDEPENDENT INDEPENDENT
COMPLEX OF ORGANISMS Family, Community, Deme, Population Family, Community, Deme, Population Environment of population
ORGAN AND ORGANISM Organism (NCBI Taxonomy) (FMA, CARO) Environment of single organism
CELL AND CELLULAR COMPONENT Cell (CL) Cell Com-ponent (FMA, GO) Environment of cell
MOLECULE Molecule (ChEBI, SO, RnaO, PrO) Molecule (ChEBI, SO, RnaO, PrO) Molecular environment
E N V I R O N M E N T
The sum total of the conditions and elements
that make up the surroundings and influence the
development and actions of an individual.
44 RELATION TO TIME GRANULARITY CONTINUANT CONTINUANT CONTINUANT CONTINUANT CONTINUANT OCCURRENT OCCURRENT
RELATION TO TIME GRANULARITY INDEPENDENT INDEPENDENT INDEPENDENT DEPENDENT DEPENDENT
COMPLEX OF ORGANISMS Family, Community, Deme, Population Family, Community, Deme, Population Organ Function (FMP, CPRO) Population Phenotype Population Process
ORGAN AND ORGANISM Organism (NCBI Taxonomy) (FMA, CARO) Organ Function (FMP, CPRO) Phenotypic Quality(PaTO) Biological Process (GO)
CELL AND CELLULAR COMPONENT Cell (CL) Cell Com-ponent (FMA, GO) Cellular Function (GO) Phenotypic Quality(PaTO) Biological Process (GO)
MOLECULE Molecule (ChEBI, SO, RnaO, PrO) Molecule (ChEBI, SO, RnaO, PrO) Molecule (ChEBI, SO, RnaO, PrO) Molecular Function (GO) Molecular Function (GO) Molecular Process (GO) Molecular Process (GO)
Plant Anatomy
Plant Growth and Developmental Stage
http//obofoundry.org
45Goal of the OBO Foundry
- all biomedical research data should cumulate to
form a single, algorithmically processable, whole - Smith, et al. Nature Biotechnology, Nov 2007
46CRITERIA
The ontology is open and available to be used by
all. The ontology is instantiated in, a common
formal language and shares a common formal
architecture The developers of the ontology agree
in advance to collaborate with developers of
other OBO Foundry ontology where domains overlap.
OBO FOUNDRY CRITERIA
47- The developers of each ontology commit to its
maintenance in light of scientific advance, and
to soliciting community feedback for its
improvement. - They commit to working with other Foundry members
to ensure that, for any particular domain, there
is community convergence on a single controlled
vocabulary.
CRITERIA
48(No Transcript)
49Current OBO Foundry Ontologies
- Biological process (GO)
- Cellular component (GO)
- Chemical entities of biological interest
- Molecular function (GO)
- Phenotypic quality
- PRotein Ontology (PRO)
- Xenopus Anatomy and Development
- Zebrafish Anatomy and Development
50Foundry ontologies under review
- Cell Ontology (CL)
- Infectious Disease Ontology (IDO)
- Ontology for Biomedical Investigations (OBI)
- Plant Ontology (PO)
51Ontologies under construction
- Allergy Ontology
- Environment Ontology (EnvO)
- Immunology Ontology (IDO)
- Mental Functioning Ontology (MFO)
- Emotion Ontology (MFO-EM)
- Pain Ontology
- Mental Disease Ontology (MDO)
- Neurological Disease Ontology (ND)
- Vaccine Ontology
52- Who am I?
- How to find your data
- How to do biology across the genome
- How to extend the GO methodology to clinical and
translational medicine - An OBO Foundry success story
- The Infectious Disease Ontology
- The Environment Ontology
53Anatomy Ontologies
- Fish Multi-Species Anatomy Ontology (NSF funding
received) - Ixodidae and Argasidae (Tick) Anatomy Ontology
- Mosquito Anatomy Ontology (MAO)
- Spider Anatomy Ontology (SPD)
- Xenopus Anatomy Ontology (XAO)
- undergoing reform Drosophila and Zebrafish
Anatomy Ontologies
54Ontologies facilitate grouping of annotations
brain 20 hindbrain 15
rhombomere 10
Query brain without ontology 20 Query brain
with ontology 45
55Anatomical Space
Anatomical Structure
Organ Cavity Subdivision
Organ Cavity
Organ
Serous Sac
Organ Component
Serous Sac Cavity
Tissue
Serous Sac Cavity Subdivision
is_a
Pleural Sac
Pleura(Wall of Sac)
Pleural Cavity
part_of
Parietal Pleura
Visceral Pleura
Interlobar recess
Mediastinal Pleura
Mesothelium of Pleura
56Basic Formal Ontology (Top Level)
http//www.ifomis.org/bfo/
Continuant
Occurrent
Process Stage
Independent Continuant
Dependent Continuant
Anatomical Structure
Quality
57Independent Continuant
58OBO Foundry organized in terms of Basic Formal
Ontology
- through the methodology of downward population
- Each Foundry ontology can be seen as an
extension of a single upper level ontology (BFO) -
59Example The Cell Ontology
60Continuant
Independent Continuant
Dependent Continuant
Quality
Disposition
..... .....
61depends_on
Continuant
Occurrent process, event
Independent Continuant thing
Dependent Continuant quality
temperature depends on bearer
.... ..... .......
62Phenotype Ontology (PATO)
63color
anatomical structure
is_a
is_a
red
eye
instantiates
instantiates
an instance of an eye (in a particular fly)
the particular case of redness (of a particular
fly eye)
depends on
64Phase transitions
portion of water
portion of ice
portion of liquid water
portion of gas
instantiates at t1
instantiates at t2
instantiates at t3
this portion of H20
65Phase transitions
plant
zygote
embryo
seed
instantiates at t1
instantiates at t2
instantiates at t3
this plant
66human
embryo
fetus
adult
neonate
infant
child
instantiates at t1
instantiates at t2
instantiates at t3
instantiates at t4
instantiates at t5
instantiates at t6
John (exists continuously)
67temperature
in nature, no sharp boundaries here
37ºC
37.1ºC
37.5ºC
37.2ºC
37.3ºC
37.4ºC
instantiates at t1
instantiates at t2
instantiates at t3
instantiates at t4
instantiates at t5
instantiates at t6
Johns temperature (exists continuously)
68coronary heart disease
asymptomatic (silent) infarction
early lesions and small fibrous plaques
stable angina
surface disruption of plaque
unstable angina
instantiates at t1
instantiates at t2
instantiates at t3
instantiates at t4
instantiates at t5
Johns coronary heart disease (exists
continuously)
time
69- Who am I?
- How to find your data
- How to do biology across the genome
- How to extend the GO methodology to clinical and
translational medicine - Anatomy Ontologies An OBO Foundry success story
- IDO The Infectious Disease Ontology
- The Environment Ontology
70We have data
- TBDB Tuberculosis Database, including Microarray
data - VFDB Virulence Factor DB
- TropNetEurop Dengue Case Data
- ISD Influenza Sequence Database at LANL
- PathPort Pathogen Portal Project
- ...
71We need to annotate this data
- to allow retrieval and integration of
- sequence and protein data for pathogens
- case report data for patients
- clinical trial data for drugs, vaccines
- epidemiological data for surveillance, prevention
- ...
- Goal to make data deriving from different
sources comparable and computable
72IDO needs to work with
- Disease Ontology (DO) SNOMED CT
- Gene Ontology Immunology Branch
- Phenotypic Quality Ontology (PATO)
- Protein Ontology (PRO)
- Sequence Ontology (SO)
- ...
73We need common controlled vocabularies to
describe these data in ways that will assure
comparability and cumulation
- What content is needed to adequately cover the
infectious domain? - Host-related terms (e.g. carrier, susceptibility)
- Pathogen-related terms (e.g. virulence)
- Vector-related terms (e.g. reservoir,
- Terms for the biology of disease pathogenesis
(e.g. evasion of host defense) - Population-level terms (e.g. epidemic, endemic,
pandemic, )
74IDO Processes
75IDO Qualities
76IDO Roles
77IDO provides a common template
- IDO contains terms (like pathogen, vector,
host) which apply to organisms of all species
involved in infectious disease and its
transmission - Disease- and organism-specific ontologies built
as refinements of the IDO core
78Disease-specific IDO test projects
- MITRE, Mount Sinai, UTSouthwestern Influenza
- Stuart Sealfon, Joanne Luciano,
- IMBB/VectorBase Vector borne diseases (A.
gambiae, A. aegypti, I. scapularis, C. pipiens,
P. humanus) - Kristos Louis
- Colorado State University Dengue Fever
- Saul Lozano-Fuentes
- Duke Tuberculosis
- Carol Dukes-Hamilton
- Cleveland Clinic Infective Endocarditis
- Sivaram Arabandi
- University of Michigan Brucilosis
- Yongqun He
79- Who am I?
- How to find your data
- How to do biology across the genome
- How to extend the GO methodology to clinical and
translational medicine - Anatomy Ontologies An OBO Foundry success story
- The Infectious Disease Ontology
- The Environment Ontology
80RELATION TO TIME GRANULARITY CONTINUANT CONTINUANT
RELATION TO TIME GRANULARITY INDEPENDENT INDEPENDENT
COMPLEX OF ORGANISMS biome / biotope, territory, habitat, neighborhood, ...
COMPLEX OF ORGANISMS work environment, home environment host/symbiont environment ...
ORGAN AND ORGANISM work environment, home environment host/symbiont environment ...
CELL AND CELLULAR COMPONENT extracellular matrix chemokine gradient ...
MOLECULE hydrophobic surface virus localized to cellular substructure active site on protein pharmacophore ...
E N V I R O N M E N T
http//obofoundry.org
81The Environment Ontology
OBO Foundry Genomic Standards Consortium National
Environment Research Council (UK) USDA, Gramene,
J. Craig Venter Institute ...
82Applications of EnvO in biology
83How EnvO currently works for information retrieval
- Retrieve all experiments on organisms obtained
from - deep-sea thermal vents
- arctic ice cores
- rainforest canopy
- alpine melt zone
- Retrieve all data on organisms sampled from
- hot and dry environments
- cold and wet environments
- a height above 5,000 meters
- Retrieve all the omic data from soil organisms
subject to - moderate heavy metal contamination
84extending EnvO to clinical and translational
research
- we have public heath, community and population
data - we need to make this data available for search
and algorithmic processing - we create a consensus-based ontology which can
interoperate with ontologies for neighboring
domains of medicine and basic biology -
85Environment totality of circumstances external
to a living organism or group of organisms
- pH
- evapotranspiration
- turbidity
- available light
- predominant vegetation
- predatory pressure
- nutrient limitation
86extend EnvO to the clinical domain
- dietary patterns (Food Ontology FAO, USDA) ...
allergies - neighborhood patterns
- built environment, living conditions
- climate
- social networking
- crime, transport
- education, religion, work
- health, hygiene
- disease patterns
- bio-environment (bacteriological, ...)
- patterns of disease transmission (links to IDO)
87(No Transcript)
88with thanks to
- BFO Fabian Neuhaus (NIST), Melissa Haendel
(Oregon), David Sutherland (Flybase) - EnvO Dawn Field, Norman Morrison (NERC)
- FMA Cornelius Rosse, J. L. E. Mejino (Seattle)
- IDO Lindsay Cowell, Albert Goldfain (Dallas)
- OBO Foundry Michael Ashburner, Suzanna Lewis,
Chris Mungall (Flybase, GO), Alan Ruttenberg
(Buffalo, Neurocommons) - NCBO NIH RFA-RM-04-022
- PRO NIH R01 GM080646-01
- PO The Plant Ontology Consortium