Title: The Scientific
1The Scientific Regulatory Drivers for Omic
Data Standards What Wider Community Benefits can
be Derived? Dr Jason Snape AstraZeneca Global SHE
eGenomes, Cambridge, September 2005
2NERC Genomic Research Effort
- Directed Programmes- Environmental Genomics-
Post-Genomics Proteomics - Knowledge Transfer- Connect B- NERC/ EA
Fellowship - Responsive Mode
- Advanced Fellows- includes Environmental
Metabolomics
3Environmental Genomics
- Identification of important phenotypes
genotypes- ecologically driven- bottom-up
fitness approach (genes to populations) - Determine the ecological significance of
molecular variation- natural variation (spatial/
temporal)- consequence of mans activities
(pollution/ climate change) - Determine the impact the environment has on
phenotype- map to genotype (phenotypic
anchoring)- basis and theory of adaptive
evolution - Genome architecture- quantitative traits-
comparative mapping (model vs. wildlife sentinel) - Data management Analysis (min. 10 of budget)
4Post-Genomics Proteomics
- Integrating post-genomics and proteomics to
studyhost-parasite interactions in aquatic and
terrestrial hosts - What makes species, populations and taxa
resistant or vulnerable to environmental change
and environmentalimpact? - Plant responses to
abiotic stress at range margins- Molecular
ecotoxicology in aquatic vertebrate populations - Nutrient flux and biogeochemistry
- Environmental regulation of the proteome
- Data management Analysis (min. 10 of budget)
Systems-level biology funded
5NERC Directed Programmes
CCGCC
BBSRC MRC EPSRC ESRC
NERC has a genomic data policy
6Data Standards
- Data standards creep
- MIAME for everything
- Minimum perceived as too detailed/ time consuming
- Detracts from experiment work
- Ms not spent on science
- What value do they add?
7Scientific Drivers
- Multivariate post-genomic analyses have diverse
- applications
- Pathway discovery/ functionality
- Chemicals management (e.g. preclinical
drugsafety assessment, toxicology
ecotoxicology) - Disease diagnosis/ susceptibility
- Environmental science
8Scientific Drivers
Wide variety of analytical techniques
(transcripts,proteins metabolites) and
diversity of data typesdictates core Standards
to facilitate communication/ collaboration
betweendifferent fields of activity and fulfill
the needs ofjournal editors regulatory
agencies
- More about accessibility than accountability
- Minimise duplication
- Foster collaborative science
- Realise potential for comparative genomics
9Approaches to Omic Science
From Kell (2003)
10Approaches to Omic Science
Increase in - assumption free- hypothesis
generating- fishing
proposals
For hypothesis-driven science the PI should
knowwhat metadata should be collected For
hypothesis-generating science, should the
PI collect all possible metadata?
How do we collect the data in an added value
format?
Standards need to be community driven
11Regulatory Drivers
RCUK, including NERC, must demonstrate effective
Knowledge Transfer One aspect of KT is science
into policy Chemicals management as an
example- toxicology - ecotoxicology
12Toxicological/ Ecotoxicological Drivers
- EU REACH Directive- Registration, Evaluation
Authorisation of Chemicals- 30,000 chemicals
(little or no data/ pre-legislation)- 11 years-
gt7.5B - Animal welfare concerns- in vitro possibilities
Industry regulatory agencies are looking to
omics- cheap effective screen to assist
prioritisation
Industry regulators work in a standards
environment- this ethos is being pushed hard
(e.g. SMRS)
13OECD 210 Fish Early Life Stage
14OECD 210 Fish Early Life Stage
- MIAME/ Tox has been developed to build
confidencein toxicogenomics - Platform issues still not resolved validation
problems
15Toxicological/ Ecotoxicological Drivers
Cheap effective screen to assist prioritisation-
target chronic testing- opp. for pathway
discovery pathway biomarkers - no false
positives- approaches must be validated (Corvi,
EHP, 2005)- platform independent- outcomes must
be associated with validity criteria-
transparent QA/ QC- reproducible (withstand ring
testing - interlab)
All this against the background of poor
arrayreproducibility
16Community Benefits
- Omic science is still data rich knowledge poor
- Open community-driven standards are required to
- instil confidence (hype is over, it is delivery
time) - address sceptics
- facilitate/ maximise the value of the data
- improve knowledge generation
- assist cross-species extrapolation to realise
comparative genomics
Lessons from array-based work must be applied to
proteomics,metabolomics metagenomics
Knowledge
Data
17Environmental Genomics
The Challenge
18Environmental Genomics
Earth System Science
Systems Biology
NEBC versus other NERC DCs
19(No Transcript)
20Regulatory Drivers