Computational Characterization of Short Environmental DNA Fragments - PowerPoint PPT Presentation

1 / 29
About This Presentation
Title:

Computational Characterization of Short Environmental DNA Fragments

Description:

Alexander Goesmann1, Scott Kelley2, Alfred P hler1. 1Bielefeld University ... Institution of Oceanography, San Diego, USA) Waste Water Treatment Plant. plasmid sample ... – PowerPoint PPT presentation

Number of Views:60
Avg rating:3.0/5.0
Slides: 30
Provided by: lutzk2
Category:

less

Transcript and Presenter's Notes

Title: Computational Characterization of Short Environmental DNA Fragments


1
Computational Characterization of Short
Environmental DNA Fragments
  • Jens Stoye1, Lutz Krause1, Robert A.
    Edwards2,Forest Rohwer2, Naryttza N.
    Diaz1,Alexander Goesmann1, Scott Kelley2, Alfred
    Pühler1

1Bielefeld University 2San Diego State University
2
Computational Characterization of Short
Environmental DNA Fragments
  • Jens Stoye1, Lutz Krause1, Robert A.
    Edwards2,Forest Rohwer2, Naryttza N.
    Diaz1,Alexander Goesmann1, Scott Kelley2, Alfred
    Pühler1

1Bielefeld University 2San Diego State University
3
Metagenomics
4
CARMA - A Pipeline for Characterizing Short-Read
Metagenomes
  • Quantitative analysis of metagenomes
  • Which microbes live in an environment?
  • What are they doing?
  • What are the differences between communities from
    different environments?

5
(A) Functional Analysis
  • Reads directly analyzed without prior assembly
  • Protein family fragments used as Environmental
    Gene Tags (EGTs) for quantitative analysis of
    gene content
  • GO-term profiles characterize genetic diversity
    and potential metabolism of underlying communities

6
Heat Map Comparing GO-Term Frequencies
  • Comparative analysis reveals genetic and
    metabolic trends
  • Significantly overrepresented GO-terms identified
    with G-test

7
(B) Analyzing the Community Structure
  • EGTs assigned to taxonomic groups based on a
    phylogenetic analysis
  • Taxonomic profiles characterize the composition
    of the underlying communities

8
Taxonomic Classification of Short Environmental
Gene Tags
Phylogenetic tree reconstructed for each matching
Pfam family
Multiple alignment of known family
members (downloaded from Pfam web site)
Krause et al., submitted
9
Taxonomic Classification of Short Environmental
Gene Tags
Phylogenetic tree reconstructed for each matching
Pfam family
PF1-PF7 Known family members EGT1-EGT3
Environmental Gene Tags matching family
Identified EGTs matching family added to full
multiple alignment
Krause et al., submitted
10
Taxonomic Classification of Short Environmental
Gene Tags
Phylogenetic tree reconstructed for each matching
Pfam family
  • Multiple alignment used to calculate distance
    matrix
  • Pairwise distance sequence identity in aligned
    region
  • Missing values determined with additive
    estimation (Landry et al., 1996)

11
Taxonomic Classification of Short Environmental
Gene Tags
  • Distance matrix used to reconstruct phylogenetic
    tree (with Neighbor Joining)
  • EGTs classified based on their location in tree

Krause et al., submitted
12
Performance Evaluation Creating Standard of Truth
  • Test set 77 complete genomes
  • 2 Superkingdoms (Archaea and Bacteria)
  • 10 Phyla
  • 29 Classes
  • 62 Genera
  • 77 Species
  • Test set excluded from reference set
  • (Pfam members from any of the 77 species omitted
    from
  • full multiple alignments)

13
Performance Evaluation Creating Standard of Truth
  • 77 genomes fragmentized with ReadSim (Schmid et
    al., submitted)
  • Simulates sequencing using 454 pyrosequencing
  • Fragments randomly sampled (2x)
  • Fragment length 80-120bp, mean 100bp
  • Simulates sequencing errors at homopolymers

14
Classification Accuracy for Short Environmental
Gene Tags
Sens Sensitivity, fraction of correctly
classified EGTs Spec Specificity, reliability of
predictions FNrate False negative rate,
proportion of wrongly classified EGTs Urate
Unknown rate, proportion of EGTs not assigned to
any taxonomic group
15
Application Example Comparative Analysis of Four
Microbial Coral Reef Communities
  • In cooperation with Rob Edwards
  • and Forest Rohwer
  • (San Diego State University, California)

Dinsdale, et al., submitted
16
Influence of Human Activities on Coral Reef
Microbial Communities
Kingman
Palmyra
Tabuaeran
Kiritimati
  • Northern Line Islands

17
GO-Term Profiles Indicate Transition in Metabolic
Activities
Significantly different (p lt 0.01)
Color indicates abundance of GO-terms in each
sample
18
Community Structure
19
Taxonomic Profiles Indicate Transition from
Prochlorococcus to Synechococcus
(most abundant marine Cyanobacteria)
20
Application Example Comparative Analysis of
Three Aquatic Microbial Communities
L. Krause, N. N. Diaz, A. Goesmann, F. Rohwer, S.
Kelley, R. A. Edwards and J. Stoye. Taxonomic
classification of short environmental DNA
fragments. submitted
21
Sampling Locations
Rios Mesquites stromatolites, Mexico
Kingman coral reef, Northern Line Islands
San Diego solar salterns, USA
Sample data provided by Forest Rohwer and Robert
Edwards
22
Community Structure
pEGTs prokaryotic fraction of EGTs
23
Community Structure
Genus
pEGTs prokaryotic fraction of EGTs
24
Taxonomic Diversity
H' Diversity, including richness and evenness
(Shannon index)
J Evenness, relative commonness and rarity of
organisms
25
Further Applications of CARMA
  • Diversity of coral reef viruses
  • (in cooperation with Stuart Sandin, Scripps
  • Institution of Oceanography, San Diego, USA)
  • Waste Water Treatment Plant
  • plasmid sample
  • (in cooperation with Andreas Schlüter,
  • Bielefeld University)

26
Conclusions
  • Gene fragments identified using Pfam profile
    hidden Markov models
  • Fragments can be assigned to functional role and
    taxonomic origin
  • Profiling allows detection of trends in species
    composition, metabolism, and genetic potential
  • Pyrosequencing combined with profiling techniques
    enables rapid and cost-effective assay of
    microbial communities

27
Acknowledgements
  • Co-authors
  • Lutz Krause1, Robert A. Edwards2, Forest
  • Rohwer2, Naryttza N. Diaz1, Alexander
  • Goesmann1, Scott Kelley2, and Alfred Pühler1
  • Also many thanks to
  • Andreas Schlüter1, Elisabeth Dinsdale2, Scott
    Kelley2, Beltran Rodriguez-Brito2, and Christelle
    Desnues2

1Bielefeld University 2San Diego State University
28
  • Thank you for your attention!!!!

29
Taxonomic Diversity
pi proportion of EGTs classified into i-th
taxonomic group Hmax total number of
taxa found
Diversity
Evenness
Write a Comment
User Comments (0)
About PowerShow.com