Towards Personal Genomics - PowerPoint PPT Presentation

1 / 23
About This Presentation
Title:

Towards Personal Genomics

Description:

HuRef Project: Genome of an Individual ... Genome of a Single Individual: Goals ... Tina McIntosh, Karen Beeson, Dana Busam, Alexia Tsiamouri, Celera Genomics. ... – PowerPoint PPT presentation

Number of Views:46
Avg rating:3.0/5.0
Slides: 24
Provided by: YLin4
Category:

less

Transcript and Presenter's Notes

Title: Towards Personal Genomics


1
Towards Personal Genomics
  • Tools for Navigating the
  • Genome of an Individual
  • Saul A. Kravitz
  • J. Craig Venter Institute
  • Rockville, MD

Bio-IT World 2008
2
Personal Genomics The future is now
3
Outline
  • HuRef Project Genome of an Individual
  • HuRef Research Highlights
  • The HuRef Browser http//huref.jcvi.org
  • Towards Personal Genomics Browsers
  • Conclusions and Credits

4
Genome of a Single Individual Goals
  • Provide a diploid genome that could serve as a
    reference for future individualized genomics
  • Characterize the individuals genetic variation
  • HuRef vs NCBI
  • HuRef haplotypes
  • Understand the individuals risk profile based on
    their genomic data

5
How does HuRef Differ?
  • NCBI Genome
  • Multiple individuals
  • Collapsed Haploid Sequence of a Diploid Genome
  • No haplotype phasing or inference possible
  • HuRef
  • Single individual
  • Can reconstruct haplotypes of diploid genome
  • Haplotype Blocks
  • Segment of DNA inherited from one parent

6
The HuRef Genome
PLoS Biology 2007 5e254 September 4, 2007
7
The HuRef Genome
  • DNA from a single individual
  • De Novo Assembly
  • 7.5x Coverage Sanger Reads
  • Diploid Reconstruction
  • Half of genome is in haplotype blocks of gt200kb
  • HuRef Data Released
  • NCBI Genome Project 19621
  • JCVI http//huref.jcvi.org

8
Variants NCBI-36 vs HuRef
  • NCBI-36 vs HuRef yields Homozygous Variants

SNP
MNP
Insertion
Deletion
NCBI
HuRef
variant
variant
variant G/A
variant TA/AT
9
Computing Allelic Contributions
  • Consensus generation conflates alleles

10
Computing Allelic Contributions
  • Consensus generation modified to separate
    alleles
  • Bioinformatics. 2008 Apr 1524(8)1035-40
  • Consensus generation conflates alleles

11
Computing Allelic Contributions
  • Modified Consensus generation separates allele
  • Compare HuRef alleles to identify SNP, MNP,
    Indel Variants

MNP Variant
12
HuRef Variations
  • 4.1 Million Variations (12.3 Mbp)
  • 1.2 Million Novel
  • Many non-synonymous changes
  • 700 indels and 10,000 total SNPs
  • Indels and non-SNP Sequence Variation
  • 22 of all variant events, 74 of all variant
    bases
  • 0.5-1.0 difference between haploid genomes
  • 5-10x higher than previous estimates

13
HuRef Browser
  • Why do this?
  • Research tool focused on variation
  • Verify assembly and variants
  • Show ALL the evidence
  • High Perfomance
  • Features
  • Use HuRef or NCBI as reference
  • Genome vs Genome Comparison
  • Drill down from chromosome to reads and
    alignments
  • Overlay of Ensembl and NCBI Annotation
  • Links from HuRef features in NCBI (e.g., dbSNP)
  • Export of data for further analysis

14
http//huref.jcvi.org
Search by Feature ID or coordinates
15
Zinc Finger ProteinChr1957564487-57581356
Variations
Assembly Structure
16
chr1957578700-57581000
Protein Truncated by 476 bp Insertion
17
Assembly Structure
18
Drill Down toMulti-sequence Alignment
Validation of Phased A/C Heterozygous SNPs in
HuRef
19
14kbp Inversion Spanning TNFRSF14chr12469149-249
6613
20
Browser for Multiple Genomes
  • Expand on existing features
  • Variants and haplotype blocks in individuals
  • Structural variation among individuals
  • Genetic traits of variants related to diseases
  • Required Features
  • Which genome/haplotype is the reference?
  • Correlation with phenotypic, medical, and
    population data
  • Correlation within families

21
Future Challenges
  • Data volumes
  • read data included from new technologies
  • Multiplication of genomes
  • Enormous number of potential comparisons
  • Populations, individuals, variants
  • Dynamic generation of views in web time
  • Use cases are evolving

22
Conclusion
  • A high performance visualization tool for an
    individual genome
  • Validation of variants
  • Comparison with NCBI-36
  • Planned extensions for multi-genome era
  • Website http//huref.jcvi.org
  • Contact hurefhelp_at_jcvi.org

23
Acknowledgements
HuRef Browser Nelson Axelrod, Yuan Lin, and
Jonathan Crabtree Scientific Leadership Sam
Levy, Craig Venter, Robert Strausberg, Marvin
Frazier Sequence Data Generation and Indel
Validation Yu-Hui Rogers, John Gill, Jon
Borman, JTC Production, Tina McIntosh, Karen
Beeson, Dana Busam, Alexia Tsiamouri, Celera
Genomics. Data Analysis Sam Levy,
Granger Sutton, Pauline Ng, Aaron Halpern, Brian
Walenz, Nelson Axelrod, Yuan Lin, Jiaqi Huang,
Ewen Kirkness, Gennady Denisov, Tim Stockwell,
Vikas Basal, Vineet Bafna, Karin Remington, and
Josep Abril CNV, Genotyping, FISH mapping
Steve Scherer, Lars Feuk, Andy Wing Chun Pang,
Jeff MacDonald Funding J. Craig Venter
Foundation DNA J. Craig Venter
Write a Comment
User Comments (0)
About PowerShow.com