How to access genomic information using Ensembl - PowerPoint PPT Presentation

1 / 46
About This Presentation
Title:

How to access genomic information using Ensembl

Description:

Fugu-sg (ICMB) Ciona-sg (Temasek) Ensembl Open source. 11 of 45 ... human build 34, mouse, rat, Fugu,mosquito. adds annotation and links. automated process ... – PowerPoint PPT presentation

Number of Views:83
Avg rating:3.0/5.0
Slides: 47
Provided by: xos49
Category:

less

Transcript and Presenter's Notes

Title: How to access genomic information using Ensembl


1
How to access genomic information using Ensembl
Damian Smedley and Xosé Fernández Ensembl
Project European Bioinformatics
Institute Cambridge, UK
November 2004
2
Schedule
Today Introduction to the Ensembl
system Hands-on examples to introduce the
system Evaluating genes and transcripts Variation
in Ensembl (SNPs, haplotypes) Tomorrow Data
mining with EnsMart Comparative genomics and
proteomics in Ensembl BioMart Advanced topics
(Upload your own data, DAS)
3
Our goal
4
Assembly
Other ordering data
non-redundant, virtual contig view
5
Mapping and Sequencing the human genome
6
Status of the human sequence
  • finished red /orange
  • 96 (99.999 accurate)
  • 30-40 repetitive elements (eg Alpha satellite,
    Alu repeats)
  • All known genes, correctly identified (99.74)
  • heterochromatin
  • 4 grey

Assembled draft sequence totals 2.85 Gb
7
Human genome Current status
  • 22,287 'gene loci defined, consisting of 19,599
    protein-coding genes in the human genome and
    2,188 DNA additional segments predicted to be
    protein-coding genes
  • 1183 genes were born in the last 60-100 My
  • 30 genes died in a similar time period

Finishing the euchromatic sequence of the human
genome, Nature 431931-45 (2004)
8
Ensembl - project aims
  • funded to provide metazoan genomes to the world
  • aims to provide the worlds best automated genome
    annotation
  • a leading group for human and mouse analysis
  • all software, data and results freely available

9
Ensembl - project background
  • group split between EBI and Sanger
  • mainly Wellcome Trust funded
  • largest dedicated compute in biology in Europe
  • developer community gt 100 people, including
    companies

10
Ensembl Open source
  • Freely-available
  • Community development.
  • gt51 Ensembl installs worldwide.
  • Both public and commercial,
  • e.g. Gramene (CSHL)
  • Fugu-sg (ICMB)
  • Ciona-sg (Temasek)

11
Ensembl
Supporting Databases
Final DB
Analysis DB
CPU
12
Genome browsingwhy present the whole genome?
  • Explore what is in a chromosome region
  • See features in and around a specific gene
  • Search retrieve across the whole genome
  • Investigate genome organization
  • Compare to other genomes

13
Genome browsers
  • Ensembl public site installable system
  • UCSC Human Genome Browser
  • NCBI Map Viewer

14
Introduction to the Ensembl web site
Ensembl
takes genomic sequence assemblies human build 34,
mouse, rat, Fugu,mosquito
adds annotation and links automated process
presents all the data on a web site
15
Annotation genes
Known genes
Novel genes
  • how to predict?
  • require evidence
  • transcripts(s)?
  • protein(s)?
  • orthologues?
  • attach useful links
  • where?
  • genomic structure?
  • transcripts(s)?
  • protein(s)?
  • orthologues?
  • attach useful links

16
Annotation other features
  • markers and SNPs
  • cytogenetic bands
  • repeated sequences
  • ESTs other sequence records
  • where do they show sequence similarity?
  • regions homologous to other species

17
How to get started
  • Species homepage
  • Site map
  • Map View
  • Text search
  • BLAST
  • SSAHA
  • Disease View

18
Homepage
19
Site map
20
MapView
21
BLAST and SSAHA
22
BLAST and SSAHA
23
Regions, maps and markers
ContigView CytoView SyntenyView MultiContigView
MarkerView SNPView
24
EnsemblContigView
25
ContigView close-up
Pop-up menu
26
ContigView - Chromosome 20 close-up
Manual annotation via Vega
Forward strand
Ensembl predictions
Reverse strand
Ensembl EST-based predictions
Other chromosomes with manual annotation from
http//vega.sanger.ac.uk 6, 7, 9, 10, 13, 14,
20, 22, X
27
CytoView
28
GeneSNPView
29
MarkerView
SNPView
30
SyntenyView
31
MultiContigView
32
Genes gene products
GeneView TransView ExonView ProteinView
FamilyView DomainView
GOView DiseaseView
33
EnsemblGeneView
34
TransView
ExonView
35
ProteinView
36
FamilyView
37
GOView
38
DiseaseView
39
Data retrieval
EnsMart
Export View
Data sets on ftp site MySQL queries of
databases Perl API access to databases
40
ExportView
41
EnsMart
42
Mouse differences
  • Genomic sequence assembly based on whole genome
    shotgun, with finished stitched BACs
  • BACs are shown in CytoView (FPC map), but for
    most no sequence is available

43
MouseCytoView
44
Help!
  • context sensitive help pages - click
  • access other documentation via generic home page
  • email the helpdesk

HelpDesk / Suggestions
45
Thanks
Ensembl Team
46
Ensembl Team
November 2004
Write a Comment
User Comments (0)
About PowerShow.com