The UCSC Genome Browser - PowerPoint PPT Presentation

About This Presentation
Title:

The UCSC Genome Browser

Description:

The UCSC Genome Browser. From Men to Mice ... Mouse/Human Synteny. Track Options & Filters ... 1,000,000 BLASTZ jobs in 25 hours for mouse/human alignment ... – PowerPoint PPT presentation

Number of Views:143
Avg rating:3.0/5.0
Slides: 19
Provided by: DavidHa168
Category:

less

Transcript and Presenter's Notes

Title: The UCSC Genome Browser


1
The UCSC Genome Browser
  • From Men to Mice

WJ Kent, C Sugnet, T Furey, T Pringle, M
Schwartz, R Baertsch, R Weber, K Roskin, D
Thomas, S Rogic, M Diekhans, F Hsu, D Karolchik,
D Haussler
2
(No Transcript)
3
Cardiac Troponin T2
4
Comparative Genomics at BMP10
5
Normalized eScores
6
Mouse/Human Synteny
7
Track Options Filters
Mini-buttons bring up track options such as
those for spliced EST track below.
8
Which EST to Sequence?
9
MGC ESTS Drawn in Red
10
DNA Coloring
11
Coloring CRYGD Start
gctcgttcaggggtaaaggtgtattctagatCCACAACAAGCCCCGTGGT
CTAGCACAGC AAAGAGAAAAAAAGAGAACACGAAAATGCCCTTGCTCCC
CTCCGGGGGCCCCTTTTGTGC GGTTCTTGCCAACGCAGCAGCCCTCCTG
CTATATAGCCCGCCGCGCCgCAGCCCCACCCG
CTCAGCGCCGCCGCCCCACCAGCTCAGCACCGCCGTGCGCCCAGCCAGCC
ATGGGGAAGG TGAGCCCAGCCTGCGCCCCGGGACCCCGGAGCTTCCTCC
ATCGCGGGGGCCAGAGACTGG GGCAGGAGCAGGCCTGTGAGACCTCGCC
TTGTCCCGCCTTGCCTTGCAGATCACCCTCTA
CGAGGACCGGGGCTTCCAGGGCCGCCACTATGAATGCAGCAGCGACCACC
CCAACCTGCA GCCCTACTTGAGCCGCTGCAACTCGGCGCGCGTGGACAG
CGGCTGCTGGATGCTCTATGA GCAGCCCAACTACTCGGGCCTCCAGTAC
TTCCTGCGCCGCGGCGACTATGCCGACCACCA
GCAGTGGATGGGCCTCAGCGACTCGGTCCGCTCCTGCCGCCTCATCCCCC
ACGTGAGTAC ATCCTCAAGTCAGGACCCAGGCCCTCAGGACACTCACTG
GAtgGTTTCAAGCAAAAGTTA AACATTAGAAGTAGTGATCAGTcacaat
aaCTGAGAGTGGACAAAAGATGAACTATAGTG
GATTAAGTCAATAGagttTGCTCCCCACATAAGCAAAGTATTACCCAGAC
AcCAGTTAAT caCAATTAATCCACAAATATGTATTGAGTAGGAATGTGT
CTCCTGCCctAGGGGTTGTAT
12
Gene Expression Tracks
13
Alt Splicing Tracks
14
Complex Transcription
15
Add Your Own Tracks
  • Users can extend the browser with their own
    tracks.
  • User tracks can be private or public.
  • No programming required.
  • GFF, GTF, PSL or BED formats supported
  • chrom start end name strand score
  • chr1 1302347 1302357 SP1 800
  • chr1 1504778 1504787 SP2 980

16
The Underlying Database
  • Power users and bioinformaticians sometimes want
    underlying database.
  • There is a table for each track.
  • Larger tracks have a table for each chromosome.
  • Format of a track table generally similar to
    add-your-own track formats.
  • Pieces of database available from tables
    browser.
  • Whole database available as tab-separated files.

17
Parasol and Kilo Cluster
  • UCSC cluster has 1000 CPUs running Linux
  • 1,000,000 BLASTZ jobs in 25 hours for mouse/human
    alignment
  • We wrote Parasol job scheduler to keep up.
  • Very fast and free.
  • Jobs are organized into batches.
  • Error checking at job and at batch level.

18
Acknowledgements
NHGRI, The Wellcome Trust, HHMI, NCI, and
Taxpayers in the US and worldwide. Whitehead,
Sanger, Wash U, Baylor, Stanford, DOE, and the
international sequencing centers. NCBI, Penn
State, Ensembl, Genoscope, The SNP Consortium, UC
Berkeley, LBL, LLL, Riken, The Mammalian Gene
Collection, Softberry, IMIM, Affymetrix,
Perlagen, Rosetta, the Mouse Homology Group The
thousands of people who worked on the sequence
and annotations
Write a Comment
User Comments (0)
About PowerShow.com