Nothing in (computational) biology makes - PowerPoint PPT Presentation

1 / 40
About This Presentation
Title:

Nothing in (computational) biology makes

Description:

Title: PowerPoint Presentation Author: Koonin Last modified by: Michael Fetchko Created Date: 11/17/2002 2:33:30 AM Document presentation format: On-screen Show – PowerPoint PPT presentation

Number of Views:77
Avg rating:3.0/5.0
Slides: 41
Provided by: Koo93
Category:

less

Transcript and Presenter's Notes

Title: Nothing in (computational) biology makes


1
Nothing in (computational) biology makes sense
except in the light of evolution
after Theodosius Dobzhansky (1970)
2
A brief history and some central principles
of evolutionary (computational) genomics
3
(No Transcript)
4
(No Transcript)
5
(No Transcript)
6
(No Transcript)
7
(No Transcript)
8
(No Transcript)
9
(No Transcript)
10
(No Transcript)
11
J. Mol Biol 1982 Dec 25162(4)729-73 Nucleotide
sequence of bacteriophage lambda DNA.Sanger F,
Coulson AR, Hong GF, Hill DF, Petersen GB. The
DNA in its circular form contains 48,502 pairs of
nucleotides. Open reading frames were
identified and, where possible, ascribed to
genes by comparing with the previously determined
genetic map. The reading frames for 46 genes were
clearly identified There are about 20 other
unidentified reading frames that may code for
proteins. Protein sequence comparison or
homology are not mentioned in this paper...
12
(No Transcript)
13
(No Transcript)
14
(No Transcript)
15
(No Transcript)
16
(No Transcript)
17
(No Transcript)
18
(No Transcript)
19
(No Transcript)
20
(No Transcript)
21
Growth of the number of completely sequenced
genomes
22
(No Transcript)
23
(No Transcript)
24
(No Transcript)
25
(No Transcript)
26
(No Transcript)
27
(No Transcript)
28
(No Transcript)
29
(No Transcript)
30
Figure 1.2. The current state of annotation of
some genomes. The data were derived from the
original genome sequencing papers
31
Nothing in (computational) biology makes sense
except in the light of evolution
after Theodosius Dobzhansky (1970)
32
Homology common ancestry of genes or portions
thereof (a qualitative notion as opposed to
similarity)
Species 1
Species 3
Species 2
33
Evolution by gene duplication, 1970
Gene duplication with subsequent diversification
- the principal path to innovation in evolution
34
(No Transcript)
35
Number of proteins
in COGs
not in COGs
The majority of the proteins in each prokaryote,
but only 1/3 of yeast proteins belong to COGs -
ancient conserved families
36
MOST OF THE COGs ARE REPRESENTED ONLY IN A SMALL
NUMBER OF CLADES MAJOR ROLE OF
HORIZONTAL GENE TRANSFER AND CLADE-SPECIFIC GENE
LOSS IN EVOLUTION
37
Gene loss
speciation
descendants
ancestor
Gene loss
Non-orthologous displacement two unrelated (or
distantly related) proteins for the same
essential function
38
Figure 2.3. Structural alignment of goose
lysozyme (PDB code 153L), chicken egg white
lysozyme (3LZT) and lysozymes from E. coli
bacteriophages l (1AM7) and T4 (1L92).
39
153L .GEKLC.VE.PAVIAGIISRESHAG..KVLK....NGWGD.
..R.......... 3LZT gLDNYRgYS.LGNWVCAAKFESNFN...
......tQATNR...N.......... 1AM7
.mvEIN.NQrKAFLDMLAWSEGTDngrQKTRnhgyDVIVGgelftdysdh
prkl 1L92 ..........MNIFEMLRIDEG...........lrlK
IYKdteG..........   153L ........GNGFGLMQVDKRSH
...............KP........QG..TWN 3LZT
.....tdgsTDYGILQINSRWWcndgrtpgsrnlcniPC........SAl
lSSD 1AM7 vtlnpklkSTGAGRYQLLSRWW...............
DayrkqlglkDF..SP. 1L92 ........YYTIG.IGHLLT....
.....kspslnaakseldkaigrntngvIT   153L
.GEVHITQGTTILINF.IKTIQK...KFPS.WTKD..QQLKGGISAYNAG
AGNVR 3LZT ITASVNCAKKIVSDG.N...................
.....GMNAWV....... 1AM7 ..KSQDAVALQQIKERgALPM...
........idR..GDIRQAIDRCSN....iw 1L92
.KDEAEKLFNQDVDAA.VRGILRnakLKPVyDSLDavRRAAIINMVFQMG
ETGVA   153L .SYARMDIGT....................THDDY
ANDVV....ARAQYYKQHGY 3LZT .....................
...........awRNRCK...gTDVQAWIRGCr 1AM7
.aslpGAGY...................gqfEHKA.DSLI....AKFKEA
Ggtvr 1L92 .gftnslrmlqqkrwdeaavnlaksrwynqTPNRAkr
vittfrtgtwDAYK....
 
Structure-based sequence alignment of goose
lysozyme (153L), chicken egg white lysozyme
(3LZT) and lysozymes from E. coli bacteriophages
l (1AM7) and T4 (1L92).
40
Only a small fraction of amino acid residues is
directly involved in protein function (including
enzymatic) the rest of the protein serves
largely as structural scaffold
Significant sequence conservation is evidence of
homology
Proteins with different structural folds can
perform the same function - non-orthologous
displacement
Proteins (domains) with the same fold are most
likely to be homologous
Convergence does not produce significant
sequence or structural similarity
Write a Comment
User Comments (0)
About PowerShow.com