Title: Nothing in (computational) biology makes
1Nothing in (computational) biology makes sense
except in the light of evolution
after Theodosius Dobzhansky (1970)
2A brief history and some central principles
of evolutionary (computational) genomics
3(No Transcript)
4(No Transcript)
5(No Transcript)
6(No Transcript)
7(No Transcript)
8(No Transcript)
9(No Transcript)
10(No Transcript)
11J. Mol Biol 1982 Dec 25162(4)729-73 Nucleotide
sequence of bacteriophage lambda DNA.Sanger F,
Coulson AR, Hong GF, Hill DF, Petersen GB. The
DNA in its circular form contains 48,502 pairs of
nucleotides. Open reading frames were
identified and, where possible, ascribed to
genes by comparing with the previously determined
genetic map. The reading frames for 46 genes were
clearly identified There are about 20 other
unidentified reading frames that may code for
proteins. Protein sequence comparison or
homology are not mentioned in this paper...
12(No Transcript)
13(No Transcript)
14(No Transcript)
15(No Transcript)
16(No Transcript)
17(No Transcript)
18(No Transcript)
19(No Transcript)
20(No Transcript)
21Growth of the number of completely sequenced
genomes
22(No Transcript)
23(No Transcript)
24(No Transcript)
25(No Transcript)
26(No Transcript)
27(No Transcript)
28(No Transcript)
29(No Transcript)
30Figure 1.2. The current state of annotation of
some genomes. The data were derived from the
original genome sequencing papers
31Nothing in (computational) biology makes sense
except in the light of evolution
after Theodosius Dobzhansky (1970)
32Homology common ancestry of genes or portions
thereof (a qualitative notion as opposed to
similarity)
Species 1
Species 3
Species 2
33Evolution by gene duplication, 1970
Gene duplication with subsequent diversification
- the principal path to innovation in evolution
34(No Transcript)
35Number of proteins
in COGs
not in COGs
The majority of the proteins in each prokaryote,
but only 1/3 of yeast proteins belong to COGs -
ancient conserved families
36MOST OF THE COGs ARE REPRESENTED ONLY IN A SMALL
NUMBER OF CLADES MAJOR ROLE OF
HORIZONTAL GENE TRANSFER AND CLADE-SPECIFIC GENE
LOSS IN EVOLUTION
37Gene loss
speciation
descendants
ancestor
Gene loss
Non-orthologous displacement two unrelated (or
distantly related) proteins for the same
essential function
38Figure 2.3. Structural alignment of goose
lysozyme (PDB code 153L), chicken egg white
lysozyme (3LZT) and lysozymes from E. coli
bacteriophages l (1AM7) and T4 (1L92).
39 153L .GEKLC.VE.PAVIAGIISRESHAG..KVLK....NGWGD.
..R.......... 3LZT gLDNYRgYS.LGNWVCAAKFESNFN...
......tQATNR...N.......... 1AM7
.mvEIN.NQrKAFLDMLAWSEGTDngrQKTRnhgyDVIVGgelftdysdh
prkl 1L92 ..........MNIFEMLRIDEG...........lrlK
IYKdteG.......... 153L ........GNGFGLMQVDKRSH
...............KP........QG..TWN 3LZT
.....tdgsTDYGILQINSRWWcndgrtpgsrnlcniPC........SAl
lSSD 1AM7 vtlnpklkSTGAGRYQLLSRWW...............
DayrkqlglkDF..SP. 1L92 ........YYTIG.IGHLLT....
.....kspslnaakseldkaigrntngvIT 153L
.GEVHITQGTTILINF.IKTIQK...KFPS.WTKD..QQLKGGISAYNAG
AGNVR 3LZT ITASVNCAKKIVSDG.N...................
.....GMNAWV....... 1AM7 ..KSQDAVALQQIKERgALPM...
........idR..GDIRQAIDRCSN....iw 1L92
.KDEAEKLFNQDVDAA.VRGILRnakLKPVyDSLDavRRAAIINMVFQMG
ETGVA 153L .SYARMDIGT....................THDDY
ANDVV....ARAQYYKQHGY 3LZT .....................
...........awRNRCK...gTDVQAWIRGCr 1AM7
.aslpGAGY...................gqfEHKA.DSLI....AKFKEA
Ggtvr 1L92 .gftnslrmlqqkrwdeaavnlaksrwynqTPNRAkr
vittfrtgtwDAYK....
Structure-based sequence alignment of goose
lysozyme (153L), chicken egg white lysozyme
(3LZT) and lysozymes from E. coli bacteriophages
l (1AM7) and T4 (1L92).
40Only a small fraction of amino acid residues is
directly involved in protein function (including
enzymatic) the rest of the protein serves
largely as structural scaffold
Significant sequence conservation is evidence of
homology
Proteins with different structural folds can
perform the same function - non-orthologous
displacement
Proteins (domains) with the same fold are most
likely to be homologous
Convergence does not produce significant
sequence or structural similarity