Title: Substitution Patterns
1Substitution Patterns
- An Introduction to Phylogeny
2Genes and Mutations
- Organism the optimal state for its gene
products - Usually
- Mechanisms to prevent harmful errors in DNA
- Proofreading
- Wobble
- DNA repair
3Mutations
- Mutations still occur
- Types of mutations
- Base Substitution
- Missense
- Nonesense
- Silent
- Indel
- Effects on Organism
- Deleterious
- Neutral
- Uncertain which happens more (deleterious or
neutral) - Advantageous
- Least often
4(No Transcript)
5Rates of Mutation
- rK/(2T)
- rrate of substitution ( substitutions/site/year)
- Knumber of nucleic acid substitutions between 2
homologous sequences - Ttime since divergence (multiplied by 2 because
must account for changes in both sequences) - Estimated from fossil record
- Rather simplistic
- Doesnt account for back mutations
6An Estimate of Time
7Average Rate of divergence
Mutation/Substitution whats the difference?
- Natural Selection
- Removal of deleterious mutations
- Which changes least often in a DNA sequence?
- Functional Constraint
- Portions responsible for structure and function
- Are they functionally constrained?
- Introns
- Exons
- Downstream of poly A tail (3 flanking region)
- Promoter and regulator regions (5 untranslated
sequence) - Upstream of promoter and regulator (5 flanking)
- Sequence immediately following codon region for
stop codon (3 untranslated)
8- What is the difference between a mutation and a
substitution?
9Globin substitution rates(/site/109 years)
3.48
- Introns
- Exons
- Downstream of poly A tail (3 flanking region)
- Promoter and regulator regions (5 untranslated
sequence) - Upstream of promoter and regulator (5 flanking)
- Sequence immediately following coding region for
stop codon (3 untranslated)
Rates Different for other genes
1.58
3.60
1.86
3.39
3.00
Functionally Constrained!
10Synonymous VS Non-synonymous
- Synonymous substitutions
- Occur from silent mutation
- Accepted slightly more often than non-synonymous
- Non-synonymous substitutions
- From missense or nonesense mutations
- Nondegenerate position
- 1st or 2nd position usually
- UUU (F), CUU (L), AUU (I), GUU (V)
- Twofold degenerate position
- Third position usually
- GAU and GAC (D), GAA and GAG (E)
- Fourfold degenerate position
- Typically synonymous!
- Third position
Substitution rates 0.56 1.67 2.35
11Indels
- Indels occur 10X less than substitutions
- DNA repair mechanisms
12Variation in substitution rates
- There is a large difference between synonymous
and nonsynonymous substitution rates in a gene.
However the two rates should theoretically be the
same, or similar, since mutations occur random. - The difference is attributed to the intensity of
purifying selection. Mutations that result in an
amino acid substitution have a higher chance of
causing deleterious effects and be removed from
the population and never fix. - Nonsynonymous substitutions may have a chance of
improving the function of a protein. If
advantageous (positive) selection plays a major
role in a particular protein, the rate of
nonsynonymous substitution should exceed that of
synonymous substitution in that protein in this
scenario. - However, most often, the stronger the functional
constraint of a protein, the slower the rate of
evolution (most nonsynonymous substitutions are
not advantageous).
13Homology
- Paralogous genes
- Gene duplication events
- Tandem copies which evolve independently
- Duplicated genes are paralogous from each other
- Dont necessarily have same function
- Orthologous genes
- Homologous genes that share common ancestry and
function in the absence of gene duplication
14Orthologs versus paralogs
http//www.ncbi.nlm.nih.gov/Education/BLASTinfo/Or
thology.html
15- Pseudogenes
- Paralogous homology?
- Well before the divergent forms a new function it
is not transcriptionally active?pseudogene - 4 substitutions every 109 years
- High substitution frequency
- Actual mutation rate is believed to be fairly
similar to substitution rates of pseudogenes and
synonymous substitutions.
16More Evolution Terms
- Chimeric genes
- Some regions of gene are homologous and others
are not (the whole gene is not homologous) - Transposition
- Horizontal gene transfer
- Xenologous
- When individual genes do not share same
evolutionary origin as the organisms overall
evolutionary origin - Horizontal gene transfer or symbiosis
- EX) Eukaryotic cell and mitochondrial DNA
- EX) viral transduction
- EX)conjugation
- EX) transformation
17Measuring Substitution Patterns
- Jukes and Cantor Approach (69)
- any nucleotide is just as likely to change into
any other - Back mutations are accounted for
- An oversimplification
- Transitions occur 3X more than transversions
- Kimuras 2 parameter model (80)
- Takes different rates of transitions and
transversions into account - Account for 2 different rates
- Back mutations accounted for still
- Still an oversimplification
1812 parameter model
- Assume that any nucleotide substitution changes
at a different rate - A?T different rate than T?A
- 12 scenarios
- Determine substitution frequencies within an
alignment and base individuals frequencies on a
those probabilities - Problem
- ideally its great
- Realistically
- Different genes result in different substitution
rates therefore, there is no one reliable
matrix. - Two parameter is most reliable of the three, but
not necessarily the most realistic
19Nucleotide Substitutions in Alu-Y
20Protein Sequence Substitution
- Recall Factors that come into play
- The number of nucleic acid changes influences the
substitution rate - Ser?pro (one amino acid change for UCU?CCU)
- Ser?Ile (two amino acid changes for UCU?AUU)
- Conservative changes in aa effect function less
than radical changes in aa - Use the PAM or BLOSUM matrix to determine
frequencies
21Evolutionary Rate Variation
- Histone genes
- Code for proteins that bind DNA in Euks
- Every amino acid associated with DNA binding
- Slowest evolving proteins
- HLA genes
- Code for proteins that help WBCs recognize
foreign antigens - Under pressure to diversify
- Better protection from pathogens
- Non-synonymous sub rate is actually greater than
synonymous sub rate
22Molecular Clocks
- Different proteins have different rates of
evolutionSURE - However, amount of change between 2 homologs is
well correlated with time since divergence - Can use substitution rates as an evolutionary
clock - Molecular phylogeny
- Using molecular comparison of several sequences
(multiple sequence alignments) to model
evolutionary changes mathematically. - Zuckerkandl and Pauling (60s)
- Classical Evolutionists have problems
23Classical Evolutionary Beliefs vs
- Classical Evolutionary models
- Comparison of phenotype only
- Bulk changes
- Cannot account for synonymous changes
- Convergent evolution cannot be deciphered!
- Not all organisms have easily studied phenotypes
- Cant compare unrelated organisms
- Small changes not noticed
- Many missing links in fossil record
- Molecular Evolutionary Models
- Study of substitution rate
- Model changes
- Synonymous
- non-synonymous and neutral
- non-synonymous and advantageous
- Comparison of phenotype
- Molecular changes still correlate well with
classical speciation models
24Phylogeny
- Rather than bother correlating divergence times
to fossil record use relative rates - Divergence dates are questionable
- Phylogenetic trees
- Outgroup (lineage 3)
- Lineage 1 and lineage 2
- Common ancestor A
1
2
3
A