Title: DNA Sequence Analysis
1DNA Sequence Analysis
2(No Transcript)
3Mitochondrial
Nuclear
A
A A
Generation 1
A
A
A
Generation 2
A A
mutation
A
A
Generation 3
A A
B
A
4Northern Alaska
AACCAACCTCCCTAAGACTCAAGGAAGAAGCTATAGCCCCACTATCAACA
CCCAAAGCTGAAGTTCTATTTAAACTATTCCCTGGCGCATATTAATATAG
TTCCACAAAATTCAAGAACCTTATCAGTATTAAATTCTTAAAAATCTTTA
ACAATTTAATACAGTTTTGTACTCAACAGCCATATTAATATTTCATATAC
CATTAATTACACTAAGTACATAATGATTTATATGCATCAGTACTCCATAC
GGGTATAGTACATAACATTAATGTATCAAGACATATTATGTATAATAGTA
CATTACACTATTTACCCCATGCTTATAAGCAAGTACATGACATTATTAAT
AGTACATAGTACATATTATTATTGATCGTACATAGCACATTATGTCAAAT
CCATTCTTGTCAACATGCGTATCCCGTCCATTAGATCACGAGCTTAATCA
CCATGCCGCGTGAAACCAACAACCCGCTTGGCAAGGATCCCTCTTCTCGC
TCCGGGCCCATGCACTGTGGGGGTAGCTATTTCATGAACTTTATCAGACA
TCTG
Statewide Alaska
AACCAACCTCCCTAAGACTCAAGGAAGAAGCTATAGCCCCACTATCAACA
CCCAAAGCTGAAGTTCTATTTAAACTA TTCCCTGGCGCATATTAATATA
GTTCCACAAAATTCAAGAACCTTATCAGTATTAAATTCTTAAAAATCTTT
AACAA TTTAATACAGTTTTGTACTCAACAGCCATATTAATATTTCATAT
ACCATTAATTACACTAAGTACATAATGATTTAT
ATGCATCAGTACTCCATACGGGTATAGTACATAACATTAATGTATCAAGA
CATATTATGTATAATAGTACATTACAC TATTTACCCCATGCTTATAAGC
AAGTACATGACATTATTAATAGTACATAGTACATATTATTATTGATCGTA
CATAG CACATTATGTCAAATCCATTCTTGCCAACATGCGTATCCCGTCC
ATTAGATCACGAGCTTAATCACCATGCCGCGTG
AAACCAACAACCCGCTTGGCAAGGATCCCTCTTCTCGCTCCGGGCCCATG
CACTGTGGGGGTAGCTATTTCATGAAC TTTATCAGACATCTG
5Distance measures
- Number of differences
- d different sites
- p-distance
- Proportion of sites different
- d nd /L, where L number of sites
- Neither of these correct for multiple hits
- Assume equal rates of substitution among all
pairs of nucleotides - Assume equal rates among sites
6Jukes-Cantor
- d -¾ln(1-4/3p)
- Assumes
- rates of substitutions between all possible
nucleotides are equal - Equal nucleotide frequencies
- Equal rates among sites
- Corrects for multiple hits
7Tajimi-Nei
- Assumes
- rates of substitutions between all possible
nucleotides are equal - Equal rates among sites
- Allows for unequal nucleotide frequencies
- Corrects for multiple hits
8Kimura 2-parameter (K2P)
- Assumes
- Equal nucleotide frequencies
- Equal rates among sites
- Corrects for multiple hits
- Allows different rates between transitions and
transversions
9Tamura 3-parameter
- Assumes
- Equal rates among sites
- Corrects for multiple hits
- Allows different rates between transitions and
transversions - Corrects for GC-content bias
10Tamura-Nei(similar to HKY)
- Assumes
- Equal rates among sites
- Corrects for multiple hits
- Accounts for different nucleotide frequencies
- Accounts for difference in rates between
transitions and transversions - Accounts for different rates of substitutions in
transitions of purines (a1) versus pyrimidines
(a2)
11Which to choose?
- A range of complexity
- All models are accurate at low genetic distances
- More complex models are more accurate when
distances are large - Why not use all the timewhy keep talking about p
distances or K2P? - Complex models have higher variance
12When rates differ among sites
- Variation in rates can be modeled
- Gamma distribution (G)
- Gamma parameter (shape parameter) a
- Can be applied to any distance measure
13Genetic rescue of Norfolk Island
boobok Cytochrome b sequence differences NI
boobookNZ boobook 2 NI boobookTas boobook
8 NZ boobookTas boobook 8 NIpowerful owl
21 NIrufous owl 23 Powerful owlrufous owl 13
14(No Transcript)
15(No Transcript)
16(No Transcript)
177
13
6
2
12
15
3
1
4
14
10
9
5
11
16
8
18(No Transcript)
19Phylogeography
20(No Transcript)
21(No Transcript)
22(No Transcript)
23(No Transcript)
24(No Transcript)
25(No Transcript)
26(No Transcript)
27For next time
- Read 2 beluga papers posted on website