Parsimony continued - PowerPoint PPT Presentation

1 / 34

About This Presentation

Title:

Parsimony continued

Description:

Tree score calculation. Ltot = Ln Wn. I=n. I=1 ... How is the minimum number of steps calculated? Postorder traversal algorithm: ... – PowerPoint PPT presentation

Number of Views:57

Avg rating:3.0/5.0

Slides: 35

Provided by: David51

Category:

more less

Transcript and Presenter's Notes

Title: Parsimony continued

1
Parsimony continued
2
Many issues glossed over

What if characters disagree?
How is the tree score determined?
How can we root the trees?
How do we find the optimal tree?
How can we evaluate the robustness of our
conclusions?

3
Tree score calculation
In
Ltot ? Ln Wn
I1
The tree score is the sum of the minimum number
of weighted steps (Ln) for each character
multiplied by the weight of that character (Wn)
4
How is the minimum number of steps calculated?

Postorder traversal algorithm
The tree is arbitrarily rooted
Each internal node is inspected to see if there
is an intersection in the possible states of its
descendant nodes
If not, length is increased
It is not necessary to identify all ancestral
state reconstructions (this requires a preorder
traversal)

5
Why weight characters?

If we think some characters are less prone to
homoplasy, we can upweight them
Character weights are multiplied by the character
length

6
We can also weight character state transitions

Unordered, flat-weighted Fitch parsimony

To state
From state
Step matrix
7
We can also weight character state transitions

Common examples
Ordered character states (morphology)

To state
From state
Step matrix
8
We can also weight character state transitions

Common examples
Transitions vs. transversions

To state
From state
Step matrix
9
We can also weight character state transitions

Common examples
Gains less likely than loss (restriction sites)

To state
From state
Step matrix (Asymmetric)
10
The weighting game

When should you weight characters/character-states
?
If you think that they differ in evidential power
How much should you modify weights?
There is no simple formula
It is probably better to err on the side of less
extreme weights
Often sensible to try a range of weights

11
Many issues glossed over

What if characters disagree?
How is the tree score determined?
How can we root the trees?
How do we find the optimal tree?
How can we evaluate the robustness of our
conclusions?

12
Rooting methods for parsimony

Outgroup method (time reversible models)
Include one or more taxa that are known from
prior information to be outside the ingroup the
group that is the focus of study
Assume near clock-like behavior
Midpoint rooting

13
Rooting methods for parsimony

Outgroup method (time reversible models)
Include one or more taxa that are known from
prior information to be outside the ingroup the
group that is the focus of study
Assume near clock-like behavior
Midpoint rooting
Include characters with asymmetric step-matrices
(time irreversible models)
Pick the root that results in the shortest tree

14
1-gt 0
L 1

A
B
C
D
F
E
1-gt 0
1-gt 0
L 2
15
Many issues glossed over

What if characters disagree?
How is the tree score determined?
How can we root the trees?
How do we find the optimal tree?
How can we evaluate the robustness of our
conclusions?

16
Rooted trees
Polytomy
Binary/dichotomous/fully-resolved
polyotomous/unresolved
17
Unrooted trees
Polytomy
18
Number of unrooted fully resolved trees for t taxa
i t

? (2i-5)

i 3
19
How many places can you add another taxon?

Two taxa
Three taxa
Four Taxa
Five taxa

20
Number of rooted trees?

The root is just one more taxon same formula but
t number of taxa 1

21
The number of trees gets big

Number of binary unrooted trees
1
3
15
105
2,027,025
2.2 x 1020
2.8 x 1074
1 x 101074

Number of tips
3
4
5
6
10
20
50
500

22
How do you find the optimal tree?

Exhaustive (lt12 taxa)

23
How do you find the optimal tree?

Exhaustive (lt12 taxa)
Branch-and-bound (lt18 taxa)

Obtain the length of a random tree (initial upper
bound)
As trees are built determine length
If length exceeds upper bound then that tree and
all its descendant trees are ignored

24
How do you find the optimal tree?

Exhaustive (lt12 taxa)
Branch-and-bound (lt18 taxa)
Heuristic search (unlimited?)

25
Heuristic searches

Search for optimal trees by finding good trees
and then rearranging them in the hopes of finding
an even better tree

26
Heuristic search
Suboptimal island of trees
Global optimum
Starting trees
Treespace
27
Getting starting trees

Random tree - not done
User tree (e.g., a NJ tree)
Build a tree by adding taxa to the location that
is optimal
Can hold more than one tree at each step

28
Taxon addition order

As-is
In the order of the matrix (not done for
parsimony)
Simple taxon addition
use a distance algorithm to decide order
Closest taxon addition
Add the taxon that makes the optimal tree
Random taxon addition order
Repeat many times

29
Branch swapping

Nearest-neighbour interchange (NNI)

30
Branch swapping

Subtree pruning and regrafting (SPR)

31
Branch swapping

Tree-bisection reconnection (TBR)

32
Many issues glossed over

What if characters disagree?
How is the tree score determined?
How can we root the trees?
How do we find the optimal tree?
How can we evaluate the robustness of our
conclusions?

33
Even if the shortest trees is the best estimate
of the true tree - the true tree might not be the
shortest
We should consider suboptimal trees
We should use statistical tests to help us
determine what to actually believe
34
Questions we can ask