Download presentation
Presentation is loading. Please wait.
1
Phylogenetic Trees Tutorial 6
2
Measuring distance Bottom-up algorithm (Neighbor Joining) –Distance based algorithm –Relative distance based Phylogenetic Trees Tutorial 6
3
Problem: unrelated sequences approach a fraction of difference expected by chance The distance measure converges. Jukes-Cantor Measuring Distance
4
Measuring Distance (cont) Euclidean Distance: Given a multiple sequence alignment, calculate the square root of the sum of the score at every position between two sequences the score increases proportionally to the extent of dissimilarity between residues
5
Star Structure Assumption: Divergence of sequences is assumed to occur at constant rate Distance to root equals a d c b
6
6 abcd a0875 b8039 c7308 d5980 a d c b Basic Algorithm Initial star diagramDistance matrix
7
7 abcd a0875 b8039 c7308 d5980 a d c b Choose the nodes with the shortest distance and fuse them. Selection step
8
8 a d c,b e a a,d c e b D ce D de f d a c e b D af D de f D ce D bf abcd a0875 b8039 c7308 d5980 12 3
9
9 Neighbor Joining Algorithm Constructs unrooted tree.
10
Step by step summary: 1.Calculate all pairwise distances. 2.Pick two nodes (i and j) for which the distance is minimal. 3.Define a new node (x) and re-calculate the distances from the free nodes to the new node. 4.Calculate D ix and D jx - the distance of the chosen nodes I and J to the new node X, as well as the distance from X to all other nodes. 5.Continue until two nodes remain – connect with edge. Neighbor Joining’ (merging close sequences – not the actual algorithm)
11
Pick two nodes for which the distance is minimal (i,j)
12
Node 10 is a new node. 5,6
13
Re-calculate the distances from new node I,j : the fused nodes (5,6) X :a new added node (node 10) m :the remaining nodes in the star
14
Calculate D ix and D jx r : ~average distance to nodes L : number of leaves left in the tree (leaves nodes representing taxa, sequences,etc)
15
Calculate Dix and Djx r 5 =ΣD 5k /(L-2)= 3.22406/(9-2)=0.46058 r 6 =ΣD 6k /(L-2)= 3.22758/(9-2)=0.461083 ΣD 5k ΣD 6k
16
Calculate Dix and Djx D 10,5 =(D 5,6 +r 5 -r 6 )/2=(0.06088+0.46058-0.461083)/2) = 0.0301886 D 10,6 =D 5,6 -D 10,5 =0.06088-0.0301886=0.0306914
17
0.0301886 0.0306914
18
Step 2 0.080375 0.044625
19
Step 3 0.069258 0.040447
20
Step 4
21
Step 5
22
Step 6
23
Step 7
24
Problems 0.1 0.4 4 3 1 2
25
Step by step summary: 1.Calculate all pairwise distances. 2.Pick two nodes (i and j) for which the relative distance is minimal (lowest). 3.Define a new node (x) and re-calculate the distances from the free nodes to the new node. 4.Calculate D ix and D jx - the distance of the chosen nodes I and J to the new node X, as well as the distance from X to all other nodes. 5.Continue until two nodes remain – connect with edge. Neighbor Joining (Not assuming equal divergence)
26
Step 2. Pick two nodes (i and j) for which the relative distance is minimal (lowest).
27
Negative values As the average distance from the common ancestor to the rest of the nodes increases, M ij has a lower value. Select pair that produce lowest value Reevaluate M with every iteration J I X M
28
0.1 0.4 4 3 1 2
29
0.1 0.4 4 3 1 2
30
Re-calculate the distances from new node J I X M
31
31 EXAMPLE A B C D E B 5 C 4 7 D 7 10 7 E 6 9 6 5 F 8 11 8 9 8 A B C D E B -13 C -11 D -10 -10.5 E -10 -11-13 F -10.5 -11 -11.5 Original distance MatrixRelative Distance Matrix (Mij) The Mij Table is used only to choose the closest pairs and not for calculating the distances
32
Bacillus E.coli Pseudomonas Salmonella Aeromonas Lechevaliera Burkholderias Problems with phylogenetic trees
33
Software PHYLIP PAUP MEGA3 http://evolution.gs.washington.edu/phylip.html http://paup.csit.fsu.edu/ http://www.megasoftware.net/ http://evolution.genetics.washington.edu/phylip/software.html More
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.