5 - 1 Chap 5 The Evolution Trees. 5 - 2 Evolutionary Tree.

Slides:



Advertisements
Similar presentations
Computing a tree Genome 559: Introduction to Statistical and Computational Genomics Prof. James H. Thomas.
Advertisements

O(N 1.5 ) divide-and-conquer technique for Minimum Spanning Tree problem Step 1: Divide the graph into  N sub-graph by clustering. Step 2: Solve each.
Phylogenetic Tree A Phylogeny (Phylogenetic tree) or Evolutionary tree represents the evolutionary relationships among a set of organisms or groups of.
IKI 10100: Data Structures & Algorithms Ruli Manurung (acknowledgments to Denny & Ade Azurat) 1 Fasilkom UI Ruli Manurung (Fasilkom UI)IKI10100: Lecture10.
. Class 9: Phylogenetic Trees. The Tree of Life Evolution u Many theories of evolution u Basic idea: l speciation events lead to creation of different.
Neighbour joining method The neighbor joining method is a greedy heuristic which joins at each step, the two closest sub-trees that are not already joined.
Graphs Chapter 12. Chapter Objectives  To become familiar with graph terminology and the different types of graphs  To study a Graph ADT and different.
Lecture 13 CS5661 Phylogenetics Motivation Concepts Algorithms.
Minimum Spanning Trees
Tree Reconstruction.
1 Discrete Structures & Algorithms Graphs and Trees: II EECE 320.
UPGMA Algorithm.  Main idea: Group the taxa into clusters and repeatedly merge the closest two clusters until one cluster remains  Algorithm  Add a.
. Computational Genomics 5a Distance Based Trees Reconstruction (cont.) Modified by Benny Chor, from slides by Shlomo Moran and Ydo Wexler (IIT)
. Phylogeny II : Parsimony, ML, SEMPHY. Phylogenetic Tree u Topology: bifurcating Leaves - 1…N Internal nodes N+1…2N-2 leaf branch internal node.
The Evolution Trees From: Computational Biology by R. C. T. Lee S. J. Shyu Department of Computer Science Ming Chuan University.
. Multiple Sequence Alignment Tutorial #4 © Ilan Gronau.
Graphs Chapter 12. Chapter 12: Graphs2 Chapter Objectives To become familiar with graph terminology and the different types of graphs To study a Graph.
Spring 2010CS 2251 Graphs Chapter 10. Spring 2010CS 2252 Chapter Objectives To become familiar with graph terminology and the different types of graphs.
4 - 1 Chap 4 The Sequence Alignment Problem The Sequence Alignment Problem Introduction –What, Who, Where, Why, When, How The Sequence Alignment.
. Multiple Sequence Alignment Tutorial #4 © Ilan Gronau.
Minimal Spanning Trees. Spanning Tree Assume you have an undirected graph G = (V,E) Spanning tree of graph G is tree T = (V,E T E, R) –Tree has same set.
. Class 9: Phylogenetic Trees. The Tree of Life D’après Ernst Haeckel, 1891.
Phylogenetic Trees Tutorial 6. Measuring distance Bottom-up algorithm (Neighbor Joining) –Distance based algorithm –Relative distance based Phylogenetic.
Tirgul 13. Unweighted Graphs Wishful Thinking – you decide to go to work on your sun-tan in ‘ Hatzuk ’ beach in Tel-Aviv. Therefore, you take your swimming.
Chapter 5 The Evolution Trees.
Fall 2007CS 2251 Graphs Chapter 12. Fall 2007CS 2252 Chapter Objectives To become familiar with graph terminology and the different types of graphs To.
9-1 Chapter 9 Approximation Algorithms. 9-2 Approximation algorithm Up to now, the best algorithm for solving an NP-complete problem requires exponential.
Building Phylogenies Distance-Based Methods. Methods Distance-based Parsimony Maximum likelihood.
. Multiple Sequence Alignment Tutorial #4 © Ilan Gronau.
Phylogenetic trees Tutorial 6. Distance based methods UPGMA Neighbor Joining Tools Mega phylogeny.fr DrewTree Phylogenetic Trees.
Phylogenetic trees Sushmita Roy BMI/CS 576
Multiple Sequence Alignment S 1 = AGGTC S 2 = GTTCG S 3 = TGAAC Possible alignment A-TA-T GGGGGG G--G-- TTATTA -TA-TA CCCCCC -G--G- AG-AG- GTTGTT GTGGTG.
Escaping local optimas Accept nonimproving neighbors – Tabu search and simulated annealing Iterating with different initial solutions – Multistart local.
Phylogenetic Analysis. 2 Introduction Intension –Using powerful algorithms to reconstruct the evolutionary history of all know organisms. Phylogenetic.
Terminology of phylogenetic trees
1 Generalized Tree Alignment: The Deferred Path Heuristic Stinus Lindgreen
PHYLOGENETIC TREES Dwyane George February 24,
COSC 2007 Data Structures II Chapter 14 Graphs III.
BINF6201/8201 Molecular phylogenetic methods
1 Section 1.4 Graphs and Trees A graph is set of objects called vertices or nodes where some pairs of objects may be connected by edges. (A directed graph.
Phylogenetic Prediction Lecture II by Clarke S. Arnold March 19, 2002.
Building phylogenetic trees. Contents Phylogeny Phylogenetic trees How to make a phylogenetic tree from pairwise distances  UPGMA method (+ an example)
Benjamin Loyle 2004 Cse 397 Solving Phylogenetic Trees Benjamin Loyle March 16, 2004 Cse 397 : Intro to MBIO.
Using traveling salesman problem algorithms for evolutionary tree construction Chantal Korostensky and Gaston H. Gonnet Presentation by: Ben Snider.
Evolutionary tree reconstruction
Lectures on Greedy Algorithms and Dynamic Programming
Introduction to Graph Theory
Union-Find  Application in Kruskal’s Algorithm  Optimizing Union and Find Methods.
Graphs Chapter 12. Chapter 12: Graphs2 Chapter Objectives To become familiar with graph terminology and the different types of graphs To study a Graph.
A* optimality proof, cycle checking CPSC 322 – Search 5 Textbook § 3.6 and January 21, 2011 Taught by Mike Chiang.
CSE 421 Algorithms Richard Anderson Winter 2009 Lecture 5.
Distance-Based Approaches to Inferring Phylogenetic Trees BMI/CS 576 Colin Dewey Fall 2010.
Distance-based methods for phylogenetic tree reconstruction Colin Dewey BMI/CS 576 Fall 2015.
Construction of Optimal Data Aggregation Trees for Wireless Sensor Networks Deying Li, Jiannong Cao, Ming Liu, and Yuan Zheng Computer Communications and.
Spanning Trees Dijkstra (Unit 10) SOL: DM.2 Classwork worksheet Homework (day 70) Worksheet Quiz next block.
1 Minimum Routing Cost Tree Definition –For two nodes u and v on a tree, there is a path between them. –The sum of all edge weights on this path is called.
Graph Search Applications, Minimum Spanning Tree
Greedy Technique.
Inferring a phylogeny is an estimation procedure.
The Evolution Trees (Part I)
COMP 6/4030 ALGORITHMS Prim’s Theorem 10/26/2000.
Clustering methods Tree building methods for distance-based trees
Multiple Alignment and Phylogenetic Trees
Minimal Spanning Trees
Graph Algorithm.
Minimum Spanning Tree.
BNFO 602 Phylogenetics Usman Roshan.
Introduction Wireless Ad-Hoc Network
Phylogeny.
More Graphs Lecture 19 CS2110 – Fall 2009.
Presentation transcript:

5 - 1 Chap 5 The Evolution Trees

5 - 2 Evolutionary Tree

5 - 3 Tree Topology Rooted tree Unrooted tree

5 - 4 Distance Matrix and Rooted Tree s1s1 s2s2 s3s3 s4s4 s5s5 s1s s2s s3s s4s s5s

5 - 5 Distances Relation dt(s i, s j ): the distance between species s i and s j in an evolution tree d(s i, s j ): the distance between species s i and s j in the distance matrix dt(s i, s j )  d(s i, s j ) s 1 = agctccca s 2 = agccccca s' 1 = agcaccca d(s 1, s 2 ) = 1s 2 = agccccca dt(s 1, s 2 ) = 2

5 - 6

7 Numbers about Unrooted Tree Number of edges of an unrooted evolurion tree NE(n) = 2n  3 Number of unrooted evolution trees for n species TU(n + 1) = (2n  3)  TU(n) TU(n) = (2n  5)  (2n  7)    1

5 - 8

9 An Unrooted Evolution Tree with an Outlier Species

Different Tree Specifications Minimax evolution trees –The maximum of (dt(s i, s j )  d(s i, s j )) is minimized. Minisum evolution trees –The total sum of all pairs of distances among leaf nodes is minimized. Minisize evolution trees –The total length of the tree is minimized.

Complexities of Evolution Tree Problems MinimaxMinisumMinisize UnrootedNP-complete Unknown RootedO(n2)O(n2)NP-complete

The Rooted Minimax Evolution Tree Algorithm (1) Find the longest distance in the distance matrix s1s1 s2s2 s3s3 s4s4 s1s s2s s3s3 01 s4s4 0

Example of the Rooted Minimax Evolution Tree Algorithm (2) Construct a minimal spanning tree

Example of the Rooted Minimax Evolution Tree Algorithm (3) Break the longest edge in path from s 2 to s 4

Example of the Rooted Minimax Evolution Tree Algorithm (4) Construct rooted subtrees

Example of the Rooted Minimax Evolution Tree Algorithm (5) Combine subtrees by making sure that dt(s 2, s 4 ) = d(s 2, s 4 )

Weights Determination for a Tree with a Given Topology Unrooted evolution tree

Weights Determination for a Tree with a Given Topology Rooted evolution tree

UPGMA for Rooted Evolution Trees Unweighted pair group method with arithmetic mean Finding a rooted evolution tree with a given distance matrix Greedy method Heuristic solution

UPGMA (1) Select the pair of species with the smallest distance s1s1 s2s2 s3s3 s4s4 s1s s2s2 065 s3s3 02 s4s4 0

UPGMA (2) Consider (s 3, s 4 ) as a new species. d(s 1, (s 3, s 4 )) = ½(d(s 1, s 3 ) + d(s 1, s 4 )) = ½(4+3) = 3.5 d(s 2, (s 3, s 4 )) = ½(d(s 2, s 3 ) + d(s 2, s 4 )) = ½(6+5) = 5.5 d(s 1, s 2 ) = 4 s1s1 s2s2 (s 3, s 4 ) s1s s2s (s 3, s 4 )0

UPGMA (3) Select the pair of species (s 1, (s 3, s 4 )) with the smallest distance s1s1 s2s2 (s 3, s 4 ) s1s s2s (s 3, s 4 )0

UPGMA (4) Obtain the final evolution tree Then use linear programming technique to produce an evolution tree for a given criteria

The Neighbor Joining Method for Unrooted Evolution Trees Finding an unrooted evolution tree with a given distance matrix Greedy method Heuristic solution

Neighbor Joining Method (1) Distance matrix s1s1 s2s2 s3s3 s4s4 s1s s2s s3s s4s4 3520

Neighbor Joining Method (2) We first construct a 1-star

Neighbor Joining Method (3) Select a pair of species, insert an internal node

Neighbor Joining Method (4) Calculate the new connection cost NC Calculate the weights of edges

Neighbor Joining Method (5) Select a pair of species, insert an internal node

Neighbor Joining Method (6) Calculate the saved costs of all pairs The cost saved by pairing s 1 with s 4 is 2 The cost saved by pairing s 1 with s 2 is 2.34 The cost saved by pairing s 1 with s 3 is The cost saved by pairing s 2 with s 3 is 1.5 The cost saved by pairing s 2 with s 4 is 1.67 The cost saved by pairing s 3 with s 4 is 2.67

Neighbor Joining Method (7) The final tree structure

An Approximation Algorithm for an Unrooted Minisize Evolution Tree Finding an unrooted evolution tree with a given distance matrix This algorithm is based upon minimal spanning tree The approximate solution is never larger than twice of the size of an optimal solution

The Approximation Algorithm (1) Distance matrix s1s1 s2s2 s3s3 s4s4 s1s s2s2 065 s3s3 02 s4s4 0

The Approximation Algorithm (2) We first construct a minimal spanning tree out of distance matrix BFS order : s 4, s 3, s 1, s 2

Example of this Approximation Algorithm (3) Breadth first search BFS order : e, b, g, j, f, a, c, d, h, i

Example of this Approximation Algorithm (4) Add nodes one by one s 4, s 3, s 1, s 2

Example of this Approximation Algorithm (5) An unrooted evolution tree transformed from the minimal spanning tree s 4, s 3, s 1, s 2

Proof(1) We will prove that the total length of this unrooted evolution tree is less than or equal to twice of the length of an optimal unrooted minisize evolution tree.

Proof(2) |MST|<|TSP| APP= |MST|<|TSP| TSP is to find a Hamiltonian cycle with the smallest length.

Proof(3) Original evolution tree The result of duplicating every edge in the tree

Proof(4) |ET|=2|OPT| |ET|=dt(s 1,s 2 )+ dt(s 2,s 3 )+...+ dt(s n-1,s n )+ dt(s n,s 1 ) |CET|= d(s 1,s 2 )+ d(s 2,s 3 )+...+ d(s n-1,s n )+ d(s n,s 1 ) |CET| |ET| |TSP| |CET| |ET|=2|OPT| APP= |MST|<|TSP| APP<2|OPT|

The Minimal Spanning Tree Preservation Approach for Evolution Construction Finding an unrooted evolution tree with a given distance matrix The condition for our minimal spanning tree approach for the evolution tree construction problem is that MST(D) is an MST(D t )

Example (1) A new distance matrix

Example (2) A minimal spanning tree constructed out of the new distance matrix e(4,5)=2, e(1,2)=3, e(2,3)=4, e(5,6)=5, e(3,4)=7

Example (3) Construct the evolution tree e(4,5)=2, e(1,2)=3, e(2,3)=4, e(5,6)=5, e(3,4)=7

Example (4) Construct the evolution tree e(4,5)=2, e(1,2)=3, e(2,3)=4, e(5,6)=5, e(3,4)=7

Example (5) Construct the evolution tree e(4,5)=2, e(1,2)=3, e(2,3)=4, e(5,6)=5, e(3,4)=7

Example (6) distance matrix

Example (7) Construct the evolution tree e(4,5)=2, e(1,2)=3, e(2,3)=4, e(5,6)=5, e(3,4)=7

Example (8) distance matrix

Example (9) Construct the evolution tree e(4,5)=2, e(1,2)=3, e(2,3)=4, e(5,6)=5, e(3,4)=7

Example (10) A new distance matrix

Example (11) The distance matrix Dt Based on the evolution tree

Example (12) A minimal spanning tree based on Dt