Lecture 14 CS5661 Neighbor Joining Generates unrooted tree, allowing for unequal branches Given: Distance matrix for sequences Steps: Repeat 1-3 till all.

Slides:



Advertisements
Similar presentations
Computing a tree Genome 559: Introduction to Statistical and Computational Genomics Prof. James H. Thomas.
Advertisements

Phylogenetic Tree A Phylogeny (Phylogenetic tree) or Evolutionary tree represents the evolutionary relationships among a set of organisms or groups of.
Neighbour joining method The neighbor joining method is a greedy heuristic which joins at each step, the two closest sub-trees that are not already joined.
Computing a tree Genome 559: Introduction to Statistical and Computational Genomics Prof. James H. Thomas.
Lecture 13 CS5661 Phylogenetics Motivation Concepts Algorithms.
Brandon Andrews CS6030.  What is a phylogenetic tree?  Goals in a phylogenetic tree generator  Distance based method  Fitch-Margoliash Method Example.
Molecular Evolution Revised 29/12/06
UPGMA Algorithm.  Main idea: Group the taxa into clusters and repeatedly merge the closest two clusters until one cluster remains  Algorithm  Add a.
Lecture 7 – Algorithmic Approaches Justification: Any estimate of a phylogenetic tree has a large variance. Therefore, any tree that we can demonstrate.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS Phylogenetic Reconstruction: Distance Matrix Methods Anders Gorm Pedersen Molecular Evolution Group Center for.
. Computational Genomics 5a Distance Based Trees Reconstruction (cont.) Modified by Benny Chor, from slides by Shlomo Moran and Ydo Wexler (IIT)
Finding Orthologous Groups René van der Heijden. What is this lecture about? What is ‘orthology’? Why do we study gene-ancestry/gene-trees (phylogenies)?
Topological Data Analysis MATH 800 Fall Topological Data Analysis (TDA) An ε-chain is a finite sequence of points x 1,..., x n such that |x i –
Pattern Recognition Introduction to bioinformatics 2005 Lecture 4.
I.1 ii.2 iii.3 iv.4 1+1=. i.1 ii.2 iii.3 iv.4 1+1=
CISC667, F05, Lec15, Liao1 CISC 667 Intro to Bioinformatics (Fall 2005) Phylogenetic Trees (II) Distance-based methods.
Phylogenetic Trees Tutorial 6. Measuring distance Bottom-up algorithm (Neighbor Joining) –Distance based algorithm –Relative distance based Phylogenetic.
Multiple sequence alignment
Lecture 24 Inferring molecular phylogeny Distance methods
Phylogenetic Trees Tutorial 6. Measuring distance Bottom-up algorithm (Neighbor Joining) –Distance based algorithm –Relative distance based Phylogenetic.
I.1 ii.2 iii.3 iv.4 1+1=. i.1 ii.2 iii.3 iv.4 1+1=
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS Phylogenetic Reconstruction: Distance Matrix Methods Anders Gorm Pedersen Molecular Evolution Group Center for.
Building Phylogenies Distance-Based Methods. Methods Distance-based Parsimony Maximum likelihood.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS Distance Matrix Methods Anders Gorm Pedersen Molecular Evolution Group Center for Biological Sequence Analysis.
Phylogenetic trees Tutorial 6. Distance based methods UPGMA Neighbor Joining Tools Mega phylogeny.fr DrewTree Phylogenetic Trees.
Introduction to Management Science
NetworkModel-1 Network Optimization Models. NetworkModel-2 Network Terminology A network consists of a set of nodes and arcs. The arcs may have some flow.
Molecular evidence for endosymbiosis Perform blastp to investigate sequence similarity among domains of life Found yeast nuclear genes exhibit more sequence.
Phylogenetic Analysis. General comments on phylogenetics Phylogenetics is the branch of biology that deals with evolutionary relatedness Uses some measure.
REMAINING SERVICE LIFE Lecture 14. n Concept of remaining service life n How remaining service life is used and its importance n How remaining life is.
Bioinformatics 2011 Molecular Evolution Revised 29/12/06.
Pattern Recognition Introduction to bioinformatics 2006 Lecture 4.
OUTLINE Phylogeny UPGMA Neighbor Joining Method Phylogeny Understanding life through time, over long periods of past time, the connections between all.
Phylogenetic Prediction Lecture II by Clarke S. Arnold March 19, 2002.
Phylogenetic Trees Tutorial 5. Agenda How to construct a tree using Neighbor Joining algorithm Phylogeny.fr tool Cool story of the day: Horizontal gene.
Phylogenetic Trees  Importance of phylogenetic trees  What is the phylogenetic analysis  Example of cladistics  Assumptions in cladistics  Frequently.
Calculating branch lengths from distances. ABC A B C----- a b c.
Using Traveling Salesman Problem Algorithms to Determine Multiple Sequence Alignment Orders Weiwei Zhong.
8.1.4 Can it still be factored? Factoring Completely I can factor out a common factor.
Hierarchical Clustering Produces a set of nested clusters organized as a hierarchical tree Can be visualized as a dendrogram – A tree like diagram that.
Tutorial 5 Phylogenetic Trees.
Distance-Based Approaches to Inferring Phylogenetic Trees BMI/CS 576 Colin Dewey Fall 2010.
Distance-based methods for phylogenetic tree reconstruction Colin Dewey BMI/CS 576 Fall 2015.
7-1 Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Network Flow Models Chapter 7.
Fitch-Margoliash Algorithm 1.From the distance matrix find the closest pair, e.g., A & B 2.Treat the rest of the sequences as a single composite sequence.
CSCE555 Bioinformatics Lecture 13 Phylogenetics II Meeting: MW 4:00PM-5:15PM SWGN2A21 Instructor: Dr. Jianjun Hu Course page:
Data Mining Cluster Analysis: Basic Concepts and Algorithms Lecture Notes Introduction to Data Mining by Tan, Steinbach, Kumar © Tan,Steinbach, Kumar Introduction.
Phylogeny - based on whole genome data
Multiple sequence alignment (msa)
Multiplication table. x
Inferring a phylogeny is an estimation procedure.
Distance and Midpoint In The Coordinate Plane
Graphing / Plotting Points Review
Multiple Sequence Alignment
Goals of Phylogenetic Analysis
Clicker #1 You add mL of water to mL of a 2.00 M sugar solution. How many of the following will change? I. total volume of the solution II. moles.
The Tree of Life From Ernst Haeckel, 1891.
Clicker #1 Which of the three solutions is the most concentrated?
ОПЕРАТИВНА ПРОГРАМА “ИНОВАЦИИ И КОНКУРЕНТОСПОСОБНОСТ“ „Подобряване на производствения капацитет в МСП“
Objective - To graph ordered pairs on the coordinate plane.
PRIME FACTORIZATION USING FACTOR TREES!.
A car is decelerated to 20 m/s in 6 seconds
Lecture 7 – Algorithmic Approaches
Solving Equations 3x+7 –7 13 –7 =.
Clicker #1 Which of the three solutions is the most concentrated?
The COORDINATE PLANE The COORDINATE PLANE is a plane that is divided into four regions (called quadrants) by a horizontal line called the x-axis and a.
Clicker #1 You add mL of water to mL of a 2.00 M sugar solution. How many of the following will change? I. total volume of the solution II. moles.
Neighbor-joining tree of the 262 S
Clicker #1 You add mL of water to mL of a 2.00 M sugar solution. How many of the following will change? I. total volume of the solution II. moles.
Phylogenetic tree analysis of the Xenopus type III IFNs (A) with other known IFNs and the putative receptors (B) with relevant receptor families. Phylogenetic.
Presentation transcript:

Lecture 14 CS5661 Neighbor Joining Generates unrooted tree, allowing for unequal branches Given: Distance matrix for sequences Steps: Repeat 1-3 till all branches generated 1.Take any two sequences i, j 2.Find branch lengths between i and j by treating remaining sequences as composite (c) 1.Calculate average i-C and j-C distances 2.Calculate branch lengths i and j 3.Treat ij as composite sequence now and generate new distance table. 4.Generate multiple trees by starting with different pairs 5.Compare resulting trees in terms of best fit to original distance matrix

Lecture 14 CS5662

3 I and II are closest neighbors at distance 10. Question: Which of them is closer to the rest, and by how much? Average distance from I to III, IV = 40 Average distance from II to III, IV = 45 X + Z = 40 Y + Z = 45 => x = 2.5; y = 7.5 X + Y = 10 I II x y z

Lecture 14 CS5664 Established Now, consider (I,II) as composite sequence and find closest pairs I II

Lecture 14 CS5665 A + B = C + A = 30  C + B = 50 I II IV III C A B A = 2.5; B = 17.5; C = 25;

Lecture 14 CS5666 I II IV III Predicted Tree

Lecture 14 CS5667 Seq\SeqIIIIIIIV I (50) II 45 (40)45 (50) III 20 Predicted Distance matrix Error = ( ) 1/2 = 8.66 Repeat entire process starting with a different pair Tree with lowest error is the best one