Phylogenetic Trees: Assumptions All existing species have a common ancestor Each species is descended from a single ancestor Each speciation gives rise.

Slides:



Advertisements
Similar presentations
Computational Molecular Biology Biochem 218 – BioMedical Informatics Doug Brutlag Professor.
Advertisements

Phylogenetic Tree A Phylogeny (Phylogenetic tree) or Evolutionary tree represents the evolutionary relationships among a set of organisms or groups of.
PHYLOGENETIC TREES Bulent Moller CSE March 2004.
Bioinformatics Phylogenetic analysis and sequence alignment The concept of evolutionary tree Types of phylogenetic trees Measurements of genetic distances.
. Class 9: Phylogenetic Trees. The Tree of Life Evolution u Many theories of evolution u Basic idea: l speciation events lead to creation of different.
Lecture 13 CS5661 Phylogenetics Motivation Concepts Algorithms.
. Phylogenetic Trees (2) Lecture 13 Based on: Durbin et al 7.4, Gusfield , Setubal&Meidanis 6.1.
Phylogenetic Analysis
Phylogenetics - Distance-Based Methods CIS 667 March 11, 2204.
Phylogenetic reconstruction
Molecular Evolution Revised 29/12/06
Tree Reconstruction.
BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference.
. Phylogeny II : Parsimony, ML, SEMPHY. Phylogenetic Tree u Topology: bifurcating Leaves - 1…N Internal nodes N+1…2N-2 leaf branch internal node.
CISC667, F05, Lec14, Liao1 CISC 667 Intro to Bioinformatics (Fall 2005) Phylogenetic Trees (I) Maximum Parsimony.
The Evolution Trees From: Computational Biology by R. C. T. Lee S. J. Shyu Department of Computer Science Ming Chuan University.
Building phylogenetic trees Jurgen Mourik & Richard Vogelaars Utrecht University.
Branch lengths Branch lengths (3 characters): A C A A C C A A C A C C Sum of branch lengths = total number of changes.
Phylogenetic Analysis. General comments on phylogenetics Phylogenetics is the branch of biology that deals with evolutionary relatedness Uses some measure.
UNIVERSITY OF SOUTH CAROLINA College of Engineering & Information Technology Bioinformatics Algorithms and Data Structures Chapter : Strings and.
. Class 9: Phylogenetic Trees. The Tree of Life D’après Ernst Haeckel, 1891.
. Comput. Genomics, Lecture 5b Character Based Methods for Reconstructing Phylogenetic Trees: Maximum Parsimony Based on presentations by Dan Geiger, Shlomo.
UNIVERSITY OF SOUTH CAROLINA College of Engineering & Information Technology Bioinformatics Algorithms and Data Structures Chapter : Strings and.
Distance-Based Phylogenetic Reconstruction Tutorial #8 © Ilan Gronau, edited by Itai Sharon.
Probabilistic methods for phylogenetic trees (Part 2)
Building Phylogenies Parsimony 1. Methods Distance-based Parsimony Maximum likelihood.
Building Phylogenies Distance-Based Methods. Methods Distance-based Parsimony Maximum likelihood.
Phylogenetic Tree Construction and Related Problems Bioinformatics.
Phylogenetic trees Sushmita Roy BMI/CS 576
Terminology of phylogenetic trees
P HYLOGENETIC T REE. OVERVIEW Phylogenetic Tree Phylogeny Applications Types of phylogenetic tree Terminology Data used to build a tree Building phylogenetic.
Molecular evidence for endosymbiosis Perform blastp to investigate sequence similarity among domains of life Found yeast nuclear genes exhibit more sequence.
1 Dan Graur Molecular Phylogenetics Molecular phylogenetic approaches: 1. distance-matrix (based on distance measures) 2. character-state.
Phylogenetic Analysis. General comments on phylogenetics Phylogenetics is the branch of biology that deals with evolutionary relatedness Uses some measure.
Molecular phylogenetics 1 Level 3 Molecular Evolution and Bioinformatics Jim Provan Page and Holmes: Sections
Computational Biology, Part D Phylogenetic Trees Ramamoorthi Ravi/Robert F. Murphy Copyright  2000, All rights reserved.
Bioinformatics 2011 Molecular Evolution Revised 29/12/06.
Parsimony-Based Approaches to Inferring Phylogenetic Trees BMI/CS 576 Colin Dewey Fall 2010.
Phylogenetics II.
OUTLINE Phylogeny UPGMA Neighbor Joining Method Phylogeny Understanding life through time, over long periods of past time, the connections between all.
Building phylogenetic trees. Contents Phylogeny Phylogenetic trees How to make a phylogenetic tree from pairwise distances  UPGMA method (+ an example)
Introduction to Phylogenetics
Benjamin Loyle 2004 Cse 397 Solving Phylogenetic Trees Benjamin Loyle March 16, 2004 Cse 397 : Intro to MBIO.
Using traveling salesman problem algorithms for evolutionary tree construction Chantal Korostensky and Gaston H. Gonnet Presentation by: Ben Snider.
Evolutionary tree reconstruction (Chapter 10). Early Evolutionary Studies Anatomical features were the dominant criteria used to derive evolutionary relationships.
Ch.6 Phylogenetic Trees 2 Contents Phylogenetic Trees Character State Matrix Perfect Phylogeny Binary Character States Two Characters Distance Matrix.
Evolutionary tree reconstruction
Algorithms in Computational Biology11Department of Mathematics & Computer Science Algorithms in Computational Biology Building Phylogenetic Trees.
More statistical stuff CS 394C Feb 6, Today Review of material from Jan 31 Calculating pattern probabilities Why maximum parsimony and UPGMA are.
Chapter 10 Phylogenetic Basics. Similarities and divergence between biological sequences are often represented by phylogenetic trees Phylogenetics is.
Parsimony-Based Approaches to Inferring Phylogenetic Trees BMI/CS 576 Colin Dewey Fall 2015.
Maximum Likelihood Given competing explanations for a particular observation, which explanation should we choose? Maximum likelihood methodologies suggest.
Phylogeny Ch. 7 & 8.
1 Alignment Matrix vs. Distance Matrix Sequence a gene of length m nucleotides in n species to generate an… n x m alignment matrix n x n distance matrix.
Phylogenetics.
Phylogenetic Trees - Parsimony Tutorial #13
Part 9 Phylogenetic Trees
Example Apply hierarchical clustering with d min to below data where c=3. Nearest neighbor clustering d min d max will form elongated clusters!
Phylogeny.
Distance-Based Approaches to Inferring Phylogenetic Trees BMI/CS 576 Colin Dewey Fall 2010.
CSCE555 Bioinformatics Lecture 13 Phylogenetics II Meeting: MW 4:00PM-5:15PM SWGN2A21 Instructor: Dr. Jianjun Hu Course page:
Building Phylogenies. Phylogenetic (evolutionary) trees Human Gorilla Chimp Gibbon Orangutan Describe evolutionary relationships between species Cannot.
394C: Algorithms for Computational Biology Tandy Warnow Jan 25, 2012.
Phylogenetic basis of systematics
Distance based phylogenetics
Inferring a phylogeny is an estimation procedure.
Goals of Phylogenetic Analysis
CS 581 Tandy Warnow.
Phylogeny.
Presentation transcript:

Phylogenetic Trees: Assumptions All existing species have a common ancestor Each species is descended from a single ancestor Each speciation gives rise to two derived species This leads to a ‘tree’ topology for geneology

Tree structure ‘Leaves’ are existing species Interior nodes are hypothetical common ancestors Tree may be rooted or unrooted Root (if it exists) is universal ancesestor gorillahumanbonobochimp

Limits of these assumptions Early life (probably wasn’t nicely ‘packaged’) Merger of symbiotes (origin of eukaryotes, plants) Lateral gene transfer

Characteristics Relatedness is inferred from features, or characters discrete character data –example: has feathers, number of fingers –data forms a n x m matrix distance data –example: sequence edit distance –data forms an n x n triangular matrix

Ordered or unordered characters Differences in characters are assumed to be the result of transitions from a previous uniform state –(unordered) Such transitions may occur between any two states –(ordered) They may occur only in a fixed sequence

Reversals Reversal - mutation into the primitive (ancestral) state Reversals are possible but unlikely Examples –nucleic acid sequence –toes of a mammal

Perfect phylogeny –Each edge in tree = one state transition –That is, the entire subtree must share this state Central problem of character state phylogeny: –input: set O of n objects. Set c of m characters. Set r of states. –Output (y/n): is there a perfect phylogeny for O?

Solve Perfect Phylogeny problem Problem 1 –acgac –accag –acgtt –tccag –acgat –aagtt Problem 2 –aagtt –atgtc –atgta –atgtt –aaatt –ttgta Problem 3 –aagtt –ttgtt –tcctt –tcaat –tcagt –acagt

Complexity of character problem Ordered states –the problem is polynomial Unordered states –If the number of states (r) is unbounded, PPP is NP-complete. –If the number of states is small (2, 3, 4) an algorithm which runs in reasonable time is known

Relaxing our assumptions Expecting perfect phylogeny is unrealistic, because –biological data contains errors –reversals to occur Two approaches to relaxation –minimize reversals (parsimony) –throw away minimum of offending characters (compatibility criterion)

Ideal distance matrix problem Input: a set O of objects and a (triangular) matrix M of pairwise distances between them Output: a tree in which the nodes are O and the paths have weights M

Ideal distance matrix problem Input: a set O of objects and a (triangular) matrix M of pairwise distances between them Output: a tree in which the nodes are O and the paths have weights M Nonideal considerations: reversals, convergence, error

Relaxed distance matrix problem Input: a set O of objects and a pair of (triangular) matrix M min and M max of pairwise distances between them Output: a tree in which the nodes are O and the path weights are bounded by M min and M max

Relaxed distance matrix problem Input: a set O of objects and a pair of (triangular) matrix M min and M max of pairwise distances between them Output: a tree in which the nodes are O and the path weights are bounded by M min and M max Tractable only for ultrametric trees (all leaves equidistant from root)

Neighbor joining A simple and commonly used heuristic while (|nodes| > 2) find nearest neighbors, A, B substitute with interior node at their midpoint