Goals of Phylogenetic Analysis

Slides:



Advertisements
Similar presentations
Introduction to Molecular Evolution
Advertisements

Computing a tree Genome 559: Introduction to Statistical and Computational Genomics Prof. James H. Thomas.
Phylogenetic Tree A Phylogeny (Phylogenetic tree) or Evolutionary tree represents the evolutionary relationships among a set of organisms or groups of.
Bioinformatics Phylogenetic analysis and sequence alignment The concept of evolutionary tree Types of phylogenetic trees Measurements of genetic distances.
An Introduction to Phylogenetic Methods
Computing a tree Genome 559: Introduction to Statistical and Computational Genomics Prof. James H. Thomas.
Phylogenetic Trees Lecture 4
1 General Phylogenetics Points that will be covered in this presentation Tree TerminologyTree Terminology General Points About Phylogenetic TreesGeneral.
IE68 - Biological databases Phylogenetic analysis
Molecular Evolution Revised 29/12/06
Tree Reconstruction.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS Phylogenetic Reconstruction: Distance Matrix Methods Anders Gorm Pedersen Molecular Evolution Group Center for.
. Phylogeny II : Parsimony, ML, SEMPHY. Phylogenetic Tree u Topology: bifurcating Leaves - 1…N Internal nodes N+1…2N-2 leaf branch internal node.
UPGMA and FM are distance based methods. UPGMA enforces the Molecular Clock Assumption. FM (Fitch-Margoliash) relieves that restriction, but still enforces.
Maximum Likelihood. Historically the newest method. Popularized by Joseph Felsenstein, Seattle, Washington. Its slow uptake by the scientific community.
Distance Methods. Distance Estimates attempt to estimate the mean number of changes per site since 2 species (sequences) split from each other Simply.
Molecular Evolution with an emphasis on substitution rates Gavin JD Smith State Key Laboratory of Emerging Infectious Diseases & Department of Microbiology.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS Distance Matrix Methods: Models of Evolution Anders Gorm Pedersen Molecular Evolution Group Center for Biological.
Probabilistic Approaches to Phylogeny Wouter Van Gool & Thomas Jellema.
Probabilistic methods for phylogenetic trees (Part 2)
Building Phylogenies Parsimony 1. Methods Distance-based Parsimony Maximum likelihood.
. Class 9: Phylogenetic Trees. The Tree of Life D’après Ernst Haeckel, 1891.
Phylogenetic trees Sushmita Roy BMI/CS 576
. Phylogenetic Trees Lecture 13 This class consists of parts of Prof Joe Felsenstein’s lectures 4 and 5 taken from:
1 Additive Distances Between DNA Sequences MPI, June 2012.
Phylogeny Estimation: Traditional and Bayesian Approaches Molecular Evolution, 2003
Terminology of phylogenetic trees
Molecular phylogenetics
Molecular evidence for endosymbiosis Perform blastp to investigate sequence similarity among domains of life Found yeast nuclear genes exhibit more sequence.
Phylogenetics Alexei Drummond. CS Friday quiz: How many rooted binary trees having 20 labeled terminal nodes are there? (A) (B)
Tree Inference Methods
1 Dan Graur Molecular Phylogenetics Molecular phylogenetic approaches: 1. distance-matrix (based on distance measures) 2. character-state.
COMPUTATIONAL MODELS FOR PHYLOGENETIC ANALYSIS K. R. PARDASANI DEPTT OF APPLIED MATHEMATICS MAULANA AZAD NATIONAL INSTITUTE OF TECHNOLOGY (MANIT) BHOPAL.
Phylogenetic Analysis. General comments on phylogenetics Phylogenetics is the branch of biology that deals with evolutionary relatedness Uses some measure.
Molecular phylogenetics 1 Level 3 Molecular Evolution and Bioinformatics Jim Provan Page and Holmes: Sections
BINF6201/8201 Molecular phylogenetic methods
Phylogenetics and Coalescence Lab 9 October 24, 2012.
Bioinformatics 2011 Molecular Evolution Revised 29/12/06.
Phylogenetic Prediction Lecture II by Clarke S. Arnold March 19, 2002.
Phylogenetic Trees  Importance of phylogenetic trees  What is the phylogenetic analysis  Example of cladistics  Assumptions in cladistics  Frequently.
Building phylogenetic trees. Contents Phylogeny Phylogenetic trees How to make a phylogenetic tree from pairwise distances  UPGMA method (+ an example)
Introduction to Phylogenetics
Calculating branch lengths from distances. ABC A B C----- a b c.
MAT 4830 Mathematical Modeling
Rooting Phylogenetic Trees with Non-reversible Substitution Models Von Bing Yap* and Terry Speed § *Statistics and Applied Probability, National University.
Why do trees?. Phylogeny 101 OTUsoperational taxonomic units: species, populations, individuals Nodes internal (often ancestors) Nodes external (terminal,
Phylogenetics.
Phylogenetic Trees - Parsimony Tutorial #13
Ayesha M.Khan Spring Phylogenetic Basics 2 One central field in biology is to infer the relation between species. Do they possess a common ancestor?
1 CAP5510 – Bioinformatics Phylogeny Tamer Kahveci CISE Department University of Florida.
Molecular Evolution Distance Methods Biol. Luis Delaye Facultad de Ciencias, UNAM.
Probabilistic methods for phylogenetic tree reconstruction BMI/CS 576 Colin Dewey Fall 2015.
CSCE555 Bioinformatics Lecture 13 Phylogenetics II Meeting: MW 4:00PM-5:15PM SWGN2A21 Instructor: Dr. Jianjun Hu Course page:
Phylogenetics LLO9 Maximum Likelihood and Its Applications
Phylogenetic basis of systematics
Models for DNA substitution
Inferring phylogenetic trees: Distance methods
Distance based phylogenetics
Lecture 6B – Optimality Criteria: ML & ME
Inferring a phylogeny is an estimation procedure.
Maximum likelihood (ML) method
Models of Sequence Evolution
Inferring phylogenetic trees: Distance and maximum likelihood methods
Lecture 6B – Optimality Criteria: ML & ME
The Most General Markov Substitution Model on an Unrooted Tree
Phylogenetic tree based on 16S rRNA gene sequence comparisons over 1,260 aligned bases showing the relationship between species of the genus Actinomyces.
But what if there is a large amount of homoplasy in the data?
Phylogenetic analysis of AquK2P.
Phylogenetic tree based on predominant 16S rRNA gene sequences obtained by C4–V8 Sutterella PCR from AUT-GI patients, Sutterella species isolates, and.
Presentation transcript:

Goals of Phylogenetic Analysis Given a multiple sequence alignment, determine the ancestral relationships among the species. We assume that residues in a column are homologous, and that all columns have the same history. Time Hu Ch Go Gi

Types of Phylogenetic Methods Character-based Parsimony Likelihood Distance-based Neighbor joining (NJ) UPGMA Involve optimizing a criterion based on fit of the residues to the tree. Involve optimizing a criterion based on fit of a matrix of pairwise distances to the tree

Parsimony Select the tree that explains the data with the smallest number of substitutions. Likelihood Select the tree that has the highest probability of producing the observed data Distance Select the tree that best recreates the observed pairwise distances.

We usually estimate unrooted trees B C D A C B D A D C B For 4 sequences, there are only 3 unique unrooted trees.

Parsimony Methods A ACGA B ATGC C GTGC D GCAA A B C D Tree 1 A C B D 1 2 3 4 Tot Tree 1 1 2 1 2 6 Tree 2 2 2 1 2 7 Tree 3 2 1 1 1 5 A D B C Tree 3

is the collection of all data (sequences A, B, and G). Likelihood Methods A B G x is the collection of all parameters affecting the evolution of sequences A, B, and G. is the collection of all data (sequences A, B, and G).

Modeling sequence change Typically, the evolution of DNA sequences is modeled at the level of single sites. JC: Jukes-Cantor (1969) K2P: Kimura (1980) F81: Felsenstein (1981) HKY: Hasegawa et al. (1985) REV: Tavaré (1986)

General Time Reversible Model C G T A C G T

(Nucleotide in Species A, site k) Under this assumption, the likelihood simplifies to a product of individual site likelihoods: and (Nucleotide in Species A, site k) A B G x

Distance Methods A B C D A 0 A ACGA B ATGC B .5 0 C GTGC C .75 .25 0 D GCAA

A B C D a c e b d Many distance methods seek to find the tree that minimizes a score such as: