Why use phylogenetic networks?

Slides:



Advertisements
Similar presentations
Mosaic exploring reticulate protein family evolution UQ, COMBIO AU, Brisbane Maetschke/Kassahn.
Advertisements

An Algorithm for Constructing Parsimonious Hybridization Networks with Multiple Phylogenetic Trees Yufeng Wu Dept. of Computer Science & Engineering University.
A Separate Analysis Approach to the Reconstruction of Phylogenetic Networks Luay Nakhleh Department of Computer Sciences UT Austin.
Phylogenetic Tree A Phylogeny (Phylogenetic tree) or Evolutionary tree represents the evolutionary relationships among a set of organisms or groups of.
1 Constructing Splits Graphs Author: Andreas W.M. Dress Daniel H. Huson Presented by: Bakhtiyar Uddin.
Bioinformatics Phylogenetic analysis and sequence alignment The concept of evolutionary tree Types of phylogenetic trees Measurements of genetic distances.
 Aim in building a phylogenetic tree is to use a knowledge of the characters of organisms to build a tree that reflects the relationships between them.
1 General Phylogenetics Points that will be covered in this presentation Tree TerminologyTree Terminology General Points About Phylogenetic TreesGeneral.
Molecular Evolution Revised 29/12/06
Lecture 7 – Algorithmic Approaches Justification: Any estimate of a phylogenetic tree has a large variance. Therefore, any tree that we can demonstrate.
. Computational Genomics 5a Distance Based Trees Reconstruction (cont.) Modified by Benny Chor, from slides by Shlomo Moran and Ydo Wexler (IIT)
D. Gusfield, V. Bansal (Recomb 2005) A Fundamental Decomposition Theory for Phylogenetic Networks and Incompatible Characters.
T HE P ROBLEM OF R ECONSTRUCTING K - ARTICULATED P HYLOGENETIC N ETWORK Supervisor : Dr. Yiu Siu Ming Second Examiner : Professor Francis Y.L. Chin Student.
Distance methods. UPGMA: similar to hierarchical clustering but not additive Neighbor-joining: more sophisticated and additive What is additivity?
Close Lower and Upper Bounds for the Minimum Reticulate Network of Multiple Phylogenetic Trees Yufeng Wu Dept. of Computer Science & Engineering University.
Protein Sequence Classification Using Neighbor-Joining Method
Phylogenetic Trees Tutorial 6. Measuring distance Bottom-up algorithm (Neighbor Joining) –Distance based algorithm –Relative distance based Phylogenetic.
. Comput. Genomics, Lecture 5b Character Based Methods for Reconstructing Phylogenetic Trees: Maximum Parsimony Based on presentations by Dan Geiger, Shlomo.
Using PQ Trees For Comparative Genomics - CPM Using PQ Trees For Comparative Genomics Gad M. Landau – Univ. of Haifa Laxmi Parida – IBM T.J. Watson.
Phylogenetic Networks of SNPs with Constrained Recombination D. Gusfield, S. Eddhu, C. Langley.
The concept of likelihood refers given some data D, a decision must be made about an adequate explanation of the data. In the phylogenetic framework, one.
1 Tricks for trees: Having reconstructed phylogenies what can we do with them? DIMACS, June 2006 Mike Steel Allan Wilson Centre for Molecular Ecology and.
Phylogenetic trees Sushmita Roy BMI/CS 576
Applied Discrete Mathematics Week 10: Equivalence Relations
Sequence Alignment and Phylogenetic Prediction using Map Reduce Programming Model in Hadoop DFS Presented by C. Geetha Jini (07MW03) D. Komagal Meenakshi.
Multiple Sequence Alignments and Phylogeny.  Within a protein sequence, some regions will be more conserved than others. As more conserved,
Phylogenetic analyses Kirsi Kostamo. The aim: To construct a visual representation (a tree) to describe the assumed evolution occurring between and among.
Phylogenetics Alexei Drummond. CS Friday quiz: How many rooted binary trees having 20 labeled terminal nodes are there? (A) (B)
Improving the layout of splits networks 06/06/2005 Philippe Gambette & Daniel Huson Caution! Some.
Shortest Path. What is Shortest Path In a network, there are often many possible paths from one node in a network to another. It is often desirable (for.
1 Summary on similarity search or Why do we care about far homologies ? A protein from a new pathogenic bacteria. We have no idea what it does A protein.
Phylogenetic Analysis. General comments on phylogenetics Phylogenetics is the branch of biology that deals with evolutionary relatedness Uses some measure.
1 Dendroscope – An interactive viewer for large phylogenetic trees Daniel H. Huson Phylogenetics Programme, Newton Institute, September and networks.
BINF6201/8201 Molecular phylogenetic methods
Bioinformatics 2011 Molecular Evolution Revised 29/12/06.
Which of these can be drawn without taking your pencil off the paper and without going over the same line twice? If we can find a path that goes over all.
OUTLINE Phylogeny UPGMA Neighbor Joining Method Phylogeny Understanding life through time, over long periods of past time, the connections between all.
Trees & Topologies Chapter 3, Part 1. Terminology Equivalence Classes – specific separation of a set of genes into disjoint sets covering the whole set.
Building phylogenetic trees. Contents Phylogeny Phylogenetic trees How to make a phylogenetic tree from pairwise distances  UPGMA method (+ an example)
Calculating branch lengths from distances. ABC A B C----- a b c.
The bootstrap, consenus-trees, and super-trees Phylogenetics Workhop, August 2006 Barbara Holland.
394C, Spring 2013 Sept 4, 2013 Tandy Warnow. DNA Sequence Evolution AAGACTT TGGACTTAAGGCCT -3 mil yrs -2 mil yrs -1 mil yrs today AGGGCATTAGCCCTAGCACTT.
Inference rules for supernetwork construction Katharina Huber, School of Computing Sciences, University of East Anglia.
Introduction to Phylogenetic trees Colin Dewey BMI/CS 576 Fall 2015.
Maximum Likelihood Given competing explanations for a particular observation, which explanation should we choose? Maximum likelihood methodologies suggest.
Subtree Prune Regraft & Horizontal Gene Transfer or Recombination.
Ayesha M.Khan Spring Phylogenetic Basics 2 One central field in biology is to infer the relation between species. Do they possess a common ancestor?
SupreFine, a new supertree method Shel Swenson September 17th 2009.
Graphs Definition: a graph is an abstract representation of a set of objects where some pairs of the objects are connected by links. The interconnected.
Distance-based methods for phylogenetic tree reconstruction Colin Dewey BMI/CS 576 Fall 2015.
The UNIVERSITY of NORTH CAROLINA at CHAPEL HILL Mining Complex Data COMP Seminar Spring 2011.
Fitch-Margoliash Algorithm 1.From the distance matrix find the closest pair, e.g., A & B 2.Treat the rest of the sequences as a single composite sequence.
Application of Phylogenetic Networks in Evolutionary Studies Daniel H. Huson and David Bryant Presented by Peggy Wang.
Chapter AGB. Today’s material Maximum Parsimony Fixed tree versions (solvable in polynomial time using dynamic programming) Optimal tree search.
Phylogenetic trees. 2 Phylogeny is the inference of evolutionary relationships. Traditionally, phylogeny relied on the comparison of morphological features.
Quartet distance between general trees Chris Christiansen Thomas Mailund Christian N.S. Pedersen Martin Randers.
Algorithms and Networks
394C, Spring 2012 Jan 23, 2012 Tandy Warnow.
Phylogeny - based on whole genome data
Distance based phylogenetics
Clustering methods Tree building methods for distance-based trees
Multiple Alignment and Phylogenetic Trees
Cladistics (Ch. 22) Based on phylogenetics – an inferred reconstruction of evolutionary history.
Hierarchical clustering approaches for high-throughput data
BNFO 602 Phylogenetics Usman Roshan.
CS 581 Tandy Warnow.
Multiple Genome Rearrangement
Route Inspection Which of these can be drawn without taking your pencil off the paper and without going over the same line twice? If we introduce a vertex.
Lecture 19 – Non-tree Based Methods
Incorporating uncertainty in distance-matrix phylogenetics
Presentation transcript:

Why use phylogenetic networks? to visualize data when the evolutionary model is assumed to be bifurcating to visualize data when the evolutionary model may not be bifurcating to provide an analytical framework for studying processes that cause phylogenetic incongruence to build reticulate evolutionary models

- to identify phylogenetic relationships that are uncertain

-to ask whether data are suitable for tree building

another example: do Noppadon’s inversion distances give tree-like distances?

-to help us understand why some phylogenetic problems are hard NNET splits graph of angiosperm & gymnosperm sequences Qui et al. 1999 [Mt: matR, atpI, Cp: atpB, rbcL, Nuc. 18sRNA]

to study complex processes (where sequence evolution at an individual locus has not been tree like)

- to study complex processes (where phylogenetic information from different gene loci is incongruent)

- to study complex processes (where phylogenetic information from different gene loci is incongruent)

- to reconstruct reticulate evolutionary models R.nivicola origins of diploid and polyploid hybrids

Overview of phylogenetic network methods

Median, split decomposition, NeighborNet Distance matrix Median network splits graph Split decomposition & NeighborNet network splits graph Aligned sequences ACGACCTACGACTGCATCAGCATCGCATCAGCTACGCTCGCTCAGACTATCGGATTAAAAGCATCAGCATCGACATCAGCATCAGCGGCGCCATCGATCGCAATCAAGGGGGGGCCCTACCGCATTCAGCATCACGCTCGCCCAATCGCATCACGCATCGCATCGCATCGCATCGCATCGACTCGCAT We have recently published on biogeographic interpretation of median networks and split decomposition, and I no going to talk about these methods today. Median networks build splits graphs directly from sequences. Split decomposition and NeighborNet build splitsgraphs from distances. These distances can be derived from sequences.

Consensus Networks and Super-Networks Tree 1 Tree 2 Tree 3 I want to introduce to you two newer approaches that we have been working with. These methods build Consensus networks and Supernetworks. Unlike Median networks, splitdecomposition and Neighbor Net, these methods visualise incongruence between gene trees. I would like to introduce these methods to you in the context of New alpine Ranunculus.

site patterns, splits, splits graph site patterns observed splits splits graph

calculated splits NJ 8 site patterns SD, NNET, MEDIAN network

extra site pattern added

nodes in splits graphs

Different splits graphs – same splits

summary Different reasons why you might want to build a phylogenetic network Some network methods identify more splits in the data than other methods there may be more than one splits graph representation for a set of splits Nodes in splits graph are not equivalent to the nodes in trees

Splits graphs and reticulate evolutionary models explicit model of reticulate evolution splits graphs

Building a reticulate evolutionary model Z-closure Supernetwork Hybridisation network

Daniel Huson and David Bryant You can also undertake the anlayses I have carried out here yourself on your own data - Daniel and David have cowritten a new version of SplitsTree that carries out all of the methods that I have mentioned today.

Split decomposition Identify weakly compatible splits for all possible combinations of quartets Define split lengths for all splits in split system Build splits graph

An example of using distances to calculate the length of internal splits

distance matrix calculated from sequences B 3 C 6 5 D 5 6 9 AB|CD AC|BD AD|BC

An example of using distances to calculate the length of external splits

example A B 3 C 6 5 D 5 6 9 A|CD A|BD A|BC

NeighborNet (NNET) Use NeighborJoining like algorithms to determine the order in which sequences (nodes) can be joined to give a circular ordering. Once you have the circular ordering, use least squares to identify all splits with positive (non zero) lengths Build splits graph

All splits that have a circular ordering can be displayed in a plane

Median networks Perform the median operation on all combinations of 3 sequences Identify all the splits between median and extant sequences – built a splits graph

Consensus networks Extends idea of median networks to splits calculated from trees

106 random trees with 8 taxa

Combining gene trees for 106 loci

Supernetworks

More detail about building a splits graph…..

Adding the Trivial Splits The set O of all trivial splits on X is represented by a star: (Embedded graph: fixed circular ordering) x1 x4 x2 x3 x5 x7 x6

Adding a Circular Split Want to add split {x2,x3,x4} vs {x1,x5,x6,x7} Determine a path from x2 to x4 along the fontier of G Separate components Insert new split edges x4 x3 x2 x3 x2 x4 x1 x6 x5 x7 x1 x5 x6 x7

Adding a Circular Split x3 x2 Want to add split {x2,x3,x4} vs {x1,x5,x6,x7} Determine a path from x2 to x4 along the frontier of G Separate components Insert new split edges Done! x4 x1 x5 x6 x7

Adding a Non-Circular Split x6 x5 x3 x1 x7 x4 x2 x3 x2 Want to add split {x3,x5,x6} vs {x1,x2,x6,x7} Convex hull {x3,x5,x6} Convex hull {x1,x2,x6,x7} Determine intersection 3 2 1 1 x4 3 1 2 2 4 x1 3 4 2 4 x5 3 x6 x7

Adding a Non-Circular Split x3 x1 x7 x4 x2 x2 Want to add split {x3,x5,x6} vs {x1,x2,x6,x7} Convex hull {x3,x5,x6} Convex hull {x1,x2,x6,x7} Determine intersection Duplicate intersection Insert new split edges x4 x1 x5 x6 x7

Adding a Non-Circular Split x3 x1 x7 x4 x2 x2 Want to add split {x3,x5,x6} vs {x1,x2,x6,x7} Convex hull {x3,x5,x6} Convex hull {x1,x2,x6,x7} Determine intersection Duplicate intersection Insert new split edges Done! x4 x1 x5 x6 x7