T HE P ROBLEM OF R ECONSTRUCTING K - ARTICULATED P HYLOGENETIC N ETWORK Supervisor : Dr. Yiu Siu Ming Second Examiner : Professor Francis Y.L. Chin Student.

Slides:



Advertisements
Similar presentations
CS 336 March 19, 2012 Tandy Warnow.
Advertisements

An Algorithm for Constructing Parsimonious Hybridization Networks with Multiple Phylogenetic Trees Yufeng Wu Dept. of Computer Science & Engineering University.
Triangle partition problem Jian Li Sep,2005.  Proposed by Redstar in Algorithm board in Fudan BBS.  Motivated by some network design strategy.
A Separate Analysis Approach to the Reconstruction of Phylogenetic Networks Luay Nakhleh Department of Computer Sciences UT Austin.
O(N 1.5 ) divide-and-conquer technique for Minimum Spanning Tree problem Step 1: Divide the graph into  N sub-graph by clustering. Step 2: Solve each.
Divide and Conquer. Subject Series-Parallel Digraphs Planarity testing.
PHYLOGENETIC TREES Bulent Moller CSE March 2004.
. Phylogenetic Trees (2) Lecture 13 Based on: Durbin et al 7.4, Gusfield , Setubal&Meidanis 6.1.
AVL Trees1 Part-F2 AVL Trees v z. AVL Trees2 AVL Tree Definition (§ 9.2) AVL trees are balanced. An AVL Tree is a binary search tree such that.
Greedy Algorithms Greed is good. (Some of the time)
Advanced Topics in Algorithms and Data Structures 1 Rooting a tree For doing any tree computation, we need to know the parent p ( v ) for each node v.
Topics: 1. Trees - properties 2. The master theorem 3. Decoders מבנה המחשב - אביב 2004 תרגול 4#
Close Lower and Upper Bounds for the Minimum Reticulate Network of Multiple Phylogenetic Trees Yufeng Wu Dept. of Computer Science & Engineering University.
Course Review COMP171 Spring Hashing / Slide 2 Elementary Data Structures * Linked lists n Types: singular, doubly, circular n Operations: insert,
Fast Computation of the Exact Hybridization Number of Two Phylogenetic Trees Yufeng Wu and Jiayin Wang Department of Computer Science and Engineering University.
Phylogenetic Networks of SNPs with Constrained Recombination D. Gusfield, S. Eddhu, C. Langley.
Perfect Phylogeny MLE for Phylogeny Lecture 14
Algorithm Animation for Bioinformatics Algorithms.
Tractable Symmetry Breaking Using Restricted Search Trees Colva M. Roney-Dougal, Ian P. Gent, Tom Kelsey, Steve Linton Presented by: Shant Karakashian.
GRAPHS CSE, POSTECH. Chapter 16 covers the following topics Graph terminology: vertex, edge, adjacent, incident, degree, cycle, path, connected component,
Dynamic Programming. Well known algorithm design techniques:. –Divide-and-conquer algorithms Another strategy for designing algorithms is dynamic programming.
“On an Algorithm of Zemlyachenko for Subtree Isomorphism” Yefim Dinitz, Alon Itai, Michael Rodeh (1998) Presented by: Masha Igra, Merav Bukra.
Lecture 10 Trees –Definiton of trees –Uses of trees –Operations on a tree.
MINATO ZDD Project Efficient Enumeration of the Directed Binary Perfect Phylogenies from Incomplete Data Toshiki Saitoh (ERATO) Joint work with Masashi.
Constructing evolutionary trees from rooted triples Bang Ye Wu Dept. of Computer Science and Information Engineering Shu-Te University.
Discrete Structures Lecture 12: Trees Ji Yanyan United International College Thanks to Professor Michael Hvidsten.
Phylogenetic networks: recent questions and results (or: constructing a level-2 phylogenetic network from a dense set of input triplets in polynomial time)
5.5.2 M inimum spanning trees  Definition 24: A minimum spanning tree in a connected weighted graph is a spanning tree that has the smallest possible.
394C, Spring 2013 Sept 4, 2013 Tandy Warnow. DNA Sequence Evolution AAGACTT TGGACTTAAGGCCT -3 mil yrs -2 mil yrs -1 mil yrs today AGGGCATTAGCCCTAGCACTT.
Ch.6 Phylogenetic Trees 2 Contents Phylogenetic Trees Character State Matrix Perfect Phylogeny Binary Character States Two Characters Distance Matrix.
5.5.3 Rooted tree and binary tree  Definition 25: A directed graph is a directed tree if the graph is a tree in the underlying undirected graph.  Definition.
Module networks Sushmita Roy BMI/CS 576 Nov 18 th & 20th, 2014.
Preview  Graph  Tree Binary Tree Binary Search Tree Binary Search Tree Property Binary Search Tree functions  In-order walk  Pre-order walk  Post-order.
5.5.2 M inimum spanning trees  Definition 24: A minimum spanning tree in a connected weighted graph is a spanning tree that has the smallest possible.
Dale Roberts Department of Computer and Information Science, School of Science, IUPUI CSCI 240 Recursion and Trees Dale Roberts, Lecturer
Speeding Up Enumeration Algorithms with Amortized Analysis Takeaki Uno (National Institute of Informatics, JAPAN)
Subtree Prune Regraft & Horizontal Gene Transfer or Recombination.
GRAPHS. Graph Graph terminology: vertex, edge, adjacent, incident, degree, cycle, path, connected component, spanning tree Types of graphs: undirected,
Chapter 10: Trees A tree is a connected simple undirected graph with no simple circuits. Properties: There is a unique simple path between any 2 of its.
Data Structures Lakshmish Ramaswamy. Tree Hierarchical data structure Several real-world systems have hierarchical concepts –Physical and biological systems.
598AGB Basics Tandy Warnow. DNA Sequence Evolution AAGACTT TGGACTTAAGGCCT -3 mil yrs -2 mil yrs -1 mil yrs today AGGGCATTAGCCCTAGCACTT AAGGCCTTGGACTT.
Constructing a level-2 phylogenetic network from a dense set of input triplets Leo van Iersel 1, Judith Keijsper 1, Steven Kelk 2, Leen Stougie 12 (1)
Why use phylogenetic networks?
CS 598 AGB Supertrees Tandy Warnow. Today’s Material Supertree construction: given set of trees on subsets of S (the full set of taxa), construct tree.
. Perfect Phylogeny MLE for Phylogeny Lecture 14 Based on: Setubal&Meidanis 6.2, Durbin et. Al. 8.1.
Hierarchical clustering approaches for high-throughput data Colin Dewey BMI/CS 576 Fall 2015.
Chapter 6 – Trees. Notice that in a tree, there is exactly one path from the root to each node.
Chapter 11. Chapter Summary  Introduction to trees (11.1)  Application of trees (11.2)  Tree traversal (11.3)  Spanning trees (11.4)
Application of Phylogenetic Networks in Evolutionary Studies Daniel H. Huson and David Bryant Presented by Peggy Wang.
Lecture #13. Topics 1.The Graph Abstract Data Type. 2.Graph Representations. 3.Elementary Graph Operations.
Trees A non-linear implementation for collection classes.
D ESIGN & A NALYSIS OF A LGORITHM 13 – D YNAMIC P ROGRAMMING Informatics Department Parahyangan Catholic University.
by d. gusfield v. bansal v. bafna y. song presented by vikas taliwal
394C, Spring 2012 Jan 23, 2012 Tandy Warnow.
Chapter 5 : Trees.
DATA STRUCTURES AND OBJECT ORIENTED PROGRAMMING IN C++
Binary Trees "A tree may grow a thousand feet tall, but its leaves will return to its roots." -Chinese Proverb.
ITEC 2620M Introduction to Data Structures
Lecture 5 Dynamic Programming
Chapter 6 Transform and Conquer.
Hierarchical clustering approaches for high-throughput data
Data Structures & Algorithms
Topic 18 Binary Trees "A tree may grow a thousand feet tall, but its leaves will return to its roots." -Chinese Proverb.
Lectures on Graph Algorithms: searching, testing and sorting
Trees Definitions Implementation Traversals K-ary Trees
Subtree Isomorphism in O(n2.5)
Introduction Wireless Ad-Hoc Network
CS 581 Tandy Warnow.
Phylogeny.
Heaps Chapter 6 Section 6.9.
Presentation transcript:

T HE P ROBLEM OF R ECONSTRUCTING K - ARTICULATED P HYLOGENETIC N ETWORK Supervisor : Dr. Yiu Siu Ming Second Examiner : Professor Francis Y.L. Chin Student : Vu Thi Quynh Hoa

C ONTENTS 1. Introduction  Motivation  Related Work  Project Plan 2. Problem Definitions 3. Algorithms  1-articulated Network Algorithm  2-articulated Network Algorithm

I NTRODUCTION – M OTIVATION To model the evolutionary history of species, phylogenetic network is a powerful approach to represent the articulation events Level-x network : the time complexity of all existing algorithms increases exponentially when x gets higher k-articulated network is a more naturally biological model which can capture complex scenarios of articulation events with a smaller value of k E.g. level-4 network vs. 2-articulated network

R ELATED W ORK The problem of constructing phylogenetic networks has been worked under many approaches using different input types Nakhleh et al. proposed an algorithm constructing a level-1 network from two trees in polynomial time Huynh et al. with a polynomial -running-time algorithm building a galled network from a set of trees Bryant and Moulton developed NeighborNet method to construct a network from a distance matrix Jansson, Nguyen and Sung with O ( n 3 ) running time to construct a galled network given a set of triplets Extending to level-2 network, Van Iersel et al. provided an O ( n 8 ) algorithm

S CHEDULES – P ROJECT P LAN ObjectivesTime Round 1 1.Reconstructing restricted 1-articulated network from a set of binary phylogenetic trees 30 Sep Reconstructing restricted 2-articulated network from a set of binary phylogenetic trees 15 Nov 2011 Round 2 3.Reconstructing 1-articulated network from a Distance Matrix 15 Feb Implementation one of the three problems 31 Mar 2012

D EFINITIONS Phylogenetic Tree A rooted, unordered tree with distinctly labeled leaves representing each strain of the species Phylogenetic Network A rooted, directed acyclic graph in which: One node has indegree 0 (the root ), and all other nodes have indegree 1 or 2 All nodes with indegree 2 must have outdegree 1 ( hybrid nodes ) All other nodes with indegree 1 have outdegree 0 or 2 Nodes with outdegree 0 are leaves which are distinctly labeled Node s is called a split node of a hybrid node h if s can be reached using two disjoint paths from the children of s

P HYLOGENETIC N ETWORK

D EFINITIONS k-articulated network a phylogenetic network in which every split node corresponds to at most k hybrid nodes A level- k network is a k -articulated network A k- articulated network can model a level- x network (x > k) Level-2 network 1-articulated network

D EFINITIONS A network is non-skew if all paths from any split node to its hybrid node have a length ≥ 2 A network is safe if the siblings of all hybrid nodes are not hybrid nodes A network is restricted if it is non-skew and safe

D EFINITIONS Given a hybrid node h and its parents p and q, a cut on edge ( p, h ) means removing the edge ( p, h ) from the network, and then for every node with indegree 1 and outdegree less than 2, contracting its outgoing edge A network N is compatible with phylogenetic tree T if N can be converted to T by performing a series of cuts one by one. h p q h p q

P ROBLEM D EFINITION Reconstructing a restricted k-articulated network (where k = 1, 2 ) from a set of binary trees Given a set of phylogenetic binary trees T i, i = 1, 2, …, k, with the same leaf label set, construct a restricted k-articulated network N (where k = 1, 2) with minimum number of hybrid nodes compatible with each tree T i

A LGORITHM Divide and Conquer Technique Dividing BipartitionTripartitionQuadripartition Conquering ?

1- ARTICULATED N ETWORK A LGORITHM Case 1: Each input tree is a single node – Base case Case 2: Input tree set admits a leaf set bipartition Case 3: Input tree set admits a leaf set tripartition

1- ARTICULATED N ETWORK A LGORITHM Case 1: Each input tree is a single node – Base case – O (1) Return a network which is a single node with the same label

1- ARTICULATED N ETWORK A LGORITHM Case 2: Input tree set admits a leaf set bipartition – O ( kn ) T1T1 T2T2 TkTk N1N1 N2N2 r N Combination

r 1- ARTICULATED N ETWORK A LGORITHM Case 3: Input tree set admits a leaf set tripartition – O ( kn ) T1T1 T2T2 TkTk N1N1 N2N2 NhNh x y It takes O ( kn ) to find nodes x in N 1 and y in N 2

2- ARTICULATED N ETWORK A LGORITHM Case 1: Each input tree is a single node – Base case Case 2: Input tree set admit a leaf set bipartition Case 3: Input tree set admit a leaf set tripartition Case 4: Input tree set admit a leaf set quadripartition

r 4- ARTICULATED N ETWORK A LGORITHM Case 4: Input tree set admits a leaf set quadripartition – O ( kn ) T1T1 T2T2 TkTk N h1 x1x1 y1y1 It takes O ( kn ) to find nodes x 1 & x 2 in N 1 and y 1 & y 2 in N 2 N h1 x2x2 y2y2 N2N2 N1N1

r 4- ARTICULATED N ETWORK A LGORITHM Case 4: Input tree set admits a leaf set quadripartition – O ( kn ) T1T1 T2T2 TkTk N h1 x1x1 y1y1 It takes O ( kn ) to find nodes x 1 & x 2 in N 1 and y 1 & y 2 in N 2 N h1 x2x2 y2y2 N2N2 N1N1

T IME C OMPLEXITY Time complexity of the Algorithms in reconstructing a restricted k -articulated network, in both cases when k = 1, 2: Each recursive step takes O ( kn ) running time to check whether the input tree set admit a leaf set bipartition or tripartition, and then combine the subnetworks returned The number of nodes in the restricted 1-articulated network is O ( n ) Therefore, the total time complexity is O ( kn 2 )

T HANK Y OU !