7-1 Chapter 7 Genome Rearrangement. 7-2 Background In the late 1980‘s Jeffrey Palmer and colleagues discovered a remarkable and novel pattern of evolutionary.

Slides:



Advertisements
Similar presentations
A Simpler 1.5-Approximation Algorithm for Sorting by Transpositions Tzvika Hartman Weizmann Institute.
Advertisements

Sorting by reversals Bogdan Pasaniuc Dept. of Computer Science & Engineering.
School of CSE, Georgia Tech
Greedy Algorithms CS 466 Saurabh Sinha. A greedy approach to the motif finding problem Given t sequences of length n each, to find a profile matrix of.
Greedy Algorithms CS 6030 by Savitha Parur Venkitachalam.
IKI 10100: Data Structures & Algorithms Ruli Manurung (acknowledgments to Denny & Ade Azurat) 1 Fasilkom UI Ruli Manurung (Fasilkom UI)IKI10100: Lecture10.
DCJUC: A Maximum Parsimony Simulator for Constructing Phylogenetic Tree of Genomes with Unequal Contents Zhaoming Yin Bader-Polo Joint Group Meeting, Nov.
Gene an d genome duplication Nadia El-Mabrouk Université de Montréal Canada.
© The McGraw-Hill Companies, Inc., Chapter 2 The Complexity of Algorithms and the Lower Bounds of Problems.
Bioinformatics Chromosome rearrangements Chromosome and genome comparison versus gene comparison Permutations and breakpoint graphs Transforming Men into.
Introduction Sorting permutations with reversals in order to reconstruct evolutionary history of genome Reversal mutations occur often in chromosomes where.
Greedy Algorithms And Genome Rearrangements
Genome Rearrangements CIS 667 April 13, Genome Rearrangements We have seen how differences in genes at the sequence level can be used to infer evolutionary.
2 -1 Chapter 2 The Complexity of Algorithms and the Lower Bounds of Problems.
Sorting Signed Permutations By Reversals (The Hannenhalli – Pevzner Theory) Seminar in Bioinformatics – ©Shai Lubliner.
Introduction to Bioinformatics Algorithms Greedy Algorithms And Genome Rearrangements.
Of Mice and Men Learning from genome reversal findings Genome Rearrangements in Mammalian Evolution: Lessons From Human and Mouse Genomes and Transforming.
Genome Rearrangements CSCI : Computational Genomics Debra Goldberg
Cleber V. G. Mira Analysis of Sorting by Transpositions based on Algebraic Formalism RECOMB 2004 João Meidanis.
Transforming Cabbage into Turnip: Polynomial Algorithm for Sorting Signed Permutations by Reversals Journal of the ACM, vol. 46, No. 1, Jan 1999, pp
5. Lecture WS 2003/04Bioinformatics III1 Genome Rearrangements Compare to other areas in bioinformatics we still know very little about the rearrangement.
Genome Rearrangement SORTING BY REVERSALS Ankur Jain Hoda Mokhtar CS290I – SPRING 2003.
1 Sorting by Transpositions Based on the First Increasing Substring Concept Advisor: Professor R.C.T. Lee Speaker: Ming-Chiang Chen.
1 Genome Rearrangements João Meidanis São Paulo, Brazil December, 2004.
A Linear-Time Algorithm for Computing Inversion Distance between signed Permutations with an experimental Study David Bader, Bernard Moret, Mi Yan Presented.
Efficient Data Structures and a New Randomized Approach for Sorting Signed Permutations by Reversals Haim Kaplan and Elad Verbin.
Phylogenetic Tree Construction and Related Problems Bioinformatics.
Approximation Algorithms
GRAPH Learning Outcomes Students should be able to:
Data Structures Using C++ 2E
CSE 6406: Bioinformatics Algorithms. Course Outline
Genome Rearrangements …and YOU!! Presented by: Kevin Gaittens.
Genome Rearrangements Tseng Chiu Ting Sept. 24, 2004.
1 A Simpler 1.5- Approximation Algorithm for Sorting by Transpositions Combinatorial Pattern Matching (CPM) 2003 Authors: T. Hartman & R. Shamir Speaker:
UNIVERSIDADE ESTADUAL DE CAMPINAS - UNICAMP INSTITUTO DE COMPUTAÇÃO Cleber V. G. Mira Analysis of Sorting by Transpositions based on Algebraic Formalism.
16. Lecture WS 2004/05Bioinformatics III1 V16 – genome rearrangement Important information – contained in the order in which genes occur on the genomes.
A Simpler 1.5-Approximation Algorithm for sorting by transposition Tzvika Hartman.
Genome Rearrangements Unoriented Blocks. Quick Review Looking at evolutionary change through reversals Find the shortest possible series of reversals.
Greedy Algorithms And Genome Rearrangements An Introduction to Bioinformatics Algorithms (Jones and Pevzner)
Semester Project: Greedy Algorithms and Genome Rearrangements August/17/2012 Name: Xuanyu Hu Professor: Elise de Doncker.
The Lower Bounds of Problems
Genome Rearrangements [1] Ch Types of Rearrangements Reversal Translocation
Greedy Algorithms And Genome Rearrangements
Chap. 7 Genome Rearrangements Introduction to Computational Molecular Biology Chap ~
CS 200 Algorithms and Data Structures
Andrew’s Leap 2011 Pancakes With A Problem Steven Rudich.
Chap. 7 Genome Rearrangements Introduction to Computational Molecular Biology Chapter 7.1~7.2.4.
Greedy Algorithms CS 498 SS Saurabh Sinha. A greedy approach to the motif finding problem Given t sequences of length n each, to find a profile matrix.
Gene: A sequence of nucleotides coding for protein Gene Prediction Problem: Determine the beginning and end positions of genes in a genome Gene Prediction:
Comp. Genomics Recitation 10 Clustering and analysis of microarrays.
Genome Rearrangement By Ghada Badr Part I.
Introduction to Bioinformatics Algorithms Chapter 5 Greedy Algorithms and Genome Rearrangements By: Hasnaa Imad.
Genome Rearrangements. Turnip vs Cabbage: Look and Taste Different Although cabbages and turnips share a recent common ancestor, they look and taste different.
Genome Rearrangements. Turnip vs Cabbage: Look and Taste Different Although cabbages and turnips share a recent common ancestor, they look and taste different.
Outline Today’s topic: greedy algorithms
1 Genome Rearrangements (Lecture for CS498-CXZ Algorithms in Bioinformatics) Dec. 6, 2005 ChengXiang Zhai Department of Computer Science University of.
Comp. Genomics Recitation 7 Clustering and analysis of microarrays.
Graphing Form of Sine and Cosine Functions
Tzvika Hartman Elad Verbin Bar Ilan University Tel Aviv University
Lecture 4: Genome Rearrangements. End Sequence Profiling (ESP) C. Collins and S. Volik (UCSF Cancer Center) 1)Pieces of tumor genome: clones ( kb).
Lecture 2: Genome Rearrangements. Outline Cancer Sequencing Transforming Cabbage into Turnip Genome Rearrangements Sorting By Reversals Pancake Flipping.
CSE 5290: Algorithms for Bioinformatics Fall 2009
Greedy (Approximation) Algorithms and Genome Rearrangements
Lecture 3: Genome Rearrangements and Duplications
CSCI2950-C Lecture 4 Genome Rearrangements
Greedy Algorithms And Genome Rearrangements
FanChang Hao, Melvin Zhang, and Hon Wai Leong Review for TCBB
Double Cut and Join with Insertions and Deletions
Greedy Algorithms And Genome Rearrangements
Presentation transcript:

7-1 Chapter 7 Genome Rearrangement

7-2 Background In the late 1980‘s Jeffrey Palmer and colleagues discovered a remarkable and novel pattern of evolutionary change in plant organelles. They mapped the mitochondrial genomes of Brassica oleracea (cabbage ,高麗菜 ) and Brassica campestris (turnip ,大頭菜 ), which are very closely related (many genes are 99% ~ 99.9% identical), differ dramatically in gene order.

7-3 Genome Rearrangement Input: Two genomes which contains the same set of genes, but the order of genes is different. Goal: Find the shortest sequence of rearrange operations transforming one genome to another. Since we are interested in the order of genes, we label each gene a unique number, 1, 2, 3, …, n. We may view the problem as a sorting problem, with some special operations (such as transposition and reversal).

7-4 Terminologies G=“ ” -g: the reverse of gene g Example: gene 5=“GCTGA”, -5=“AGTCG” Transposition: swap two adjacent substrings of any length without changing the order of the two substrings  Reversal: invert the order of a substring of any length 

7-5 Terminologies Transposition:ρ(i, j, k) e.g. π ={ }, π ·  (1,3,6)={ } Unsigned reversal:  Signed reversal: 

7-6 Sorting by Reversal

7-7 Sorting by Transposition Input: A permutation π=π 1 π 2... π n of 1, 2,..., n, with π 0 = 0, π n+1 = n+1. Goal: Sort π by the minimum number of transpositions. Example:

7-8 Break Points For all 0  i  n in a permutation, there is a breakpoint between π i and π i+1 if π i +1≠π i+1. π= { } has 6 breakpoints 0  3   2  1  4  8 9 We can eliminate at most three breakpoints in a single transposition. Example: 0 1  4  2 3  5 6  A trivial lower bound

7-9 Lower Bound and Cycle Graph gray edge: from i-1 to i black edge:from π i to π i-1 There are 4 alternating cycles (each pair of adjacent edges are of different colors). Notation: c(G)= 4

7-10 Cycle Graph of Identity Permutation The cycle graph of identity permutation {012…(n+1)} can be decomposed into n+1 cycles. The purpose of sorting π is increasing the number of cycles from c(π) to n+1.

7-11 c(G) Change in Transposition (1) Δc(G)=2 Δc(G)=0

7-12 c(G) Change in Transposition (2) Δc(G)  {-2, 0, 2} x-move: Δc(G)= x after a transposition Δc(G)=0 Δc(G)=-2

7-13 Identity permutation has n+1 cycles. Each transposition increases # of cycles by at most two. lower bound of transposition distance: Lower Bound of Transposition Distance

approximation Algorithm and Cycles A cycle can be represented by (i 1, i 2,..., i k ) according to the visiting black edges from i 1 to i k, where i 1 is the rightmost black edge in the cycle. Cycles: (6,1,3,4), (7,5) and (2) Non-oriented cycle: (7,5): decreasing sequence Oriented cycle: (6,1,3,4)

move on an Oriented Cycle C = (i 1,..., i k ): an oriented cycle, 3  t  k, i t > i t-1 ρ(i t-1, i t, i 1 ) is a 2-move transposition. After ρ(1,3,6): Δc(G)=2

move in a Non-oriented Cycle Δc(G) = 0 We can not perform 2-moves on a non-oriented cycle. A non-oriented cycle can be transformed into an oriented cycle with a special 0-move transposition. After ρ(2,3,7):

move on an Oriented Cycle Δc(G) = 2After ρ(2,5,6): When there is an oriented cycle, we can perform 2-move transposition on it again.

approximation Algorithm Summary If there is an oriented cycle, then perform a 2-move. If there is no oriented cycle, we can create one from a non-oriented cycle via a 0-move. So we can increase at least two cycles in two transpositions. It is a 2-approximation algorithm.

7-19 Definitions for 1.75 Approximation Short cycle: cycle with at most two black edges. Long cycle: cycle with three or more black edges.

7-20 Definitions for 1.75 Approximation Even cycle: cycle with even number of black edges. Odd cycle: cycle with odd number of black edges.

7-21 Mail Approach For a long cycle, we can increase four cycles in three consecutive transpositions. In the worst case, average Δf 1 =4/3 For a short cycle, we can increase four odd cycles and decrease two even cycles in two consecutive transpositions. On average Δf 2 =(4x-2)/2=2x-1 (See the definition of object function on the next page.)

7-22 Approximation Ratio Define an object function: f(π)=xC odd (π)+C even (π), where x > 1. For π I = identity permutation, f(π I )=x(n+1). Δc(G)  {-2, 0, 2}, so f(π) increases by at most 2x after a transposition (Δf  2x) The minimal value of Ratio: 2x-1=4/3 Ratio=1.75

7-23 An Example for Short Cycles Δf = 2x-2 Δf = 2x After ρ(2,3,4): C odd (π)=0 C even (π)=2 C odd (π)=2 C even (π)=0 C odd (π)=4 C even (π)=0 After ρ(1,2,4):

Move for Long Cycles (1) Δf = 00-move ρ(2,4,6): Cycles: (6,4,2), (5,3,1) C odd (π)=2 C even (π)=0 Cycles: (6,4,2), (5,1,3) C odd (π)=2 C even (π)=0

Move for Long Cycles (2) Δf = 2x Cycles: (6,4,2), (5,1,3) C odd (π)=2 C even (π)=0 2-move ρ(1,3,5): Cycles: (6,2,4) C odd (π)=4 C even (π)=0

Move for Long Cycles (2) Δf = 2x Cycles: (6,2,4) C odd (π)=4 C even (π)=0 2-move ρ(2,4,6): C odd (π)=6 C even (π)=0