Recombination:. Different recombinases have different topological mechanisms: Xer recombinase on psi. Unique product Uses topological filter to only perform.

Slides:



Advertisements
Similar presentations
Lecture 6: Creating a simplicial complex from data. in a series of preparatory lectures for the Fall 2013 online course MATH:7450 (22M:305) Topics in Topology:
Advertisements

Preliminaries Computing Betti Numbers Computing Persistent Homology.
Lecture 5: Triangulations & simplicial complexes (and cell complexes). in a series of preparatory lectures for the Fall 2013 online course MATH:7450 (22M:305)
Topological Data Analysis
Phylogenetic reconstruction
Molecular Evolution Revised 29/12/06
Bioinformatics Chromosome rearrangements Chromosome and genome comparison versus gene comparison Permutations and breakpoint graphs Transforming Men into.
Close Lower and Upper Bounds for the Minimum Reticulate Network of Multiple Phylogenetic Trees Yufeng Wu Dept. of Computer Science & Engineering University.
Finding generators for H1.
Phylogenetic Tree Construction and Related Problems Bioinformatics.
Classification and Phylogenies Taxonomic categories and taxa Inferring phylogenies –The similarity vs. shared derived character states –Homoplasy –Maximum.

Sequencing a genome and Basic Sequence Alignment
Department of Biomedical Informatics Biomedical Data Visualization Kun Huang Department of Biomedical Informatics OSUCCC Biomedical Informatics Shared.
Multiple Sequence Alignment CSC391/691 Bioinformatics Spring 2004 Fetrow/Burg/Miller (Slides by J. Burg)
Molecular evidence for endosymbiosis Perform blastp to investigate sequence similarity among domains of life Found yeast nuclear genes exhibit more sequence.
Recombination:. Different recombinases have different topological mechanisms: Xer recombinase on psi. Unique product Uses topological filter to only perform.
Recombination Homologous recombination Site-specific recombination Transposition Different sets of proteins are responsible for the different types of.
Lecture 4: Addition (and free vector spaces) of a series of preparatory lectures for the Fall 2013 online course MATH:7450 (22M:305) Topics in Topology:
Lecture 25 - Phylogeny Based on Chapter 23 - Molecular Evolution Copyright © 2010 Pearson Education Inc.
Bioinformatics 2011 Molecular Evolution Revised 29/12/06.
DNA TOPOLOGY De Witt Sumners Department of Mathematics Florida State University Tallahassee, FL
Topological Data Analysis
DNA TOPOLOGY: EXPERIMENTS AND ANALYSIS
Sequencing a genome and Basic Sequence Alignment
Lecture 2: Addition (and free abelian groups) of a series of preparatory lectures for the Fall 2013 online course MATH:7450 (22M:305) Topics in Topology:
Evolutionary Biology Concepts Molecular Evolution Phylogenetic Inference BIO520 BioinformaticsJim Lund Reading: Ch7.
Introduction to Phylogenetics
Isabel K. Darcy Mathematics Department University of Iowa
Isabel K. Darcy Mathematics Department University of Iowa ©2008 I.K. Darcy. All rights reserved.
Optional Lecture: A terse introduction to simplicial complexes in a series of preparatory lectures for the Fall 2013 online course MATH:7450 (22M:305)
Isabel K. Darcy Mathematics Department Applied Mathematical and Computational Sciences (AMCS) University of Iowa ©2008.
Lecture 3: Modular Arithmetic of a series of preparatory lectures for the Fall 2013 online course MATH:7450 (22M:305) Topics in Topology: Scientific and.
Isabel K. Darcy Mathematics Department Applied Mathematical and Computational Sciences (AMCS) University of Iowa ©2008.
Cis-regulatory Modules and Module Discovery
341- INTRODUCTION TO BIOINFORMATICS Overview of the Course Material 1.
Ayesha M.Khan Spring Phylogenetic Basics 2 One central field in biology is to infer the relation between species. Do they possess a common ancestor?
MATH:7450 (22M:305) Topics in Topology: Scientific and Engineering Applications of Algebraic Topology Sept 9, 2013: Create your own homology. Fall 2013.
A filtered complex is an increasing sequence of simplicial complexes: C0 C1 C2 …
A filtered complex is an increasing sequence of simplicial complexes: C 0 C 1 C 2 … UUU.
Welcome to MATH:7450 (22M:305) Topics in Topology: Scientific and Engineering Applications of Algebraic Topology Week 1: Introduction to Topological Data.
MATH:7450 (22M:305) Topics in Topology: Scientific and Engineering Applications of Algebraic Topology Sept 16, 2013: Persistent homology III Fall 2013.
Recombination:. Different recombinases have different topological mechanisms: Xer recombinase on psi. Unique product Uses topological filter to only perform.
Sept 25, 2013: Applicable Triangulations.
From Natural Images to MRIs: Using TDA to Analyze Image Data
Nov 6, 2013: Stable Persistence and time series.
TDA is a form of Exploratory Data Analysis (EDA)
Zigzag Persistent Homology Survey
Difference topology experiments and skein relations
Oct 16, 2013: Zigzag Persistence and installing Dionysus part I.
Sept 23, 2013: Image data Application.
“DNA is a helical structure” with “two co-axial molecules.”
MATH:7450 (22M:305) Topics in Topology: Scientific and Engineering Applications of Algebraic Topology Nov 20, 2013: Intro to RNA & Topological Landscapes.
Application to Natural Image Statistics
MATH:7450 (22M:305) Topics in Topology: Scientific and Engineering Applications of Algebraic Topology Nov 22, 2013: Topological methods for exploring low-density.
Dec 4, 2013: Hippocampal spatial map formation
MATH:7450 (22M:305) Topics in Topology: Scientific and Engineering Applications of Algebraic Topology Nov 8, 2013: DNA Topology Fall 2013 course offered.
MATH:7450 (22M:305) Topics in Topology: Scientific and Engineering Applications of Algebraic Topology Nov 15, 2013: Brief intro to tangles & Phylogeny.
A Hybrid Algorithm for Multiple DNA Sequence Alignment
Defn: degree of si = deg si = time when si enters the filtration. .
Tutorial: Introduction to DNA topology
Multiple Alignment and Phylogenetic Trees

Clustering Via Persistent Homology
A filtered complex is an increasing sequence of simplicial complexes: C0 C1 C2 …

Rational 2-string tangles
Phylogeny and the Tree of Life
Tangle analysis of protein-DNA complexes.
Presentation transcript:

Recombination:

Different recombinases have different topological mechanisms: Xer recombinase on psi. Unique product Uses topological filter to only perform deletions, not inversions Ex: Cre recombinase can act on both directly and inversely repeated sites.

PNAS 2013

Tangle Analysis of Protein-DNA complexes

Mathematical Model Protein = DNA = = ==

Protein-DNA complex Heichman and Johnson C. Ernst, D. W. Sumners, A calculus for rational tangles: applications to DNA recombination, Math. Proc. Camb. Phil. Soc. 108 (1990), protein = three dimensional ball protein-bound DNA = strings. Slide (modified) from Soojeong Kim

Solving tangle equations Tangle equation from: Path of DNA within the Mu transpososome. Transposase interactions bridging two Mu ends and the enhancer trap five DNA supercoils. Pathania S, Jayaram M, Harshey RM. Cell May 17;109(4):

vol. 110 no. 46, 18566–18571, 2013

Background

Recombination:

Homologous recombination

books.com/MoBio/Free/Ch8D2. htm

Homologous recombination

Distances can be derived from Multiple Sequence Alignments (MSAs). The most basic distance is just a count of the number of sites which differ between two sequences divided by the sequence length. These are sometimes known as p-distances. Cat ATTTGCGGTA Dog ATCTGCGATA Rat ATTGCCGTTT Cow TTCGCTGTTT CatDogRatCow Cat Dog Rat Cow Where do we get distances from?

Perfectly “ tree-like ” distances CatDogRat Dog3 Rat45 Cow676 Cat Dog Rat Cow

Perfectly “ tree-like ” distances CatDogRat Dog3 Rat45 Cow676 Cat Dog Rat Cow

Perfectly “ tree-like ” distances CatDogRat Dog3 Rat45 Cow676 Cat Dog Rat Cow

Perfectly “ tree-like ” distances CatDogRat Dog3 Rat45 Cow676 Cat Dog Rat Cow

Perfectly “ tree-like ” distances CatDogRat Dog3 Rat45 Cow676 Cat Dog Rat Cow

Perfectly “ tree-like ” distances CatDogRat Dog3 Rat45 Cow676 Cat Dog Rat Cow

CatDogRat Dog3 Rat45 Cow676 Cat Dog Rat Cow RatDogCat Dog3 Cat45 Cow676 Rat Dog Cat Cow

CatDogRat Dog3 Rat45 Cow676 Cat Dog Rat Cow RatDogCat Dog3 Cat45 Cow676 Rat Dog Cat Cow CatDogRat Dog4 Rat44 Cow676

Linking algebraic topology to evolution. Chan J M et al. PNAS 2013;110: ©2013 by National Academy of Sciences

Linking algebraic topology to evolution. Chan J M et al. PNAS 2013;110: ©2013 by National Academy of Sciences Reticulation

Multiple sequence alignment

Reassortment

Homologous recombination

Reconstructing phylogeny from persistent homology of avian influenza HA. (A) Barcode plot in dimension 0 of all avian HA subtypes. Chan J M et al. PNAS 2013;110: ©2013 by National Academy of Sciences Influenza: For a single segment, no H k for k > 0 no horizontal transfer (i.e., no homologous recombination)

Persistent homology of reassortment in avian influenza. Chan J M et al. PNAS 2013;110: ©2013 by National Academy of Sciences 009/06/29/reassor tment-of-the- influenza-virus- genome/ For multiple segments, non-trivial H k k = 1, 2. Thus horizontal transfer via reassortment but not homologous recombination

Barcoding plots of HIV-1 reveal evidence of recombination in (A) env, (B), gag, (C) pol, and (D) the concatenated sequences of all three genes. Chan J M et al. PNAS 2013;110: ©2013 by National Academy of Sciences HIV – single segment (so no reassortment) Non-trivial H k k = 1, 2. Thus horizontal transfer via homologous recombination.

TOP = Topological obstruction = maximum barcode length in non-zero dimensions TOP ≠ 0  no additive distance tree TOP is stable

ICR = irreducible cycle rate = average number of the one-dimensional irreducible cycles per unit of time Simulations show that ICR is proportional to and provides a lower bound for recombination/reassortment rate

Persistent homology Viral evolution Filtration value  Genetic distance (evolutionary scale)  0 at filtration value  Number of clusters at scale  Generators of H 0 A representative element of the cluster Hierarchical Hierarchical clustering relationship among H 0 generators  1 Number of reticulate events (recombination and reassortment)

Persistent homology Viral evolution Generators of H 1 Reticulate events Generators of H 2 Complex horizontal genomic exchange H k ≠ 0 for some k > 0 No phylogenetic tree representation Number of Lower bound on rate of higher-dimensional reticulate events generators over time (irreducible cycle rate)

MATH:7450 (22M:305) Topics in Topology: Scientific and Engineering Applications of Algebraic Topology Oct 16, 2013: Zigzag Persistence and installing Dionysus part I. Fall 2013 course offered through the University of Iowa Division of Continuing Education Isabel K. Darcy, Department of Mathematics Applied Mathematical and Computational Sciences, University of Iowa

/activities/Carlsson-Gunnar/lecture14.pdf

workshop_files/slides/deSilva_TGDA.pdf

Lee-Mumford-Pedersen [LMP] study only high contrast patches. Collection: 4.5 x 10 6 high contrast patches from a collection of images obtained by van Hateren and van der Schaaf Recall from Sept 20 lecture

M(100, 10) U Q where |Q| = 30 On the Local Behavior of Spaces of Natural Images, Gunnar Carlsson, Tigran Ishkhanov, Vin de Silva, Afra Zomorodian, International Journal of Computer Vision 2008, pp 1-12.

The Theory of Multidimensional Persistence, Gunnar Carlsson, Afra Zomorodian "Persistence and Point Clouds" Functoriality, diagrams, difficulties in classifying diagrams, multidimensional persistence, Gröbner bases, Gunnar Carlsson

The Theory of Multidimensional Persistence, Gunnar Carlsson, Afra Zomorodian "Persistence and Point Clouds" Functoriality, diagrams, difficulties in classifying diagrams, multidimensional persistence, Gröbner bases, Gunnar Carlsson

Computing Multidimensional Persistence, Gunnar Carlsson, Gurjeet Singh, and Afra Zomorodian

Time varying data X[t 0, t 1 ] = data points existing at time t for t in [t 0, t 1 ] X[t 1, t 2 ] X[t 2, t 3 ] X[t 0, t 2 ] X[t 1, t 3 ] X[t 2, t 4 ]

Time varying data X[t 0, t 1 ] = data points existing at time t for t in [t 0, t 1 ] X[t 1, t 2 ] X[t 2, t 3 ] X[t 0, t 2 ] X[t 1, t 3 ] X[t 2, t 4 ] VR(X[t 1, t 2 ], ε) VR(X[t 2, t 3 ], ε) VR(X[t 0, t 2 ], ε) VR(X[t 1, t 3 ], ε) VR(X[t 2, t 4 ], ε)

Time varying data X[t 0, t 1 ] = data points existing at time t for t in [t 0, t 1 ] X[t 1, t 2 ] X[t 2, t 3 ] X[t 0, t 2 ] X[t 1, t 3 ] X[t 2, t 4 ] VR(X[t 1, t 2 ], ε) VR(X[t 2, t 3 ], ε) VR(X[t 0, t 2 ], ε) VR(X[t 1, t 3 ], ε) VR(X[t 2, t 4 ], ε) C 0  C 1  C 2  C 3  C 4

C 1 C 3 C 0 C 2 C 4 H 1 H 3 H 0 H 2 H 4

C 0  C 1  C 2  C 3  C 4 H 0  H 1  H 2  H 3  H 4 H k i, p = Z k i /(B k i+p Z k i ) = L(i, i+p)( H k i ) U Persistent Homology: C 0  C 1  C 2  C 3  C 4 H 0  H 1  H 2  H 3  H 4 Zigzag Homology: