Some Mathematical Challenges from Life Sciences Part III Peter Schuster, Universität Wien Peter F.Stadler, Universität Leipzig Günter Wagner, Yale University,

Slides:



Advertisements
Similar presentations
RNA & NKS Erik A. Schultes Hedgehog Research hedgehogresearch.info June 16, 2006.
Advertisements

An introduction to maximum parsimony and compatibility
Milgram-Routing in Social Networks
Diversity and Plasticity of RNA Beyond the One-Sequence-One-Structure Paradigm Peter Schuster Institut für Theoretische Chemie und Molekulare Strukturbiologie.
Towards Life Joyce’s lab turns their efforts now to tweaking the ABT system even more The goal is a system that undergoes self- sustained replication.
. Phylogenetic Trees (2) Lecture 13 Based on: Durbin et al 7.4, Gusfield , Setubal&Meidanis 6.1.
Transfer FAS UAS SAINT-PETERSBURG STATE UNIVERSITY COMPUTATIONAL PHYSICS Introduction Physical basis Molecular dynamics Temperature and thermostat Numerical.
Probabilistic Modeling of Molecular Evolution Using Excel, AgentSheets, and R Jeff Krause (Shodor)
The Cat and The Mouse -- The Case of Mobile Sensors and Targets David K. Y. Yau Lab for Advanced Network Systems Dept of Computer Science Purdue University.
Using Structure Indices for Efficient Approximation of Network Properties Matthew J. Rattigan, Marc Maier, and David Jensen University of Massachusetts.
Evolvable by Design Panos Oikonomou James Franck Institute Institute of Biophysical Dynamics The University of Chicago Philippe Cluzel How topology affects.
Non-coding RNA William Liu CS374: Algorithms in Biology November 23, 2004.
Deployment of Surface Gateways for Underwater Wireless Sensor Networks Saleh Ibrahim Advising Committee Prof. Reda Ammar Prof. Jun-Hong Cui Prof. Sanguthevar.
Geometric Crossover for Biological Sequences Alberto Moraglio, Riccardo Poli & Rolv Seehuus EuroGP 2006.
Implementation of Planted Motif Search Algorithms PMS1 and PMS2 Clifford Locke BioGrid REU, Summer 2008 Department of Computer Science and Engineering.
RNA Folding Kinetics Bonnie Kirkpatrick Dr. Nancy Amato, Faculty Advisor Guang Song, Graduate Student Advisor.
Mario Čagalj supervised by prof. Jean-Pierre Hubaux (EPFL-DSC-ICA) and prof. Christian Enz (EPFL-DE-LEG, CSEM) Wireless Sensor Networks:
TGCAAACTCAAACTCTTTTGTTGTTCTTACTGTATCATTGCCCAGAATAT TCTGCCTGTCTTTAGAGGCTAATACATTGATTAGTGAATTCCAATGGGCA GAATCGTGATGCATTAAAGAGATGCTAATATTTTCACTGCTCCTCAATTT.
Finding Mathematics in Genes and Diseases Ming-Ying Leung Department of Mathematical Sciences University of Texas at El Paso (UTEP)
Advanced methods of molecular dynamics Monte Carlo methods
Evolutionary Algorithms BIOL/CMSC 361: Emergence Lecture 4/03/08.
Introduction to Job Shop Scheduling Problem Qianjun Xu Oct. 30, 2001.
On realizing shapes in the theory of RNA neutral networks Speaker: Leszek Gąsieniec, U of Liverpool, UK Joint work with: Peter Clote, Boston College, USA.
RNA Secondary Structure Prediction Spring Objectives  Can we predict the structure of an RNA?  Can we predict the structure of a protein?
Computational Biology, Part D Phylogenetic Trees Ramamoorthi Ravi/Robert F. Murphy Copyright  2000, All rights reserved.
Evolution of Populations Chapter 16. Gene and Variation Although Mendel and Darwin both worked in the 1800’s, they were not able to share information.
Function-Information Relationship in Nucleic Acids Andrej Luptak U NIVERSITY oU NIVERSITY of C ALIFORNIA ‧ I RVINE Andrej Luptak
A new Programming Paradigm inspired by Algorithmic Chemistries UPP 2004 September 2004 Wolfgang Banzhaf, Memorial University of Newfoundland, Canada and.
Functional Integrals for the Parallel and Eigen Models of Virus Evolution Jeong-Man Park The Catholic University of Korea.
An Introduction to Genetic Algorithms Lecture 2 November, 2010 Ivan Garibay
Finding Mathematics in Genes and Diseases Ming-Ying Leung Department of Mathematical Sciences University of Texas at El Paso (UTEP)
Calculating branch lengths from distances. ABC A B C----- a b c.
What is a QUASI-SPECIES By Ye Dan U062281A USC3002 Picturing the World through Mathematics.
Game-Theoretic Analysis of Mobile Network Coverage David K.Y. Yau.
Ch.6 Phylogenetic Trees 2 Contents Phylogenetic Trees Character State Matrix Perfect Phylogeny Binary Character States Two Characters Distance Matrix.
1 Distributed and Optimal Motion Planning for Multiple Mobile Robots Yi Guo and Lynne Parker Center for Engineering Science Advanced Research Computer.
More statistical stuff CS 394C Feb 6, Today Review of material from Jan 31 Calculating pattern probabilities Why maximum parsimony and UPGMA are.
A New Evolutionary Approach for the Optimal Communication Spanning Tree Problem Sang-Moon Soak Speaker: 洪嘉涓、陳麗徽、李振宇、黃怡靜.
STATISTICAL COMPLEXITY ANALYSIS Dr. Dmitry Nerukh Giorgos Karvounis.
Compositional Assemblies Behave Similarly to Quasispecies Model
Molecular Modelling - Lecture 2 Techniques for Conformational Sampling Uses CHARMM force field Written in C++
Causality, symmetry, brain, evolution, DNA, and a new theory of Physics Sergio Pissanetzky 1.
1 Use graphs and not pure logic Variables represented by nodes and dependencies by edges. Common in our language: “threads of thoughts”, “lines of reasoning”,
Project funded by the Future and Emerging Technologies arm of the IST Programme Are Proliferation Techniques more efficient than Random Walk with respect.
Sampling Theory and Some Important Sampling Distributions.
Ayesha M.Khan Spring Phylogenetic Basics 2 One central field in biology is to infer the relation between species. Do they possess a common ancestor?
Motif Search and RNA Structure Prediction Lesson 9.
Design of a Robust Search Algorithm for P2P Networks
Project funded by the Future and Emerging Technologies arm of the IST Programme Search in Unstructured Networks Niloy Ganguly, Andreas Deutsch Center for.
An Introduction to Genetic Algorithms Lecture 2 November, 2010 Ivan Garibay
. Perfect Phylogeny MLE for Phylogeny Lecture 14 Based on: Setubal&Meidanis 6.2, Durbin et. Al. 8.1.
Probability and Statistics AME20213 – Measurements and Data Analysis.
5.6 Prefix codes and optimal tree Definition 31: Codes with this property which the bit string for a letter never occurs as the first part of the bit string.
Chemistry towards Biology
Some Mathematical Challenges from Life Sciences Part II Peter Schuster, Universität Wien Peter F.Stadler, Universität Leipzig Günter Wagner, Yale University,
Other confidence intervals
Figure 3. Definition of the outside distance, Dout, as the sum of the distances from each of the two protein binding sites to their corresponding.
Network analysis.
Character-Based Phylogeny Reconstruction
Design and Analysis of Algorithms (07 Credits / 4 hours per week)
DNA Library Design for Molecular Computation
Department of Computer Science University of York
Using Quotient Graphs to Model Neutrality in Evolutionary Search Dominic Wilson Devinder Kaur University of Toledo.
A Comparison of Genotype-Phenotype Maps for RNA and Proteins
Universality and diversity of the protein folding scenarios:a comprehensive analysis with the aid of a lattice model  Leonid A Mirny, Victor Abkevich,
Discrete Mathematics for Computer Science
第八届全国复杂网络学术会议 Spectra of transition matrix for networks: Computation and applications 章 忠 志 复旦大学计算机科学技术学院 Homepage:
MUTATIONS! Page 77.
Flow-Induced β-Hairpin Folding of the Glycoprotein Ibα β-Switch
Presentation transcript:

Some Mathematical Challenges from Life Sciences Part III Peter Schuster, Universität Wien Peter F.Stadler, Universität Leipzig Günter Wagner, Yale University, New Haven, CT Angela Stevens, Max-Planck-Institut für Mathematik in den Naturwissenschaften, Leipzig and Ivo L. Hofacker, Universität Wien Oberwolfach, GE,

1.Mathematics and the life sciences in the 21 st century 2.Selection dynamics 3.RNA evolution in silico and optimization of structure and properties

Definition of RNA structure

Sequence space of binary sequences of chain lenght n=5

Population and population support in sequence space: The master sequence

Population and population support in sequence space: The quasi-species

The increase in RNA production rate during a serial transfer experiment

Mapping from sequence space into structure space and into function

Folding of RNA sequences into secondary structures of minimal free energy,  G GGCGCGCCCGGCGCC GUAUCGAAAUACGUAGCGUAUGGGGAUGCUGGACGGUCCCAUCGGUACUCCA UGGUUACGCGUUGGGGUAACGAAGAUUCCGAGAGGAGUUUAGUGACUAGAGG

The Hamming distance between structures in parentheses notation forms a metric in structure space

Replication rate constant: f k =  / [  +  d S (k) ]  d S (k) = d H (S k,S  ) Evaluation of RNA secondary structures yields replication rate constants

The flowreactor as a device for studies of evolution in vitro and in silico Replication rate constant: f k =  / [  +  d S (k) ]  d S (k) = d H (S k,S  ) Selection constraint: # RNA molecules is controlled by the flow

The molecular quasispecies in sequence space

Evolutionary dynamics including molecular phenotypes

In silico optimization in the flow reactor: Trajectory (biologists‘ view)

In silico optimization in the flow reactor: Trajectory (physicists‘ view)

Movies of optimization trajectories over the AUGC and the GC alphabet AUGCGC

Statistics of the lengths of trajectories from initial structure to target ( AUGC -sequences)

Endconformation of optimization

Reconstruction of the last step 43  44

Reconstruction of last-but-one step 42  43 (  44)

Reconstruction of step 41  42 (  43  44)

Reconstruction of step 40  41 (  42  43  44)

Reconstruction of the relay series

Change in RNA sequences during the final five relay steps 39  44 Transition inducing point mutationsNeutral point mutations

In silico optimization in the flow reactor: Trajectory and relay steps

Birth-and-death process with immigration

Calculation of transition probabilities by means of a birth-and-death process with immigration

Neutral genotype evolution during phenotypic stasis Neutral point mutationsTransition inducing point mutations 28 neutral point mutations during a long quasi-stationary epoch

A random sequence of minor or continuous transitions in the relay series

Probability of occurrence of different structures in the mutational neighborhood of tRNA phe Rare neighbors Main transitions Frequent neighbors Minor transitions

In silico optimization in the flow reactor: Main transitions

Three important steps in the formation of the tRNA clover leaf from a randomly chosen initial structure corresponding to three main transitions.

Statistics of the numbers of transitions from initial structure to target ( AUGC -sequences)

Statistics of trajectories and relay series (mean values of log-normal distributions)

Transition probabilities determining the presence of phenotype S k (j) in the population

The number of main transitions or evolutionary innovations is constant.

Neutral genotype evolution during phenotypic stasis Neutral point mutationsTransition inducing point mutations 28 neutral point mutations during a long quasi-stationary epoch

Variation in genotype space during optimization of phenotypes Mean Hamming distance within the population and drift velocity of the population center in sequence space.

Spread of population in sequence space during a quasistationary epoch: t = 150

Spread of population in sequence space during a quasistationary epoch: t = 170

Spread of population in sequence space during a quasistationary epoch: t = 200

Spread of population in sequence space during a quasistationary epoch: t = 350

Spread of population in sequence space during a quasistationary epoch: t = 500

Spread of population in sequence space during a quasistationary epoch: t = 650

Spread of population in sequence space during a quasistationary epoch: t = 820

Spread of population in sequence space during a quasistationary epoch: t = 825

Spread of population in sequence space during a quasistationary epoch: t = 830

Spread of population in sequence space during a quasistationary epoch: t = 835

Spread of population in sequence space during a quasistationary epoch: t = 840

Spread of population in sequence space during a quasistationary epoch: t = 845

Spread of population in sequence space during a quasistationary epoch: t = 850

Spread of population in sequence space during a quasistationary epoch: t = 855

Mapping from sequence space into structure space and into function

The pre-image of the structure S k in sequence space is the neutral network G k

Neutral networks are sets of sequences forming the same structure. G k is the pre-image of the structure S k in sequence space: G k =  -1 (S k )  {I j |  (I j ) = S k } The set is converted into a graph by connecting all sequences of Hamming distance one. Neutral networks of small RNA molecules can be computed by exhaustive folding of complete sequence spaces, i.e. all RNA sequences of a given chain length. This number, N=4 n, becomes very large with increasing length, and is prohibitive for numerical computations. Neutral networks can be modelled by random graphs in sequence space. In this approach, nodes are inserted randomly into sequence space until the size of the pre-image, i.e. the number of neutral sequences, matches the neutral network to be studied.

Mean degree of neutrality and connectivity of neutral networks GC,AU GUC,AUG AUGC

A connected neutral network

A multi-component neutral network

Degree of neutrality of cloverleaf RNA secondary structures over different alphabets

Reference for postulation and in silico verification of neutral networks

The compatible set C k of a structure S k consists of all sequences which form S k as its minimum free energy structure (the neutral network G k ) or one of its suboptimal structures.

The intersection of two compatible sets is always non empty: C 0  C 1  

Reference for the definition of the intersection and the proof of the intersection theorem

A sequence at the intersection of two neutral networks is compatible with both structures

Barrier tree for two long living structures

A ribozyme switch E.A.Schultes, D.B.Bartel, Science 289 (2000),

Two ribozymes of chain lengths n = 88 nucleotides: An artificial ligase (A) and a natural cleavage ribozyme of hepatitis-  -virus (B)

The sequence at the intersection: An RNA molecules which is 88 nucleotides long and can form both structures

Two neutral walks through sequence space with conservation of structure and catalytic activity

Examples of smooth landscapes on Earth Massif Central Mount Fuji

Examples of rugged landscapes on Earth Dolomites Bryce Canyon

Evolutionary optimization in absence of neutral paths in sequence space

Evolutionary optimization including neutral paths in sequence space

Example of a landscape on Earth with ‘neutral’ ridges and plateaus Grand Canyon

Neutral ridges and plateus