Topological Methods for RNA Pseudoknots

Slides:



Advertisements
Similar presentations
B. Knudsen and J. Hein Department of Genetics and Ecology
Advertisements

RNA Secondary Structure Prediction
RNA structure prediction. RNA functions RNA functions as –mRNA –rRNA –tRNA –Nuclear export –Spliceosome –Regulatory molecules (RNAi) –Enzymes –Virus –Retrotransposons.
Towards RNA structure prediction: 3D motif prediction and knowledge-based potential functions Christian Laing Tamar Schlick’s lab Courant Institute of.
The Wales Group in Context: Exploring Energy Landscapes Research Review by Ryan Babbush Applied Computation 298r February 8, 2013.
Modern Monte Carlo Methods: (2) Histogram Reweighting (3) Transition Matrix Monte Carlo Jian-Sheng Wang National University of Singapore.
On the formulation of a functional theory for pairing with particle number restoration Guillaume Hupin GANIL, Caen FRANCE Collaborators : M. Bender (CENBG)
1 Folding RNA a confluence of biology, mathematics, and physics A. Zee Institute for Theoretical Physics University of California Santa Barbara, CA, USA.
Predicting RNA Structure and Function. Non coding DNA (98.5% human genome) Intergenic Repetitive elements Promoters Introns mRNA untranslated region (UTR)
Feynman diagrams, RNA folding, and the transition polynomial Yongwu Rong Department of Mathematics George Washington University RNA in Biology, Bioengineering.
RNA Folding Xinyu Tang Bonnie Kirkpatrick. Overview Introduction to RNA Previous Work Problem Hofacker ’ s Paper Chen and Dill ’ s Paper Modeling RNA.
Non-coding RNA William Liu CS374: Algorithms in Biology November 23, 2004.
Improving Free Energy Functions for RNA Folding RNA Secondary Structure Prediction.
RNA Secondary Structure Prediction
Sónia Martins Bruno Martins José Cruz IGC, February 20 th, 2008.
Discovery of RNA Structural Elements Using Evolutionary Computation Authors: G. Fogel, V. Porto, D. Weekes, D. Fogel, R. Griffey, J. McNeil, E. Lesnik,
. Protein Structure Prediction [Based on Structural Bioinformatics, section VII]
Elasticity and structural phase transitions in single biopolymer systems Haijun Zhou ( 周海军 ) Institute of Theoretical Physics, the Chinese Academy of Sciences,
Sunmin Ahn Journal Club Presentation October 23, 2006
Finding Common RNA Pseudoknot Structures in Polynomial Time Patricia Evans University of New Brunswick.
Structural Alignment of Pseudoknotted RNAs Banu Dost, Buhm Han, Shaojie Zhang, Vineet Bafna.
Predicting RNA Structure and Function. Nobel prize 1989 Nobel prize 2009 Ribozyme Ribosome.
Materials and Methods Abstract Conclusions Introduction 1. Korber B, et al. Br Med Bull 2001; 58: Rambaut A, et al. Nat. Rev. Genet. 2004; 5:
School of Physics & Astronomy FACULTY OF MATHEMATICAL & PHYSICAL SCIENCE Parallel Transport & Entanglement Mark Williamson 1, Vlatko Vedral 1 and William.
Non-coding RNA gene finding problems. Outline Introduction RNA secondary structure prediction RNA sequence-structure alignment.
Relating computational and physical complexity Computational complexity: How the number of computational steps needed to solve a problem scales with problem.
Monte Carlo Simulation of Interacting Electron Models by a New Determinant Approach Mucheng Zhang (Under the direction of Robert W. Robinson and Heinz-Bernd.
1 Bio + Informatics AAACTGCTGACCGGTAACTGAGGCCTGCCTGCAATTGCTTAACTTGGC An Overview پرتال پرتال بيوانفورماتيك ايرانيان.
Materials Process Design and Control Laboratory ON THE DEVELOPMENT OF WEIGHTED MANY- BODY EXPANSIONS USING AB-INITIO CALCULATIONS FOR PREDICTING STABLE.
Strand Design for Biomolecular Computation
A Kinetic Monte Carlo Study Of Ordering in a Binary Alloy Group 3: Tim Drews (ChE) Dan Finkenstadt (Physics) Xuemin Gu (MSE) CSE 373/MatSE 385/Physics.
Computational Prediction of RNA and DNA Secondary Structure Anne Condon Bioinformatics, and Empirical and Theoretical Algorithmics (BETA) Laboratory The.
From Structure to Function. Given a protein structure can we predict the function of a protein when we do not have a known homolog in the database ?
RNA Secondary Structure Prediction. 16s rRNA RNA Secondary Structure Hairpin loop Junction (Multiloop)Bulge Single- Stranded Interior Loop Stem Image–
The Ising Model Mathematical Biology Lecture 5 James A. Glazier (Partially Based on Koonin and Meredith, Computational Physics, Chapter 8)
KIAS July 2006 RNA secondary structure Ground state and the glass transition of the RNA secondary structure RNA folding: specific versus nonspecific pairing.
8. Selected Applications. Applications of Monte Carlo Method Structural and thermodynamic properties of matter [gas, liquid, solid, polymers, (bio)-macro-
© Wiley Publishing All Rights Reserved. RNA Analysis.
RNA Structure Prediction
Classifying Pseudoknots Kyle L. Spafford. Classifying Pseudoknots -- Kyle Spafford 2 Recap – What’s a pseudoknot again? Substructure with non- nested.
1 Departament of Bioengineering, University of California 2 Harvard Medical School Department of Genetics Metabolic Flux Balance Analysis and the in Silico.
RNA Structure Prediction Including Pseudoknots Based on Stochastic Multiple Context-Free Grammar PMSB2006, June 18, Tuusula, Finland Yuki Kato, Hiroyuki.
Protein Folding and Modeling Carol K. Hall Chemical and Biomolecular Engineering North Carolina State University.
Kernel Properties 2012 Computer Science PhD Showcase 17 February 2012 Roberto Valerio Dr. Ricardo Vilalta Pattern Analysis Lab.
Approximation Algorithms For Protein Folding Prediction Giancarlo MAURI,Antonio PICCOLBONI and Giulio PAVESI Symposium on Discrete Algorithms, pp ,
The role of the sigma meson in thermal models W. Broniowski¹ ˑ ², F. Giacosa¹ ˑ ³, V. Begun¹ ¹ Institute of Physics, Jan Kochanowski University, PL
Diffusion in Disordered Media Nicholas Senno PHYS /12/2013.
Workshop on Optimization in Complex Networks, CNLS, LANL (19-22 June 2006) Application of replica method to scale-free networks: Spectral density and spin-glass.
Structural Alignment of Pseudo-knotted RNA
Study of chemical potential effects on hadron mass by lattice QCD Pushkina Irina* Hadron Physics & Lattice QCD, Japan 2004 Three main points What do we.
341- INTRODUCTION TO BIOINFORMATICS Overview of the Course Material 1.
The Study Of Statistical Thermodynamics In Melting of Atomic Cluster Pooja Shrestha.
Materials Process Design and Control Laboratory ON THE DEVELOPMENT OF WEIGHTED MANY- BODY EXPANSIONS USING AB-INITIO CALCULATIONS FOR PREDICTING STABLE.
1 Discovery of Structural and Functional Features in RNA Pseudoknots Qingfeng Chen and Yi-Ping Phoebe Chen, Senior Member, IEEE IEEE TRANSACTIONS ON KNOWLEDGE.
Motif Search and RNA Structure Prediction Lesson 9.
Collaborators: Bugra Borasoy – Bonn Univ. Thomas Schaefer – North Carolina State U. University of Kentucky CCS Seminar, March 2005 Neutron Matter on the.
Rapid ab initio RNA Folding Including Pseudoknots via Graph Tree Decomposition Jizhen Zhao, Liming Cai Russell Malmberg Computer Science Plant Biology.
Internal loops within the RNA secondary structure can be worked out in an almost quadratic time stRNAgology, Haifa, 2006.
4.2 - Algorithms Sébastien Lemieux Elitra Canada Ltd.
RNAs. RNA Basics transfer RNA (tRNA) transfer RNA (tRNA) messenger RNA (mRNA) messenger RNA (mRNA) ribosomal RNA (rRNA) ribosomal RNA (rRNA) small interfering.
Overview of Molecular Dynamics Simulation Theory
What is frameshifting? Frame-shifting used to synthesize multiple
Predicting RNA Structure and Function
RNA Secondary Structure Prediction
Permeability of gases in glassy polymers by computer simulation
Yang Zhang, Andrzej Kolinski, Jeffrey Skolnick  Biophysical Journal 
RNA 2D and 3D Structure Craig L. Zirbel October 7, 2010.
Network Inference Chris Holmes Oxford Centre for Gene Function, &,
Biointelligence Laboratory, Seoul National University
Presentation transcript:

Topological Methods for RNA Pseudoknots Nicole A. Larsen Georgia Institute of Technology Department of Mathematics Math 4803 – 04/21/2008

Overview Introduction to Pseudoknots Topological Representation and Classification Thermodynamic Calculations Conclusions and Open Problems

Pseudoknots RNA secondary structures with “crossing” base pairs Prevalent in nature Telomerase Viruses such as Hepatitis C, SARS Coronavirus, and even several strains of HIV Coronavirus

The Trouble with Pseudoknots Cannot be represented as a plane tree Current energy calculation methods do not hold About the only thing we can do is use recursive methods Single Hairpin Pseudoknot

Representing Pseudoknots

Topological Genus For a surface in 3-space: g=0 for a sphere, g=1 for a single-holed torus, g=2 for a double-holed torus… g=n for an n-holed torus. The genus of an RNA structure is defined by Bon et al. to be the minimum g such that the disk diagram can be drawn on a surface of genus g with no crossing arcs.

Calculating Genus Where P is the number of arcs in the diagram and L is the number of loops.

Properties of Genus Pseudoknot-free structures have genus 0. Stacked base pairs do not contribute to genus. For concatenated structures, genus is the sum of the two substructures. For nested structures, genus is the sum of the two substructures.

RNA Structures with Genus 1

Classification Results There are 4 primitive pseudoknots of genus 1 Pseudobase: Contains 246 pseudoknots 238 were H-pseudoknots or nested H-pseudoknots Only 1 had genus >1 World Wide Protein Database (wwPDB) Even very long RNA structures (~2000 bases) have low genus (<18) Primitive pseudoknots have genus 1 or 2 Expected genus for random RNA sequences ~ length/4

Classification Results (Left) Genus as a function of length of the RNA structure. (Right) A histogram of the genus of primitive RNA structures found in the wwPDB (Bon et al.)

What good is it, anyway? Genus gives us a way to measure the “complexity” of a pseudoknot If we can determine a relationship between topological genus and energy then we can use a minimum free energy approach for prediction

Thermodynamics and Quantum Matrix Field Theory RNA disk diagrams --------- Feynman diagrams Feynman diagrams representing the Lamb shift – Nothing to do with RNA at all!

Partition Function Thermodynamic partition function: where the sum ranges over all possible Feynman diagrams D for a given RNA sequence and E(D) is the energy of diagram D where the sum ranges over all possible Feynman diagrams D for a given RNA sequence and E(D) is the energy of diagram D

Results Vernizzi and Orland use a Monte Carlo method to generate RNA structures weighed by the partition function: Where  is a “topological potential energy” and g is genus. By adjusting  you can allow RNA structures of any genus, or restrict to small genus structures. Useful for rapidly exploring energy regions to find minimum energy structures. When  goes to infinity (PKF) results agree with mfold predictions. g/L ~ 0.23 for random sequences

Modeling with a Cubic Lattice Infinitely flexible polymer sequence Given by a self-avoiding random walk on a cubic lattice Each base lies on a vertex of the lattice Bases only bond with neighboring bases, modeled by “spin vectors” where the sum ranges over all possible Feynman diagrams D for a given RNA sequence and E(D) is the energy of diagram D

Results Average genus per unit energy where the sum ranges over all possible Feynman diagrams D for a given RNA sequence and E(D) is the energy of diagram D Average genus per unit energy

Results Average genus per unit length for the low-energy phase (left) and the high-energy phase (right) <g/L> = 0.141 ± 0.003 for low energy and <g/L> = (585 ± 8) x 10-6 for high energy

Conclusions Topological genus provides a nice, relatively easy classification scheme for pseudoknots Thermodynamic predictions based on genus agree with observations and with predictions given by mfold Low-genus structures are more likely to be found in nature.

Open Questions Create an algorithm for predicting secondary structures that may have pseudoknots Pillsbury, Orland, and Zee: steepest-descent method that takes O(L6) just to calculate partition function, much less optimal structures! Experimental measurement and cataloging of low-genus structures How does genus depend on temperature? Can genus be used to predict asymptotic behavior of very long sequences? Incorporation of higher-order considerations such as entropy

References Key Sources Mathematics Sources (found in MathSciNet) Bon, Michael, Graziano Vernizzi, Henri Orland, & A. Zee. “Topological Classification of RNA Structures.” ArXiv Quantitative Biology e-prints (2006): arXiv:q-bio/0607032v1. Orland, Henri, & A. Zee. “RNA Folding and Large N Matrix Theory.” Nucl.Phys. B620 (2002): 456-476. Vernizzi, Graziano, and Henri Orland. “Large-N Random Matrices for RNA Folding.” Acta Physica Polonica B 36(2005): 2821-2827. Vernizzi, Graziano, Paulo Ribeca, Henri Orland, & A. Zee. “Topology of Pseudoknotted Homopolymers.” Physical Review E 73(2006). Mathematics Sources (found in MathSciNet) Karp, Richard M. “Mathematical Challenges from Genomics and Molecular Biology.” Notices of the AMS 49(2002): 544-553. Pillsbury, M., J. A. Taylor, H. Orland, & A. Zee. “An Algorithm for RNA Pseudoknots.” ArXiv Condensed Matter e-prints (2005): arXiv:cond-mat/0310505. Rivas, Elena, and Sean R. Eddy. “A Dynamic Programming Algorithm for RNA Structure Prediction Including Pseudoknots.” Journal of Molecular Biology, Vol. 285 No 5 (5 February 1999), pp 2053-2068. Vernizzi, Graziano, Henri Orland, & A. Zee. “Enumeration of RNA Structures by Matrix Models.” Phys Rev Lett. 94(2006). Zee, A. “Random Matrix Theory and RNA Folding.” Acta Physica Polonica B 36(2005): 2829-2836. Biology Sources (found in PubMed) Brierley, Ian, Simon Pennell, and Robert J. C. Gilbert. “Viral RNA Pseudoknots: Versatile Motifs in Gene Expression and Replication.” Nature Reviews Microbiology 5(2007): 598-610. Chen, Jiunn-Liang, and Carol W. Greider. “Functional Analysis of the Pseudoknot Structure in Human Telomerase RNA.” Proceedings of the National Academy of Sciences 102(2005): 8080-8085. Maugh, Thomas H. “RNA Viruses: The Age of Innocence Ends.” Science, New Series, Vol. 183, No. 4130. (Mar. 22, 1974), pp. 1181-1185. Tu, Chialing, Tzy-Hwa Tzeng, and Jeremy A. Bruenn. “Ribosomal Movement Impeded at a Pseudoknot Required for Frameshifting.” Proceedings of the National Academy of Sciences of the United States of America, Vol. 89, No. 18. (Sep. 15, 1992), pp. 8636-8640. Other Sources Rong, Yongwu. “Feynman diagrams, RNA folding, and the transition polynomial.” IMA Annual Program Year Workshop: RNA in Biology, Bioengineering and Nanotechnology. October 29-November 2, 2007. Staple DW, Butcher SE (2005) “Pseudoknots: RNA Structures with Diverse Functions.” PLoS Biol 3(6) (2005), e213 doi:10.1371/journal.pbio.0030213.

THE END