Lecture 9 CS5661 RNA – The “REAL nucleic acid” Motivation Concepts Structural prediction –Dot-matrix –Dynamic programming Simple cost model Energy cost.

Slides:



Advertisements
Similar presentations
B. Knudsen and J. Hein Department of Genetics and Ecology
Advertisements

RNA Secondary Structure Prediction
RNA structure prediction. RNA functions RNA functions as –mRNA –rRNA –tRNA –Nuclear export –Spliceosome –Regulatory molecules (RNAi) –Enzymes –Virus –Retrotransposons.
Stochastic Context Free Grammars for RNA Modeling CS 838 Mark Craven May 2001.
Predicting the 3D Structure of RNA motifs Ali Mokdad – UCSF May 28, 2007.
Predicting RNA Structure and Function. Non coding DNA (98.5% human genome) Intergenic Repetitive elements Promoters Introns mRNA untranslated region (UTR)
Predicting RNA Structure and Function
RNA structure prediction. RNA functions RNA functions as –mRNA –rRNA –tRNA –Nuclear export –Spliceosome –Regulatory molecules (RNAi) –Enzymes –Virus –Retrotransposons.
Pattern Discovery in RNA Secondary Structure Using Affix Trees (when computer scientists meet real molecules) Giulio Pavesi& Giancarlo Mauri Dept. of Computer.
Improving Free Energy Functions for RNA Folding RNA Secondary Structure Prediction.
RNA Secondary Structure Prediction
Predicting RNA Structure and Function. Nobel prize 1989Nobel prize 2009 Ribozyme Ribosome RNA has many biological functions The function of the RNA molecule.
Presenting: Asher Malka Supervisor: Prof. Hermona Soreq.
Developing Pairwise Sequence Alignment Algorithms Dr. Nancy Warter-Perez May 20, 2003.
RNA structure analysis Jurgen Mourik & Richard Vogelaars Utrecht University.
Predicting RNA Structure and Function. Following the human genome sequencing there is a high interest in RNA “Just when scientists thought they had deciphered.
. Class 5: RNA Structure Prediction. RNA types u Messenger RNA (mRNA) l Encodes protein sequences u Transfer RNA (tRNA) l Adaptor between mRNA molecules.
CISC667, F05, Lec19, Liao1 CISC 467/667 Intro to Bioinformatics (Fall 2005) RNA secondary structure.
Computational Biology, Part 2 Sequence Comparison with Dot Matrices Robert F. Murphy Copyright  1996, All rights reserved.
Predicting RNA Structure and Function
Bioinformatics Gene Introduction Oct NTUST.
Predicting RNA Structure and Function. Nobel prize 1989 Nobel prize 2009 Ribozyme Ribosome.
Dynamic Programming (cont’d) CS 466 Saurabh Sinha.
Motif finding: Lecture 1 CS 498 CXZ. From DNA to Protein: In words 1.DNA = nucleotide sequence Alphabet size = 4 (A,C,G,T) 2.DNA  mRNA (single stranded)
Information theoretic interpretation of PAM matrices Sorin Istrail and Derek Aguiar.
Case Study. DNA Deoxyribonucleic acid (DNA) is a nucleic acid that contains the genetic instructions used in the development and functioning of all known.
RNA-Seq and RNA Structure Prediction
Multiple Sequence Alignment CSC391/691 Bioinformatics Spring 2004 Fetrow/Burg/Miller (Slides by J. Burg)
Pairwise & Multiple sequence alignments
Non-coding RNA gene finding problems. Outline Introduction RNA secondary structure prediction RNA sequence-structure alignment.
Genomics and Personalized Care in Health Systems Lecture 9 RNA and Protein Structure Leming Zhou, PhD School of Health and Rehabilitation Sciences Department.
CRB Journal Club February 13, 2006 Jenny Gu. Selected for a Reason Residues selected by evolution for a reason, but conservation is not distinguished.
Computational Biology, Part 3 Sequence Alignment Robert F. Murphy Copyright  1996, All rights reserved.
Alineamiento Matricial (Harr Plot, Matrix Plot, Dot Plot, Dot Matrix)
RNA Secondary Structure Prediction Spring Objectives  Can we predict the structure of an RNA?  Can we predict the structure of a protein?
Multiple Alignment and Phylogenetic Trees Csc 487/687 Computing for Bioinformatics.
From Structure to Function. Given a protein structure can we predict the function of a protein when we do not have a known homolog in the database ?
Lecture 6. Pairwise Local Alignment and Database Search Csc 487/687 Computing for bioinformatics.
RNA Secondary Structure Prediction. 16s rRNA RNA Secondary Structure Hairpin loop Junction (Multiloop)Bulge Single- Stranded Interior Loop Stem Image–
CS5263 Bioinformatics Lecture 20 Practical issues in motif finding Final project.
RNA secondary structure RNA is (usually) single-stranded The nucleotides ‘want’ to pair with their Watson-Crick complements (AU, GC) They may ‘settle’
Improving the prediction of RNA secondary structure by detecting and assessing conserved stems Xiaoyong Fang, et al.
CS5263 Bioinformatics RNA Secondary Structure Prediction.
Questions?. Novel ncRNAs are abundant: Ex: miRNAs miRNAs were the second major story in 2001 (after the genome). Subsequently, many other non-coding genes.
Prediction of Secondary Structure of RNA
Doug Raiford Lesson 7.  RNA World Hypothesis  RNA world evolved into the DNA and protein world  DNA advantage: greater chemical stability  Protein.
RNA Structure Prediction RNA Structure Basics The RNA ‘Rules’ Programs and Predictions BIO520 BioinformaticsJim Lund Assigned reading: Ch. 6 from Bioinformatics:
Pairwise sequence alignment Lecture 02. Overview  Sequence comparison lies at the heart of bioinformatics analysis.  It is the first step towards structural.
Sequence Alignment.
Motif Search and RNA Structure Prediction Lesson 9.
Tracking down ncRNAs in the genomes. How to find ncRNA gene The stability of ncRNA secondary structure is not sufficiently different from the predicted.
Lecture 11 CS5661 Structural Bioinformatics – Structure Comparison Motivation Concepts Structure Comparison.
4.2 - Algorithms Sébastien Lemieux Elitra Canada Ltd.
RNAs. RNA Basics transfer RNA (tRNA) transfer RNA (tRNA) messenger RNA (mRNA) messenger RNA (mRNA) ribosomal RNA (rRNA) ribosomal RNA (rRNA) small interfering.
AAA AAAU AAUUC AUUC UUCCG UCCG CCGG G G Karen M. Pickard CISC889 Spring 2002 RNA Secondary Structure Prediction.
Multiple sequence alignment (msa)
Stochastic Context-Free Grammars for Modeling RNA
Vienna RNA web servers
Predicting RNA Structure and Function
RNA Secondary Structure Prediction
RNA Secondary Structure Prediction
Stochastic Context-Free Grammars for Modeling RNA
Structure Prediction dmitra 11/18/2018.
Dynamic Programming (cont’d)
RNA Secondary Structure Prediction
RNA 2D and 3D Structure Craig L. Zirbel October 7, 2010.
RNA is like a computer disk or flash drive
Paradigms for Computational Nucleic Acid Design
Multiple Sequence Alignment (I)
CISC 467/667 Intro to Bioinformatics (Spring 2007) RNA secondary structure CISC667, S07, Lec19, Liao.
Presentation transcript:

Lecture 9 CS5661 RNA – The “REAL nucleic acid” Motivation Concepts Structural prediction –Dot-matrix –Dynamic programming Simple cost model Energy cost model –Covariance

Lecture 9 CS5662 Motivation First molecule of life - “Jack of all trades, by necessity” –RNA genes produce numerous kinds of RNA (distinct from mRNA from protein genes) tRNA, rRNA, components in splicing, protein trafficking Basis of global classification of life –Phylogenetic studies of RNA shed light on relationship between various life forms

Lecture 9 CS5663 Concepts RNA has structure –Base pairing “No date? No problem, I complement myself, if imperfectly.” –Stem, Loop, Bulge, Junction Promiscuous base pairing –Not just GC, AU, but also GU 1:n mapping between structure and sequence –Distinction between structure and sequence as evolutionary constraints

Lecture 9 CS5664 Dot-matrix Analysis 3’ Flip sequence Generate complement Compare result with original sequence Limitation: Which is the best structure?

Lecture 9 CS5665 Maximum base-pairs algorithm Premise: Best structure is structure with maximum base-pairs Approach: Dynamic Programming Optimality: Best structure for RNA sequence must be based on best structure of subsequences Scoring function: Base-pair = 1; Non-base- pair = 0;

Lecture 9 CS5666 Maximum base-pairs algorithm Algorithm: S(i,j) = max[ {S(i+1, j-1) + m ij }, {S(i+1, j)}, {S(i, j-1)}, max k | (i<k<j) {S(i, k) + S(k+1, j)} ]

Lecture 9 CS5667 Maximum base-pairs algorithm

Lecture 9 CS5668 Case for better cost function Limitations of maximal base pair algorithm: –Fails to distinguish between contiguous versus scattered base-pairs –Ignores other energetic contributions Stacking interactions (“Affinity between adjacent ballroom dancing pairs”) RNA flexible, but not infinitely so (“India rubber man, but still can’t fold into box”). Need to penalize distorted structures, proportional to degree of distortion

Lecture 9 CS5669 Minimum Energy Algorithm (MFOLD) Key differences –Score n base-pair interactions not explicitly but indirectly as  n-1 stacking interactions –Stacking costs depend on exact sequence of base-pairs (Table 8.2a) –Also penalize non-paired bases, differentially (Table 8.2b) –More involved implementation –Suboptimal structures also found

Lecture 9 CS56610 Minimum Energy Algorithm (MFOLD) Steps (Fig 8.6, 8.7) –First, exhaustive enumeration of possible pairs, noting exact pairs –Identify local diagonals and score based on stacking costs –Use these values, instead of just number of base pairs, to carry out dynamic programming similar to base-pair counting algorithm

Lecture 9 CS56611 Covariance based analysis Based on –Base pairing is conserved, even if individual bases are not –Structure (pattern of base pairing, not base-pairs per se) is of key importance for RNA molecules Approach –Multiply align RNA sequences with similar function –Identify columns across which base-pairing is conserved by statistical analysis of joint probability versus random probability Benefits –Identify structural core elements –Rank search results of structure prediction methods

Lecture 9 CS56612 Miscellany Stochastic grammars for RNA structure prediction/analysis (Fig. 8.16) –Productions like “G. non-terminal. C” encode complementarity Searches for RNA genes –Sequence alignment per se not useful –Searches based on self-complementary regions in sliding windows useful