Predicting RNA Structure and Function

Slides:



Advertisements
Similar presentations
RNA Secondary Structure Prediction
Advertisements

Gene expression From Gene to Protein
RNA structure prediction. RNA functions RNA functions as –mRNA –rRNA –tRNA –Nuclear export –Spliceosome –Regulatory molecules (RNAi) –Enzymes –Virus –Retrotransposons.
Two short pieces MicroRNA Alternative splicing.
6 - 1 Chapter 6 The Secondary Structure Prediction of RNA.
MiRNA in computational biology 1 The Nobel Prize in Physiology or Medicine for 2006 Andrew Z. Fire and Craig C. Mello for their discovery of "RNA interference.
RNA Structure Prediction
Predicting the 3D Structure of RNA motifs Ali Mokdad – UCSF May 28, 2007.
Predicting RNA Structure and Function. Non coding DNA (98.5% human genome) Intergenic Repetitive elements Promoters Introns mRNA untranslated region (UTR)
Predicting RNA Structure and Function
RNA structure prediction. RNA functions RNA functions as –mRNA –rRNA –tRNA –Nuclear export –Spliceosome –Regulatory molecules (RNAi) –Enzymes –Virus –Retrotransposons.
Non-coding RNA William Liu CS374: Algorithms in Biology November 23, 2004.
Computational biology seminar
RNA Secondary Structure Prediction
Predicting RNA Structure and Function. Nobel prize 1989Nobel prize 2009 Ribozyme Ribosome RNA has many biological functions The function of the RNA molecule.
Presenting: Asher Malka Supervisor: Prof. Hermona Soreq.
MicroRNA genes Ka-Lok Ng Department of Bioinformatics Asia University.
Predicting RNA Structure and Function. Following the human genome sequencing there is a high interest in RNA “Just when scientists thought they had deciphered.
[Bejerano Fall10/11] 1.
. Class 5: RNA Structure Prediction. RNA types u Messenger RNA (mRNA) l Encodes protein sequences u Transfer RNA (tRNA) l Adaptor between mRNA molecules.
1 Ref: Ch. 5 Mount: Bioinformatics i.Protein synthesis: ribosomal RNA transfer RNA messenger RNA ii.Catalysis e.g. ribozymes iii.Regulatory molecules 17.1.
Predicting RNA Structure and Function
Predicting RNA Structure and Function. Nobel prize 1989 Nobel prize 2009 Ribozyme Ribosome.
RNA Secondary Structure Prediction Introduction RNA is a single-stranded chain of the nucleotides A, C, G, and U. The string of nucleotides specifies the.
RNA informatics Unit 12 BIOL221T: Advanced Bioinformatics for Biotechnology Irene Gabashvili, PhD.
Non-coding RNA gene finding problems. Outline Introduction RNA secondary structure prediction RNA sequence-structure alignment.
MicroRNA Targets Prediction and Analysis. Small RNAs play important roles The Nobel Prize in Physiology or Medicine for 2006 Andrew Z. Fire and Craig.
Genomics and Personalized Care in Health Systems Lecture 9 RNA and Protein Structure Leming Zhou, PhD School of Health and Rehabilitation Sciences Department.
Structure and function of nucleic acids.. Heat. Heat flows through the boundary of the system because there exists a temperature difference between the.
RNA Secondary Structure Prediction Spring Objectives  Can we predict the structure of an RNA?  Can we predict the structure of a protein?
Gene Expression and Gene Regulation. The Link between Genes and Proteins At the beginning of the 20 th century, Garrod proposed: – Genetic disorders such.
From Structure to Function. Given a protein structure can we predict the function of a protein when we do not have a known homolog in the database ?
RNA folding & ncRNA discovery I519 Introduction to Bioinformatics, Fall, 2012.
RNA Folding. RNA Folding Algorithms Intuitively: given a sequence, find the structure with the maximal number of base pairs For nested structures, four.
RNA Secondary Structure Prediction. 16s rRNA RNA Secondary Structure Hairpin loop Junction (Multiloop)Bulge Single- Stranded Interior Loop Stem Image–
© Wiley Publishing All Rights Reserved. RNA Analysis.
Lecture 9 CS5661 RNA – The “REAL nucleic acid” Motivation Concepts Structural prediction –Dot-matrix –Dynamic programming Simple cost model Energy cost.
RNA secondary structure RNA is (usually) single-stranded The nucleotides ‘want’ to pair with their Watson-Crick complements (AU, GC) They may ‘settle’
Do you know… What does the central dogma of modern biology say? What are the two main steps in Protein Synthesis?
RNA Structure Prediction
MicroRNAs and Other Tiny Endogenous RNAs in C. elegans Annie Chiang JClub Ambros et al. Curr Biol 13:
RNA Structure Prediction RNA Structure Basics The RNA ‘Rules’ Programs and Predictions BIO520 BioinformaticsJim Lund Assigned reading: Ch. 6 from Bioinformatics:
Motif Search and RNA Structure Prediction Lesson 9.
RNA Structure Prediction
Rapid ab initio RNA Folding Including Pseudoknots via Graph Tree Decomposition Jizhen Zhao, Liming Cai Russell Malmberg Computer Science Plant Biology.
RNAs. RNA Basics transfer RNA (tRNA) transfer RNA (tRNA) messenger RNA (mRNA) messenger RNA (mRNA) ribosomal RNA (rRNA) ribosomal RNA (rRNA) small interfering.
AAA AAAU AAUUC AUUC UUCCG UCCG CCGG G G Karen M. Pickard CISC889 Spring 2002 RNA Secondary Structure Prediction.
Gene Expression - Transcription
Biochemistry Free For All
Fig Prokaryotes and Eukaryotes
Halfway Feedback (yours)
Lecture 8.2.
Protein Synthesis Part 3
Lab 8.3: RNA Secondary Structure
RNA Secondary Structure Prediction
Protein Synthesis Notes
RNA Secondary Structure Prediction
Protein Synthesis Part 3
Central Dogma of Molecular Biology From Genes to Protein
Protein Synthesis Part 3
Synthetic Biology: Protein Synthesis
Coordinately Controlled Genes in Eukaryotes
MicroRNAs: regulators of gene expression and cell differentiation
Identification and Characterization of pre-miRNA Candidates in the C
Protein synthesis: Overview
TRANSLATION AHL Topic 7.3 IB Biology Miss Werba
Transcription & Translation
RNA 2D and 3D Structure Craig L. Zirbel October 7, 2010.
credit: modification of work by NIH
DNA Deoxyribonucleic Acid.
Presentation transcript:

Predicting RNA Structure and Function

According to the central dogma of molecular biology the main role of DNA RNA protein According to the central dogma of molecular biology the main role of RNA is to transfer genetic information from DNA to protein

RNA has many other biological functions Protein synthesis (ribosome) Control of mRNA stability (UTR) Control of splicing (snRNP) Control of translation (microRNA) The function of the RNA molecule depends on its folded structure

Ribozyme Ribosome Nobel prize 1989 Nobel prize 2009

Protein structures RNA structures ~Total 80,000 Total ~800

RNA Structural levels Secondary Structure Tertiary Structure tRNA

RNA Secondary Structure RNA bases are G, C, A, U The RNA molecule folds on itself. The base pairing is as follows: G C A U G U hydrogen bond. U U C G U A A U G C 5’ 3’ 5’ G A U C U U G A U C 3’

RNA Secondary structure Short Range Interactions A U U G C C G G A U A G C A G C U U G HAIRPIN LOOP BULGE INTERNAL LOOP STEM DANGLING ENDS 5’ 3’

The function of the RNA molecule depends on its folded structure Example: mRNA structure involved in control of Iron levels Iron Responsive Element IRE G U A G C N N N’ C conserved Recognized by IRP1, IRP2 5’ 3’

F: Ferritin = iron storage TR: Transferin receptor = iron uptake IRP1/2 IRE 3’ 5’ F mRNA IRP1/2 3’ TR mRNA 5’ Low Iron IRE-IRP inhibits translation of ferritin IRE-IRP Inhibition of degradation of TR High Iron IRE-IRP off -> ferritin translated Transferin receptor degradated

Predicting RNA secondary Structure Most common approach: Zucker & Stiegler (1981) Search for a RNA structure with a Minimal Free Energy (MFE) U U C G U A A U G C G A U C U U G A U C

Free energy of a structure is the sum of all interactions energies Free energy model Free energy of a structure is the sum of all interactions energies exclude coaxial stacking, metal ions, nonstandard bonds, folding pathway, etc Free Energy(E) = E(CG)+E(CG)+….. Each interaction energy can be calculated thermodynamicly

Why is MFE secondary structure prediction hard? MFE structure can be found by calculating free energy of all possible structures BUT the number of potential structures grows exponentially with the number, n, of bases

RNA folding with Dynamic programming (Zucker and Steigler) W(i,j): MFE structure of substrand from i to j W(i,j) i j

RNA folding with dynamic programming Assume a function W(i,j) which is the MFE for the sequence starting at i and ending at j (i<j) Define scores, for example (CG) =-1 (CA)=1 (we want a negative score ) Consider 4 possibilities: i,j are a base pair, added to the structure for i+1..j-1 i is unpaired, added to the structure for i+1..j j is unpaired, added to the structure for i..j-1 i,j are paired, but not to each other; W(i,j) i (i+1) (j-1) j Choose the minimal energy possibility

Simplifying Assumptions for Structure Prediction RNA folds into one minimum free-energy structure. The energy of a particular base can be calculated independently Neighbors do not influence the energy.

Sequence dependent free-energy Nearest Neighbor Model U U C G G C A U A UCGAC 3’ U U C G U A A U G C A UCGAC 3’ 5’ 5’ Energy is influenced by the previous base pair (not by the base pairs further down).

Sequence dependent free-energy values of the base pairs (nearest neighbor model) U U C G G C A U A UCGAC 3’ U U C G U A A U G C A UCGAC 3’ 5’ 5’ These energies are estimated experimentally from small synthetic RNAs. Example values: GC GC GC GC AU GC CG UA -2.3 -2.9 -3.4 -2.1

Adding Complexity to Energy Calculations Positive energy - added for destabilizing regions such as bulges, loops, etc. More than one structure can be predicted

Free energy computation U U A A G C A U A A U C G A 3’ 5’ +5.9 4 nt loop -1.1 mismatch of hairpin -2.9 stacking +3.3 1nt bulge -2.9 stacking -1.8 stacking -0.9 stacking -1.8 stacking 5’ dangling -2.1 stacking -0.3 G= -4.6 KCAL/MOL -0.3

Mfold :Adding Complexity to Energy Calculations Positive energy - added for destabilizing regions such as bulges, loops, etc. More than one structure can be predicted

More than one structure can be predicted for the same RNA GNAS1 mRNA folding structures predicted by MFOLD. The mRNA sequence carrying the T393C polymorphism was used for secondary folding structure model building by the use of the computer program MFOLD (26). Frey U H et al. Clin Cancer Res 2005;11:5071-5077 ©2005 by American Association for Cancer Research

RNA fold prediction based on Multiple Alignment Information from multiple sequence alignment (MSA) can help to predict the probability of positions i,j to be base-paired. G C C U U C G G G C G A C U U C G G U C G G C U U C G G C C

Compensatory Substitutions Mutations that maintain the secondary structure can help predict the fold U U C G U A A U G C A UCGAC 3’ G C 5’

G C C U U C G G G C G A C U U C G G U C G G C U U C G G C C RNA secondary structure can be revealed by identification of compensatory mutations U C U G C G N N’ G C G C C U U C G G G C G A C U U C G G U C G G C U U C G G C C

Insight from Multiple Alignment Information from multiple sequence alignment (MSA) can help to predict the probability of positions i,j to be base-paired. Conservation – no additional information Consistent mutations (GC GU) – support stem Inconsistent mutations – does not support stem. Compensatory mutations – support stem.

RNA families Rfam : General non-coding RNA database (most of the data is taken from specific databases) http://www.sanger.ac.uk/Software/Rfam/ Includes many families of non coding RNAs and functional motifs, as well as their alignment and their secondary structures

An example of an RNA family miR-1 MicroRNAs mir-1 microRNA precursor family This family represents the microRNA (miRNA) mir-1 family. miRNAs are transcribed as ~70nt precursors (pre-mir) and subsequently processed to give a ~22nt product (miRNA=mir). The products are thought to have regulatory roles through complementarity to mRNA.

Seed alignment (based on 7 sequences)