Roles of RNA mRNA (messenger) rRNA (ribosomal) tRNA (transfer) other ribonucleoproteins (e.g. spliceosome, signal recognition particle, ribonuclease P)

Slides:



Advertisements
Similar presentations
RNA Secondary Structure Prediction
Advertisements

Gene expression From Gene to Protein
RNA structure prediction. RNA functions RNA functions as –mRNA –rRNA –tRNA –Nuclear export –Spliceosome –Regulatory molecules (RNAi) –Enzymes –Virus –Retrotransposons.
6 - 1 Chapter 6 The Secondary Structure Prediction of RNA.
Nucleic Acids - RNA and DNA is a complex, high-molecular- weight biochemical macromolecule composed of chains that convey genetic information. The most.
Predicting RNA Structure and Function. Non coding DNA (98.5% human genome) Intergenic Repetitive elements Promoters Introns mRNA untranslated region (UTR)
Predicting RNA Structure and Function
RNA structure prediction. RNA functions RNA functions as –mRNA –rRNA –tRNA –Nuclear export –Spliceosome –Regulatory molecules (RNAi) –Enzymes –Virus –Retrotransposons.
. Class 1: Introduction. The Tree of Life Source: Alberts et al.
Non-coding RNA William Liu CS374: Algorithms in Biology November 23, 2004.
Improving Free Energy Functions for RNA Folding RNA Secondary Structure Prediction.
Predicting RNA Structure and Function. Nobel prize 1989Nobel prize 2009 Ribozyme Ribosome RNA has many biological functions The function of the RNA molecule.
LECTURE 5: DNA, RNA & PROTEINS
Predicting RNA Structure and Function. Following the human genome sequencing there is a high interest in RNA “Just when scientists thought they had deciphered.
. Class 5: RNA Structure Prediction. RNA types u Messenger RNA (mRNA) l Encodes protein sequences u Transfer RNA (tRNA) l Adaptor between mRNA molecules.
CISC667, F05, Lec19, Liao1 CISC 467/667 Intro to Bioinformatics (Fall 2005) RNA secondary structure.
Predicting RNA Structure and Function
Predicting RNA Structure and Function. Nobel prize 1989 Nobel prize 2009 Ribozyme Ribosome.
Dynamic Programming (cont’d) CS 466 Saurabh Sinha.
DNA Past Paper Questions. 1. Draw as simple diagram of the molecular structure of DNA. 5 marks.
Introducing DNA. Outline DNA nucleotide structure in terms of sugar, base, and phosphate.
DNA Replication.
GENE EXPRESSION.
RNA-Seq and RNA Structure Prediction
Non-coding RNA gene finding problems. Outline Introduction RNA secondary structure prediction RNA sequence-structure alignment.
Genomics and Personalized Care in Health Systems Lecture 9 RNA and Protein Structure Leming Zhou, PhD School of Health and Rehabilitation Sciences Department.
Structure and function of nucleic acids.. Heat. Heat flows through the boundary of the system because there exists a temperature difference between the.
Posttranscriptional Modification
Strand Design for Biomolecular Computation
RNA Secondary Structure Prediction Spring Objectives  Can we predict the structure of an RNA?  Can we predict the structure of a protein?
From Structure to Function. Given a protein structure can we predict the function of a protein when we do not have a known homolog in the database ?
1  Walter Sutton discovered chromosomes were made of DNA and Protein  However, scientists were NOT sure which one (protein or DNA) was the actual genetic.
1 Classification of real and pseudo microRNA precursors using local structure-sequence features and support vector machine Chenghai Xue, Fei Li, Tao He,
RNA Secondary Structure Prediction. 16s rRNA RNA Secondary Structure Hairpin loop Junction (Multiloop)Bulge Single- Stranded Interior Loop Stem Image–
1 TRANSCRIPTION AND TRANSLATION. 2 Central Dogma of Gene Expression.
KIAS July 2006 RNA secondary structure Ground state and the glass transition of the RNA secondary structure RNA folding: specific versus nonspecific pairing.
Lecture 9 CS5661 RNA – The “REAL nucleic acid” Motivation Concepts Structural prediction –Dot-matrix –Dynamic programming Simple cost model Energy cost.
RNA secondary structure RNA is (usually) single-stranded The nucleotides ‘want’ to pair with their Watson-Crick complements (AU, GC) They may ‘settle’
Do you know… What does the central dogma of modern biology say? What are the two main steps in Protein Synthesis?
RNA Structure Prediction
PROTEIN SYNTHESIS The formation of new proteins using the code carried on DNA.
Molecular Biology of the Gene  1952—Hershey & Chase determine that DNA rather than protein carries genetic information  1953—Rosalind Franklin captures.
Transcription in Prokaryotic (Bacteria) The conversion of DNA into an RNA transcript requires an enzyme known as RNA polymerase RNA polymerase – Catalyzes.
NUCLEIC ACIDS (2).
RNA Structure Prediction RNA Structure Basics The RNA ‘Rules’ Programs and Predictions BIO520 BioinformaticsJim Lund Assigned reading: Ch. 6 from Bioinformatics:
Bailee Ludwig Quality Management. Before we get started…. ….Let’s see what you know about Genomics.
This seems highly unlikely.
RNA and Gene Expression BIO 224 Intro to Molecular and Cell Biology.
Motif Search and RNA Structure Prediction Lesson 9.
RNA Structure Prediction
Rapid ab initio RNA Folding Including Pseudoknots via Graph Tree Decomposition Jizhen Zhao, Liming Cai Russell Malmberg Computer Science Plant Biology.
HW7: Evolutionarily conserved segments ENCODE region 009 (beta-globin locus) Multiple alignment of human, dog, and mouse 2 states: neutral (fast-evolving),
8.1Nucleotides Starter What is DNA ? Where will you find DNA in a cell? Draw the basic structure of a pentose sugar.
RNA MODIFICATION Eukaryotic mRNA molecules are modified before they exit the nucleus.
RNAs. RNA Basics transfer RNA (tRNA) transfer RNA (tRNA) messenger RNA (mRNA) messenger RNA (mRNA) ribosomal RNA (rRNA) ribosomal RNA (rRNA) small interfering.
Structure of Nucleic Acids
MCB 7200: Molecular Biology
Fig Prokaryotes and Eukaryotes
Gene Expression: From Gene to Protein
GENE EXPRESSION © 2016 Paul Billiet ODWS.
Predicting RNA Structure and Function
RNA Secondary Structure Prediction
Nucleic Acids and Protein Synthesis
Gene Expression: From Gene to Protein
Relationship between Genotype and Phenotype
Chapter 17 From Gene to Protein.
RNA 2D and 3D Structure Craig L. Zirbel October 7, 2010.
Gene Expression: From Gene to Protein
credit: modification of work by NIH
Comparison Of DNA And RNA Synthesis in Prokaryotes and Eukaryotes
Presentation transcript:

Roles of RNA mRNA (messenger) rRNA (ribosomal) tRNA (transfer) other ribonucleoproteins (e.g. spliceosome, signal recognition particle, ribonuclease P) viral genomes artificial ribozymes

Typical transfer RNA structure

Bulges Internal loops Hairpin loop Multi-branched loop CG UA UA U C C U }  G = -2.1 kcal/mol }  G = -1.2 kcal/mol loop  G = kcal/mol Thermodynamics parameters are measured on real molecules. Helix formation = hydrogen bonds + stacking Entropic penalty for loop formation. Sum up contributions of helices and loops over the whole structure.

i k j l (a) lk ji (b) i k j l (c) Pairs i-j and k-l are compatible if (a) i < j < k < l, or (b) i < k < l < j. (c) is called a pseudoknot: i < k < j < l. Usually not counted as secondary structure. Bracket notation is used to represent structure: a: ((((....))))..((((....)))) b: ((.((((....)))).)) Basic problem: Want an algorithm that considers every allowed secondary structure for a given sequence and finds the lowest energy state.

ijij-1j ikj = or Simplest case: find structure which maximizes number of base pairs. Let  = -1 if bases can pair and +  if not. Ignore loop contributions. E(i,j) = energy of min energy structure for chain segment from i to j. We want E(1,N). Algorithms that work by recursion relations like this are called dynamic programming. The algorithm is O(N 3 ) although the number of structures increases exponentially with N. Also need to do backtracking to work out the minimum energy structure: Set B(i,j) = k if j is paired with k, or 0 if unpaired.

Partition Function Algorithm (for simplest energy rules) Real Energy Rules : Need to consider many special cases. What type of loop are you closing? Algorithm is more complex but still is O(N 3 ). ijij-1jikj = or where

Equilibrium probability that base i is paired with j Equilibrium probability that base i is unpaired 1ij N Example of pairing probabilities taken from Vienna package web-site

AB C D B E C D F B D G H i ii iii I Is folding kinetics important? RNA folding kinetics involves reorganisation of secondary structure Native structures may not be global minimum free energy states. Morgan & Higgs (1996) J. Chem. Phys.

QuantityFitting FunctionParameters Groundstate energy C 1 = 2.9 (  0.2)  = (  0.001) Total number of states C 2 = -5.6 (  0.4)  = (  0.001) Number of groundstates C 3 = 1.75 (  0.2)  = (  0.001) Energy Landscapes in RNA Folding Morgan & Higgs (1998) Groundstates are degenerate in this model because energies are integers. Generate many random groundstates. How far apart are these groundstates? How high are the barriers between groundstates?

We found Frozen pairs (present in every groundstate) This figure shows the frozen pairs only. The molecule is divided into independent unfrozen loops. Define N eff as the length of the longest loop. Two groundstates for the same sequence

Minimum Free Energy Prediction Deterministic. Always gets MFE structure for a given set of energy rules. If MFE structure is not the same as biological structure, this could be because (i) energy rules are inaccurate or insufficient (ii) kinetics is important and molecule is trapped in metastable state. Monte Carlo simulations of folding kinetics. Store a current structure. Estimate rates of removal of existing helices and rates of addition of other compatible helices. Choose one helix to be added or removed with probability proportional to its rate. Repeat this many times. Can simulate structure formation from an unfolded state.

Q  is a bacteriophage RNA virus with approx 4000 nucleotides Viral RNA has complex secondary structure. The replicase gene codes for the replicase protein. This is an RNA-dependent RNA polymerase. Synthesizes complementary strand. Viral replication needs two steps: plus to minus to plus.

In vitro RNA evolution in the Q  system cccc Begin with Replicase + nucleotides + viral RNA Replicase + nucleotides only sequence RNA after many transfers Transfer small quantity to each successive tube

Barrier heights between alternative groundstates Observation: Mean barrier height between groundstates scales as ~ N eff 0.5 N eff ~ 0.3 N Therefore barriers become significant for large enough sequences.

An example where kinetics is important to control biological function: the 5’ region of the MS2 phage Maturation protein

Time to formation of the 5’ structure influences expression of the maturation protein more than the stability of this structure. Simulations compare with experiments on mutant sequences.

RNA in comparison to Proteins Both have well defined 3d structures RNA folding problem is easier because secondary structure separates from tertiary structure more easily - But it is still a complex problem. RNA model has real parameters therefore you can say something about real molecules. RNA folding algorithm is simple enough to be able to do statistical physics. (cf. 27-mer lattice protein models).

Part of sequence alignment of Mitochondrial Small Sub-Unit rRNA Full gene is length ~ Primate species with mouse as outgroup

Murphy et al. Nature (2001) uses 15 nuclear plus 3 mitochondrial proteins

Afrotheria / Laurasiatheria Striking examples of convergent evolution

Cao et al. (2000) Gene uses 12 mitochondrial proteins

RNA pairs model (GR7) 53 complete Mammalian mitochondrial genomes Complete set of rRNAs + tRNAs from = 973 pairs. Jow et al. (2002)

AB E D C 3 B C D A E 4 A D E BC 1 A E D BC 2 MCMC searches the rugged landscape in tree space using the Metropolis algorithm. Obtains a set of possible trees weighted according to their likelihood. 1. Rate parameter changes = continuous 2. Branch length changes = continuous 3. Topology changes = discrete Nearest-neighbour interchange Long-range move

Models of Sequence Evolution P ij (t) = probability of being in state j at time t given that ancestor was in state i at time 0. States label bases A,C,G & T i t j r ij is the rate of substitution from state i to state j

The HKY model describes rate of evolution of single sites to from The frequencies of the four bases are  is the transition-transversion rate parameter * means minus the sum of elements on the row

((((((( ))))))) Bacillus subtilis GGCUCGG CCGAGCC Escherichia coli GCCCGGA UCCGGGC Saccharomyces cerevisiae GCGGAUU AAUUCGC Drosophila melanogaster GCCGAAA UUUCGGC Homo sapiens GCCGAAA UUUCGGC Compensatory Substitutions Two sides of the acceptor stem from a tRNA are shown. Due to structure conservation alignment is possible in widely different species.

Model 7A is a General Reversible 7-state Model 7 frequencies  i + 21 rate parameters  ij - 2 constraints = 26 free parameters

Probability of remaining in same state P ii SSU rRNA sequences from Eubacteria

Probability P ij of changes from CG to other pairs SSU rRNA from Eubacteria

AU GC GU CG UA UG slow fast Selection against GU and UG is weaker than against mismatches. Double transitions are faster than double transversions. Double transitions are faster than single transitions to GU and UG states. This is explained by the theory of compensatory substitutions. What is going on?

tRNA mitoch. tRNA general tRNA archaea Rnase PSSU rRNA G+C average G+C helical regions Frequencies GC CG AU UA GU UG MM Number of sequences Number of pairs Analysis of RNA sequence databases Selection for thermodynamically stable structures Higgs (2000) Quart. Rev. Biophysics

tRNA mitoch. tRNA general tRNA archaea Rnase PSSU rRNA Mutabilities GC CG AU UA GU UG MM Double transitions / Double transversions Double transitions / Transitions to GU or UG Analysis of RNA Substitution Rates Thermodynamic properties influence Evolutionary properties