Presentation is loading. Please wait.

Presentation is loading. Please wait.

Roles of RNA mRNA (messenger) rRNA (ribosomal) tRNA (transfer) other ribonucleoproteins (e.g. spliceosome, signal recognition particle, ribonuclease P)

Similar presentations


Presentation on theme: "Roles of RNA mRNA (messenger) rRNA (ribosomal) tRNA (transfer) other ribonucleoproteins (e.g. spliceosome, signal recognition particle, ribonuclease P)"— Presentation transcript:

1 Roles of RNA mRNA (messenger) rRNA (ribosomal) tRNA (transfer) other ribonucleoproteins (e.g. spliceosome, signal recognition particle, ribonuclease P) viral genomes artificial ribozymes

2 Typical transfer RNA structure

3

4

5 Bulges Internal loops Hairpin loop Multi-branched loop CG UA UA U C C U }  G = -2.1 kcal/mol }  G = -1.2 kcal/mol loop  G = + 4.5 kcal/mol Thermodynamics parameters are measured on real molecules. Helix formation = hydrogen bonds + stacking Entropic penalty for loop formation. Sum up contributions of helices and loops over the whole structure.

6 i k j l (a) lk ji (b) i k j l (c) Pairs i-j and k-l are compatible if (a) i < j < k < l, or (b) i < k < l < j. (c) is called a pseudoknot: i < k < j < l. Usually not counted as secondary structure. Bracket notation is used to represent structure: a: ((((....))))..((((....)))) b: ((.((((....)))).)) Basic problem: Want an algorithm that considers every allowed secondary structure for a given sequence and finds the lowest energy state.

7 ijij-1j ikj = or Simplest case: find structure which maximizes number of base pairs. Let  = -1 if bases can pair and +  if not. Ignore loop contributions. E(i,j) = energy of min energy structure for chain segment from i to j. We want E(1,N). Algorithms that work by recursion relations like this are called dynamic programming. The algorithm is O(N 3 ) although the number of structures increases exponentially with N. Also need to do backtracking to work out the minimum energy structure: Set B(i,j) = k if j is paired with k, or 0 if unpaired.

8 Partition Function Algorithm (for simplest energy rules) Real Energy Rules : Need to consider many special cases. What type of loop are you closing? Algorithm is more complex but still is O(N 3 ). ijij-1jikj = or where

9 Equilibrium probability that base i is paired with j Equilibrium probability that base i is unpaired 1ij N Example of pairing probabilities taken from Vienna package web-site

10 AB C D B E C D F B D G H i ii iii I Is folding kinetics important? RNA folding kinetics involves reorganisation of secondary structure Native structures may not be global minimum free energy states. Morgan & Higgs (1996) J. Chem. Phys.

11 QuantityFitting FunctionParameters Groundstate energy C 1 = 2.9 (  0.2)  = -0.368 (  0.001) Total number of states C 2 = -5.6 (  0.4)  = 0.533 (  0.001) Number of groundstates C 3 = 1.75 (  0.2)  = 0.068 (  0.001) Energy Landscapes in RNA Folding Morgan & Higgs (1998) Groundstates are degenerate in this model because energies are integers. Generate many random groundstates. How far apart are these groundstates? How high are the barriers between groundstates?

12 We found Frozen pairs (present in every groundstate) This figure shows the frozen pairs only. The molecule is divided into independent unfrozen loops. Define N eff as the length of the longest loop. Two groundstates for the same sequence

13 Minimum Free Energy Prediction Deterministic. Always gets MFE structure for a given set of energy rules. If MFE structure is not the same as biological structure, this could be because (i) energy rules are inaccurate or insufficient (ii) kinetics is important and molecule is trapped in metastable state. Monte Carlo simulations of folding kinetics. Store a current structure. Estimate rates of removal of existing helices and rates of addition of other compatible helices. Choose one helix to be added or removed with probability proportional to its rate. Repeat this many times. Can simulate structure formation from an unfolded state.

14 Q  is a bacteriophage RNA virus with approx 4000 nucleotides Viral RNA has complex secondary structure. The replicase gene codes for the replicase protein. This is an RNA-dependent RNA polymerase. Synthesizes complementary strand. Viral replication needs two steps: plus to minus to plus.

15 In vitro RNA evolution in the Q  system cccc Begin with Replicase + nucleotides + viral RNA Replicase + nucleotides only sequence RNA after many transfers Transfer small quantity to each successive tube

16 Barrier heights between alternative groundstates Observation: Mean barrier height between groundstates scales as ~ N eff 0.5 N eff ~ 0.3 N Therefore barriers become significant for large enough sequences.

17

18 An example where kinetics is important to control biological function: the 5’ region of the MS2 phage. 3500 130 Maturation protein

19 Time to formation of the 5’ structure influences expression of the maturation protein more than the stability of this structure. Simulations compare with experiments on mutant sequences.

20 RNA in comparison to Proteins Both have well defined 3d structures RNA folding problem is easier because secondary structure separates from tertiary structure more easily - But it is still a complex problem. RNA model has real parameters therefore you can say something about real molecules. RNA folding algorithm is simple enough to be able to do statistical physics. (cf. 27-mer lattice protein models).

21 Part of sequence alignment of Mitochondrial Small Sub-Unit rRNA Full gene is length ~950 11 Primate species with mouse as outgroup

22 Murphy et al. Nature (2001) uses 15 nuclear plus 3 mitochondrial proteins

23 Afrotheria / Laurasiatheria Striking examples of convergent evolution

24 Cao et al. (2000) Gene uses 12 mitochondrial proteins

25 RNA pairs model (GR7) 53 complete Mammalian mitochondrial genomes Complete set of rRNAs + tRNAs from = 973 pairs. Jow et al. (2002) 86 97 100

26 AB E D C 3 B C D A E 4 A D E BC 1 A E D BC 2 MCMC searches the rugged landscape in tree space using the Metropolis algorithm. Obtains a set of possible trees weighted according to their likelihood. 1. Rate parameter changes = continuous 2. Branch length changes = continuous 3. Topology changes = discrete Nearest-neighbour interchange Long-range move

27 Models of Sequence Evolution P ij (t) = probability of being in state j at time t given that ancestor was in state i at time 0. States label bases A,C,G & T i t j r ij is the rate of substitution from state i to state j

28 The HKY model describes rate of evolution of single sites to from The frequencies of the four bases are  is the transition-transversion rate parameter * means minus the sum of elements on the row

29 1234567 7654321 ((((((( ))))))) Bacillus subtilis GGCUCGG CCGAGCC Escherichia coli GCCCGGA UCCGGGC Saccharomyces cerevisiae GCGGAUU AAUUCGC Drosophila melanogaster GCCGAAA UUUCGGC Homo sapiens GCCGAAA UUUCGGC Compensatory Substitutions Two sides of the acceptor stem from a tRNA are shown. Due to structure conservation alignment is possible in widely different species.

30 Model 7A is a General Reversible 7-state Model 7 frequencies  i + 21 rate parameters  ij - 2 constraints = 26 free parameters

31 Probability of remaining in same state P ii SSU rRNA sequences from Eubacteria

32 Probability P ij of changes from CG to other pairs SSU rRNA from Eubacteria

33 AU GC GU CG UA UG slow fast Selection against GU and UG is weaker than against mismatches. Double transitions are faster than double transversions. Double transitions are faster than single transitions to GU and UG states. This is explained by the theory of compensatory substitutions. What is going on?

34 tRNA mitoch. tRNA general tRNA archaea Rnase PSSU rRNA G+C average G+C helical regions 0.339 0.448 0.532 0.681 0.636 0.829 0.594 0.730 0.545 0.674 Frequencies GC CG AU UA GU UG MM 0.266 0.121 0.257 0.233 0.046 0.030 0.046 0.372 0.260 0.128 0.142 0.043 0.025 0.030 0.473 0.320 0.057 0.077 0.031 0.020 0.022 0.385 0.296 0.117 0.104 0.050 0.022 0.026 0.352 0.298 0.122 0.173 0.020 0.021 0.014 Number of sequences8847546484455 Number of pairs21 80296 Analysis of RNA sequence databases Selection for thermodynamically stable structures Higgs (2000) Quart. Rev. Biophysics

35 tRNA mitoch. tRNA general tRNA archaea Rnase PSSU rRNA Mutabilities GC CG AU UA GU UG MM 0.67 0.84 0.86 0.77 2.44 3.32 2.32 0.49 0.83 1.46 1.24 1.96 5.01 0.99 0.45 0.89 4.01 1.78 1.85 3.00 0.86 0.65 0.60 1.46 1.09 1.72 2.84 5.24 0.55 0.66 1.40 0.93 3.92 4.36 7.84 Double transitions / Double transversions 4.71.72.33.12.1 Double transitions / Transitions to GU or UG 1.62.08.93.62.8 Analysis of RNA Substitution Rates Thermodynamic properties influence Evolutionary properties


Download ppt "Roles of RNA mRNA (messenger) rRNA (ribosomal) tRNA (transfer) other ribonucleoproteins (e.g. spliceosome, signal recognition particle, ribonuclease P)"

Similar presentations


Ads by Google