[BejeranoFall15/16] 1 MW 1:30-2:50pm in Clark S361* (behind Peet’s) Profs: Serafim Batzoglou & Gill Bejerano CAs: Karthik Jagadeesh.

Slides:



Advertisements
Similar presentations
Gene expression From Gene to Protein
Advertisements

RNA structure prediction. RNA functions RNA functions as –mRNA –rRNA –tRNA –Nuclear export –Spliceosome –Regulatory molecules (RNAi) –Enzymes –Virus –Retrotransposons.
Naveen K. Bansal and Prachi Pradeep Dept. of Math., Stat., and Comp. Sci. Marquette University Milwaukee, WI (USA)
RNA and Protein Synthesis
Predicting RNA Structure and Function. Non coding DNA (98.5% human genome) Intergenic Repetitive elements Promoters Introns mRNA untranslated region (UTR)
Predicting RNA Structure and Function
Non-coding RNA William Liu CS374: Algorithms in Biology November 23, 2004.
Zhi John Lu, Jason Gloor, and David H. Mathews University of Rochester Medical Center, Rochester, New York Improved RNA Secondary Structure Prediction.
Predicting RNA Structure and Function. Nobel prize 1989Nobel prize 2009 Ribozyme Ribosome RNA has many biological functions The function of the RNA molecule.
Presenting: Asher Malka Supervisor: Prof. Hermona Soreq.
Predicting RNA Structure and Function. Following the human genome sequencing there is a high interest in RNA “Just when scientists thought they had deciphered.
[Bejerano Fall10/11] 1.
. Class 5: RNA Structure Prediction. RNA types u Messenger RNA (mRNA) l Encodes protein sequences u Transfer RNA (tRNA) l Adaptor between mRNA molecules.
Structural Alignment of Pseudoknotted RNAs Banu Dost, Buhm Han, Shaojie Zhang, Vineet Bafna.
CISC667, F05, Lec19, Liao1 CISC 467/667 Intro to Bioinformatics (Fall 2005) RNA secondary structure.
Predicting RNA Structure and Function
Chapter 15 Noncoding RNAs. You Must Know The role of noncoding RNAs in control of cellular functions.
[Bejerano Spr06/07] 1 TTh 11:00-12:15 in Clark S361 Profs: Serafim Batzoglou, Gill Bejerano TAs: George Asimenos, Cory McLean.
[Bejerano Aut07/08] 1 MW 11:00-12:15 in Redwood G19 Profs: Serafim Batzoglou, Gill Bejerano TA: Cory McLean.
Predicting RNA Structure and Function. Nobel prize 1989 Nobel prize 2009 Ribozyme Ribosome.
Dynamic Programming (cont’d) CS 466 Saurabh Sinha.
CS273A Lecture 5: Genes Enrichment, Gene Regulation I
[BejeranoFall13/14] 1 MW 12:50-2:05pm in Beckman B302 Profs: Serafim Batzoglou & Gill Bejerano TAs: Harendra Guturu & Panos.
RNA-Seq and RNA Structure Prediction
[BejeranoFall13/14] 1 MW 12:50-2:05pm in Beckman B302 Profs: Serafim Batzoglou & Gill Bejerano TAs: Harendra Guturu & Panos.
[BejeranoFall14/15] 1 MW 12:50-2:05pm in Beckman B100 Profs: Serafim Batzoglou & Gill Bejerano CAs: Jim Notwell & Sandeep Chinchali.
Introduction to RNA Bioinformatics Craig L. Zirbel October 5, 2010 Based on a talk originally given by Anton Petrov.
RNA informatics Unit 12 BIOL221T: Advanced Bioinformatics for Biotechnology Irene Gabashvili, PhD.
Non-coding RNA gene finding problems. Outline Introduction RNA secondary structure prediction RNA sequence-structure alignment.
The Search for Small Regulatory RNA Central Dogma: DNA to RNA to Protein Replication Processing / Translocation hnRNA rRNAtRNA mRNA.
From Structure to Function. Given a protein structure can we predict the function of a protein when we do not have a known homolog in the database ?
RNA folding & ncRNA discovery I519 Introduction to Bioinformatics, Fall, 2012.
[BejeranoFall13/14] 1 MW 12:50-2:05pm in Beckman B302 Profs: Serafim Batzoglou & Gill Bejerano TAs: Harendra Guturu & Panos.
1 RNA Bioinformatics Genes and Secondary Structure Anne Haake Rhys Price Jones & Tex Thompson.
RNA Secondary Structure Prediction. 16s rRNA RNA Secondary Structure Hairpin loop Junction (Multiloop)Bulge Single- Stranded Interior Loop Stem Image–
1 TRANSCRIPTION AND TRANSLATION. 2 Central Dogma of Gene Expression.
How Genes Work Ch. 12.
8.4 Transcription KEY CONCEPT Transcription converts a gene into a single-stranded RNA molecule.
Replication Transcription Translation
Lecture 9 CS5661 RNA – The “REAL nucleic acid” Motivation Concepts Structural prediction –Dot-matrix –Dynamic programming Simple cost model Energy cost.
[BejeranoWinter12/13] 1 MW 11:00-12:15 in Beckman B302 Prof: Gill Bejerano TAs: Jim Notwell & Harendra Guturu CS173 Lecture 6:
Predicting protein degradation rates Karen Page. The central dogma DNA RNA protein Transcription Translation The expression of genetic information stored.
Basics of RNA structure. RNA functions Storage/transfer of genetic information Genomes many viruses have RNA genomes single-stranded (ssRNA) e.g., retroviruses.
CS5263 Bioinformatics RNA Secondary Structure Prediction.
Progress toward Predicting Viral RNA Structure from Sequence: How Parallel Computing can Help Solve the RNA Folding Problem Susan J. Schroeder University.
Questions?. Novel ncRNAs are abundant: Ex: miRNAs miRNAs were the second major story in 2001 (after the genome). Subsequently, many other non-coding genes.
Doug Raiford Lesson 7.  RNA World Hypothesis  RNA world evolved into the DNA and protein world  DNA advantage: greater chemical stability  Protein.
RNA Structure Prediction RNA Structure Basics The RNA ‘Rules’ Programs and Predictions BIO520 BioinformaticsJim Lund Assigned reading: Ch. 6 from Bioinformatics:
Core Transcription and Translation
[BejeranoFall15/16] 1 MW 1:30-2:50pm in Clark S361* (behind Peet’s) Profs: Serafim Batzoglou & Gill Bejerano CAs: Karthik Jagadeesh.
Functions of RNA mRNA (messenger)- instructions protein
Motif Search and RNA Structure Prediction Lesson 9.
GENOME: an organism’s complete set of genetic material In humans, ~3 billion base pairs CHROMOSOME: Part of the genome; structure that holds tightly wound.
IB Saccharomyces cerevisiae - Jan Major model system for molecular genetics. For example, one can clone the gene encoding a protein if you.
Unit-II Synthetic Biology: Protein Synthesis Synthetic Biology is - A) the design and construction of new biological parts, devices, and systems, and B)
Rapid ab initio RNA Folding Including Pseudoknots via Graph Tree Decomposition Jizhen Zhao, Liming Cai Russell Malmberg Computer Science Plant Biology.
Gene structure and function
RNAs. RNA Basics transfer RNA (tRNA) transfer RNA (tRNA) messenger RNA (mRNA) messenger RNA (mRNA) ribosomal RNA (rRNA) ribosomal RNA (rRNA) small interfering.
Biochemistry Free For All
Halfway Feedback (yours)
Lecture 21 RNA Secondary Structure Prediction
Lab 8.3: RNA Secondary Structure
CS273A Lecture 3: Non Coding Genes MW 12:50-2:05pm in Beckman B100
Predicting RNA Structure and Function
Transcription and Gene Expression
RNA Secondary Structure Prediction
RNA Secondary Structure Prediction
Introduction to Bioinformatics II
Profs: Serafim Batzoglou, Gill Bejerano TAs: Cory McLean, Aaron Wenger
CISC 467/667 Intro to Bioinformatics (Spring 2007) RNA secondary structure CISC667, S07, Lec19, Liao.
Presentation transcript:

[BejeranoFall15/16] 1 MW 1:30-2:50pm in Clark S361* (behind Peet’s) Profs: Serafim Batzoglou & Gill Bejerano CAs: Karthik Jagadeesh & Johannes Birgmeier * Mostly: track on website/piazza CS273A Lecture 3: Non Coding Genes

– Lecture slides, problem sets, etc. Course communications via Piazza – Auditors please sign up too Second Tutorial this Fri 10/2. UCSC genome browser. We recommend bringing your laptop. We are starting to take attendance today. [BejeranoFall15/16] 2 Announcements

[BejeranoFall15/16] 3 “non coding” RNAs (ncRNA)

4 Central Dogma of Biology:

5 Active forms of “non coding” RNA reverse transcription (of eg retrogenes) long non-coding RNA microRNA rRNA, snRNA, snoRNA

6 What is ncRNA? Non-coding RNA (ncRNA) is an RNA that functions without being translated to a protein. Known roles for ncRNAs: –RNA catalyzes excision/ligation in introns. –RNA catalyzes the maturation of tRNA. –RNA catalyzes peptide bond formation. –RNA is a required subunit in telomerase. –RNA plays roles in immunity and development (RNAi). –RNA plays a role in dosage compensation. –RNA plays a role in carbon storage. –RNA is a major subunit in the SRP, which is important in protein trafficking. –RNA guides RNA modification. –RNA can do so many different functions, it is thought in the beginning there was an RNA World, where RNA was both the information carrier and active molecule.

[BejeranoFall15/16] 7 “non coding” RNAs (ncRNA) subtype: Small structural RNAs (ssRNA)

8 AAUUGCGGGAAAGGGGUCAA CAGCCGUUCAGUACCAAGUC UCAGGGGAAACUUUGAGAUG GCCUUGCAAAGGGUAUGGUA AUAAGCUGACGGACAUGGUC CUAACCACGCAGCCAAGUCC UAAGUCAACAGAUCUUCUGU UGAUAUGGAUGCAGUUCA ssRNA Folds into Secondary and 3D Structures Cate, et al. (Cech & Doudna). (1996) Science 273:1678. Waring & Davies. (1984) Gene 28: 277. Computational Challenge: predict 2D & 3D structure from sequence (1D).

For example, tRNA Activity Complementation is a “brilliant” way for the Genome to “get things done”. Replication, Transcription, Translation, … Compliment(Compliment(Identity)) = Identity

tRNA Structure

ssRNA structure rules Canonical basepairs: –Watson-Crick basepairs: G ≡ C A = U –Wobble basepair: G − U Stacks: continuous nested basepairs. (energetically favorable) Non-basepaired loops: –Hairpin loop –Bulge –Internal loop –Multiloop Pseudo-knots

Ab initio RNA structure prediction: lots of Dynamic Programming Objective: Maximizing # of base pairings (Nussinov et al, 1978) simple model:  (i, j) = 1 iff GC|AU|GU fancier model: GC > AU > GU

Dynamic programming  Compute S(i,j) recursively (dynamic programming) –Compares a sequence against itself in a dynamic programming matrix  Three steps

1. Initialization Example: GGGAAAUCC GGGAAAUCC G0 G00 G00 A00 A00 A00 U00 C00 C00 the main diagonal the diagonal below L: the length of input sequence

2. Recursion GGGAAAUCC G0000 G00000 G00000 A0000? A0001 A0011 U0000 C000 C00 Fill up the table (DP matrix) -- diagonal by diagonal j i

3. Traceback GGGAAAUCC G G G A A A00111 U0000 C000 C00 The structure is: What are the other “optimal” structures?

Pseudoknots drastically increase computational complexity

[BejeranoFall15/16] 18 Objective: Minimize Secondary Structure Free Energy at 37 O C: Mathews, Disney, Childs, Schroeder, Zuker, & Turner PNAS 101: Instead of  (i, j), measure and sum energies:

Zuker’s algorithm MFOLD: computing loop dependent energies

Bafna1 RNA structure Base-pairing defines a secondary structure. The base-pairing is usually non-crossing. In this example the pseudo-knot makes for an exception.

S  aSu S  cSg S  gSc S  uSa S  a S  c S  g S  u S  SS 1. A CFG S  aSu  acSgu  accSggu  accuSaggu  accuSSaggu  accugScSaggu  accuggSccSaggu  accuggaccSaggu  accuggacccSgaggu  accuggacccuSagaggu  accuggacccuuagaggu 2. A derivation of “accuggacccuuagaggu” 3. Corresponding structure Stochastic context-free grammar

ssRNA transcription ssRNAs like tRNAs are usually encoded by short “non coding” genes, that transcribe independently. Found in both the UCSC “known genes” track, and as a subtrack of the RepeatMasker track [BejeranoFall15/16] 22

[BejeranoFall15/16] 23 “non coding” RNAs (ncRNA) subtype: microRNAs (miRNA/miR)

[BejeranoFall15/16] 24 MicroRNA (miR) mRNA ~70nt~22nt miR match to target mRNA is quite loose.  a single miR can regulate the expression of hundreds of genes.

[BejeranoFall15/16] 25 MicroRNA Transcription mRNA

[BejeranoFall15/16] 26 MicroRNA Transcription mRNA

[BejeranoFall15/16] 27 MicroRNA (miR) mRNA ~70nt~22nt miR match to target mRNA is quite loose.  a single miR can regulate the expression of hundreds of genes. Computational challenges: Predict miRs. Predict miR targets.

[BejeranoFall15/16] 28 MicroRNA Therapeutics mRNA ~70nt~22nt miR match to target mRNA is quite loose.  a single miR can regulate the expression of hundreds of genes. Idea: bolster/inhibit miR production to broadly modulate protein production Hope: “right” the good guys and/or “wrong” the bad guys Challenge: and not vice versa.

[BejeranoFall15/16] 29 Other Non Coding Transcripts

lncRNAs (long non coding RNAs) [BejeranoFall15/16] 30 Don’t seem to fold into clear structures (or only a sub-region does). Diverse roles only now starting to be understood. Example: bind splice site, cause intron retention.  Challenges: 1) detect and 2) predict function computationally

X chromosome inactivation in mammals X XX Y X Dosage compensation

Xist – X inactive-specific transcript Avner and Heard, Nat. Rev. Genetics (1):59-67

System output measurements We can measure non/coding gene expression! (just remember – it is context dependent) 1. First generation mRNA (cDNA) and EST sequencing: [BejeranoFall15/16] 33 In UCSC Browser:

2. Gene Expression Microarrays (“chips”) [BejeranoFall15/16] 34

3. RNA-seq “Next” (2nd) generation sequencing. [BejeranoFall15/16] 35

[BejeranoFall15/16] 36 Transcripts, transcripts everywhere Human Genome Transcribed* (Tx) Tx from both strands* * True size of set unknown

Or are they? [BejeranoFall15/16] 37

[BejeranoFall15/16] 38 The million dollar question Human Genome Transcribed* (Tx) Tx from both strands* * True size of set unknown Leaky tx? Functional?

[BejeranoFall15/16] 39 Gene Finding II: technology dependence Challenge: “Find the genes, the whole genes, and nothing but the genes” We started out trying to predict genes directly from the genome. When you measure gene expression, the challenge changes: Now you want to build gene models from your observations. These are both technology dependent challenges. The hybrid: what we measure is a tiny fraction of the space-time state space for cells in our body. We want to generalize from measured states and improve our predictions for the full compendium of states.