Ribosomal Profiling Data Handling and Analysis

Slides:



Advertisements
Similar presentations
An Introduction to Studying Expression Data Through RNA-seq
Advertisements

12/04/2017 RNA seq (I) Edouard Severing.
RNAseq Library Preparation and ANAlysis basics
Processing of miRNA samples and primary data analysis
Molecular Genetics DNA RNA Protein Phenotype Genome Gene
Simon v2.3 RNA-Seq Analysis Simon v2.3.
Peter Tsai Bioinformatics Institute, University of Auckland
Ribosome Profiling Library Preparation with SOLID Nate Blewett MGL Users Group May 4 th 2015.
Transcriptomics Jim Noonan GENE 760.
. Class 1: Introduction. The Tree of Life Source: Alberts et al.
TRANSLATION The process of converting the information stored in mRNA into a protein is called translation mRNA carries information from a gene to a structure.
2.7 DNA Replication, transcription and translation
RNA and Protein Synthesis
Nucleic Acids 7.3 Translation.
Translation and Transcription
PROTEIN SYNTHESIS.
Express yourself That darn ribosome Mighty Mighty Proteins Mutants RNA to the Rescue
An Overview of Protein Synthesis. Genes A sequence of nucleotides in DNA that performs a specific function such as coding for a particular protein.
DNA, RNA & Proteins Transcription Translation Chapter 3, 15 & 16.
Ji-hye Choi August Introduction (2006) ABRF-NGS (the Association fo Biomolecular Resource Facilities next-generation sequencing study)
Biology 1060 Chapter 17 From Gene to Protein. Genetic Information Important: Fig Describe how genes control phenotype –E.g., explain dwarfism in.
RNAseq analyses -- methods
RNA and Protein Synthesis
RNA-Seq Analysis Simon V4.1.
Amino acid sequence of His protein DNA provides the instructions for how to build proteins Each gene dictates how to build a single protein in prokaryotes.
RNA and Protein Synthesis
Protein Synthesis Chapter Protein synthesis- the production of proteins The amount and kind of proteins produced in a cell determine the structure.
Transcriptomics Sequencing. over view The transcriptome is the set of all RNA molecules, including mRNA, rRNA, tRNA, and other non coding RNA produced.
No reference available
Copyright © by Holt, Rinehart and Winston. All rights reserved. ResourcesChapter menu Flow of Genetic Information The flow of genetic information can be.
GENOME: an organism’s complete set of genetic material In humans, ~3 billion base pairs CHROMOSOME: Part of the genome; structure that holds tightly wound.
Lesson Four Structure of a Gene. Gene Structure What is a gene? Gene: a unit of DNA on a chromosome that codes for a protein(s) –Exons –Introns –Promoter.
RNA Makin’ Proteins DNAMutations Show off those Genes!
Translation Translation is the process of building a protein from the mRNA transcript. The protein is built as transfer RNA (tRNA) bring amino acids (AA),
The beginning of protein synthesis. OVERVIEW  Uses a strand of nuclear DNA to produce a single-stranded RNA molecule  Small section of DNA molecule.
Transcription and The Genetic Code From DNA to RNA.
Unit-II Synthetic Biology: Protein Synthesis Synthetic Biology is - A) the design and construction of new biological parts, devices, and systems, and B)
RNA and Protein Synthesis Chapter How are proteins made? In molecular terms, genes are coded DNA instructions that control the production of.
Protein Synthesis. RNA vs. DNA Both nucleic acids – Chains of nucleotides Different: – Sugar – Types of bases – Numbers of bases – Number of chains –
Canadian Bioinformatics Workshops
Genetic Code and Interrupted Gene Chapter 4. Genetic Code and Interrupted Gene Aala A. Abulfaraj.
RNA-Seq with the Tuxedo Suite Monica Britton, Ph.D. Sr. Bioinformatics Analyst September 2015 Workshop.
Simon v RNA-Seq Analysis Simon v
RNA Quantitation from RNAseq Data
The Transcriptional Landscape of the Mammalian Genome
Lesson Four Structure of a Gene.
Lesson Four Structure of a Gene.
Gene expression from RNA-Seq
RNA-Seq analysis in R (Bioconductor)
From DNA to Proteins Transcription.
From Gene to Protein Chapter 17.
Relationship between Genotype and Phenotype
RNA and Protein Synthesis
RNA and Protein Synthesis
CENTRAL DOGMA OF GENE EXPRESSION
Protein synthesis: Overview
From Genes to Proteins.
mRNA Degradation and Translation Control
Central Dogma
Volume 154, Issue 1, Pages (July 2013)
RNA and Protein Synthesis
Volume 14, Issue 7, Pages (February 2016)
Volume 26, Issue 1, Pages (April 2007)
Jong-Eun Park, Hyerim Yi, Yoosik Kim, Hyeshik Chang, V. Narry Kim 
Structure of the Genome
From Genes to Proteins.
Dom34 Rescues Ribosomes in 3′ Untranslated Regions
Protein Synthesis.
Volume 71, Issue 2, Pages e5 (July 2018)
Presentation transcript:

Ribosomal Profiling Data Handling and Analysis MGL Users Group James Iben 5/4/15

Goals of Ribosomal Profiling Basic RNAseq experiments measure relative quantities of transcripts in the cell. Not all transcripts are equally engaged with translating ribosomes. Ribosomal Profiling captures RNA bound by polysomes giving a snapshot of active translation in the cell. Tends to provide a better proxy for protein levels in the cell than just RNAseq. (Duncan & Mata, Nature Structural & Molecular Biology 21, 641–647 (2014)) Can also identify where ribosomes are engaged at the codon level if RNA footprints are well-trimmed to just the ribosome protected fragments. Identify pause sites, frame of translation, etc.

Sequencing Prepared Samples Following an involved library preparation, short DNA fragments are sequenced according to regular RNA-seq protocols with minor modifications: Since inserts are expected to be primarily ~28 nucleotides, only single end sequencing is performed Fragments are read to 50bp with the expectation of some adapter being read We performed sequencing of 14 samples 4 sets of triplicate conditions/controls (4x, Mod5, Tit1, Tyr) Two test mouse library preparations

Trimming of Sequenced Reads Reads are trimmed for adapter sequence ONLY No quality trimming is performed at this stage as size is critically important Read length distribution is observed in samples, expecting a tight distribution around the size of the ribosome protected fragment Some ‘Full length’ (50 nt) reads may also be obtained. These are primarily found to map to contaminants (rRNA, tRNA, other ncRNA, etc)

Length Distribution Obtained Suggests More Aggressive RNAse Needed in Yeast Samples Nothing 34bp+ other than 50bp fragments. Lacking a strong 28/29bp (expected yeast footprint) component, positioning of A/P sites cannot be reliably performed. Mouse Fraction of Reads Length of Fragment

Mapping of Reads Alignment is performed using Bowtie1 (short, non-permissive mapping) against the transcriptome and genome. Mapping is found primarily on coding RNA in direction of transcript, spanning introns. Ribosomal RNA contamination was ~40% in pombe and ~10% in mouse

Measure of Translational Efficiency of mRNAs A measure of how engaged ribosomes are with mRNA Assumption: more ribosomes = more active translation Translational Efficiency (TE) is defined by Ribosomal occupancy of the message normalized by the amount of message in the cell (as measured by RNAseq) TE = (RPKM in RiboProfiling) / (RPKM in RNAseq) Previously, RNAseq had been performed on these same conditions in triplicate

Quantitation and Comparisons (Approach 1) Read density on genes was measured as RPKM (Reads Per Kilobase of transcript per Million reads in the experiment). Used HTSeq with a GTF of gene definitions from the latest pombe build (ASM294v2.22) used for alignment. Tables were prepared for both RNAseq and RP experiments. TE calculated over 9 comparisons (3 RNAseq x 3 RP) TE compared across experimental conditions as a difference of means. Bonferroni correction for multiple testing to establish significance.

Cross Sample Comparison of TEs Per-gene mean TE is generally well correlated between conditions. Spearman correlation >0.8

ANOTA (Approach 2) ANalysis Of Translational Activity R Bioconductor package (Larsson O, Sonenberg N and Nadon R (2011). anota: ANalysis Of Translational Activity (ANOTA).. R package version 1.16.0.) Attempts to control for non-translation related changes (localization, etc) that may cause false positives in comparing raw TE. Uses regression analysis between the translationally active mRNA levels and the cytosolic mRNA levels. Dependent of several criteria for appropriate use Outlier samples cannot exist Consistency (polysome prep, etc) Residuals close to normal (no major bias)

Additional Types of Analysis With well trimmed fragments, reading frame may be analyzed Bioconductor package riboSeqR (Hardcastle TJ (2014). riboSeqR: Analysis of sequencing data from ribosome profiling experiments.. R package version 1.2.0.)

Other Considerations Transcript level view of pausing Codon occupancy Phenotype of these samples in particular is expected to alter decoding of codons somewhat Enrichment analysis of differentially expressed genes