Bo Li, Akshay Tambe, Sharon Aviran, Lior Pachter  Cell Systems 

Slides:



Advertisements
Similar presentations
Verna Vu & Timothy Abreo
Advertisements

Volume 50, Issue 1, Pages (April 2013)
Comparative Analysis of Single-Cell RNA Sequencing Methods
Low Dimensionality in Gene Expression Data Enables the Accurate Extraction of Transcriptional Programs from Shallow Sequencing  Graham Heimberg, Rajat.
Volume 4, Issue 4, Pages e5 (April 2017)
Volume 38, Issue 4, Pages (May 2010)
Volume 3, Issue 1, Pages (July 2016)
Volume 55, Issue 5, Pages (September 2014)
Global Mapping of Human RNA-RNA Interactions
Volume 33, Issue 2, Pages (January 2009)
High-Resolution Profiling of Histone Methylations in the Human Genome
Taichi Umeyama, Takashi Ito  Cell Reports 
Volume 44, Issue 3, Pages (November 2011)
A Phase Separation Model for Transcriptional Control
Volume 36, Issue 2, Pages (October 2009)
N6-Methyladenosines Modulate A-to-I RNA Editing
Volume 68, Issue 4, Pages e5 (November 2017)
Pejman Mohammadi, Niko Beerenwinkel, Yaakov Benenson  Cell Systems 
Volume 5, Issue 1, Pages e4 (July 2017)
Volume 61, Issue 1, Pages (January 2016)
Volume 23, Issue 5, Pages (May 2018)
A Massively Parallel Reporter Assay of 3′ UTR Sequences Identifies In Vivo Rules for mRNA Degradation  Michal Rabani, Lindsey Pieper, Guo-Liang Chew,
Inference of Environmental Factor-Microbe and Microbe-Microbe Associations from Metagenomic Data Using a Hierarchical Bayesian Statistical Model  Yuqing.
Yang Liu, Perry Palmedo, Qing Ye, Bonnie Berger, Jian Peng 
Gracjan Michlewski, Sonia Guil, Colin A. Semple, Javier F. Cáceres 
Volume 23, Issue 10, Pages (October 2016)
Edwards Allen, Zhixin Xie, Adam M. Gustafson, James C. Carrington  Cell 
Volume 18, Issue 5, Pages (January 2017)
RRNA Modifications in an Intersubunit Bridge of the Ribosome Strongly Affect Both Ribosome Biogenesis and Activity  Xue-hai Liang, Qing Liu, Maurille.
Mapping Gene Expression in Two Xenopus Species: Evolutionary Constraints and Developmental Flexibility  Itai Yanai, Leonid Peshkin, Paul Jorgensen, Marc W.
Volume 19, Issue 1, Pages (July 2005)
High-Resolution Profiling of Histone Methylations in the Human Genome
Fast, Quantitative and Variant Enabled Mapping of Peptides to Genomes
Volume 25, Issue 3, Pages (February 2007)
Volume 14, Issue 7, Pages (February 2016)
Volume 4, Issue 5, Pages e5 (May 2017)
Pan-Cancer Analysis of Mutation Hotspots in Protein Domains
Michal Levin, Tamar Hashimshony, Florian Wagner, Itai Yanai 
Sai Zhang, Hailin Hu, Jingtian Zhou, Xuan He, Tao Jiang, Jianyang Zeng 
Volume 68, Issue 5, Pages e9 (December 2017)
Volume 6, Issue 2, Pages e5 (February 2018)
Volume 29, Issue 5, Pages (March 2008)
Volume 44, Issue 3, Pages (November 2011)
Volume 50, Issue 2, Pages (April 2013)
Volume 132, Issue 2, Pages (January 2008)
Volume 5, Issue 2, Pages e4 (August 2017)
Baekgyu Kim, Kyowon Jeong, V. Narry Kim  Molecular Cell 
Integral Nuclear Pore Proteins Bind to Pol III-Transcribed Genes and Are Required for Pol III Transcript Processing in C. elegans  Kohta Ikegami, Jason D.
Volume 6, Issue 2, Pages e5 (February 2018)
ADAR Regulates RNA Editing, Transcript Stability, and Gene Expression
Impact of Alternative Splicing on the Human Proteome
Nathan Pirakitikulr, Andrew Kohlway, Brett D. Lindenbach, Anna M. Pyle 
Predicting Gene Expression from Sequence
HOTAIR Forms an Intricate and Modular Secondary Structure
Figure 1. Summary of experimental conditions and data normalization
Volume 16, Issue 2, Pages (February 2015)
Volume 2, Issue 5, Pages (May 2016)
Volume 5, Issue 1, Pages e4 (July 2017)
Maria S. Robles, Sean J. Humphrey, Matthias Mann  Cell Metabolism 
Volume 14, Issue 4, Pages (May 2004)
Volume 52, Issue 1, Pages (October 2013)
A Role for Mammalian Sin3 in Permanent Gene Silencing
Taichi Umeyama, Takashi Ito  Cell Reports 
Volume 4, Issue 2, Pages e4 (February 2017)
Multiplex Enhancer Interference Reveals Collaborative Control of Gene Regulation by Estrogen Receptor α-Bound Enhancers  Julia B. Carleton, Kristofer.
Interactome Networks and Human Disease
Volume 3, Issue 6, Pages e3 (December 2016)
Integral Nuclear Pore Proteins Bind to Pol III-Transcribed Genes and Are Required for Pol III Transcript Processing in C. elegans  Kohta Ikegami, Jason D.
Volume 18, Issue 5, Pages (January 2017)
RNA Processing: Redrawing the Map of Charted Territory
Presentation transcript:

PROBer Provides a General Toolkit for Analyzing Sequencing-Based Toeprinting Assays  Bo Li, Akshay Tambe, Sharon Aviran, Lior Pachter  Cell Systems  Volume 4, Issue 5, Pages 568-574.e7 (May 2017) DOI: 10.1016/j.cels.2017.04.007 Copyright © 2017 Elsevier Inc. Terms and Conditions

Cell Systems 2017 4, 568-574.e7DOI: (10.1016/j.cels.2017.04.007) Copyright © 2017 Elsevier Inc. Terms and Conditions

Figure 1 Cartoon Depictions of Sequencing-Based Toeprinting Experiments, the Associated Bioinformatics Challenges, and Our Solution (A) Cartoon depiction of an idealized toeprinting experiment. The genome is transcribed and RNAs are spliced and folded to form the structured transcriptome. This pool of RNAs is split into two and either treated with a chemical probe or mock-treated without the chemical probe. These chemical adducts are detected by reverse transcriptase (RT) drop-off, but the signal is convoluted by reverse transcriptase noise. Reverse transcription products are collected and sequenced. (B) Potential bioinformatics challenges. The structured transcriptome that gave rise to a given toeprinting dataset consists of known transcripts of unknown relative abundance. Reads from this dataset might align ambiguously to one or more transcripts, and might have been generated by either RT drop-off at a chemical modification or by RT noise. (C) Conceptual workflow of PROBer. Sequencing data (both treatment and control datasets) from a toeprinting experiment are used as the input. In the E step, reads are assigned to transcripts depending on an initial alignment, and the relative abundances and toeprinting parameters of the transcripts are estimated in the M step. In the M step, transcript abundances and toeprinting parameters are learned, using the read assignments calculated in the E step. Cell Systems 2017 4, 568-574.e7DOI: (10.1016/j.cels.2017.04.007) Copyright © 2017 Elsevier Inc. Terms and Conditions

Figure 2 Performance of PROBer on RNA Structure Probing Assays (A) A simulated RNA structure-probing dataset was generated in a manner consistent with the protocol of Ding et al. (2014), and used as the input for a number of structure-probing quantification methods, which include PROBer, StructureFold (Tang et al., 2015), Mod-seeker (Talkish et al., 2014), and icSHAPE (Spitale et al., 2015). Accuracy was evaluated by comparing the results from these methods with the ground truth modification reactivity profiles (β values) using Pearson's correlation coefficients. PROBer consistently outperforms alternative approaches across a wide range of expression levels. See also Figures S2 and S3. (B) PROBer was compared with alternative methods for in vitro probing yeast 18S rRNA on a paired-end dataset (Hector et al., 2014). Methods were evaluated by precision-recall (PR) curves and area under curve (AUC) values using crystallographically informed solvent-accessible secondary structures as ground truth. PROBer outperforms alternative methods significantly. See also Figure S6 and Table S3. (C) PROBer was compared with alternative methods on two yeast structure-probing datasets (Hector et al., 2014; Talkish et al., 2014) for predicting 18S and 25S rRNA secondary structures. For each method, the estimated modification reactivity profiles were converted into SHAPE constraints (see STAR Methods), and then the SHAPE constraints were fed to RNAstructure (Mathews, 2014) to produce secondary structure predictions. We additionally compared with a baseline method, which ran RNAstructure with no SHAPE constraints. The resulting minimum free energy structures were evaluated on two commonly used metrics: sensitivity and positive predictive value (PPV). We highlight the best performer of each column in bold. PROBer outperforms alternative approaches in all four cases and outperforms the baseline in three out of four cases. Cell Systems 2017 4, 568-574.e7DOI: (10.1016/j.cels.2017.04.007) Copyright © 2017 Elsevier Inc. Terms and Conditions

Figure 3 Performance of PROBer on Detecting Pseudouridine Modifications and Identifying Protein-RNA Binding Sites (A) PROBer was compared with alternative methods on data for predicting known pseudouridine (Ψ) sites in yeast rRNAs and snoRNA (Carlile et al., 2014). Methods were evaluated by precision-recall (PR) curves and area under curve (AUC) values. PROBer outperforms alternative approaches significantly. See also Figures S7 and S8. (B) PROBer was compared with the common practice (unique method) that uses only uniquely mapping reads on eCLIP (Van Nostrand et al., 2016) datasets for four distinct RNA-binding proteins. The first two columns in the table give the protein name and canonical binding motif. The binding motifs have been validated both in vitro and in vivo (Van Nostrand et al., 2016). The next four columns give the number of input-normalized peaks called at a false discovery rate of 0.05 (see STAR Methods) and the percentage of input-normalized peaks overlapping with canonical motifs for the unique method and PROBer, respectively. Peaks were called using CLIPper (Lovci et al., 2013). The last column gives the percentage of more peaks PROBer detected comparing with the unique method. PROBer enables us to extract significantly more information from the eCLIP datasets. See also Tables S5–S7. Cell Systems 2017 4, 568-574.e7DOI: (10.1016/j.cels.2017.04.007) Copyright © 2017 Elsevier Inc. Terms and Conditions