Sai Zhang, Hailin Hu, Jingtian Zhou, Xuan He, Tao Jiang, Jianyang Zeng 

Slides:



Advertisements
Similar presentations
Comprehensive Analysis of Tissue-wide Gene Expression and Phenotype Data Reveals Tissues Affected in Rare Genetic Disorders  Ariel Feiglin, Bryce K. Allen,
Advertisements

Xiaoshu Chen, Jianzhi Zhang  Cell Systems 
Volume 44, Issue 1, Pages (January 2016)
Codon Bias as a Means to Fine-Tune Gene Expression
Gene Architectures that Minimize Cost of Gene Expression
Volume 4, Issue 4, Pages e5 (April 2017)
Ruth B. McCole, Jelena Erceg, Wren Saylor, Chao-ting Wu  Cell Reports 
Volume 95, Issue 3, Pages e4 (August 2017)
Volume 54, Issue 1, Pages (April 2014)
Transient N-6-Methyladenosine Transcriptome Sequencing Reveals a Regulatory Role of m6A in Splicing Efficiency  Annita Louloupi, Evgenia Ntini, Thomas.
How the Sequence of a Gene Can Tune Its Translation
A Role for Codon Order in Translation Dynamics
Volume 146, Issue 6, Pages (September 2011)
N6-Methyladenosines Modulate A-to-I RNA Editing
Volume 5, Issue 1, Pages e4 (July 2017)
Translation of Genotype to Phenotype by a Hierarchy of Cell Subsystems
Volume 19, Issue 7, Pages (July 2011)
Metabolic Network Prediction of Drug Side Effects
A Massively Parallel Reporter Assay of 3′ UTR Sequences Identifies In Vivo Rules for mRNA Degradation  Michal Rabani, Lindsey Pieper, Guo-Liang Chew,
Volume 34, Issue 4, Pages (May 2009)
Vincent B. McGinty, Antonio Rangel, William T. Newsome  Neuron 
The Translational Landscape of the Mammalian Cell Cycle
Yang Liu, Perry Palmedo, Qing Ye, Bonnie Berger, Jian Peng 
Volume 154, Issue 1, Pages (July 2013)
Dynamic Gene Regulatory Networks of Human Myeloid Differentiation
Volume 3, Issue 1, Pages (July 2016)
Volume 11, Issue 4, Pages (April 2018)
Volume 11, Issue 11, Pages (November 2003)
Volume 59, Issue 5, Pages (September 2015)
Volume 5, Issue 4, Pages e4 (October 2017)
Modeling Enzyme Processivity Reveals that RNA-Seq Libraries Are Biased in Characteristic and Correctable Ways  Nathan Archer, Mark D. Walsh, Vahid Shahrezaei,
Volume 14, Issue 7, Pages (February 2016)
Fine-Resolution Mapping of TF Binding and Chromatin Interactions
RNA Structural Determinants of Optimal Codons Revealed by MAGE-Seq
Folding Membrane Proteins by Deep Transfer Learning
Benjamin Scholl, Daniel E. Wilson, David Fitzpatrick  Neuron 
Fine-Resolution Mapping of TF Binding and Chromatin Interactions
Volume 26, Issue 1, Pages (April 2007)
Pan-Cancer Analysis of Mutation Hotspots in Protein Domains
Srinivas Ramachandran, Kami Ahmad, Steven Henikoff  Molecular Cell 
Volume 5, Issue 4, Pages e4 (October 2017)
Volume 95, Issue 5, Pages e5 (August 2017)
Volume 23, Issue 5, Pages (May 2015)
Systems-wide Identification of cis-Regulatory Elements in Proteins
Volume 9, Pages (November 2018)
Volume 5, Issue 3, Pages e7 (September 2017)
Computational Modeling Reveals that Signaling Lipids Modulate the Orientation of K- Ras4A at the Membrane Reflecting Protein Topology  Zhen-Lu Li, Matthias.
Volume 133, Issue 7, Pages (June 2008)
Grid and Nongrid Cells in Medial Entorhinal Cortex Represent Spatial Location and Environmental Features with Complementary Coding Schemes  Geoffrey W.
Bo Li, Akshay Tambe, Sharon Aviran, Lior Pachter  Cell Systems 
Volume 132, Issue 2, Pages (January 2008)
Probing the “Dark Matter” of Protein Fold Space
Volume 10, Issue 10, Pages (October 2017)
Benjamin Scholl, Daniel E. Wilson, David Fitzpatrick  Neuron 
Volume 5, Issue 4, Pages e4 (October 2017)
Comprehensive Analysis of Tissue-wide Gene Expression and Phenotype Data Reveals Tissues Affected in Rare Genetic Disorders  Ariel Feiglin, Bryce K. Allen,
Volume 29, Issue 5, Pages (May 2016)
Volume 122, Issue 6, Pages (September 2005)
A Flip Turn for Membrane Protein Insertion
iRNA-PseU: Identifying RNA pseudouridine sites
Modeling Enzyme Processivity Reveals that RNA-Seq Libraries Are Biased in Characteristic and Correctable Ways  Nathan Archer, Mark D. Walsh, Vahid Shahrezaei,
Volume 5, Issue 1, Pages e4 (July 2017)
Brandon Ho, Anastasia Baryshnikova, Grant W. Brown  Cell Systems 
Network-Based Coverage of Mutational Profiles Reveals Cancer Genes
Volume 7, Issue 6, Pages e3 (December 2018)
Multiplex Enhancer Interference Reveals Collaborative Control of Gene Regulation by Estrogen Receptor α-Bound Enhancers  Julia B. Carleton, Kristofer.
Xiaoshu Chen, Jianzhi Zhang  Cell Systems 
Joana Pinto Vieira, Julien Racle, Vassily Hatzimanikatis 
Andrey V Kajava, Gilbert Vassart, Shoshana J Wodak  Structure 
eIF5A Functions Globally in Translation Elongation and Termination
Presentation transcript:

Analysis of Ribosome Stalling and Translation Elongation Dynamics by Deep Learning  Sai Zhang, Hailin Hu, Jingtian Zhou, Xuan He, Tao Jiang, Jianyang Zeng  Cell Systems  Volume 5, Issue 3, Pages 212-220.e6 (September 2017) DOI: 10.1016/j.cels.2017.08.004 Copyright © 2017 Elsevier Inc. Terms and Conditions

Cell Systems 2017 5, 212-220.e6DOI: (10.1016/j.cels.2017.08.004) Copyright © 2017 Elsevier Inc. Terms and Conditions

Figure 1 The ROSE Pipeline and the Convolutional Neural Network (CNN) Model (A) Schematic overview of the ROSE pipeline. The codon sites with normalized ribosome footprint densities beyond two SDs are regarded as positive samples, which represent the ribosome stalling positions, to train a deep CNN model. Then the sequence profiles of individual codon sites along the genome are fed into the trained CNN to compute the distribution of ribosome stalling, which can be further used to study the potential factors affecting ribosome stalling and analyze the genome-wide landscape of translation elongation dynamics. (B) Schematic illustration of the CNN model used in the ROSE pipeline. More details can be found in the main text. Cell Systems 2017 5, 212-220.e6DOI: (10.1016/j.cels.2017.08.004) Copyright © 2017 Elsevier Inc. Terms and Conditions

Figure 2 Performance Evaluation of ROSE (A and B) The receiver operating characteristic (ROC) curves and the area under the corresponding ROC curve (AUROC) scores on the human (Battle15) and yeast (Pop14) test datasets, respectively. (C) Comparison of AUROCs between parallel and sequential CNN architectures. “sROSE” and “eROSE” stand for the ROSE frameworks with one (single) and 64 (ensemble) CNNs, respectively. conv1, conv2, and conv3 represent the convolutional-pooling modules with kernel sizes of the convolutional layers corresponding to the short, medium, and long ranges used, respectively. × denotes the stacking operation in the sequential architecture. (D) The ROC curves and the corresponding AUROC scores of the cross-study tests on additional human (Stumpf13 G1, M, and S) and yeast (Brar12 and Young15) datasets. The “Brar12 non-overlapping dataset” includes 1,748 genes with sufficient (over 60%) ribosome profiling coverage in the Brar12 dataset but not in the Pop14 dataset. (E) The Venn diagram of the three yeast datasets regarding sufficiently covered genes. (F) The ROC curves and the corresponding AUROC scores on the ramp regions. Cell Systems 2017 5, 212-220.e6DOI: (10.1016/j.cels.2017.08.004) Copyright © 2017 Elsevier Inc. Terms and Conditions

Figure 3 A Comprehensive Reexamination on the Relations between Diverse Putative Regulatory Factors and Ribosome Stalling Using ROSE (A and B) Comparisons of intraRSS between the codon sites enriched with individual factors and the background for human and yeast, respectively. Data labels: negative control (randomly selected 10,000 codon sites), cAI (codon adaptation index), tAI (tRNA adaptation index), cCI (codon co-occurrence index), and %MinMax score (the codon rareness measurement proposed in Clarke and Clark, 2008). (C) Comparisons of intraRSS between the single-peptide pattern of proline (i.e., XPX) and the multiple-peptide patterns of proline, including dipeptide (i.e., XPP and PPX) and tripeptide (i.e., PPP), where “P” and “X” stand for proline and any non-proline amino acid, respectively. (D) A schematic illustration of the m6A modification of a codon (e.g., AAA) to delay tRNA accommodation (denoted by the purple star) during translation elongation. ∗5 × 10−25 < p < 1 × 10−2; ∗∗5 × 10−50 < p ≤ 5 × 10−25; ∗∗∗5 × 10−100 < p ≤ 5 × 10−50; +p ≤ 5 × 10−100; one-sided Wilcoxon rank-sum test. Note that here it is not necessary that the median was zero for the “Background” column, as the intraRSS was normalized within individual genes rather than along the genome. Also, the extremely small p values were partially due to the large sample size (N = 10,000). Cell Systems 2017 5, 212-220.e6DOI: (10.1016/j.cels.2017.08.004) Copyright © 2017 Elsevier Inc. Terms and Conditions

Figure 4 The Intragenic RSS Landscapes Reveal that Ribosome Stalling Associates with Protein Secondary Structure and the SRP Binding of Transmembrane (TM) Segments (A) The intraRSS landscapes of α helix, β strand, and random coil regions. (B) The intraRSS landscapes of the secondary structure element (SSE) transition regions. “H”, “E” and “C” stand for α helix, β strand, and random coil, respectively, while “X” represents any SSE type in the flanking regions on both sides. Polynomial curve fitting of degree four was used to show the general intraRSS tendency. The Spearman correlation coefficients between human and yeast intraRSS tendencies were calculated. (C) The overall comparisons of intraRSS between the structured (i.e., α helix and β strand) and random coil residues. ∗∗5 × 10−50 < p ≤5 × 10−25; ∗∗∗5 × 10−100 < p ≤ 5 × 10−50; one-sided Wilcoxon rank-sum test. (D) A schematic illustration of the SRP binding of a TM segment during translation elongation. (E) Comparison of intraRSS tendency between the TM segments with (SRP+) and without SRP binding (SRP−) in yeast, in which all the protein sequences were aligned with regard to the start of the TM segment whose position was indexed as zero. The yellow rectangle covers the TM segment, while the gray rectangles represent two intraRSS peaks downstream of the TM segment. The intraRSS peak marked with the blue star (i.e., positions from +50 to +70) was significantly diminished (p = 1.5 × 10−3 by one-sided Wilcoxon rank-sum test). Cell Systems 2017 5, 212-220.e6DOI: (10.1016/j.cels.2017.08.004) Copyright © 2017 Elsevier Inc. Terms and Conditions