Genome Pool Strategy for Structural Coverage of Protein Families

Slides:



Advertisements
Similar presentations
Scaffold Download free viewer:
Advertisements

Journal of Vision. 2008;8(11):18. doi: / Figure Legend:
The Functional Impact of Alternative Splicing in Cancer
Nir London, Ora Schueler-Furman  Structure 
Mark S. Blumberg, Cassandra M. Coleman, Ashlynn I. Gerth, Bob McMurray 
Volume 112, Issue 7, Pages (April 2017)
Iddo Friedberg, Adam Godzik  Structure 
Sebastian Meyer, Raimund Dutzler  Structure 
Volume 22, Issue 2, Pages (February 2014)
Value Representations in the Primate Striatum during Matching Behavior
A Statistical Description of Plant Shoot Architecture
Shaul Druckmann, Dmitri B. Chklovskii  Current Biology 
Volume 24, Issue 11, Pages (November 2016)
Volume 19, Issue 7, Pages (July 2011)
A Statistical Description of Plant Shoot Architecture
Vincent B. McGinty, Antonio Rangel, William T. Newsome  Neuron 
Allosteric Effects of the Oncogenic RasQ61L Mutant on Raf-RBD
Volume 111, Issue 2, Pages (July 2016)
A Switching Observer for Human Perceptual Estimation
The Functional Impact of Alternative Splicing in Cancer
Volume 154, Issue 1, Pages (July 2013)
Volume 101, Issue 2, Pages (July 2011)
Volume 24, Issue 11, Pages (November 2016)
Cultural Confusions Show that Facial Expressions Are Not Universal
Andrew D. Bosson, Jesse R. Zamudio, Phillip A. Sharp  Molecular Cell 
Publication metrics and success on the academic job market
Nicolas Catz, Peter W. Dicke, Peter Thier  Current Biology 
Volume 15, Issue 9, Pages (September 2007)
Hippocampal “Time Cells”: Time versus Path Integration
Volume 26, Issue 3, Pages e2 (March 2018)
Visually Cued Action Timing in the Primary Visual Cortex
Volume 88, Issue 3, Pages (November 2015)
Single-Unit Responses Selective for Whole Faces in the Human Amygdala
Volume 26, Issue 3, Pages e2 (March 2018)
A Switching Observer for Human Perceptual Estimation
Integration Trumps Selection in Object Recognition
Volume 65, Issue 4, Pages (February 2010)
BOLD fMRI Correlation Reflects Frequency-Specific Neuronal Correlation
Filip Lankaš, Richard Lavery, John H. Maddocks  Structure 
Multiple Conformations of F-actin
Volume 14, Issue 9, Pages (September 2006)
Between Order and Disorder in Protein Structures: Analysis of “Dual Personality” Fragments in Proteins  Ying Zhang, Boguslaw Stec, Adam Godzik  Structure 
A Scalable Population Code for Time in the Striatum
Volume 20, Issue 2, Pages (February 2012)
Volume 77, Issue 6, Pages (March 2013)
Volume 26, Issue 5, Pages (March 2016)
Volume 3, Issue 6, Pages (November 1998)
Nir London, Ora Schueler-Furman  Structure 
Mark S. Blumberg, Cassandra M. Coleman, Ashlynn I. Gerth, Bob McMurray 
Recognizing Protein Substructure Similarity Using Segmental Threading
Volume 16, Issue 8, Pages (August 2008)
Volume 16, Issue 2, Pages (February 2008)
The Normalization Model of Attention
Unfolding Barriers in Bacteriorhodopsin Probed from the Cytoplasmic and the Extracellular Side by AFM  Max Kessler, Hermann E. Gaub  Structure  Volume.
Atomic-Level Protein Structure Refinement Using Fragment-Guided Molecular Dynamics Conformation Sampling  Jian Zhang, Yu Liang, Yang Zhang  Structure 
Gene Density, Transcription, and Insulators Contribute to the Partition of the Drosophila Genome into Physical Domains  Chunhui Hou, Li Li, Zhaohui S.
Yan Xia, Axel W. Fischer, Pedro Teixeira, Brian Weiner, Jens Meiler 
Deciphering the Deformation Modes Associated with Function Retention and Specialization in Members of the Ras Superfamily  Francesco Raimondi, Modesto.
Dynamic Shape Synthesis in Posterior Inferotemporal Cortex
The Atomistic Mechanism of Conformational Transition in Adenylate Kinase: A TEE-REX Molecular Dynamics Study  Marcus B. Kubitzki, Bert L. de Groot  Structure 
Volume 24, Issue 1, Pages (January 2016)
Volume 22, Issue 4, Pages (April 2014)
Vilas Menon, Brinda K. Vallat, Joseph M. Dybas, Andras Fiser  Structure 
Volume 23, Issue 4, Pages (April 2015)
The Postsaccadic Unreliability of Gain Fields Renders It Unlikely that the Motor System Can Use Them to Calculate Target Position in Space  Benjamin Y.
Reliability of Assessment of Protein Structure Prediction Methods
Matthew R. Roesch, Adam R. Taylor, Geoffrey Schoenbaum  Neuron 
XLF Regulates Filament Architecture of the XRCC4·Ligase IV Complex
An Efficient Null Model for Conformational Fluctuations in Proteins
Unfolding Barriers in Bacteriorhodopsin Probed from the Cytoplasmic and the Extracellular Side by AFM  Max Kessler, Hermann E. Gaub  Structure  Volume.
Presentation transcript:

Genome Pool Strategy for Structural Coverage of Protein Families Lukasz Jaroszewski, Lukasz Slabinski, John Wooley, Ashley M. Deacon, Scott A. Lesley, Ian A. Wilson, Adam Godzik  Structure  Volume 16, Issue 11, Pages 1659-1667 (November 2008) DOI: 10.1016/j.str.2008.08.018 Copyright © 2008 Elsevier Ltd Terms and Conditions

Figure 1 Crystallization Probability (A) The probability of crystallization is shown as a function of sequence identity to the closest crystallized, homologous target (see Experimental Procedures). Each protein was assigned to the appropriate bin, according to its distance (as measured by sequence identity) to the closest crystallized homolog, and to another bin, according to its distance (again measured by sequence identity) to the closest homolog that failed to crystallize. The bins correspond to the following ranges of sequence identity: 99%–90%, 89%–60%, 59%–50%, 49%–40%, 39%–30%, and 29%–20% (the second bin is larger because smaller bins did not amass sufficient data). The crystallization successes and failures were then counted for each bin, and the success rate was calculated. In each bin, the number of crystallized proteins is shown as a white bar, and the number of proteins that failed to crystallize is shown as a gray bar. The success rate (right vertical axis) was calculated directly from the histograms as a percentage of crystallized targets per bin and is shown as a black line. (B) The probability of crystallization shown as a function of sequence identity to the closest homologous target that failed to crystallize. Prepared as in panel A (see Experimental Procedures). (C) The probability of crystallization shown as a function of two variables: sequence identity to the closest homologous target that crystallized and the sequence identity to the closest homolog that failed to crystallize. Panels A and B are one-dimensional projections of the figure shown here; see the detailed explanation in panel A and in Experimental Procedures. Structure 2008 16, 1659-1667DOI: (10.1016/j.str.2008.08.018) Copyright © 2008 Elsevier Ltd Terms and Conditions

Figure 2 Distribution of Protein Crystallization Feasibility Classes in Known Microbial Genomes Genomes of thermophilic organisms, host-associated organisms, and others are shown separately. Inside each group, genomes are sorted by the percentage of proteins in the very difficult class (magenta graph). Structure 2008 16, 1659-1667DOI: (10.1016/j.str.2008.08.018) Copyright © 2008 Elsevier Ltd Terms and Conditions

Figure 3 Distribution of Solved Structures in Protein Families as a Function of a Percentage of Very Difficult Targets The families were first sorted by the percentage of the very difficult targets (crystallizability class 5) and then split into 6 bins of 500 families corresponding to different levels of difficulty. After sorting, the first bin contains families with the lowest percentage of very difficult targets, and the last bin contains families consisting almost exclusively of very difficult targets; that is, akin to the 5 relative scoring classes from optimal (green) to very difficult (magenta) as colored on the graph. The distributions of crystallizability classes (green to magenta) are shown for all protein families (left y axis). The normalized average number of structures per protein family has been calculated for each bin (right y axis). By using normalization, we are taking into account differences in family sizes—the average number of solved structures per family has been multiplied by the ratio of the average family size in all 5 bins to the average family size in a given bin. Structure 2008 16, 1659-1667DOI: (10.1016/j.str.2008.08.018) Copyright © 2008 Elsevier Ltd Terms and Conditions

Figure 4 Increasing Coverage of Known PfamA Families from Sequencing of Microbial Genomes The color coding reflects the different crystallizability scoring classes. The green curve indicates the number of PfamA families with at least one target in the optimal crystallizability class; the light-green curve is the cumulative number of PfamA families with members in the two top crystallizability classes (optimal and suboptimal); the yellow curve represents the three top classes; and the red curve shows all but the fifth crystallizability class (very difficult). The magenta graph shows the number of PfamA families covered by proteins from all crystallization classes (optimal to very difficult). The differences and transitions for one color to the next then indicate the sequential additions of Pfam families covered from considering optimal through to very difficult in 5 steps of the classifications . The statistics are shown separately for all PfamA families (A) and for families that still do not contain any solved structures (B). As might be anticipated, there are fewer optimal targets and more very difficult targets in Pfam families with no solved structures. Structure 2008 16, 1659-1667DOI: (10.1016/j.str.2008.08.018) Copyright © 2008 Elsevier Ltd Terms and Conditions

Figure 5 Distributions of Parameters of Protein Sequences Distributions of parameters describing various features (length, gravy index, pI, instability index, length of disordered fragment, and percentage of coil structure) of protein sequences calculated for full sequences of microbial members of PfamA families with at least one solved structure (green graphs), full sequences of all solved structures from PfamA families (blue graphs), sequences of actual constructs of solved structures from PfamA families (black graphs), and full sequences of microbial members of PfamA families without any solved structures (red graphs). For more details, see Results and Discussion. Structure 2008 16, 1659-1667DOI: (10.1016/j.str.2008.08.018) Copyright © 2008 Elsevier Ltd Terms and Conditions