Genome Pool Strategy for Structural Coverage of Protein Families

Slides:

Advertisements

Similar presentations

Scaffold Download free viewer:

Advertisements

Journal of Vision. 2008;8(11):18. doi: / Figure Legend:

The Functional Impact of Alternative Splicing in Cancer

Nir London, Ora Schueler-Furman Structure

Mark S. Blumberg, Cassandra M. Coleman, Ashlynn I. Gerth, Bob McMurray

Volume 112, Issue 7, Pages (April 2017)

Iddo Friedberg, Adam Godzik Structure

Sebastian Meyer, Raimund Dutzler Structure

Volume 22, Issue 2, Pages (February 2014)

Value Representations in the Primate Striatum during Matching Behavior

A Statistical Description of Plant Shoot Architecture

Shaul Druckmann, Dmitri B. Chklovskii Current Biology

Volume 24, Issue 11, Pages (November 2016)

Volume 19, Issue 7, Pages (July 2011)

A Statistical Description of Plant Shoot Architecture

Vincent B. McGinty, Antonio Rangel, William T. Newsome Neuron

Allosteric Effects of the Oncogenic RasQ61L Mutant on Raf-RBD

Volume 111, Issue 2, Pages (July 2016)

A Switching Observer for Human Perceptual Estimation

The Functional Impact of Alternative Splicing in Cancer

Volume 154, Issue 1, Pages (July 2013)

Volume 101, Issue 2, Pages (July 2011)

Volume 24, Issue 11, Pages (November 2016)

Cultural Confusions Show that Facial Expressions Are Not Universal

Andrew D. Bosson, Jesse R. Zamudio, Phillip A. Sharp Molecular Cell

Publication metrics and success on the academic job market

Nicolas Catz, Peter W. Dicke, Peter Thier Current Biology

Volume 15, Issue 9, Pages (September 2007)

Hippocampal “Time Cells”: Time versus Path Integration

Volume 26, Issue 3, Pages e2 (March 2018)

Visually Cued Action Timing in the Primary Visual Cortex

Volume 88, Issue 3, Pages (November 2015)

Single-Unit Responses Selective for Whole Faces in the Human Amygdala

Volume 26, Issue 3, Pages e2 (March 2018)

A Switching Observer for Human Perceptual Estimation

Integration Trumps Selection in Object Recognition

Volume 65, Issue 4, Pages (February 2010)

BOLD fMRI Correlation Reflects Frequency-Specific Neuronal Correlation

Filip Lankaš, Richard Lavery, John H. Maddocks Structure

Multiple Conformations of F-actin

Volume 14, Issue 9, Pages (September 2006)

Between Order and Disorder in Protein Structures: Analysis of “Dual Personality” Fragments in Proteins Ying Zhang, Boguslaw Stec, Adam Godzik Structure

A Scalable Population Code for Time in the Striatum

Volume 20, Issue 2, Pages (February 2012)

Volume 77, Issue 6, Pages (March 2013)

Volume 26, Issue 5, Pages (March 2016)

Volume 3, Issue 6, Pages (November 1998)

Nir London, Ora Schueler-Furman Structure

Mark S. Blumberg, Cassandra M. Coleman, Ashlynn I. Gerth, Bob McMurray

Recognizing Protein Substructure Similarity Using Segmental Threading

Volume 16, Issue 8, Pages (August 2008)

Volume 16, Issue 2, Pages (February 2008)

The Normalization Model of Attention

Unfolding Barriers in Bacteriorhodopsin Probed from the Cytoplasmic and the Extracellular Side by AFM Max Kessler, Hermann E. Gaub Structure Volume.

Atomic-Level Protein Structure Refinement Using Fragment-Guided Molecular Dynamics Conformation Sampling Jian Zhang, Yu Liang, Yang Zhang Structure

Gene Density, Transcription, and Insulators Contribute to the Partition of the Drosophila Genome into Physical Domains Chunhui Hou, Li Li, Zhaohui S.

Yan Xia, Axel W. Fischer, Pedro Teixeira, Brian Weiner, Jens Meiler

Deciphering the Deformation Modes Associated with Function Retention and Specialization in Members of the Ras Superfamily Francesco Raimondi, Modesto.

Dynamic Shape Synthesis in Posterior Inferotemporal Cortex

The Atomistic Mechanism of Conformational Transition in Adenylate Kinase: A TEE-REX Molecular Dynamics Study Marcus B. Kubitzki, Bert L. de Groot Structure

Volume 24, Issue 1, Pages (January 2016)

Volume 22, Issue 4, Pages (April 2014)

Vilas Menon, Brinda K. Vallat, Joseph M. Dybas, Andras Fiser Structure

Volume 23, Issue 4, Pages (April 2015)

The Postsaccadic Unreliability of Gain Fields Renders It Unlikely that the Motor System Can Use Them to Calculate Target Position in Space Benjamin Y.

Reliability of Assessment of Protein Structure Prediction Methods

Matthew R. Roesch, Adam R. Taylor, Geoffrey Schoenbaum Neuron

XLF Regulates Filament Architecture of the XRCC4·Ligase IV Complex

An Efficient Null Model for Conformational Fluctuations in Proteins

Unfolding Barriers in Bacteriorhodopsin Probed from the Cytoplasmic and the Extracellular Side by AFM Max Kessler, Hermann E. Gaub Structure Volume.

Presentation transcript:

Genome Pool Strategy for Structural Coverage of Protein Families Lukasz Jaroszewski, Lukasz Slabinski, John Wooley, Ashley M. Deacon, Scott A. Lesley, Ian A. Wilson, Adam Godzik Structure Volume 16, Issue 11, Pages 1659-1667 (November 2008) DOI: 10.1016/j.str.2008.08.018 Copyright © 2008 Elsevier Ltd Terms and Conditions

Figure 1 Crystallization Probability (A) The probability of crystallization is shown as a function of sequence identity to the closest crystallized, homologous target (see Experimental Procedures). Each protein was assigned to the appropriate bin, according to its distance (as measured by sequence identity) to the closest crystallized homolog, and to another bin, according to its distance (again measured by sequence identity) to the closest homolog that failed to crystallize. The bins correspond to the following ranges of sequence identity: 99%–90%, 89%–60%, 59%–50%, 49%–40%, 39%–30%, and 29%–20% (the second bin is larger because smaller bins did not amass sufficient data). The crystallization successes and failures were then counted for each bin, and the success rate was calculated. In each bin, the number of crystallized proteins is shown as a white bar, and the number of proteins that failed to crystallize is shown as a gray bar. The success rate (right vertical axis) was calculated directly from the histograms as a percentage of crystallized targets per bin and is shown as a black line. (B) The probability of crystallization shown as a function of sequence identity to the closest homologous target that failed to crystallize. Prepared as in panel A (see Experimental Procedures). (C) The probability of crystallization shown as a function of two variables: sequence identity to the closest homologous target that crystallized and the sequence identity to the closest homolog that failed to crystallize. Panels A and B are one-dimensional projections of the figure shown here; see the detailed explanation in panel A and in Experimental Procedures. Structure 2008 16, 1659-1667DOI: (10.1016/j.str.2008.08.018) Copyright © 2008 Elsevier Ltd Terms and Conditions

Figure 2 Distribution of Protein Crystallization Feasibility Classes in Known Microbial Genomes Genomes of thermophilic organisms, host-associated organisms, and others are shown separately. Inside each group, genomes are sorted by the percentage of proteins in the very difficult class (magenta graph). Structure 2008 16, 1659-1667DOI: (10.1016/j.str.2008.08.018) Copyright © 2008 Elsevier Ltd Terms and Conditions

Figure 3 Distribution of Solved Structures in Protein Families as a Function of a Percentage of Very Difficult Targets The families were first sorted by the percentage of the very difficult targets (crystallizability class 5) and then split into 6 bins of 500 families corresponding to different levels of difficulty. After sorting, the first bin contains families with the lowest percentage of very difficult targets, and the last bin contains families consisting almost exclusively of very difficult targets; that is, akin to the 5 relative scoring classes from optimal (green) to very difficult (magenta) as colored on the graph. The distributions of crystallizability classes (green to magenta) are shown for all protein families (left y axis). The normalized average number of structures per protein family has been calculated for each bin (right y axis). By using normalization, we are taking into account differences in family sizes—the average number of solved structures per family has been multiplied by the ratio of the average family size in all 5 bins to the average family size in a given bin. Structure 2008 16, 1659-1667DOI: (10.1016/j.str.2008.08.018) Copyright © 2008 Elsevier Ltd Terms and Conditions

Figure 4 Increasing Coverage of Known PfamA Families from Sequencing of Microbial Genomes The color coding reflects the different crystallizability scoring classes. The green curve indicates the number of PfamA families with at least one target in the optimal crystallizability class; the light-green curve is the cumulative number of PfamA families with members in the two top crystallizability classes (optimal and suboptimal); the yellow curve represents the three top classes; and the red curve shows all but the fifth crystallizability class (very difficult). The magenta graph shows the number of PfamA families covered by proteins from all crystallization classes (optimal to very difficult). The differences and transitions for one color to the next then indicate the sequential additions of Pfam families covered from considering optimal through to very difficult in 5 steps of the classifications . The statistics are shown separately for all PfamA families (A) and for families that still do not contain any solved structures (B). As might be anticipated, there are fewer optimal targets and more very difficult targets in Pfam families with no solved structures. Structure 2008 16, 1659-1667DOI: (10.1016/j.str.2008.08.018) Copyright © 2008 Elsevier Ltd Terms and Conditions

Figure 5 Distributions of Parameters of Protein Sequences Distributions of parameters describing various features (length, gravy index, pI, instability index, length of disordered fragment, and percentage of coil structure) of protein sequences calculated for full sequences of microbial members of PfamA families with at least one solved structure (green graphs), full sequences of all solved structures from PfamA families (blue graphs), sequences of actual constructs of solved structures from PfamA families (black graphs), and full sequences of microbial members of PfamA families without any solved structures (red graphs). For more details, see Results and Discussion. Structure 2008 16, 1659-1667DOI: (10.1016/j.str.2008.08.018) Copyright © 2008 Elsevier Ltd Terms and Conditions