NMRQ: A Web Server for the Validation, Comparison and Analysis of Protein Structures Solved by NMR Gary Van Domselaar †, Paul Stothard, Trent Bjorndahl,

Slides:



Advertisements
Similar presentations
Scientific & technical presentation Structure Visualization with MarvinSpace Oct 2006.
Advertisements

Original Figures for "Molecular Classification of Cancer: Class Discovery and Class Prediction by Gene Expression Monitoring"
Solving NMR Structures II: Calculation and evaluation What NMR-based (solution) structures look like the NMR ensemble inclusion of hydrogen coordinates.
RECOORD REcalculated COORdinates Database Aart Nederveen Bijvoet Center for Biomolecular Research Utrecht University Jurgen Doreleijers.
Lab Meeting 06/05/20051 NMRQ: Quality Assessment and Validation for Protein Structures Generated by NMR Spectroscopy Gary Van Domselaar
Solving NMR structures II: Calculation and evaluation The NMR ensemble Methods for calculating structures distance geometry, restrained molecular dynamics,
. Protein Structure Prediction [Based on Structural Bioinformatics, section VII]
Modelling Workshop - Some Relevant Questions Prof. David Jones University College London Where are we now? Where are we going? Where should.
Modeling of protein turns and derivation of NMR parameters related to turn structure Megan Chawner BRITE REU Program Advisor: Dr. Dimitrios Morikis Department.
Separate multivariate observations
Protein Structure Prediction and Analysis
Computational Structure Prediction Kevin Drew BCH364C/391L Systems Biology/Bioinformatics 2/12/15.
Evaluation of Structure Quality Using RCSB PDB Tools Kyle Burkhardt, Lead Data Annotator The RCSB PDB at Rutgers University.
Representations of Molecular Structure: Bonds Only.
BASys: A Web Server for Automated Bacterial Genome Annotation Gary Van Domselaar †, Paul Stothard, Savita Shrivastava, Joseph A. Cruz, AnChi Guo, Xiaoli.
Biomolecular Nuclear Magnetic Resonance Spectroscopy BASIC CONCEPTS OF NMR How does NMR work? Resonance assignment Structure determination 01/24/05 NMR.
X-ray Validation Package Present status Swanand Gore PDBe D&A meeting : 21-Oct-2010.
Biomolecular Nuclear Magnetic Resonance Spectroscopy FROM ASSIGNMENT TO STRUCTURE Sequential resonance assignment strategies NMR data for structure determination.
Protein Classification II CISC889: Bioinformatics Gang Situ 04/11/2002 Parts of this lecture borrowed from lecture given by Dr. Altman.
STA 286 week 131 Inference for the Regression Coefficient Recall, b 0 and b 1 are the estimates of the slope β 1 and intercept β 0 of population regression.
Structural Models Lecture 11. Structural Models: Introduction Structural models display relationships among entities and have a variety of uses, such.
Protein Folding & Biospectroscopy Lecture 6 F14PFB David Robinson.
Jürgen Sühnel Supplementary Material: 3D Structures of Biological Macromolecules Exercise 1:
Lab Meeting 10/08/20041 SuperPose: A Web Server for Automated Protein Structure Superposition Gary Van Domselaar October.
How NMR is Used for the Study of Biomacromolecules Analytical biochemistry Comparative analysis Interactions between biomolecules Structure determination.
Automated Refinement (distinct from manual building) Two TERMS: E total = E data ( w data ) + E stereochemistry E data describes the difference between.
Computational Structure Prediction
Volume 6, Issue 5, Pages (May 1998)
Elementary Statistics
Lecture Slides Elementary Statistics Thirteenth Edition
Prediction of Protein Structure and Function on a Proteomic Scale
The Contribution of Entropy, Enthalpy, and Hydrophobic Desolvation to Cooperativity in Repeat-Protein Folding  Tural Aksel, Ananya Majumdar, Doug Barrick 
Anatomy of an Amyloidogenic Intermediate
Volume 14, Issue 9, Pages (September 2006)
Solution Structure of a Cyanovirin-N:Manα1-2Manα Complex
Backbone Dynamics of the 18
Giovanni Settanni, Antonino Cattaneo, Paolo Carloni 
Volume 18, Issue 11, Pages (November 2010)
Barley lipid-transfer protein complexed with palmitoyl CoA: the structure reveals a hydrophobic binding site that can expand to fit both large and small.
Ab Initio Structure Determination from Electron Microscopic Images of Single Molecules Coexisting in Different Functional States  Dominika Elmlund, Ralph.
Richard C. Page, Sanguk Kim, Timothy A. Cross  Structure 
Clare-Louise Towse, Steven J. Rysavy, Ivan M. Vulovic, Valerie Daggett 
Volume 7, Issue 3, Pages (September 2016)
Volume 19, Issue 10, Pages (October 2011)
Volume 14, Issue 7, Pages (February 2016)
Increased Reliability of Nuclear Magnetic Resonance Protein Structures by Consensus Structure Bundles  Lena Buchner, Peter Güntert  Structure  Volume.
Volume 21, Issue 10, Pages (October 2013)
Volume 16, Issue 5, Pages (May 2008)
Volume 14, Issue 9, Pages (September 2006)
A Conformational Switch in the CRIB-PDZ Module of Par-6
by T. M. Jacobs, B. Williams, T. Williams, X. Xu, A. Eletsky, J. F
Richard C. Page, Sanguk Kim, Timothy A. Cross  Structure 
Volume 20, Issue 3, Pages (March 2012)
Volume 20, Issue 2, Pages (February 2012)
Volume 21, Issue 6, Pages (June 2013)
Zheng Liu, Fei Guo, Feng Wang, Tian-Cheng Li, Wen Jiang  Structure 
Volume 85, Issue 4, Pages (October 2003)
Volume 19, Issue 1, Pages (January 2011)
Rita Pancsa, Daniele Raimondi, Elisa Cilia, Wim F. Vranken 
Structural Flexibility of CaV1. 2 and CaV2
Volume 10, Issue 2, Pages (February 2002)
Unmasking the Annexin I Interaction from the Structure of Apo-S100A11
Volume 19, Issue 7, Pages (July 2011)
Volume 6, Issue 5, Pages (May 1998)
Brandon Ho, Anastasia Baryshnikova, Grant W. Brown  Cell Systems 
Vilas Menon, Brinda K. Vallat, Joseph M. Dybas, Andras Fiser  Structure 
Volume 1, Issue 5, Pages R95-R106 (October 1996)
Solution Structure of the Proapoptotic Molecule BID
Backbone Dynamics of the 18
Volume 15, Issue 6, Pages (June 2007)
Presentation transcript:

NMRQ: A Web Server for the Validation, Comparison and Analysis of Protein Structures Solved by NMR Gary Van Domselaar †, Paul Stothard, Trent Bjorndahl, Steve Neal, Mark Berjanskii, and David S. Wishart ‡ Department of Computing Science and Biological Sciences University of Alberta Edmonton AB T6E 2E9 † ‡ Chemical Shift Report N MRQ uses the model coordinates in the ensemble to predict the backbone chemical shifts, and compares the predictions to the supplied experimental chemical shift data. NMRQ presents the predicted-vs-observed data as a scatterplot for each model. (a) Cumulative and average chemical shift scatterplots are reported for the ensemble. Outliers indicate possible misassignments. NMRQ compares the correlation coefficient to a database of high-quality NMR structures to provide the chemical shift 'R-factor': a measure of the quality of the fit between model and experimental data. (b) A plot comparing Chemical Shift Index with secondary structure. Chemical Shift Report N MRQ uses the model coordinates in the ensemble to predict the backbone chemical shifts, and compares the predictions to the supplied experimental chemical shift data. NMRQ presents the predicted-vs-observed data as a scatterplot for each model. (a) Cumulative and average chemical shift scatterplots are reported for the ensemble. Outliers indicate possible misassignments. NMRQ compares the correlation coefficient to a database of high-quality NMR structures to provide the chemical shift 'R-factor': a measure of the quality of the fit between model and experimental data. (b) A plot comparing Chemical Shift Index with secondary structure. NOE Report (Under Development) NMRQ uses Aqua [4] to report NOE violations, NOEs/Residue, and NOE completeness. NMRQ also identifies the long-range NOEs in the well-ordered regions of the ensemble. These NOEs are considered to be the most important for folding the protein structure. The 'expected important NOEs' are compared to the observed NOEs and a quality score derived from the degree of their agreement. NOE Report (Under Development) NMRQ uses Aqua [4] to report NOE violations, NOEs/Residue, and NOE completeness. NMRQ also identifies the long-range NOEs in the well-ordered regions of the ensemble. These NOEs are considered to be the most important for folding the protein structure. The 'expected important NOEs' are compared to the observed NOEs and a quality score derived from the degree of their agreement. References 1.Lovell SC, Davis IW, Arendall WB 3rd, de Bakker PI, Word JM, Prisant MG, Richardson JS, and Richardson DC (2003) Proteins 50: Nederveen AJ, Doreleijers JF, Vranken W, Miller Z, Spronk CA, Nabuurs SB, Guntert P, Livny M, Markley JL, Nilges M, Ulrich EL, Kaptein R, and Bonvin AM (2005) Proteins 59: Morris AL, MacArthur MW, Hutchinson EG and Thornton JM. (1992) Proteins. 12: Laskowski RA, Rullmann JAC., MacArthur MW, Kaptein R and Thornton JM (1996) J. Biomol. NMR 8: References 1.Lovell SC, Davis IW, Arendall WB 3rd, de Bakker PI, Word JM, Prisant MG, Richardson JS, and Richardson DC (2003) Proteins 50: Nederveen AJ, Doreleijers JF, Vranken W, Miller Z, Spronk CA, Nabuurs SB, Guntert P, Livny M, Markley JL, Nilges M, Ulrich EL, Kaptein R, and Bonvin AM (2005) Proteins 59: Morris AL, MacArthur MW, Hutchinson EG and Thornton JM. (1992) Proteins. 12: Laskowski RA, Rullmann JAC., MacArthur MW, Kaptein R and Thornton JM (1996) J. Biomol. NMR 8: Volume/Area Report (a) NMRQ uses Voronoi polyhedra to calculate residue volumes. Unusual volumes suggest possible packing defects. (b) Fractional accessible surface areas reveal energetically unfavorable regions. Volume/Area Report (a) NMRQ uses Voronoi polyhedra to calculate residue volumes. Unusual volumes suggest possible packing defects. (b) Fractional accessible surface areas reveal energetically unfavorable regions. Abstract NMRQ is a generalized web server designed to assess and visualize the quality of NMR-derived protein structures and NMR-derived chemical shift assignments. It performs five key functions including (i) assessment of the quality/correctness and disposition of chemical shift assignments, (ii) assessment of the structure ensemble quality (via superposition and RMSD analysis), (iii) assessment of the correctness of NOE restraints, (iv) assessment of overall structure quality, and (v) the identification of key structural features (via multiple geometric and heuristic measures). It accepts chemical shifts, PDB coordinates and NOE restraint tables as input. From these NMRQ performs extensive, unbiased and fully automated analyses and generates high quality tables, graphs and color images of both collective and individual structure/chemical shift properties and quality scores. Additionally we introduce the concept of the chemical shift R factor and demonstrate how chemical shifts can serve as independent, orthogonal measures of structure quality relative to standard NOE assessments. The NMRQ server is freely available at Abstract NMRQ is a generalized web server designed to assess and visualize the quality of NMR-derived protein structures and NMR-derived chemical shift assignments. It performs five key functions including (i) assessment of the quality/correctness and disposition of chemical shift assignments, (ii) assessment of the structure ensemble quality (via superposition and RMSD analysis), (iii) assessment of the correctness of NOE restraints, (iv) assessment of overall structure quality, and (v) the identification of key structural features (via multiple geometric and heuristic measures). It accepts chemical shifts, PDB coordinates and NOE restraint tables as input. From these NMRQ performs extensive, unbiased and fully automated analyses and generates high quality tables, graphs and color images of both collective and individual structure/chemical shift properties and quality scores. Additionally we introduce the concept of the chemical shift R factor and demonstrate how chemical shifts can serve as independent, orthogonal measures of structure quality relative to standard NOE assessments. The NMRQ server is freely available at Summary Reports NMRQ identifies anomalies, outliers, and other peculiar properties at the residue, model, and ensemble levels. These possible problems are presented as Summary Reports. (a) Residue Summaries: Possible problems are tabulated for each residue. Residue reports also include per-residue plots (per-residue Ramachandran plots, chi-distribution, chi1-chi2 normality plots, etc) and tabular summaries of the various analyses. (b) Model Summaries: The problematic residues for each model are shaded red. The darkness of the shading corresponds to the number of identified problems per residue. The identified anomalies for each problematic residue are tabulated below the model. (c) Ensemble Summaries: Residues with persistent problems, i.e. that appear in a majority of the models, are mapped to a representative model for the ensemble. Model and Ensemble quality are scored by comparing to a database of high quality X-ray structures (Richardson top 500 [1]) and NMR structures (RECOORD [2]). The scores are mapped to equivalent resolution and plotted as scatterplots (in development). Summary Reports NMRQ identifies anomalies, outliers, and other peculiar properties at the residue, model, and ensemble levels. These possible problems are presented as Summary Reports. (a) Residue Summaries: Possible problems are tabulated for each residue. Residue reports also include per-residue plots (per-residue Ramachandran plots, chi-distribution, chi1-chi2 normality plots, etc) and tabular summaries of the various analyses. (b) Model Summaries: The problematic residues for each model are shaded red. The darkness of the shading corresponds to the number of identified problems per residue. The identified anomalies for each problematic residue are tabulated below the model. (c) Ensemble Summaries: Residues with persistent problems, i.e. that appear in a majority of the models, are mapped to a representative model for the ensemble. Model and Ensemble quality are scored by comparing to a database of high quality X-ray structures (Richardson top 500 [1]) and NMR structures (RECOORD [2]). The scores are mapped to equivalent resolution and plotted as scatterplots (in development). Superposition Report NMRQ features a superpositioning algorithm that can automatically identify the well-ordered regions of the ensemble and restrict the superposition to these regions. A table (not shown) reports the C-alpha, backbone, heavy atom, and all-atom RMSDs (to average) for the well- ordered regions and for all residues. (a) The superpositioned ensemble is rendered in stereo-view and coloured by model. A hyperlinked legend facilitates drill-down to the individual models. (b) The ensemble is presented with shading to indicate the degree of local RMSD. (c) A per- residue RMSD plot shows local RMSD variability. The secondary structure overlay can be used to examine the correspondence of secondary structure elements and RMSD. Superposition Report NMRQ features a superpositioning algorithm that can automatically identify the well-ordered regions of the ensemble and restrict the superposition to these regions. A table (not shown) reports the C-alpha, backbone, heavy atom, and all-atom RMSDs (to average) for the well- ordered regions and for all residues. (a) The superpositioned ensemble is rendered in stereo-view and coloured by model. A hyperlinked legend facilitates drill-down to the individual models. (b) The ensemble is presented with shading to indicate the degree of local RMSD. (c) A per- residue RMSD plot shows local RMSD variability. The secondary structure overlay can be used to examine the correspondence of secondary structure elements and RMSD. a a b b c c a a b b c c a a Statistics Report C ombinations of torsion angle, volume, and accessible surface area values are combined to create (a) Stereo/Packing quality plots, and (b) 3D Profile Quality plots. Many other statistics (not shown) are also reported here. Statistics Report C ombinations of torsion angle, volume, and accessible surface area values are combined to create (a) Stereo/Packing quality plots, and (b) 3D Profile Quality plots. Many other statistics (not shown) are also reported here. Statistics a a Torsion Angle Report NMRQ reports on the quality of φ, ψ, χ and ω torsion angles. (a) The phi+psi variability plot highlights the ordered and disordered regions of the ensemble. (b) The Ramachandran plot quality can be assessed using the φ, ψ distributions derived from the Richardson Top 500 [1], RECOORD [2], or Morris et al. [3]. Tables display the residues in disallowed region of the plot. Not shown but included are χ 1 dispersion plots, χ 1 -χ 2 normality plots χ 1 angle favorability plots, and χ 1 -Accessible Surface Area covariance plots. Torsion Angle Report NMRQ reports on the quality of φ, ψ, χ and ω torsion angles. (a) The phi+psi variability plot highlights the ordered and disordered regions of the ensemble. (b) The Ramachandran plot quality can be assessed using the φ, ψ distributions derived from the Richardson Top 500 [1], RECOORD [2], or Morris et al. [3]. Tables display the residues in disallowed region of the plot. Not shown but included are χ 1 dispersion plots, χ 1 -χ 2 normality plots χ 1 angle favorability plots, and χ 1 -Accessible Surface Area covariance plots. b b a a a a b b b b b b