CommonCoot Common Coot (Fulica atra) (Fulica atra)

Slides:



Advertisements
Similar presentations
CCP4 Molecular Graphics (CCP4MG)
Advertisements

Automated phase improvement and model building with Parrot and Buccaneer Kevin Cowtan
Introduction to protein x-ray crystallography. Electromagnetic waves E- electromagnetic field strength A- amplitude  - angular velocity - frequency.
Determination of Protein Structure. Methods for Determining Structures X-ray crystallography – uses an X-ray diffraction pattern and electron density.
Rosetta Energy Function Glenn Butterfoss. Rosetta Energy Function Major Classes: 1. Low resolution: Reduced atom representation Simple energy function.
Refinement procedure Copy your best coordinate file to “prok-native-r1.pdb”: cp yourname-coot-99.pdb prok-native-r1.pdb Start refinement phenix.refine.
Overview of Key ICM Features NIBR - Emeryville July
Recent developments 1) Tests (outlier analysis) and Bug fixing ( with Paul) 2) Regeneration of Values of Bonds and Bond-angles existing all structures.
Structure Validation using Coot Paul Emsley Mar 2007 York University of York.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Homology Modeling Anne Mølgaard, CBS, BioCentrum, DTU.
2. Modeling of small systems Building the model What is the optimal conformation of a molecule? What is the relative energy of a given conformation? What.
Macromolecular structure refinement Garib N Murshudov York Structural Biology Laboratory Chemistry Department University of York.
Protein Primer. Outline n Protein representations n Structure of Proteins Structure of Proteins –Primary: amino acid sequence –Secondary:  -helices &
Molecular modelling / structure prediction (A computational approach to protein structure) Today: Why bother about proteins/prediction Concepts of molecular.
eHiTS Score Darryl Reid, Zsolt Zsoldos, Bashir S. Sadjad, Aniko Simon, The next stage in scoring function evolution: a new statistically.
Computational Structure Prediction Kevin Drew BCH364C/391L Systems Biology/Bioinformatics 2/12/15.
Recommendations and Questions wwPDB/CCDC/D3R Ligand Validation Workshop Center for Integrative Proteomics Research, Rutgers 7/30-31/2015 Group D, Academic.
Module 2: Structure Based Ph4 Design
Bringing Structure to Biology: Small Molecules and the PDBe
CCP4mg Liz Potterton, Stuart McNicholas, Martin Noble, Jan Gruber.
Introduction to Macromolecular X-ray Crystallography Biochem 300 Borden Lacy Print and online resources: Introduction to Macromolecular X-ray Crystallography,
Kevin Cowtan, DevMeet CCP4 Wiki ccp4wiki.org Maintainer: YOU.
Model-Building with Coot An Introduction Bernhard Lohkamp Karolinska Institute June 2009 Chicago (Paul Emsley) (University of Oxford)
Increasing the Value of Crystallographic Databases Derived knowledge bases Knowledge-based applications programs Data mining tools for protein-ligand complexes.
EMBL-EBI Adel Golovin MSDsite The project is funded by the European Commission as the TEMBLOR, contract-no. QLRI-CT under the RTD programme.
 Four levels of protein structure  Linear  Sub-Structure  3D Structure  Complex Structure.
Coot Tools for Model Building and Validation
X-ray Validation Package Present status Swanand Gore PDBe D&A meeting : 21-Oct-2010.
Applied common sense The why, what and how of validation (and what EM can learn of X-ray) Gerard J. Kleywegt Protein Data Bank in Europe EMBL-EBI, Cambridge,
EBI is an Outstation of the European Molecular Biology Laboratory. Annotation Procedures for Structural Data Deposited in the PDBe at EBI.
Crystallographic Databases I590 Spring 2005 Based in part on slides from John C. Huffman.
Ligand fitting and Validation with Coot Bernhard Lohkamp Karolinska Institute June 2009 Chicago (Paul Emsley) (University of Oxford)
Protein Folding & Biospectroscopy Lecture 6 F14PFB David Robinson.
SR Users Meeting 10-11th September 2003 CCP4 Release 5.0 Peter Briggs CCP4/CCLRC Daresbury Laboratory.
Refinement is the process of adjusting an atomic model to:
Forward and inverse kinematics in RNA backbone conformations By Xueyi Wang and Jack Snoeyink Department of Computer Science UNC-Chapel Hill.
RAPPER Nick Furnham Blundell Group – Department of Biochemistry Cambridge University UK
Automated Refinement (distinct from manual building) Two TERMS: E total = E data ( w data ) + E stereochemistry E data describes the difference between.
Molecular mechanics Classical physics, treats atoms as spheres Calculations are rapid, even for large molecules Useful for studying conformations Cannot.
Structure Visualization
Common Coot (Fulica atra) (Fulica atra) Common Coot (Fulica atra) (Fulica atra)
Common Coot (Fulica atra) (Fulica atra). More building with Coot クート オオバン.
Sequence: PFAM Used example: Database of protein domain families. It is based on manually curated alignments.
Introduction to Coot Paul McEwan Ph.D. Evotec.
PDBe Protein Interfaces, Surfaces and Assemblies
Computational Structure Prediction
Common Coot (Fulica atra).
Refinement procedure for native structure
CCP4 6.1 and beyond: Tools for Macromolecular Crystallography
Complete automation in CCP4 What do we need and how to achieve it?
Protein Structure Prediction and Protein Homology modeling
Reduce the need for human intervention in protein model building
CCP4 from a user perspective
Crystal structure of vancomycin
1.b What are current best practices for selecting an initial target ligand atomic model(s) for structure refinement from X-ray diffraction data?
Conformation Dependence of Backbone Geometry in Proteins
Goals for Today Introduce automated refinement and validation.
Volume 25, Issue 11, Pages e3 (November 2017)
Progress Report in REFMAC
Goals for Today Introduce automated refinement and validation.
Yvonne Groemping, Karine Lapouge, Stephen J. Smerdon, Katrin Rittinger 
Crystal structure of vancomycin
Version 5.3 From SMILE string to dictionary (LIBCHECK): Now coot uses it Segment id is now used Automatic adjustment for weights Improved bond order extraction.
Volume 23, Issue 3, Pages (March 2016)
Supertertiary Structure of the MAGUK Core from PSD-95
The basis for K-Ras4B binding specificity to protein farnesyl-transferase revealed by 2 Å resolution ternary complex structures  Stephen B Long, Patrick.
Human glucose-6-phosphate dehydrogenase: the crystal structure reveals a structural NADP+ molecule and provides insights into enzyme deficiency  Shannon.
Note for 2019 If you get positive peaks on the sulfurs after phenix.refine, try setting all the B-factors to a constants, such as 5.00 Å2 or Å2.
Peter König, Rafael Giraldo, Lynda Chapman, Daniela Rhodes  Cell 
Presentation transcript:

CommonCoot Common Coot (Fulica atra) (Fulica atra)

Ligands and validation クート オオバン

Ligands and validation with Coot Bernhard Lohkamp Karolinska Institutet December 2011 OIST

Ligands in Coot Fitting ligand Fitting ligand Ligand validation Ligand validation Ligand representation Ligand representation Ligand builder Ligand builder

Ligand Overlay Algorithm and Code by Eugene Krissinel Algorithm and Code by Eugene Krissinel Tries to overlay different ligands/monomers by graph matching Tries to overlay different ligands/monomers by graph matching Useful for “database” ligands where atom names are not selected by hand Useful for “database” ligands where atom names are not selected by hand Has been used as the basis of the function which “mutates” residues to alternative monomer types Has been used as the basis of the function which “mutates” residues to alternative monomer types e.g. phosphorylation e.g. phosphorylation

Restraint Editor

Displaying LINKs

Ligand validation

Ligand validation Why validate? structures in PDB structures in PDB 'ligands' in structures 'ligands' in structures Sulphate ions in 1DW9 Sulphate ions in 1DW A resolution 1.65 A resolution R/Rfree 0.15/0.19 R/Rfree 0.15/0.19

Ligand validation How to validate ligand geometry? How to validate ligand geometry? Compare observed structure to restraints Coot: Validate->Geometry analysis Compare observed structure to restraints Coot: Validate->Geometry analysis But what if parametrization (dictionary) is wrong? But what if parametrization (dictionary) is wrong?

How to validate ligand geometry? QM QM CPU hungry CPU hungry In vaccuo In vaccuo Low energy ≠ (?) bound ligand Low energy ≠ (?) bound ligand PDB (e.g. ValLigURL) PDB (e.g. ValLigURL) Good cofactor structures Good cofactor structures Less useful for novel ligands Less useful for novel ligands (occ.) questionable quality (occ.) questionable quality Cambridge Structural Database (CSD) Cambridge Structural Database (CSD) Using small molecule geometries Using small molecule geometries e.g. Mogul e.g. Mogul

Ligand representation

Bond order representation from dictionary files Bond order representation from dictionary files

Surfaces Partial charges Partial charges

Surfaces Transparent surfaces – surface complementarity Transparent surfaces – surface complementarity

Binding mode analysis Binding site highlighting Binding site highlighting Isolated molprobity dots Isolated molprobity dots

Binding mode analysis Binding site highlighting Binding site highlighting Isolated molprobity dots Isolated molprobity dots

Ligand environment 2D ligand pocket layout (ligplot, poseview) 2D ligand pocket layout (ligplot, poseview) Can we do better?

Ligand environment layout Binding pocket residues Binding pocket residues Interactions Interactions Substitution contour Substitution contour Solvent accessible halos Solvent accessible halos Work in progress!!!!!

Ligand environment layout Considerations Considerations 2D placement and distances should reflect 3D metrics 2D placement and distances should reflect 3D metrics As much as possible As much as possible Residues should not overlap the ligand Residues should not overlap the ligand Residues should not overlap each other Residues should not overlap each other H-bonded residues should be close to atoms to which they are bonded H-bonded residues should be close to atoms to which they are bonded Etc. Etc. c.f. Clark & Labute (2007); poseview c.f. Clark & Labute (2007); poseview Work in progress

Ligand environment layout Initial residue placement Initial residue placement

Ligand environment layout Residue position minimization Residue position minimization

Solvent exposure calculation Identification of solvent accessible atoms Identification of solvent accessible atoms Different from substitution contour Different from substitution contour

Ligand futures Make ligand builder (and environment layout) available Make ligand builder (and environment layout) available Interface to JLigand Interface to JLigand Ligand validation with CSD (Mogul or similar) Ligand validation with CSD (Mogul or similar)

Validation

Refinement Validation External e.g. REFMAC Internal Internal External e.g. MolProbity Feature Integration

What is Validation? Comparison of various aspects of the model with pre-conceived notions of “good quality” Comparison of various aspects of the model with pre-conceived notions of “good quality” Includes unrestrained and restrained criteria Includes unrestrained and restrained criteria Many aspects of validation overlap with refinement and model-building Many aspects of validation overlap with refinement and model-building Global and local indicators Global and local indicators

Why Validate? Model-building is error-prone Model-building is error-prone (although automated methods seem to do better) (although automated methods seem to do better) Map interpretation is subjective Map interpretation is subjective Someone else did the model-building Someone else did the model-building The model was built several years ago The model was built several years ago and the notion of “good quality” has changed and the notion of “good quality” has changed Deposition requires validation Deposition requires validation

A “good” model Makes statistical sense Makes statistical sense The reciprocal space representation agrees tolerably well with the observations (R-factor) The reciprocal space representation agrees tolerably well with the observations (R-factor) No meaningful difference map peaks (no under- or over-fitting) No meaningful difference map peaks (no under- or over-fitting)

A “good” model Makes statistical sense Makes statistical sense The reciprocal space representation agrees tolerably well with the observations (R-factor) The reciprocal space representation agrees tolerably well with the observations (R-factor) No meaningful difference map peaks (no under- or over-fitting) No meaningful difference map peaks (no under- or over-fitting) Makes chemical sense Makes chemical sense Model geometry is consistent with the restraints Model geometry is consistent with the restraints Ramachandran Plot has less than 1% outliers Ramachandran Plot has less than 1% outliers A good clashscore A good clashscore

A “good” model Makes statistical sense Makes statistical sense The reciprocal space representation agrees tolerably well with the observations (R-factor) The reciprocal space representation agrees tolerably well with the observations (R-factor) No meaningful difference map peaks (no under- or over-fitting) No meaningful difference map peaks (no under- or over-fitting) Makes chemical sense Makes chemical sense Model geometry is consistent with the restraints Model geometry is consistent with the restraints Ramachandran Plot has less than 1% outliers Ramachandran Plot has less than 1% outliers A good clashscore A good clashscore Make physical sense Make physical sense Crystal packing and contacts Crystal packing and contacts

A “good” model Makes statistical sense Makes statistical sense The reciprocal space representation agrees tolerably well with the observations (R-factor) The reciprocal space representation agrees tolerably well with the observations (R-factor) No meaningful difference map peaks (no under- or over-fitting) No meaningful difference map peaks (no under- or over-fitting) Makes chemical sense Makes chemical sense Model geometry is consistent with the restraints Model geometry is consistent with the restraints Ramachandran Plot has less than 1% outliers Ramachandran Plot has less than 1% outliers A good clashscore A good clashscore Make physical sense Make physical sense Crystal packing and contacts Crystal packing and contacts Makes biological sense Makes biological sense Residues in chemically sensible environment Residues in chemically sensible environment Is consistent (on the whole) with external biochemistry observations (active site residues) Is consistent (on the whole) with external biochemistry observations (active site residues)

Quick Bayes Bayes Eq: Bayes Eq: Pr( model | data )  Pr( data | model ) * Pr( model ) Pr( model | data )  Pr( data | model ) * Pr( model ) Pr( data | model ) is also called the model likelihood, L M ( model | data ) Pr( data | model ) is also called the model likelihood, L M ( model | data )

Validation Tools - Pr(model) Ramachandran Plot Ramachandran Plot Kleywegt Plot (NCS differences) Kleywegt Plot (NCS differences) Geometry Analysis Geometry Analysis Peptide  Analysis Peptide  Analysis Temperature Factor Analysis Temperature Factor Analysis Rotamer Analysis Rotamer Analysis Clashes Clashes

Validation Tools - Pr(data|model) Density Fit Analysis Density Fit Analysis Difference Map Peaks Difference Map Peaks Variance analysis at water positions Variance analysis at water positions Unmodelled blobs Unmodelled blobs

Find difference map peaks

Check/Delete Waters

Difference Map Sampling Detects “Anomalous” Waters

Find unmodelled blobs

Pr(model)... Ramachandran Plot Ramachandran Plot Kleywegt Plot (NCS differences) Kleywegt Plot (NCS differences) Geometry Analysis Geometry Analysis Peptide  Analysis Peptide  Analysis Temperature Factor Analysis Temperature Factor Analysis Rotamer Analysis Rotamer Analysis Sequence analysis Sequence analysis Clashes Clashes

Ramachandran Plot for residues with CB

Ramachandran Plot for GLY

Ramachandran Plot for PRO

Top500-based distribution

Not all outliers are bad

Kleywegt Plots [*] [*] Named by George Sheldrick

Peptide  Needed to check the planarity of the peptide link Needed to check the planarity of the peptide link At low resolutions it is possible to give the protein lots of (too much) freedom to optimize the fit to the density At low resolutions it is possible to give the protein lots of (too much) freedom to optimize the fit to the density Can accidentally create CIS peptides Can accidentally create CIS peptides When discovered they are easily reconverted using the CIS TRANS peptide tool When discovered they are easily reconverted using the CIS TRANS peptide tool Less accidents happen when peptide plane restraints are applied Less accidents happen when peptide plane restraints are applied

B-factor variance

Rotamers Side-chains have certain preferred combinations of torsions round their rotatable bonds Side-chains have certain preferred combinations of torsions round their rotatable bonds An analysis (batched around the staggered conformations) will give rotamer occurrence An analysis (batched around the staggered conformations) will give rotamer occurrence

Torsion-based Validation

Not all outliers are bad Correct rotamer(?!)

Sequence validation Assign sequence (Extensions  Dock sequence  Associate sequence) Assign sequence (Extensions  Dock sequence  Associate sequence) Sequence alignment of model to given sequence (Validate  Alignment vs PIR) Sequence alignment of model to given sequence (Validate  Alignment vs PIR)

More Validation Pr(model) Coot has interface to Molprobity Coot has interface to Molprobity (Molprobity is the widely regarded as the best model validation suite) (Molprobity is the widely regarded as the best model validation suite) Uses identical Ramachandran plot Uses identical Ramachandran plot Uses identical Rotamer library Uses identical Rotamer library Coot reads probe dots directly Coot reads probe dots directly

Molprobity’s Reduce & Probe

Other Programs Molprobity Suite Molprobity Suite molprobity.biochem.duke.edu molprobity.biochem.duke.edu WHATCHECK WHATCHECK VERIFY-3D VERIFY-3D

Acknowledgements Paul Emsley Paul Emsley Kevin Cowtan Kevin Cowtan Eleanor Dodson Eleanor Dodson Keith Wilson Keith Wilson Libraries, dictionaries Libraries, dictionaries Alexei Vagin, Eugene Krissinel, Stuart McNicholas Alexei Vagin, Eugene Krissinel, Stuart McNicholas Dunbrack, Richardsons Dunbrack, Richardsons Coot Builders and Testers Coot Builders and Testers William Scott, Ezra Peisach William Scott, Ezra Peisach York YSBL, Dundee, Glasgow (early adopters) York YSBL, Dundee, Glasgow (early adopters) Coot Mailing List subscribers Coot Mailing List subscribers Google: Coot or for WinCoot