Ligand fitting and Validation with Coot Bernhard Lohkamp Karolinska Institute June 2009 Chicago (Paul Emsley) (University of Oxford)

Slides:



Advertisements
Similar presentations
Kevin Cowtan, CCP4 March Pirate applications... Pirate:phase improvement software Brigantine:bias removal.
Advertisements

CCP4 Molecular Graphics (CCP4MG)
Quality of Protein Crystal Structures in the PDB Eric. N Brown, Lokesh Gakhar and S. Ramaswamy.
Automated phase improvement and model building with Parrot and Buccaneer Kevin Cowtan
Crystals are made from very large numbers of small units: Unit cells. The unit cell may contain more than one protein. The packing of the unit cells gives.
Refinement procedure Copy your best coordinate file to “prok-native-r1.pdb”: cp yourname-coot-99.pdb prok-native-r1.pdb Start refinement phenix.refine.
Recent developments 1) Tests (outlier analysis) and Bug fixing ( with Paul) 2) Regeneration of Values of Bonds and Bond-angles existing all structures.
Short fast history of protein design Site-directed mutagenesis -- protein engineering (J. Wells, 1980's) Coiled coils, helix bundles (W. DeGrado, 1980's-90's)
Structure Validation using Coot Paul Emsley Mar 2007 York University of York.
An overview of amino acid structure Topic 2. Biomacromolecule A naturally occurring substance of large molecular weight e.g. Protein, DNA, lipids etc.
Structure Outline Solve Structure Refine Structure and add all atoms
Refinement of Macromolecular structures using REFMAC5 Garib N Murshudov York Structural Laboratory Chemistry Department University of York.
The TEXTAL System for Automated Model Building Thomas R. Ioerger Texas A&M University.
Seminar series 2 Protein structure validation. In 't verleden ligt het heden; in 't nu, wat worden zal. The past: Linus Pauling ‘Inventor’ of helix and.
Macromolecular structure refinement Garib N Murshudov York Structural Biology Laboratory Chemistry Department University of York.
3. Crystals What defines a crystal? Atoms, lattice points, symmetry, space groups Diffraction B-factors R-factors Resolution Refinement Modeling!
Automated Crystal Structure Validation Ton Spek, National Single Crystal Facility, Utrecht University, Utrecht, The Netherlands Platon Workshop Chicago,
Ligands, dictionary and refinement Garib N Murshudov York Structural Biology Laboratory University of York.
Don't fffear the buccaneer Kevin Cowtan, York. ● Map simulation ⇨ A tool for building robust statistical methods ● 'Pirate' ⇨ A new statistical phase improvement.
Structure validation Everything that can go wrong, will go wrong. Everything that could go wrong has gone wrong. Especially with something as complicated.
Basics of protein structure and stability III: Anatomy of protein structure Biochem 565, Fall /29/08 Cordes.
A PEPTIDE BOND PEPTIDE BOND Polypeptides are polymers of amino acid residues linked by peptide group Peptide group is planar in nature which limits.
Proteins: Levels of Protein Structure Conformation of Peptide Group
Two parts to successful model building BUILDING TOOLS –how to use Coot –Initiate trace of protein chain (“Place helix here”) –Test sidechain assignments.
Computational Structure Prediction Kevin Drew BCH364C/391L Systems Biology/Bioinformatics 2/12/15.
Recommendations and Questions wwPDB/CCDC/D3R Ligand Validation Workshop Center for Integrative Proteomics Research, Rutgers 7/30-31/2015 Group D, Academic.
CCP4mg Liz Potterton, Stuart McNicholas, Martin Noble, Jan Gruber.
Introduction to Macromolecular X-ray Crystallography Biochem 300 Borden Lacy Print and online resources: Introduction to Macromolecular X-ray Crystallography,
COMPARATIVE or HOMOLOGY MODELING
Kevin Cowtan, DevMeet CCP4 Wiki ccp4wiki.org Maintainer: YOU.
Model-Building with Coot An Introduction Bernhard Lohkamp Karolinska Institute June 2009 Chicago (Paul Emsley) (University of Oxford)
Increasing the Value of Crystallographic Databases Derived knowledge bases Knowledge-based applications programs Data mining tools for protein-ligand complexes.
BALBES (Current working name) A. Vagin, F. Long, J. Foadi, A. Lebedev G. Murshudov Chemistry Department, University of York.
Data quality and model parameterisation Martyn Winn CCP4, Daresbury Laboratory, U.K. Prague, April 2009.
Coot Tools for Model Building and Validation
X-ray Validation Package Present status Swanand Gore PDBe D&A meeting : 21-Oct-2010.
Crystallographic Databases I590 Spring 2005 Based in part on slides from John C. Huffman.
1.Overall amino acid structure 2.Amino acid stereochemistry 3.Amino acid sidechain structure & classification 4.‘Non-standard’ amino acids 5.Amino acid.
Molecular visualization
Overview of MR in CCP4 II. Roadmap
Protein Modeling Protein Structure Prediction. 3D Protein Structure ALA CαCα LEU CαCαCαCαCαCαCαCα PRO VALVAL ARG …… ??? backbone sidechain.
Data Harvesting: automatic extraction of information necessary for the deposition of structures from protein crystallography Martyn Winn CCP4, Daresbury.
SR Users Meeting 10-11th September 2003 CCP4 Release 5.0 Peter Briggs CCP4/CCLRC Daresbury Laboratory.
Crystallography -- Lecture 22 Refinement and Validation.
Bioinformatics 2 -- lecture 9
Refinement is the process of adjusting an atomic model to:
Forward and inverse kinematics in RNA backbone conformations By Xueyi Wang and Jack Snoeyink Department of Computer Science UNC-Chapel Hill.
RAPPER Nick Furnham Blundell Group – Department of Biochemistry Cambridge University UK
Automated Refinement (distinct from manual building) Two TERMS: E total = E data ( w data ) + E stereochemistry E data describes the difference between.
Structure Visualization
Common Coot (Fulica atra) (Fulica atra) Common Coot (Fulica atra) (Fulica atra)
CommonCoot Common Coot (Fulica atra) (Fulica atra)
Common Coot (Fulica atra) (Fulica atra). More building with Coot クート オオバン.
Introduction to Coot Paul McEwan Ph.D. Evotec.
Computational Structure Prediction
Common Coot (Fulica atra).
Refinement procedure for native structure
CCP4 6.1 and beyond: Tools for Macromolecular Crystallography
Complete automation in CCP4 What do we need and how to achieve it?
Reduce the need for human intervention in protein model building
CCP4 from a user perspective
1.b What are current best practices for selecting an initial target ligand atomic model(s) for structure refinement from X-ray diffraction data?
A Smoothed Backbone-Dependent Rotamer Library for Proteins Derived from Adaptive Kernel Density Estimates and Regressions  Maxim V. Shapovalov, Roland L.
Conformation Dependence of Backbone Geometry in Proteins
Goals for Today Introduce automated refinement and validation.
Volume 14, Issue 2, Pages (February 2006)
Goals for Today Introduce automated refinement and validation.
Volume 19, Issue 10, Pages (October 2011)
Note for 2019 If you get positive peaks on the sulfurs after phenix.refine, try setting all the B-factors to a constants, such as 5.00 Å2 or Å2.
Presentation transcript:

Ligand fitting and Validation with Coot Bernhard Lohkamp Karolinska Institute June 2009 Chicago (Paul Emsley) (University of Oxford)

Fitting Ligands

REFMAC Monomer Library chem_comp_bond loop_ _chem_comp_bond.comp_id _chem_comp_bond.atom_id_1 _chem_comp_bond.atom_id_2 _chem_comp_bond.type _chem_comp_bond.value_dist _chem_comp_bond.value_dist_esd ALA N H single ALA N CA single ALA CA HA single ALA CA CB single ALA CB HB1 single ALA CB HB2 single ALA CB HB3 single ALA CA C single ALA C O double

REFMAC Monomer Library chem_comp_tor loop_ _chem_comp_tor.comp_id _chem_comp_tor.id _chem_comp_tor.atom_id_1 _chem_comp_tor.atom_id_2 _chem_comp_tor.atom_id_3 _chem_comp_tor.atom_id_4 _chem_comp_tor.value_angle _chem_comp_tor.value_angle_esd _chem_comp_tor.period TRP chi1 N CA CB CG TRP chi2 CA CB CG CD chi1 chi2

Ligand Fitting c.f. Oldfield (2001) Acta Cryst. D X-LIGAND c.f. Oldfield (2001) Acta Cryst. D X-LIGAND Somewhat different torsion search algorithm Somewhat different torsion search algorithm Build in crystal-space Build in crystal-space

Ligand Torsionable Angle Probability from CIF file

Crystal Space Build in “crystal space” Like real-space, but wrapped by crystal symmetry Like “Asteroids” Assures only one real-space representation of map features Build everything only once, No symmetry clashing However, more difficult to calculate real space geometries …such as bonds, torsions

Crystal Space Build in “crystal space” Like real-space, but wrapped by crystal symmetry Like “Asteroids” Assures only one real-space representation of map features Build everything only once No symmetry clashing However, more difficult to calculate real space geometries …such as bonds, torsions

Clipper Map Mapping Clipper maps Clipper maps Appear to be “infinite” Appear to be “infinite” Density value can be queried anywhere in space Density value can be queried anywhere in space

Conformation Idealization Each conformer is passed through the “Regularization” function of Coot Each conformer is passed through the “Regularization” function of Coot Non-bonded terms included Non-bonded terms included Better to have hydrogen atoms on the model Better to have hydrogen atoms on the model Slows things down a good deal… Slows things down a good deal… May not be the best method to explore conformational variability for many rotatable bonds May not be the best method to explore conformational variability for many rotatable bonds

Ligand Overlay Algorithm and Code by Eugene Krissinel Algorithm and Code by Eugene Krissinel Tries to overlay different ligands/monomers by graph matching Tries to overlay different ligands/monomers by graph matching Useful for “database” ligands where atom names are not selected by hand Useful for “database” ligands where atom names are not selected by hand Has been used as the basis of the function which “mutates” residues to alternative monomer types Has been used as the basis of the function which “mutates” residues to alternative monomer types e.g. phosphorylation e.g. phosphorylation

Refinement Validation External e.g. REFMAC Internal Internal External [e.g. MolProbity] Feature Integration

Validation...

What is Validation? Comparison of Various aspects of the model with pre-conceived notions of “good quality” Comparison of Various aspects of the model with pre-conceived notions of “good quality” Includes unrestrained and restrained criteria Includes unrestrained and restrained criteria Many aspects of validation overlap with refinement and model-building Many aspects of validation overlap with refinement and model-building

Why Validate? Model-building is error-prone Model-building is error-prone (although automated methods seem to do better) (although automated methods seem to do better) Someone else did the model-building Someone else did the model-building The model was built several years ago The model was built several years ago and the notion of “good quality” has changed and the notion of “good quality” has changed Deposition requires validation Deposition requires validation

Observations to Parameters Ratio Some typical numbers Some typical numbers to 2Å, reflections to 2Å, reflections 200 residues x 10 (atoms/residue) x 4 params/atom 200 residues x 10 (atoms/residue) x 4 params/atom -> about 2.6 -> about 2.6 To 3Å: To 3Å: Ratio is about 1:1 Ratio is about 1:1 As statisticians, we prefer our models to be parsimonious As statisticians, we prefer our models to be parsimonious Depending on solvent content and the manner in which NCS is handled

A “good” model Makes statistical sense Makes statistical sense The reciprocal space representation agrees tolerably well with the observations (R-factor) The reciprocal space representation agrees tolerably well with the observations (R-factor) No meaningful difference map peaks No meaningful difference map peaks Makes Chemical sense Makes Chemical sense Model Geometry is consistent with the restraints Model Geometry is consistent with the restraints Ramachandran Plot has less than 1% outliers Ramachandran Plot has less than 1% outliers A good clashscore A good clashscore Makes Biological sense Makes Biological sense Residues in chemically sensible environment Residues in chemically sensible environment Is consistent (on the whole) with external biochemistry observations (active site residues) Is consistent (on the whole) with external biochemistry observations (active site residues)

Quick Bayes Bayes Eq: Bayes Eq: Pr( model | data )  Pr( data | model ) * Pr( model ) Pr( model | data )  Pr( data | model ) * Pr( model ) Pr( data | model ) is also called the Likelihood, L( model | data ) Pr( data | model ) is also called the Likelihood, L( model | data )

Validation Tools - Pr(model) Ramachandran Plot Ramachandran Plot Kleywegt Plot (NCS differences) Kleywegt Plot (NCS differences) Geometry Analysis Geometry Analysis Peptide  Analysis Peptide  Analysis Temperature Factor Analysis Temperature Factor Analysis Rotamer Analysis Rotamer Analysis [Clashes] [Clashes]

Rotamers Side-chains have certain preferred combinations of torsions round their rotatable bonds Side-chains have certain preferred combinations of torsions round their rotatable bonds An analysis (batched around the staggered conformations) will give rotamer occurrence An analysis (batched around the staggered conformations) will give rotamer occurrence

Validation Tools - Pr(data|model) Density Fit Analysis Density Fit Analysis Difference Map Peaks Difference Map Peaks Variance analysis at Water Positions Variance analysis at Water Positions Unmodelled blobs Unmodelled blobs

B-factor variance

Chiral Volume Analysis Based on data in the Refmac dictionary Based on data in the Refmac dictionary …was needed because it was possible with Coot to accidentally invert Chiral centres …was needed because it was possible with Coot to accidentally invert Chiral centres e.g. C  s, C  (THR) e.g. C  s, C  (THR) (Easily corrected with the Mutate & Autofit tool) (Easily corrected with the Mutate & Autofit tool) These days we have chiral volume restraints These days we have chiral volume restraints

Check/Delete Waters

Difference Map Sampling Detects “Anomalous” Waters

Torsion-based Validation

Peptide  Needed to check the planarity of the peptide link Needed to check the planarity of the peptide link At low resolutions it is possible to give the protein lots of (too much) freedom to optimize the fit to the density At low resolutions it is possible to give the protein lots of (too much) freedom to optimize the fit to the density Can accidentally create CIS peptides Can accidentally create CIS peptides When discovered they are easily reconverted using the CIS TRANS peptide tool When discovered they are easily reconverted using the CIS TRANS peptide tool Less accidents happen when peptide plane restraints are applied Less accidents happen when peptide plane restraints are applied

Ramachandran Plot for residues with CB

Ramachandran Plot for GLY

Ramachandran Plot for PRO

Top500-based distribution

Kleywegt Plots [*] [*] Named by George Sheldrick

More Validation Pr(model) Coot has interface to Molprobity Coot has interface to Molprobity (Molprobity is the widely regarded as the best model validation suite) (Molprobity is the widely regarded as the best model validation suite) Uses identical Ramachandran plot Uses identical Ramachandran plot Uses identical Rotamer library Uses identical Rotamer library Coot reads probe dots directly Coot reads probe dots directly

Molprobity’s Reduce & Probe

Other Programs Moprobity Suite Moprobity Suite molprobity.biochem.duke.edu molprobity.biochem.duke.edu WHATCHECK WHATCHECK VERIFY-3D VERIFY-3D

Acknowledgements Paul Emsley Paul Emsley Kevin Cowtan Kevin Cowtan Eleanor Dodson Eleanor Dodson Keith Wilson Keith Wilson Libraries, dictionaries Libraries, dictionaries Alexei Vagin, Eugene Krissinel, Stuart McNicholas Alexei Vagin, Eugene Krissinel, Stuart McNicholas Dunbrack, Richardsons Dunbrack, Richardsons Coot Builders and Testers Coot Builders and Testers William Scott, Ezra Peisach William Scott, Ezra Peisach York YSBL, Dundee, Glasgow (early adopters) York YSBL, Dundee, Glasgow (early adopters) Coot Mailing List subscribers Coot Mailing List subscribers or Google: Coot or for WinCoot