Refinement of Macromolecular structures using REFMAC5 Garib N Murshudov York Structural Laboratory Chemistry Department University of York.

Slides:



Advertisements
Similar presentations
CCP4 Molecular Graphics (CCP4MG)
Advertisements

Twinning etc Andrey Lebedev YSBL. Data prcessing Twinning test: 1) There is twinning 2) The true spacegroup is one of … 3) Find the true spacegroup at.
Alexander J. Blake, School of Chemistry The University of Nottingham, Nottingham UK Refinement on weak or problematic small molecule data using SHELXL-97.
An introduction to the Rietveld method Angus P. Wilkinson School of Chemistry and Biochemistry Georgia Institute of Technology.
Automated phase improvement and model building with Parrot and Buccaneer Kevin Cowtan
Towards Low Resolution Refinement Garib N Murshudov York Structural Laboratory Chemistry Department University of York.
Introduction to protein x-ray crystallography. Electromagnetic waves E- electromagnetic field strength A- amplitude  - angular velocity - frequency.
Disorder.
Effects of TLS parameters in Macromolecular Refinement Martyn Winn Daresbury Laboratory, U.K. IUCr99 08/08/99.
Refinement Garib N Murshudov MRC-LMB Cambridge 1.
Refinement of Macromolecular structures using REFMAC5
Refinement procedure Copy your best coordinate file to “prok-native-r1.pdb”: cp yourname-coot-99.pdb prok-native-r1.pdb Start refinement phenix.refine.
Twinning. Like disorder but of unit cell orientation… –In a perfect single crystal, unit cells are all found in the same orientation. We can consider.
Recent developments 1) Tests (outlier analysis) and Bug fixing ( with Paul) 2) Regeneration of Values of Bonds and Bond-angles existing all structures.
Small Molecule Example – YLID Unit Cell Contents and Z Value
CCP4 workshop: Diamond – 2014 ___________________________________________ Refinement Garib N Murshudov MRC-LMB Cambridge 1.
Structure Outline Solve Structure Refine Structure and add all atoms
A Brief Description of the Crystallographic Experiment
Twinning in protein crystals NCI, Macromolecular Crystallography Laboratory, Synchrotron Radiation Research ANL Title Zbigniew Dauter.
Chem Once an apparently reasonable initial solution has been found, as indicated by a chemically reasonable electron density distribution, it can.
Chem Thermal Ellipsoids Remember that thermal ellipsoids can indicate problems with a refinement even when the R factors seem to indicate a reasonable.
Macromolecular structure refinement Garib N Murshudov York Structural Biology Laboratory Chemistry Department University of York.
Protein Primer. Outline n Protein representations n Structure of Proteins Structure of Proteins –Primary: amino acid sequence –Secondary:  -helices &
Don't fffear the buccaneer Kevin Cowtan, York. ● Map simulation ⇨ A tool for building robust statistical methods ● 'Pirate' ⇨ A new statistical phase improvement.
Automated protein structure solution for weak SAD data Pavol Skubak and Navraj Pannu Automated protein structure solution for weak SAD data Pavol Skubak.
Chemometrics Method comparison
Refinement with REFMAC
28 Mar 06Automation1 Overview of developments within CCP4 Generation 1 ccp4i tasks Generation 2 isolated scripts / web service Generation 3 integrated.
Progress report on Crank: Experimental phasing Biophysical Structural Chemistry Leiden University, The Netherlands.
Model-Building with Coot An Introduction Bernhard Lohkamp Karolinska Institute June 2009 Chicago (Paul Emsley) (University of Oxford)
A Molecular Replacement Pipeline Garib Murshudov Chemistry Department, University of York 
BALBES (Current working name) A. Vagin, F. Long, J. Foadi, A. Lebedev G. Murshudov Chemistry Department, University of York.
Data quality and model parameterisation Martyn Winn CCP4, Daresbury Laboratory, U.K. Prague, April 2009.
Coot Tools for Model Building and Validation
Ionic Conductors: Characterisation of Defect Structure Lecture 15 Total scattering analysis Dr. I. Abrahams Queen Mary University of London Lectures co-financed.
TLS MODELLING OF ANISOTROPY IN MACROMOLECULAR REFINEMENT Martyn Winn CCP4, Daresbury Laboratory, U.K. York, April 11th 2002.
Chem Structure Factors Until now, we have only typically considered reflections arising from planes in a hypothetical lattice containing one atom.
Phasing Today’s goal is to calculate phases (  p ) for proteinase K using PCMBS and EuCl 3 (MIRAS method). What experimental data do we need? 1) from.
1. Diffraction intensity 2. Patterson map Lecture
Doug Raiford Lesson 17.  Framework model  Secondary structure first  Assemble secondary structure segments  Hydrophobic collapse  Molten: compact.
Lesson 8 Diffraction by an atom Atomic Displacement Parameters.
X RAY CRYSTALLOGRAPHY. WHY X-RAY? IN ORDER TO BE OBSERVED THE DIMENTIONS OF AN OBJECT MUST BE HALF OF THE LIGHT WAVELENGHT USED TO OBSERVE IT.
Erice CCP4 (refmac) wokshop Garib N Murshudov York Structural Laboratory Chemistry Department University of York 1.
Data Harvesting: automatic extraction of information necessary for the deposition of structures from protein crystallography Martyn Winn CCP4, Daresbury.
Protein Folding & Biospectroscopy Lecture 6 F14PFB David Robinson.
Direct Use of Phase Information in Refmac Abingdon, University of Leiden P. Skubák.
Progress report on Crank: Model building Biophysical Structural Chemistry Leiden University, The Netherlands.
Atomic structure model
Crystallography -- Lecture 22 Refinement and Validation.
Refinement of Macromolecular structures using REFMAC5 Garib N Murshudov York Structural Laboratory Chemistry Department University of York.
Refinement is the process of adjusting an atomic model to:
CCP4 Version The most recent version of the CCP4 suite is 4.1, which was released at the end of January 2001, with a minor patch release shortly.
Automated Refinement (distinct from manual building) Two TERMS: E total = E data ( w data ) + E stereochemistry E data describes the difference between.
CCP4 6.1 and beyond: Tools for Macromolecular Crystallography
Complete automation in CCP4 What do we need and how to achieve it?
Introduction to Isomorphous Replacement and Anomalous Scattering Methods Measure native intensities Prepare isomorphous heavy atom derivatives Measure.
Nobel Laureates of X Ray Crystallography
Experimental phasing in Crank2 Pavol Skubak and Navraj Pannu Biophysical Structural Chemistry, Leiden University, The Netherlands
Goals for Today Introduce automated refinement and validation.
Volume 24, Issue 8, Pages (August 2016)
Progress Report in REFMAC
Goals for Today Introduce automated refinement and validation.
Axel T Brünger, Paul D Adams, Luke M Rice  Structure 
Version 5.3 From SMILE string to dictionary (LIBCHECK): Now coot uses it Segment id is now used Automatic adjustment for weights Improved bond order extraction.
Volume 15, Issue 9, Pages (September 2007)
Garib Murshudov YSBL, Chemistry Department, University of York
Crystal structure description
Presentation transcript:

Refinement of Macromolecular structures using REFMAC5 Garib N Murshudov York Structural Laboratory Chemistry Department University of York

Contents 1)Introduction 2)Considerations for refinement 3)Refinement against all data 4)TLS 5)Dictionary and alternative conformations 6)Conclusions

Available refinement programs SHELXL CNS REFMAC5 TNT BUSTER/TNT Phenix.refine RESTRAINT MOPRO

Considerations in refinement Function to optimise (link between data and model) –Should use experimental data –Should be able to handle chemical (e.g bonds) and other (e.g. NCS, structural) information Parameters –Depends on the stage of analysis –Depends on amount and quality of the experimental data Methods to optimise –Depends on stage of analysis: simulated annealing, conjugate gradient, second order (normal matrix, information matrix, second derivatives) –Some methods can give error estimate as a by-product. E.g second order.

Two components of target function Crystallographic target functions have two components: one of them describes the fit of the model parameters into the experimental data and the second describes chemical integrity (restraints). Currently used restraints are: bond lengths, angles, chirals, planes, ncs if available, some torsion angles

Various form of functions SAD function uses observed F+ and F- directly without any preprocessing by a phasing program (It is not available in the current version but will be available soon) MLHL - explicit use of phases with Hendrickson Lattman coefficients Rice - Maximum likelihood refinement without phase information

Shortcomings of using ABCD directly Dependent on where you obtained your Hendrickson- Lattman coefficients Assumes that your prior phase information is independent from your model phases!

Differences between SAD and RICE in wARP * + Refmac Resol. (Å) Anom. atoms Experiment Residues RICE/SAD/FINAL MutS3.046 SeSAD (peak)493/1093/1600 subtilisin1.773 Ca, SSAD6/259/275 thioesterase2.58 SeSAD (infl)300/542/572 gere SeMAD(p/i)43/110/444 cyanase SeMAD (p/i)71/669/1560 thioesterase I BrSAD(peak)35/431/462 * 10 wARP cycles. These results are from Raj Pannu and Pavol Skubak from Leiden

Twin refinement Twin refinement in the new version of refmac is automatic. Only thing you need to do is to add one keyword: TWIN Then the program identifies twin operators and refines twin fractions as well as all other usual parameters. It is better to give intensities for twin refinement. NB: It is not available in the standard version yet.

Map calculation After refinement programs usually give coefficients for two type of maps: 1) 2Fo-Fc type maps. They try to represent the content of the crystal. 2) Fo-Fc type of maps. They try to represent difference between contents of the crystal and current atomic model. Both these maps should be inspected and model should be corrected if necessary. Refmac gives coefficients: 2 m F o - D F c – to represent contents of the crystal m F o –D F c - to represent differences m is the figure of merit (reliability) of the phase of the current reflection and D is related with model error. m depends on each reflection and D depends on resolution If phase information is available then map coefficients correspond to the combined phases.

Parameters Usual parameters (if programs allow it) 1)Positions x,y,z 2)B values – isotropic or anisotropic 3)Occupancy Derived parameters 1)Rigid body positional After molecular replacement Isomorphous crystal (liganded, unliganded, different data) 2)Rigid body of B values – TLS –Useful at the medium and final stages –At low resolution when full anisotropy is impossible 3)Torsion angles

Overall parameters: Scaling There are several options for scaling: 1)Babinet’s bulk solvent assumes that at low resolution solvent and protein contributors are very similar and only difference is overall density and B value. It has the form: k b = 1-k b e(-B b s 2 /4) 2)Mask bulk solvent: Part of the asymmetric unit not occupied by atoms are asigned constant value and Fourier transformation from this part is calculated. Then this contribution is added with scale value to “protein” structure factors. Total structure factor has a form: F tot = F p +s s exp(-B s s 2 /4). 3)The final total structure factor that is scaled has a form: s aniso s protein k b F tot

TLS

TLS groups Rigid groups should be defined as TLS groups. As starting point they could be: subunits or domains. If you use script then default rigid groups are subunits or segments if defined. In ccp4i you should define rigid groups (in the next version default will be subunits). Rigid group could be defined using TLSMD webserver:

Give your pdb file with refined isotrpopic B values

Ideally this plot should have an elbow indicating the number of TLS groups

Alternative conformations and links

Alternative conformations Example from 0.88Å catalase structure:Two conformations of Tyrosine. Ring is clearly in two conformation. To refine it properly CB also needs to be split. It helps adding hydrogen atom on CB and improves restraints in anisotropic U values

Alternative conformation: Example in pdb file ATOM 977 N GLU A N ATOM 978 CA GLU A C ATOM 980 CB AGLU A C ATOM 981 CB BGLU A C ATOM 986 CG AGLU A C ATOM 987 CG BGLU A C ATOM 992 CD AGLU A C ATOM 993 CD BGLU A C ATOM 994 OE1AGLU A O ATOM 995 OE1BGLU A O ATOM 996 OE2BGLU A O ATOM 997 OE2AGLU A O ATOM 998 C GLU A C ATOM 999 O GLU A O Note that pdb is strictly formatted. Every element has its position

Link between residues in double conformation Fluro-modified sugar MAF is in two conformation. One of them is bound to GLU and another one is bound to ligand BEN

Alternative conformation of links: how to handle Description Description of link(s) should be added to the library. When residues make link then each component is usually modified. Description of Link should contain it also PDB LINK C6 BBEN B 1 O1 BMAF S 2 BEN-MAF LINK OE2 AGLU A 320 C1 AMAF S 2 GLU-MAF

Things to look at R factor/Rfree: They should go down during refinement Geometric parameters: rms bond and other. They should be reasonable. For example rms bond should be around 0.02 Map and coordinates using coot Logggraph outputs. That is available on the cpp4i interface

Behaviour of R/Rfree, average Fobs vs resolution should be reasonable. If there is a bump or it has an irregular behaviour then either something is wrong with your data or refinement.

What and when Rigid body: At early stages - after molecular replacement or when refining against data from isomorphous crystals TLS - at medium and end stages of refinement at resolutions up to A (roughly) Anisotropic - At higher resolution towards the end of refinement Adding hydrogens - Higher than 2A but they could be added always Phased refinement - at early and medium stages of refinement SAD - at all stages(?) Twin - always Ligands - as soon as you see them What else?

Conclusions If phases are available they should be used at least at the early and medium stages of refinement Unless there is very good reason not to all resolution should be used in refinement TLS describes overall motion and works well in practice Ligand and link description should be considered very carefully Although there is information about motion of molecule in the TLS parameters they should be used with care