Phasing in Macromolecular Crystallography

Slides:

Advertisements

Similar presentations

Intensities Learning Outcomes By the end of this section you should: understand the factors that contribute to diffraction know and be able to use the.

Advertisements

Phasing Goal is to calculate phases using isomorphous and anomalous differences from PCMBS and GdCl3 derivatives --MIRAS. How many phasing triangles will.

Protein x-ray crystallography

Introduction to Macromolecular X-ray Crystallography Biochem 300 Borden Lacy Print and online resources: Introduction to Macromolecular X-ray Crystallography,

Introduction to protein x-ray crystallography. Electromagnetic waves E- electromagnetic field strength A- amplitude  - angular velocity - frequency.

Methods: X-ray Crystallography

Overview of the Phase Problem

Determination of Protein Structure. Methods for Determining Structures X-ray crystallography – uses an X-ray diffraction pattern and electron density.

M.I.R.(A.S.) S.M. Prince U.M.I.S.T.. The only generally applicable way of solving macromolecular crystal structure No reliance on homologous structure.

M.I.R.(A.S.) S.M. Prince U.M.I.S.T.. The only generally applicable way of solving macromolecular crystal structure No reliance on homologous structure.

X-Ray Crystallography

Bob Sweet Bill Furey Considerations in Collection of Anomalous Data.

Internal – External Order We described symmetry of crystal habit (32 point groups) We also looked at internal ordering of atoms in 3-D structure (230 space.

Chem Single Crystals For single crystals, we see the individual reciprocal lattice points projected onto the detector and we can determine the values.

Experimental Phasing stuff. Centric reflections |F P | |F PH | FHFH Isomorphous replacement F P + F H = F PH FPFP F PH FHFH.

A Brief Description of the Crystallographic Experiment

Twinning in protein crystals NCI, Macromolecular Crystallography Laboratory, Synchrotron Radiation Research ANL Title Zbigniew Dauter.

Hanging Drop Sitting Drop Microdialysis Crystallization Screening.

X-ray Crystallography Kalyan Das. Electromagnetic Spectrum to 10 nm 400 to 700 nm to nm 10 to 400 nm 700 to 10 4 nm X-ray radiation.

Experimental Phasing Andrew Howard ACA Summer School 22 July 2005.

Anomalous Scattering: Theory and Practice Andrew Howard ACA Summer School 29 July 2005 Andrew Howard ACA Summer School 29 July 2005.

Fourier transform. Fourier transform Fourier transform.

19 Feb 2008 Biology 555: Crystallographic Phasing II p. 1 of 38 ProteinDataCrystalStructurePhases Overview of the Phase Problem John Rose ACA Summer School.

In Macromolecular Crystallography Use of anomalous signal in phasing

Analysis of crystal structure x-rays, neutrons and electrons

Automated protein structure solution for weak SAD data Pavol Skubak and Navraj Pannu Automated protein structure solution for weak SAD data Pavol Skubak.

Overview of the Phase Problem

Phasing based on anomalous diffraction Zbigniew Dauter.

CHE (Structural Inorganic Chemistry) X-ray Diffraction & Crystallography lecture 2 Dr Rob Jackson LJ1.16,

Analysis of crystal structure x-rays, neutrons and electrons

Lesson 5 Conditioning the x-ray beam

Patterson Space and Heavy Atom Isomorphous Replacement

In serial femtosecond crystallography (SFX) with hard X-ray free-electron laser as light source, a set of three-dimensional single-crystal diffraction.

The ‘phase problem’ in X-ray crystallography What is ‘the problem’? How can we overcome ‘the problem’?

Chem Patterson Methods In 1935, Patterson showed that the unknown phase information in the equation for electron density:  (xyz) = 1/V ∑ h ∑ k.

Considerations for Protein Crystallography (BT Chapter 18) 1.Growing crystals Usually require 0.5mm in shortest dimension, except if using Synchrotron.

Chem Structure Factors Until now, we have only typically considered reflections arising from planes in a hypothetical lattice containing one atom.

Phasing Today’s goal is to calculate phases (  p ) for proteinase K using PCMBS and EuCl 3 (MIRAS method). What experimental data do we need? 1) from.

1. Diffraction intensity 2. Patterson map Lecture

THE PHASE PROBLEM Electron Density

Macromolecular Crystallography and Structural Genomics – Recent Trends Prof. D. Velmurugan Department of Crystallography and Biophysics University of Madras.

Page 1 X-ray crystallography: "molecular photography" Object Irradiate Scattering lens Combination Image Need wavelengths smaller than or on the order.

Lesson 13 How the reciprocal cell appears in reciprocal space. How the non-translational symmetry elements appear in real space How translational symmetry.

X-ray diffraction X-rays discovered in 1895 – 1 week later first image of hand. X-rays have ~ 0.1 – few A No lenses yet developed for x-rays – so no possibility.

What is the problem? How was the problem solved?

Protein Structure Determination Lecture 4 -- Bragg’s Law and the Fourier Transform.

Pattersons The “third space” of crystallography. The “phase problem”

Atomic structure model

X-ray crystallography – an overview (based on Bernie Brown’s talk, Dept. of Chemistry, WFU) Protein is crystallized (sometimes low-gravity atmosphere is.

Anomalous Differences Bijvoet differences (hkl) vs (-h-k-l) Dispersive Differences 1 (hkl) vs 2 (hkl) From merged (hkl)’s.

Electron Density Structure factor amplitude defined as: F unit cell (S) = ∫ r  (r) · exp (2  i r · S) dr Using the inverse Fourier Transform  (r) =

Calculation of Structure Factors

Absolute Configuration Types of space groups Non-centrosymmetric Determining Absolute Configuration.

Before Beginning – Must copy over the p4p file – Enter../xl.p4p. – Enter../xl.hkl. – Do ls to see the files are there – Since the.p4p file has been created.

Interpreting difference Patterson Maps in Lab this week! Calculate an isomorphous difference Patterson Map (native-heavy atom) for each derivative data.

X-ray Crystallography Kalyan Das. Electromagnetic Spectrum to 10 nM 400 to 700 nM to nM 10 to 400 nM 700 to 10 4 nM X-ray was discovered.

Fourier transform from r to k: Ã(k) =  A(r) e  i k r d 3 r Inverse FT from k to r: A(k) = (2  )  3  Ã(k) e +i k r d 3 k X-rays scatter off the charge.

Today: compute the experimental electron density map of proteinase K Fourier synthesis  (xyz)=  |F hkl | cos2  (hx+ky+lz -  hkl ) hkl.

Lecture 3 Patterson functions. Patterson functions The Patterson function is the auto-correlation function of the electron density ρ(x) of the structure.

Crystallography : How do you do? From Diffraction to structure…. Normally one would use a microscope to view very small objects. If we use a light microscope.

Lecture 53: X-ray crystallography. Electrons deflect x-rays We try to recreate electron density from the x-ray diffraction pattern Each point in space.

Amyloid Precursor Protein (APP)

Phasing Today’s goal is to calculate phases (ap) for proteinase K using MIRAS method (PCMBS and GdCl3). What experimental data do we need? 1) from native.

An In-Depth Look at the GES Family of Enzymes

Introduction to Isomorphous Replacement and Anomalous Scattering Methods Measure native intensities Prepare isomorphous heavy atom derivatives Measure.

Nobel Laureates of X Ray Crystallography

Experimental phasing in Crank2 Pavol Skubak and Navraj Pannu Biophysical Structural Chemistry, Leiden University, The Netherlands

S. Takeda, A. Yamashita, K. Maeda, Y. Maeda

r(xyz)=S |Fhkl| cos2p(hx+ky+lz -ahkl)

Presentation transcript:

Phasing in Macromolecular Crystallography What do we do with it now?

Phasing in Macromolecular Crystallography How do we get from spots on a screen to a pretty picture of our protein? BY CALCULATING AMPLITUDES AND PHASES!! Heidi’s Gun4 Data and Stucture

Why Do We Care So Much About Phases?

Diffraction by X-rays h = 1 k = 2 l = 0 Bragg’s Law nλ = 2dsinθ k0 SOURCE DETECTOR Θ k h = 1 k = 2 l = 0 y x z dhkl Bragg’s Law nλ = 2dsinθ

Diffraction by X-rays (h,k,l) = (2,1,0) k k0 SOURCE Crystal

The Phase Problem A PHASE DIFFERENCE!! k0 Φ k

Addition of Waves f1 + f2 + f3 Fs

The Phase Problem imaginary axis The resulting wave that reaches the detector has a particular phase and amplitude that results from the addition of individual scattering factors from all the atoms in the unit cell, which each have their own phase and amplitude. Fhkl f3 Φhkl real axis f2 f1 Fs = V ∑ fj[cos2π(hx + ky + lz) + isin2π(hx + ky + lz)] N j=1

The Phase Problem The Phase Problem: Each reflection we measure during the diffraction experiment tells us the amplitude of a particular Fs, but not its phase.

Addition of Waves f1 + f2 + f3 Fs

Solving the Phase Problem How do we figure out the phase of Fs? Combine Fs (or FP here) with a wave of known phase (fH) to get a new resultant wave (FPH). FP (unknown phase) + fH (known phase) FPH (unknown phase) FPH = FP + fH

Solving the Phase Problem: Harker Constructions imaginary axis Hurray!! We’ve solved the phase problem!! fH real axis FPH FP Well, sort of… Now we actually have two phases to choose from. FPH = FP + fH

Solving the Phase Problem imaginary axis FPH = FP + fH How do we actually figure out the amplitude and phase of fH? fH real axis FPH FP Amplitude: determined during the diffraction experiment Phase: determine the real space (x,y,z) location of “H” through the use of the Patterson Function

The Patterson Method The Patterson function is similar to the electron density equation in that they both use amplitudes measured during the diffraction experiment. However, Patterson functions do not require phases to be computed, just the amplitudes of the (h,k,l) reflections P(u,v,w) = 1/V ∑ |Fhkl|2cos2π(hu + kv + lw) h,k,l

The Patterson Method The Patterson map provides a map of interatomic vectors within the unit cell and the peaks in a Patterson map are proportional to the electron density at a particular position. 1-2 1-3 2-1 2-3 3-1 3-2 PATTERSON SPACE 1 2 3 REAL SPACE

The Patterson Method The Patterson function is especially useful because of two important features: The magnitude of P(u,v,w) is proportional to the product of the atomic numbers of the atoms at the ends of the vector u = (u,v,w). The symmetry within a unit cell is imposed on the peaks in a Patterson map. This means that symmetry related atoms will also have a peak in the Patterson map.

Harker Sections (-x,y,-z) (x,y,z) 180° (-x,y,-z) (x,y,z) x z Should have a peak on the v=0 Harker section of the Patterson map. (x,y,z) - (-x,y,-z) = (2x,0,2z) = (u,v,w) x = u/2, y = 0, z = w/2

Harker Sections v = 0 Harker Section (2x,0,2z) = (u,v,w) x = u/2, y = 0, z = w/2

Solving the Phase Problem imaginary axis FPH = FP + fH How do we actually figure out the amplitude and phase of fH? fH real axis FPH FP Amplitude: determined during the diffraction experiment Phase: determine the real space (x,y,z) location of “H” through the use of the Patterson Function

Methods for Determining Phases Isomorphous Replacement Single Isomorphous Replacement (SIR) Multiple Isomorphous Replacement (MIR) Anomalous Dispersion Single Wave-Length Anomalous Dispersion (SAD) Multiple Wave-Length Anomalous Dispersion (MAD) Molecular Replacement Direct Methods

Isomorphous Replacement The Goal: Modify our crystal by having it bind a heavy atom. Why? A heavy-atom derivative of our crystal will create a change in the intensity of observed reflections relative to our native crystal.

Isomorphous Replacement Why do we use heavy atoms? Because they have lots of electrons and scatter x-rays more strongly. NH = number of heavy atoms fH = heavy atom scattering power NN = number of native atoms fN = avg. scattering power for native atoms

Isomorphous Replacement imaginary axis imaginary axis FP FP f3 Φhkl Φhkl real axis real axis f2 f1 FPH f1 f2 f3 fH NATIVE HEAVY-ATOM DERIVATIVE

Isomorphous Replacement imaginary axis FP Φhkl real axis FPH f1 f2 f3 fH HEAVY-ATOM DERIVATIVE

Isomorphous Replacement Why do we care about intensity differences in the observed diffraction pattern? The differences in intensity (ΔF = |FPH| – |FP|) can be used as coefficients for the Patterson Function. This is useful because our Patterson Maps and Harker Sections are now giving us information about the locations of our heavy-atom derivatives. THIS IS REALLY IMPORTANT!! P(u,v,w) = 1/V ∑ |ΔF|2cos2π(hu + kv + lw) h,k,l

Heavy Atoms in Isomorphous Replacement Common Heavy Metals Platinum Potassium Thiocyanate Gold Cyanide Potassium Tetrachloro Platinate Thimerasol Amino Acid Ligands His (pH>7), Lys (pH>9) Cys, His (pH>7), Lys (pH>9) Cystines, His (pH>7) Cys, His (pH>7) And many more… How do we make derivatives of our crystal? Trial and error: add a small amount of the metal reagent (0.1-10 mM) to the crystallization condition and soak the crystal (seconds, minutes, hours, days)

Detecting Derivatives Unfortunately, like most things in crystallography, just about everything is a variable when trying to get a derivative. So how do we know when we’ve gotten a derivative? Answer: Look for differences in spot intensity between native and potential derivative crystals, especially at low resolution where heavy atom differences will be strongest

Detecting Derivatives It must take a long time to find a derivative!!! Fortunately, we don’t have to collect a full dataset for each derivative Scaling a few images of a derivative dataset against a native dataset is enough to detect differences in intensity

Detecting Derivatives First things first: A derivative crystal should be different than the native crystal.

Detecting Derivatives Use Scalepack to scale a native dataset against a few images (~3°) of a potential derivative When derivative and native crystals are scaled together, the Χ2 value should be >1. Assuming the cell parameters look good, If Χ2 is… ~50, probably not a derivative (non-isomorphous, wrong indexing, or way too many substitutions) ~10, good chance that you have a derivative ~2-5, well…I would keep looking for derivatives, but keep this one in mind 1, scales well with native so you’ve probably got a native crystal (no heavy-atom substitution)

Isomorphous Replacement: The Good and the Bad Can be quick since data collection can be done at home Works well for proteins purified from sources other than E. coli (e.g. yeast) Don’t need high quality data (low resolution is okay) The Bad Getting a derivative in the first place is not trivial Getting a derivative crystal that’s isomorphous can be difficult Data quality can be poor

The Reality of Isomorphous Replacement imaginary axis FP Φhkl fH real axis FPH FPH = FP + fH

The Reality of Isomorphous Replacement imaginary axis In reality, there’s some error in our measurement of FPH and the location of the heavy atoms FP Φhkl fH real axis FPH,obs ε FPH,calc ε = FPH,obs – FPH,calc ε = lack of closure FPH ≈ FP + fH

The Reality of Isomorphous Replacment Harker Construction of a single reflection from an SIR experiment Phase Probability P(α) = exp{-ε2(α)/2E2} Best Phase αbest = ∫αP(α)dα Figure of Merit m = ∫P(α)exp(iα)dα/∫P(α)dα m = 1, no phase error m = 0.5, ~60° phase error m = 0, all phases equally probably The phases have a minimum error when the best phase (alpha_best), i.e. the centroid of the phase distribution, is used instead of the most probably phase. The quality of the phases is indicated by the figure of merit, m. A value of 1 for m indicates no phase error, while a value of 0 means that all phases are equally probable. Minimize the phase error by using the centroid of the phase distribution (Best Phase).

The Reality of Isomorphous Replacement By using multiple heavy atom derivatives, we can get a better estimate of the correct phase This is Multiple Isomorphous Replacement (MIR)

The Reality of Isomorphous Replacment SIR MIR EMPHASIZE THAT THESE ARE STILL VERY IDEALIZED AND THAT IN REALITY THE PHASES THAT WE WOULD NORMALLY GET ARE GOING TO BE PRETTY POOR BECAUSE THE CIRCLES WILL STILL NOT OVERLAP PERFECTLY. MODERN METHODS USE MAXIMUM LIKLIEHOOD METHODS OF GETTING THE BEST PHASES.

Anomalous Dispersion Techniques What is Anomalous Dispersion? A phenomenon which occurs when electrons absorb and reemit X-rays having an energy close to that of the electron’s nuclear binding energy. What’s the result of this absorption? The x-ray’s reemitted have the same wavelength as the incident radiation, but now are phase shifted by 90°.

Effects of Anomalous Dispersion Radiation scattered by an atom is actually composed of two components: “Normal” Thompson scattering (no phase change relative to incident radiation) A minor anomalous component phase shifted by π/2 f(λ) = f0 + Δf'(λ) + if''(λ) = f'(λ) + if''(λ) fH+ fH- FP+ FP- FPH+ FPH- f'' FPH+ FPH- fH+ fH- FP+ FP- Bijvoet Differences (That’s Bi-foot) |F+| = |F-| Friedel’s Law |F+| = |F-| Breakdown in Friedel’s Law

Wavelength Dependence of Anomalous Dispersion

Anomalous Signal Increases With Scattering Angle

Using Anomalous Dispersion to Solve the Phase Problem: MAD P(u,v,w) = 1/V ∑ (|F+| - |F-|)2cos2π(hu + kv + lw) h,k,l

Choosing Appropriate Wavelengths Peak (f'') Remote Inflection (f')

MAD Requirements Strong anomalous signal (at least 1 Se per 17 kDa of protein) Tunable x-ray source (synchrotron) Preferably the ability to measure the absorbance spectrum of your protein High solvent content Best data possible- high resolution and low Rmerge

Single Wavelength Anamalous Dispersion (SAD) Solve the phase ambiguity by evaluating the quality of the maps for both solutions using density modification programs. Basic Requirements: Strong anomalous signal (usually at least 1 Se per 17 kDa of protein) As usual, best data possible (resolution and error) High solvent content (>50%) Accurate measurement of phasing errors

Sulfur Anomalous Phasing

Sulfur Anomalous Phasing

Sulfur Anomalous Phasing Ramagopal et al. Acta Cryst. (2003) D59, 1020-1027

Detecting Anomalous Signals Using Scalepack, scale data as normal, except turn on the ANOMALOUS flag (writes out F+ and F- separately in a .sca file). Rescale this .sca file, but this time with the anomalous flag turned off. This compares F+ and F-. Examine the X2 values. Presence of an anomalous signal should give X2 > 1 (or could indicate absorption or detector problems). Useful for also examining the resolution cut-off of the anomalous signal.

The Good and the Bad of MAD/SAD Structure Determination Essentially eliminate the isomorphism problems of SIR/MIR. Generally better phases. Ability to use molecular biology to derivatize your protein (SeMet). May also be able to use naturally bound anomalous scatters (Zn, Fe, Ca, etc.) Usually need to go to synchrotron However, there’s often an anomalous signal with Cu-Kα; could be useful in SIRAS/MIRAS methods. The potential of sulfur anomalous phasing essentially eliminates the need to derivatize your protein Method can be limited to especially good data Due to availability of synchrotron resources, anomalous phasing is the primary method of choice for phasing

Calculation of Protein Phases After solving the real space location of the heavy atom through isomorphous or anomalous difference Patterson’s, determine the protein phases: Refine the xyz coordinates of the heavy atom through cycles of refining the occupancy and B-factor Determine the protein phases from the refined heavy atom coordinates and refine these phases Look at the initial map and see how you did Then go and talk to Devin to see where to go from here Programs that can do these steps include PHASES and MLPHARE (available in the CCP4 package)

Automated Methods: SOLVE Fortunately, there are automated processes available to do everything from scaling data, solving the location of heavy atoms, and determining protein phases. One such program is SOLVE. Works by converting each decision making step into an optimization problem through scoring and ranking of possible solutions Locate and refine heavy atom sites through difference Patterson’s or direct methods and generate phases. Converts MAD data to pseudo-SIRAS. Score potential heavy atom sites by four criteria: Agreement between calculated and observed Patterson maps Cross-validation of the heavy-atom sites through difference Fourier analysis- delete a site in a solution and recalculate phases Figure of Merit (m) Non-randomness of the electron density map- identify solvent and protein regions and score based on connectivity in solvent and protein region. SOLVE is pretty cool because it actually only finds 1 or 2 sites in the Patterson function, scores solutions, and then analyses each of those solutions based on the four criteria above. Additional sites are added based on difference fourier analysis and solutions are scored. The initial set of solutions are called seeds and sometimes a particular seed will emerge that is especially promising. SOLVE will focus on this seed and try to build it up as quickly as possible. Cycles are repeated until no new sites are found and no significant improvement of the phases are achieved. At the conclusion of the algorithm, electron density maps are generated

Tutorial Use Heidi’s Gun4 MAD data to calculate and analyze Patterson Maps and Harker Sections to determine the real space coordinates of the Selenium within the structure.