Overview of the Phase Problem

Slides:



Advertisements
Similar presentations
Reciprocal Space Learning outcomes
Advertisements

Intensities Learning Outcomes By the end of this section you should: understand the factors that contribute to diffraction know and be able to use the.
Phasing Goal is to calculate phases using isomorphous and anomalous differences from PCMBS and GdCl3 derivatives --MIRAS. How many phasing triangles will.
Diffraction Basics Cora Lind-Kovacs Department of Chemistry & Biochemistry The University of Toledo Toledo, OH 43606
Introduction to protein x-ray crystallography. Electromagnetic waves E- electromagnetic field strength A- amplitude  - angular velocity - frequency.
Methods: X-ray Crystallography
Overview of the Phase Problem
Planes in Lattices and Miller Indices
Determination of Protein Structure. Methods for Determining Structures X-ray crystallography – uses an X-ray diffraction pattern and electron density.
X-Ray Crystallography
X-ray Crystallography-2 (plus extra slides)
Bob Sweet Bill Furey Considerations in Collection of Anomalous Data.
Reciprocal lattice How to construct reciprocal lattice
Solid State Physics 2. X-ray Diffraction 4/15/2017.
Chem Single Crystals For single crystals, we see the individual reciprocal lattice points projected onto the detector and we can determine the values.
Expression of d-dpacing in lattice parameters
HCI 530 : Seminar (HCI) Damian Schofield. HCI 530: Seminar (HCI) Transforms –Two Dimensional –Three Dimensional The Graphics Pipeline.
A Brief Description of the Crystallographic Experiment
Hanging Drop Sitting Drop Microdialysis Crystallization Screening.
Experimental Phasing Andrew Howard ACA Summer School 22 July 2005.
Anomalous Scattering: Theory and Practice Andrew Howard ACA Summer School 29 July 2005 Andrew Howard ACA Summer School 29 July 2005.
Fourier transform. Fourier transform Fourier transform.
19 Feb 2008 Biology 555: Crystallographic Phasing II p. 1 of 38 ProteinDataCrystalStructurePhases Overview of the Phase Problem John Rose ACA Summer School.
PHY 102: Waves & Quanta Topic 8 Diffraction II John Cockburn Room E15)
The Effects of Symmetry in Real and Reciprocal Space Sven Hovmöller, Stockholm Univertsity Mirror symmetry 4-fold symmetry.
1. Crystals Principles of crystal growth 2. Symmetry Unit cells, Symmetry elements, point groups and space groups 3. Diffraction Introduction to diffraction.
CHE (Structural Inorganic Chemistry) X-ray Diffraction & Crystallography lecture 2 Dr Rob Jackson LJ1.16,
Miller Indices And X-ray diffraction
Analysis of crystal structure x-rays, neutrons and electrons
UNIVERSITI MALAYSIA PERLIS
Patterson Space and Heavy Atom Isomorphous Replacement
The ‘phase problem’ in X-ray crystallography What is ‘the problem’? How can we overcome ‘the problem’?
Diffraction Basics Coherent scattering around atomic scattering centers occurs when x-rays interact with material In materials with a crystalline structure,
Chem Patterson Methods In 1935, Patterson showed that the unknown phase information in the equation for electron density:  (xyz) = 1/V ∑ h ∑ k.
Chem Structure Factors Until now, we have only typically considered reflections arising from planes in a hypothetical lattice containing one atom.
Phasing Today’s goal is to calculate phases (  p ) for proteinase K using PCMBS and EuCl 3 (MIRAS method). What experimental data do we need? 1) from.
1. Diffraction intensity 2. Patterson map Lecture
THE PHASE PROBLEM Electron Density
Molecular Crystals. Molecular Crystals: Consist of repeating arrays of molecules and/or ions.
Page 1 X-ray crystallography: "molecular photography" Object Irradiate Scattering lens Combination Image Need wavelengths smaller than or on the order.
Methods in Chemistry III – Part 1 Modul M.Che.1101 WS 2010/11 – 8 Modern Methods of Inorganic Chemistry Mi 10:15-12:00, Hörsaal II George Sheldrick
Lesson 13 How the reciprocal cell appears in reciprocal space. How the non-translational symmetry elements appear in real space How translational symmetry.
Lesson 13 How the reciprocal cell appears in reciprocal space. How the non-translational symmetry elements appear in real space How translational symmetry.
X-ray diffraction X-rays discovered in 1895 – 1 week later first image of hand. X-rays have ~ 0.1 – few A No lenses yet developed for x-rays – so no possibility.
What is the problem? How was the problem solved?
Protein Structure Determination Lecture 4 -- Bragg’s Law and the Fourier Transform.
Pattersons The “third space” of crystallography. The “phase problem”
Atomic structure model
X-ray crystallography – an overview (based on Bernie Brown’s talk, Dept. of Chemistry, WFU) Protein is crystallized (sometimes low-gravity atmosphere is.
Electron Density Structure factor amplitude defined as: F unit cell (S) = ∫ r  (r) · exp (2  i r · S) dr Using the inverse Fourier Transform  (r) =
Calculation of Structure Factors
Electromagnetism Around 1800 classical physics knew: - 1/r 2 Force law of attraction between positive & negative charges. - v ×B Force law for a moving.
Absolute Configuration Types of space groups Non-centrosymmetric Determining Absolute Configuration.
Before Beginning – Must copy over the p4p file – Enter../xl.p4p. – Enter../xl.hkl. – Do ls to see the files are there – Since the.p4p file has been created.
Interpreting difference Patterson Maps in Lab this week! Calculate an isomorphous difference Patterson Map (native-heavy atom) for each derivative data.
Methods in Chemistry III – Part 1 Modul M.Che.1101 WS 2010/11 – 9 Modern Methods of Inorganic Chemistry Mi 10:15-12:00, Hörsaal II George Sheldrick
X-ray Crystallography-2 (plus extra slides)
Phasing in Macromolecular Crystallography
Fourier transform from r to k: Ã(k) =  A(r) e  i k r d 3 r Inverse FT from k to r: A(k) = (2  )  3  Ã(k) e +i k r d 3 k X-rays scatter off the charge.
Today: compute the experimental electron density map of proteinase K Fourier synthesis  (xyz)=  |F hkl | cos2  (hx+ky+lz -  hkl ) hkl.
Lecture 3 Patterson functions. Patterson functions The Patterson function is the auto-correlation function of the electron density ρ(x) of the structure.
Crystallography : How do you do? From Diffraction to structure…. Normally one would use a microscope to view very small objects. If we use a light microscope.
CHARACTERIZATION OF THE STRUCTURE OF SOLIDS
Phasing Today’s goal is to calculate phases (ap) for proteinase K using MIRAS method (PCMBS and GdCl3). What experimental data do we need? 1) from native.
Introduction to Isomorphous Replacement and Anomalous Scattering Methods Measure native intensities Prepare isomorphous heavy atom derivatives Measure.
X-ray Neutron Electron
Nobel Laureates of X Ray Crystallography
r(xyz)=S |Fhkl| cos2p(hx+ky+lz -ahkl)
A. The Solid State Classification of Solid Structures
Presentation transcript:

Overview of the Phase Problem Protein Crystal Data Phases Structure John Rose ACA Summer School 2006 Reorganized by Andy Howard, Biology 555, Spring 2008 Remember We can measure reflection intensities We can calculate structure factors from the intensities We can calculate the structure factors from atomic positions We need phase information to generate the image Biology 555 Crystallographic Phasing I 14 Feb 2008

What is the Phase Problem? X-ray Diffraction Experiment All phase information is lost x,y.z Fhkl [Real Space] [Reciprocal Space] In the X-ray diffraction experiment photons are reflected from the crystal lattice (planes) in different directions giving rise to the diffraction pattern. Using a variety of detectors (film, image plates, CCD area detectors) we can estimate intensities but we lose any information about the relative phase for different reflections.

Biology 555 Crystallographic Phasing I Phases Let’s define a phase fj associated with a specific plane [hkl] for an individual atom: fj = 2p(hxj + kyj + lzj) Atom at xj=0.40, yj=0.05, zj=0.10 for plane [213]: fj = 2p(2*0.40 + 1*0.05 + 3*0.10) = 2p(1.35) If we examine a 2-dimensional case like k=0, then fj = 2p(hxj + lzj) Thus for [201] (a two-dimensional case): fj = 2p(2*0.40 + 0*0.05 + 1*0.10) = 2p(0.90) Now, to understand what this means: Biology 555 Crystallographic Phasing I 14 Feb 2008

Biology 555 Crystallographic Phasing I 201 Phases A B G C H D F I E 0° 720° c a 201 planes 4p 360° 2p 1080° 6p 0.4, y, 0.1 fD = 2p[ 2•(0.40) + 1•(0.10)] = 2p(0.) Biology 555 Crystallographic Phasing I 14 Feb 2008

In General for Any Atom (x, y, z) dhkl 6π dhkl 4π Atom (j) at x,y,z dhkl 2π φ c Remember: We express any position in the cell as (1) fractional coordinates: pxyz = xja+yjb+zjc (2) the sum of integral multiples of the reciprocal axes hkl = ha* + kb* + lc* Plane hkl Biology 555 Crystallographic Phasing I 14 Feb 2008

Diffraction vector for a Bragg spot We set up the diffraction vector shkl associated with a specific diffraction direction hkl: shkl = ha* + kb* + lc* The magnitude of this diffraction vector is the reciprocal of our Bragg-law plane spacing dhkl: |shkl| = 1/ dhkl Biology 555 Crystallographic Phasing I 14 Feb 2008

Biology 555 Crystallographic Phasing I Phase angle for a spot The phase angle fj associated with our atom is 2p times the projection of the displacement vector pj onto shkl: fj = 2p shkl• pj But that displacement vector pj is related to the real-space coordinates of the atom at position j: pj = xja + yjb + zjc where the fractional coordinates of our atom within the unit cell are (xj, yj, zj) Thus fj = 2p (ha* + kb* + lc*) • (xja + yjb + zjc) Biology 555 Crystallographic Phasing I 14 Feb 2008

Real-space and reciprocal space But these real-space and reciprocal-space unit cell vectors (a,b,c) and (a*,b*,c*) are duals of one another; that is, they obey: a•a* = 1, a•b* = 0, a•c* =0 b•a* = 0, b•b* = 1, b•c* =0 c•a* = 0, c•b* = 0, c•c* = 1 … even when the unit cell isn’t all full of 90-degree angles! Biology 555 Crystallographic Phasing I 14 Feb 2008

Matrix formulation of this duality If we construct the 3x3 reciprocal-space unit cell matrix A = (a* b* c*) And the 3x3 real-space unit cell matrix R = (a b c) for a specific position of the sample, then A and R obey the simple relationship A = R-1, i.e. AR = I Where I is a 3x3 identity matrix Biology 555 Crystallographic Phasing I 14 Feb 2008

How to use this in getting phases fj = 2p (ha* + kb* + lc*) • (xja + yjb + zjc) But using those dual relationships, e.g. a*•a = 1, b*•c = 0, we get fj = 2p (hxj + kyj + lzj) Note that this is true even if our unit cell angles aren’t 90º! Biology 555 Crystallographic Phasing I 14 Feb 2008

Biology 555 Crystallographic Phasing I Why Do We Need the Phase? Fourier transform Inverse Fourier transform Structure Factor Electron Density In order to reconstruct the molecular image (electron density) from its diffraction pattern both the intensity and phase, which can assume any value from 0 to 2, of each of the thousands of measured reflections must be known. Biology 555 Crystallographic Phasing I 14 Feb 2008

Importance of Phases Phases dominate the image! Hauptman amplitudes with Hauptman phases Karle amplitudes with Karle phases Karle amplitudes with Hauptman phases Hauptman amplitudes with Karle phases Phases dominate the image! Phase estimates need to be accurate Biology 555 Crystallographic Phasing I 14 Feb 2008

Understanding the Phase Problem The phase problem can be best understood from a simple mathematical construct. The structure factors (Fhkl) are treated in diffraction theory as complex quantities, i.e., they consist of a real part (Ahkl) and an imaginary part (Bhkl). If the phases, hkl, were available, the values of Ahkl and Bhkl could be calculated from very simple trigonometry: Ahkl = |Fhkl| cos (hkl) Bhkl = |Fhkl| sin (hkl) This leads to the relationship: (Ahkl)2 + (Bhkl)2 = |Fhkl|2 = Ihkl Biology 555 Crystallographic Phasing I 14 Feb 2008

Biology 555 Crystallographic Phasing I Argand Diagram (Ahkl)2 + (Bhkl)2 = |Fhkl|2 = Ihkl The above relationships are often illustrated using an Argand diagram (right). From the Argand diagram, it is obvious that Ahkl and Bhkl may be either positive or negative, depending on the value of the phase angle, hkl. Note: the units of Ahkl, Bhkl and Fhkl are in electrons. Biology 555 Crystallographic Phasing I 14 Feb 2008

Biology 555 Crystallographic Phasing I The Structure Factor sinq/l f0 Atomic scattering factors Here fj is the atomic scattering factor The scattering factor for each atom type in the structure is evaluated at the correct sinq/l. That value is the scattering ability for that atom. Remember sinq/l = 1/(2dhkl) We now have an atomic scattering factor with magnitude f0 and direction fj Biology 555 Crystallographic Phasing I 14 Feb 2008

The Structure Factor Sum of all individual atom contributions real imaginary Individual atom fjs Resultant Fhkl Ahkl Bhkl Biology 555 Crystallographic Phasing I 14 Feb 2008

Biology 555 Crystallographic Phasing I Electron Density Remember the electron density (image of the molecule) is the Fourier transform of the structure factor Fhkl. Thus Here V is the volume of the unit cell Biology 555 Crystallographic Phasing I 14 Feb 2008

How to calculate r(x,y,z) In practice, the electron density for one three-dimensional unit cell is calculated by starting at x, y, z = (0, 0, 0) and stepping incrementally along each axis, summing the terms as shown in the equation above for all hkl (as limited by the resolution of the data) at each point in space. Biology 555 Crystallographic Phasing I 14 Feb 2008

Solving the Phase Problem Small molecules Direct Methods Patterson Methods Molecular Replacement Macromolecules Multiple Isomorphous Replacement (MIR) Multi Wavelength Anomalous Dispersion (MAD) Single Isomorphous Replacement (SIR) Single Wavelength Anomalous Scattering (SAS) Direct Methods (special cases) Biology 555 Crystallographic Phasing I 14 Feb 2008

Solving the Phase Problem SMALL MOLECULES: The use of Direct Methods has essentially solved the phase problem for well diffracting small molecule crystals. MACROMOLECULES: Today, anomalous scattering techniques such as MAD or SAS are the most common techniques used for de novo structure determination of macromolecules. Both techniques require the presence of one or more anomalous scatterers in the crystal. Biology 555 Crystallographic Phasing I 14 Feb 2008

Biology 555 Crystallographic Phasing I Direct methods Karle, Hauptman, David Sayre, and others determined algebraic relationships among phase angles of groups of reflections. The simplest are triplet relationships: For three reflections h1=(h1,k1,l1), h2=(h2,k2,l2), h3=(h3,k3,l3), they showed that if h3= -h1- h2, then F1 + F2 + F3 ≈ 0 Thus if F1 and F2 are known then we can estimate that F3 ≈ -F1 - F2 David Sayre Biology 555 Crystallographic Phasing I 14 Feb 2008

When do triplet relations hold? Note the approximately zero value in that relationship F1 + F2 + F3 ≈ 0. The stronger the Bragg reflections are, the closer this condition is to being exact. For very strong Bragg reflections that sum will be very close to zero For weaker ones it may differ significantly from zero Biology 555 Crystallographic Phasing I 14 Feb 2008

Biology 555 Crystallographic Phasing I Phase probabilities This notion of relationships among phases obliges us to think of phases probabilistically rather than deterministically. This is a key to the direct-methods approach and has a huge influence on how we think about phase determination. I’m introducing all of this mostly to get you accustomed to the notion of phase probability distributions! Biology 555 Crystallographic Phasing I 14 Feb 2008

Biology 555 Crystallographic Phasing I Phase probabilities Any phase has a value between 0 and 2p (or 0 and 360, if we’re using degrees) If we know it’s close to 2p*0.42, then: If it’s 2p*(0.42 0.01), it’s a sharp phase probability distribution If it’s 2p*(0.42 0.32), it’s a much broader phase probability distribution Biology 555 Crystallographic Phasing I 14 Feb 2008

Plots of phase probability Integral of probability must be 1, since every phase has to have some value. Sharp distribution Broad distribution  2π Biology 555 Crystallographic Phasing I 14 Feb 2008

Biology 555 Crystallographic Phasing I How can we use this? Obviously if we don’t know f1+f2, we can’t use this to calculate f3, even if the intensities of all three are large. But we could guess what f1 and f2 are and use this to compute f3. Then we guess f4 and use the triplet relationship to compute f5 and f6, where h5 = -h1 - h4 and h6 = -h1 - h4 … assuming that reflections 5 and 6 are strong, too! Biology 555 Crystallographic Phasing I 14 Feb 2008

Biology 555 Crystallographic Phasing I Can we make this work? We start with guessed phases for a 10-100 strong reflections and use the triplet relationships to determine the phases for another 1000 reflections Any particular calculated phase can be determined by several different triplet relationships, so if they’re self-consistent, the initial guessed 10-100 are correct; if they aren’t self-consistent, the guess was wrong! In the latter case, we try a different set of guesses for our 10-100 starting phases and keep going Biology 555 Crystallographic Phasing I 14 Feb 2008

This actually works, provided: The data are correctly measured The data are strong enough that we can pick 1000 strong reflections to use in this process The data extend to high enough resolution that atomicity (separable atoms) is really found There are ways to do direct methods without assuming atomicity, but they’re more complicated Biology 555 Crystallographic Phasing I 14 Feb 2008

Is this relevant to macromolecules? Not directly: Atomicity rarely present Systematic errors in data Indirectly yes, because it can be used in conjunction with other methods for locating heavy atoms in the SIR, MIR, and SAS methods It also helps introduce the notion of phase probability distributions (sneaky!) Biology 555 Crystallographic Phasing I 14 Feb 2008

Biology 555 Crystallographic Phasing I SIR and SAS Methods Need a heavy atom (lots of electrons) or a anomalous scatterer (large anomalous scattering signal) in the crystal. SIR - heavy atoms usually soaked in. SAS - anomalous scatterers usually engineered in as selenomethional labels. Can also be soaked. SIR collect a native and a derivative data set (2 sets total). SAS collect one highly redundant data set and keep anomalous pairs separate during processing. SAS - may want to choose a scatterer or wavelength that enhances the anomalous signal. Must find the heavy atoms or anomalous scatterers can use Patterson analysis or direct methods. Must resolve the bimodal ambiguity. use solvent flattening or similar technique Biology 555 Crystallographic Phasing I 14 Feb 2008

What’s the bimodal ambiguity? As we’ll show next time, a single isomorphous derivative or anomalous scatterer enables us to measure each phase apart from an ambiguity That is, for each phase we get two answers (e.g. 2π*0.12 and 2π*0.55), and we can’t pick one out A second scatterer will resolve that Biology 555 Crystallographic Phasing I 14 Feb 2008

Phase probabilities with no error A single derivative with no error gives a phase probability like this:  2π Biology 555 Crystallographic Phasing I 14 Feb 2008

Biology 555 Crystallographic Phasing I 2 derivatives, no error P() Wrong estimate derived from derivative 2 The two distributions overlap at the correct answer, not at the wrong answer Wrong estimate derived from derivative 1 Correct phase  2π Biology 555 Crystallographic Phasing I 14 Feb 2008

Biology 555 Crystallographic Phasing I Errors spread this out Each phase estimate is not really that sharp Lack of isomorphism (see below) makes each distribution spread out Joint probability distribution from 2 or more experiments is the product of the probability distributions of the individual experiments Biology 555 Crystallographic Phasing I 14 Feb 2008

Realistic probability distributions Joint probability distribution = product of individual ones  2π Biology 555 Crystallographic Phasing I 14 Feb 2008

Joint probability distribution Biology 555 Crystallographic Phasing I 14 Feb 2008

Heavy Atom Derivatives Heavy atom derivatives MUST be isomorphous Heavy atom derivatives are generally prepared by soaking crystals in dilute (2 - 20 mM) solutions of heavy atom salts (see Table II below for some examples). Crystal cracking is generally a good indication that that heavy atom is interacting with the crystal lattice, and suggests that a good derivative can be obtained by soaking the crystal in a more dilute solution. Biology 555 Crystallographic Phasing I 14 Feb 2008

Is the derivative worth using? Once derivative data has been collected, the merging R factor (Rmerge) between the native and derivative data sets can be used to check for heavy atom incorporation and isomorphism. Rmerge values for isomorphous derivatives range from 0.05 to 0.15. Values below 0.05 indicate that there is little heavy atom incorporation. Values above 0.15 indicate a lack of isomorphism between the two crystals. Biology 555 Crystallographic Phasing I 14 Feb 2008

Biology 555 Crystallographic Phasing I What is isomorphism? Isomorphism for derivatives means that the structure of the derivatized macromolecule is identical to the structure of the underivatized molecule except at the site where the derivative compound has been introduced. Biology 555 Crystallographic Phasing I 14 Feb 2008

What is lack of isomorphism? A derivative may be nonisomorphous if: It alters the unit cell lengths or angles significantly (>0.2%?) It rotates or translates the entire macromolecule within the unit cell It alters significantly the conformation of a large segment (> 8 amino acids or 4 nucleotides?) of the mcromolecule Biology 555 Crystallographic Phasing I 14 Feb 2008

Biology 555 Crystallographic Phasing I Derivative compounds Biology 555 Crystallographic Phasing I 14 Feb 2008

Finding the Heavy Atoms or Anomalous Scatterers The Patterson function - a F2 Fourier transform with f = 0 - vector map (u,v,w instead of x,y,z) - maps all inter-atomic vectors - get N2 vectors!! (where N= number of atoms) From Glusker, Lewis and Rossi Biology 555 Crystallographic Phasing I 14 Feb 2008