Re-stating the obvious: To solve structures from powder diffraction, first they must be indexed! Lachlan M. D. Cranswick CCP14 (Collaborative Computation Project No 14 for Single Crystal and Powder Diffraction) Department of Crystallography; Birkbeck College, University of London, Malet Street, Bloomsbury, London, WC1E 7HX, UK. WWW: Robin Shirley School of Human Sciences University of Surrey Guildford, Surrey, GU2 7XH, U.K. Tel: (+44) Fax: (+44)
Lachlan M. D. Cranswick Slide 2 Notes Free Zone - they are on the web
Lachlan M. D. Cranswick Slide 3 Talk Aims Some history of Indexing of Powder Diffraction Data Peak finding and peak profiling software Fundamental Parameters Peak Fitting Available powder indexing software and indexing suites Crysfire in action Chekcell in action Mmap with Crysfire 2002 (NEW!!!!) Possibilities for the future and conclusion
Lachlan M. D. Cranswick Slide 4 Some History of Powder Indexing First published manual powder indexing method: –Runge, C. (1917), "Die Bestimmung eines Kristallsystems durch Rontgenstrahlen", Physik. Z., 18, Candidates for first published computer based powder indexing –Haendler, H. M. & Cooney, W. A. (1963), "Computer Determination of Unit-Cell from Powder-Diffraction Data", Acta Cryst., 16, –Lefker, R. (1964), "Indexing of tetragonal and hexagonal X-ray powder photographs with the aid of a small computer", Anal. Chem., 36,
Lachlan M. D. Cranswick Slide 5 Some History of Powder Indexing - 2 Candidates for first published publicly available powder-indexing program –Haendler, H. M. & Cooney, W. A. (1963), "Computer Determination of Unit-Cell from Powder-Diffraction Data", Acta Cryst., 16, –Lefker, R. (1964), "Indexing of tetragonal and hexagonal X-ray powder photographs with the aid of a small computer", Anal. Chem., 36, –Werner, P.-E. (1964), "Trial and error computer methods for the indexing of unknown powder patterns", Z. Krist., 120, –Hoff, W. D. & Kitchingman, W. J. (1966), "Computer indexing of x-ray powder patterns from crystals of unknown structures", J. Sci. Instrum., 43,
Lachlan M. D. Cranswick Slide 6 Some History of Powder Indexing - 3 Candidates for first publically available general indexing program –Ishida, T & Watanabe, Y. (1967), "Probability Computer Method of Determining the Lattice Parameters from Powder Diffraction Data", J. Phys. Soc. Japan, 23, –Roof, R. B. (1968), "INDX: A Computer Program to Aid in the Indexing of X-Ray Powder Patterns of Crystal Structures of Unknown Symmetry", Los Alamos Laboratory, University of California, Report LA –Taupin, D. (1968), "Une Methode Generale pour l'Indexation des Diagrammes de Poudres", J. Appl. Cryst., 1, –Visser, J. W. (1969), "A Fully Automatic Program for Finding the Unit Cell from Powder Data", J. Appl. Cryst., 2,
Lachlan M. D. Cranswick Slide 7 Checking Sample Purity Powder indexing requires line-position data of the highest available quality An important aspect of this is purity, since indexing programs don’t like to struggle with impurity lines, and several will not accept them at all. Some suggestions from Robin for checking sample purity: –Optical (polarising) microscope –Density analysis in a gradient column –Electron microscopy and diffraction of selected crystallites –And, if time permits, see whether properties are changing with time
Lachlan M. D. Cranswick Slide 8 Peak Profiling Software For Overall Summary of available peak profiling software refer to: Some of the available freeware includes: –CMPR –DRXWin –EFLECH –GPLSFT –pearson.xls –Rawplot (with GSAS) –SHADOW –Powder v2.00 –PowderX –Winfit –Winplotr (with Fullprof) –XFIT
Lachlan M. D. Cranswick Slide 9 Empirical peak fitting : Some tricks may be required for stability and reasonable results Peaks over small ranges may have to be linked to have the same shape and width Usually fit small ranges of data - groups of peaks Might have to fit large and medium peaks first, followed by trace peaks
Lachlan M. D. Cranswick Slide 10 Peak Profiling: high accuracy peak positions using Fundamental Parameters peak profiling Example of Fundamental parameters (convoluting in the geometry elements of the diffractometer) that can provide accurate peak positions as though your sample was being run on an “ideal” diffractometer. Tutorial at: – Available Fundamental Parameters Peak Profiling and Rietveld software: XFIT (no longer maintained) – Topas (Commercial - sequel to XFIT) – BGMN (Commercial - academic demonstration version is freely downloadable) – EFLECH/Index freeware from BGMN website –
Lachlan M. D. Cranswick Slide 11 XFIT Fundamental Parameters Peak Profiling mode : fit to a low angle peak on a Bragg-Brentano diffractometer
Lachlan M. D. Cranswick Slide 12 Powder Indexing FOM Results XFIT Fundamental Parameters peak profiling compared to empirical Peak Fitting
Lachlan M. D. Cranswick Slide 13 Powder Indexing - indexing programs Overall Summary of available powder indexing software refer to: Powder Indexing: –Classic (widely used): Ito, Dicvol, Treor –High symmetry (exhaustive): Taup (=Powder), Dicvol –Other powerful (general symmetry): Lzon, Kohl (=TMO), Fjzn –Dominant-zone cases: Lzon, Losh, Mmap –Semi-automatic (via SIW basis set): Losh, Mmap –Fundamental parameters & covariance matrix: EFLECH/Index –Incommensurate, etc: Supercell –Others (better with some user guidance): Scanix, Autox
Lachlan M. D. Cranswick Slide 14 Powder Indexing - indexing suites Overall Summary of available powder indexing software refer to: Integrated Suites: Crysfire Powder v2.00 PowderX PROSZKI WinPlotr and after primary indexing: Chekcell
Lachlan M. D. Cranswick Slide 15 Powder Indexing - specialist programs Supercel is a specialised indexing program by Juan Rodriguez- Carvajal for tackling incommensurate cells and super/sub-cell relationships. (available within Winplotr/Fullprof) Web: FTP: ftp://bali.saclay.cea.fr/pub/divers/fullprof.2k/
Lachlan M. D. Cranswick Slide 16 Powder Indexing - the Crysfire suite (1) –by Robin Shirley (+ many contributing authors) – Intelligent defaults for control-parameters when launching each indexing program Data rescaling for tackling high-volume cells from macromolecular samples and zeolites Integrated under a common interface for use by non- specialists
Lachlan M. D. Cranswick Slide 17 Powder Indexing - the Crysfire suite (2) Crysfire 2000: 8 different indexing programs (Ito, Dicvol, Treor, Taup, Kohl, Lzon, Fjzn, Losh) It’s good to have a wide variety of methods available: to cater for different types of problem to give a feel for the range of solutions out there and see which ones turn up repeatedly using different methods, either identically or as derivative cells
Lachlan M. D. Cranswick Slide 18 New Release - Crysfire 2002 June 2002 Now 9 indexing programs: Ito, Dicvol, Treor, Taup, Kohl Lzon, Fjzn, Losh, Mmap New features: Mmap for exploring solution-space and evaluating trial cells Greater ease of use, especially for first-time users Better resilience, especially under the Windows/NT family (master program totally rewritten)
Lachlan M. D. Cranswick Slide 19 Why bother running more than one powder indexing program? Maximize the possibility of indexing unknowns by taking advantage of different algorithms and different program implementations.
Lachlan M. D. Cranswick Slide 20 Powder Indexing - Routine Crysfire Example Importing an XFIT peak file with xf2crys, to give a Crysfire CDT file
Lachlan M. D. Cranswick Slide 21 Powder Indexing - Routine Crysfire Example Run crysfire (giving the following screen)
Lachlan M. D. Cranswick Slide 22 Powder Indexing - Routine Crysfire Example “lo” to load a CDT file
Lachlan M. D. Cranswick Slide 23 Powder Indexing - Routine Crysfire Example “sa” to target an indexing program
Lachlan M. D. Cranswick Slide 24 Powder Indexing - Routine Crysfire Example target ITO with defaults
Lachlan M. D. Cranswick Slide 25 Powder Indexing - Routine Crysfire Example “in” to start the targetted program running ITO
Lachlan M. D. Cranswick Slide 26 Powder Indexing - Routine Crysfire Example ITO runs
Lachlan M. D. Cranswick Slide 27 Powder Indexing - Routine Crysfire Example After Ito completes, press enter to see output file
Lachlan M. D. Cranswick Slide 28 Powder Indexing - Routine Crysfire Example Then “enter” again to see “one solution per line” summary
Lachlan M. D. Cranswick Slide 29 Powder Indexing - Routine Crysfire Example “Enter” again to see Le Page summary of reduced cells
Lachlan M. D. Cranswick Slide 30 Powder Indexing - Routine Crysfire Example Repeat the sequence to run the remaining relevant indexing programs (dicvol, treor, taup, kolh, lzon) –lo to load –sa to save (to Q) –in to launch indexing –look at the output file –look at the summary file –look at the Le Page summary file of reduced cells –repeat
Lachlan M. D. Cranswick Slide 31 Powder Indexing - Routine Crysfire Example Finally: the complete summary file trial solutions
Lachlan M. D. Cranswick Slide 32 Powder Indexing - Routine Crysfire Example Complete Le Page summary file of reduced cells trial solutions
Lachlan M. D. Cranswick Slide 33 Crysfire Self Calibration Bragg-Bretano Data Sometimes it’s hard to be sure how far the beam is penetrating into the sample, giving an unknown sample- displacement error This can be addressed by trying self-calibration (SC) –If indexing the uncorrected data has failed, try SC –Select T (for specimen-displacement correction ratio) –A list of candidates for T will be displayed, based on 1st/2nd-order line pairs –If a particular ratio comes up several times and looks plausible, adopt it –Save it (SA) in Crysfire format under a new name, so that it can be reloaded –Do another cycle of indexing runs, this time with the “corrected” data
Lachlan M. D. Cranswick Slide 34 Crysfire Self Calibration Example Self calibration output of an inorganic
Lachlan M. D. Cranswick Slide 35 Crysfire Re-scaling Protein Data (from Bob von Dreele) Input for RAWPLOT: ( Å) Lysozyme – single peak fits for 24 reflections : Rescale factor = 0.1 Rescaled cell = a of ~7.9Å and c of ~3.8Å Unscaled cell = a of ~79Å and c of ~38Å
Lachlan M. D. Cranswick Slide 36 Chekcell: Interpreting Crysfire Summary Files: Powder Indexing and Spacegroup Assignment Crysfire interlinks with Chekcell for Windows (part of the LMGP suite for Windows by Jean Laugier and Bernard Bochu). Chekcell provides a graphical interface for manually and automatically suggesting a best cell/spacegroup combination using both FOM and algorithms relating to parsimony of superfluous HKLs.
Lachlan M. D. Cranswick Slide 37 Chekcell : easy to see non-matching or impurity peaks
Lachlan M. D. Cranswick Slide 38 Chekcell : automatic cell and spacegroup searching can trudge through a single selected unitcell; or over 1000s of trial cells looking for the best cell and spacegroup combination based on parsimony of extra reflections criteria.
Lachlan M. D. Cranswick Slide 39 Chekcell: “integration” of Ton Spek and A. Meetsma’s Le Page (Crysfire 2001 now also provides a reduced-cell summary) Obtaining the Reduced Cell –which in the past many powder indexing programs have not determined either reliably or at all –Refer: "'Reduced Cells', M.J. Buerger, (Zeitschift fur Kristallographie, BD 109, S (1957)” –Crysfire 2001 gives a LePage overview Chekcell displays a fuller analysis Efficient Sub-cell and super-cell searching, then easy reviewing of newly derived cells within the Chekcell interface
Lachlan M. D. Cranswick Slide 40 Chekcell: GUI Cell transformation Easily transform cells and test them withing Chekcell Knows about common transformations Can manually look at sub-cells and super-cells
Lachlan M. D. Cranswick Slide 41 Chekcell: Density / Z/ Mol. Vol explorer Easily explore values of Z, density and estimated molecular volume - comparing with your found trial cells.
Lachlan M. D. Cranswick Slide 42 Chekcell: example of it running Load the raw diffraction data
Lachlan M. D. Cranswick Slide 43 Chekcell: example of it running Load the peak position file
Lachlan M. D. Cranswick Slide 44 Chekcell: example of it running Load the Crysfire summary file
Lachlan M. D. Cranswick Slide 45 Chekcell: example of it running Use the tools to evaluate the trial cells
Lachlan M. D. Cranswick Slide 46 Crysfire / Chekcell: indexing powder Protein data Using the “Lepage” feature in Chekcell Can find the correct rhombohedral cell as published in: R. B. Von Dreele, P. W. Stephens, G. D. Smith and R. H. Blessing, "The first protein crystal structure determined from high-resolution X-ray powder diffraction data: a variant of T3R3 human insulin- zinc complex produced by grinding", Acta Cryst. (2000). D56,
Lachlan M. D. Cranswick Slide 47 Crysfire / Chekcell: indexing powder Protein data Even on rescaled data, if Crysfire and the various indexing programs do not find the “true cell”, Chekcell and possibly can find a derivative cell which LePage (combined with Chekcell’s “parsimony and superflous reflections” criteria), can find the true cell. In the case of reindexing of the protein data: – Crysfire and Dicvol finds Orthorhomic –P222 or PMM2 or PMMM : 157 hkls to match 41 reflections –LePage followed by Chekcell “parsimony check” then finds: Rhombohedral Rhombohedral –R-3 or R3 etc : 60 hkls to match 41 reflections
Lachlan M. D. Cranswick Slide 48 MMAP Mmap explores sections of solution-space (e.g. sections with varying alpha* and beta*, holding the other 4 parameters constant) The results are displayed as a map, with the high-ground colored It lists all the peaks in the map and refines them - each is a trial solution So it can act as an indexing program in its own right (taking the basis set containing the first 4 constants from Lzon’s output) This offers a particularly robust and flexible way to search for cells Use the new “LC” (Load a trial cell from the Crysfire summary file) to then run MMAP (using the MM command) on it.
Lachlan M. D. Cranswick Slide 49 MMAP - what does the output look like? Example MMap output from a high FOM trial cell
Lachlan M. D. Cranswick Slide 50 MMAP But another use is for checking out trial solutions taken from the summary lists generated by other indexing programs The usual guides to which solutions are the most promising are ones like figures of merit, numbers of lines indexed, etc., but only at the position of the refined cell in the map That’s like having a list of high points in a landscape, and trying to tell tall buildings from natural features with only their heights to go on Knowing the surroundings of the high-point tells us which are man-made, and similarly which Mmap peak corresponds to the physical solution
Lachlan M. D. Cranswick Slide 51 MMAP - which of these objects is man made? Using peak height as the selection criteria. Height of 80 Height of 78 Height of 65
Lachlan M. D. Cranswick Slide 52 MMAP - which of these peaks is man made? 2 It’s obvious once you can see the landscape
Lachlan M. D. Cranswick Slide 53 Analogous problem with powder indexing Traditional indexing only gives you the numbers (and possibly a Chekcell style fit) where a large number of trial cells can give reasonable matches Mmap shows you the actual “landscape”
Lachlan M. D. Cranswick Slide 54 An Mmap landscape Quick scan of an alpha*/beta* section for some real data for a triclinic cell (a finely-divided triclinic biological sample showing some line broadening) There are a number of high points scattered across the map Each of these peaks is a potential solution (+others others out of section)
Lachlan M. D. Cranswick Slide 55 Mmap Let’s take some real data and see how figure- of-merit lists can be illuminated by looking at the landscapes that they come from.
Lachlan M. D. Cranswick Slide 56 Mmap After refinement, though one peak has reached a figure of merit of 21, it still has a long list of competitors - no one solution really stands out There are 48 peaks in the whole list, all non-equivalent 17 of these have M>10 and index all of the first 20 lines
Lachlan M. D. Cranswick Slide 57 Mmap : Is the following a good solution? Here is the traditional set of information that we have for solution 1 from that list: a = Aalpha = deg b = Abeta = deg c = Agamma = deg V = A 3 M20 = 21.02
Lachlan M. D. Cranswick Slide 58 Mmap : Yes looks a good solution Here is a magnification of the region around map-peak 1 (M=21.0), reloaded into Crysfire from the Mmap summary file using LC and then selecting a cell. It’s relatively compact (though sitting on a lowish diagonal ridge)
Lachlan M. D. Cranswick Slide 59 Mmap : Is the following a good solution? Similarly for the solution that corresponds to map-peak 2: a = Aalpha = deg b = Abeta = deg c = Agamma = deg V = A 3 M20 = 16.05
Lachlan M. D. Cranswick Slide 60 Mmap A similar magnified map around map-peak 2 (M=16.0) This is revealed to be less well defined as well as lower, with its central region extending as a broad ridge right off the top of the map
Lachlan M. D. Cranswick Slide 61 Mmap: Is the following a good solution? And for map-peak 3: a = Aalpha = deg b = Abeta = deg c = Agamma = deg V = A 3 M20 = 14.57
Lachlan M. D. Cranswick Slide 62 Mmap Map-peak 3 (M=14.6) is broader still, and also drifts off the top of the map
Lachlan M. D. Cranswick Slide 63 Mmap We can also avoid wasting time on landscapes that are broad and featureless, indicating data problems or wrong initial assumptions, and so not worth pursuing
Lachlan M. D. Cranswick Slide 64 Conclusion Don’t give up if your lab’s favorite indexing program doesn’t work on a particular dataset Suites like Crysfire offer a wide range of indexing tools Follow on with Chekcell to help pin down the correct physical cell They’re both under continuing development (Crysfire 2002 to be released this June) Relevant web links: –Crysfire: –Chekcell: