Structure Validation Challenges in Chemical Crystallography Ton Spek Utrecht University, The Netherlands. Madrid, Aug. 26, 2011.

Slides:



Advertisements
Similar presentations
© S.J. Coles 2006 Usability WS, NeSC Jan 06 Enabling the reusability of scientific data: Experiences with designing an open access infrastructure for sharing.
Advertisements

Data Curation in Crystallography: Publisher Perspectives JISC Data Cluster Consultation Workshop CCLRC, Didcot, Oxon 10 October 2006.
Marshing: Past, Present and Future Ton Spek National Single Crystal Service Facility Utrecht University The Netherlands.
Structure Validation : How to distinguish GOOD and reliable single crystal structures from BAD and UGLY reports A.L.Spek Utrecht University The Netherlands.
Crystal Structure Validation : The IUCr tool to distinguish GOOD and trustable single crystal structures from BAD and UGLY reports Ton Spek Bijvoet Center.
The MISSYM Family: Software for the detection of Missed or Pseudo Symmetry A.L.Spek Utrecht University The Netherlands.
Crystal Structure Validation A Tool to distinguish GOOD and reliable single crystal studies from BAD and UGLY reports. Ton Spek National Single Crystal.
PLATON, New Options Ton Spek, National Single Crystal Structure Facility, Utrecht, The Netherlands. Delft, Sept. 18, 2006.
PLATON TUTORIAL A.L.Spek, National Single Crystal Service Facility,
Structure Comparison, Analysis and Validation Ton Spek National Single Crystal Facility Utrecht University.
CIF, PLATON-2014, SHELXL-2014, VALIDATION & SQUEEZE
SOFTWARE TESTING. INTRODUCTION  Software Testing is the process of executing a program or system with the intent of finding errors.  It involves any.
Introduction to protein x-ray crystallography. Electromagnetic waves E- electromagnetic field strength A- amplitude  - angular velocity - frequency.
An Update on Current and New Structure Analysis Tools in PLATON Ton Spek, Bijvoet Center for Biomolecular Research, Utrecht University, The Netherlands.
FTP Biostatistics II Model parameter estimations: Confronting models with measurements.
PLATON Validation and Analysis Tools Ton Spek National Single Crystal Service Facility, Utrecht University, The Netherlands. Sevilla, 14-Dec-2010.
Data activities of the International Union of Crystallography Brian McMahon IUCr 5 Abbey Square Chester CH1 2HU
The Crystallographic Information File (CIF) Description and Usage Ton Spek, Bijvoet Center for Biomolecular Research Utrecht University Leiden, 27-Jan
PLATON/CheckCIF Issues Ton Spek Utrecht University The Netherlands Bruker User Meeting UCSD, La Jolla, March 22-24, 2012.
Recent developments 1) Tests (outlier analysis) and Bug fixing ( with Paul) 2) Regeneration of Values of Bonds and Bond-angles existing all structures.
PLATON/SQUEEZE Ton Spek Bijvoet Center Utrecht University, The Netherlands. PLATON Workshop Chicago, 24-July-2010.
T.G. Fawcett, S. N. Kabbekodu, F. Needham, J. R. Blanton, D. M. Crane, J. Faber International Centre for Diffraction Data Using PDF-4+/Organics to discover.
The PLATON Toolbox Ton Spek National Single Crystal Service Facility, Utrecht University, The Netherlands. Kyoto, 20-Aug-2008.
Automatic Detection of Poor or Incorrect Single Crystal Structures A.L.Spek Utrecht University The Netherlands.
Software Quality Control Methods. Introduction Quality control methods have received a world wide surge of interest within the past couple of decades.
Chem Thermal Ellipsoids Remember that thermal ellipsoids can indicate problems with a refinement even when the R factors seem to indicate a reasonable.
Software Tools for the Analysis of Z’ > 1 Structures A.L.Spek, Utrecht University, National Single Crystal Service Facility The Netherlands. BCA-Meeting,
CheckCIF/PLATON Crystal Structure Validation
The System-S Approach to Automated Structure Determination: Problems and Solutions Ton Spek National Single Crystal Service Utrecht University, The Netherlands.
Automated Crystal Structure Validation Ton Spek, National Single Crystal Facility, Utrecht University, Utrecht, The Netherlands Platon Workshop Chicago,
Why Crystal Structure Validation ? Ton Spek, National Single Crystal Facility, Utrecht University, Utrecht, The Netherlands Slovenia, 17-june-2010.
Why Small Molecule Crystal Structure Validation ? Ton Spek, National Single Crystal Facility, Utrecht University, Utrecht, The Netherlands Sevilla, 14-Dec-2010.
Journals.iucr.org/f/ Acta Crystallographica Section F Structural Biology and Crystallization Communications An electronic journal for macromolecular structure.
The Crystallographic Information File (CIF) Description and Usage Ton Spek, Bijvoet Center for Biomolecular Research Utrecht University Sevilla, 14-Dec
SYSTEM-S The Challenge of Automated Structure Determination Ton Spek National Single Crystal Service Utrecht University, The Netherlands.
Structure Validation in Chemical Crystallography with CheckCIF/PLATON Ton Spek, National Single Crystal Service Facility, Utrecht University The Netherlands.
Structure Validation in Chemical Crystallography Ton Spek, Bijvoet Centre for Biomolecular Research, Utrecht University, The Netherlands. CCP4-Leeds, 5-Jan
Structure Validation in Chemical Crystallography Principles and Application Ton Spek, National Single Crystal Service Facility, Utrecht University SAB-Delft,
From Papertape Input to ‘Forensic Crystallography’ A History of the Program PLATON Ton Spek, Bijvoet Center Utrecht University The Netherlands K.N.Trueblood.
PLATON and STRUCTURE VALIDATION Ton Spek National Single Crystal Service Facility, Utrecht University, The Netherlands. Goettingen, 13-Oct-2007.
New Structures for Old: A Cautionary Tale of Fraud in Small Molecule Crystallography Jim Simpson Department of Chemistry University of Otago.
On the Proper Reporting and Archival of Crystal Structure Data Ton Spek Utrecht University, NL (ACA2015-Philadelphia)
PLATON, A set of Tools for the Interpretation of Structural Results Ton Spek National Single Crystal Service Facility, Utrecht University,The Netherlands.
Structure Validation Ton Spek, Bijvoet Centre Utrecht University The Netherlands PLATON Course, Utrecht, April 18, 2012.
PLATON TUTORIAL A.L.Spek, National Single Crystal Service Facility, Utrecht, The Netherlands.
Ton Spek Utrecht University The Netherlands IUCr-Montreal Aug 11, 2014
Information for (New) Co-editors This presentation was used at the new co-editors induction meeting held during the IUCR Osaka Congress, August It.
The PLATON/TwinRotMat Tool for Twinning Detection Ton Spek National Single Crystal Service Facility, Utrecht University, The Netherlands. Delft, 29-Sept-2008.
PLATON, A Multipurpose Crystallographic Tool Ton Spek, National Single Crystal Service Facility, Utrecht, The Netherlands.
The PLATON Toolbox History and Applications Ton Spek Utrecht University, The Netherlands. Bruker User Meeting, UCSD La Jolla, March 22-24, 2012.
MANUAL TESTING KS SESSION PRESENTED BY 26/11/015 VISHAL KUMAR.
PLATON/SQUEEZE Ton Spek Bijvoet Center Utrecht University, The Netherlands. PLATON Course Utrecht, April 18, 2012.
SOFTWARE TESTING. Introduction Software Testing is the process of executing a program or system with the intent of finding errors. It involves any activity.
Updates on Validation and SQUEEZE Ton Spek Utrecht University Bruker User Meeting Jacksonville (FL), Jan 19, 2016.
CIF's CIFs from SHELX CIFs from MOLEN Working with CIFS ENCIFER PUBLCIF PLATON Validation.
What is Needed for Proper Structure Validation and How to Act upon Validation ALERTS Ton Spek Utrecht University The Netherlands ACA-Denver, july 26, 2016.
Dr.V.Jaiganesh Professor
The PLATON checkCIF and SQUEEZE Tools
(check)CIF, SHELXL-2014, SQUEEZE
What Makes a Crystal Structure Report Valid?
Software for Crystallographic and Rietveld Analysis
Ton Spek Utrecht University (NL) ECS4-School, Warsaw, July 2-7, 2017
Software Testing.
Crystal Structure Validation with PLATON
Crystal structure determination
Why Crystal Structure Validation ?
Topic Title Refinement
The SQUEEZE Tool in PLATON and its use with SHELXL2013
Ton Spek Utrecht University The Netherlands Vienna –ECM
The PLATON/TwinRotMat Tool for Twinning Detection
Presentation transcript:

Structure Validation Challenges in Chemical Crystallography Ton Spek Utrecht University, The Netherlands. Madrid, Aug. 26, 2011

Validation History Structure Validation of data supplied in computer readable CIF format was pioneered by Acta Cryst. C (Syd Hall et al., 1990ies). Initially the numerical checking of papers submitted to Acta C in CIF format was done by the Chester staff. Subsequently automated checking of the CIF for data consistency, data completeness and validity was introduced (checkCIF) PLATON facilities to check for Missed Symmetry and VOIDS were added later on. Soon followed by the inclusion of numerous other PLATON based tests (PLATxxx) of the reported structure (currently more than 400). checkcif/PLATON

FCF Validation Fo/Fc reflection file deposition and archival in CIF format (FCF) was made mandatory early on for Acta Cryst. papers. Useful for subsequent analysis of possibly unique data. CIF + FCF checking was added in 2010 into the IUCr CheckCIF/PLATON suite. Major chemical journals now require CIF deposition and validation reports but (not yet) the deposition of reflection data. The CCDC now accepts FCF's for deposition.

Why Automated Structure Validation The large volume of new and routine structure reports submitted for publication. The limited number experienced and available crystallographic referees for validation. Detection of errors due to the black box use of crystallography by non-crystallographers. Setting standards of quality and reliability. Automated detection of unusual though not necessarily erroneous issues that need special attention (ALERTS A,B,C,G). Sadly: The need to Detect Frauded structure reports.

Systematic Fraud A massive fraud was detected in late 2009 of structures mainly published around 2007 in Acta Cryst. E. (Soon 200 retractions !) Nobody was prepared for serious and systematic fraud in this not competitive field of routine structures before Many deviations from the expected results can often be explained as errors, inexperience or due to poor data. Several retractions before 2010 might in hindsight concern frauded structures and not errors. Ongoing testing of our validation software on the archived data for structures published in Acta E often indicated suspect structures needing a more detailed investigation. It was only by following up on one of such a strange structure report with an analysis of all structures published by the authors of that paper that a fraud pattern emerged. It was discovered that the same data set was used to publish a series if invented isomorphous structures. Full story: Acta Cryst. E (2010) editorial and a Powerpoint Presentation of the E-section editor Jim Simpson (IUCr Website).

BogusVariations (with Hirshfeld ALERTS) on the Published Structure 2-hydroxy-3,5-nitrobenzoic acid (ZAJGUM) OH => F H2O => NH3 OH=>NH2 NO2=>COOH

Fraud Detection Tools Generalized Hirshfeld Rigid Bond Test. CIF versus FCF data checking. Scatter Plots of the reflection data of the same or related structure(s). Look in Difference Maps for unusual features. SHELXL re-refinement using the supplied CIF & FCF data. Check in the CSD for related structures. Two case studies that illustrate the use of the above validation and analysis tools follow.

Example 1: Error or Fraud ? Submitted to Acta Cryst. (2011) Structure I

PLATON Report Part 1

PLATON Report Part 2

RELATED STRUCTURE FROM THE CSD Structure II

Structure Report for II

Scatter Plots I(obs) versus I(calc) (I) (II)

Analysis Structure (II) has no validation issues. C-CH3 distance in (II) of 1.50 Ang. as expected. ‘C-F’ distance in (I) is 1.50 Ang. and not the expected 1.35 Ang. Conclusion: Structure (I) is the CH3 variety and not F. Data sets of (I) & (II) are not identical (see next). Data set (I) likely based on CH3 compound. Fraud or Error ? DIFABS file Error ? Authors of (I) confirmed Error believing external chemists proposal. Paper was retracted.

Scatter Plots of 2 Data Sets Two Unrelated Data Sets Two Identical Data sets

CIF versus FCF data Check The R & S values in the three lines # R= should be identical within rounding error. The reported and calculated residual density ranges should also be closely identical This is the case in the first example but not in the second where the CIF & FCF data do not match.

Example 2: Iron(III) Complex

Fe(III) Validation Part 1

Fe(III) Validation Part 2

Example 2: Difference Density Map

Fe Structure Re-refined

Conclusion ? Structure now O.K. after an erratum ? Search for similar (isomorphous) structures in the CSD Yes, there is an isomorphous Mn complex published by a different set of authors from a different university. Let us compare both structures.

Isomorphous Mn(III) Complex

Mn Structure Validation Part 1

Mn Validation Part 2

Scatter Plot Fe versus Mn I(obs) Fe and Mn Data Sets Identical !

Analysis on Fe/Mn Structures The Displacement parameters in the CIF for the H2O molecule in the Fe complex are different from those used in the final refinement. Reflection sets identical for papers from two different sets of authors and location. CSD: Unusual coordination distances Fraud or Error ? Withdraw/Retract one or both ?

Validation Challenges Avoid False Positive and Negative ALERTS Disordered structures (true or artifact) Handling of Twinning (data names missing) Powder structure validation (experts needed) Incommensurate structure validation (experts) Fabricated reflection data – Can we detect them Education – What is the meaning of an ALERT Should validation criteria be different for structures published in chemical journals ?

Concluding Remarks PLATON includes a standalone Validation Tool. It is part of the WEB-based IUCr CheckCIF/PLATON Tool that is capably managed by Mike Hoyland (IUCr) Validation is still a learning process. Chemical insight might be very helpful and often decisive as a validation tool. Deposition of structure factors should be a requirement for all journals (The CCDC now accepts those along with the CIF)

Thanks To Martin Lutz and many others for taking the time to bring various unresolved issues to my attention with actual data. Send to