The PCOD and P2D2 databases (P for Predicted) Armel Le Bail Université du Maine, Laboratoire des oxydes et Fluorures, CNRS UMR 6010, Avenue O. Messiaen,

Slides:



Advertisements
Similar presentations
Publisher perspective eBank/R4L/SPECTRa Joint Consultation Workshop London Metropole Hotel 20 October 2006.
Advertisements

Determination of Protein Structure. Methods for Determining Structures X-ray crystallography – uses an X-ray diffraction pattern and electron density.
Data activities of the International Union of Crystallography Brian McMahon IUCr 5 Abbey Square Chester CH1 2HU
INTRODUCTION Massive inorganic crystal structure predictions were recently Performed, justifying the creation of new databases. Among them, the PCOD [1]
Electron Diffraction Applications Using the PDF-4+ Relational Database.
Practice of analysis and interpretation of X-ray diffraction data
Chem Single Crystals For single crystals, we see the individual reciprocal lattice points projected onto the detector and we can determine the values.
Small Molecule Example – YLID Unit Cell Contents and Z Value
Introduction to the Powder Diffraction File: The PDF-4 Family of Relational Databases International Centre for Diffraction Data.
What is e-Science? e-Science refers to large scale science that will increasingly be carried out through distributed global collaborations enabled by the.
T.G. Fawcett, S. N. Kabbekodu, F. Needham, J. R. Blanton, D. M. Crane, J. Faber International Centre for Diffraction Data Using PDF-4+/Organics to discover.
Frontiers Between Crystal Structure Prediction and Determination by Powder Diffractometry Armel Le Bail Université du Maine, Laboratoire des Oxydes et.
Timothy G. Fawcett, Soorya N. Kabbekodu, Fangling Needham and Cyrus E. Crowder International Centre for Diffraction Data, Newtown Square, PA, USA Experimental.
Inorganic Structure Prediction with GRINSP Armel Le Bail Université du Maine, Laboratoire des oxydes et Fluorures, CNRS UMR 6010, Avenue O. Messiaen,
Chem Thermal Ellipsoids Remember that thermal ellipsoids can indicate problems with a refinement even when the R factors seem to indicate a reasonable.
Crystallographic Data Publication at Source International Union of Crystallography Peter R. Strickland and Brian McMahon IUCr 5 Abbey Square Chester CH1.
Automated Crystal Structure Validation Ton Spek, National Single Crystal Facility, Utrecht University, Utrecht, The Netherlands Platon Workshop Chicago,
Inorganic structure prediction : too much and not enough Armel Le Bail Université du Maine, Laboratoire des oxydes et Fluorures, CNRS UMR 6010, Avenue.
Structure Validation in Chemical Crystallography Principles and Application Ton Spek, National Single Crystal Service Facility, Utrecht University SAB-Delft,
Microporous Titanium Silicates Predicted by GRINSP Armel Le Bail Université du Maine, Laboratoire des oxydes et Fluorures, CNRS UMR 6010, Avenue O. Messiaen,
Advanced Identification Tools. This tutorial will demonstrate how a user can increase both the speed and efficiency of the material identification process.
NIST and other spectral databases John C. Huffman IUMSC.
Putting 3D-print files of crystallographic models into open access International Advisory Board of the Crystallography Open Database & T. J. Snyder solid.
The Need for Speed. The PDF-4+ database is designed to handle very large amounts of data and provide the user with an ability to perform extensive data.
Molecular Graphics. Molecular Graphics What? PDF-4 products contain data sets with atomic coordinates. A molecular graphic package embedded in the product.
Construction of efficient PDP scheme for Distributed Cloud Storage. By Manognya Reddy Kondam.
Linux Operations and Administration
Crystallography Open Database (COD), Predicted Crystallography Open Database (PCOD) and Material Properties Open Database (MPOD)
Process Flowsheet Generation & Design Through a Group Contribution Approach Lo ï c d ’ Anterroches CAPEC Friday Morning Seminar, Spring 2005.
INTRODUCTION The COD was created in March 2003 and was built on the PDB model of open access on the Internet. It is intended that this database [1] consists.
Information Sources in Crystallography Your Logo Here Gregory K. Youngen Physics/Astronomy Librarian University of Illinois at Urbana-Champaign Gregory.
SIeve+ Introduction SIeve+ is a Plug-In module to the DDView+ software which is integrated in the PDF-4 products. SIeve+ is licensed separately at an additional.
CONCLUSIONS COD server is technically in position to store and serve all structures that are currently solved. COD deposition procedure ensures syntactic.
BALBES (Current working name) A. Vagin, F. Long, J. Foadi, A. Lebedev G. Murshudov Chemistry Department, University of York.
McMaille – Sous le Capot (Under the Bonnet) A.Le Bail Université du Maine Laboratoire des Oxydes et Fluorures CNRS – UMR 6010 FRANCE
High Throughput Screening of Materials (CCP9) Friday 20 th April 2012 CXD Workshop.
Crystallographic Databases I590 Spring 2005 Based in part on slides from John C. Huffman.
1 SIeve+ Introduction SIeve+ is a Plug-In module to the DDView+ software which is integrated in the PDF-4 products. SIeve+ is licensed separately at an.
Training and Evaluation Tool Milan Jovic Dusan Jevtic Dr Dragan Jankovic Public Reporting on Project Results TEMPUS project.
INTRODUCTION The results from a third structure determination by powder diffractometry (SDPD) round-robin are summarized. From the 175 potential participants.
COD (CRYSTALLOGRAPHY OPEN DATABASE) and PCOD (PREDICTED) COD Advisory Board : Daniel Chateigner (France), XiaoLong Chen (China), Marco E. Ciriotti (Italy),
News in Open Databases COD, PCOD, TCOD, MPOD, FPSM … Association Française de Cristallographie 2013, Bordeaux D. Chateigner, S. Grazulis, J. Butkus, A.
Protein Tertiary Structure Prediction Structural Bioinformatics.
Zach Miller Computer Sciences Department University of Wisconsin-Madison Supporting the Computation Needs.
X-ray powder diffraction
Lecture-6 Bscshelp.com. Todays Lecture  Which Kinds of Applications Are Targeted?  Business intelligence  Search engines.
Lecture 53: X-ray crystallography. Electrons deflect x-rays We try to recreate electron density from the x-ray diffraction pattern Each point in space.
Presentation Outline ANAELU: 2-D XRD texture analysis Experimental 2D XRD patterns Representation of structure Simulation of single-crystal XRD Polycrystal.
Structure Prediction (especially with GRINSP)
CHARACTERIZATION OF THE STRUCTURE OF SOLIDS
Recent activities around the crystallography open databases COD, PCOD and P2D2 Armel Le Bail Université du Maine, Laboratoire des oxydes et Fluorures,
PDBe Protein Interfaces, Surfaces and Assemblies
Software for Crystallographic and Rietveld Analysis
Crystallography images
The Rietveld Method Armel Le Bail
Database Requirements for CCP4 17th October 2005
Institute of Biotechnology
COD (CRYSTALLOGRAPHY OPEN DATABASE) and PCOD (PREDICTED)
PREDICTED CORNER SHARING TITANIUM SILICATES
ICDD Release 2008 New Features
Solid state (Calculations & Doping)
A similarity index for comparing diffraction patterns
Solid state (Calculations & Doping)
Solid Crystal Structures. (based on Chap
Solid Crystal Structures. (based on Chap
The site to download BALBES:
Solid Crystal Structures. (based on Chap
Inorganic Structure Prediction with GRINSP
Instructors Tim Fawcett Suri Kabekkodu Diane Sagnella
Getting Cell Parameters from Powder Diffraction Data
Presentation transcript:

The PCOD and P2D2 databases (P for Predicted) Armel Le Bail Université du Maine, Laboratoire des oxydes et Fluorures, CNRS UMR 6010, Avenue O. Messiaen, Le Mans Cedex 9, France. Energy Landscape of Solids: from (Hypothetical) Topologies to Material Properties Lausanne - July,

OUTLINE - Foundations of the COD, PCOD, P2D2 databases - Current state of these open databases - Some applications - Future - Conclusion Aim of the talk : try to decide the crystal structure prediction experts to deposit their best models in an open access database (PCOD)

FOUNDATIONS COD = Crystallography Open Database Foundation March 2003 = Actual crystal structures PCOD = Predicted Crystallography Open Database Foundation December 2003 P2D2 = Predicted Powder Diffraction Database Foundation February 2007

OPEN DATA and Crystallography Databases ——— Open access on the Web, before COD : PDB (proteins) NDB (nucleic acids) AMCSD (minerals) Toll databases : CSD (organic, organometallic) ICSD (inorganic, minerals) CRYSTMET (metals, intermetallics) ICDD (powder patterns)

COD Built on the PDB model of open access on the Internet. Consists of any small or medium crystal structure (inorganic, organic, organometallic). Total entry number ~70000, (~10000 from the American Mineralogist Crystal Structure Database, ~30000 from IUCr, and CIF files donations from a few laboratories in Europe or from individuals). Distribution through an Apache/MYSQL/PHP system taking queries on chemistry, range of cell parameters, volumes, etc, as well as combinations of fields, and allows to download or upload CIF files.

= Recent addition of the IUCr logo, after permission to download their CIFs in September 2007 New COD coordinator since January 2008 : Dr. Saulius Gražulis, Institute of Biotechnology, Graiciuno 8, LT Vilnius, Lietuva (Lithuania) entries – July 2008

SEARCH OPTIONS Search page Results

SEARCH the COD The COD wishes to offer minimal and simple search possibilities, allowing you : 1- to verify if the structure you intend to solve is not already solved, 2- to find models or fragments for solving your current problem, 3- to make a correct job if an editor asks you to review a manuscript. The problem being now that the COD is not completed yet…

GET the COD TOOLS EasyPHP (Apache server, MySQL, PHP scripts) You can download the complete database and make it run on your PC. You can reuse the complete system and create your lab CIF repository.

PCOD (P = Predicted) PCOD contains > entries (« structure candidates ») Most are predictions by the GRINSP software CIF files : hypothetical zeolites and other binary or ternary compounds corresponding to N and N/N’-connected 3D networks (N = 3, 4, 5, 6). PCOD is open for search, download and upload of predicted crystal structures (coming from any prediction computer program, inorganic or small and medium organic molecules).

entries in March 2008, updated once a year, up to now.

SEARCHING PCOD Search page Results

VIRTUAL MODELS in PCOD Zeolites B 2 O 3 nanotubes [Ca 3 Al 4 F 21 ] 3-

Structure candidates in PCOD Not ranked by energy (exception for the AlF 3 series studied by WIEN2K), not all electrically neutral… More work needed on the GRINSP software.

For each series, classifications by quality (R), framework density (FD), coordination sequence (CS) are available 2154 Titanosilicates

GRINSP is an Open Source software

Applications from COD and PCOD data 1 - Identification from calculated powder patterns : Actual structures : COD : Match ! Crystal Impact Virtual structures : PCOD : P2D2 -> EVA- Bruker 2 - Structural fingerprints for nanocrystals by means of TEM, HRTEM 3 - Interface with COD and PCOD for visualization, importing, exporting data to other applications like GULP to calculate energies, phonon properties, molecular dynamics, free energies and so on…

Identification from calculated powder patterns (from the COD) : Match! sofware from Crystal Impact

Identification from calculated powder patterns

Predicted crystal structures (from the PCOD) provide predicted fingerprints: powder patterns

Calculated powder patterns in the P2D2 allow for identification by search-match (EVA - Bruker and Highscore - Panalytical) List of d(Å) and intensities from the Bragg law, providing a way for « immediate structure solution » We « simply » need for a complete database of predicted structures ;-)

Example 1 – The actual and virtual structures have the same chemical formula, PAD = 0.52% (percentage of absolute difference on cell parameters, averaged) :  -AlF 3, tetragonal, a = Å, c = Å. Predicted : Å, Å. A global search (no chemical restraint) is resulting in the actual compound (PDF-2) in first position and the virtual one (PPDF-1) in 2 nd (green mark in the toolbox).

Example 2 – Model showing uncomplete chemistry, PAD = Actual compound : K 2 TiSi 3 O 9  H 2 O, orthorhombic, a = Å, b = Å, c = Å. Predicted framework : TiSi 3 O 9, a = 7.22 Å, b = 9.97 Å, c =12.93 Å. Without chemical restraint, the correct PDF-2 entry is coming at the head of the list, but no virtual model. By using the chemical restraint (Ti + Si + O), the correct PPDF-1 entry comes in second position in spite of large intensity disagreements with the experimental powder pattern (K and H 2 O are lacking in the PCOD model) : Virtual Actual

Example 3 – Model showing uncomplete chemistry, PAD = Predicted framework : Ca 4 Al 7 F 33, cubic, a = Å. Actual compound : Na 4 Ca 4 Al 7 F 33, a = Å. By a search with chemical restraints (Ca + Al + F) the virtual model comes in fifth position, after 4 PDF-2 correct entries, if the maximum angle is limited to 30°(2  ) : Virtual

Example 4 : heulandite

Example 5 : Mordenite

Two main problems in identification by search-match process from the P2D2 : - Inaccuracies in the predicted cell parameters, introducing discrepancies in the peak positions. - Uncomplete chemistry of the models, influencing the peak intensities. However, identification may succeed satisfyingly if the chemistry is restrained adequately during the search and if the averaged difference in cell parameters is smaller than 1%.

« New similarity index for crystal structure determination from X-ray powder diagrams, » D.W.M. Hofmann and L. Kuleshova, J. Appl. Cryst. 38 (2005) A similarity index less sensitive to cell parameter discrepancies

δ-Zn 2 P 2 O 7 Bataille et al., J. Solid State Chem. 140 (1998) Typical case to be solved by prediction α β γ δ Uncertain indexing, line profiles broadened by size/microstrain effects (Powder pattern not better from synchrotron radiation than from conventional X-rays) But the fingerprint is there…

Other fingerprints than powder patterns may be calculated from structural data : building fingerprints for nanocrystal identification by transmission electron microscopy. P. Moeck and P. Fraundorf, Z. Kristallogr. 222 (2007)

J. Appl. Crys. 41 (2008)

Examples of search with that COD/PCOD User Interface

From that COD/PCOD graphical user interface, you may decide to study more seriously some series of structures predicted by the GRINSP software. This was done already for the predicted AlF 3, using WIEN2K : A. Le Bail, F. Calvayrac, J. Solid State Chem. 179 (2006)

Expected GRINSP improvements : Edge, face, corner-sharing, mixed. Hole detection, filling them automatically, appropriately, for electrical neutrality. Using bond valence rules or/and energy calculations to define a new cost function. Extension to quaternary compounds, combining more than two different polyhedra. Etc, etc. Do it yourself, the GRINSP software is open source… Nothing planned about hybrids…

Two things that don’t work well enough up to now… Validation of the Predictions - Ab initio calculations (WIEN2K, etc) : not fast enough for the validation of > structure candidates (was 2 months for 12 AlF 3 models) Identification (is this predicted structure already known?) - There is no efficient tool for the fast comparison of these thousands of inorganic predicted structures to the known structures (inside of ICSD)

People do not think to look in databases if their new compound was already predicted… Rb 2 Zn 3 (P 2 O 7 ) 2 Averbuch-Pouchot, M.T. (1985). Z. Kristallogr. 171, K 2 Zn 3 (P 2 O 7 ) 2 Ji, L.N. et al. (2008). Powder Diffraction, in press. Both are corresponding to 3D 4-connected nets of ZnO 4 and PO 4 tetrahedra sharing corners (7 nodes). Calculating their coordination sequence and searching in two databases (hypothetical zeolites and PCOD) : none of these « simple » networks is predicted yet… Such a verification is not easy to realize… Tools have to be built…

Suggestion to the structure predictors Send your data (CIFs) to the PCOD, thanks…

CIF = Crystallographic Information File The IUCr standard data exchange file format (International Union for Crystallography) Description of the format : data_PCOD _publ_section_title ; Acta Crystallographica B61 (2005) Hypothetical binodal zeolitic frameworks, A. Simperler, M. D. Foster, O. Delgado Friedrichs, R. G. Bell, F. A. Almeida Paz and J. Klinowski ; _chemical_name_common '2_106‘ _chemical_formula_sum 'O2 Si‘ _symmetry_space_group_name_H-M 'P 63/m m c‘ _cell_length_a _cell_length_b _cell_length_c _cell_angle_alpha _cell_angle_beta _cell_angle_gamma loop_ _atom_site_label _atom_site_fract_x _atom_site_fract_y _atom_site_fract_z _atom_site_type_symbol Si Si Si Si O O O O O O O O O O O O

Future for the COD, PCOD, P2D2 COD : need to attain > actual structures entries… Need to convince the ACS and RSC to give permission to download systematically their CIFs Need to decide more search-match software producers to incorporate powder patterns calculated from the COD PCOD and P2D2 : virtual structures Need to improve the quality of the predicted crystal structures by bond valence and energy calculations, etc The number of entries may grow fast, and also decrease times to times, as our material theories progress, allowing to suppress wrong predictions…

CONCLUSION To you to see what you can do with or for the COD, PCOD, P2D2 database… Knowing that : Structure and properties full prediction is THE challenge of this XXIth century in crystallography

COD/PCOD International Advisory Board Chateigner, D. (France) Chen, X.L. (China) Ciriotti, M. (Italy) Downs, R.T. (USA) Gražulis, S. (Lithuania) Le Bail, A. (France) Lutterotti, L. (Italy) Matsushita, Y. (Japan) Moeck, P. (USA) Quirós Olozábal, M. (Spain) Rajan, H. (India) Yokochi, A.F.T. (USA)