Inorganic structure prediction : too much and not enough Armel Le Bail Université du Maine, Laboratoire des oxydes et Fluorures, CNRS UMR 6010, Avenue O. Messiaen, Le Mans Cedex 9, France. XX Conference on Applied Crystallography, Wisla, Poland, September 2006
CONTENTS - Introduction - Prediction software and examples - More examples from the GRINSP software (especially AlF 3 polymorphs and titanosilicates) - Opened doors, limitations, problems - Conclusion XX Conference on Applied Crystallography, Wisla, Poland, September 2006
INTRODUCTION Personnal views about crystal structure prediction : “Exact” description before synthesis or discovery in nature. These “exact” descriptions should be used for the calculation of powder patterns included in a database for automatic identification of real compounds not yet characterized crystallographycally. XX Conference on Applied Crystallography, Wisla, Poland, September 2006
If the state of the art had dramatically evolved in the past ten years, we should have huge databases of predicted compounds, and not any new crystal structure would surprise us since it would corespond already to an entry in that database. Moreover, we would have obtained in advance the physical properties and we would have preferably synthesized those interesting compounds. Of course, this is absolutely not the case. Where are we with inorganic crystal structure prediction? XX Conference on Applied Crystallography, Wisla, Poland, September 2006
But things are changing, maybe : Two databases of hypothetical compounds were built in One is exclusively devoted to zeolites : M.D. Foster & M.M.J. Treacy - Hypothetical Zeolites – The other includes zeolites as well as other predicted oxides (phosphates, borosilicates, etc) and fluorides : the PCOD (Predicted Crystallography Open Database) XX Conference on Applied Crystallography, Wisla, Poland, September 2006
Prediction software Especially recommended lectures (review papers) : 1- S.M. Woodley, in: Application of Evolutionary Computation in Chemistry, R. L. Johnston (ed), Structure and bonding series, Springer- Verlag 110 (2004) J.C. Schön & M. Jansen, Z. Krist. 216 (2001) ; Software : CASTEP, program for Zeolites, GULP, G42, Spuds, AASBU, GRINSP XX Conference on Applied Crystallography, Wisla, Poland, September 2006
CASTEP Uses the density functional theory (DFT) for ab initio modeling, applying a pseudopotential plane-wave code. M.C Payne et al., Rev. Mod. Phys. 64 (1992) Example : carbon polymorphs XX Conference on Applied Crystallography, Wisla, Poland, September 2006
Hypothetical Carbon Polymorph Suggested By CASTEP XX Conference on Applied Crystallography, Wisla, Poland, September 2006
Another CASTEP prediction XX Conference on Applied Crystallography, Wisla, Poland, September 2006
ZEOLITES The structures gathered in the database of hypothetical zeolites are produced from a 64-processor computer cluster grinding away non-stop, generating graphs and annealing them, the selected frameworks being then re-optimized using the General Utility Lattice Program (GULP, written by Julian Gale) using atomic potentials. M.D. Foster & M.M.J. Treacy - Hypothetical Zeolites – XX Conference on Applied Crystallography, Wisla, Poland, September 2006
Zeolite predictions are probably too much… Less than 200 zeotypes are known Less than 10 new zeotypes are discovered every year Less than half of them are listed in that > database So that zeolite predictions will continue up to attain several millions more… Quantum chemistry validation of these prediction is required, not only empirical energy calculations, for elimination of a large number of models that will certainly never be confirmed.
GULP Appears to be able to predict crystal structures (one can find in the manual the data for the prediction of TiO 2 polymorphs). Recently, a genetic algorithm was implemented in GULP in order to generate crystal framework structures from the knowledge of only the unit cell dimensions and constituent atoms (so, this is not prediction...), the structures of the better candidates produced are relaxed by minimizing the lattice energy, which is based on the Born model of a solid. S.M. Woodley, in: Application of Evolutionary Computation in Chemistry, R. L. Johnston (ed), Structure and bonding series, Springer-Verlag 110 (2004) GULP : J. D. Gale, J. Chem. Soc., Faraday Trans., 93 (1997) XX Conference on Applied Crystallography, Wisla, Poland, September 2006
Part of the command list of GULP : XX Conference on Applied Crystallography, Wisla, Poland, September 2006
G42 A concept of 'energy landscape' of chemical systems is used by Schön and Jansen for structure prediction with their program named G42. J.C. Schön & M. Jansen, Z. Krist. 216 (2001) ; XX Conference on Applied Crystallography, Wisla, Poland, September 2006
SPuDS Dedicated especially to the prediction of perovskites. M.W. Lufaso & P.M. Woodward, Acta Cryst. B57 (2001) XX Conference on Applied Crystallography, Wisla, Poland, September 2006
AASBU method (Automated Assembly of Secondary Building Units) Developed by Mellot-Draznieks et al., C. Mellot-Drazniek, J.M. Newsam, A.M. Gorman, C.M. Freeman & G. Férey, Angew. Chem. Int. Ed. 39 (2000) ; C. Mellot-Drazniek, S. Girard, G. Férey, C. Schön, Z. Cancarevic, M. Jansen, Chem. Eur. J. 8 (2002) Using Cerius2 and GULP in a sequence of simulated annealing plus minimization steps for the aggregation of large structural motifs. Cerius2, Version 4.2, Molecular Simulations Inc., Cambridge, UK, XX Conference on Applied Crystallography, Wisla, Poland, September 2006
Not enough If zeolites are excluded, the productions of these prediction software are a few dozen… not enough, not available in any database. A recent (2005) prediction program is able to extend the investigations to larger series of inorganic compounds characterized by corner-sharing polyhedra.
GRINSP Geometrically Restrained INorganic Structure Prediction Applies the knowledge about the geometrical characteristics of a particular group of inorganic crystal structures (N-connected 3D networks with N = 3, 4, 5, 6, for one or two N values). Explores that limited and special space (exclusive corner-sharing polyhedra) by a Monte Carlo approach. The cost function is very basic, depending on weighted differences between ideal and calculated interatomic distances for first neighbours M-X, X-X and M-M for binary M a X b or ternary M a M' b X c compounds. J. Appl. Cryst. 38, 2005, J. Solid State Chem. 179, 2006,
Observed and predicted cell parameters comparison Predicted by GRINSP (Å)Observed or idealized (Å) Dense SiO 2 abcRabc (%) Quartz Tridymite Cristobalite Zeolites ABW EAB EDI GIS GME Aluminum fluorides -AlF Na 4 Ca 4 Al 7 F AlF 3 -pyrochl Titanosilicates Batisite Pabstite Penkvilskite
Predictions produced by GRINSP Binary compounds Formulations M 2 X 3, MX 2, M 2 X 5 et MX 3 were examined. Zeolites MX 2 (= 4-connected 3D nets) More than 1000 zeolites (not ) are proposed with cell parameters < 16 Å, placed into the PCOD database : GRINSP recognizes a zeotype by comparing the coordination sequences (CS) of a model with a previously established list of CS and with the CS of the models already proposed during the current calculation).
Hypothetical zeolite PCOD SG : P432, a = Å, FD = 11.51
Other GRINSP predictions : > 3000 B 2 O 3 polymorphs Hypothetical B 2 O 3 - PCOD Triangles BO 3 sharing corners. = 3-connected 3D nets
> 500 V 2 O 5 polymorphs square-based pyramids = 5-connected 3D nets
12 AlF 3 polymorphs Corner-sharing octahedra. = 6-connected 3D nets
Do these AlF 3 polymorphs can really exist ? Ab initio energy calculations by WIEN2K « Full Potential (Linearized) Augmented Plane Wave code » A. Le Bail & F. Calvayrac, J. Solid State Chem. 179 (2006)
Ternary compounds M a M’ b X c in 3D networks of polyhedra connected by corners Either M/M’ with same coordination but different ionic radii or with different coordinations (mixed N-N’-connected 3D frameworks) These ternary compounds are not always electrically neutral.
Borosilicates PCOD , Si 5 B 2 O 13, R = > 3000 models SiO 4 tetrahedra and BO 3 triangles
Aluminoborates > 2000 models Example : [AlB 4 O 9 ] -2, cubic, SG : Pn-3, a = Å, R = : AlO 6 octahedra and BO 3 triangles
Fluoroaluminates Known Na 4 Ca 4 Al 7 F 33 : PCOD [Ca 4 Al 7 F 33 ] 4-. Two-sizes octahedra AlF 6 and CaF 6
Unknown : PCOD [Ca 3 Al 4 F 21 ] 3-
Results for titanosilicates > 1000 models TiO 6 octahedra and SiO 4 tetrahedra
Numbers of compounds in ICSD version 1-4-1, (89369 entries) potentially fitting structurally with the [TiSi n O (3+2n) ] 2- series of GRINSP predictions, adding either C, C 2 or CD cations for electrical neutrality. n+C+C 2 +CDTotalGRINSP ABX AB 2 X AB 3 X AB 4 X AB 5 X AB 6 X Total More than 70% of the predicted titanosilicates have the general formula [TiSi n O (3+2n) ] 2- Not all these 2581 ICSD structures are built up from corner sharing octahedra and tetrahedra. Many isostructural compounds inside.
Models with real counterparts
Example in PCOD Not too bad if one considers that K et H 2 O are not taken into account in the model prediction... Model PCOD (Si 3 TiO 9 ) 2- : a = 7.22 Å; b = 9.97 Å; c =12.93 Å, SG P Known as K 2 TiSi 3 O 9.H 2 O (isostructural to mineral umbite): a = Å; b = Å; c = Å, SG P (Eur. J. Solid State Inorg. Chem. 34, 1997, )
Highest quality (?) models
Models with the largest porosity
PCOD : P = 70.2%, FD = 10.6, D P = 3 (dimensionality of the pore/channels system) [Si 6 TiO 15 ] 2-, cubic, SG = P4 1 32, a = Å Ring apertures 9 x 9 x 9
PCOD , P = 61.7%, FD = 12.0, D P = 3 [Si 2 TiO 7 ] 2-, orthorhombic, SG = Imma Ring apertures 10 x 8 x 8
PCOD , P = 61.8%, FD = 13.0, D P = 3 [Si 6 TiO 15 ] 2-, cubic, SG = Pn-3 Ring apertures 12 x 12 x
PCOD , P = 59.6%, FD = 13.0, D P = 3 [Si 4 TiO 11 ] 2-, tetragonal, SG = P4 2 /mcm Ring apertures 12 x 10 x 10
Opened doors, Limitations, Problems GRINSP limitation : exclusively corner-sharing polyhedra. Opening the door potentially to > hypothetical compounds. The predicted titanosilicates can be extrapolated to phosphates, sulfates, and/or replacing Ti by Nb, V, Zr, Ga, etc. More than should be included into PCOD before the end of Then, their powder patterns will be calculated and possibly used for search-match identification.
Expected improvements : Edge, face, corner-sharing, mixed. Hole detection, filling them automatically, appropriately, for electrical neutrality. Using bond valence rules or/and energy calculations to define a new cost function. Extension to quaternary compounds, combining more than two different polyhedra. Etc, etc. Do it yourself, the GRINSP software is open source…
Two things that don’t work well enough up to now… Validation - Ab initio calculations (WIEN2K, etc) : not fast enough for the validation of > structure candidates (was 2 months for 12 AlF 3 models) Identification - There is no efficient tool for the identification of the known structures (from the ICSD) among >10000 hypothetical compounds
One advice, if you become a structure predictor Send your data (CIFs) to the PCOD, thanks… (no proteins, no nucleic acid, not zeolites)
CONCLUSIONS Structure and properties prediction is THE challenge of this XXIth century in crystallography. Advantages are obvious (less serendipity and fishing-type syntheses). We have to establish databases of predicted compounds, preferably open access on the Internet, finding some equilibrium between too much and not enough. If we are unable to do that, we have to stop pretending to understand and master the crystallography laws.