Data Mining with DDView+ and the PDF-4 Databases Carbamazepine Polymorphs Some slides of this tutorial have sequentially-layered information that is best.

Slides:



Advertisements
Similar presentations
AESuniversity Ad hoc Reporting. Ad hoc Reports What are ad hoc reports? Why would you use ad hoc reports? Creating an ad hoc report from a query Building.
Advertisements

Microsoft® Office Access® 2007 Training
DATA WAREHOUSE QueryLink TUTORIAL QueryLink is used to retrieve Campus data for Departmental use. This tutorial presents an overview of the point and click.
DATABASE BASICS: INSERTING AND FORMATTING DATA EXCEL 07 SESSION II.
Data Mining Tools Sorted Displays Histograms SIeve.
Synchrotron Diffraction. Synchrotron Applications What? Diffraction data are collected on diffractometer beam lines at the world’s synchrotron sources.
Quantitative Analysis Reference Intensity Ratio (RIR)
Electron Diffraction Applications Using the PDF-4+ Relational Database.
2008 Physiological Measurements Focusing on measurements that assess the function of the major body systems 1.
Sorting Pharmaceutical Polymorphic Perplexities with DDView+ and the PDF-4/Organics 2011 Database Carbamazepine Polymorphs.
T.G. Fawcett, S. N. Kabbekodu, F. Needham, J. R. Blanton, D. M. Crane, J. Faber International Centre for Diffraction Data Using PDF-4+/Organics to discover.
Chemical and Structural Classifications. Chemical and Structural Classification What? Materials can be classified by their chemistry and structure. There.
Microsoft Office 2007 Access Chapter 2 Querying a Database.
XP New Perspectives on Microsoft Office Access 2003, Second Edition- Tutorial 2 1 Microsoft Office Access 2003 Tutorial 2 – Creating And Maintaining A.
Creating And Maintaining A Database. 2 Learn the guidelines for designing databases When designing a database, first try to think of all the fields of.
Advanced Identification Tools. This tutorial will demonstrate how a user can increase both the speed and efficiency of the material identification process.
Inventory Throughout this slide show there will be hyperlinks (highlighted in blue) follow the hyperlinks to navigate to the specified Topic or Figure.
Microsoft Access 2010 Chapter 7 Using SQL.
Access Tutorial 3 Maintaining and Querying a Database
Chapter 2 Querying a Database
Creating a MagicInfo Pro Screen Template
Electron Diffraction Search and Identification Strategies.
Molecular Graphics. Molecular Graphics What? PDF-4 products contain data sets with atomic coordinates. A molecular graphic package embedded in the product.
PDF-2 Tools and Searches. PDF-2 Release 2009 Using DDView The PDF-2 Release 2009 database requires retrieval software, such as ICDD’s DDView or developers’
Data Mining with DDView+ and the PDF-4 Databases Solid Solution Crystal Cell Parameters Some slides of this tutorial have sequentially-layered information.
Introduction to Access By Mary Ann Chaney and Alicia Harkleroad.
Chapter 2 Querying a Database MICROSOFT ACCESS 2010.
With Microsoft Office 2007 Introductory© 2008 Pearson Prentice Hall1 PowerPoint Presentation to Accompany GO! with Microsoft ® Office 2007 Introductory.
Copyright © 2008 Pearson Prentice Hall. All rights reserved. 1 1 Copyright © 2008 Prentice-Hall. All rights reserved. Committed to Shaping the Next Generation.
11 Exploring Microsoft Office Excel 2007 Chapter 4: Working with Large Worksheets and Tables Chapter 04 - Lecture Notes (CSIT 104)
OST Introduction to Excel Excel 2004 The Project Gallery is a window that links you to every application in Microsoft Office. It allows.
SIeve+ Introduction SIeve+ is a Plug-In module to the DDView+ software which is integrated in the PDF-4 products. SIeve+ is licensed separately at an additional.
Search Preferences. What? – Search Preferences allow the customization of the DDView+ Results table display fields. Why? – To view the relevant data.
Advanced Searches Using History Advanced Searches What? For a given session, a list of Standard Format Past Searches is automatically saved each time.
Microsoft ® Office Access ® 2007 Training Datasheets II: Sum, sort, filter, and find your data ICT Staff Development presents:
Data Presentation & Graphing Introduction to Mechanical Engineering The University of Texas-Pan American College of Science and Engineering.
Evaluating Data Quality. Quality Mark What is a Quality Mark? A Quality Mark is a reliability index used in Powder Diffraction File (PDF).A Quality Mark.
XP New Perspectives on Microsoft Access 2002 Tutorial 21 Microsoft Access Tutorial 2 – Creating And Maintaining A Database.
XP New Perspectives on Microsoft Office Access 2003, Second Edition- Tutorial 2 1 Microsoft Office Access 2003 Tutorial 2 – Creating And Maintaining A.
The Subfile Search In this tutorial, the following Topics are covered: 1. What is a Subfile? 2. How is a Subfile defined? 3. Learn more about a Subfile.
Page 1 Non-Payroll Cost Transfer Enhancements Last update January 24, 2008 What are the some of the new enhancements of the Non-Payroll Cost Transfer?
1 SIeve+ Introduction SIeve+ is a Plug-In module to the DDView+ software which is integrated in the PDF-4 products. SIeve+ is licensed separately at an.
For additional assistance, please call the Help Desk Searching 1. If a Search window does not appear after logging into the system, click the Search icon.
FIX Eye FIX Eye Getting started: The guide EPAM Systems B2BITS.
1 CHAPTER 3 “Status/Repository”. 2 1.Quotes – Policies Look-up – From the “Community Home Page”, click on this option to view an existing quote or a specific.
Sorting Options. Search Preferences What? – Search Preferences allow the customization of the DDView+ Results table display fields. Why? – To view the.
1. 2 Word Processing Word Processing is writing words and sentences on the computer. It is easy to change or move text in a word document. People use.
Digital Pattern Simulations. Pattern Simulations.
Fall 2003Sylnovie Merchant, Ph.D. ACCESS Tutorial Note: The purpose of this tutorial is to provide an introduction to some of the functions of ACCESS in.
Slide 1 of 35 Welcome to GSA’s Vendor and Customer Self Service (VCSS) course Section 3: Basic Navigation This presentation is compliant with section 508.
Online Catalog Tutorial. Introduction Welcome to the Online Catalog Tutorial. This is the place to find answers to all of your online shopping questions.
Pasewark & Pasewark Microsoft Office 2003: Introductory 1 INTRODUCTORY MICROSOFT ACCESS Lesson 4 – Finding and Ordering Data.
PDF-4 Tools and Searches. PDF The PDF database is powered by our integrated search display software, DDView+. PDF boasts 53 search.
Excel part 5 Working with Excel Tables, PivotTables, and PivotCharts.
Key Applications Module Lesson 22 — Managing and Reporting Database Information Computer Literacy BASICS.
Extracting Information from an Excel List The purpose of creating a database, or list in Excel, is to be able to manipulate the data elements in ways that.
Microsoft Office 2013 Try It! Chapter 4 Storing Data in Access.
This tutorial will describe how to navigate the section of Gramene that allows you to view various types of maps (e.g., genetic, physical, or sequence-based)
Microsoft Excel Prepared by the Academic Faculty Members of IT.
Integrating Components and Dynamic Text Boxes with the Animated Map– Lesson 101 Integrating Components and Dynamic Text Boxes with the Animated Map Lesson.
ACCESS PROJECT TWO. PROJECT ONE In project one you: -Created Tables -Created Forms -Created Reports In this project you will learn about queries. Databases.
Access Chapter 2 Querying a Database.
2008 Physiological Measurements
ICDD Release 2008 New Features
MATHEMATICS SAMPLE ASSESSMENT QUESTIONS
A similarity index for comparing diffraction patterns
Approving Time in Kronos Manager/Supervisor Reference Guide
Sample Question Styles
Data Mining – Minor Phase Analysis
Presentation transcript:

Data Mining with DDView+ and the PDF-4 Databases Carbamazepine Polymorphs Some slides of this tutorial have sequentially-layered information that is best viewed in ‘Slide Show’ mode

This is one of three example-based tutorials for using the data mining capabilities of DDView+ with the PDF-4+ database and it covers the following topic: Two other similar tutorials for data mining exist and cover the following topics: CIGS Photovoltaics – solid solution / cell parameter relationship FeO Non-stoichiometric Oxides – sorting out temperature and stoichiometric effects on cell parameters Carbamazepine Polymorphs – a PDF-4/Organics application – investigating polymorphic forms of an active pharmaceutical ingredient (API)

Carbamazepine Polymorphs An example for the PDF-4/Organics 2008 database that answers the following questions: – How many polymorphs are known to exist? – How do I distinguish these polymorphs by XRPD?

Carbamazepine Polymorphs Search for PDF entries: – Empirical Formula: C 15 H 12 N 2 O Note that this is not unique to carbamazepine – Name: carbamazepine By itself, this name search will also include hydrates, solvates, derivatives – Elements: Only {C, H, N, O} This could be performed, but really is a redundancy if the empirical formula search given above is performed

Results Table Preferences Useful information for sorting out polymorphic forms should be in the results table for retrieved entries A plot of space group vs. reduced cell volume can often provide relevant groupings for polymorph differentiation Another useful plot would be any of the reduced cell edge lengths vs. reduced cell volume Suggested fields for results table: -PDF # -Empirical Formula -Quality Mark -Compound Name -Common Name -International Space Group -Space Group Number -Reduced cell a -Reduced cell b -Reduced cell c -Reduced cell volume

Specifying Results Table Preferences for Carbamazepine Polymorphs Search Selected Fields: Use these buttons to move a selected item up or down in the listed order for the results table. Prepare the ‘Selected Fields’ list for this exercise as illustrated here. Available Fields: Use these buttons to move selected items between the ‘Available Fields’ list of 60 items and the ‘Selected Fields’ list of items that will be displayed in the results table.

Entering the Empirical Formula Search Criterion Empirical Formula criteria is entered on the ‘Elements’ tab of the ‘Search’ window. There should be no space between each element and its ‘subscript’, and one space before entering each subsequent element. These elements can be entered in any order using the ‘Contains Elements’ option, but must match the order of the formula on the PDF card if the ‘Contains Phrase’ option is used.

Entering the ‘Carbamazepine’ Name Search Criterion The name criterion is entered on the ‘Names’ tab Since ‘carbamazepine’ may appear as either a ‘Compound Name’ or ‘Common Name’, the criterion is best entered in the ‘All Names’ category. Because it is a single word, it does not matter if ‘Contains Words’ or ‘Contains Phrase’ is selected. The search can now be performed.

Carbamazepine Polymorphs 17 hits in PDF-4/Organics 2008 database – 12 have cell parameter and space group information – 5 different space group designations (1,2,14,15,148)

Carbamazepine Polymorphs To illustrate groupings related to space groups and reduced cell volumes given in the results table, the following is performed to create the appropriate graph: Enter ‘RedCellVol’ for the X-axis field, and SG # for the Y-axis field

Space Group / Reduced Cell Volume Groupings for 12 Carbamazepine PDF Entries P2 1 /n C2/c R-3 P1 P-1 Note the groupings have an approximate 2:3:4 ‘Reduced Cell Volume’ ratio. The Z values for the unit cells in each group will almost assuredly have the same ratio. The groupings suggest that the likely number of known polymorphs for carbamazepine is 3, 4, or 5.

Carbamazepine – Another View: a-axis Length vs. Reduced Cell Volume Gamma (4?) Alpha (1) Beta (6) Form IV (1) Here, the groupings have been labeled based on information from the ‘Compound Name’ or ‘Comments’ field from at least one member of the group.

Carbamazepine – Comparing XRD Data To sort out all the entries, including those without unit cell information, the X-ray powder diffraction (XRPD) patterns themselves can be used With DDView+, the user can overlay simulated XRPD patterns based on individual PDF entry data Example – overlay simulated XRPD patterns for entry (reported as alpha form but without cell parameters) and entry (reported as alpha form with cell parameters included) Double click anywhere on these ‘Results’ table rows to open the corresponding PDF entries (or right-click and select ‘Open PDF Card’)

Overlaying Simulated XRPD Patterns for Comparison This icon, when clicked, displays an XRPD pattern simulated from the peak list of the current PDF entry. To overlay an XRPD pattern from another entry, use the ‘Plots’ drop- down menu and select ‘Add Full Trace …’. Additional XRPD patterns can be overlaid using the ‘Plots’/’Add Full Trace …’ option To overlay an XRPD pattern from another entry, use the ‘Plots’ drop- down menu and select ‘Add Full Trace …’. Enter the PDF entry number for the pattern to be overlaid: ‘ ’. To overlay an XRPD pattern from another entry, use the ‘Plots’ drop down menu and select ‘Add Full Trace …’. Enter the PDF entry number for the pattern to be overlaid: ‘ ’. Once the PDF # has been entered, click ‘OK’ to overlay the requested XRPD pattern.

α-Carbamazepine Pattern Comparison These two patterns show a similar set of peaks, at roughly similar positions, indicating polymorphic similarity. Intensities are somewhat dissimilar but could be explained by sample thickness: red pattern from thin layer, blue pattern from thick sample. The comments section for entry indicates that this is a deleted pattern, having been replaced by

Mis-assigned Polymorphic Form Entry (red pattern) was also labeled as the α-form by its author. However, one can see from the comparison with a known α-form pattern (top graph – blue) and a known γ-form pattern (bottom graph – blue), that this entry is actually for a γ-carbamazepine.

Overlaying Many XRPD Patterns Using ‘Ctrl-Click’, one can select multiple entries from the ‘Results’ window and overlay their XRPD patterns on one graph. Here, all the reported α-form entries have been selected. A ‘Right-Click’ on one of these selected entries brings up the menu shown here. The ‘Open Diffraction Pattern’ choice will prepare the desired graph of overlaid patterns.

Overlay of Reported α-Carbamazepine Patterns on One Graph The result clearly shows the obvious difference between the XRPD pattern (blue) and the other two. One can use this technique to determine which XRPD patterns among the 17 entries are similar, including those without polymorphic form or space group designations. Results of such a determination are shown on the next slide.

Grouping of Similar XRPD Patterns for 17 Carbamazepine PDF Entries One can overlay the simulated patterns for all 17 PDF entries to establish four groups of similar patterns. Note: Several entries with no space group or polymorphic form information have been reasonably assigned to one of these four groups. 2 entries 2 reported as α-form 1 reported SG as R-3 9 entries 7 reported as β-form 5 reported SG as P2 1 /n 1 reported SG as P2 1 /c 5 entries 1 reported as γ-form 1 misreported as α-form 3 reported SG as P-1 1 reported SG as P1 1 entry 1 reported as Form IV 1 reported SG as C2/c

Carbamazepine - Cell vs. Space Group 17 PDF Entries (12 Experimental, 5 Calculated from Single Crystal data) Beta Gamma Form IV Alpha Revisiting the ‘Cell vs. Space Group’ graph, all 17 entries can be assigned to one of these four groupings, based on similarity of their XRPD patterns.

Carbamazepine Polymorphs Crystallographic and powder diffraction data suggest 4 known polymorphic forms Data mining and display capabilities of DDView+ give users the ability to categorize the database entries The user’s own pattern can be compared with the known polymorph patterns to ascertain polymorphic form

International Centre for Diffraction Data 12 Campus Boulevard Newtown Square, PA Phone: Fax: Thank you for viewing our tutorial. Additional tutorials are available at the ICDD web site (