Institute of Biotechnology

Slides:



Advertisements
Similar presentations
Implementing Tableau Server in an Enterprise Environment
Advertisements

© S.J. Coles 2006 Usability WS, NeSC Jan 06 Enabling the reusability of scientific data: Experiences with designing an open access infrastructure for sharing.
Configuration management
Configuration management
ServiceDesk Plus Product Overview Presented by ManageEngine 1.
ServiceDesk Plus MSP Product Overview. Why ServiceDesk Plus - MSP? Capability of Managing Multiple Client’s in one Help Desk Stop Juggling with multiple.
INTRODUCTION Massive inorganic crystal structure predictions were recently Performed, justifying the creation of new databases. Among them, the PCOD [1]
Case Tools Trisha Cummings. Our Definition of CASE  CASE is the use of computer-based support in the software development process.  A CASE tool is a.
SOFTWARE PRESENTATION ODMS (OPEN SOURCE DOCUMENT MANAGEMENT SYSTEM)
Requirements Specification
Data Sources & Using VIVO Data Visualizing Scholarship VIVO provides network analysis and visualization tools to maximize the benefits afforded by the.
Putting 3D-print files of crystallographic models into open access International Advisory Board of the Crystallography Open Database & T. J. Snyder solid.
DB2 (Express C Edition) Installation and Using a Database
Computer Concepts – Illustrated 8 th edition Unit C: Computer Software.
Systems Analysis – Analyzing Requirements.  Analyzing requirement stage identifies user information needs and new systems requirements  IS dev team.
 EPrints & Preservation David Tarrant University of Southampton (UK) Preserv Repository Preservation and Interoperability.org.uk.
Crystallography Open Database (COD), Predicted Crystallography Open Database (PCOD) and Material Properties Open Database (MPOD)
Metadata Creation with the Earth System Modeling Framework Ryan O’Kuinghttons – NESII/CIRES/NOAA Kathy Saint – NESII/CSG July 22, 2014.
INTRODUCTION The COD was created in March 2003 and was built on the PDB model of open access on the Internet. It is intended that this database [1] consists.
CONTENTS Arrival Characters Definition Merits Chararterstics Workflows Wfms Workflow engine Workflows levels & categories.
CONCLUSIONS COD server is technically in position to store and serve all structures that are currently solved. COD deposition procedure ensures syntactic.
Section 2 Software.
Configuration Management (CM)
EBI is an Outstation of the European Molecular Biology Laboratory. Annotation Procedures for Structural Data Deposited in the PDBe at EBI.
Database Design and Management CPTG /23/2015Chapter 12 of 38 Functions of a Database Store data Store data School: student records, class schedules,
The PCOD and P2D2 databases (P for Predicted) Armel Le Bail Université du Maine, Laboratoire des oxydes et Fluorures, CNRS UMR 6010, Avenue O. Messiaen,
Samba – Good Just Keeps Getting Better The new and not so new features available in Samba, and how they benefit your organization. Copyright 2002 © Dustin.
VIGNAN'S NIRULA INSTITUTE OF TECHNOLOGY & SCIENCE FOR WOMEN TOOLS LINKS PRESENTED BY 1.P.NAVEENA09NN1A A.SOUJANYA09NN1A R.PRASANNA09NN1A1251.
COD (CRYSTALLOGRAPHY OPEN DATABASE) and PCOD (PREDICTED) COD Advisory Board : Daniel Chateigner (France), XiaoLong Chen (China), Marco E. Ciriotti (Italy),
Proposal for an Open Source Flash Failure Analysis Platform (FLAP) By Michael Tomer, Cory Shirts, SzeHsiang Harper, Jake Johns
News in Open Databases COD, PCOD, TCOD, MPOD, FPSM … Association Française de Cristallographie 2013, Bordeaux D. Chateigner, S. Grazulis, J. Butkus, A.
Introduction to bibliographic software and Mendeley Anyone can use this - just put your name here.
Document Name CONFIDENTIAL Version Control Version No.DateType of ChangesOwner/ Author Date of Review/Expiry The information contained in this document.
The platform for EU Interparliamentary Exchange IPEX XML feed and XML starter kit IPEX Correspondents meeting The Hague, november (Jerry.
Structure Prediction (especially with GRINSP)
SharePoint 101 – An Overview of SharePoint 2010, 2013 and Office 365
Recent activities around the crystallography open databases COD, PCOD and P2D2 Armel Le Bail Université du Maine, Laboratoire des oxydes et Fluorures,
Development Environment
SOFTWARE TESTING Date: 29-Dec-2016 By: Ram Karthick.
Software Project Configuration Management
How to Contribute to System Testing and Extract Results
What are they? The Package Repository Client is a set of Tcl scripts that are capable of locating, downloading, and installing packages for both Tcl and.
The Client-Server Model
Project Objectives Publish to a remote server
GO! with Microsoft Office 2016
The Rietveld Method Armel Le Bail
Pipeline Execution Environment
Report Writing.
GO! with Microsoft Access 2016
LCGAA nightlies infrastructure
Building A Web-based University Archive
Department of Computer Science Homepage
R Programming.
Introduction to Cloud Computing
COD (CRYSTALLOGRAPHY OPEN DATABASE) and PCOD (PREDICTED)
PREDICTED CORNER SHARING TITANIUM SILICATES
Running Large distributed projects
What's New in eCognition 9
Analysis models and design models
Enterprise Program Management Office
Microsoft Office Not in Textbook.
SDLC Phases Systems Design.
Case Study 1 By : Shweta Agarwal Nikhil Walecha Amit Goyal
Automation of Control System Configuration TAC 18
In-house Developed Library Solutions
What's New in eCognition 9
What is a System? A system is a collection of interrelated components that work together to perform a specific task.
What's New in eCognition 9
Web Application Development Using PHP
Presentation transcript:

Institute of Biotechnology EU Centre of Excellence Software for maintaining and expanding the (Predicted) Crystallography Open Database Saulius Gražulisa, Justas Butkusa, Robert Downsb, Miguel Quirós Olozábalc, Armel Le Baild a Institute of Biotechnology, Graiciuno 8, LT-02241 Vilnius, Lithuania b Department of Geosciences, University of Arizona, Tucson, Arizona 85721-0077, USA c Departamento de Química Inorgánica, Facultad de Ciencias, Universidad de Granada, 18071 Granada, Spain d Université du Maine, Laboratoire des Oxydes et Fluorures, CNRS UMR 6010, Avenue O. Messiaen, 72085 Le Mans Cedex 9, France Introduction Data are arriving in ever increasing rates. Thus, automated software tools are needed to cope with the growing amount of data, to provide consistent, uniform and accurate information. The following goals are set by COD team: build automated structure deposition tools; build a collaboration platform for structure validation and curation; ensure data quality – uniformity, integrity, and trustworthiness; make scientific data freely accessible to anyone. Deposit your data to COD for publication! http://www.crystallography.net/ Automatic data deposition cif-tools available at svn://www.crystallography.net/cif-tools A set of tools based on CIFParser was developed and deployed: All basic programs a *x style command programs, developed for Linux; agile development – fast deployment, automatic unit testing ensures high quality and predictability new tools can be added as needed by anybody; integrable into GUIs and Web applications. Example programs: cif_filter – extract uncopywritable data from CIFs, fix errors, compute missing data values; cif_cod_check – check whether a CIF satisfies COD requirements CIFParser A new CIF parser was developed, to meet the following demands: identify, locate and clearly show errors in the CIF source (existing parsers were found to be not adequate for this task); callable from Perl for ultimate flexibility; capable to correct most common CIF errors in a predictable and documented way (via extension a documented and switchable extension of the IUCr CIF grammar); easy to extend and check against official CIF grammar for conformance – written in yapp notation, close to a Backus-Naur form. free software – distributable under an OSI – compatible license Automatic error detection/correction Automated CIF download Remember: atomic coordinates are not copyrightable. The scripts introduce delays so that no noticeable charge will be put in the servers. COD identifies itself as the downloader. The bibliographic information is downloaded from Openurl or Pubmed and appended to the CIF's. Search by SMILES data_whatever .... _cell_length_a ... _atom_site_fract_x _geom_bond_distance [Pd]([P](c1ccccc1)(C)C)(ON(=O)=O)... from CIF to SMILES Use of available free tools: Openbabel (http://www.openbabel.org). Very compact ASCII format Human easily readable, understandable and editable. Stores ONLY chemical connectivity, discard any other information. SMILES-building helper scripts Classification of structures: Is it easy to define what is the “molecule” ? Choosing between openbabel-generated connectivity and author-supplied bonds. Automatic fixing of some common mistakes. Removing of chirality markers in non-chiral groups. Human inspection of the results and fixing if necessary. Expanding the PCOD 2010 update : 898.707 SiO2 entries were added from ZEFSAII zeolite predictions and the contributions from GRINSP increased to 163.520 (silicates, phosphates, sulfates of Al, Ti, V, Ga, Nb, Zr, or zeolites, fluorides, etc). The PCOD is the first database to attain and offer more than one million of CIF entries. Software : a new GRINSP version is now available [3] for parallel computing (for instance using fully the 8 processors of an INTEL core i7). Other data from other prediction computer programs (CASTEP, CERIUS2, CRYSTAL, G42, GULP, USPEX…) are expected, just send them, please. PCOD Powder P2D2 All powder patterns (> 1 million) were calculated and gathered in the P2D2 (Predicted Powder Diffraction Database [4]), they can be used for search-match purposes with EVA (Bruker), Highscore (Panalytical) and more soon. Trying to solve the zeolite LZ-200 unknown crystal structure by using PCOD/P2D2 and search-match identification techniques [x] VIRTUAL MODELS in PCOD Zeolites B2O3 nanotubes [Ca3Al4F21]3- Titanosilicates CONCLUSIONS COD server is technically in position to store and serve all structures that are currently solved. COD deposition procedure ensures syntactic correctness and presence of the most essential CIF data. COD and PCOD are constantly expanded to meet more needs of crystallographic community We rely on the help of crystallographic community to add more data and ensure the data correctness! Use and Add more structures to: http://www.crystallography.net/ THANK YOU ! Acknowledgments The Vilnius COD development group is financed by the Research Council of Lithuania, contract No. MIP-124/2010 We thank Adriana Daškevič and Andrius Merkys for help with software development and data curation. Authors thank to all CIF donators, listed on our Web page, and to numerous anonymous volunteers who help to collect data and keep COD running. The COD Advisory Board thanks comercial supporters for donated hardware and financial support. REFERENCES [1] http://www.crystallography.net/ [2] S. Grazulis et al., (2009). J. Appl. Cryst. 42, 726-729. [3] A. Le Bail (2010). Phys. Chem. Chem. Phys. 12, 8521-8530. [4] A. Le Bail (2008). Powder Diffr. Suppl. 23, S5-S12. COD/PCOD : cod@crystallography.net ———————