Chemistry Research Data Interest Group

Slides:



Advertisements
Similar presentations
DRIVER Long Term Preservation for Enhanced Publications in the DRIVER Infrastructure 1 WePreserve Workshop, October 2008 Dale Peters, Scientific Technical.
Advertisements

© S.J. Coles 2006 Digital Repositories as a Mechanism for the Capture, Management and Dissemination of Chemical Data Simon Coles School of Chemistry, University.
Linking Data and Publications: the Chemistry Way Simon Coles School of Chemistry, University of Southampton, U.K. CLADDIER workshop.
© S.J. Coles 2006 Digital Repositories as a Mechanism for the Capture, Management and Dissemination of Chemical Data Simon Coles School of Chemistry, University.
Federation eCrystals Federation: Open Repositories for Data-driven Science Dr Liz Lyon, UKOLN, University of Bath, UK Dr Simon Coles, University of Southampton,
The SPECTRa Project : A wider chemistry picture Alan Tonge & Jim Downing A Digital Repository for the Chemical Community.
© S.J. Coles 2006 Institutional Data Repositories for Chemistry Simon Coles School of Chemistry, University of Southampton, U.K.
EBankII Workshop 1 Making Scientific Data Openly Available Simon Coles School of Chemistry, University of Southampton.
Co-funded by the European Union under FP7-ICT Co-ordinated by aparsen.eu #APARSEN Welcome to the Conference !! Juan Bicarregui Chair, APA Executive.
Global Alignment and Collaboration Jo
THE ODIN PROJECT Sergio Ruiz – DataCite Laura Paglione – ORCID ORCID and DataCite Interoperability Network: Connecting Identifiers This project has received.
Natalia Wehler: Dublin Core Requirements on Metadata  multiple softwares to use metadata  management of changing standards  needs to be functional,
University of Southampton, U.K.
Jeffery Loo NLM Associate Fellow ’03 – ’05 chemicalinformaticsforlibraries.
EPrints Workshop, January eBank UK: Dissemination of research data using EPrints Simon Coles, School of Chemistry, University of Southampton.
© S.J. Coles 2006 Data Management in the Chemistry Domain Simon Coles School of Chemistry, University of Southampton, U.K.
DATA FOUNDATION TERMINOLOGY WG 4 th Plenary Update THE PLUM GOALS This model together with the derived terminology can be used Across communities and stakeholders.
Beyond a Data Portal: A Collaborative Environment for the Deep Carbon Science Communities Han Wang, Yu Chen, Patrick West, John Erickson, Xiaogang Ma,
ICSU World Data System - trusted data services for global science Michael Diepenbroek, Vice-Chair WDS-SC.
Royal Society of Chemistry activities to develop a data repository for chemistry-specific data Aileen Day, Alexey Pshenichnov, Ken Karapetyan, Colin Batchelor,
VIVO and Scholarly Repositories: Synergistic Opportunities.
Symposium on Global Scientific Data Infrastructures Panel Two: Stakeholder Communities in the DWF Ann Wolpert, Massachusetts Institute of Technology Board.
Vendor Session: ChemSpider, from Royal Society of Chemistry.
Discussion of Data Fabric Terms & Preparation for RDA P7 Virtual Meeting Monday, January 25, 2016 Organized by Gary Berg-Cross (DFT-IG) and Peter Wittenburg.
Chemistry and Materials Science break-out group (Friday morning)
Preservation e-Infrastructure IG Description: help ensure preservation of needed data succeeds Goals: foster worldwide collaboration; ensure consistency.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No Herbadrop.
Data Foundations And Terminology (DFT) IG Virtual Meeting July 6 th 2016 Co-Chairs DFT IG :Gary Berg-Cross & Raphael Ritz P8 Sessions DFT IG Breakout Session.
General & Background InformationPractical & Useful DataDetailed, Original Research Encyclopedias Dictionaries Reference Texts Books Safety Information.
Chemistry Research Data Interest Group
Data Foundations And Terminology (DFT) IG
NRF Open Access Statement
Stuart J. Chalk, Department of Chemistry University of North Florida
Overview of WGs, IGs and BoFs
WG/IG Collaboration Meeting 6 Dec 12-13, NIST, Gaithersburg 'Assembling the Pieces: Connecting Outputs with Each Other and with Domain Adoption‘
RDA US Science workshop Arlington VA, Aug 2014 Cees de Laat with many slides from Ed Seidel/Rob Pennington.
Materials Resource Registries Working Group Co-chairs: Laura M
Paolo Budroni, University of Vienna
Research Data Alliance - Research Data Sharing without barriers Terena Networking Conference 22 May 2014.
ORCID ID: Driving needs for analytical data exchange standards and the potential impacts on the chemical sciences Antony Williams.
ACS 2016 Moving research forward with persistent identifiers
knowledge organization for a food secure world
Collaborating to engage chemists in good data management
Who knew I would get here from there: How I became the ChemConnector
The Research Data Alliance - What’s going on in Europe?
Chemistry Research Data Interest Group
Chemistry, University of Southampton, UK
Florian Gräf Software Developer of the McEntyre group at EMBL-EBI
EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal
The JISC IE Metadata Schema Registry
WG/IG Collaboration Meeting June Göteborg METADATA GROUPS PERSPECTIVE Keith G Jeffery & Rebecca Koskela.
An ecosystem of contributions
From Observational Data to Information (OD2I IG )
Archives and Records Professionals for Research Data IG
Data types and persistent identifiers in
Research Data Alliance (RDA) 9th WG/IG Collaboration Meeting: Repository Platforms for Research Data (RPRD) Interest Group 13nd June 2018 Co-Chairs:
Repository Platforms for Research Data Interest Group: Requirements, Gaps, Capabilities, and Progress Robert R. Downs1, 1 NASA.
Common Solutions to Common Problems
Interoperability – GO FAIR - RDA
Bird of Feather Session
Developing Institutional Data Repositories
Chemistry Metadata Initiatives
Helena Cousijn, Claire Austin, Jonathan Petters & Michael Diepenbroek
Co-Chairs: Keith Jeffery, Rebecca Koskela, Alex Ball
Leveraging PIDs for object management in data infrastructures RDA UK Node Workshop, July Tobias Weigel (DKRZ)
STFC case study: PhD research graph
Persistent identifiers for instruments (PIDINST) working group
Supporting Open Research
The Research Data Alliance: a (data) window to the world
Cultivating Semantics for Data in Agriculture and Nutrition
Presentation transcript:

Chemistry Research Data Interest Group WG/IG chairs Meeting, NIST, 11 Jan 2018 David Martinsen Disclaimer: These views are mine, and not necessarily those of my Co-Chairs: Ian Bruno, Stuart Chalk, Richard Kidd, Leah McEwen Chemistry Research Data Interest Group (bit.ly/digchem)

Brief recap of purpose of the IG and planned outcomes/aims Digital Chemistry… “a consistent global framework for Human AND Machine-readable (and “understandable”) chemical information in collaboration with other science communities, industry, and governments” How best to disseminate and deploy chemical data standards and related assets to support this digital framework?

Vision for chemical data standards Cheminformatics Standards Instruments Experiments Devices Internet of Things Data Repositories Human Reader Machine Reader Visualization Metadata Formats Tools Semantics Curation Reviewer

Chemistry Research Data Interest Group (bit.ly/digchem) Chemical Data Publication Workflow Spectrometer Chemical Sample Raw Data Analysis Software Processed Data Supplemental Information Community Discussion Community Discussion Publisher Figshare Spectra Data Package Spectra Files FIDs JCAMP-DX DOI InChIs PIDs Structure Files CTABs Identifiers SI Expt. Images Peer Review Data Analyst Standard Standard Standard Human Readers Chemistry Research Data Interest Group (bit.ly/digchem)

Standard Identifiers and Interoperability ORCID iDs for Researchers 30% of current CSD depositors provide an ORCID iD DOIs for Digital Objects Other persistent identifiers are available (ARKs, Handles, etc.) IDs for Institutions See activities of the Organization Identifier Working Group InChIs for Chemical Structures Identifiers for antibodies, organisms, cell lines, tools https://www.force11.org/group/resource-identification-initiative Identifiers for earths science samples and specimens http://www.igsn.org/ Ack: I. Bruno Chemistry Research Data Interest Group (bit.ly/digchem)

What has been accomplished to date? *Prehistoric Times 1965: The Cambridge Structure Database 1971: Protein Databank 1974: Wiley Registry of Mass Spectral Data 1978: EPA/NIH Mass Spectra Database 1980s: IR 1980s: NMR Communication from Steve Heller: In 1980 there were about 500 computer readable databases available in all fields of science, technology, business, and other areas, with some 75 companies making these databases available online in a computer system which was available for access by telephone and computer terminal connection.

What has been accomplished to date? *Prehistoric Times JCAMP-DX – spectra data file format (SCDS, several extensions) InChI – chemical identifier (InChI Trust, several extensions) RInChI – reaction identifier ThermoML – thermo-property data markup (NIST, current project revision) Gold Book – compendium of IUPAC terminology (SCDS, current project revision) In principle: 2013 Blue Book, Nomenclature for Organic Compounds Hierarchical criteria for preferred IUPAC name (PIN) allows for more systematic encoding of rule-sets in computer algorithms quadrant visual

What has been accomplished to date? Symposia and Open Meetings at ACS National Meetings, 2016, 2017 Symposium and Open Meeting at IUPAC General Assembly and Conference, 2017 IUPAC/RDA-US Workshop, 2016 CODATA Symposium and Workshop, 2017 EMBL-EBI Industry Programme Workshop, 2017 Beilstein Symposium – Open Science and the Chemistry Lab of the Future, 2017 DC VoCamp, 2016 & 2017 RDA Plenaries, 2015, 2016, 2017

What issues, challenges, problems, have been encountered? Finding the right people Chemists with domain knowledge don’t ordinarily attend RDA Ontology experts, repository experts, metadata experts don’t ordinarily frequent chemistry meetings (unless they are reformed chemists) Many groups are finding their own solutions (e.g., Allotrope Foundation, Pistoia Alliance, software vendors, instrument vendors) Getting relevant use cases from non-chemists that really allow us to understand inter- disciplinary needs that the chemistry community should be focussing on.

What is the plan for completion/progress for the coming 6-12 months? Creation of DIGChem website, ready for launch: https://sites.google.com/view/digchem

What is the plan for completion/progress for the coming 6-12 months? Symposia and Open Meetings at ACS National Meetings Presence at RDA/Berlin Cheminformatics Workshop in Amsterdam, July 16-17, 2018, cosponsored with CODATA, focus on GO FAIR, interoperability across disciplines, standards for spectral data SciDataCon/Botswana, planning underway for an Inter-Union Workshop, Symposium: “Data Interoperability in in chemistry, biology, and crystallography”

What is the plan for completion/progress for the coming 6-12 months, and beyond? On 28 July 1919, the International Union of Pure and Applied Chemistry was formally registered, setting in place the foundation of the organization that we serve today. In 2019, IUPAC will celebrate 100 years. The International Year of the Periodic Table of Chemical Elements in 2019 will coincide with the 150th anniversary of the discovery of the Periodic System by Dmitry Mendeleev in 1869

Is your work related to/coordinated with other WG/IGs? Agriculture Materials BioSharing/FAIRSharing Photon and Neutron Data Citation Structural Biology Data Usage Metrics Weather/Climate/Air Quality Persistent Identifiers of Instruments And more… Data Publishing Workflows Publishing Data Scholix ELIXIR Long Tail of Research Data

Many of these initiatives rely on volunteer effort Global data initiatives provide high level guiding principles and motivation Chemistry community initiatives provide domain-specific implementations Many of these initiatives rely on volunteer effort If you want to go far, go together