A metadata system for the global observatory community

Slides:



Advertisements
Similar presentations
SDMX in the Vietnam Ministry of Planning and Investment - A Data Model to Manage Metadata and Data ETV2 Component 5 – Facilitating better decision-making.
Advertisements

Data citation at Geoscience Australia Policy Amanda Steen (Systems and Data Librarian) Infrastructure to support data citation Dr Sue Fyfe (Director, Data.
Database Management3-1 L3 Database Management Santa R. Susarapu Ph.D. Student Virginia Commonwealth University.
WMO Core Profile of the ISO Metadata Standard Steve Foreman Chair IPET-Metadata Implementation.
WorldWideEnergy: A paradigm shift in advancing energy information access Ms. Deborah Cutler International Program Manager Office of Scientific and Technical.
EuroCRIS Best Practices & Solutions Members Helping Members Move Forward.
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
1 Workshop on Metadata Interoperability for Electronic Records Management November 15, 2001 Archives II, College Park, MD.
Data Management: Documentation & Metadata Types of Documentation.
EZID (easy-eye-dee) is a service that makes it simple for digital object producers (researchers and others) to obtain and manage long-term identifiers.
12 December, 2012 Katrin Heinze, Bundesbank CEN/WS XBRL CWA1: European Filing Rules CWA1Page 1.
METADATA Research Data Management. What is metadata? Metadata is additional information that is required to make sense of your files – it’s data about.
ORGANIZING AND STRUCTURING DATA FOR DIGITAL PROJECTS Suzanne Huffman Digital Resources Librarian Simpson Library.
SWIS Digital Inspections Project (SWIS DIP) Chris Allen, Information Management Branch California Integrated Waste Management Board November 5, 2008 The.
Addressing Metadata in the MPEG-21 and PDF-A ISO Standards NISO Workshop: Metadata on the Cutting Edge May 2004 William G. LeFurgy U.S. Library of Congress.
U.S. Department of the Interior U.S. Geological Survey NWIS, STORET, and XML National Water Quality Monitoring Council August 20, 2003.
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
The Metadata Object Description Schema (MODS) NISO Metadata Workshop May 20, 2004 Rebecca Guenther Network Development and MARC Standards Office Library.
Tech Terminology for non-technical people Tim Bornholtz 2006 Annual Conference.
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
Meta Tagging / Metadata Lindsay Berard Assisted by: Li Li.
GPO’s Federal Digital System August 17, 2010 U.S. Government Printing Office.
1 CS 502: Computing Methods for Digital Libraries Lecture 19 Interoperability Z39.50.
Database Basics BCIS 3680 Enterprise Programming.
Dataset citation Clickable link to Dataset in the archive Sarah Callaghan (NCAS-BADC) and the NERC Data Citation and Publication team
An introduction to the MEDIN Discovery Metadata Standard.
An introduction to the MEDIN Discovery Metadata Standard.
Understanding SEED Headers. SEED is an international standard for the exchange of digital seismological data SEED was designed for use by the earthquake.
An Introduction to the MEDIN Discovery Metadata Standard MEDIN Workshop BGS, Edinburgh, June 2015.
An Introduction to the MEDIN Discovery Metadata Standard MEDIN Workshop NOC, Liverpool, Sept 2015.
University of Colorado at Denver and Health Sciences Center Department of Preventive Medicine and Biometrics Contact:
CS 501: Software Engineering Fall 1999 Lecture 23 Design for Usability I.
Intro to HTML CS 1150 Spring 2017.
NASA Earth Science Data Stewardship
Slides Template for Module 3 Contextual details needed to make data meaningful to others CC BY-NC.
Russian Academy of Sciences
Intro to HTML CS 1150 Fall 2016.
GRDC Metadata Profile – the way forward
aspects of archive system design
Peter Shepherd COUNTER March 2012
Chris Menegay Sr. Consultant TECHSYS Business Solutions
An Overview of Data-PASS Shared Catalog
DOI Overview to Support its Use in GSICS
Karen Dennison Collections Development Manager
A Metadata System for Geomagnetism
Data collection methodology and NM paradigms
GLOBAL BIODIVERSITY INFORMATION FACILITY
SDMX FROM NEEDS Kamel ABDELLAOUI Rafik Mahjoubi
GSICS Data Management and Availability to Users
SDMX Information Model
Richard Waller NOF Technical Advisor UKOLN is supported by:
A SPEAKER’S GUIDEBOOK 4TH EDITION CHAPTER 9
Data Management: Documentation & Metadata
HOW TO WRITE A SYSTEMATIC/NARRATIVE REVIEW
SDMX: A brief introduction
Prepared by: Jennifer Saleem Arrigo, Program Manager
Citation Map Visualizing citation data in the Web of Science
Identifying An Article and Getting Scientist Participation
Research Data Management
Tech·Ed North America /22/2019 3:15 AM
Database Management Systems
Prepared by Peter Boško, Luxembourg June 2012
The ultimate in data organization
School of Information Studies, Syracuse University, Syracuse, NY, USA
MSDI training courses feedback MSDIWG10 March 2019 Busan
Capitalising on Metadata
The Attribute and the ecosystem
Web Service Requirements for WISE Data Exchange
Australian and New Zealand Metadata Working Group
Fundamental Science Practices (FSP) of the U.S. Geological Survey
Presentation transcript:

A metadata system for the global observatory community Geomagnetic Metadata A metadata system for the global observatory community

What is Metadata? Information about data, EG: Kelly, G.S.; Viljanen, A.; Beggan, C.D.; Thomson, A.W.P.. 2017 Understanding GIC in the UK and French high-voltage transmission systems during severe magnetic storms. Space Weather, 15 (1). 99-114. https://doi.org/10.1002/2016SW001469 Several pieces of metadata appear in this citation (metadata) to a journal publication (data): A list of authors A title for an article A title for a publication Some references There’s nothing conceptually novel about metadata. It’s been around of a long time.

Why do we need it Data on its own

Why do we need it Data with metadata

Why do we need it We won’t always be around to explain our work in person. Are we doing as good a job to record our work in the digital age as was done by observatories creating yearbooks?

Why do we need international standards Format IAGA-2002 | Source of Data Danish Meteorological Institute | Station Name Narsarsuaq | IAGA CODE NAQ | Geodetic Latitude 61.160 | Geodetic Longitude 314.560 | Elevation 4 | Reported XYZF | Sensor Orientation DIF | Digital Sampling 0.01 seconds | Data Interval Type Filtered 1-minute (00:30 - 01:29) | Data Type Definitive | DATE TIME DOY NAQX NAQY NAQZ NAQF | 2001-03-13 00:00:00.000 072 10800.11 -6100.23 53381.51 54801.12 2001-03-13 00:01:00.000 072 10800.31 -6100.20 53381.51 54801.12 2001-03-13 00:02:00.000 072 10801.11 -6101.23 99999.00 54801.12 2001-03-13 00:03:00.000 072 10803.12 -6100.23 99999.00 54801.12 Standards (like IAGA-2002) allow us to exchange data. It’s no different with metadata. International metadata standards (like ISO19115) allow us to exchange metadata outside our community

Why do we need international standards Then we can do things like this (OneGeology) Without agreed standards for metadata, collaborations like OneGeology would not be possible. INTERMAGNET and IAGA have provided good standards, well accepted by the community, for data. We have not had any standards for metadata in our community.

What does our community want to record for the future? This is just a start – the bare minimum – but still complex to implement across the global community. A more ambitious goal would be to recreate digitally the level of metadata recorded in a yearbook.

Oversight of the work Funded by the European Horizon 2020 project EPOS (European Plate Observing System) Peer reviewed by the community Peer reviewed by database experts We have been using money provided by EPOS to fund this work. A community review panel was initiated at the previous IAGA workshop in 2016. Those signing onto this panel (around 30 people) were asked for their comments on the design of the metadata. These comments were worked into the design. BGS database experts reviewed the database design to ensure it meets the required standards for a relational database (data only represented in one place in the schema, suitable primary keys in each table, field naming conventions adhered to, suitable dictionaries in place, field contents validated against dictionaries where possible, …)

Implementation A set of tables in a relational database (About 30 tables and 15 dictionaries) Hosted on BGS ORACLE servers All text can be held in multiple languages (“internationalised”) All entities have validity dates The first step (completed in 2017) was to design a schema to hold the metadata. This slide describes how we implemented the database schema. Example validity dates: An instrument (e.g. flxugate) can be shown to be active at an observatory between two dates A person can be shown to be the contact for an observatory between two dates The location of an observatory can be shown to be valid between two dates

Implementation – Key entities Address Data repository Data set Institute Instrument INTERMAGNET GIN INTERMAGNET Membership Location Observatory Observatory Photo Person

Progress First version of the database schema designed and reviewed http://geomag.bgs.ac.uk/im_mdata/imag_reports/contacts/ {"Stewart_Duff": {"addresses": [{"lang": "en", "address1": "U.S. Geological Survey", "address2": "Box 25046 MS 966", "address3": "Denver Federal Center", "city": "Denver", "state": "Colorado 80225", "country": "United States of America"}], "name": "Duff Stewart", "tel": "1-303-273-8485", "fax": "1-303-273-8506", "institute": "USGS", "emails": ["dcstewart@usgs.gov"]}, "Hitchman_Adrian": {"addresses": [{"lang": "en", "address1": "Geomagnetism", "address2": "Geoscience Australia", "address3": "Cnr Jerrabomberra Av and Hindmarsh Dr", "city": "Symonston", "country": "Australia", "postcode": "ACT 2609"}], "name": "Adrian Hitchman", "tel": "61-2-6249 9111", "fax": "61-2-6249 9986", "institute": "GA", "emails": ["geomag@ga.gov.au"]}, "Shevtsov_Boris": {"addresses": [{"lang": "en", "address1": "Mirnaya str. 7", "address2": "Paratunka, Kamchatka reg, Elizovskiy dist", "country": "Russia", "postcode": "684034"}], "name": "Boris Shevtsov", "tel": "+7-41531-33193", "fax": "+7-41531-33718", "institute": "IKIR", "emails": ["ikir@ikir.ru"]}, "Schwarz_Gerhard": {"addresses": [{"lang": "en", "address1": "Geological Survey of Sweden", "address2": "Documentation of Geomagnetism", "address3": "Box 670", "city": "Uppsala", "country": "Sweden", "postcode": "75128"}], "name": "Gerhard Schwarz", "tel": "+46 18 179344", "fax": "+46 18 179210", "institute": "SGU", "emails": ["gerhard.schwarz@sgu.se"]}, "Crosthwaite_Peter": {"addresses": [{"lang": "en", "address1": "Geomagnetism", "address2": "Geoscience Australia", "address3": "GPO Box 378", "city": "Canberra", "country": "Australia", "postcode": "ACT 2601"}], "name": "Peter Crosthwaite", "tel": "61-2-6249 9321", First version of the database schema designed and reviewed Database implemented on BGS corporate servers Data made available on public facing servers First application nearly complete (real-time feed for INTERMAGNET website metadata) The example shown is the first application written that uses the metadata. This application will allow a simple web request to retrieve metadata needed by the INTERMAGNET web site: Contact details List of definitive data (CD/DVD) contributions List of institutes List of observatories, locations and instruments Currently only the contact details work, but this proves the concept: Web request received in Edinburgh DMZ Application sends request to ORACLE server in Keyworth DMZ Data from main ORACLE server in Keyworth SAN is forwarded to Keyworth DMZ This was complex to get working (technically and because of security policy). Having proved the concept we can quite quickly move forward with the remaining metadata reports needed by the INTERMAGNET web site and with other applications that will use the metadata. Database “views” allow us to create complex combinations of the individual tables that comprise the metadata database (the example shown uses several tables). There is a great deal of flexibility in the combinations of data that we can produce from the database.

Working securely Web service request DMZ DMZ Database Web server BGS Nottingham BGS Edinburgh Database

Future Plans Finish the work to supply INTERMAGNET website metadata Creation of Digital Object Identifiers for INTERMAGNET Filling of metadata from existing sources (hard work because much of the metadata is not structured): INTERMAGNET CD-ROMs and DVD World Data Centres The creation of digital object identifiers requires metadata to fully describe the dataset that the identifier identifies. For the INTERMAGNET DVDs this includes information such as the lists of observatories and institutes contributing to each CD/DVD. This is currently available in human readable forms (such as READNE files), but not in machine readable form (ie XML). A large part of the work in creating the DVDs will be to collect this metadata. Storing it in the metadata database makes it easily re-usable for other purposes if people have a need for it. The picture shows a README file from the INTERMAGNET DVD (Belsk, 2010). These files are now relatively well structured (although still requiring programming effort to read). Earlier files were less structured and more varied in their contents.