Developing our Metadata: Technical Considerations & Approach Ray Plante NIST 4/14/16 NMI Registry Workshop BIPM, Paris 1 …don’t worry ;-) or How we concentrate.

Slides:



Advertisements
Similar presentations
Metadata and the UK Data Archive CESSDA Expert Seminar Odense September 2008 Margaret Ward Lenin Ageer.
Advertisements

A Prototype Implementation of a Framework for Organising Virtual Exhibitions over the Web Ali Elbekai, Nick Rossiter School of Computing, Engineering and.
XML DOCUMENTS AND DATABASES
Disseminating Service Registry Records Ann Apps MIMAS, The University of Manchester, UK.
14 October 2003ADASS 2003 – Strasbourg1 Resource Registries for the Virtual Observatory R.Plante (NCSA), G. Greene (STScI), R. Hanisch (STScI), T. McGlynn.
ELPUB 2006 June Bansko Bulgaria1 Automated Building of OAI Compliant Repository from Legacy Collection Kurt Maly Department of Computer.
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
OAI Standards for Sheet Music Meeting March 28-29, 2002 Basic OAI Principals How They Apply to Sheet Music Presenter: Curtis Fornadley, Senior Programmer/Analyst.
1 An introduction to the NSDL William Y. Arms Cornell University.
ELearning Frameworks: What, Why and Who, Where & When Daniel Rehak, Learning Systems Architecture Lab, USA Kerry Blinco, Dept. Education Science and Training,
OpenMDR: Generating Semantically Annotated Grid Services Rakesh Dhaval Shannon Hastings.
Strategy Directorate Web Services Technologies Diane McDonald, Strathclyde University Institutional Web Managers.
OpenMDR: Alternative Methods for Generating Semantically Annotated Grid Services Rakesh Dhaval Shannon Hastings.
Providing Access to Your Data: Access Mechanisms Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International.
XML: The Strategic Opportunity Roy Tennant Challenges*  Only librarians like to search, everyone else likes to find  Our users want more information.
OBAA STANDARD Where are we? Tiago Primo GIA – Grupo de Pesquisa em Inteligência Artificial UFRGS.
Astrogrid Resource Registry Querying the Registry 1.Mullard Space Science Laboratory, University College London, Holmbury St. Mary, Dorking, Surrey RH5.
Using IESR Ann Apps MIMAS, The University of Manchester, UK.
Enabling E Research ANU Data Commons. What is it ? Building a repository for data sets o data can be deposited o updated o published to Research Data.
PEER (Public End-Entity Registry) (MLS -> SPIT -> BEER -> PEER)
Using the Open Metadata Registry (openMDR) to create Data Sharing Interfaces October 14 th, 2010 David Ervin & Rakesh Dhaval, Center for IT Innovations.
Materials Science Registry Will propose RDA Materials Science WG Define minimum/modest metadata extensions to Dublin Core to enable resource discovery.
Indo-US Workshop, June23-25, 2003 Building Digital Libraries for Communities using Kepler Framework M. Zubair Old Dominion University.
CLARIN Metadata Infrastructure Component Metadata and intermediate solutions Daan Broeder Claus Zinn Dieter van Uytvanck - Max-Planck Institute for Psycholinguistics.
Providing Access to Your Data: Access Mechanisms Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International.
1 Hans Pfeiffenberger, Ana Macario, Alfred Wegener Institut, Helmholtz Association OAI4 CERN Text, Data and People – How to Represent Earth.
19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick.
Marshall Breeding Director for Innovative Technology and Research Vanderbilt University
AUKEGGS Architecturally Significant Issues (that we need to solve)
Roy Tennant Life After MARC A Metadata Infrastructure for the 21st Century.
1 CS 502: Computing Methods for Digital Libraries Lecture 19 Interoperability Z39.50.
Federated Discovery and Access in Astronomy Robert Hanisch (NIST), Ray Plante (NCSA)
Kurt Maly Department of Computer Science Old Dominion University Norfolk, Virginia 23529, USA Digital Libraries, OAI and Free Software.
1 Meeting on the Management of Statistical Information Systems (MSIS 2010) SDMX architecture for data sharing and interoperability Francesco Rizzo, ISTAT,
Slavic Digital Text Workshop 2006 The Open Archives Initiative Protocol for Metadata Harvesting: an Opportunity for Sharing Content in a Distributed Environment.
1 GRID Based Federated Digital Library K. Maly, M. Zubair, V. Chilukamarri, and P. Kothari Department of Computer Science Old Dominion University February,
JISC Information Environment Service Registry (IESR) Ann Apps MIMAS, The University of Manchester, UK.
OAI Overview DLESE OAI Workshop April 29-30, 2002 John Weatherley
Portable Infrastructure for the Metafor Metadata System Charlotte Pascoe 1, Gerry Devine 2 1 NCAS-BADC, 2 NCAS-CMS University of Reading PIMMS provides.
GBIF Data Access and Database Interoperability 2003 Work Programme Overview Donald Hobern, GBIF Programme Officer for Data Access and Database Interoperability.
Research Grants and Projects Discovery Service ANDS Webinar 12th August 2015 Monica Omodei, ANDS.
Laura Russell Programmer VertNet Buenos Aires (Argentina) 28 September 2011 Training course on biodiversity data publishing and.
SPASE and the VxOs Jim Thieman Todd King Aaron Roberts.
UCL DEPARTMENT OF SPACE AND CLIMATE PHYSICS MULLARD SPACE SCIENCE LABORATORY Taverna Plugin VAMDC and HELIO (part of the ‘taverna-astronomy’ edition) Kevin.
A centre of expertise in digital information management Shaping the e-future? Grids, Web Services and Digital Libraries Professor Tony.
Eurostat November 2015 Eurostat Unit B3 – IT and standards for data and metadata exchange Jean-Francois LEBLANC Christian SEBASTIAN SDMX IT Tools SDMX.
Discussion of Data Fabric Terms & Preparation for RDA P7 Virtual Meeting Monday, January 25, 2016 Organized by Gary Berg-Cross (DFT-IG) and Peter Wittenburg.
NSDL STEM Exchange: Technical Overview and Implications for Active Dissemination of Federally Funded Resources Across Implementation Systems.
IESR, A Registry of Collections and Services: Using the DCMI Collection Description Profile in Practice Ann Apps MIMAS, The University of Manchester, UK.
2/22/2016J Ammerman1 Open Archives Initiative What is it? What’s it good for?
Metadata-based Discovery: Experience in Crystallography UKOLN is supported by: Monica Duke UKOLN, University of Bath, UK A centre of.
Dynamic/Deferred Document Sharing (D3S) Profile for 2010 presented to the IT Infrastructure Technical Committee Karen Witting February 1, 2010.
International Planetary Data Alliance Registry Project Update September 16, 2011.
IPDA Registry Definitions Project Dan Crichton Pedro Osuna Alain Sarkissian.
Introduction: AstroGrid increases scientific research possibilities by enabling access to distributed astronomical data and information resources. AstroGrid.
JOINT SESSION IG Domain Repositories, IG Agriculture Data, WG BioSharing Registry, IG Materials Data, WG Wheat Data RDA P6, Paris,
NIST Office of Data and Informatics (ODI) of the Material Measurement Laboratory Robert Hanisch, director Ray Plante, interoperability expert ODI has responsibility.
Software & Technologies: an overview
Loading Records Through the Registry’s REST Interface
WP3 – SA1 Service activities in support of deployment of IVOA protocols and standards Christophe ARVISET, ESA.
The Re3gistry software and the INSPIRE Registry
SDMX: A brief introduction
An ecosystem of contributions
1/18/2019 Transforming the Way the DoD Manages Data Implementing the Net Centric Data Strategy using Communities of Interest Introduction
2/15/2019 Transforming the Way the DoD Manages Data Implementing the Net Centric Data Strategy using Communities of Interest Introduction
Disseminating Service Registry Records
JISC Information Environment Service Registry (IESR)
IVOA Interoperability Meeting - Boston
Metadata supported full-text search in a web archive
Presentation transcript:

Developing our Metadata: Technical Considerations & Approach Ray Plante NIST 4/14/16 NMI Registry Workshop BIPM, Paris 1 …don’t worry ;-) or How we concentrate on concepts

Creating & Curating Records Descriptions of an NMI’s data assets will be stored in a registry An NMI will be able to create and update their own records Can operate own registry or use a remote one NIST can provide a registry application, or NMI can create or adapt their own e.g. to connect their local infrastructure Options will be described tomorrow 4/14/16 NMI Registry Workshop BIPM, Paris 2

Collecting Records for Searching Propose using OAI-PMH as the protocol for exchanging metadata Community standard Widely used (including in the Virtual Observatory) Well supported by open software Searchable Registry Wants to collect records for all data resources from all NMIs Uses OAI-PMH to pull the records from the NMIs Provide a means to search Web page GUI Scriptable (REST) interface 4/14/16 NMI Registry Workshop BIPM, Paris 3

NMI Registry Federation 4/14/16 NMI Registry Workshop BIPM, Paris 4 Publishing Registry Portal Dataset Full Searchable Registry Dataset Database Dataset Data Repository Portal Database Publishing Registry Dataset Portal Database Dataset Full Searchable Registry harvest (pull) manual entry (push) NMI Registry Of Registries harvest (pull) harvest (pull)

Record Format We will eventually decide the record encoding format Leading choices: XML, JSON, JSON-LD Choice is not critical At NIST, we have been developing conventions for defining schemas in all forms with mechanisms to convert between them as needed. Is there an opportunity to leverage local infrastructure, tools by picking a particular format? Today, we want to concentrate on… What kinds of data resources we want to discover What concepts are needed to describe them Which concepts are important for discovering resources through a query What information we need in order to access and use them 4/14/16 NMI Registry Workshop BIPM, Paris 5

Defining a Schema Schema = the set and organization of the terms (representing concepts) that we will use to describe our data resources Schema framework = the techniques and patterns we use to define our schema Key requirement: Extensibility Allows us to evolve schema with extensions add new terms as needed Don’t need to solve the entire metadata problem today (or ever)! Introducing extensions must not break existing systems Successful strategy for extension demonstrated in the Virtual Observatory 4/14/16 NMI Registry Workshop BIPM, Paris 6

What do we want to find? Each record will describe something that we want to be able to discover Datasets Standard Reference Data, Reference Data, Data associated with publications, … Databases Portals and web sites Other tools and services Our Member Institutes Participating Registries Our discussion of sample queries will help tease this out 4/14/16 NMI Registry Workshop BIPM, Paris 7

Different types of resources Resource = something we want to find We expect to have a set of metadata attributes that are common to all resources We can add additional metadata to describe specific kinds of resources 4/14/16 NMI Registry Workshop BIPM, Paris 8 A model being developed for the materials science community

Kinds of metadata Identity -- how we recognize it Curation -- who is responsible Content -- what it is about Access -- how to get at it Applicability -- how it applies to different domains Examples: Physics, Chemistry, Biology, Materials Science Can have multiple entries, each containing metadata specific to a different domain 4/14/16 NMI Registry Workshop BIPM, Paris 9

Strategy Collaborate on a demonstration NMIs can participate at whatever level they are able Refining the metadata schema: conceptually or technically Software implementations Leverage on-going registry development at NIST What do we want to find and how Sample queries Issues 4/14/16 NMI Registry Workshop BIPM, Paris 10