Persistent Identifiers Implementation in EOSDIS

Slides:



Advertisements
Similar presentations
Product Quality and Documentation – Recent Developments H. K. Ramapriyan Assistant Project Manager ESDIS Project, Code 423, NASA GFSC
Advertisements

NASA Earth Science Data Preservation Content Specification H. K. (Rama) Ramapriyan John Moses 10 th ESDSWG Meeting – November 2, 2011 Newport News, VA.
Metrics Planning Group (MPG) Report to Plenary Clyde Brown ESDSWG Nov 3, 2011.
Provenance and Context Content Standard (Emerging) – Status of Activities H. K. Ramapriyan Assistant Project Manager ESDIS Project, Code 423, NASA GFSC.
Digital Object Identifiers for EOSDIS data HDF Workshop April 17, 2012 John Moses, ESDIS
Digital Object Identifiers for EOSDIS data ESDSWG TIWG November 2, 2011 John Moses, ESDIS
Agency Requirements: NASA Data Management Plans Ronald Weaver National Snow and Ice Data Center W. Christopher Lenhardt Renaissance Computing Institute.
ORNL DAAC Experience With Digital Object Identifiers (DOIs) Bruce Wilson, ORNL DAAC Manager for NASA Data Center Managers telecon 22 Feb 2010.
Introduction and Election of Co-Chair H. K. (Rama) Ramapriyan NASA/GSFC Metrics Planning and Reporting (MPAR) WG 8 th Earth Science Data Systems Working.
NOAA Metadata Update Ted Habermann. NOAA EDMC Documentation Directive This Procedural Directive establishes 1) a metadata content standard (International.
Data Ingest Automation GHRC Status and Plans Helen Conover GHRC DAAC Operations Manager Presented at ESIP Summer Meeting 2015.
Ensuring Long Term Access to Remotely Sensed HDF4 Data with Layout Maps Mike Folks, The HDF Group Ruth Duerr, NSIDC 1.
Guidelines for Provenance ESIP 2015 Summer Meeting Asilomar Conference Grounds Monterey, CA Tuesday, July 14, 2015 Hook Hua NASA / JPL / Caltech.
Digital Object Identifiers for EOSDIS data ESIP Winter Meeting Jan 6, 2011 John Moses, ESDIS
Earth Observing System Data and Information System (EOSDIS) provides access to more than 3,000 types of Earth science data products and specialized services.
Emerging Provenance/Context Content Standard Discussion at Data Stewardship Committee Session at ESIP Federation Meeting January 5, 2012 H. K. “Rama” Ramapriyan.
ACCESS for VALIDITY ACCESS for INNOVATION. Starting January 2011 for NEW proposals Not voluntary – “integral part” of proposal and FastLane Required for.
Citing Data Sets in the Literature: ORNL DAAC Practices Robert Cook, Suresh SanthanaVannan, and Daine Wright Environmental Sciences Division Oak Ridge.
References: [1] [2] [3] Acknowledgments:
Implementation of Citation Count Metrics H. K. (Rama) Ramapriyan NASA/GSFC Metrics Planning and Reporting (MPAR) WG 9 th Earth Science Data Systems Working.
Ensuring Long Term Access to Remotely Sensed HDF4 Data with Layout Maps Ruth Duerr, NSIDC Christopher Lynnes, GES DISC The HDF Group Oct HDF and.
AMSR-E SIPS Processing Status Presented by Helen Conover Information Technology and Systems Center at the University of Alabama in Huntsville AMSR-E Joint.
MPARWG Business & Disposition of Action Items from MPARWG October 2009 H. K. (Rama) Ramapriyan NASA/GSFC Metrics Planning and Reporting (MPAR) WG 9 th.
Metrics Planning and Reporting Working Group (MPAR-WG) H. K. (Rama) Ramapriyan, NASA/GSFC Clyde Brown, LaRC / SSAI Co-Chairs MPAR-WG Recommendations Approved.
The Case for Data Stewardship: Agency Requirements: NASA Data Management Policies Ronald Weaver National Snow and Ice Data Center Version 1.0 June 28,
EOSDIS Status 10/16/2008 Dan Marinelli, Science Systems Development Office.
NASA Earth Science Data and Information System (ESDIS) Project Data Preservation Activities – Update Andrew Mitchell (NASA Goddard Space Flight Center)
NASA Earth Science Data and Information System (ESDIS) Project Preservation Activities – Software & Documentation H. K. “Rama” Ramapriyan Science Systems.
1 U.S. Department of the Interior U.S. Geological Survey LP DAAC Stacie Doman Bennett, LP DAAC Scientist Dave Meyer, LP DAAC Project Scientist.
Report to Plenary H. K. (Rama) Ramapriyan NASA/GSFC Clyde Brown SSAI - NASA/LaRC Metrics Planning and Reporting (MPAR) WG 9 th Earth Science Data Systems.
NOAA Data Citation Procedural Directive 8 November 2012 DAARWG.
Improving Information Quality for Earth Science Data and Products – An Overview H. K. (Rama) Ramapriyan Science Systems and Applications, Inc. & NASA Goddard.
KEY PERSONNEL Dr. Bob Schutz, GLAS Science Team Leader Dr. Jay Zwally, ICESat Project Scientist, GLAS Team Member Mr. David Hancock, Science Software Development.
Dataset citation Clickable link to Dataset in the archive Sarah Callaghan (NCAS-BADC) and the NERC Data Citation and Publication team
ECS Metadata Considerations for Preservation SiriJodha S. Khalsa National Snow and Ice Data Center.
EO Dataset Preservation Workflow Data Stewardship Interest Group WGISS-37 Meeting Cocoa Beach (Florida-US) - April 14-18, 2014.
Copyright and Data Matthew Mayernik National Center for Atmospheric Research Section: Responsible Data Use Version 1.0 October 2012 Copyright 2012 Matthew.
Data Systems Integration Committee of the Earth Science Data System Working Group (ESDSWG) on Data Quality Robert R. Downs 1 Yaxing Wei 2, and David F.
Information Quality Cluster - Introduction H. K. (Rama) Ramapriyan Science Systems and Applications, Inc. & NASA Goddard Space Flight Center David Moroni.
Global Change Master Directory (GCMD) Mission “To assist the scientific community in the discovery of Earth science data, related services, and ancillary.
SEDAC Long-Term Archive Development Robert R. Downs Socioeconomic Data and Applications Center Center for International Earth Science Information Network.
Providing access to your data: Determining your audience Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International.
UWG 2013 Meeting Introduction Andrew Bingham, Project Manager
ESDIS DOI STATUS DOI process in operation since 2010 Process fully automated (manual review) Implement improvements to the process as we learn along the.
1 Digital Object Identifiers Update ESIP Data Stewardship Committee Meeting May 16, 2016 Presenters: Nate James, ESDIS Lalit Wanchoo, ADNET Systems Inc.
Instrument Landing Pages Challenges and Proposals.
Data Stewardship Interest Group WGISS-43 Meeting
NASA Earth Science Data Stewardship
Modules should be 3-7 minutes long Ronald Weaver
Ensuring and Improving Information Quality for Earth Science Data and Products – Role of the ESIP Information Quality Cluster H. K. (Rama) Ramapriyan,
Progress Collaborations FUTURE
ATom data management plan:
Federation of Earth Science Information Partners (ESIP)
NSIDC DAAC Accessioning and “De-commissioning” Plans
NASA Data Quality Working Group (DQWG) Update
EOSDIS Data Preservation Archive (EDPA)
NASA’s EOSDIS – Long Term Archive Infrastructure and Processes
AGU Paper Number: IN43B-1697 Evolving a NASA Digital Object Identifiers System with Community Engagement Lalit Wanchoo1 and Nathan.
Amanda Leon ESIP Summer 2017
Active Data Management in Space 20m DG
SowiDataNet - A User-Driven Repository for Data Sharing and Centralizing Research Data from the Social and Economic Sciences in Germany Monika Linne, 30.
GIRO usage and GSICS Lunar Observation Dataset Policy S. Wagner
Final review 24th Nov 2014 Brussels
Site classifications, definitions, and updates to Landnet
Data Stewardship Interest Group WGISS-45 Meeting
HDF Support for NASA Data Producers
Data Acceptance and De-Accessioning Plans
Presented to the CEOS WGISS October 22, 2018
A Case Study for Synergistically Implementing the Management of Open Data Robert R. Downs NASA Socioeconomic Data and Applications.
Presented to the CEOS WGISS October 10, 2019
Presentation transcript:

Persistent Identifiers Implementation in EOSDIS H. K. “Rama” Ramapriyan Science Systems and Applications, Inc. & ESDIS Project, NASA Goddard Space Flight Center ESIP Summer Meeting, July 19-22, 2016

Acknowledgments Thanks to Nate James (ESDIS Project) and Amanda Leon (NSIDC DAAC) for their comments on a draft of this presentation H. K. Ramapriyan's work was supported by NASA’s contract NNG15HQ01C with SSAI

Preservation Challenge Preservation Content DOI Implementation Topics Ideal State Preservation Challenge Preservation Content DOI Implementation

Ideal State Traceability of everything related to a dataset to be able to answer all possible questions that user may raise, e.g., What were the inputs? How was the dataset generated – software, algorithm, computer, operating system, etc.? Who were the authors of algorithm? What instrument(s) did data come from? What satellite did it (they) fly on? Who funded the development? What is the quality of data? What are the limitations? What can the dataset be used for? What publications have used the dataset? Unambiguous references important for scientific understanding and reproducibility Long-term assumption – authors of datasets and related items not available for answering questions

Preservation Challenge Calibration Team Mission logs Science Data Product Documentation Mission Data Calibration Mission Operations Product Generation Support Teams (SIPSs) Instrument Teams / PI’s Science Data Software Tools Science Data Product Software Science Data Products Level 0 Data Science Data Product Algorithm Input Ancillary data sources (e.g., NOAA) Different entities hold preservation content during the life of a project, but they need to be gathered for long-term preservation. Some items are part of regular flow during active operational phase. Others need extra effort to collect. As they are collected they need to be identified thoroughly and unambiguously and appropriate pointers need to be established to help with traceability. DAACs Science Data Product Validation Major production data Other artifacts Preflight/ Pre-Operations Instrument Developer/ Manufacturer Data gathering project (e.g., flight project) Validation Team

Preservation Content Categories Preflight/Pre-Operations: Instrument/Sensor characteristics including pre-flight/pre-operations performance measurements; calibration method; radiometric and spectral response; noise characteristics; detector offsets Science Data Products: Raw instrument data, Level 0 through Level 4 data products and associated metadata Science Data Product Documentation: Structure and format with definitions of all parameters and metadata fields; algorithm theoretical basis; processing history and product version history; quality assessment information Mission Data Calibration: Instrument/sensor calibration method (in operation) and data; calibration software used to generate lookup tables; instrument and platform events and maneuvers Science Data Product Software: Product generation software and software documentation Science Data Product Algorithm Input: Any ancillary data or other data sets used in generation or calibration of the data or derived product; ancillary data description and documentation Science Data Product Validation: Records, publications and data sets Science Data Software Tools: product access (reader) tools. Checklist: “metadata” about the above 8 categories showing how and where items in each category are preserved All these items are called for and explained in NASA’s Preservation Content Specification, which is being used a requirement for new missions. It is used as a checklist to gather artifacts and archive them either at a DAAC or another long-term archive with appropriate pointers. They are archived to permit traceability, but right now we don’t have a systematic way of assigning formal persistent identifiers to all the artifacts.

DOI for datasets - Implementation Duerr et al (2011) “On the utility of identification schemes for digital Earth science data: an assessment and recommendations” - DOI: 10.1007/s12145-011-0083-6 ESDIS Project started implementing DOIs in 2011 for datasets held in EOSDIS – goal is to assign DOIs to all datasets ESDIS is DOI issuing authority for datasets at most DAACs – prefix 10.5067 Exceptions are ORNL and SEDAC - preceded ESDIS in DOI implementation

DOI for datasets - Implementation Step 1: Visit ESDIS DOI wiki Website (https://wiki.earthdata.nasa.gov/display/DOIsforEOSDIS/ Step 3: Fill the New/Update Form (check examples on the wiki) Step 2: Download DOI Submission Form (https://wiki.earthdata.nasa.gov/display/DOIsforEOSDIS/DOI+Submission+Form/ Step 4: Submit Form to ESDIS (ESDIS Contact Team) DAACs* Communicate Information to the Provider Step 5: Review of DOI Information Step 6: Process DOI Information Step 8: Post Information on the ESDIS wiki website Step 7: Reserve/Register/Update DOI Information *A few requests for DOI assignments come form non-DAAC entities ESDIS DOI Team Credit: Lalit Wanchoo, ADNet/ESDIS

DOI for Datasets –Status Number of datasets in EOSDIS ~10,000

Earth Science Data System Working Groups Related to Identifiers Under the umbrella of NASA’s ESDSWG, identifiers and citations have been themes of WG’s since 2012 Data Stewardship WG DOI WG Citations and Identifiers WG Topics considered: Past: DOI Syntax, Assignment Process, Landing Page Contents, DOI Field Formatting, Citations Reformatting, Citation Policy Current: Identifiers for Non-Dataset Objects (focus on software right now)

Citations and Acknowledgements Open Data and the Importance of Data Citations: The NASA EOSDIS Perspective Data Citations and Acknowledgements ASDC - https://eosweb.larc.nasa.gov/citing-asdc-data ASF DAAC - https://www.asf.alaska.edu/about/how-to-cite-data/ CDDIS - http://cddis.gsfc.nasa.gov/About/Citing_our_data.html GES DAAC - http://daac.gsfc.nasa.gov/additional/citing-our-data/data_citation.shtml GHRC DAAC - https://ghrc.nsstc.nasa.gov/home/about-ghrc/citing-ghrc-daac-data LAADS - http://modaps.nascom.nasa.gov/services/faq/LAADS_Data-Use_Citation_Policies.pdf LP DAAC - https://lpdaac.usgs.gov/citing_our_data NSIDC DAAC - http://nsidc.org/data/citations.html OB.DAAC - http://oceancolor.gsfc.nasa.gov/cms/citations ORNL DAAC - http://daac.ornl.gov/citation_policy.html PO.DAAC - http://podaac.jpl.nasa.gov/CitingPODAAC SEDAC - http://sedac.ciesin.columbia.edu/citations

Challenges Takes time to implement DOI’s for all datasets Identifiers for non-dataset objects still being considered Provenance implementation – establishing links among different objects - still in experimental stage