NASA Earth Science Data and Information System (ESDIS) Project Data Preservation Activities – Update Andrew Mitchell (NASA Goddard Space Flight Center)

Slides:



Advertisements
Similar presentations
Future Directions and Initiatives in the Use of Remote Sensing for Water Quality.
Advertisements

Product Quality and Documentation – Recent Developments H. K. Ramapriyan Assistant Project Manager ESDIS Project, Code 423, NASA GFSC
NASA Earth Science Data Preservation Content Specification H. K. (Rama) Ramapriyan John Moses 10 th ESDSWG Meeting – November 2, 2011 Newport News, VA.
Metrics Planning Group (MPG) Report to Plenary Clyde Brown ESDSWG Nov 3, 2011.
Provenance and Context Content Standard (Emerging) – Status of Activities H. K. Ramapriyan Assistant Project Manager ESDIS Project, Code 423, NASA GFSC.
May 17, Capabilities Description of a Rapid Prototyping Capability for Earth-Sun System Sciences RPC Project Team Mississippi State University.
 Explore methods to connect Mission developers to standards experts. Active feedback to new missions to encourage them to use the existing standards.
Data Stewardship Interest Group WGISS-39 Meeting
NOAA Metadata Update Ted Habermann. NOAA EDMC Documentation Directive This Procedural Directive establishes 1) a metadata content standard (International.
System Design/Implementation and Support for Build 2 PDS Management Council Face-to-Face Mountain View, CA Nov 30 - Dec 1, 2011 Sean Hardman.
Introduction to Geospatial Metadata – FGDC CSDGM National Coastal Data Development Center A division of the National Oceanographic Data Center Please .
Provenance for Earth Science (PROV-ES) Started as an ESDSWG 2013 Working Group – Developed early OPM/PROV approaches from ACCESS 2009 project An interoperable.
05 December, 2002HDF & HDF-EOS Workshop VI1 SEEDS Standards Process Richard Ullman SEEDS Standards Formulation Team Lead
Emerging Provenance/Context Content Standard Discussion at Data Stewardship Committee Session at ESIP Federation Meeting January 5, 2012 H. K. “Rama” Ramapriyan.
Elements of a Data Management Plan Bill Michener University Libraries University of New Mexico Data Management Practices for.
CEOS Disaster Risk Management Implementation Phase Status Ivan Petiteville (ESA) on behalf of CEOS DRM Team CEOS SIT-28 Meeting Hampton, Virginia, USA.
1561 INITIATIVE Lessons Learned and Future Considerations
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
MPARWG Business & Disposition of Action Items from MPARWG October 2009 H. K. (Rama) Ramapriyan NASA/GSFC Metrics Planning and Reporting (MPAR) WG 9 th.
WGISS Working Group on Information Systems and Services Richard MORENO CNES WGISS report, Agenda Item 14 Tromsø, Norway October 2014.
The ISO TC211 Standard Project 19130: Sensor and Data Models for Imagery and Gridded Data Liping Di George Mason University Fairfax, VA, USA
Preservation of NASA’s Earth Observation Data EOSDIS Science Operations, ESDIS Project Code 423 Goddard Space Flight Center, Greenbelt, MD ESIP Federation.
Archival Workshop on Ingest, Identification, and Certification Standards Certification (Best Practices) Checklist Does the archive have a written plan.
Using the Global Change Master Directory (GCMD) to Promote and Discover ESIP Data, Services, and Climate Visualizations Presented by GCMD Staff January.
NASA Earth Science Data and Information System (ESDIS) Project Preservation Activities – Software & Documentation H. K. “Rama” Ramapriyan Science Systems.
WGISS Richard Moreno CNES SIT Workshop Agenda Item #7 WGISS working group CEOS SIT Technical Workshop EUMETSAT, Darmstadt, Germany 17 th -18 th September.
NASA Perspectives on Data Quality July Overall Goal To answer the common user question, “Which product is better for me?”
Jianchun Qin, Liguang Wu, Michael Theobald, A. K. Sharma, George Serafino, Sunmi Cho, Carrie Phelps NASA Goddard Space Flight Center, Code 902 Greenbelt,
1 U.S. Department of the Interior U.S. Geological Survey LP DAAC Stacie Doman Bennett, LP DAAC Scientist Dave Meyer, LP DAAC Project Scientist.
PoDAG XXI: SEEDS SEED: NSIDC Potential Interactions NSIDC DAAC should prepare an evaluation of their desired future roles in "core activities" and in mission.
Report to Plenary H. K. (Rama) Ramapriyan NASA/GSFC Clyde Brown SSAI - NASA/LaRC Metrics Planning and Reporting (MPAR) WG 9 th Earth Science Data Systems.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
1 NSIDC DAAC Product Workshop Overview Martha Maiden Program Executive for Data Systems NASA Headquarters NSIDC DAAC Product Workshop January 11-12, 2006.
Metadata By N.Gopinath AP/CSE Metadata and it’s role in the lifecycle. The collection, maintenance, and deployment of metadata Metadata and tool integration.
Provenance and Context Content Standard (Emerging) – Status of Activities H. K. Ramapriyan Science Systems and Applications, Inc. & ESDIS Project, Code.
Improving Information Quality for Earth Science Data and Products – An Overview H. K. (Rama) Ramapriyan Science Systems and Applications, Inc. & NASA Goddard.
ECS Metadata Considerations for Preservation SiriJodha S. Khalsa National Snow and Ice Data Center.
EO Dataset Preservation Workflow Data Stewardship Interest Group WGISS-37 Meeting Cocoa Beach (Florida-US) - April 14-18, 2014.
More Information Working Group Composition End Users Data Modelers Data Analysts Airborne Measurement Scientists Airborne Instrument Scientists Data Management.
Data Systems Integration Committee of the Earth Science Data System Working Group (ESDSWG) on Data Quality Robert R. Downs 1 Yaxing Wei 2, and David F.
Provenance and Context Content Standard (Emerging) – Status of Activities H. K. Ramapriyan Science Systems and Applications, Inc. & ESDIS Project, Code.
Information Quality Cluster - Introduction H. K. (Rama) Ramapriyan Science Systems and Applications, Inc. & NASA Goddard Space Flight Center David Moroni.
Global Change Master Directory (GCMD) Mission “To assist the scientific community in the discovery of Earth science data, related services, and ancillary.
ESO and the CMR Life Cycle Process Winter ESIP, Jan 2015 ESDIS Standards Office (ESO) Yonsook Enloe Allan Doyle Helen Conover.
Data Management Practices for Early Career Scientists: Closing Robert Cook Environmental Sciences Division Oak Ridge National Laboratory Oak Ridge, TN.
PDS4 Project Report PDS MC F2F UCLA Dan Crichton November 28,
PDS4 Project Report PDS MC F2F University of Maryland Dan Crichton March 27,
1 Current Plans for Long Term Archiving of MODIS Data Martha Maiden Program Executive Earth Science Data Systems NASA Headquarters MODIS Meeting November.
1 Digital Object Identifiers Update ESIP Data Stewardship Committee Meeting May 16, 2016 Presenters: Nate James, ESDIS Lalit Wanchoo, ADNET Systems Inc.
The National Digital Stewardship Alliance: Stewardship, Collaboration, Inclusiveness, Exchange.
IPDA Architecture Project International Planetary Data Alliance IPDA Architecture Project Report.
A Shared Commitment to Digital Preservation and Access.
Developing a Dark Archive for OJS Journals Yu-Hung Lin, Metadata Librarian for Continuing Resources, Scholarship and Data Rutgers University 1 10/7/2015.
CEOS Working Group on Information System and Services (WGISS) Data Access Infrastructure and Interoperability Standards Andrew Mitchell - NASA Goddard.
International Planetary Data Alliance Registry Project Update September 16, 2011.
Committee on Earth Observation Satellites John Bates, NOAA Plenary Agenda Item 8 29 th CEOS Plenary Kyoto International Conference Center Kyoto, Japan.
The National Digital Stewardship Alliance: Community, Content, Commitment.
NASA Earth Science Data Stewardship
A Liaison Report from ISO TC211 to CEOS WGISS-44
Persistent Identifiers Implementation in EOSDIS
NASA’s EOSDIS – Long Term Archive Infrastructure and Processes
AGU Paper Number: IN43B-1697 Evolving a NASA Digital Object Identifiers System with Community Engagement Lalit Wanchoo1 and Nathan.
Active Data Management in Space 20m DG
WGISS-WGCV Joint Session
Data Stewardship Interest Group WGISS-45 Meeting
Presented to the CEOS WGISS October 22, 2018
WGISS Connected Data Assets Oct 24, 2018 Yonsook Enloe
Bird of Feather Session
CEOS Working Group on Climate (WGClimate)
Presented to the CEOS WGISS October 10, 2019
Presentation transcript:

NASA Earth Science Data and Information System (ESDIS) Project Data Preservation Activities – Update Andrew Mitchell (NASA Goddard Space Flight Center) and H. K. “Rama” Ramapriyan (Science Systems and Applications, Inc. & NASA GSFC) WGISS Data Stewardship Interest Group Meeting May 2015

Topics NASA Earth Science Data Preservation Content Specification – application to missions NASA Earth Science Data System Working Groups – 5 WG’s in Data Stewardship Interest Area

Preservation NASA not a “preservation agency”, but… – it is essential for NASA to preserve all the data and associated content beyond the lives of NASA’s missions to meet NASA’s near- term objective of providing access to data and services for active scientific research. Also NASA has to ensure that the data and associated content are preserved for transition to permanent archival agencies. Preservation involves ensuring long-term protection of:  Bits  Discoverability and accessibility  Readability  Understandability  Usability  Reproducibility of results

Preservation Content Specification (PCS) Has been in effect since November 2011; latest version dated January 2013 Covers eight categories of content plus a checklist (see next page) Of necessity, rigor of application varies among completed, on-going and future missions

Preservation Content Categories 1.Preflight/Pre-Operations: Instrument/Sensor characteristics including pre-flight/pre- operations performance measurements; calibration method; radiometric and spectral response; noise characteristics; detector offsets 2.Science Data Products: Raw instrument data, Level 0 through Level 4 data products and associated metadata 3.Science Data Product Documentation: Structure and format with definitions of all parameters and metadata fields; algorithm theoretical basis; processing history and product version history; quality assessment information 4.Mission Data Calibration: Instrument/sensor calibration method (in operation) and data; calibration software used to generate lookup tables; instrument and platform events and maneuvers 5.Science Data Product Software: Product generation software and software documentation 6.Science Data Product Algorithm Input: Any ancillary data or other data sets used in generation or calibration of the data or derived product; ancillary data description and documentation 7.Science Data Product Validation: Records, publications and data sets 8.Science Data Software Tools: product access (reader) tools. 9.Checklist: “metadata” about the above 8 categories showing how and where items in each category are preserved

Use of PCS in NASA to-date Distributed Active Archive Centers (DAACs) work with instrument teams, with higher priority to instruments at or near end-of-life – Using PCS as checklist – UARS (Sept. 1991), Earth Probe/TOMS (July 1996), AIRS, AMSR-E (EOS Aqua – May 2002), ICESat-1 (Jan. 2003), HIRDLS, MLS (EOS Aura – July 2004), LIS (TRMM – Nov. 1997) – Artifacts called for in PCS have been gathered for several of the above, organized by categories and archived (e.g., see documents) documents New missions are required to plan to preserve and deliver to DAACs items listed in PCS – Included as a “Level 1” requirement for new missions since 2102 – SMAP mission (launched Jan. 2015) has started preparing list of ancillary data and documentation to be preserved

Software Missions are required to deliver product generation software (source code) Purpose of preservation of software is primarily for users to understand exactly how products were generated – Algorithm Theoretical Basis Documents are generally not a precise description – PCS states “The final version of a derived product should be the version archived. If results reported in peer reviewed publications were based on earlier versions of the product, those versions or at least representative subsets of those versions should also be archived. At a minimum, the algorithm and software that generated such earlier versions should be archived.” – “Versions of science data product software should be archived for each major product release. A major product release is characterized by the appearance of peer reviewed publications where reported results are based on the product version.” It is not expected that “heritage software” will necessarily be executable; it may take significant effort to regenerate products from preserved software

Documentation PCS calls for several types of documentation covering project/data life cycles DAACs archive and maintain checklists of specific documentation delivered by instrument teams and flight projects Goddard DAAC uses Fedora Commons, an open-source repository management system – Simple web-based Graphical User Interface (GUI). – Allows entry of objects or data-streams (these can be of any type document, image, source code, binary data, etc.) – The DAAC has developed a command line script to allow batch ingest of objects into the Fedora Repository. Public access documents are kept separate from restricted (sensitive or proprietary) documents Heritage missions require extensive work for gathering and processing documents for preservation

Standard for Preservation Content NASA would like to see a broad international standard identifying preservation content – NASA’s PCS is a good starting point  NASA has drafted a TC 211 New Work Item Proposal (NWIP) for this ISO/TC 211 has recently approved a NWIP for ISO  “Geographic Information - Preservation of digital data and metadata” initiated by Prof. Wolfgang Kresse, Chair of International Society for Photogrammetry and Remote Sensing (ISPRS) Ad-hoc Group on Standards Some overlap in interests between NASA’s draft NWIP and ISO  ISO mainly driven by the interests of National Mapping and Cadastral Agencies (vector data) Options  Include content similar to NASA’s PCS as a part of ISO  Wait for ISO to be completed and initiate an extension (say ) H. K. “Rama” Ramapriyan has been named an expert for participation in the group on ISO NASA made presentation about this at OGC meeting (Barcelona, March 2015); recommendation from OGC was that ISO and OGC should work together 9

ESDSWG – Data Stewardship Interest Area – Working Groups (April 2014 – March 2015) (1 of 3) Working Groups (April 2014 – March 2015) – Data Preservation Practices WG Mission: Collaborate with stakeholders to define and document an archive process, spanning all types of projects, that can be used to encourage the timely delivery of science data products and related documentation, as defined in the PCS document Key Accomplishments: “Data Preservation Guidelines” document is in final draft form – provides relationship between different project lifecycles and archive lifecycle and recommendations on when various artifacts should be collected for archival – Data Quality WG Mission: Assess the existing data quality standards and practices in the inter- agency and international arena to determine a working solution relevant to ESDIS, DAACs, and NASA Data Providers Key Accomplishments: Analyzed 16 use cases through four subgroups focused on: 1. Accuracy, Precision and Uncertainty; 2. Distinguishability, 3. Applicability and 4. Usability. Formulated over 90 recommendations. Integrated recommendation document in progress

ESDSWG – Data Stewardship Interest Area – Working Groups (April 2014 – March 2015) (2 of 3) Dataset Interoperability WG – Mission: Identify best practices to bridge or reduce gaps between NASA- stewarded data and data from outside NASA, and to ensure NASA data discoverability, maintainability and extensibility using CF, ISO, and Attribute Conventions for Data Discovery (ACDD) conventions – Key Accomplishments: 1. Seven recommendations for Grid Structures in Earth science datasets; 2. Continued improvement of metadata compliance checking; 3. Continued engagement with CF community to exploit group hierarchies Digital Object Identifiers WG – Mission: Develop a method to promote consistency, discoverability, and usefulness across NASA DOI landing pages – Key Accomplishments: Developed list of minimal metadata elements needed to meet the needs of a DOI landing page, reviewed with all DAACs and made final recommendation to ESDIS Project. Made recommendations for improvements in ESDIS Project’s DOI registration process

ESDSWG – Data Stewardship Interest Area – Working Groups (April 2014 – March 2015) (3 of 3) Working Groups (April 2014 – March 2015) – PROV-ES WG Mission: assess and determine an interoperable provenance standard for use in Earth Science Data Systems to enable the following: – Ensure capturing the increasing amount of contextual processing information of Earth Science Data Records (ESDRs). – Improve the understanding of the lineage and dependencies of ESDRs. – Provide an interoperable representation of provenance for NASA EOS missions that adheres to the NASA Preservation Information Architecture. Key Accomplishments: – Defined extensions to W3C PROV to accommodate Earth science- specific processes (during ) – Infused Automatic PROV-ES generation into initial NASA data systems – Implemented faceted search interface to explore PROV-ES records