SOFTWARE ARCHIVE WORKING GROUP (SAWG) REPORT TODD KING PDS MANAGEMENT COUNCIL MEETING FEB. 4-5, 2016.

Slides:



Advertisements
Similar presentations
Current State of Play in Digital Preservation Peter B. Hirtle Cornell University Library Society of American Archivists.
Advertisements

Research Data Access and Preservation Summit Panel 2 - Promoting Re-Use of Scientific Collections Some responses to the questions posed... John Harrison.
New Services for Data Creators and Providers Louise Corti, Head ESDS Qualidata/ Outreach & Training Alasdair Crockett, ESDS Data Services Manager.
Planning and management of digital projects. Overview Assessing the feasibility to Assessing the feasibility to become digital Building selection criteria.
DSpace: the MIT Libraries Institutional Repository MacKenzie Smith, MIT EDUCAUSE 2003, November 5 th Copyright MacKenzie Smith, This work is the.
CHORUS Implementation Webinar May 16, 2014 Mark Martin Assistant Director, Office of Scientific and Technical Information Office of Science U.S. Department.
PDS MC April 2-3, Wash. D. C.1 PDS4 Data Model Working Group Status Report to the PDS Management Council April 2-3, 2008.
Digital Preservation - Its all about the metadata right? “Metadata and Digital Preservation: How Much Do We Really Need?” SAA 2014 Panel Saturday, August.
Placing Movies in the PDS Archive Engineering Node Pasadena, California Nov 30, 2006
PDS4 vs. Level 1-3 Requirements Mitch Gordon PDSMC, UCLA August 27, 2014.
ELPUB 2006 June Bansko Bulgaria1 Automated Building of OAI Compliant Repository from Legacy Collection Kurt Maly Department of Computer.
INTER-UNIVERSITY CONSORTIUM FOR POLITICAL AND SOCIAL RESEARCH Social Science Data and Resources for Researchers 1 DIGITAL PRESERVATION: MAINTAINING THE.
Welcome!. Looking at Student Notebooks, Goals: Deepen understanding of the nature and purpose of science notebook entry types Deepen understanding of.
© Tefko Saracevic, Rutgers University1 digital libraries and human information behavior Tefko Saracevic, Ph.D. School of Communication, Information and.
Corporation For National Research Initiatives NSF SMETE Library Building the SMETE Library: Getting Started William Y. Arms.
Data Management: Documentation & Metadata Types of Documentation.
Long-term Archive Service Requirements draft-ietf-ltans-reqs-00.txt.
Employers’ Expectation for Entry-Level Catalog Librarians: What Position Announcement Data Indicate.
Software Construction and Evolution - CSSE 375 Software Documentation 1 Shawn & Steve Right – For programmers, it’s a cultural perspective. He’d feel almost.
March 2010 PDS Imaging Node 1 NASA PDS Imaging Node: NASA PDS Imaging Node: Digital Data Archives and Distribution Archiving and distributing data and.
System Design/Implementation and Support for Build 2 PDS Management Council Face-to-Face Mountain View, CA Nov 30 - Dec 1, 2011 Sean Hardman.
Elements of a Data Management Plan Alison Boyer Environmental Sciences Division Oak Ridge National Laboratory.
Management, marketing and population of repositories Morag Greig, University of Glasgow.
The Case for Data Stewardship: Preserving the Scientific Record Matthew Mayernik National Center for Atmospheric Research Version 2.0 [Review Date]
CERN – IT Department CH-1211 Genève 23 Switzerland t CERN Open Source Collaborative tools: Digital Library Software Tim Smith CERN/IT.
The importance of DART for funding agencies Dr. Ingrid Kissling-Näf.
1 LSST dark energy science collaboration meeting Penn June 11-13, 2012 LSST dark energy science collaboration meeting Penn June 2012 Governance Document.
Archiving 40+ years of Planetary Mission Data - Lessons Learned and Recommendations K. E. Simmons LASP, University of Colorado, Boulder, CO
Elements of a Data Management Plan: Identifying the materials to be created Ruth Duerr National Snow and Ice Data Center Version Review Date Section:
ACCESS for VALIDITY ACCESS for INNOVATION. Starting January 2011 for NEW proposals Not voluntary – “integral part” of proposal and FastLane Required for.
Data Sharing and Communication Across Partnerships GROUP TECHNICAL ASSISTANCE WEBINAR SEPTEMBER 21, CFPHE.
VO Sandpit, November 2009 Environmental Data Archival: Practices and Benefits crib sheet Graham Parton With many thanks to Dr.
Scientific Record Keeping Alan L. Goldin, M.D./Ph.D.
CCSM DATA MANGEMENT POLICY The Community Climate System Model (CCSM) Data Management Policy documents the procedures for the management of model data produced.
United States Department of Agriculture Food Safety and Inspection Service 1 National Advisory Committee on Meat and Poultry Inspection August 8-9, 2007.
Update from the Data Integrity & Tracking WG Management Council F2F UCLA Los Angles, CA August 13-14, 2007
Planetary Science Archive PSA User Group Meeting #1 PSA UG #1  July 2 - 3, 2013  ESAC PSA Archiving Standards.
1 - A View from the Field - The Next Generation Data Standards For the PDS - PDS4 - ESIP Federation Meeting July 8, 2009 J. Steven Hughes JPL Copyright.
PDS Geosciences Node Page 1 Archiving Mars Mission Data Sets with the Planetary Data System Report to MEPAG Edward A. Guinness Dept. of Earth and Planetary.
Digital Preservation: Current Thinking Anne Gilliland-Swetland Department of Information Studies.
Is the project funded by the EPSRC? University policy covering “significant” research data will still apply Will you publish results based on this data?
Introduction ESDS Qualidata John Southall ESDS Creating and delivering re-usable qualitative data 24 June 2004.
Label Design Tool Management Council F2F Washington, D.C. November 29-30, 2006
Data Standards and Build 3b Plans Steve Hughes MC Face-to-Face UCLA, Los Angeles, CA November 28-29, 2012.
SSC SI Data Processing Pipeline Plans Tom Stephens USRA Information Systems Development Manager SSSC Meeting – Sept 29, 2009.
U.S. Department of the Interior U.S. Geological Survey U.S. Geological Survey Scientific Records Appraisal Tool Presented at the Ensuring the Long-Term.
SPASE and the VxOs Jim Thieman Todd King Aaron Roberts.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
ESA UNCLASSIFIED – For Official Use Data Stewardship Interest Group WGISS-40 Meeting Preservation of SW & Documents at CEOS Agencies Approaches and Lessons.
PDS Geosciences Node Page 1 Archiving LCROSS Ground Observation Data in the Planetary Data System Edward Guinness and Susan Slavney PDS Geosciences Node.
Institutional Repositories: the DSpace Experience Ann J. Wolpert Director of Libraries Massachusetts Institute of Technology.
DOE Data Management Plan Requirements
Portico’s “d-collections” preservation service Stephanie Orphan Positive trends in sustainability? Emerging approaches to archiving commercial databases.
Lifecycle Metadata for Digital Objects November 15, 2004 Preservation Metadata.
Planetary Data System (PDS) Tom Morgan November 24, 2014.
Tools Report Engineering Node March 2007
Biological and Chemical Oceanography Data Management Office slide 1 of 22 Introduction to Data Management for Ocean Science Research Cyndy Chandler Biological.
C OLLEGE OF A GRICULTURE D ATA C OHORT D ATA M ANAGEMENT P LANNING J ANUARY 27, 2014 Jake Carlson Associate Professor of Library Science / Data Services.
Technology Services – National Institute of Standards and Technology Conformity Assessment ANSI-HSSP Workshop Emergency Communications December 2, 2004.
IPDA Architecture Project International Planetary Data Alliance IPDA Architecture Project Report.
SIMULATION AND MODELING WORKING GROUP REPORT PDS MANAGEMENT MEETING – 2016 FEB 4-5.
IPDA Registry Definitions Project Dan Crichton Pedro Osuna Alain Sarkissian.
NASA Earth Science Data Stewardship
Active Data Management in Space 20m DG
Introduction to Research Data Management
Helping Active Missions Convert to PDS4
Research data lifecycle²
Fundamental Science Practices (FSP) of the U.S. Geological Survey
Presentation transcript:

SOFTWARE ARCHIVE WORKING GROUP (SAWG) REPORT TODD KING PDS MANAGEMENT COUNCIL MEETING FEB. 4-5, 2016

A LITTLE HISTORY Management did consider archiving software in The results were: MC voted that PDS has no requirement to archive software. Minutes from the meeting indicate that the intent was to postpone those branches of the PDS 2010 development related to software archiving. PDS M/C Meeting Feb

FORMATION OF SAWG The SAWG was formed to take a new look at software archiving and what it means to PDS. Working group kick-off: Jan. 20, 2016 (telecon) “Lots of enthusiastic ideas flying around.” (T. Stein) PDS M/C Meeting Feb

AND DIFFERENT PERSPECTIVES Some concerns About potential cost. How to create an effective software archive. Provenance Reproducibility Impact on other PDS tasks PDS M/C Meeting Feb

PRELIMINARY QUESTIONS 1. What is the driver for archiving software in PDS? 2. What does “archiving software” mean? 3. What support commitment do we expect PDS to give? PDS M/C Meeting Feb

WHAT IS THE DRIVER? Publically funded software exists and should be preserved. “there are some wonderful software tools out there that many people have dedicated a huge portion of their lives to. There is a strong desire to have that work preserved. The work cost the public lots of money and no one wants to see their work lost.” (Eric Palmer) NASA and OMB consider some software to be research data. PDS M/C Meeting Feb

OMB AND NASA VIEW SOME SOFTWARE AS DATA According to OMB Circular a-100 (revise 11/19/93) Research data is defined as the recorded factual material commonly accepted in the scientific community as necessary to validate research findings, but not any of the following: preliminary analyses, drafts of scientific papers, plans for future research, peer reviews, or communications with colleagues. This "recorded" material excludes physical objects (e.g., laboratory samples). According to NASA Plan: Increasing Access to the Results of Scientific Research (November 21, 2014) Data are understood to include not only the recorded technical information, but also metadata (describing the data), descriptions of the software required to read and use the data, associated software documentation, and associated data (e.g. calibrations) NASA PDART 2015 Announcement - Requires preserving software in NASA’s Github. PDS M/C Meeting Feb

DOES THIS CHANGE THE REQUIREMENTS ON PDS? NASA and PDART have set new expectations. Depends on how we define “data” Does "data" in "Planetary Data System" refer only to "observational data" or to all "research data" (per the OMB and NASA definition)? Or is software a form of documentation? Let’s look at existing requirements PDS M/C Meeting Feb

PDS4 LEVEL 1-3 REQUIREMENTS 1.4 Archiving Standards: PDS will have archiving standards for planetary science data. There are also support requirements that are software related: PDS will provide a capability for opening and inspecting the contents (e.g.label, objects, groups) of any PDS compliant archival product PDS will provide tools for translating archival products between selected formats PDS will provide tools for translating archival products between selected coordinate systems PDS will provide tools for visualizing selected archival products PDS will define and maintain a set of usability requirements to ensure on-going utility of the data in the archive. PDS M/C Meeting Feb

HOWEVER…. There is a large variety of possible software i.e., processing, visualization, conversion and analysis Early PDS volumes included software: imaging 15 out of 90 datasets. Half is decompression source and/or executables. A few have display software and a few have processing software. It is all pretty ad-hoc. PPI 269 out of 779 volumes Mixture of source code, references to “software” volumes, references to web sites, applications. How might this be handled in the PDS4 world? PDS M/C Meeting Feb

THE ABILITY TO DESCRIBE A SOFTWARE PRODUCT DOES EXIST IN PDS4 PDS M/C Meeting Feb PDS4 has Product_Software

… BUT ITS USE IS UNCLEAR There is no Software collection type. What types of software products do policies allow? The “PDS4 Data Formats” policy could be interpreted to mean only source code (flat* UTF-8 text) is allowed. Software_Binary doesn’t fit without changes in the allowed supplemental formats. PDS M/C Meeting Feb

Too much is left to interpretation PDS M/C Meeting Feb

DISCUSSION What is the PDS position on software today? and how do we align with NASA’s 2014 policy We need to state that position as a formal policy. If software is allowed we need to: Add collection type of “software” Define what must be in a software collection Define allowable binary software “formats” Address reproducibility requirements PDS M/C Meeting Feb

BACKUP MATERIAL PDS M/C Meeting Feb

WHAT DOES “ARCHIVING SOFTWARE” MEAN? “Assuming that the context is the PDS, then "archiving software" could mean that software is classified and archived as provenance information for science digital objects.” (Steve Hughes) Should the provenance scope be 1. Describe how the digital object (software) was produced. (We have Product_Software in PDS4 that can do this) 2. Software as documentation for science digital objects (data) 3. Sufficient information for the digital object (software) to be reproduced (run and used). Reproducibility is a big question. PDS M/C Meeting Feb