Preservation Strategies: Framing The Approach Nancy Hoebelheinrich Knowledge Motifs LLC Data Management Workshop American Geophysical.

Slides:



Advertisements
Similar presentations
Archive Requirements Working Group A NOAA clearinghouse for requirement planning in support of the science objectives related to archive, access, and reprocessing.
Advertisements

NASA Earth Science Data Preservation Content Specification H. K. (Rama) Ramapriyan John Moses 10 th ESDSWG Meeting – November 2, 2011 Newport News, VA.
PREMIS Implementation Fair San Francisco, CA, October Stanford Digital Repository PREMIS & Geospatial Resources Nancy J. Hoebelheinrich Knowledge.
PREMIS in Thought: Data Center for LC Digital Holdings Ardys Kozbial, Arwen Hutt, David Minor February 11, 2008.
Preservation Strategies: What do long-term archives do with my data? Jeff Arnfield NOAA’s National Climatic Data Center Version 1.0 Review Date.
Agency Requirements: NASA Data Management Plans Ronald Weaver National Snow and Ice Data Center W. Christopher Lenhardt Renaissance Computing Institute.
"Keeping alert: issues to know today for long-term digital preservation with repositories" Neil Beagrie Fedora Users Group Open Repositories Southampton.
Promoting Digital Preservation Partnerships at the U.S. Library of Congress April 2004.
Metadata for preservation Michael Day, UKOLN, University of Bath Chinese-European Workshop on Digital Preservation,
Providing Access to Your Data: Tracking Data Usage Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International.
Providing Access to Your Data: Access Mechanisms Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International.
The Case for Data Stewardship: Preserving the Scientific Record Matthew Mayernik National Center for Atmospheric Research Version 2.0 [Review Date]
Providing Access to Your Data Matthew Mayernik National Center for Atmospheric Research Version 1.0 Review Date.
Data Management Plans Bill Michener University Libraries and Biology Dept. University of New Mexico.
Science Archives in the 21st Century 25/26 April Towards an International standard for Audit and Certification of Digital Repositories David Giaretta.
World Data Center for Human Interactions in the Environment Conducting a Self-Assessment of a Long-Term Archive for Interdisciplinary Scientific Data as.
Providing Access to Your Data: Tracking Data Usage Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International.
Elements of a Data Management Plan Ruth Duerr National Snow and Ice Data Center Version 1.0 Review Date Section: Data Management Plans.
NE II NOAA Environmental Software Infrastructure and Interoperability Program Cecelia DeLuca Sylvia Murphy V. Balaji GO-ESSP August 13, 2009 Germany NE.
Advertising your data: Using data portals and metadata registries Nancy Hoebelheinrich Version 1.0 September 2012 Section: Local Data Management Copyright.
Elements of a Data Management Plan: Identifying the materials to be created Ruth Duerr National Snow and Ice Data Center Version Review Date Section:
Preservation Strategies: Sponsor or Institutional Requirements Ronald Weaver National Snow and Ice Data Center Version 1.0 Review Date.
ESIP 2009 Summer Meeting, UC Santa Barbara, CA, July 7 – 10, Stanford Digital Repository PREMIS & Geospatial Resources Nancy J. Hoebelheinrich InfoAnalytics.
Preserving the Scientific Record: Case Study 1 – National Snow & Ice Data Center (NSIDC) Glacier Photos Matthew Mayernik National Center for Atmospheric.
Providing Access to Your Data: Access Mechanisms Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International.
Richard MarcianoChien-Yi Hou Caryn Wojcik University of University of State of Michigan North Carolina North Carolina Records Management ServicesSALT DCAPE.
November 2004 NDIIPP: Future Directions and Relevance to Other Countries Beth Dulabahn Office of Strategic Initiatives Library of Congress November 7,
ESIP Federation Air Quality Cluster Partner Agencies.
Elements of a Data Management Plan: Roles and Responsibilities Ruth Duerr National Snow and Ice Data Center Version 1.0 Review Date.
NOAA Administrative Order : Management of Environmental and Geospatial Data and Information Jeff Arnfield NOAA’s National Climatic Data Center Version.
Life Cycle Models & Principles Jake Carlson Associate Professor of Library Science Data Services Specialist Purdue University Libraries.
Providing Access to Your Data Matthew Mayernik National Center for Atmospheric Research Copyright 2012 Matthew Mayernik. Version 1.0 October 2012 Section:
1 NOAA Use of the Open Archival Information System Reference Model (OAIS-RM) Ken McDonald NOAA NESDIS ESIP Federation Meeting July 9, 2009.
Introduction GeoData 2014 Workshop #geodata2014 June 17-19, 2014,NCAR, Boulder, CO Peter Fox (RPI)
Small steps and lasting impact: making a start with preservation or It’s not all NASA Patricia Sleeman Digital Archives and Repositories University of.
National Geospatial Digital Archive Greg Janée University of California at Santa Barbara.
PREMIS Implementation Fair, San Francisco, CA October 7, Stanford Digital Repository PREMIS & Geospatial Resources Nancy J. Hoebelheinrich Knowledge.
Preserving the Scientific Record: Case Study 2 – Arctic Temperature Variability Data Matthew Mayernik National Center for Atmospheric Research Version.
Metadata for digital preservation: a review of recent developments Michael Day UKOLN, University of Bath ECDL2001, 5th European Conference.
Symposium on Global Scientific Data Infrastructures Panel Two: Stakeholder Communities in the DWF Ann Wolpert, Massachusetts Institute of Technology Board.
Why Create a Data Management Plan? Ruth Duerr National Snow and Ice Data Center Version 1.0 February 2013 Data Management Plans Copyright 2013 Ruth Duerr.
Fedora and the Preservation of University Electronic Records Project NHPRC Electronic Records Research Grant Kevin L. Glick Manuscripts and Archives, Yale.
ARL Workshop on New Collaborative Relationships: The Role of Academic Libraries in the Digital Data Universe September 26-27, 2006 ARL Prue.
Elements of a Data Management Plan Ruth Duerr National Snow and Ice Data Center Version 1.0 February 2013 Data Management Plans Copyright 2013 Ruth Duerr.
Preservation metadata and the Cedars project Michael Day UKOLN: UK Office for Library and Information Networking University of Bath
ESIP Data Management Training (DMT) Survey & Clearinghouse Report ESIP Winter 2016 Wednesday, 2016 January 6 ESIP Data Stewardship Committee Nancy Hoebelheinrich.
Data Management Plans: Elements of a Data Management Plan Ruth Duerr National Snow and Ice Data Center Version 1.0 Review Date.
8 January 2016 ESIP Winter Meeting
June 21, 2011EDMC Workshop in Silver Spring, MDDan Kowal Submission Agreements: The role they play in supporting the relationship between the Data Producer/Provider.
Preliminary Findings Baseline Assessment of Scientists’ Data Sharing Practices Carol Tenopir, University of Tennessee
SEDAC Long-Term Archive Development Robert R. Downs Socioeconomic Data and Applications Center Center for International Earth Science Information Network.
Providing access to your data: Determining your audience Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International.
Infrastructure Breakout What capacities should we build now to manage data and migrate it over the future generations of technologies, standards, formats,
The OAIS model SEEDS meeting May 5 th, 2015, Lausanne Bojana Tasic.
Cedars work on metadata Michael Day UKOLN, University of Bath Cedars Workshop Manchester, February 2002.
Working with your archive organization: Broadening your user community Robert R. Downs, PhD Socioeconomic Data and Applications Center (SEDAC) Center for.
Working with Your Archive : Broadening Your User Community Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International.
Nancy J. Hoebelheinrich, Metadata Coordinator, Stanford University 1 Metadata for the NGDA: Developing a Shared Approach Joint UCSB / Stanford meeting.
The Case for Data Stewardship: Preserving the Scientific Record Matthew Mayernik National Center for Atmospheric Research Section: The Case for Data Stewardship.
Training Course on Data Management for Information Professionals and In-Depth Digitization Practicum September 2011, Oostende, Belgium Concepts.
R2R ↔ NODC Steve Rutz NODC Observing Systems Team Leader May 12, 2011 Presented by L. Pikula, IODE OceanTeacher Course Data Management for Information.
NASA Earth Science Data Stewardship
Ingest and Dissemination with DAITSS
Developing Criteria to Establish Trusted Digital Repositories
The Case for Data Management: Agency Requirements
Persistent Identifiers Implementation in EOSDIS
Agency Requirements: NOAA Administrative Order Management of environmental and geospatial data and information This training module is part of.
Identifiers Answer Questions
Training Course on Data Management for Information Professionals and In-Depth Digitization Practicum September 2011, Oostende, Belgium Concepts.
The Case for Data Management: Agency Requirements
Presentation transcript:

Preservation Strategies: Framing The Approach Nancy Hoebelheinrich Knowledge Motifs LLC Data Management Workshop American Geophysical Union San Francisco, CA Tuesday, December 6, 2011

Overview Preservation strategies to pursue once the argument for data stewardship & data preservation is won Background of previous issues & discussions re: data stewardship & data management Provides a framework of questions that a scientist can answer to facilitate the preservation of his/her data for the long term

Relevance to Data Management Why is this important???? As a metaphorical example, consider the following situation:

Relevance to Data Management Documentation for My Latest Research Project To Data Manager: Don’t worry, the connections are all there [– in my head!]

Relevance to Data Management Documentation for My Latest Research Project To Data Manager: See, here’s the primary algorithm I used…

Relevance to Data Management Documentation for My Latest Research Project To Data Manager: Here’s the schedule we used to gather the data – although some months it was a little different…

Relevance to Data Management Documentation for My Latest Research Project To Data Manager: Oh, and here’s the team – our PI wasn’t available for the photo, so we put a placeholder for him – see the guy with the mustache below on the stick? And the project manager – she’s the one with the long ears…what was her name?

Relevance to Data Management So, what’s the Data Manager gonna do with all this stuff?? Ensure long term integrity & viability of your data incl. Various levels of processed data / data products, if desired Metadata (MD) you have (in your head or in documentation) Context & Provenance – “audit” trail of sources, processing, products By ingesting, identifying, storing, locating & providing access, if desired, to all of the above Deploy preservation strategies such as: Assigning checksums and/or identifiers to each “item” of a data set Migrating to non-proprietary and/or new formats over time Migrating to new storage media over time Refreshing the data over time

How can I (the scientist) help? Besides me, who’s going to care? Sponsor mandates to archive Specific requirements from sponsor e.g., NASA, NOAA, USGS Data archive requirements & desirements Negotiated & documented in Submission Information Package (OAIS SIP) Future scientists who want to use/re-use your data!! What kind of data should be kept? Formulae for decisionmaking, e.g., NOAA National Climatic Data Center’s Climate Data Record Maturity Matrix; factors include software readiness, existence / state of metadata & (other) documentation, utility of data, validity of product (based on certainty estimates), desire for / restrictions upon public access Documentation of specific disciplinary requirements, e.g., CDRs from Satellite Passive Microwave Sounders Allowing for serendipity & cyclical nature of scientific data Framework Questions:

Example Data Maturity Index

How can I (the scientist) help? Key Framework Question for future scientists who want to use/re-use my data: what will they need to know? (= MD that I probably know best) Documentation including restrictions on access & use Assumptions, hypotheses, algorithms about data (who, what, when, where, why & how) = “provenance & context” Sequence of time, date, technical details of data creation / acquisition and relationships among data units or how to figure out = “preservation MD” Key people, roles & their organizations = “citation MD”

What if I don’t have an existing archive for my data? Some disciplines may not have a data center or archive set up for them – what resources are available? Institutions with experience: governmental agencies (UK Data Centers, UK Digital Curation Center, in US: NASA, NOAA, USGS, NARA, Research Libraries, national & international libraries, archives and data centers Comprehensive information resources about preservation and archiving, e.g., CIESIN’s Geospatial Clearinghouse, at 9gzJWYlQJJ690! US Library of Congress, etc., and Duraspace, at 9gzJWYlQJJ690! US Library of Congress, etc. DataOne – NSF funded consortium, focused on preservation and access to multi-scale, multi-discipline, and multi-national science datahttps:// DataConservancy – an NSF funded consortium focused upon scientific data curation is a means to collect, organize, validate and preserve data,

References and Resources NASA Earth Science Data Preservation Content Specification (Nov 2011), NASA, 2011: Metadata Requirements – Base Reference for NASA Earth Science Data Products, (Nov 2011), Requirements_V1_ _0.pdf Requirements_V1_ _0.pdf Preliminary Principles and Guidelines for Archiving Environmental and Geospatial Data at NOAA: Interim Report, Archiving Strategy for USGS EROS Center & Our Future Direction, March 29, 2010, Example disciplinary requirements: NOAA Workshop on Climate Data Records from Satellite Passive Microwave Sounders Report.pdf Report.pdf NOAA NCDC Climate Data Record ( CDR) Maturity Matrix, ESIP Data Stewardship & Preservation Cluster, wiki found at

Other Relevant Modules The case for data stewardship Managing your data Creating documentation and metadata Working with your archive organization