U.S. Department of the Interior U.S. Geological Survey Tutorials on Data Management Lesson 2: Data Management Planning CC image by Joe Hall on Flickr.

Slides:



Advertisements
Similar presentations
DSpace: the MIT Libraries Institutional Repository MacKenzie Smith, MIT EDUCAUSE 2003, November 5 th Copyright MacKenzie Smith, This work is the.
Advertisements

Peter Griffith and Megan McGroddy 4 th NACP All Investigators Meeting February 3, 2013 Expectations and Opportunities for NACP Investigators to Share and.
OVERVIEW & LIBRARY SUPPORT FOR DATA MANAGEMENT/SHARING Jim Van Loon, MSME/MLIS Science Librarian.
Data Management Planning Kerry Miller Digital Curation Centre University of Edinburgh DIY Research Data Management Training Kit for.
NOAA Data Sharing Policy for Grants and Cooperative Agreements Ingrid Guch NOAA Environmental Data Management Committee.
Data Management Plans PAUL H. BERN, PH.D. APRIL 3, 2014.
Data Management What? Why? How?. 2 What do we mean by … Managing your Research (aka Data) … Ensuring physical integrity of files and helping to preserve.
NSF Data Management Plan Requirements Alex Kanous
Institutional Repositories Tools for scholarship Mary Westell University of Calgary AMTEC Conference May 26, 2005.
Persistent Digital Archives and Library System (PeDALS) A Guide for Wisconsin State Agencies.
INTRODUCTION TO RESEARCH DATA MANAGEMENT Robin Desmeules Janice Kung J W Scott Health Sciences Library University of Alberta Libraries.
Elements of a Data Management Plan Alison Boyer Environmental Sciences Division Oak Ridge National Laboratory.
Elements of a Data Management Plan
GRAD 521, Research Data Management Winter 2014 – Lecture 2 Amanda L. Whitmire, Asst. Professor.
Guidance on Preparing a Data Management Plan
DMPTool Expert Resources and Support for Data Management Planning Tao Zhang Michael Witt Purdue University Libraries 1.
+ Sarah Jones Digital Curation Centre Supporting researchers with Data Management Plans.
Africa RISING West Africa Mega Site M&E Activities Summary Africa RISING Project Steering Committee Meeting February 4, 2014; Bamako, Mali Beliyou Haile,
Data Management Overview for Instruction Librarians
Open for ^ Business Research Data Services & Data Management Planning Ryan Schryver Wendt Commons is our.
Data Management Plans Bill Michener University Libraries and Biology Dept. University of New Mexico.
U.S. Department of the Interior U.S. Geological Survey Planning for Data Management Creating data management plans for your project.
GRAD 521, Research Data Management Winter 2014 – Lecture 5 Amanda L. Whitmire, Asst. Professor.
ACCESS for VALIDITY ACCESS for INNOVATION. Starting January 2011 for NEW proposals Not voluntary – “integral part” of proposal and FastLane Required for.
Elements of a Data Management Plan Bill Michener University Libraries University of New Mexico Data Management Practices for.
LTER Information Management Training Materials LTER Information Managers Committee Data Management Planning (adapted from DataONE training materials)
UVa Library Research Data Services
Data Management Planning
Data Management Planning Lesson 2: Data Management Planning CC image by Joe Hall on Flickr.
Data Archiving and Networked Services DANS is an institute of KNAW en NWO Data Archiving and Networked Services Introduction to Data Management Planning.
Data Management: Documentation & Metadata Sherry Lake, Senior Data Consultant Bill Corey, Data Consultant Jeremy Bartczak, Intellectual Access & Metadata.
Data Management 101 for Earth Scientists Data Management Plans Robert Cook Environmental Sciences Division Oak Ridge National Laboratory.
Agency Requirements: NSF Data Management Plans Ruth Duerr National Snow and Ice Data Center Version 1.0 October 2012 Section: The Case for Data Stewardship.
University Libraries/ITS Content Stewardship Program Mairéad Martin, Sr. Director, ITS Digital Library Technologies Presentation to FACAC March 1, 2011.
Changing Implementation of NSF Data Policy Dr. Jennifer M. Schopf, NSF OD/OIA/EPSCoR On behalf of the NSF Data Working Group March 17, 2011 CASC Spring.
U.S. Department of the Interior U.S. Geological Survey Tutorials on Data Management Lesson 3.1: How to Write Quality Metadata CC image by Sara Bjork on.
U.S. Department of the Interior U.S. Geological Survey CDI Webinar Series 2013 Data Management at the National Climate Change and Wildlife Science Center.
DMPTool and Data Management Basics Hannah Norton July 29, 2014 Image modified from :
Data Management Plans Module 2. Data Management Plans Data Management Plans  What is a data management plan (DMP)?  Why prepare a DMP?  Components.
Data Management & the Library. FACT #1 Research is increasingly digital and produces digital data.
Elements of a Data Management Plan Bill Michener University of New Mexico
Module 1 Overview Of Research Data Management Andrew Creamer, UMass Medical School Donna Kafel, UMass Medical School Elaine Martin, UMass Medical School.
Primer on Data Management Data Management Plans Robert Cook Environmental Sciences Division Oak Ridge National Laboratory American Meteorological Society.
DOE Data Management Plan Requirements
Data Management Lesley A. Brown Director of Proposal Development.
11 Researcher practice in data management Margaret Henty.
Data Management Plans PAUL H. BERN, PH.D. APRIL 3, 2014.
Office of Science Statement on Digital Data Management Laura Biven, PhD Senior Science and Technology Advisor Office of the Deputy Director for Science.
SEDAC Long-Term Archive Development Robert R. Downs Socioeconomic Data and Applications Center Center for International Earth Science Information Network.
Preserving your research data for future use This work is licensed under a Creative Commons Attribution 3.0 Unported License.Creative Commons Attribution.
Funded by GFBio – Education module Propose Lesson in Data Management Planning.
C OLLEGE OF A GRICULTURE D ATA C OHORT D ATA M ANAGEMENT P LANNING J ANUARY 27, 2014 Jake Carlson Associate Professor of Library Science / Data Services.
Using the DMPTool for data management plans Kathleen Fear February 27, 2014.
Data Stewardship Lifecycle A framework for data service professionals Protectors of data.
Training Course on Data Management for Information Professionals and In-Depth Digitization Practicum September 2011, Oostende, Belgium Concepts.
Writing a successful data management plan Kathleen Fear October 17, 2013.
Jeff Moon Data Librarian &
Slides Template for Module 3 Contextual details needed to make data meaningful to others CC BY-NC.
Lesson Topics What is a data management plan (DMP)? Why prepare a DMP?
Data Management What? Why? How?.
Data Management 101 for Earth Scientists Data Management Plans
Data Management 101 for Earth Scientists Data Management Plans
General Finnish DMP Guidance
Getting Started with Data Management
Open Access to your Research Papers and Data
Data Management Planning
Digital Stewardship Curriculum
Getting Started with Data Management & DMPTool
Research Data Dr Aoife Coffey, Research Data Coordinator
Fundamental Science Practices (FSP) of the U.S. Geological Survey
Presentation transcript:

U.S. Department of the Interior U.S. Geological Survey Tutorials on Data Management Lesson 2: Data Management Planning CC image by Joe Hall on Flickr

Plan Provided by DataONE Lesson Topics  What is a data management plan (DMP)?  Why prepare a DMP?  Components of a DMP  NSF requirements for DMPs  Example of NSF DMP

Plan Provided by DataONE Learning Objectives  After completing this lesson, the participant will be able to:  Define a DMP  Understand the importance of preparing a DMP  Identify the key components of a DMP  Recognize the DMP elements required for an NSF proposal CC image by cybrarian77 on Flickr

Plan Provided by DataONE The Data Lifecycle

Plan Provided by DataONE Plan  Prior to any data acquisition or system development, the business process needs to be understood, and data needs clearly identified. Once the business data needs are determined, a system to store and manipulate the data can be identified and developed. Provided by Tom Chatfield, BLM

Plan Provided by DataONE Plan For Data Needs  Questions that need answers during planning:  Why collect this data? What mission or business requirement does the data address?  Does the data already exist and, if so, where? And in what format?  What data needs to be collected (needed versus “nice to have”) To meet all known business needs  What are the likely uses for this data? Business uses- is it going to need to be alpha-numeric or geospatial?  Are there other interested parties that could use this data for other business areas?  How will the data be collected- analog? digital? Spatially enabled?  What is the schedule and budget?  How will the data be checked and certified?  What are the required outputs for this data? Provided by Tom Chatfield, BLM

Plan Provided by DataONE Plan for Data Needs  In order to answer these questions, prior to any data acquisition the business process needs to be understood and the data needs clearly identified.  Collecting data just for the sake of collecting data is not cost effective. Provided by Tom Chatfield, BLM

Plan Provided by DataONE Plan for Data Needs  Does data already exist?  How will data be collected?  What is the schedule and budget for the collection?  How will the data be checked and certified?  What are all the likely uses for the data, who will use it, what kinds of outputs will be needed?  How do these impact the needs to store, access, and protect the data? Provided by Tom Chatfield, BLM

Plan Provided by DataONE What is a Data Management Plan?  Formal document  Outlines what you will do with your data during and after you complete your research  Ensures your data are safe for the present and the future From University of Virginia Library

Plan Provided by DataONE Why Prepare a DMP? (1)  Save time  Less reorganization later  Increase research efficiency  Ensures you and others will be able to understand and use data in the future CC image by Cathdew on Flickr

Plan Provided by DataONE Why Prepare a DMP? (2)  Easier to preserve your data  Prevents duplication of effort  Can lead to new, unanticipated discoveries  Increases visibility of research  Makes research and data more relevant  Funding agency requirement

Plan Provided by DataONE Components of a General DMP 1. Information about data & data format 2. Metadata content and format 3. Policies for access, sharing and re-use 4. Long-term storage and data management 5. Budget

Plan Provided by DataONE 1. Information About Data & Data Format 1.2 How data will be acquired  When?  Where? 1.3 How data will be processed  Software used  Algorithms  Workflows CC image by Ryan Sandridge on Flickr

Plan Provided by DataONE 1. Information About Data & Data Format 1.4 File formats  Justification  Naming conventions 1.5 Quality assurance & control during sample collection, analysis, and processing CC image by Artform Canada on Flickr

Plan Provided by DataONE 1. Information About Data & Data Format 1.6 Existing data  If existing data are used, what are their origins?  Will your data be combined with existing data?  What is the relationship between your data and existing data? 1.7 How data will be managed in short-term  Version control  Backing up  Security & protection  Who will be responsible

Plan Provided by DataONE 2. Metadata Content & Format Metadata defined:  Documentation and reporting of data  Contextual details: Critical information about the data set  Information important for using the data  Descriptions of temporal and spatial details, instruments, parameters, units, files, etc.

Plan Provided by DataONE 2. Metadata Content & Format 2.1 What metadata are needed  Any details that make data meaningful 2.2 How metadata will be created and/or captured  Lab notebooks? GPS units?  Auto-saved on instrument? 2.3 What format will be used for the metadata  Standards for community  Justification for format chosen

Plan Provided by DataONE 3. Policies for Access, Sharing, Reuse 3.1 Obligations for sharing  Funding agency  Institution  Other organization  Legal 3.2 Details of data sharing  How long?  When?  How access can be gained?  Data collector rights 3.3 Ethical/privacy issues with data sharing CC image by Jim Sher on Flickr

Plan Provided by DataONE 3. Policies for Access, Sharing, Reuse 3.4 Intellectual property & copyright issues  Who owns the copyright?  Institutional policies  Funding agency policies  Embargos for political/commercial reasons 3.5 Intended future uses/users for data 3.6 Citation  How should data be cited when used?  Persistent citation?

Plan Provided by DataONE 4. Long-term Storage & Data Management 4.1 What data will be preserved 4.2 Where will it be archived  Most appropriate archive for data  Community standards 4.3 Data transformations/formats needed  Consider archive policies 4.4 Who will be responsible  Contact person for archive

Plan Provided by DataONE 5. Budget 5.1 Anticipated costs  Time for data preparation & documentation  Hardware/software for data preparation & documentation  Personnel  Archive costs 5.2 How costs will be paid CC image by Adria Richards on Flickr

Plan Provided by DataONE Tools for Creating Data Management Plans dmp.cdlib.org dmponline.dcc.ac.uk

Plan Provided by DataONE NSF DMP Requirements From Grant Proposal Guidelines: Plans for data management and sharing of the products of research. Proposals must include a supplementary document of no more than two pages labeled “Data Management Plan”. This supplement should describe how the proposal will conform to NSF policy on the dissemination and sharing of research results (in AAG), and may include: 1. the types of data, samples, physical collections, software, curriculum materials, and other materials to be produced in the course of the project 2. the standards to be used for data and metadata format and content (where existing standards are absent or deemed inadequate, this should be documented along with any proposed solutions or remedies) 3. policies for access and sharing including provisions for appropriate protection of privacy, confidentiality, security, intellectual property, or other rights or requirements 4. policies and provisions for re-use, re-distribution, and the production of derivatives 5. plans for archiving data, samples, and other research products, and for preservation of access to them

Plan Provided by DataONE NSF DMP Requirements Summarized from Award & Administration Guide: 4. Dissemination and Sharing of Research Results a) Promptly publish with appropriate authorship b) Share data, samples, physical collections, and supporting materials with others, within a reasonable timeframe c) Share software and inventions d) Investigators can keep their legal rights over their intellectual property, but they still have to make their results, data, and collections available to others e) Policies will be implemented via f) Proposal review g) Award negotiations and conditions h) Support/incentives

Plan Provided by DataONE Data in Real Life: A DMP Example Project name: Effects of temperature and salinity on population growth of the estuarine copepod, Eurytemora affinis Project participants and affiliations: Carly Strasser (University of Alberta and Dalhousie University) Mark Lewis (University of Alberta) Claudio DiBacco (Dalhousie University and Bedford Institute of Oceanography) Funding agency: CAISN (Canadian Aquatic Invasive Species Network) Description of project aims and purpose: We will rear populations of E. affinis in the laboratory at three temperatures and three salinities (9 treatments total). We will document the population from hatching to death, noting the proportion of individuals in each stage over time. The data collected will be used to parameterize population models of E. affinis. We will build a model of population growth as a function of temperature and salinity. This will be useful for studies of invasive copepod populations in the Northeast Pacific. Video Source: Plankton Copepods. Video. Encyclopædia Britannica Online. Web. 13 Jun Photo by C. Strasser; all rights reserved

Plan Provided by DataONE 1. Information about data Every two days, we will subsample E. affinis populations growing at our treatment conditions. We will use a microscope to identify the stage and sex of the subsampled individuals. We will document the information first in a laboratory notebook, then copy the data into an Excel spreadsheet. For quality control, values will be entered separately by two different people to ensure accuracy. The Excel spreadsheet will be saved as a comma- separated value (.csv) file daily and backed up to a server. After all data are collected, the Excel spreadsheet will be saved as a.csv file and imported into the program R for statistical analysis. Strasser will be responsible for all data management during and after data collection. Our short-term data storage plan, which will be used during the experiment, will be to save copies of 1) the.txt metadata file and 2) the Excel spreadsheet as.csv files to an external drive, and to take the external drive off site nightly. We will use the Subversion version control system to update our data and metadata files daily on the University of Alberta Mathematics Department server. We will also have the laboratory notebook as a hard copy backup. Data in Real Life: A DMP Example

Plan Provided by DataONE 2. Metadata format & content We will first document our metadata by taking careful notes in the laboratory notebook that refer to specific data files and describe all columns, units, abbreviations, and missing value identifiers. These notes will be transcribed into a.txt document that will be stored with the data file. After all of the data are collected, we will then use EML (Ecological Metadata Language) to digitize our metadata. EML is on of the accepted formats used in Ecology, and works well for the type of data we will be producing. We will create these metadata using Morpho software, available through the Knowledge Network for Biocomplexity (KNB). The documentation and metadata will describe the data files and the context of the measurements. Data in Real Life: A DMP Example

Plan Provided by DataONE 3. Policies for access, sharing & reuse We are required to share our data with the CAISN network after all data have been collected and metadata have been generated. This should be no more than 6 months after the experiments are completed. In order to gain access to CAISN data, interested parties must contact the CAISN data manager or the authors and explain their intended use. Data requests will be approved by the authors after review of the proposed use. The authors will retain rights to the data until the resulting publication is produced, within two years of data production. After publication (or after two years, whichever is first), the authors will open data to public use. After publication, we will submit our data to the KNB, allowing discovery and use by the wider scientific community. Interested parties will be able to download the data directly from KNB without contacting the authors, but will still be required to give credit to the authors for the data used by citing a KNB accession number either in the publication text or in the references list. Data in Real Life: A DMP Example

Plan Provided by DataONE 4. Long-term storage and data management The data set will be submitted to KNB for long-term preservation and storage. The authors will submit metadata in EML format along with the data to facilitate its reuse. Strasser will be responsible for updating metadata and data author contact information in the KNB. 5. Budget A tablet computer will be used for data collection in the field, which will cost approximately $500. Data documentation and preparation for reuse and storage will require approximately one month of salary for one technician. The technician will be responsible for data entry, quality control and assurance, and metadata generation. These costs are included in the budget in lines Data in Real Life: A DMP Example

Plan Provided by DataONE Summary DMPs are an important part of the data lifecycle. They save time and effort in the long run, and ensure that data are relevant and useful for others. Funding agencies are beginning to require DMPs Major components of a DMP: 1. Information about data & data format 2. Metadata content and format 3. Policies for access, sharing and re-use 4. Long-term storage and data management 5. Budget

Plan Provided by DataONE Resources 1. University of Virginia Library 2. DataONE education modules Accessed August, 2012 at 3. Digital Curation Centre management-plans 4. Chatfield, T., Selbach, R. February, Data Management for Data Stewards. Data Management Training Workshop. Bureau of Land Management (BLM). 5. University of Michigan Library publishing-support/nsf-data-management- plans#directorate_guide

Plan Provided by DataONE Resources 6. NSF Grant Proposal Guidelines 2.jsp#dmp 7. Inter-University Consortium for Political and Social Research 8. DataONE