CC&E Best Data Management Practices, April 19, 2015 Please take the Workshop Survey https://www.surveymonkey.com/s/update 1.

Slides:



Advertisements
Similar presentations
ETD Preservation Workshop Session Four: Collection Management for Preservation Gail McMillan, Virginia Tech.
Advertisements

Peter Griffith and Megan McGroddy 4 th NACP All Investigators Meeting February 3, 2013 Expectations and Opportunities for NACP Investigators to Share and.
Resources for Social Sciences
Technical Information Center
ORNL DAAC Experience With Digital Object Identifiers (DOIs) Bruce Wilson, ORNL DAAC Manager for NASA Data Center Managers telecon 22 Feb 2010.
NBII and the FGDC CAP Grant: Building Metadata Partnerships Vivian Hutchison Metadata Program Coordinator NBII/USGS.
1 CS 502: Computing Methods for Digital Libraries Lecture 22 Repositories.
1 ORNL DAAC: Data and Services Robert Cook and Suresh SanthanaVannan Environmental Sciences Division Oak Ridge National Laboratory Oak Ridge, TN Presentation.
1 Newspaper Digitisation Workflows Rose Holley- Manager ANDP Presentation to Cultural Heritage Digitisation professionals 26 November 2008.
1 Australian Newspapers Digitisation Program Development of the Newspapers Content Management System Rose Holley – ANDP Manager ANPlan/ANDP Workshop, 28.
Data-PASS Shared Catalog Micah Altman & Jonathan Crabtree 1 Micah Altman Harvard University Archival Director, Henry A. Murray Research Archive Associate.
Elements of a Data Management Plan Alison Boyer Environmental Sciences Division Oak Ridge National Laboratory.
Elements of a Data Management Plan
ORGANIZING AND STRUCTURING DATA FOR DIGITAL PROJECTS Suzanne Huffman Digital Resources Librarian Simpson Library.
SAFARI 2000 Data Activities at the ORNL DAAC Bob Cook, Les Hook, Stan Attenberger, Dick Olson, and Tim Rhyne Oak Ridge National Laboratory.
Fundamental Practices for Preparing Data Sets Robert Cook ORNL Distributed Active Archive Center Environmental Sciences Division Oak Ridge National Laboratory.
U.S. Department of the Interior U.S. Geological Survey Best Practices for Preparing Science Data to Share.
1 © Netskills Quality Internet Training, University of Newcastle Metadata Explained © Netskills, Quality Internet Training.
Inter-American Workshop on Environmental Data Access Panel discussion on scientific and technical issues Merilyn Gentry, LBA-ECO Data Coordinator NASA.
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
Best Practices for Preparing Data Sets Non-CO2 Synthesis Workshop Boulder, Colorado October 2008 Compiled by: A. Dayalu, Harvard University Adapted.
Dataset Citation: From Pilot to Production Mark Martin Assistant Director, Office of Scientific and Technical Information U.S. Department of Energy.
ACCESS for VALIDITY ACCESS for INNOVATION. Starting January 2011 for NEW proposals Not voluntary – “integral part” of proposal and FastLane Required for.
Citing Data Sets in the Literature: ORNL DAAC Practices Robert Cook, Suresh SanthanaVannan, and Daine Wright Environmental Sciences Division Oak Ridge.
Elements of a Data Management Plan Bill Michener University Libraries University of New Mexico Data Management Practices for.
TECHNOLOGY SUPPORT FOR ESSSS Progress, Issues, and Challenges Marshall Breeding Director for Innovative Technology and Research Vanderbilt University Library.
F. Toussaint (WDCC, Hamburg) / / 1 CERA : Data Structure and User Interface Frank Toussaint Michael Lautenschlager World Data Center for Climate.
Chapter 8 Cookies And Security JavaScript, Third Edition.
Managing the Impacts of Programmatic Scale and Enhancing Incentives for Data Archiving A Presentation for “International Workshop on Strategies for Preservation.
Fundamental Practices for Preparing Data Sets Bob Cook Environmental Sciences Division Oak Ridge National Laboratory.
M.Lautenschlager (WDCC, Hamburg) / / 1 Semantic Data Management for Organising Terabyte Data Archives Michael Lautenschlager World Data Center.
WVU Electronic Theses & Dissertations Transforming Graduate Education and Research.
Discovering Computers Fundamentals Fifth Edition Chapter 9 Database Management.
Current and Future Applications of the Generic Statistical Business Process Model at Statistics Canada Laurie Reedman and Claude Julien May 5, 2010.
Data Management Practices for Early Career Scientists: Closing Robert Cook Environmental Sciences Division Oak Ridge National Laboratory Oak Ridge, TN.
Preparing Metadata Records Suresh K.S. Vannan ORNL, Oak Ridge, TN Viv Hutchison US Geological Survey, Denver, CO
Data Management 101 for Earth Scientists Data Management Plans Robert Cook Environmental Sciences Division Oak Ridge National Laboratory.
Life Cycle Models & Principles Jake Carlson Associate Professor of Library Science Data Services Specialist Purdue University Libraries.
1 Metadata –Information about information – Different objects, different forms – e.g. Library catalogue record Property:Value: Author Ian Beardwell Publisher.
WK 13 - How to Prepare Ecological Data Sets for Effective Analysis and Sharing 2:00 PM-5:00 PM August 1 st, 2010.
Managing the Impacts of Change on Archiving Research Data A Presentation for “International Workshop on Strategies for Preservation of and Open Access.
Data Management 101 for Earth Scientists Managing Your Data Robert Cook Environmental Sciences Division Oak Ridge National Laboratory.
Archival Workshop on Ingest, Identification, and Certification Standards Certification (Best Practices) Checklist Does the archive have a written plan.
Managing Your Data: Assign Descriptive File Names Robert Cook Oak Ridge National Laboratory Section: Local Data Management Version 1.0 October 2012.
3/30/04 16:14 1 Lessons Learned CERES Data Management Presented to GIST 21 “If the 3 laws of climate are calibrate, calibrate, calibrate, then the 3 laws.
DataONE: Preserving Data and Enabling Data-Intensive Biological and Environmental Research Bob Cook Environmental Sciences Division Oak Ridge National.
IDigBio is funded by a grant from the National Science Foundation’s Advancing Digitization of Biodiversity Collections Program (Cooperative Agreement EF ).
DRAFT EDMC Procedural Directives NOAA Environmental Data Management Committee 12/3/2015 1
Funded by: © AHDS Preservation in Institutional Repositories Preliminary conclusions of the SHERPA DP project Gareth Knight Digital Preservation Officer.
Cyberinfrastructure to promote Model - Data Integration Robert Cook, Yaxing Wei, and Suresh S. Vannan Oak Ridge National Laboratory Presented at the Model-Data.
Collecting History: Profiles in Science Alexa T. McCray National Library of Medicine Bethesda, MD Stanford University August 21, 1999.
The US Long Term Ecological Research (LTER) Network: Site and Network Level Information Management Kristin Vanderbilt Department of Biology University.
INTRODUCTION TO GIS  Used to describe computer facilities which are used to handle data referenced to the spatial domain.  Has the ability to inter-
Primer on Data Management Data Management Plans Robert Cook Environmental Sciences Division Oak Ridge National Laboratory American Meteorological Society.
George E. Brown, Jr. Network for Earthquake Engineering Simulation 4 th regular meeting of the NEES preservation advisory committee Stanislav Pejša
Copyright 2010, The World Bank Group. All Rights Reserved. Recommended Tabulations and Dissemination Section B.
Why RDA? A domain repository perspective George Alter ICPSR University of Michigan.
Mercury and the Metadata Editor Robert Cook and Tim Rhyne Oak Ridge National Laboratory Distributed Active Archive Center June 2000.
Global Change Master Directory (GCMD) Mission “To assist the scientific community in the discovery of Earth science data, related services, and ancillary.
A Project of the University Libraries Ball State University Libraries A destination for research, learning, and friends.
Data Management Practices for Early Career Scientists: Closing Robert Cook Environmental Sciences Division Oak Ridge National Laboratory Oak Ridge, TN.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
Writing a successful data management plan Kathleen Fear October 17, 2013.
Discover ScholarSphere A repository service collaboration between the University Libraries and ITS.
Global Rangelands Data Entry Guidelines March 23, 2015.
Standardization Promotes Biogeochemical Data Management and Use in Multidisciplinary Environmental Research Yaxing Wei, Suresh Vannan, Robert B. Cook,
Request a Content Change for Novartis.com
FY18 Water Use Data and Research Program Q & A Session
Technical Issues in Sustainability
Fundamental Science Practices (FSP) of the U.S. Geological Survey
Presentation transcript:

CC&E Best Data Management Practices, April 19, 2015 Please take the Workshop Survey 1

Data Management Practices for Early Career Scientists: Closing Robert Cook ORNL Distributed Active Archive Center Environmental Sciences Division Oak Ridge National Laboratory Oak Ridge, TN CC&E Joint Science Workshop College Park, MD April 19, 2015

CC&E Best Data Management Practices, April 19, 2015 Plan for archiving data “Begin with the end in mind” Identified the Data Center Collaborated with data center during project Communicated: Volume and Number of Files Special needs Delivery dates 3

CC&E Best Data Management Practices, April 19, 2015 Followed Fundamental Data Practices 4 Define the contents of your data files Define the variables Use consistent data organization Use stable file formats Assign descriptive file names Preserve processing information Perform basic quality assurance Provide documentation Protect your data Preserve your data

CC&E Best Data Management Practices, April 19, 2015 What to submit to the archive? Well-structured data files, with variables, units, and fill values well-defined Document that describes the data set Additional information –Article written with the data set –Files that describe project, protocols, or field sites (photographs) –Material from Project Web site or Wiki Basic description of the data (15 questions) – 5

CC&E Best Data Management Practices, April 19, 2015 Issues with data sets received Descriptive information about data files and content is incomplete –Data description and collection method –Field sites –Quality / uncertainty of data Inconsistencies with publication Files uploaded are not identified / described Variable names are not defined or vague –“Height” unclear, change to “canopy_height” Perhaps append the method/sensor for added clarity 6

CC&E Best Data Management Practices, April 19, 2015 Information about Data (15 questions)15 questions Information About Your Data Set 1.Have you looked at our Best Data Management Practices 2.Who produced this data set? 3.What agency and program funded the project? What awards funded this project? (comma separate multiple awards) Data Set Description 4.Provide a title for your data set. (maximum 84 characters) What type of data does your data set contain? What does the data set describe? (2-3 sentences) 5.What parameters did you measure, derive, or generate? (comma separated, limit to ten) 6.Have you analyzed the uncertainty in your data? Briefly describe your uncertainty analysis. (2-3 sentences) Will the uncertainty estimates be included with your data set? 7

CC&E Best Data Management Practices, April 19, 2015 Information about Data (cont) Temporal and Spatial Characteristics 7.What date range does the data cover? (YYYY-MM-DD) What is a representative sampling frequency or temporal resolution for your data? 8.Where were the data collected/generated? 9.Which of the following best describes the spatial nature of your data? (single point, multiple points, transect, grid, polygon, n/a) 10.What is a representative spatial resolution for these data? 11.Provide a bounding box around your data. Data Preparation and Delivery 12.What are the formats of your data files? How many data files does your product contain? What is the total disk volume of your data set? (MB) 13.Is this data set final, unrestricted, and available for release? What are the reasons to restrict access to the data set? 14.Has this data set been described and used in a published paper? If so, provide a DOI or upload a digital copy of the manuscript with the data set. 15.Are the data and documentation posted on a public server? If so, provide the URL. 8

9  Exploration and Distribution –provide tools to explore, access, and extract data  Post-Project Data Support –provide long-term secure archiving –serve as a buffer between end users and PIs –provide usage statistics  Stewardship –security, disaster recovery –migration to new computer systems Data Center: Stewardship and Archive Functions  Ingest –perform QA checks –compile project-provided metadata –generate additional metadata –convert to archival file formats  Metadata / Documentation –prepare final metadata record and documentation  Archive / Release − generate citation and DOI (digital object identifier)

CC&E Best Data Management Practices, April 19, 2015 Workshop Goal Provide fundamental data management practices that investigators should perform during the course of data collection. 10 To improve the usability of data sets for: You Collaborators People outside your project By following the practices taught in this workshop, your data will be less prone to error, more efficiently structured for analysis, and more readily understandable for any future research.

CC&E Best Data Management Practices, April 19, 2015 Please take the Workshop Survey 11

12 Workshop Sponsors