Biological and Chemical Oceanography Data Management Office slide 1 of 22 Introduction to Data Management for Ocean Science Research Cyndy Chandler Biological.

Slides:



Advertisements
Similar presentations
Critical Reading Strategies: Overview of Research Process
Advertisements

Data Provenance and Attribution for Published Datasets The Challenge and the reality check April 9-10, 2009 National Academy of Sciences, Woods Hole, MA.
Rolling Deck to Repository: Transforming the United States Academic Fleet Into an Integrated Global Observing System Suzanne M. Carbotte, Robert Arko,
Maines Sustainability Solutions Initiative (SSI) Focuses on research of the coupled dynamics of social- ecological systems (SES) and the translation of.
DSpace: the MIT Libraries Institutional Repository MacKenzie Smith, MIT EDUCAUSE 2003, November 5 th Copyright MacKenzie Smith, This work is the.
Peter Griffith and Megan McGroddy 4 th NACP All Investigators Meeting February 3, 2013 Expectations and Opportunities for NACP Investigators to Share and.
Visualizing Fitness for Purpose Bob Groman and Dicky Allison Biological and Chemical Oceanography Data Management Office Woods Hole Oceanographic Institution.
Colour of Ocean Data – panel discussion conclusions1 Colour of Ocean Data The Palais des Congrès, Brussels, Belgium November 2002 Summary of the.
Complexity must become Linear or Decrease Smart data infrastructure: The sixth generation of mediation for data science Peter Fox 1
NSF Data Management Plan Requirements Alex Kanous
Course Materials Overview.  Module 1: History and Organization of the Industry  Module 2: Safety  Module 3: Electric Power Generation  Module 4: Electric.
Biological and Chemical Oceanography Data Management Office 1 of 12 An Introduction to the Biological and Chemical Oceanography Data Management Office.
Klassificering af Inf. Systemer Baseret på: Luis M. Camarinha-Matos & Hamideh Afsarmanesh: Collaborative networks: a new scientific discipline.
INTRODUCTION TO RESEARCH DATA MANAGEMENT Robin Desmeules Janice Kung J W Scott Health Sciences Library University of Alberta Libraries.
Research Data Management Philip Tarrant Global Institute of Sustainability.
U.S. Department of the Interior U.S. Geological Survey USGS Data Management Training Modules: Value of Data Management “Data is a precious thing and will.
Key integrating concepts Groups Formal Community Groups Ad-hoc special purpose/ interest groups Fine-grained access control and membership Linked All content.
DR. AHMAD SHAHRUL NIZAM ISHA
Data Management Practices: BCO-DMO’s Successes and Challenges Bob Groman BCO-DMO Woods Hole Oceanographic Institution NERACOOS/NeCODP Data Management Workshop.
Metadata Guides for Smarties Marine Metadata Initiative URL:
The Case for Data Stewardship: Preserving the Scientific Record Matthew Mayernik National Center for Atmospheric Research Version 2.0 [Review Date]
Providing Access to Your Data Matthew Mayernik National Center for Atmospheric Research Version 1.0 Review Date.
CC 2007, 2011 atribution - R.B. Allen Scholarship, Science, Data, and Domain Informatics.
Designing the Microbial Research Commons: An International Symposium Overview National Academy of Sciences Washington, DC October 8-9, 2009 Cathy H. Wu.
The Natural Inquirer Science Education Journal & Climate Change Education.
ACCESS for VALIDITY ACCESS for INNOVATION. Starting January 2011 for NEW proposals Not voluntary – “integral part” of proposal and FastLane Required for.
Elements of a Data Management Plan Bill Michener University Libraries University of New Mexico Data Management Practices for.
UVa Library Research Data Services
Why Archiving and Preserving GIS Data Is Important Maps tell a compelling story of change over time. They document movement, progress, and change to the.
Managing the Impacts of Programmatic Scale and Enhancing Incentives for Data Archiving A Presentation for “International Workshop on Strategies for Preservation.
Local global disambiguation of terms and concepts The BCO-DMO metadata database uses controlled vocabularies to record many of the important pieces of.
The Case for Data Management Ruth Duerr National Snow and Ice Data Center.
Providing Access to Your Data Matthew Mayernik National Center for Atmospheric Research Copyright 2012 Matthew Mayernik. Version 1.0 October 2012 Section:
Judith E. Skog Biological Sciences Directorate Emerging Frontiers Division H. Richard Lane Geological Sciences Directorate Earth Systems Science.
Introduction GeoData 2014 Workshop #geodata2014 June 17-19, 2014,NCAR, Boulder, CO Peter Fox (RPI)
VERTIGO data OCB database status update Cyndy Chandler Ocean Carbon and Biogeochemistry Data Management Office Cyndy Chandler Ocean Carbon and Biogeochemistry.
The Case for Data Stewardship: Enhancing Your Reputation Matthew Mayernik National Center for Atmospheric Research Version 1.0 [Review Date]
Reconstituting the Ocean: a tale from U.S. JGOFS Cyndy Chandler (MCG, WHOI) U.S. JGOFS Data Management Office and Ocean Carbon and Biogeochemistry Coordination.
Biological and Chemical Oceanography Data Management Office slide 1 of 19 CAMEO Data Management Bob Groman Biological and Chemical Oceanography Data Management.
Cyberinfrastructure What is it? Russ Hobby Internet2 Joint Techs, 18 July 2007.
The Long Tail of Sample-based Data in the Next Decade FROM DARKNESS TO LIGHT Kerstin Lehnert
Marine Metadata Interoperability - Web Services Marine scientists face an opportunity and a challenge in the volume of data available from various ocean.
DRAFT EDMC Procedural Directives NOAA Environmental Data Management Committee 12/3/2015 1
Breakout # 1 – Data Collecting and Making It Available Data definition “ Any information that [environmental] researchers need to accomplish their tasks”
Advertising your data: Agency requirements for submitting metadata Nancy J. Hoebelheinrich Version 1.0 September 2012 Section: Local Data Management Copyright.
Science Data in the Science Mission Directorate (SMD) Jeffrey J.E. Hayes Program Executive for MO & DA, Heliophysics Division August 17, 2011.
CSD 5100 Introduction to Research Methods in CSD Where To Begin?? Selecting the Research Problem Identification of a topic Framing a research problem Research.
5 July 2012Ganesha Associates1 Basic Skills for Scientific Research and Publishing. Segment 1. Introduction to the course.
Challenges of Coping with Funding and Data Management in a Changing World Rick Lyons Director Infectious Disease Research Center.
1Mobile Computing Systems © 2001 Carnegie Mellon University Writing a Successful NSF Proposal November 4, 2003 Website: nsf.gov.
It’s the data that makes a paper Joerg Heber Executive Editor Nature Communications.
Marine Metadata Interoperability Acknowledgements Ongoing funding for this project is provided by the National Science Foundation.
NATIONAL TREASURES DATA PRESERVATION WITH METADATA Sharon Shin Metadata Coordinator Federal Geographic Data Committee Secretariat ASPRS-Reno 2006.
Cyberinfrastructure: Many Things to Many People Russ Hobby Program Manager Internet2.
DOE Data Management Plan Requirements
Data Management Lesley A. Brown Director of Proposal Development.
The Case for Data Stewardship: Enhancing Your Reputation Matthew Mayernik National Center for Atmospheric Research Version 1.0 September 2012 Section:
Copyright and Data Matthew Mayernik National Center for Atmospheric Research Section: Responsible Data Use Version 1.0 October 2012 Copyright 2012 Matthew.
Providing access to your data: Determining your audience Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International.
SOFTWARE ARCHIVE WORKING GROUP (SAWG) REPORT TODD KING PDS MANAGEMENT COUNCIL MEETING FEB. 4-5, 2016.
Sociology. Sociology is a science because it uses the same techniques as other sciences Explaining social phenomena is what sociological theory is all.
Biological and Chemical Oceanography Data Management Office slide 1 of 10 U.S. GEOTRACES Data Management Cyndy Chandler BCO-DMO ~ WHOI 23 September 2008.
Biological and Chemical Oceanography Data Management Office slide 1 of 10 The Biological and Chemical Oceanography Data Management Office (BCO-DMO) Cyndy.
Training Course on Data Management for Information Professionals and In-Depth Digitization Practicum September 2011, Oostende, Belgium Concepts.
R2R ↔ NODC Steve Rutz NODC Observing Systems Team Leader May 12, 2011 Presented by L. Pikula, IODE OceanTeacher Course Data Management for Information.
Acknowledgments Funding provided by the Jewett Foundation Introduction Data collected in ocean sciences, whether generated from research or operational.
Agency Requirements: NOAA Administrative Order Management of environmental and geospatial data and information This training module is part of.
Data Management: Documentation & Metadata
Training Course on Data Management for Information Professionals and In-Depth Digitization Practicum September 2011, Oostende, Belgium Concepts.
The Data Management Plan (DMP) and your NSF proposal
Presentation transcript:

Biological and Chemical Oceanography Data Management Office slide 1 of 22 Introduction to Data Management for Ocean Science Research Cyndy Chandler Biological and Chemical Oceanography Data Management Office 12 November 2009 Ocean Acidification Short Course Woods Hole, MA USA

slide 2 of 22 C.Chandler ~ Biological and Chemical Oceanography Data Management Office Discussion Topics Part 1 of 2: Introduction Why data management matters New funding agency requirements New research paradigms New expectations for data access Part 2: data management specifics

slide 3 of 22 C.Chandler ~ Biological and Chemical Oceanography Data Management Office Why data management matters good data management practices have always been integral to the scientific method 1949 – recording BT CTD

slide 4 of 22 C.Chandler ~ Biological and Chemical Oceanography Data Management Office It’s important to science careful and deliberate record keeping results reported and made publicly available enabling reproducibility of results from the pre-course survey 57% of students reported having ‘minimal experience’ with “Metadata production and data archiving” Why data management matters

slide 5 of 22 C.Chandler ~ Biological and Chemical Oceanography Data Management Office Some definitions … what do I mean by … Data Management end-to-end data management proposal to preservation having a plan from the beginning to ensure that data and metadata are recorded accurately, are preserved securely (backups) and will be made accessible to others and ‘dataset’ ? a logical grouping of related measurements (often from the same sampling device or sensor)

slide 6 of 22 C.Chandler ~ Biological and Chemical Oceanography Data Management Office Metadata metadata ~ “about the data” information required to interpret the data Metadata records capture the information required to answer the who, what, where, why, how and when questions that are asked about a data set. It is important to know who collected, analyzed and contributed the data and where, when and how those data were acquired and subsequently analyzed and processed.

slide 7 of 22 C.Chandler ~ Biological and Chemical Oceanography Data Management Office Changes and Challenges data sets used to be smaller and were often published on paper (in a journal article or a data report, and they fit in Table 1) data were published as a tangible thing as data acquisition becomes automated, rate of acquisition and volume increases but metadata acquisition (data documentation) is not being automated at the same rate

slide 8 of 22 C.Chandler ~ Biological and Chemical Oceanography Data Management Office What else has changed? shift from ‘local’ to ‘global’  research themes  collaborative teams of researchers are trending toward being more distributed ~~ thematically and geographically technological advances are enabling these changes cultural changes lag behind technological changes  no direct relationship between career advancement and publication of data

slide 9 of 22 C.Chandler ~ Biological and Chemical Oceanography Data Management Office Why data management matters Cultural Changes – a work in progress: goal: scientific data should be freely accessible to all achievement of that goal relies on agreement that: anyone using the data must properly acknowledge the data originators (proper citation of all source data used)

slide 10 of 22 C.Chandler ~ Biological and Chemical Oceanography Data Management Office Cultural issues …  little incentive for researchers to publish their data  exacerbated by the perception that the data are the ‘property’ of the originating investigator, and might be ‘stolen’ Conventional wisdom is still that ‘publish or perish’ applies predominantly to journal publications, not data publication. In the US, funding agency program managers are beginning to effect change in this area. NSF, NASA and NOAA all require publication of data generated by federally funded research. Publication of Data

slide 11 of 22 C.Chandler ~ Biological and Chemical Oceanography Data Management Office New funding agency requirements Division of Ocean Sciences Data and Sample Policy. National Science Foundation. NSF General Data Policy Principal Investigators are required to submit all environmental data collected to the designated National Data Centers as soon as possible, but no later than two (2) years after the data are collected. Inventories (metadata) of all marine environmental data collected should be submitted to the designated National Data Centers within sixty (60) days after the observational period/cruise.

slide 12 of 22 C.Chandler ~ Biological and Chemical Oceanography Data Management Office New funding agency requirements Proposal Requirements The NSF Grant Proposal Guide requires that proposal Project Descriptions outline plans for preservation, documentation, and sharing of data, samples, physical collections, curriculum materials and other related research and education products. Plans for the handling of data and other products will be considered in the review process. Reporting Requirements Annual reports, required for all projects, should address progress on data and research product sharing. The Division of Ocean Sciences requires that final reports document compliance or explain why it did not occur.

slide 13 of 22 C.Chandler ~ Biological and Chemical Oceanography Data Management Office Publication of Data call me and I might share freely available Each approach has associated pros and cons, but as more data are published and are made freely available, it will become more of an accepted practice, and community expectations will change as well. my data community data

slide 14 of 22 C.Chandler ~ Biological and Chemical Oceanography Data Management Office Paradigm Shift Updating the ‘red phone paradigm’... developing new and better ways to locate and retrieve data. familiar easy to learn it works convenient effective yields better results The grand challenge facing data managers today is to design a data access system that can replace the telephone.

slide 15 of 22 C.Chandler ~ Biological and Chemical Oceanography Data Management Office New research paradigms... science themes are trending toward  interdisciplinary  basin-wide studies involving coupling of complex models  atmospheric and hydrologic  end-to-end food web... require access to data from many disciplines

slide 16 of 22 C.Chandler ~ Biological and Chemical Oceanography Data Management Office New expectations for data access complex research themes (ocean biogeochemistry, ocean acidification research) require access to data collected by other researchers access to research designed to enable science-based decision support for legislative policies  social science  economics  history  broad range of disciplines

slide 17 of 22 C.Chandler ~ Biological and Chemical Oceanography Data Management Office What does ‘access to data’ mean? ability to locate data of interest determine ‘fitness for purpose’ accurately use the data “Scientists are confronted with significant data management problems due to the large volume and high complexity of scientific data. In particular, the latter makes data integration a significant technical challenge.” (A.K. Sinha, Geoinformatics, 2006)

slide 18 of 22 C.Chandler ~ Biological and Chemical Oceanography Data Management Office New expectations for data access New tools based on emerging technologies are being developed to address the challenge of integration of distributed heterogeneous data informatics semantic mediation registered ontologies

slide 19 of 22 C.Chandler ~ Biological and Chemical Oceanography Data Management Office New expectations for data access all of the new technologies assume that data resources will be accompanied by machine-readable metadata while we wait for the new informatics tools, and semantic e-science resources to come online … … ocean science data accompanied by human readable metadata are of great value

slide 20 of 22 C.Chandler ~ Biological and Chemical Oceanography Data Management Office these data are incomplete and of little use to colleagues The dataset lacks sufficient metadata to enable efficient and accurate reuse. Presumably the data originator would decode Sample ‘DIL 10’ because they know it to be a proxy for where, when and how the data were collected.

slide 21 of 22 C.Chandler ~ Biological and Chemical Oceanography Data Management Office local... to global oldnew Atlantis, 1958 Challenges and Opportunities

Biological and Chemical Oceanography Data Management Office slide 22 of 22 end of part 1 “ You can’t play with the data without the metadata. Well, you can, but it’s much less fun. “ (Peter Wiebe, WHOI, 2009)