Report for Wed. Question 1 In your view/experience what parts of data life cycle, data citation and data integration implementations/applications or frameworks.

Slides:



Advertisements
Similar presentations
Federation eCrystals Federation: Open Repositories for Data-driven Science Dr Liz Lyon, UKOLN, University of Bath, UK Dr Simon Coles, University of Southampton,
Advertisements

Integrating Data and Publication Researchers Perspective Max Wilkinson APA 9th Nov 2011.
SCORM-NSDL Workshop May 18, Educational Materials are Scattered across the Internet NASA Math Forum State standards Scientific American Ask.
Spruce Group Notes by Julia Collins Facilitated by Erin Robinson
Open Dialogue on Digital Data management
Research Data Service at the IT Pro Forum HEIDI IMKER, DIRECTOR.
Sara Bowman Center for Open Science Open Science Framework: Facilitating Transparency and Reproducibility.
Data-PASS Shared Catalog Micah Altman & Jonathan Crabtree 1 Micah Altman Harvard University Archival Director, Henry A. Murray Research Archive Associate.
Field Project Planning, Operations and Data Services Jim Moore, EOL Field Project Services (FPS) Mike Daniels, EOL Computing, Data and Software (CDS) Facility.
Data Formats: Using Self-describing Data Formats Curt Tilmes NASA Version 1.0 February 2013 Section: Local Data Management Copyright 2013 Curt Tilmes.
Metadata Guides for Smarties Marine Metadata Initiative URL:
Challenges & opportunities in the preservation of (digital) information: the case of European research libraries Museo de las Ciencias Teatro de UNIVERSUM.
Updates from EOSDIS -- as they relate to LANCE Kevin Murphy LANCE UWG, 23rd September
UC3 Standards and Best Practices for Datasets and Other Supplemental Journal Article Materials UC3 Stephen Abrams Patricia Cruse John Kunze.
Preserving Digital Collections for Future Scholarship Oya Y. Rieger Cornell University
Ensemble Computing in the National Science Digital Library (NSDL)
VO Sandpit, November 2009 Environmental Data Archival: Practices and Benefits crib sheet Graham Parton With many thanks to Dr.
CCSM DATA MANGEMENT POLICY The Community Climate System Model (CCSM) Data Management Policy documents the procedures for the management of model data produced.
Data Providers Dissemination – Access, cost, formats, size, metadata, service, support, findability, Policies – Copyright, fees, confidentiality, preservation,
An Environmental Scan for Data Services Trends that are shaping today’s environment for data services.
Topic Rathachai Chawuthai Information Management CSIM / AIT Review Draft/Issued document 0.1.
Dr. Fran Berman, RPI Feedback from BRDI Sponsor Forum 11/11 January 29, 2012 Fran Berman.
Archival Information Packages for NASA HDF-EOS Data R. Duerr, Kent Yang, Azhar Sikander.
Lifecycle Metadata for Digital Objects September 11, 2002 Major archival and digital library metadata schemes.
Introduction GeoData 2014 Workshop #geodata2014 June 17-19, 2014,NCAR, Boulder, CO Peter Fox (RPI)
2004 Annual Report Summary. 2 Summary of Responses.
The Role of Academic Libraries in the Digital Data Universe Break-Out Session: New Partnership Models Bob Hanisch and Brian Schottlaender Co-Leaders ARL.
What is CDR? – A Few Examples Water Resources in a Changing Climate – Idaho Climate Change Large CD consortia — not the case that everyone works on everything.
Secure Epidemiology Research Platform (SERPent) Kick Start Meeting - April 15 th, 2010 Pascal Heus
The Long Tail of Sample-based Data in the Next Decade FROM DARKNESS TO LIGHT Kerstin Lehnert
Portable Infrastructure for the Metafor Metadata System Charlotte Pascoe 1, Gerry Devine 2 1 NCAS-BADC, 2 NCAS-CMS University of Reading PIMMS provides.
The CF Conventions: Options for Sustained Support Involving Unidata Russ Rew Unidata Policy Committee May 12, 2008.
NOAA Data Citation Procedural Directive 8 November 2012 DAARWG.
Exporting WaterML from the Earth System Modeling Framework Xinqi Wang Louisiana State University NCAR SIParCS Program August 4, 2009.
Warwick Cathro Assistant Director-General Resource Sharing and Innovation National Library of Australia Trove – a service built on collaboration OCLC Asia.
CESD 1 SAGES Scottish Alliance for Geoscience, Environment & Society The challenges of geo-simulation data Centre For Earth System Dynamics
ARL Workshop on New Collaborative Relationships: The Role of Academic Libraries in the Digital Data Universe September 26-27, 2006 ARL Prue.
April 14, 2005MIT Libraries Visiting Committee Libraries Strategic Plan Theme III Work to shape the future MacKenzie Smith Associate Director for Technology.
Day 3 Agenda 1.Brief review of where we are 2.Break out reports on recommendations, then distillation incl. identifying any major gaps 3.Proposed outline.
Earth System Curator and Model Metadata Discovery and Display for CMIP5 Sylvia Murphy and Cecelia Deluca (NOAA/CIRES) Hannah Wilcox (NCAR/CISL) Metafor.
Proposal for a new RDA/TDWG WG Attribution Standards for Data Object Curation.
1 Interactions between the Marine Data Harmonization IG and Data Citation WG.
4 way comparison of Data Citation Principles: Amsterdam Manifesto, CoData, Data Cite, Digital Curation Center FORCE11 Data Citation Synthesis Group.
Library of Congress Partnerships for Managing Geospatial Data North Carolina Geographic Information Coordinating Council Raleigh, NC November 7, 2007 William.
CI.III.1 Wider Adoption, Deployment, Utilization of a Cyberinfrastructure David De Roure.
Carl Lagoze Digital Library Service Registry Workshop Services in a Scholarly Communication Framework.
End-to-End Data Services A Few Personal Thoughts Unidata Staff Meeting 2 September 2009.
Preliminary Findings Baseline Assessment of Scientists’ Data Sharing Practices Carol Tenopir, University of Tennessee
@ulccwww.ulcc.ac.uk IRMS Cymru October 2015 From EDRMS to digital archive: a wish-list for ways to preserve digital records.
1 2.5 DISTRIBUTED DATA INTEGRATION WTF-CEOP (WGISS Test Facility for CEOP) May 2007 Yonsook Enloe (NASA/SGT) Chris Lynnes (NASA)
Lidar Radar Open Software Environment LROSE Mike Dixon Earth Observing Laboratory (EOL) National Center for Atmospheric Research (NCAR) Boulder, Colorado.
Metadata Development in the Earth System Curator Spanning the Gap Between Models and Datasets Rocky Dunlap, Georgia Tech 5 th GO-ESSP Community Meeting.
BG 5+6 How do we get to the Ideal World? Tuesday afternoon What gaps, challenges, obstacles prevent us from attaining the vision now? What new research.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No EPOS and EUDAT.
Biological and Chemical Oceanography Data Management Office slide 1 of 22 Introduction to Data Management for Ocean Science Research Cyndy Chandler Biological.
NOAA EDMC Ocean Observatories Initiative Cyberinfrastructure Karen Stocks OOI CI Data Curator University of California, San Diego Ocean Observatories.
Fedora Commons Overview and Background Sandy Payette, Executive Director UK Fedora Training London January 22-23, 2009.
Data Stewardship Lifecycle A framework for data service professionals Protectors of data.
Training Course on Data Management for Information Professionals and In-Depth Digitization Practicum September 2011, Oostende, Belgium Concepts.
Store and exchange data with colleagues and team Synchronize multiple versions of data Ensure automatic desktop synchronization of large files B2DROP is.
COST Action and European GBIF Nodes Anne-Sophie Archambeau.
Federation of Earth Science Information Partners EGIDA Workshop May 9-11, 2011, Bonn, Germany.
ODIN – ORCID and DATACITE Interoperability Network ODIN: Connecting research and researchers Sergio Ruiz - DataCite Funded by The European Union Seventh.
The Earth System Curator Metadata Infrastructure for Climate Modeling Rocky Dunlap Georgia Tech.
Developing our Metadata: Technical Considerations & Approach Ray Plante NIST 4/14/16 NMI Registry Workshop BIPM, Paris 1 …don’t worry ;-) or How we concentrate.
DP Knowhow: Open Archival Information Systems (OAIS) in ISO APA/C-DAC International Conference on Digital Preservation and the Development of Trusted.
Moving on : Repository Services after the RAE
Common Framework for Earth Observation Data
CNI Spring 2010 Membership Meeting
Bird of Feather Session
Presentation transcript:

Report for Wed. Question 1 In your view/experience what parts of data life cycle, data citation and data integration implementations/applications or frameworks are well established (or not) in your disciplines and what are the common gaps?

Understanding Value Streams Single, continuous stream? – Collect, Process, Archive, Discover, Access, Use Two distinct streams? – “Original Intent”: Collect, Process, Archive, Use – Secondary Use: Discover, Access, Use – Key differences: funding org, different community – BUT: can anything be done in the original intent stream to facilitate the secondary use?? Spiral Model? – Collect, Process, Archive, Discover, Access, Use (=Process further) – Process further, Archive, Discover, Access, Use, …

What Part of Framework is Working: Standards WORKING – Some standards are useful and widely used “Self-describing” formats: SEED, HDF, netCDF Climate-Forecast (CF) convention

Gaps in Standards Some standards are underused – E.g., ISO metadata: cost, learning curve Need to consider the human factor – Tool availability Interdisciplinary standards are problematic – Discontinuities between disciplines in standards use – Observations and MeasurementsModel may help here for some disciplines Need standards to support the scientific workflow – E.g., when to add metadata and what kind of metadata Standards Churn (changing too fast/often)

The Human Factor in Data Lifecycle Management Incentives – Sticks: funding, publication requirements – Carrots: wider use of data, citations

The problem with citations… Human and Process problems – Citations are not being used where they should be – Digital data citations are not accepted in some citation indices – Data are not often peer-reviewed, therefore of uncertain quality and citability. Technical problems – Agreement and widespread use of data identifiers – Citation granularity (dataset vs files vs columns in files)

Metadata Capture We need to capture more metadata at the point of data origin – Ideally, built into the collection mechanism – Also, following standards – Exemplars: EXIF standard for cameras ArcCatalog SEED format from seismometers EarthChem We need to capture more metadata at later processing steps (beyond basic provenance) – Gap: handling provenance granularity

Where/how to implement robust data management practices Federal data centers – NOAA data centers, NASA DAACs Federally Funded Research Centers – NCAR University Consortia – IRIS DMC Libraries (Could collaborate more with data centers) Collaborations between scientists and data managers – Argonne “catalysts” example of helping scientists leverage computing facilities: apply to data mgmt Professional Societies Individual Universities – U. of Oklahoma Climate Services Center(?) Key Gap: Robust Business Model for Long-term Persistence of Data Archive

Some Proposals to Involve More People in the Data Lifecycle… Teach students about data management and require them to make data and metadata available as part of their thesis – Partnership with university libraries would be key Involve 4-yr colleges more (not just graduate programs) Provide a mechanism for people other than the data provider to add annotations to data Provide more education on data management to practicing scientists

Unresolved Questions Model Output: treat like data or something else? What to do about identifiers and locators for data? Discussion assumed the web to be an integral part of the lifecycle. Is this Good or Bad, considering the overall low reliability of info on the web? – Establishing trust for data is clearly important

Comments/Questions Ted: – Need to stop talking about hard metadata is, or people will believe it – Hard to make generalized tools Maybe make more domain specific tools? Did you discuss metrics? – JG, maybe use SEI CMM model