Working Group 4 Data and metadata lifecycle management  1. Policies and infrastructure for data and metadata changes  2. Supporting file and data formats.

Slides:



Advertisements
Similar presentations
Data Publishing Service Indiana University Stacy Kowalczyk April 9, 2010.
Advertisements

New Century, New Metadata Thomas Krichel University of Surrey, Hitotsubashi University and Long Island University.
Theme 3: Architecture. Q1: Who houses stuff, both records and identifiers All useful services and repositories are centralized (latency, etc.) … but centralizing.
Building Repositories of eprints in UK Research Universities Bill Hubbard SHERPA Project Manager University of Nottingham.
Long-term Digital Metadata Curation Arif Shaon University of Reading 16 April 2014.
A centre of expertise in data curation and preservation DCC/NeSC eScience Workshop, June 2008 Working in partnership with the eScience community This work.
Breakout 1 Socio-legal etc. Every discipline will be different & each data centre will have different answers to questions. Use a questionnaire and send.
The Discovery Landscape in Crystallography UKOLN is supported by: Monica Duke UKOLN, University of Bath, UK – eBank UK project A centre.
A centre of expertise in data curation and preservation London :: ARK Group Workshop: Archiving the Web :: 28 Sept 2006 Funded by: This work is licensed.
An Introduction to Repositories Thornton Staples Director of Community Strategy and Alliances Director of the Fedora Project.
DSpace: the MIT Libraries Institutional Repository MacKenzie Smith, MIT EDUCAUSE 2003, November 5 th Copyright MacKenzie Smith, This work is the.
Meeting Disciplinary Challenges in Research Data Management Planning – March 23 rd 2012 Data Management Planning for Secure Services (DMP-SS) † Tito Castillo,
Health Ingenuity Exchange (HingX) Best Practices for User Groups and Resource Registration.
Advanced Metadata Usage Daan Broeder TLA - MPI for Psycholinguistics / CLARIN Metadata in Context, APA/CLARIN Workshop, September 2010 Nijmegen.
Introduction to Research Data Management Services, January 2013 The Analysis Stage Analyzing the data from the 4 exercises.
Digital Preservation - Its all about the metadata right? “Metadata and Digital Preservation: How Much Do We Really Need?” SAA 2014 Panel Saturday, August.
Versioning Requirements and Proposed Solutions CM Jones, JE Brace, PL Cave & DR Puplett OR nd April
December 2008 MRC Data Support Services (DSS) Chris Morris 13 th February 2009 Sharing Research Data: Pioneers, Policies and Protocols The seventh cat.
A centre of expertise in data curation and preservation MIS Seminar :: University of Edinburgh :: 2 October 2006 Funded by: This work is licensed under.
Data citation from the perspective of a scholarly publisher Lyubomir Penev TDWG Data Citation Workshop, New Orleans, Oct 2011 ViBRANT.
Figures for ADMIRAL Project grant application These figures are copyright © David Shotton, University of Oxford, They are made available for reuse.
University of Southampton, U.K.
Publishing Digital Content to a LOR Publishing Digital Content to a LOR 1.
ORGANIZING AND STRUCTURING DATA FOR DIGITAL PROJECTS Suzanne Huffman Digital Resources Librarian Simpson Library.
Integrating Digital Curation in a Digital Library curriculum: the International Master DILL case study Anna Maria Tammaro University of Parma Florence,
Agenda: DMWG SM policy status ESIP meeting recap Reminder - DM Webinar Series New and updated web pages on DM website Metadata Training Sessions CDI meeting.
Metadata: An Overview Katie Dunn Technology & Metadata Librarian
 To explain the importance of software configuration management (CM)  To describe key CM activities namely CM planning, change management, version management.
THROUGH OR AROUND? SCIENTIFIC RESEARCH DATA AND THE INSTITUTIONAL REPOSITORY Panel Presentation for the International Conference on University Libraries.
UVa Library Research Data Services
VO Sandpit, November 2009 Environmental Data Archival: Practices and Benefits crib sheet Graham Parton With many thanks to Dr.
Topic Rathachai Chawuthai Information Management CSIM / AIT Review Draft/Issued document 0.1.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
What’s the use?: Searching for catalog user tasks beyond finding, identifying, selecting, and obtaining Marty Kurth Heads of Cataloging Interest Group.
Library Repositories and the Documentation of Rights Leslie Johnston, University of Virginia Library NISO Workshop on Rights Expression May 19, 2005.
Data Management in Scholarly Journals and possible Roles for Libraries – Some Insights from EDaWaX Sven Vlaeminck | Leibniz-Information Centre for Economics.
Dataset Metadata Joan Starr California Digital Library January, Tools and Approaches for Access and Preservation.
BUILDING ON COMMON GROUND: EXPLORING THE INTERSECTION OF ARCHIVES AND DATA CURATION Lizzy Rolando & Wendy Hagenmaier 6/3/2015IASSIST 2015.
The Physiome Model Repository – PMR David Nickerson Auckland Bioengineering Institute The University.
PREMIS Implementation Fair, San Francisco, CA October 7, Stanford Digital Repository PREMIS & Geospatial Resources Nancy J. Hoebelheinrich Knowledge.
ScholarSpace & Open UH Mānoa March 2013 Beth Tillinghast Web Support Librarian ScholarSpace & eVols Project Manager UHM Library.
Symposium on Global Scientific Data Infrastructures Panel Two: Stakeholder Communities in the DWF Ann Wolpert, Massachusetts Institute of Technology Board.
A centre of expertise in digital information managementwww.ukoln.ac.uk DCMI Affiliates: Implications for Institutions Rosemary Russell UKOLN University.
SPASE and the VxOs Jim Thieman Todd King Aaron Roberts.
Unless otherwise noted, the content of this course material is licensed under a Creative Commons Attribution 3.0 License.
TopCAT Use Cases Priorities User Interface 1 ICAT developer workshop, August 2009 Laurent Lerusse – STFC
FRErator – the Bridge between FRE and Curator DB.
Institutional Repositories: the DSpace Experience Ann J. Wolpert Director of Libraries Massachusetts Institute of Technology.
Digital Repositories: Concepts and Issues By Devendra. S. Gobbur (Sr) Assistant Librarian, Gulbarga University, Gulbarga. 10 NOV, NOV, 2009.
A centre of expertise in digital information management UKOLN is supported by: Functional Requirements Eprints Application Profile Working.
Prizms for Data Publication and Management Katie Chastain May 9, 2014.
Describing resources II: Dublin Core CERN-UNESCO School on Digital Libraries Rabat, Nov 22-26, 2010 Annette Holtkamp CERN.
Data Stewardship Lifecycle A framework for data service professionals Protectors of data.
Digital Preservation Initiatives in the United States A Summary Deanna B. Marcum.
Democratization of ‘Omics Data Availability and Review Robert Chalkley UCSF Data Management Editor - MCP.
Discover ScholarSphere A repository service collaboration between the University Libraries and ITS.
Geospatial metadata Prof. Wenwen Li School of Geographical Sciences and Urban Planning 5644 Coor Hall
Acknowledgments Funding provided by the Jewett Foundation Introduction Data collected in ocean sciences, whether generated from research or operational.
Developing our Metadata: Technical Considerations & Approach Ray Plante NIST 4/14/16 NMI Registry Workshop BIPM, Paris 1 …don’t worry ;-) or How we concentrate.
Current as of April/May 2013
Open access as a means to produce high quality data Anja Gassner Head Research Method Group Sentinel Landscape Coordinator FTA World Agroforestry Centre.
VI-SEEM Data Repository
Institutional role in supporting open access, open science, open data
Changing Practices… Changing Values
VI-SEEM Data Repository
Health Ingenuity Exchange - HingX
Metadata in the modernization of statistical production at Statistics Canada Carmen Greenough June 2, 2014.
2. An overview of SDMX (What is SDMX? Part I)
ENVRI Reference Model (RM) Information Viewpoint components
Research data lifecycle²
Presentation transcript:

Working Group 4 Data and metadata lifecycle management  1. Policies and infrastructure for data and metadata changes  2. Supporting file and data formats  3. Policies for change control for data and metadata  4. Provenance and approval of data and metadata versioning

Priorities for Small Science data repositories  Preserve the Data (Urgent) Descriptions of data, experimental designs, use cases, policies about inclusion and exclusion of data points… Can be thought of as a conversation between humans that supports the need to re-create the experiment and independent analysis. Unambiguous linkage between data sets and publications  Accessibility of Data When you have it preserved, support access appropriately (search support, etc.)  Formulation of public policies pertaining to curation of data  ? Formalization of a vocabulary to describe relations among datasets, change policies….  ? Automation Support for automated data exchange and use Schemas to capture machine-processable data

Data and Applications where does the complexity belong?  Simple data Complex Applications  Complex data Simple applications  Data Schemas Applications Where the intelligence is invested will have an impact on managing the lifecycle of data sets Applications require curation as well as the data. Schemas do as well

How can data standards be motivated and managed?  Data standards stakeholders Researchers Publishers Data Curators Funding agencies  What are the motivations for adopting and enforcing standards? Publication attribution – data sets must be recognized publicly as creditable publications (Carrot) Requirement of publishing (paper not accepted unless data is recorded in a suitable repository (Stick) Publishers may find that their journals are more widely used to the extent that the data is openly accessible Publishers may actively oppose open publication of data Funding agencies may require open publication of data as a condition of funding

Curation issues  Classes of metadata? Author-generated Machine-generated Third-party (repository curators) User annotations (Web 2.0 style?)?  What sort of metadata is important for a data set independent of the publications that reference it?

‘Ownership’ of data and metadata  Who can edit datasets? Assertion: datasets should never change, but rather should be versioned with changes clearly journaled Is there a need for conventions concerning post- publication status changes? (fraud/errors/augmentation…)  Who can edit metadata? Assertion: metadata is a curatable object independent of that which is described: the creator or curator is responsible for change policy (not the data set creator)

Additional Questions  What are the natural institutional homes for repositories?  Is there support in the data repository world for retrieval of known-item data sets (this is the canonical identifier problem)  Life-cycle of data sets imply that death may be an event in the cycle. How are policies concerning life and death of publications, datasets, and the relationships between them assured?