Maggie, Carlo, Peter, Rebecca (GEDE discussions)

Slides:



Advertisements
Similar presentations
A Unified Approach to Combat Counterfeiting: Use of the Digital Object Architecture and ITU-T Recommendation X.1255 Robert E. Kahn President & CEO CNRI,
Advertisements

Persistent identifiers – an Overview Juha Hakala The National Library of Finland
The current state of Metadata - as far as we understand it - Peter Wittenburg The Language Archive - Max Planck Institute CLARIN Research Infrastructure.
RDA Data Foundation and Terminology (DFT) IG: Introduction Prepared for RDA Plenary San Diego, March 9, 2015 Gary Berg-Cross, Raphael Ritz, Co-Chairs DFT.
1 APARSEN - WP2200 Identifiers and Citability Interoperability Framework for PI systems Webinar on PI - 15 February 2013 Maurizio Lunghi.
Presented by DOI Create: TERN as a use-case Siddeswara Guru
DATA FOUNDATION TERMINOLOGY WG 4 th Plenary Update THE PLUM GOALS This model together with the derived terminology can be used Across communities and stakeholders.
RDA Data Foundation and Terminology (DFT) IG: Introduction Prepared for RDA Plenary San Diego, March 9, 2015 Gary Berg-Cross, Raphael Ritz, Co-Chairs DFT.
CrossRef, DOIs and Data: A Perfect Combination Ed Pentz, Executive Director, CrossRef CODATA ’06 Session K4 October 25, 2006.
RDA Data Foundation and Terminology (DFT) IG: Introduction Prepared for RDA 6 th Plenary Paris, Sept. 25, 2015 Gary Berg-Cross, Raphael Ritz Co-Chairs.
1 CrossRef - a DOI Implementation for Journal Publishers January 29, 2003 CENDI Workshop.
CHRIS NELSON METADATA TECHNOLOGY WORK SESSION ON STATISTICAL METADATA GENEVA 6-8 MAY 2013 Designing a Metadata Repository Metadata Technology Ltd.
Lifecycle Metadata for Digital Objects November 22, 2004 Usage and Rights Management Metadata.
Data Fabric IG Introduction. 2  about 50 interviews & about 75 community interactions  Data Management and Processing is too time consuming and costly.
RDA Terminology: Data Management and Data Fabric Prepared for RDA 6 th Plenary Paris, Sept. 23, 2015 Gary Berg-Cross Co-Chair DFT IG, Co-organizing Chair.
TWC Adoption of RDA DTR and PID in Deep Carbon Observatory Data Portal Stephan Zednik, Xiaogang Ma, John Erickson, Patrick West, Peter Fox, & DCO-Data.
4 way comparison of Data Citation Principles: Amsterdam Manifesto, CoData, Data Cite, Digital Curation Center FORCE11 Data Citation Synthesis Group Should.
Find Research Data b2find.eudat.eu B2FIND User Training How to find data objects and collections using EUDAT’s B2FIND This work is licensed.
4 way comparison of Data Citation Principles: Amsterdam Manifesto, CoData, Data Cite, Digital Curation Center FORCE11 Data Citation Synthesis Group Should.
4 way comparison of Data Citation Principles: Amsterdam Manifesto, CoData, Data Cite, Digital Curation Center FORCE11 Data Citation Synthesis Group.
Discussion of Data Fabric Terms & Preparation for RDA P7 Virtual Meeting Monday, January 25, 2016 Organized by Gary Berg-Cross (DFT-IG) and Peter Wittenburg.
NIH BioCADDIE / Force11 Data Citation Pilot Kickoff Meeting Nine Zero Hotel, Boston MA, 3 February 2016 Introduction: Tim Clark, Maryann Martone and Joan.
Data Citation Implementation Pilot Workshop
May 2, 2013 An introduction to DSpace. Module 8 – Identifiers By the end of this module, you will … Understand what persistent identifiers are, how they.
1 The Metadata Groups - Keith G Jeffery. 2 Positioning  Raise profile of metadata  Data first  Also software, resources, users  Achieve outputs/outcomes.
Course on persistent identifiers, Madrid (Spain) Information architecture and the benefits of persistent identifiers Greg Riccardi Director Institute for.
Chapter 1 Overview of Databases and Transaction Processing.
Draft Data Foundation and Terminology (DFT) Vocabulary Development Process Prepared for WG-Core meeting 24/25.2 Munich/Garching Gary Berg-Cross Co-Chair.
Data Foundations And Terminology (DFT) IG Virtual Meeting July 6 th 2016 Co-Chairs DFT IG :Gary Berg-Cross & Raphael Ritz P8 Sessions DFT IG Breakout Session.
1 This slide indicated the continuous cycle of creating raw data or derived data based on collections of existing data. Identify components that could.
Intentions and Goals Comparison of core documents from DFIG and Publishing Workflow IG show that there is much overlap despite different starting points.
Core LIMS Training: Key Concepts & Definitions.
Workforce Repository & Planning Tool
RDA Europe: Views about PID Systems
Redesigning the DOE Data Explorer to embed dataset relationships at the point of search and to reflect landing page organization Sara Studwell Department.
RDA to Deliver Why? What? When? How?.
research data workflow
DSA and FAIR: a perfect couple
Current and Upcoming RDA Recommendations Dr. ir. Herman Stehouwer
RDA Data Fabric (DF) Interest Group Peter Wittenburg & Gary Berg-Cross
Making Data Providers’ Contribution Count
Exercise: understanding authenticity evidence
DOI Overview to Support its Use in GSICS
Middleware independent Information Service
Hashing - Hash Maps and Hash Functions
Active Data Management in Space 20m DG
Data Foundation and Terminology (DFT) Vocabulary Development Session
Exercise: understanding authenticity evidence
Chapter 4 Relational Databases
Data Fabric Interest Group Plenary 9 Core Session Barcelona
FORCE11 Data Citation Synthesis Group
GSAF Grid Storage Access Framework
C2CAMP (A Working Title)
A step-by-step guide to DOI registration
Sophia Lafferty-hess | research data manager
Tools of Software Development
Attributes and Values Describing Entities.
New input for CEOS Persistent Identifier Best Practices
EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal
Agenda welcome and goals (Peter)
Health Ingenuity Exchange - HingX
Tech introduction.
Data types and persistent identifiers in
Bird of Feather Session
ROLE OF «electronic virtual enhanced research-engaged student teams» WEB PORTAL IN SOLUTION OF PROBLEM OF COLLABORATION INTERNATIONAL TEAMS INSIDE ONE.
GEDE Focus Area Repositories - motivation -
©Ian Sommerville 2004Software Engineering, 7th edition. Chapter 8 Slide 1 Tools of Software Development l 2 types of tools used by software engineers:
Digital Object Management for ENES: Challenges and Opportunities
INTRODUCTION A Database system is basically a computer based record keeping system. The collection of data, usually referred to as the database, contains.
Improving the interoperability of metadata exchange in the ESS
Presentation transcript:

Maggie, Carlo, Peter, Rebecca (GEDE discussions) PID Usage Issues Maggie, Carlo, Peter, Rebecca (GEDE discussions)

Some basics PID: <prefix><del><suffix> prefix given to registration authority and all are different suffix is locally unique delimiter is for Handles/DOIs “/“ full PID is actionable such as https://hdl.handle.net/11304/a3d012ca-4e23-425e-9e2a-1e6a195b966f Handle is a technology used widely, DOI is a community of Handle users Discussion about PIDs now 20 years & 20 years of experience 1976 US Cross-Industry Working Team: Digital Object: - has some digital material (data, sw, ...) - has a PID - has some metadata

granularity and collection building digital objects (DOs) will be re-used and re-combined by others and we cannot predict how these objects will be used in a few years - this requires to give each scientifically meaningful object an identifier DOs are not just referenced within publications, but increasingly often we will need stable references for our data processing (workflows, etc.) to guarantee reproducibility there will be different strategies dependent on the discipline, the repositories storing data need to make their strategy clear there seems to be a trend that people start assigning Handles at high granularity and DOIs for citable collections (climate modelling, linguistics, etc.) in some labs it is already common practice to create virtual collections which are just some metadata and a whole set of PIDs pointing to DOs; collections themselves get assigned a PID

when to assign PIDs for some digital content it is obvious that they are subject to changes, therefore the question is raised when (small versus major changes) one should assign a new PID to a changed object in some communities people work on such DOs and carry out many changes without “registering” a new version so that it can be accessed etc. possibly the use of versionable databases in conjunction with assigning PIDs to queries - as already suggested by an RDA working group - can address this issue, but not all communities feel this is practical or implementable also in this case the repositories and/or communities need to indicate which policies they follow in some cases it may even be useful to assign PIDs before uploading content into a repository - however then problems may occur (what about relevance and accessibility of data on notebooks etc.) It may help to define the term "repository" as something "simple": a "repository" is an entity whose primary tasks are to provide services to access digital object content and essential state information, given an object’s PID, and to enable reliable and trusted data management.

versioning and PID binding role some repositories use an attribute in the PID record to refer to the previous and/or subsequent version; if these attributes are typed also machines can use the information other repositories use metadata records to include this information which is probably not as efficient as using the PID record it is obvious that we are increasingly dependent on PIDs - thus we need to work towards a stable system that is well maintained, redundant etc. if we have such a system we can use the PIDs to bind various types of information (bit sequences, metadata of different types, landing pages, etc.)

PID Attributes and Semantic Categories there is an urgent need to discuss this - a session should be organised at the Barcelona plenary it is about defining a set of types, but there is no obligation to use them all it is generally agreed that one should not overload the PID record some use fragment indicators – they are not part of the PID there is a need for using Persistent Identifiers for referring to concepts and/or categories used in specific disciplines. it is not obvious which kind of references should be used to refer to semantic categories the semantic web community suggests to use cool URIs there are existing practices in the communities which need to be respected; in biodiversity quite a number of schemes are being used, but yet not in a systematic fashion - they are looking for an overarching schema to overcome fragmentation