1 Data Description Registry Interoperability (DDRI) Working Group Dimitris Gavrilis, Amir Aryani
2 Enabling cross-platform discovery between research data registries. Problem & Context
3 ▪ANDS. Australian National Data Service ▪Data-PASS. Data Preservation Alliance for the Social Sciences ▪Dryad. Digital Repository ▪Thomson Reuters DCI. Data Citation Index ▪VIVO Cornell. Research-focused multidisciplinary discovery tool ▪CERN. European Organization for Nuclear Research ▪DANS. Data Archiving & Networked Services ▪da|ra. The DOI registration service in Germany ▪DCU. Digital Curation Unit – IMIS, Athena R.C. Partners
4 Example from Dryad
5 Author Information
6
7 Research Data Switchboard
8 Modelling Connections as a Graph
9 Duplication of Content ● Why ? ○ Same information submitted to different repositories ○ Harvested from multiple sources ● How can we deal with duplication ? ○ Unique identifiers ■DOI ■Handle.net ■ORCID ■ISNI ■... ● What happens if no unique identifier is present ?
10 A De-Duplication Service for the Humanities ● Motivation o Huge amount of humanities related content in Europe o Many aggregation projects about cultural heritage ● Simple examples o Same dataset is aggregated from different sources o Same dataset is submitted multiple times o Same author deposits data to different repositories o Same author registers twice o Two co-authors submit the same dataset twice from two different locations
11 Why De-Duplication ● Cleaner, more accurate data ● Save time in submission validation ● Save space and resources when aggregating content from multiple sources ● Improved interoperability
12 Service Design
13 Matching Algorithm ● Proposed Elements to use o Keywords o Issued Date o Subject Terms o Spatial Information o Temporal Information ● Matching algorithms o Accuracy Exact Approximate o Element Titles Authors/Creators Spatial Temporal
14 Prototype Implementation
15 Operational Workflow Harvest Service OAI-PMH File Upload REST Validation De-Duplication Report RDF Store Publish results IngestEnrichment
16 Thank you