CLARINO WP2 National Registry and Long- Term Archiving Freddy Wetjen and Oddrun Pauline Ohren National Library of Norway Bergen, 12. September 2013
National Registry of metadata Goal – Joint metadata registry of resources in all Clarino centres Harvest data from all CLARINO centres Exchange data with other national CLARIN centres Status – current situation On-going and planned activities
National Registry of metadata Status (1) Metadata registry version 1 is running – Search/browse, editing and management, but no harvesting facilities – Infrastructure: META-SHARE infrastructure 3.0 – proxied by the managing node – Metadata complying META-SHARE metadata format 3.0 – No harvesting facilities – Metadata content: 71 resources – Usage: : 37 of the resources downloaded 1-17 times – Norwegian Wordnet (Bokmål) at the top – Topmost downloading locations: Norway, Germany, Greece, Sweden
National Registry of metadata Status (2) Decision made: Migrate to CMDI (CLARIN platform) – Uncertain future for META-SHARE 2 ys guaranteed life span – Need for more adaptability and expressivity in metadata model – Increased involvement with the CLARIN community
National Registry of metadata Planned activities Build a basic CMDI infrastructure – Repository, editor, search service, PID scheme, harvesting Convert metadata from META-SHARE to CMDI – Use META-SHARE profile as specified in Component Registry Extend/adapt metadata model according to need – In collaboration with the other CLARINO centres
CMDI Metadata framework Search Service Joint Metadata Repository TextLab EDD Relation Registry ISOcat Concept Registry Other trusted concept Registries CLARIN Component Registry Bergen Centre LAP META-SHARE components, a.o Other centre… Component editor Metadata editor Adaptation of Broeder, D. A Data Category Registry- and Component-based Metadata Framework. LREC «My profile» Definitions of concepts used in metadata components Metadata modeler Metadata creator Språk- banken User Infrastructure provided by CLARIN centrally
National Registry of metadata; Services Repository CMDI Metadata Editor (Arbil..?) Metadata creator OAI/PMH harvesting Search Services Weblicht VLO FCS? «Our profiles» Clarin common infrastructure
Data Repository Metadata editor -Resoures Data Delivery client Processing and adaptation for long term storage (Checksum,pid, metadata etc.) NB long term storage (preservation) Long term archiving
Time perspective Metadata registry version 2 : Primo 2014 – Basic CMDI infrastructure existing metadata converted from META-SHARE OAI/PMH endpoint, but no harvesting from other centres Metadata registry version 3: Mid 2015 – Extended/adapted metadata model – Harvesting from other CLARINO centres Long term archiving: Mid 2014 with both data and metadata.
CLARINO WP2 National Registry and Long- Term Archiving Freddy Wetjen and Oddrun Pauline Ohren National Library of Norway Bergen, 12. September 2013