Do MORe with your data LoCloud Final Conference 5th February 2016 Dr. Dimitris Gavrilis Digital Curation Unit - IMIS, Athena Research Center LoCloud is funded by the European Commission's ICT Policy Support Programme
Do MORe with your data
MORe Architecture Key characteristics: Fault-tolerance High-availability Elasticity Scalability Key components Storage layer Decentralized & scalable services Pluggable services
Micro-service architecture Language identification Thesauri collections Vocabulary matching File-Upload Omeka Background links OAI-PMH Structure Geo normalization Archive MINT mapping tool Schema Geo coding Elastic Search Wikimedia Linking Reverse geo-coding RDF Store LoCloud collections Schematron rules Historic place names OAI-PMH Input sources Validation micro-services Enrichment micro-services Publish services Input service mgmt Validation service mgmt Enrichment service mgmt Publish serv. mgmt Core services layer Data access layer Storage nodes
Enrichment micro-services 14 enrichment services so far Thematic Spatial Temporal Other
Distributed Enrichment services run on: Austria Spain Greece Lithuania Slovenia Norway
Validation Validation schemes Schematron Rule based validation Flexibility Schematron Rule based validation No more rejected packages
Metadata Quality Get completeness graphs for every package and schema element Per mandatory/recommended set
Metadata Quality On the fly indexing, analysis and intuitive presentation of Thematic information Spatial information Temporal information
Preview
Publication Publish your enriched data to Europeana An RDF Store as LOD To Elastic Search Download them in a zip archive Publish to multiple targets simultaneously
Enrichment micro-services
Place names We have our own Geo-names server
Periods We have our own PeriodO database
We have access to over 30 thesauri AIT (Angewandte Informationstechnik Forschungsgesellschaft mbH
Thesauri mappings Map your subject terms to standardized concepts from SKOSified vocabularies AAT Perio.do …
Subject collections Subject collections showcase Publically available subject collections Seamless integration with MoRe Autocomplete search of terms within thesaurus Targeted enrichment based on item level subject terms
Metadata Enrichment Automatically enrichment of content with entries from: Wikipedia DBPedia SKOSified thesauri UPV/EHU – Universidad del País Vasco
Developers & Creative Industries API Integration MORe API allows to run the entire aggregation engine through REST Developers area API key generation API documentation with examples Example Java projects for NetBeans & Eclipse IDEs
Developers & Creative Industries Plugins Allows developers to create their own enrichment micro-services on their own servers and integrate them into the enrichment process of MoRe. Developers have to implement a REST based interface and declare it as an enrichment micro-service in MoRe
MORe success stories 10 more projects are using/evaluating MORe ARIADNE chose MORe as it’s primary aggregator Over 1 million records have been aggregated and published to the ARIADNE portal RDA DDRI WG uses MORe Zero downtime Zero data loss New metadata schemas have been integrated New enrichment services have been developed / integrated
Thank you d.gavrilis@dcu.gr
Funding LoCloud is funded by the European Commission's ICT Policy Support Programme The views and opinions expressed in this presentation are the sole responsibility of the authors and do not necessarily reflect the views of the European Commission.
Native record (OAI_DC)
EDM Record Missing language attributes Place label is a concat string of coordinates
Language identification Enriched EDM Record Enrichment Plan Language identification Vocabulary matching Geo-normalization Geo-coding