RDF Generation and Linking using Open Refine Daniel Vila-Suero
Introduction Learn how to use Open Refine and its RDF extension Map tabular data to RDF Link entities to a external dataset
Overview 3
Installation Download Open Refine at: Download the RDF Extension at:
Installing the RDF Extension Make sure "extensions" folder exists in your Google Refine workspace Download the extension Extract the downloaded zip file to the "extensions" folder Start/Restart Google Refine
Data Download available at: museum/phm-collection-cleaned.zip
Task 1 Inspect the data Pre-process if neccesary
Task 2 Using the RDF extension map the data to RDF using a selection of vocabularies
Task 3 Using a reconciliation service try to find entities within your data that have URIs in other dataset. For example the LCSH (Library of Congress Subject Heading) with the following configuration: Name: LCSH (preprocessed) Endpoint URL: Graph URI: processed/ Type: Virtuoso Label properties: check only skos:prefLabel
Publish Load the data in a SPARQL endpoint (Virtuoso, 4store, Fuseki, Sesame, etc.) Install and configure a LD front-end: Recommended: Puelia (PHP), Elda (Java) Configurable using a config file in turtle Documentation available at: api/wiki/Specification 10