A Perspective on Preservation of Linked Data Richard Cyganiak DERI, NUI Galway.

Slides:



Advertisements
Similar presentations
Geoscience Information Network Stephen M Richard Arizona Geological Survey National Geothermal Data System.
Advertisements

Museums and Digital Repositories October, The punch line… In the digital realm, museums: * are very much like libraries * tend to share the same.
(1) Standardizing for Open Data Ivan Herman, W3C Open Data Week Marseille, France, June Slides at:
VIVO and Linked Open Data December 13, 2010 Dean B. Krafft Chief Technology Strategist and Director of IT Cornell University Library.
Do not use fonts other than Arial for your presentations ‘From A2A to Web 3.0’: local authority archives and the challenges in working across sectors in.
Actual Trends Semantic Web Lecture WS 2010/2011. What‘s next? W3C view: Look at Semantic Web activity:
LINKED DATA COMS E6125 Prof. Gail Kaiser Presented By : Mandar Mohe ( msm2181 )
National libraries and identity in the Semantic Web Gordon Dunsire BNE, Madrid, 14 Dec 2011.
WORKING WITH DATA ABOUT DATA: Introduction to Metadata | Ms Dot Porter| slide 1 A project of the Introduction to Metadata Working with Data.
PV2013 Summary Results Data Stewardship Interest Group WGISS-37 Meeting Cocoa Beach (Florida-US) - April 14-18, 2014.
The Data Cube Vocabulary: Statistics in the Web of Linked Data Arofan Gregory Open Data Foundation WICS, Geneva, 5-7 May 2015.
Data Sets, Vocabularies and Tools Pablo N. Mendes Freie Universität Berlin 1st year review Luxembourg, December /02/11.
Metadata Standards and Applications 4. Metadata Syntaxes and Containers.
METS-Based Cataloging Toolkit for Digital Library Management System Dong, Li Tsinghua University Library
DAISY AND DEVELOPING COUNTRIES PERSPECTIVE BY DIPENDRA MANOCHA.
WP.5 - DDI-SDMX Integration E.S.S. cross-cutting project on Information Models and Standards Marco Pellegrino, Denis Grofils Eurostat METIS Work Session6-8.
The OAI-ORE based data model of Europeana and the Digital Public Library of America: implications for educational publishing Dov Winer MAKASH – Advancing.
Data on the Web Life Cycle Bernadette Farias Lóscio March, 2014.
DDI-RDF Discovery Vocabulary A Metadata Vocabulary for Documenting Research and Survey Data Linked Data on the Web (LDOW 2013) Thomas Bosch.
ECHO DEPository Project: Highlight on tools & emerging issues The ECHO DEPository Project is a 3-year digital preservation research and development project.
CF Conventions Support at BADC Alison Pamment Roy Lowry (BODC)
Leveraging the DDI Model for Linked Statistical Data in the Social, Behavioural, and Economic Sciences DC Thomas Bosch GESIS – Leibniz.
Resource Curation and Automated Resource Discovery.
A Journey in Data Discovery Wendy Watkins TSES October, 2007.
JENN RILEY METADATA LIBRARIAN IU DIGITAL LIBRARY PROGRAM Introduction to Metadata.
DDI-RDF Leveraging the DDI Model for the Linked Data Web.
Metadata Lessons Learned Katy Ginger Digital Learning Sciences University Corporation for Atmospheric Research (UCAR)
Aligning library-domain metadata with the Europeana Data Model Sally CHAMBERS Valentine CHARLES ELAG 2011, Prague.
Boris Villazón-Terrazas, Ghislain Atemezing FI, UPM, EURECOM, Introduction to Linked Data.
In Dublin’s fair city, where the metadata are so pretty… John Roberts Archives New Zealand.
Challenges for Academic Libraries in the Networked World Christine L. Borgman Professor & Presidential Chair in Information Studies UCLA & Visiting Professor.
Study Discovery in Support of the Data Without Boundaries Initiative, the NIH Data Documentation Index and Infonomics Jay Greenfield Booz Allen Hamilton.
Antoine Isaac 1 st PRELIDA Workshop Pisa, June 26, 2013.
Access and Query Task Force Status at F2F1 Simon Miles.
VIVO and Scholarly Repositories: Synergistic Opportunities.
Introduction to Metadata Jenn Riley Metadata Librarian IU Digital Library Program.
Secure Epidemiology Research Platform (SERPent) Kick Start Meeting - April 15 th, 2010 Pascal Heus
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Introduction to the Semantic Web and Linked Data
Trustworthy Semantic Webs Dr. Bhavani Thuraisingham The University of Texas at Dallas Lecture #4 Vision for Semantic Web.
DDI Discovery: An Overview of Current RDF Vocabularies Arofan Gregory Metadata Technologies NA Joachim Wackerow GESIS.
Eurostat SDMX and Global Standardisation Marco Pellegrino Eurostat, Statistical Office of the European Union Bangkok,
Access and Query Task Force Status at F2F1 Simon Miles.
EDM Europeana Data Model Guus Schreiber with input from Carlo Meghini, Antoine Isaac, Stefan Gradmann, Maxx Dekkers et al. from Europeana V1.
Prizms for Data Publication and Management Katie Chastain May 9, 2014.
Metadata Standards Directory Alex Ball, Jane Greenberg, Keith Jeffery, Rebecca Koskela.
SDMX Basics course, March 2016 Eurostat SDMX Basics course, March Introducing the Roadmap Marco Pellegrino Eurostat Unit B5: “Data and.
A practitioner’s guide to linked data for cultural heritage Jacco van Ossenbruggen 10m for 10y lessons learned from MultimediaN, PrestoPrime, EuropeanaConnect,
Setting the stage: linked data concepts Moving-Away-From-MARC-a-thon.
The Earth System Curator Metadata Infrastructure for Climate Modeling Rocky Dunlap Georgia Tech.
FAIR Metadata RDA 10 Luiz Olavo Bonino – - September 21, 2017.
Presented at Archives Records 2016, session 510
Wheat Data Interoperability Esther DZALE YEUMO KABORE Richard FULSS
knowledge organization for a food secure world
ALA Practical Linked Data With Open Source
OzNome 5-star Tool: A Rating System for making data FAIR and Trustable
Introduction to Metadata
Lifting Data Portals to the Web of Data
Identifiers Answer Questions
Linked Data Technologies in e-Government and e-Medicine
How can DDI make the most of RDF?
DDI-RDF Discovery Vocabulary _ Use Cases and Vocabularies
LOD reference architecture
W3C Recommendation 17 December 2013 徐江
OpenSearch and JSON-LD for enhanced Earth observation data and service discovery Dr. Ingo Simonis Workshop on making spatial data discoverable through.
Linked Data Ryan McAlister.
Australian and New Zealand Metadata Working Group
IGARSS 2019 Dr. Ingo Simonis July 2019
Classifications and Linked Open Data Formalizing the structure and content of statistical classifications Item 9.1 Standards Working Group Luxembourg,
Presentation transcript:

A Perspective on Preservation of Linked Data Richard Cyganiak DERI, NUI Galway

How is Linked Data preservation different? Easier because RDF is (sometimes) self- describing – Representation information and context tends to be explicit and machine-processable Harder because it is tied to a particular technology infrastructure – If the domain name is lost, a dataset can no longer be LD (cf. TimBL's four principles) – Doesn't mean the data is no longer useful

Why think about preservation of LD? 1.Can the preservation community teach us how to make data more self-describing? 2.Preservation requires packaging. LD needs better data packaging 3.Preservation requires versioning. LD needs better versioning 4.LD datasets do go offline. How can we deal with it? Preserving the bits is not necessarily the hardest problem!

Access and formats Multiple methods of publishing/accessing LD – Dereferenceable URIs – SPARQL endpoints – RDF dumps (triple/quad) – Embedding into web pages (RDFa, microdata) Focus on RDF dumps to keep things tractable and to maximise usefulness for non-RDF data

Vocabularies Meaning of an LD dataset depends on used vocabularies (a.k.a. ontologies) – Most important representation information – Vocabularies can change and disappear too – Need to be preserved alongside the data Vocabularies would be good starting point for LD preservation – Note: LOV already archives versions of 100s of vocabularies (

Versioning How to package individual versions of a dataset in an explicit, machine-readable way? There is no strong notion of versioning in the RDF community. – Books have editions. Software products have releases. This is important for data too. What version of Dataset X are you using? “Dependencies” between datasets and vocabularies, incl. versions? See also: Memento

Cataloging and packaging How can the various parts of a dataset and its surrounding information be packaged and held together in an explicit, machine-readable way? What metadata needs to be recorded about these packages to preserve context and make them findable? Potential benefit: Tooling for setting up a local copy of a published/archived dataset including all its dependencies See also: OKFN's data packages –

Existing relevant (?) standards VoID – Metadata standard for RDF vocabularies DCAT – Upcoming W3C standard for data catalogs PROV – W3C standard for provenance DDI Discovery Vocabulary – Used by data archives to document statistical microdata, survey data, etc.

Summary 1.The most important repository for LD preservation will be one that versions vocabularies 2.Focus on bulk RDF (dumps, not SPARQL endpoints or deref URI crawling) 3.Work towards good practices for making data self- describing and for metadata? 4.Work towards standards and good practices for packaging, versioning, dependencies? 5.Use existing standards: VoID, DCAT, PROV, Disco 6.Preservation across time… 7.But also preservation across space and communities