Publishing the British National Bibliography as Linked Open Data

Slides:



Advertisements
Similar presentations
Resource description and access for the digital world Gordon Dunsire Centre for Digital Library Research University of Strathclyde Scotland.
Advertisements

Catherine Worrall Slide Library Co-ordinator, University College Falmouth.
UKOLN, University of Bath
From content standards to RDF Gordon Dunsire Presented at AKM 15, Porec, 2011.
Introduction to linked data Gordon Dunsire Presented at the Cataloguing and Indexing Group Scotland seminar Linked data and the Semantic Web: what have.
Controlled Vocabularies in TELPlus Antoine ISAAC Vrije Universiteit Amsterdam EDLProject Workshop November 2007.
Semantic Web Introduction
Linked Data for Libraries, Archives, Museums. Learning objectives Define the concept of linked data State 3 benefits of creating linked data and making.
Forging Links & Breaking Shackles The Linked Open Data BNB Brenda Young Metadata Systems Manager.
Introducing Linked Data ISD Spotlight Presented by Alison Hitchens 2013.
VIVO and Linked Open Data December 13, 2010 Dean B. Krafft Chief Technology Strategist and Director of IT Cornell University Library.
Corey A Harper DC2006 October 4, 2006 Authority Control for the Semantic Web Encoding Library of Congress Subject Headings (LCSH) in SKOS.
Highs and Lows of Library Linked Data Adrian Stevenson UKOLN, University of Bath, UK (until end Dec 2011) Mimas, Libraries and Archives Team, University.
The RDF meta model: a closer look Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations.
A Registry for controlled vocabularies at the Library of Congress
MINT – METADATA INTEROPERABILITY SERVICES Nikolaos Simou – National Technical University of Athens.
National libraries and identity in the Semantic Web Gordon Dunsire BNE, Madrid, 14 Dec 2011.
Ontologies aka: your metadata elements. “ontology” / “vocabulary” / “term” / “element” “…vocabularies define the concepts and relationships (also referred.
Metadata: Its Functions in Knowledge Representation for Digital Collections 1 Summary.
UKOLUG - July Metadata for the Web RDF and the Dublin Core Andy Powell UKOLN, University of Bath UKOLN.
ISO as the metadata standard for Statistics South Africa
Evangelia Mitsopoulou, St George’s University of London Panagiotis Bamidis, Aristotle University of Thessaloniki Daniela Giordano, University of Catania,
Publishing the British National Bibliography as Linked Open Data Corine Deliot Metadata Standards Analyst British Library CIG Event Birmingham, 25 November.
PREMIS Tools and Services Rebecca Guenther Network Development & MARC Standards Office, Library of Congress NDIIPP Partners Meeting July 21,
Archival description and linked data: Opportunities and implementation challenges Karen F. Gracy, Ph.D., Kent State University The Metadata Vocabulary.
RDA and Linking Library Data VuStuff III Conference Villanova University, Villanova, PA October 18, 2012 Dr. Sharon Yang Rider University.
The OAI-ORE based data model of Europeana and the Digital Public Library of America: implications for educational publishing Dov Winer MAKASH – Advancing.
Metadata: An Overview Katie Dunn Technology & Metadata Librarian
Data on the Web Life Cycle Bernadette Farias Lóscio March, 2014.
Semantic Web Applications GoodRelations BBC Artists BBC World Cup 2010 Website Emma Nherera.
The British Library and SUNCAT Brenda Young The British Library Bibliographic Development.
Lifecycle Metadata for Digital Objects (INF 389K) September 18, 2006 The Big Metadata Picture, Web Access, and the W3C Context.
Aligning library-domain metadata with the Europeana Data Model Sally CHAMBERS Valentine CHARLES ELAG 2011, Prague.
Boris Villazón-Terrazas, Ghislain Atemezing FI, UPM, EURECOM, Introduction to Linked Data.
Creating an Application Profile Tutorial 3 DC2004, Shanghai Library 13 October 2004 Thomas Baker, Fraunhofer Society Robina Clayphan, British Library Pete.
Towards a semantic web Philip Hider. This talk  The Semantic Web vision  Scenarios  Standards  Semantic Web & RDA.
Evolving MARC 21 for the future Rebecca Guenther CCS Forum, ALA Annual July 10, 2009.
Evidence from Metadata INST 734 Doug Oard Module 8.
RELATORS, ROLES AND DATA… … similarities and differences.
1 Dublin Core & DCMI – an introduction Some slides are from DCMI Training Resources at:
Metadata Registries Registry: authoritative, centrally controlled store of information – W3C Web Services Glossary, 2004
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Introduction to the Semantic Web and Linked Data
The RDF meta model Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations of XML compared.
Resource Description and Access (RDA) information session Deirdre Kiorgaard Australian Committee on Cataloguing Representative to the Joint Steering Committee.
5. Applying metadata standards: Application profiles Metadata Standards and Applications Workshop.
THE BIBFRAME EDITOR AND THE LC PILOT Module 3 – Unit 1 The Semantic Web and Linked Data : a Recap of the Key Concepts Library of Congress BIBFRAME Pilot.
Differences and distinctions: metadata types and their uses Stephen Winch Information Architecture Officer, SLIC.
Getting triples from records: the role of ISBD Gordon Dunsire Presented at Centar zu Stalno Stručno Usavršavanje (CSSU), Zagreb 21 Nov 2011.
© Copyright 2015 STI INNSBRUCK PlanetData D2.7 Recommendations for contextual data publishing Ioan Toma.
Current initiatives in developing library linked data Gordon Dunsire Presented at the Cataloguing and Indexing Group Scotland seminar “Linked data and.
CNI Spring 2016 Membership Meeting San Antonio TX Linked Data Implementations— Who, What and Why? Karen Smith-Yoshimura OCLC Research.
RDA and Linked Data Gordon Dunsire Presented at Cita BNE - RDA and Linked Data, 15 April 2016, Madrid, Spain.
Linked Open Data Dataset from Related Documents Petya Osenova and Kiril Simov IICT-BAS LDL-2016, LREC, Portoroz.
Setting the stage: linked data concepts Moving-Away-From-MARC-a-thon.
Making Connections Creating Linked Open Data Neil Wilson Head, Collection Metadata UKSG Webinar June
Linked Data and Libraries
Repository Software - Standards
Geospatial Knowledge Base (GKB) Training Platform
BIBFRAME at the Library of Congress
Recording RDA data as linked data
Applications of IFLA Namespaces
Eurostat activities update
PREMIS Tools and Services
Accommodating local cataloguing traditions in a global context
Session 2: Metadata and Catalogues
LOD reference architecture
RDA in a non-MARC environment
RDA Community and linked data
Antoine Isaac SEMIC conference
Presentation transcript:

Publishing the British National Bibliography as Linked Open Data Corine Deliot Metadata Standards Analyst British Library Linked Data: what cataloguers need to know London, 20 February 2015 © The British Library Board 2014

Overview Motivations and approach The modelling process and the data model Technical process: from MARC 21 to RDF, linking to external datasets Outcomes and dissemination Plans for future developments Use of the BNB data Challenges Benefits

Motivations Publishing our data for others to re-use Looking beyond library audiences Taking part in the Linked Data conversation

How? Pragmatic, bottom-up approach Using existing staff Building on existing skills Using existing tools as much as possible But training and mentoring from external provider

Why BNB? General bibliography - not a unique institutional catalogue Consistent format - over 60 years Size & range of content - 3 million records on all subjects in many languages Control of metadata – publishable as CC0. © Waldir/ Wikimedia Commons/ CC BY-SA-3.0 Usage terms: http://creativecommons.org/licenses/by-sa/3.0/

The modelling process (I) identify our objects of interest, i.e. what does the MARC record says about “things in the world” e.g. Bibliographic resources, people, organizations, places, subjects, etc. Assign URIs to identify these objects of interests

URIs: Things to think about Create our own URIs or use existing ones? e.g. http://viaf.org/viaf/96994048 http://id.loc.gov/authorities/names/n78095332 Create opaque or transparent URIs? e.g. http://viaf.org/viaf/96994048 or http://dbpedia.org/resource/William_Shakespeare What pattern? URI pattern guidance from the UK Cabinet Office “Designing URI Sets for the UK Public Sector” Create valid, i.e. syntax conformant URIs

URI patterns http://bnb.data.bl.uk/id/resource/{control-number} http://bnb.data.bl.uk/id/resource/{BNB-number} http://bnb.data.bl.uk/id/person/{person-name} http://bnb.data.bl.uk/id/organization/{organization-name} http://bnb.data.bl.uk/id/concept/lcsh/{topic} http://bnb.data.bl.uk/id/concept/ddc/{edition- number}/{dewey-number}

URI patterns http://bnb.data.bl.uk/id/resource/008043929 http://bnb.data.bl.uk/doc/resource/008043929 http://bnb.data.bl.uk/doc/resource/008043929.rdf http://bnb.data.bl.uk/doc/resource/008043929.ttl http://bnb.data.bl.uk/doc/resource/008043929.json http://bnb.data.bl.uk/doc/resource/008043929.html http://bnb.data.bl.uk/doc/resource/008043929.xml

The modelling process (II) Describe these objects of interest, i.e. use classes and how they relate to each other, i.e. use properties Use classes and properties from existing RDF vocabularies Define our own classes and properties when required; documented in the British Library Terms RDF schema

RDF Vocabularies Bibliographic Resource Person/Organization Event Dublin Core Bibliographic Ontology ISBD British Library Terms Event Event Ontology Person/Organization FOAF: Friend of a Friend Bio: a Vocabulary for Biographical Information Org: an Organisation Ontology RDA MADS/RDF Place WGS84 Geo Positioning Concept SKOS British Library Terms RDF RDF Schema OWL

The British Library Terms RDF Schema @prefix blt:<http://www.bl.uk/schemas/bibliographic/blterms#> . Existing property “not quite right” (e.g. not granular enough) e.g. dcterms:identifier vs blt:bnb

The British Library Terms RDF Schema @prefix blt:<http://www.bl.uk/schemas/bibliographic/blterms#> . Property or class required by specific feature of the model e.g. blt:publication and blt:PublicationEvent (rdfs:subClassOf event:Event)

The British Library Terms RDF Schema @prefix blt:<http://www.bl.uk/schemas/bibliographic/blterms#> . For pragmatic reasons, e.g. facilitate searching and navigating through the graph e.g. blt:TopicLCSH and blt:TopicDDC e.g. blt:hasCreated owl:inverseOf dcterms:creator

The BNB data model - Books http://www.bl.uk/bibliographic/pdfs/bldatamodelbook.pdf

Data Model Features (I): the Bibliographic Resource

Data Model Features (II): Publication as an event @prefix dc:<http://purl.org/dc/elements/1.1/> . @prefix dcterms:<http://purl.org/dc/terms> . <BibResource> dc:publisher “Publisher” ; dcterms:issued “Date” ; ?:placeOfPublication “Place” . @prefix blt:<http://www.bl.uk/schemas/bibliographic/blterms#> . @prefix event:<http://purl.org/NET/c4dm/event.owl#> . <BibResource> blt:publication <PublicationEvent> . <PublicationEvent> event:place <Place> ; event:agent <Publisher> ; event:time <Year> . Usual approach Event-based approach

Data model features (III) Birth and death are modelled as biographical events extensive use of foaf:focus to relate “things in the world” (e.g. people, organizations, places) to their SKOS concepts. e.g. “London”, the capital of England and the UK as a single “thing in the world” may be the “focus” of multiple concepts belonging to different concept schemes, e.g. thesauri (LCSH, Rameau, etc.) <Thing-as-Concept> foaf:focus <Thing in the World> . http://efoundations.typepad.com/efoundations/2011/09/things-their-conceptualisations-skos-foaffocus-modelling-choices.html by Pete Johnston

MARC to RDF Conversion Workflow Process Selection Character set conversion Pre-processing URI generation Data transformation Create & load triples Produce VoiD descriptions Tools Catalogue Bridge Utilities MARC Global/MARC Report http://www.marcofquality.com/ Jena Eyeball http://jena.sourceforge.net/Eyeball/

Linking to external sources (I) To give our data broader context we linked to: General resources: GeoNames Lexvo ISNI RDF Book Mashup Library resources: LCSH VIAF Dewey.info MARC language and country codes Page 20

Linking to external sources (II) Techniques included: Automatic generation from record data Auto text match with linked data dumps Crosswalk matching for coded data © Silverspoon/ Wikimedia Commons/ CC BY-SA-3.0 Usage terms: http://creativecommons.org/licenses/by-sa/3.0/

Serializations available: RDF/XML, N-Triples Outcomes Two datasets – Books and Serials - and their VoID descriptions, accessible at: BNB Linked data platform: http://bnb.data.bl.uk SPARQL endpoint: http://bnb.data.bl.uk/sparql SPARQL editor: http://bnb.data.bl.uk/flint-sparql Bulk downloads: http://www.bl.uk/bibliographic/download.html Updated monthly Serializations available: RDF/XML, N-Triples “Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/” Usage terms: http://creativecommons.org/licenses/by-sa/3.0/

http://bnb.data.bl.uk

Platform change 2011 - initial Talis platform 2013 – data migration to TSO platform http://www.tso.co.uk/our-expertise/technology/openup-platform Tendering process Migration of data and services over a couple of months

http://bnb.data.bl.uk/flint-sparql

http://www.bl.uk/bibliographic/download.html

Dissemination British Library Terms RDF schema declared in LOV (Linked Open Vocabularies) http://lov.okfn.org Linked Open BNB on data.gov.uk 5 * Openness rating included in the National Information Infrastructure http://data.gov.uk/dataset/the-linked-open-british-national- bibliography Open Data Institute certification Pilot level 92% Data Quality indicator as part of Heritage & Culture Challenge evaluation

Plans for Future Developments Refine and extend the model Investigate frbr-ization Link to other external sources e.g. DBPedia/Wikidata Collaborate with other national libraries Expand scope beyond current BNB, e.g. the British Catalogue of Music. Improve developer support

Use of the BNB data Statistics e.g. Number of hits on the SPARQL endpoint e.g. Number of downloads on the BL webpage e.g. Web logs analysis reports BNB data used in pilot projects e.g. Linked Open BNB data used as test data for a semantic search demonstrator. e.g. data provided to Microsoft to assist in their research into linking structured data. BNB data used in tutorials e.g. http://www.meanboyfriend.com/overdue_ideas/2014/10/using-an-api-hands-on-exercise/ - Owen Stephens

Use of the BNB data Anecdotal evidence However, use is quite difficult to assess; part and parcel of the data being open and available for all to use

Challenges Legacy data issues Converting MARC data into RDF! Publication event approach: transforming transcribed text into data URI creation from string may result in duplication changes over time may also produce duplication. Legacy data issues e.g. inconsistency of the data e.g. cataloguers using inadequate input tools for diacritics This was (relatively) new, nobody had all the answers

Benefits of Linked Open Data We have learnt a lot about the practical aspects of working with linked data. The data model is influencing other implementations Re-used by Danish Bibliography Centre LOD raised the profile of Collection Metadata internally and the Library’s profile externally LOD helped us focus our legacy data enhancement activities

For further information http://bnb.data.bl.uk http://www.bl.uk/bibliographic/datafree.html Thank you. Questions? metadata@bl.uk http://twitter.com/#!/BLMetadata