Practical Application of Linked Data Music Library Association Annual Meeting 2016.

Slides:



Advertisements
Similar presentations
Resource description and access for the digital world Gordon Dunsire Centre for Digital Library Research University of Strathclyde Scotland.
Advertisements

CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
Bibliographic Framework Initiative Approach for MARC Data as Linked Data Sally McCallum Library of Congress.
Yes, we can! Some observations on library linked data.
LEVERAGING THE DEEPER GRAPH (VIA QUERIES OR PATTERNS) STEVEN FOLSOM PAOLO CICCARESE LD4L USE CASE 4.
RDF Tutorial.
Semantic Web Introduction
RDA AND LINKED DATA: MOVING BEYOND THE RULES Jenn Riley Head, Carolina Digital Library and Archives The University of North Carolina at Chapel Hill.
RDA: A New Standard Supporting Resource Discovery Presentation given at the CLA conference session The Future of Resource Discovery: Promoting Resource.
CSCI 572 Project Presentation Mohsen Taheriyan Semantic Search on FOAF profiles.
COMP 6703 eScience Project Semantic Web for Museums Student : Lei Junran Client/Technical Supervisor : Tom Worthington Academic Supervisor : Peter Strazdins.
LINKED DATA COMS E6125 Prof. Gail Kaiser Presented By : Mandar Mohe ( msm2181 )
U of R eXtensible Catalog Team MetaCat. Problem Domain.
RDF: Building Block for the Semantic Web Jim Ellenberger UCCS CS5260 Spring 2011.
Module 2b: Modeling Information Objects and Relationships IMT530: Organization of Information Resources Winter, 2007 Michael Crandall.
Linked Data for Libraries (LD4L) CUL Metadata Working Group May 15, 2015.
Leveraging Names with Linked Data Karen Smith-Yoshimura Ralph LeVan 2010 RLG Partnership Annual Meeting Chicago, IL 9 June 2010.
Metadata: Its Functions in Knowledge Representation for Digital Collections 1 Summary.
Page 1 ISMT E-120 Desktop Applications for Managers Introduction to Microsoft Access.
PREMIS Tools and Services Rebecca Guenther Network Development & MARC Standards Office, Library of Congress NDIIPP Partners Meeting July 21,
RDA and Linked Data Steve Henry University of Maryland March 2, 2013.
Metadata: An Overview Katie Dunn Technology & Metadata Librarian
Interoperable Digitised Content “Discover, search, extract, link, associate, and view digitised content” Les Carr.
RDA data and applications Gordon Dunsire Presented to staff of the British Library, Boston Spa, 20 Mar 2014.
Shared innovation Linking Distributed Data across the Web Dr Tom Heath Researcher, Platform Division Talis Information Ltd t
Linked data the next network?. The Web of documents is for people The Web of data is for computers The Web of documents is difficult for computers to.
Not Just For Data Geeks! A Practical Approach to Linked Data for Digital Library Managers Cory Lampert and Silvia Southwick Salt Lake City October 9, 2013.
Integrating Live Plant Images with Other Types of Biodiversity Records Steve Baskauf Vanderbilt Dept. of Biological Sciences
Open Data Protocol * Han Wang 11/30/2012 *
Interoperability through Library APIs Library Technology Services Open House 7/30/15.
-1- Philipp Heim, Thomas Ertl, Jürgen Ziegler Facet Graphs: Complex Semantic Querying Made Easy Philipp Heim 1, Thomas Ertl 1 and Jürgen Ziegler 2 1 Visualization.
Taking Action: Linked Data for Digital Library Managers Silvia Southwick and Cory Lampert UNLV Digital Collections American Library Association Annual.
Lifecycle Metadata for Digital Objects (INF 389K) September 18, 2006 The Big Metadata Picture, Web Access, and the W3C Context.
LINKED DATA AND RDA: LOOKING TOWARD NEXT GENERATION CATALOGING Jenn Riley Head, Carolina Digital Library and Archives Digital Discussions series Twitter:
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
Evolving MARC 21 for the future Rebecca Guenther CCS Forum, ALA Annual July 10, 2009.
All the Reasons to be a Fan of PCC's Strategic Directions Shifting from Authorities to People, Places, Events, Awards… Steven Folsom | Metadata.
Lifecycle Metadata for Digital Objects November 1, 2004 Descriptive Metadata: “Modeling the World”
RELATORS, ROLES AND DATA… … similarities and differences.
OAI Overview DLESE OAI Workshop April 29-30, 2002 John Weatherley
Linked Data: Emblematic applications on Legacy Data in Libraries.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Introduction to the Semantic Web and Linked Data
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
THE BIBFRAME EDITOR AND THE LC PILOT Module 3 – Unit 1 The Semantic Web and Linked Data : a Recap of the Key Concepts Library of Congress BIBFRAME Pilot.
BI Practice March-2006 COGNOS 8BI TOOLS COGNOS 8 Framework Manager TATA CONSULTANCY SERVICES SEEPZ, Mumbai.
Linked Data Best Practices and BibFrame December 15 th, 2015 Rob Sanderson (google doc) CNI 2015 F ALL F ORUM.
DBpedia - A Crystallization Point
Paloma Marín Arraiza 17 th International Conference on Grey Literature 1 st and 2 nd December 2015, Amsterdam (Netherlands) SCIENTIFIC AUDIOVISUAL MATERIALS.
Presenting Semantic Data Through “Instance Hubs” Using Authoritative URI Design Schemes Alexei Bulazel 1 ( ), Dominic Difranzo 1 (
LINKED DATA PILOT PROJECT AT SYRACUSE UNIVERSITY LIBRARIES Sarah Theimer & Brian Dobreski Acquisitions and Cataloging Syracuse University Libraries.
GoRelations: an Intuitive Query System for DBPedia Lushan Han and Tim Finin 15 November 2011
CNI Spring 2016 Membership Meeting San Antonio TX Linked Data Implementations— Who, What and Why? Karen Smith-Yoshimura OCLC Research.
MARC Tags to BIBFRAME Vocabulary: a new view of metadata Sally McCallum Library of Congress ALA - January 2014.
Linked Library (+AM) Data Presented LITA Next-Generation Catalog IG Corey A Harper Publish, Enrich, Relate and Un-Silo.
Linked Open Data Dataset from Related Documents Petya Osenova and Kiril Simov IICT-BAS LDL-2016, LREC, Portoroz.
Shared innovation Linking Distributed Data across the Web Dr Tom Heath Researcher, Platform Division Talis Information Ltd t
Setting the stage: linked data concepts Moving-Away-From-MARC-a-thon.
Developing our Metadata: Technical Considerations & Approach Ray Plante NIST 4/14/16 NMI Registry Workshop BIPM, Paris 1 …don’t worry ;-) or How we concentrate.
Shared innovation Linking Distributed Data across the Web Dr Tom Heath Researcher, Platform Division Talis Information Ltd t
Xiaoli Li Co-head of Content Support Services
Linked Data Web that can be processed by machines
WHAT DOES THE FUTURE HOLD? Ann Ellis Dec. 18, 2000
Presented at Archives Records 2016, session 510
BIBFRAME at the Library of Congress
Digging into Linked Data: Perspectives from the Long Tail
An ecosystem of contributions
PREMIS Tools and Services
LOD reference architecture
Linked Data Ryan McAlister.
Presentation transcript:

Practical Application of Linked Data Music Library Association Annual Meeting 2016

Today’s Speakers Kimmy Karen Steven James Soe

Music Library Association Annual Meeting March 4, 2016 Practical Applications of Linked Data Are We There Yet? Kimmy Szeto

Metadata Building Blocks Kimmy Szeto Practical Applications of Linked Data: Are We There Yet? MLA Annual Meeting March 4, 2016 Data Model Relating Resources and Metadata key-value, RDF, etc. Content Rules Extracting information AACR2, RDA, CCO, DACS, etc. Schema Organizing the Information MARC, ONIX, DC, EAD, etc. Exchange Query, Retrieval, Transmission Z39.50, SRU, SQL, SPARQL, etc. Serialization Notating the Structured Information ISO 2709, XML, JSON, Turtle, etc.

Tim Berners-Lee Linked Open Data Kimmy Szeto Practical Applications of Linked Data: Are We There Yet? MLA Annual Meeting March 4, 2016 Use URI as identifiers Use HTTP URIs for look up Use standards (RDF/SPARQL) Link to other URIs Link statements to form trees and networks b.info/roles/ authorWork orities/names/n b.info/Elemen tsGr2/nameO fThePerson “1913” b.info/Eleme ntsGr2/date OfBirth “The holy sonnets of John Donne” info/Elements/ preferredTitleF orTheWork “Britten, Benjamin” subject object predicate resource value property

Linked Open Data Building Blocks Kimmy Szeto Practical Applications of Linked Data: Are We There Yet? MLA Annual Meeting March 4, 2016 Data Model Schema Records RDF Statements Content Rules Free Text Data + URI Exchange Open Standard Serialization Open Standard Bibframe

Mix and Match Kimmy Szeto Practical Applications of Linked Data: Are We There Yet? MLA Annual Meeting March 4, 2016 schema.org Bibframe foaf LD4L MusicOntology Music Event DBpedia

Linked Jazz: The Data Sessions

Backgroun d Art Kane, A Great Day in Harlem, 1958

Backgroun d Emmett BerryLawrence Brown Marian McPartland Sonny Rollins Thelonious Monk Mary Lou Williams Art Kane, A Great Day in Harlem, 1958

Backgroun d The constellation of people represented in the Linked Jazz network

Leveraging linked data combine and extend relationships Backgroun d Representing relationships in jazz Analyzing linked data extracted RDF to express Sharing our data new ways

Process & Tools What resources do we use to define relationships? How is our linked data generated?

Process & Tools Interview with Vi Redd by Monk Rowe, 1999 Hamilton College Jazz Archive Transcripts of oral histories from jazz collections around the country

Process & Tools Transcript Analyzer (TA) for machine-assisted identification and reconciliation of name entities Interview with Mary Lou Williams by John S. Wilson, 1973 Rutgers, Institute of Jazz Studies

Process & Tools Name control: LOD resources

Process & Tools Name mapping allows the relationship to be automatically established in RDF Mary Lou Williamstalks about a Buck. talks about < < <

Process & Tools Relationships derived from all transcripts visualized in an interactive network graph A new way to explore resources…

Process & Tools Linked Jazz triple data: Under the Hood

Process & Tools Dynamic Ego Networks

Process & Tools Interview with Buster Williams by Monk Rowe, 2002 Hamilton College Jazz Archive

Process & Tools “…Betty Carter who always sang out of tune but you still better play in tune… It was sort of like an art form for her.” “…she was the consummate canned heat. She was like a can of Sterno. You open it up and you put a flame to it and you get this beautiful blue flame.” Interview with Buster Williams by Monk Rowe, 2002 Hamilton College Jazz Archive

Service s Ways we provide access to our data: API NetworkGraphs SPARQL

Service s Dereferencing Pages for Name Entities

Service s Dereferencing Pages for Name Entities assigned URI in the Linked Jazz namespace two musicians mentioned him: “is rel:knowsOf of 2 resources”

LOD Experiments What does our Linked Open Data enable us and other usersto do?

LOD Experiments Interlink our data with other LOD datasets to build custom datasets

LOD Experiments Experimental interlinking examples

LOD Experiments Experimental interlinking examples Exploratory network visualization of musicians in Linked Jazz transcript data and Carnegie Hall performance data For more information: by Molly Reese-Lerner and Hannah Sistrunk

LOD Experiments Enrich our entities with attributes from other LOD resources to create new ways to understand the data

LOD Experiments Using Linked Data to write loops to query gender data from other LOD resources Toshiko Akiyoshi owl:sameAs (‘zitgist’) dcterms:subject dbpedia-owl:viafId F F “a” = F For more information:

LOD Experiments Storing the queried data for evaluation and use

LOD Experiments New view of network through a gender lens

LOD Experiments Roy Haynes’ transcript visualized with gender encoding

LOD Experiments Mary Lou Williams’ transcript visualized with gender encoding

Crowdsourcing semantic refinement of relationships on 52nd Street Publishing the Linked Jazz ontology Interlinking with other resource types: Tulane University jazz photo collections William P. Gottlieb collection, Library of Congress Adding new attributes to our dataset, e.g. “instrument”, “date of birth”, “place of death” Ongoing and Future Projects Direction s

Find us at: linkedjazz.orglinkedjazz.org

STEVEN FOLSOM HARVARD LIBRARY CORNELL UNIVERSITY LIBRARY) MUSIC LIBRARY ASSOCIATION, 2016 ANNUAL MEETING LINKING HIP HOP PARTY AND EVENT FLYERS TO THE SEMANTIC WEB

DISCLAIMER Image credit:

ABOUT THE HIP HOP FLYERS All images of Hip Hop Flyers in this presentation are courtesy of the Cornell Hip Hop Collection

LD4L

USE CASE 4 The essence of this use case is making use of complex graph relationships via queries or patterns (rather than direct connections) to allow discovery that would not be possible without the semantics of different relationships between items and types of items included in the graph. User stories and demonstrations will be somewhat tied to available data because detailed information and relationships will not be available for all resources.

PILOT: LINKING HIP HOP FLYER METADATA TO MUSICBRAINZ/LINKEDBRAINZ DATA Model non-MARC metadata from Cornell Hip Hop Flyer Collection to RDF Test BIBFRAME for describing the flyers Test the use of other ontologies for describing other entities, e.g. events, venues (more on this in a moment) Use of LinkedBrainz URIs for performers to discover relationships to other entities to discover relationships to other entities… (On and on to da break of dawn)

HIP HOP FLYER METADATA

ONTOLOGY DECISIONS Describe the flyer in BIBFRAME, extend where needed Used Getty AAT W orktypes to create bf:Work sub-classes Describe events and related entities using MusicOntology, Event Ontology and Schema.org Use foaf:Person’s to reflect RWO persons, with bf:Person as an associated authority Same pattern for other bf:Authority subclasses

ONTOLOGY DECISIONS: BIBFRAME FOR FLYERS

ONTOLOGY DECISIONS: FOAF FOR PERSONS

ONTOLOGY DECISIONS: EVENTS AND PERFORMERS

MUSICBRAINZMUSICBRAINZ AND LINKED BRAINZ

TYING THIS TO EXTERNAL GRAPHS When we have a MusicBrainz URI for instances of mo:MusicArtist we can query for relationships to other entities and properties of these new entities.

BRUTE FORCE RECONCILIATION Normalized Labels using Open Refine Manually searched for MusicBrainz for entries for a subset of literals (many of these were derivations for the same performer) Found roughly 250 URL’s for entries in MusicBrainz Ultimately surfacing 115 unique corresponding LinkedBrainz URIs for the proof of concept

PULLING DATA FROM LINKEDBRAINZ.ORG CONSTRUCT { ?s ?p1 ?o1. ?o1 ?p2 ?o2. } WHERE { ?s ?p1 ?o1. ?o1 ?p2 ?o2. FILTER ( ?s = ) # Eliminate guid property FILTER ( ?p1 != ) FILTER ( ?p2 != ) # Eliminate Tracks FILTER ( NOT EXISTS { ?o1 a.} ) FILTER (NOT EXISTS { ?o2 a.} ) }

LINKEDBRAINZ.ORG CONTINUED { [ { " " : [ " b68554c98#_" ], " : [ " ], " : [ " a4ac-3fb8-9c29-fdaf1c429212#_", " " " ] }, { " : [ " ], " : [ { : "United States" } ], " : [ " " ], " : [ { : "United States" } ] }, { " bfd56daad006#_", : [ " ], " : [ { : "Def Jam / Cold Chillin\u2019 in the Spot", : " } ], “

MAPPING METADATA TO RDF USING ISI’S KARMA

RECONCILING MO:RELEASE WITH BF:AUDIO, ETC.

REMAINING WORK IN FEBRUARY 2015 Continue Metadata clean up and RDF conversion Post Processing More Reconciliation Add data to a visualization/ discovery layer

2015 TAKEAWAYS FROM FLYERS PILOT Able to map large parts of our metadata to RDF using multiple ontologies to discover more relationships to more entities Largely predicated on manual workflows for preprocessing, URI lookups, and unstable software for RDF creation Need more URI’s, for both linking to and linking from in order to take advantage of queries and patterns Yes it is possible to describe flyers and related entities using BIBFRAME 1.0, but do we want to…

ONE YEAR LATER... Image credit:

A YEAR LATER: STATUS UPDATE Largely dormant because focus turned to: LD4L Ontology/BIBFRAME 2.0 BIBFRAME to LD4L Ontology Post- processing LD4L has made some decisions on local URIs and Infrastructure **New Metadata Librarian** focusing on batch remediation, interoperability, reconciliation at Cornell

A YEAR LATER: REVISITING THE PROCESS LinkedBrainz- Efforts are being made to improve performance, but… A possible Side B: LOD Laundromat Laundry Basket Wardrobe Analytics LOTUS

A YEAR LATER: KARMA Karma is still great if you can get it installed ISI has implemented a Virtual Box option for work with Karma Side B: Considering how to make a business case for “best of breed” Converters to RDF Infrastructures that go beyond pilot

PARAPHRASING JIM HENDLER (WHO MIGHT BEEN PARAPHRASING SOMEONE ELSE) Saying that we can do the same things, only now it’s more difficult… Isn’t much of a sales pitch. With core ontology decisions decide we can now build/adapt tools that make it easier. Better RDF Converters Better RDF Reconciliation Better RDF Native Cataloging Tools *Actually meet uses cases previously unmet!*

Image credit: Discogs