Presentation is loading. Please wait.

Presentation is loading. Please wait.

Practical Application of Linked Data Music Library Association Annual Meeting 2016.

Similar presentations


Presentation on theme: "Practical Application of Linked Data Music Library Association Annual Meeting 2016."— Presentation transcript:

1 Practical Application of Linked Data Music Library Association Annual Meeting 2016

2 Today’s Speakers Kimmy Szetoks287@columbia.eduks287@columbia.edu Karen Hwangkhwang3@pratt.edukhwang3@pratt.edu Steven Folsomsteven_folsom@harvard.edusteven_folsom@harvard.edu James Soe Nyunjsoenyun@ucsd.edujsoenyun@ucsd.edu

3 Music Library Association Annual Meeting March 4, 2016 Practical Applications of Linked Data Are We There Yet? Kimmy Szeto

4 Metadata Building Blocks Kimmy Szeto Practical Applications of Linked Data: Are We There Yet? MLA Annual Meeting March 4, 2016 Data Model Relating Resources and Metadata key-value, RDF, etc. Content Rules Extracting information AACR2, RDA, CCO, DACS, etc. Schema Organizing the Information MARC, ONIX, DC, EAD, etc. Exchange Query, Retrieval, Transmission Z39.50, SRU, SQL, SPARQL, etc. Serialization Notating the Structured Information ISO 2709, XML, JSON, Turtle, etc.

5 Tim Berners-Lee Linked Open Data Kimmy Szeto Practical Applications of Linked Data: Are We There Yet? MLA Annual Meeting March 4, 2016 Use URI as identifiers Use HTTP URIs for look up Use standards (RDF/SPARQL) Link to other URIs Link statements to form trees and networks http://lccn.loc.gov/200 4268202 http://rdvoca b.info/roles/ authorWork http://id.loc.gov/auth orities/names/n96042 036 http://rdvoca b.info/Elemen tsGr2/nameO fThePerson “1913” http://rdvoca b.info/Eleme ntsGr2/date OfBirth “The holy sonnets of John Donne” http://rdvocab. info/Elements/ preferredTitleF orTheWork “Britten, Benjamin” subject object predicate resource value property

6 Linked Open Data Building Blocks Kimmy Szeto Practical Applications of Linked Data: Are We There Yet? MLA Annual Meeting March 4, 2016 Data Model Schema Records RDF Statements Content Rules Free Text Data + URI Exchange Open Standard Serialization Open Standard Bibframe

7 Mix and Match Kimmy Szeto Practical Applications of Linked Data: Are We There Yet? MLA Annual Meeting March 4, 2016 schema.org Bibframe foaf LD4L MusicOntology Music Event DBpedia

8 Linked Jazz: The Data Sessions

9 Backgroun d Art Kane, A Great Day in Harlem, 1958

10 Backgroun d Emmett BerryLawrence Brown Marian McPartland Sonny Rollins Thelonious Monk Mary Lou Williams Art Kane, A Great Day in Harlem, 1958

11 Backgroun d The constellation of people represented in the Linked Jazz network

12 Leveraging linked data combine and extend relationships Backgroun d Representing relationships in jazz Analyzing linked data extracted RDF to express Sharing our data new ways

13 Process & Tools What resources do we use to define relationships? How is our linked data generated?

14 Process & Tools Interview with Vi Redd by Monk Rowe, 1999 Hamilton College Jazz Archive Transcripts of oral histories from jazz collections around the country

15 Process & Tools Transcript Analyzer (TA) for machine-assisted identification and reconciliation of name entities Interview with Mary Lou Williams by John S. Wilson, 1973 Rutgers, Institute of Jazz Studies

16 Process & Tools Name control: LOD resources

17 Process & Tools Name mapping allows the relationship to be automatically established in RDF Mary Lou Williamstalks about a Buck. talks about <http://dbpedia.org/ <http://dbpedia.org/resource/Mary_Lou_Williams> <http://purl.org/vocab/relationship/

18 Process & Tools Relationships derived from all transcripts visualized in an interactive network graph https://linkedjazz.org/network/ A new way to explore resources…

19 Process & Tools Linked Jazz triple data: https://linkedjazz.org/api/people/all/nthttps://linkedjazz.org/api/people/all/nt Under the Hood

20 Process & Tools Dynamic Ego Networks

21 Process & Tools Interview with Buster Williams by Monk Rowe, 2002 Hamilton College Jazz Archive

22 Process & Tools “…Betty Carter who always sang out of tune but you still better play in tune… It was sort of like an art form for her.” “…she was the consummate canned heat. She was like a can of Sterno. You open it up and you put a flame to it and you get this beautiful blue flame.” Interview with Buster Williams by Monk Rowe, 2002 Hamilton College Jazz Archive

23 Service s Ways we provide access to our data: API https://linkedjazz.org/api/ NetworkGraphs SPARQL https://linkedjazz.org/sparql/

24 Service s Dereferencing Pages for Name Entities

25 Service s Dereferencing Pages for Name Entities assigned URI in the Linked Jazz namespace two musicians mentioned him: “is rel:knowsOf of 2 resources”

26 LOD Experiments What does our Linked Open Data enable us and other usersto do?

27 LOD Experiments Interlink our data with other LOD datasets to build custom datasets

28 LOD Experiments Experimental interlinking examples

29 LOD Experiments Experimental interlinking examples Exploratory network visualization of musicians in Linked Jazz transcript data and Carnegie Hall performance data For more information: http://pfch.nyc/linked_jazz_meets_carnegie_hall/http://pfch.nyc/linked_jazz_meets_carnegie_hall/ by Molly Reese-Lerner and Hannah Sistrunk

30 LOD Experiments Enrich our entities with attributes from other LOD resources to create new ways to understand the data

31 LOD Experiments Using Linked Data to write loops to query gender data from other LOD resources Toshiko Akiyoshi http://dbpedia.org/resource/Toshiko_Akiyoshi owl:sameAs (‘zitgist’) dcterms:subject dbpedia-owl:viafId F F “a” = F For more information: https://linkedjazz.org/enriching-the-linked-jazz-name-list-with-gender-information/

32 LOD Experiments Storing the queried data for evaluation and use

33 LOD Experiments New view of network through a gender lens

34 LOD Experiments Roy Haynes’ transcript visualized with gender encoding

35 LOD Experiments Mary Lou Williams’ transcript visualized with gender encoding

36 Crowdsourcing semantic refinement of relationships on 52nd Street https://linkedjazz.org/52ndStreet/ https://linkedjaz.org/52ndStreet/ Publishing the Linked Jazz ontology Interlinking with other resource types: Tulane University jazz photo collections William P. Gottlieb collection, Library of Congress Adding new attributes to our dataset, e.g. “instrument”, “date of birth”, “place of death” Ongoing and Future Projects Direction s

37 Find us at: linkedjazz.orglinkedjazz.org

38 STEVEN FOLSOM HARVARD LIBRARY (PREVIOUSLY @ CORNELL UNIVERSITY LIBRARY) MUSIC LIBRARY ASSOCIATION, 2016 ANNUAL MEETING LINKING HIP HOP PARTY AND EVENT FLYERS TO THE SEMANTIC WEB

39 DISCLAIMER Image credit: https://systemcenterguru.wordpress.com/disclaimer/

40 ABOUT THE HIP HOP FLYERS All images of Hip Hop Flyers in this presentation are courtesy of the Cornell Hip Hop Collection

41 LD4L

42 USE CASE 4 The essence of this use case is making use of complex graph relationships via queries or patterns (rather than direct connections) to allow discovery that would not be possible without the semantics of different relationships between items and types of items included in the graph. User stories and demonstrations will be somewhat tied to available data because detailed information and relationships will not be available for all resources.

43 PILOT: LINKING HIP HOP FLYER METADATA TO MUSICBRAINZ/LINKEDBRAINZ DATA Model non-MARC metadata from Cornell Hip Hop Flyer Collection to RDF Test BIBFRAME for describing the flyers Test the use of other ontologies for describing other entities, e.g. events, venues (more on this in a moment) Use of LinkedBrainz URIs for performers to discover relationships to other entities to discover relationships to other entities… (On and on to da break of dawn)

44 HIP HOP FLYER METADATA

45 ONTOLOGY DECISIONS Describe the flyer in BIBFRAME, extend where needed Used Getty AAT W orktypes to create bf:Work sub-classes Describe events and related entities using MusicOntology, Event Ontology and Schema.org Use foaf:Person’s to reflect RWO persons, with bf:Person as an associated authority Same pattern for other bf:Authority subclasses

46 ONTOLOGY DECISIONS: BIBFRAME FOR FLYERS

47 ONTOLOGY DECISIONS: FOAF FOR PERSONS

48 ONTOLOGY DECISIONS: EVENTS AND PERFORMERS

49 MUSICBRAINZMUSICBRAINZ AND LINKED BRAINZ

50 TYING THIS TO EXTERNAL GRAPHS When we have a MusicBrainz URI for instances of mo:MusicArtist we can query for relationships to other entities and properties of these new entities.

51 BRUTE FORCE RECONCILIATION Normalized Labels using Open Refine Manually searched for MusicBrainz for entries for a subset of literals (many of these were derivations for the same performer) Found roughly 250 URL’s for entries in MusicBrainz Ultimately surfacing 115 unique corresponding LinkedBrainz URIs for the proof of concept

52 PULLING DATA FROM LINKEDBRAINZ.ORG CONSTRUCT { ?s ?p1 ?o1. ?o1 ?p2 ?o2. } WHERE { ?s ?p1 ?o1. ?o1 ?p2 ?o2. FILTER ( ?s = ) # Eliminate guid property FILTER ( ?p1 != ) FILTER ( ?p2 != ) # Eliminate Tracks FILTER ( NOT EXISTS { ?o1 a.} ) FILTER (NOT EXISTS { ?o2 a.} ) }

53 LINKEDBRAINZ.ORG CONTINUED { "@graph": [ { "@id": "http://musicbrainz.org/artist/c9378ced-9e63-4edc-ab37-35bde1062a32#_", "http://xmlns.com/foaf/0.1/based_near" : [ "http://musicbrainz.org/area/489ce91b- 6658-3307-9877-795b68554c98#_" ], "http://purl.org/ontology/mo/member_of" : [ "http://musicbrainz.org/artist/73046026-6228-41a3-aa12-b3b796b491fa#_" ], "http://xmlns.com/foaf/0.1/made" : [ "http://musicbrainz.org/signal-group/3cfacddf- a4ac-3fb8-9c29-fdaf1c429212#_", "http://musicbrainz.org/release/396c4bd1-50d2-43d7-9149-bfd56daad006#_", "http://musicbrainz.org/release/9a019ed2-3daf-4024-94dc-fd8f24ed6a59#_", "http://musicbrainz.org/signal-group/40e49ce8-c8ee-3c2b-9861-6b521a092ba1#_" ] }, { "@id": "http://musicbrainz.org/area/489ce91b-6658-3307-9877-795b68554c98#_", "@type" : [ "http://www.w3.org/2003/01/geo/wgs84_pos#SpatialThing" ], "http://www.w3.org/2000/01/rdf-schema#label" : [ { "@value" : "United States" } ], "http://www.w3.org/2002/07/owl#sameAs" : [ "http://ontologi.es/place/US", "http://dbpedia.org/resource/United_States" ], "http://open.vocab.org/terms/sortLabel" : [ { "@value" : "United States" } ] }, { "@id": "http://musicbrainz.org/release/396c4bd1-50d2-43d7-9149- bfd56daad006#_", "@type" : [ "http://purl.org/ontology/mo/Release" ], "http://purl.org/dc/elements/1.1/title" : [ { "@value" : "Def Jam / Cold Chillin\u2019 in the Spot", "@type" : "http://www.w3.org/2001/XMLSchema#string" } ], “

54 MAPPING METADATA TO RDF USING ISI’S KARMA

55 RECONCILING MO:RELEASE WITH BF:AUDIO, ETC.

56 REMAINING WORK IN FEBRUARY 2015 Continue Metadata clean up and RDF conversion Post Processing More Reconciliation Add data to a visualization/ discovery layer

57 2015 TAKEAWAYS FROM FLYERS PILOT Able to map large parts of our metadata to RDF using multiple ontologies to discover more relationships to more entities Largely predicated on manual workflows for preprocessing, URI lookups, and unstable software for RDF creation Need more URI’s, for both linking to and linking from in order to take advantage of queries and patterns Yes it is possible to describe flyers and related entities using BIBFRAME 1.0, but do we want to…

58 ONE YEAR LATER... Image credit: www.homestudiocorner.com

59 A YEAR LATER: STATUS UPDATE Largely dormant because focus turned to: LD4L Ontology/BIBFRAME 2.0 BIBFRAME to LD4L Ontology Post- processing LD4L has made some decisions on local URIs and Infrastructure **New Metadata Librarian** focusing on batch remediation, interoperability, reconciliation at Cornell

60 A YEAR LATER: REVISITING THE PROCESS LinkedBrainz- Efforts are being made to improve performance, but… A possible Side B: LOD Laundromat Laundry Basket Wardrobe Analytics LOTUS

61 A YEAR LATER: KARMA Karma is still great if you can get it installed ISI has implemented a Virtual Box option for work with Karma Side B: Considering how to make a business case for “best of breed” Converters to RDF Infrastructures that go beyond pilot

62 PARAPHRASING JIM HENDLER (WHO MIGHT BEEN PARAPHRASING SOMEONE ELSE) Saying that we can do the same things, only now it’s more difficult… Isn’t much of a sales pitch. With core ontology decisions decide we can now build/adapt tools that make it easier. Better RDF Converters Better RDF Reconciliation Better RDF Native Cataloging Tools *Actually meet uses cases previously unmet!*

63 Image credit: Discogs


Download ppt "Practical Application of Linked Data Music Library Association Annual Meeting 2016."

Similar presentations


Ads by Google