CNI Spring 2016 Membership Meeting San Antonio TX Linked Data Implementations— Who, What and Why? Karen Smith-Yoshimura OCLC Research.

Slides:



Advertisements
Similar presentations
Resource description and access for the digital world Gordon Dunsire Centre for Digital Library Research University of Strathclyde Scotland.
Advertisements

Aligning BIBFRAME with The Schema/Bib Extend model
Bibliographic Framework Initiative Approach for MARC Data as Linked Data Sally McCallum Library of Congress.
RDA and the semantic Web Lectio magistralis in Library Science by Gordon Dunsire Florence University, Florence, Italy 4th March, 2014.
Linked Data, Discovery and Discoverability John McCullough Senior Product Manager, OCLC December 3, 2014 UCL Discovery and Discoverability.
Interoperability Aspects in Europeana Antoine Isaac Workshop on Research Metadata in Context 7./8. September 2010, Nijmegen.
The Virtual International Authority File Thomas Hickey ACIG 2009 July 12 ALA, Chicago IL.
Forging Links & Breaking Shackles The Linked Open Data BNB Brenda Young Metadata Systems Manager.
Corey A Harper DC2006 October 4, 2006 Authority Control for the Semantic Web Encoding Library of Congress Subject Headings (LCSH) in SKOS.
The Web of data with meaning... By Michael Griffiths.
Information and Business Work
Linked Open Data: Opportunities & Barriers for Archives Adrian Stevenson LOCAH Project Manager UKOLN, University of Bath, UK Archives 360, Society of American.
Highs and Lows of Library Linked Data Adrian Stevenson UKOLN, University of Bath, UK (until end Dec 2011) Mimas, Libraries and Archives Team, University.
Using Metadata in CONTENTdm Diana Brooking and Allen Maberry Metadata Implementation Group, Univ. of Washington Crossing Organizational Boundaries Oct.
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
A Registry for controlled vocabularies at the Library of Congress
Leveraging Names with Linked Data Karen Smith-Yoshimura Ralph LeVan 2010 RLG Partnership Annual Meeting Chicago, IL 9 June 2010.
OCLC Research Library Partners, Works in Progress Series, 12 August 2015 Looking inside the Library Knowledge Vault Bruce Washburn Consulting Software.
Publishing the British National Bibliography as Linked Open Data Corine Deliot Metadata Standards Analyst British Library CIG Event Birmingham, 25 November.
Metadata Standards and Applications 5. Applying Metadata Standards: Application Profiles.
PREMIS Tools and Services Rebecca Guenther Network Development & MARC Standards Office, Library of Congress NDIIPP Partners Meeting July 21,
Multilingual Issues in the Representation of International Bibliographic Standards for the Semantic Web Gordon Dunsire Independent Consultant; Chair of.
Is Semantic Web Our Future? Computers in Libraries Conference 2012 March 21-23, 2012 Hilton Washington Washington, DC Sharon Q. Yang, Rider University,
Terminology services and the DDC: the High-Level Thesaurus and beyond Presented to the symposium Dewey goes Europe: on the use and development of the Dewey.
Use cases Gordon Dunsire. UC: Bibliographic network +Identification and deduplication of library records +Regional catalogue +Data BNF +*Community Information.
TOOLS FOR LLD Vocabularies, linking, and application programming.
Publishing and Visualizing Large-Scale Semantically-enabled Earth Science Resources on the Web Benno Lee 1 Sumit Purohit 2
RDA and Linked Data by Gordon Dunsire National Seminar, National Library of Finland, Helsinki, Finland, 25 March 2014.
1 Bibliothèque nationale de France use case Pauline Chougnet ISNI AGM October 2014.
VIAF (Virtual International Authority File) Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web Dr. Barbara B.
Interoperability through Library APIs Library Technology Services Open House 7/30/15.
Meta Tagging / Metadata Lindsay Berard Assisted by: Li Li.
Virtual International Authority File – introduction & implications Basil Dewhurst Project Manager, ARDC Party Infrastructure Project | National Library.
OCLC Research: Selected projects Eric Childress Larry Olszewski Presentation for Dpto. Biblioteconomía y Documentación Universidad Carlos III de Madrid.
Moving from a locally-developed data model to a standard conceptual model Jenn Riley Metadata Librarian Indiana University Digital Library Program.
Jennifer Bowen, University of Rochester ALA Annual Conference, 2009, Chicago, Illinois 1 Defining Linked Data for the eXtensible Catalog (XC): Metadata.
Towards a semantic web Philip Hider. This talk  The Semantic Web vision  Scenarios  Standards  Semantic Web & RDA.
Evolving MARC 21 for the future Rebecca Guenther CCS Forum, ALA Annual July 10, 2009.
PREMIS Controlled vocabularies Rebecca Guenther Sr. Networking & Standards Specialist, Library of Congress PREMIS Implementation Fair San.
Publishing the British National Bibliography as Linked Open Data
Linked Data: Emblematic applications on Legacy Data in Libraries.
AGROVOC Thesaurus. 1980s: developed as multilingual structured thesaurus for agricultural terminology (“rice”) : parallel effort to express thesaurus.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Introduction to the Semantic Web and Linked Data
UC San Diego Library: Where we are with Linked Data Arwen Hutt.
Dewey.info Update Dewey Breakfast/Update ALA Midwinter January 16, 2010 Michael Panzer.
San Juan, Puerto Rico (21 October 2015) RDA, Linked Data, BIBFRAME Eric Childress Consulting Project Manager OCLC Membership & Research.
THE BIBFRAME EDITOR AND THE LC PILOT Module 3 – Unit 1 The Semantic Web and Linked Data : a Recap of the Key Concepts Library of Congress BIBFRAME Pilot.
A Resource Discovery Service for the Library of Texas Requirements, Architecture, and Interoperability Testing William E. Moen, Ph.D. Principal Investigator.
San Juan, Puerto Rico (21 October 2015) VIAF Hispánica Eric Childress Consulting Project Manager OCLC Membership & Research.
Paloma Marín Arraiza 17 th International Conference on Grey Literature 1 st and 2 nd December 2015, Amsterdam (Netherlands) SCIENTIFIC AUDIOVISUAL MATERIALS.
PREMIS Controlled vocabularies Rebecca Guenther Sr. Networking & Standards Specialist, Library of Congress PREMIS Implementation Fair Vienna,
ADLUG Roma (Italy) What is known must be shared Building on the insights from OCLC Research.
© Copyright 2015 STI INNSBRUCK PlanetData D2.7 Recommendations for contextual data publishing Ioan Toma.
Current initiatives in developing library linked data Gordon Dunsire Presented at the Cataloguing and Indexing Group Scotland seminar “Linked data and.
Thomas Hickey Chief Scientist, OCLC Research 2015 August VIAF Council State of VIAF VI AF.
| Barbara Pfeifer | VIAF workshop Strasbourg | VIAF partners: Deutsche Nationalbibliothek (DNB) Barbara Pfeifer.
Digitizing Historical Newspapers South Carolina Digital Newspaper Program's participation with the Library of Congress' Chronicling America: Historic American.
PREPARING FOR LINKED DATA IN DIGITAL REPOSITORIES Sai Deng, University of Central Florida Libraries ACRL Technical Services Interest Group ALA.
Linked Library (+AM) Data Presented LITA Next-Generation Catalog IG Corey A Harper Publish, Enrich, Relate and Un-Silo.
The world’s libraries. Connected. Linked Data A View of OCLC’s Strategy Ted Fons Executive Director, Data Services,& WorldCat Quality ALA Annual Conference,
CASEY A. MULLIN WITH: LALA HAJIBAYOVA SCOTT MCCAULAY DECEMBER 8, 2008 FRBR in RDF: a proof-of-concept model 1 ©2008 Casey A. Mullin.
Making Connections Creating Linked Open Data Neil Wilson Head, Collection Metadata UKSG Webinar June
Shrinking the silo boundary: data and schema in the Semantic Web Gordon Dunsire Presented at AKM 16, Poreč, 2012.
Applications of IFLA Namespaces
What’s changed in linked data implementations in the last three years?
PREMIS Tools and Services
Linked data implementations—
IDEALS at the University Of Illinois: A Case Study of Integration Between an IR and Library Discovery Systems Sarah L. Shreeves University of Illinois.
JISC Information Environment Service Registry (IESR)
Presentation transcript:

CNI Spring 2016 Membership Meeting San Antonio TX Linked Data Implementations— Who, What and Why? Karen Smith-Yoshimura OCLC Research

International Linked Data Surveys for Implementers

Number of institutional responses Both 29

Geographic breakdown of 90 responding institutions 20 countries represented

Responding institutions by type

2015 responding institutions by type

Not yet in production3727 Less than one year1913 More than one year, less than two years 1012 More than two years4624 How long linked data project or service in production Total112 76

Consume linked data 3825 Publish linked data 104 Both consume & publish 6447 How linked data is used

Reasons for publishing linked data Expose to larger audience on the Web6745 Demonstrate what could be done with datasets as linked data5941 Heard about linked data and wanted to try it out by exposing our data as linked data.4321 See if publishing linked data would improve our Search Engine Optimization (SEO.)299

Types of data published as linked data

SOME EXAMPLES IN PRODUCTION

North Rhine-Westphalian Library Service Center

Barriers to publishing linked data2015 Steep learning curve for staff40 Inconsistency in legacy data33 Selecting appropriate ontologies to represent our data31 Establishing the links27 Little documentation or advice on how to build the systems21

Reasons for consuming linked data Provide our users with a richer experience.5135 Enhance our own data by consuming linked data from other sources.5037 More effective internal metadata management.3216 Greater accuracy and scope in our search results2712 See if consuming linked data would improve our Search Engine Optimization (SEO).1912 Experiment with combining different types of data into a single triple store.1715 Heard about linked data and wanted to try it out by using linked data sources.1713

2015 linked data sources most consumed2015 VIAF (Virtual International Authority File)41 DBpedia36 GeoNames35 id.loc.gov35 Resources we convert to linked data ourselves17 Getty's AAT16 FAST (Faceted Application of Subject Terminology)15 WorldCat.org15 data.bnf.fr 12 Deutsche National Bib Linked Data Service12

PROFILES OF MOST CONSUMED SOURCES CITED

VIAF Combines multiple name authority files into a single OCLC-hosted name authority service. More than 100,000 requests/day Size: 500 million – 1 billion triples Consumes: GeoNames id.loc.gov ISNI Wikidata WorldCat.org WorldCat.org Works RDF Vocabularies/Ontologies: Bibliographic Ontology Dublin Core & DC Terms FOAF Owl 2 Web ontology RDF schema Schema.org SKOS

id.loc.gov Enables developers to interact with vocabularies found in data & standards promulgated by LC as linked data. More than 100,000 requests/day Size: 100 million – 500 million triples Consumes: AGROVAC data.bnf.fr DNB’s Linked Data Service id.loc.gov VIAF Wikidata WorldCat.org Works Resources we convert to linked data ourselves RDF Vocabularies/Ontologies: BibFrame FOAF MADS/RDF RDF schema SKOS

Getty’s AAT A structured vocabulary for generic concepts related to art and architecture. More than 100,000 requests/day Size: 10 million – 50 million triples Consumes: None RDF Vocabularies/Ontologies: Bibliographic Ontology Dublin Core & DC Terms FOAF Local vocabulary Owl 2 Web ontology language RDF schema SKOS

FAST Adapts LC Subject Headings with a simplified syntax to retain LCSH’s rich vocabulary while making the schema easier to understand, control, apply and use. 10,000 – 50,000 requests/day Size: 10 million – 50 million triples Consumes: DBpedia GeoNames id.loc.gov VIAF RDF Vocabularies/Ontologies: Dublin Core & DC Terms FOAF Schema.org SKOS WSGS84 Geo Positioning

WorldCat.org OCLC has made WorldCat.org bibliographic metadata experimentally available in linked data form. More than 100,000 requests/day Size: 15 billion triples Consumes: DBpedia FAST VIAF WorldCat.org RDF Vocabularies/Ontologies: Dublin Core FOAF Schema.org SKOS

data.bnf.fr Make the data produced by the Bibliothèque nationale de France more useful on the Web. 10,000 – 50,000 requests/day Size: 100 million – 500 million triples Consumes: AGROVAC data.bnf.fr DBpedia DNB’s Linked Data Service GeoNames id.loc.gov ISNI VIAF (+ others) RDF Vocabularies/Ontologies: Bibliographic Ontology Biographical Ontology Dublin Core & DC Terms FOAF FRBR ISNI Music Ontology OAI ORE Terms Owl 2 Web ontology RDA RDF schema SKOS WSGS84 Geo Positioning …

DNB’s Linked Data Service Publishes authority and bibliographic data in RDF to make the data accessible to the semantic Web community with no need to know library-specific metadata schemes. Size: 100 million – 500 million triples Consumes: None RDF Vocabularies/Ontologies: Bibliographic Ontology Dublin Core & DC Terms FOAF ISBD Owl 2 Web ontology language RDA RDF schema SKOS

Barriers to consuming linked data2015 Matching, disambiguating and aligning source data and linked data resources23 Mapping of vocabulary17 What's published as linked data is not always reusable or lacks URIs16 Lack of authority control15 Datasets not being updated14 Size of RDF dumps12 Understanding how data is structured before using it12

What would you do differently?2015 Have more time allocated for its development38 Would do nothing differently30 Get more staff28 Get wider organizational support23 Have more realistic expectations12

Focus on what you want to achieve, not technical stuff. Build on what you have that others don’t. Pick a problem you can solve. Model data that solves your use cases. Consider legal issues from the beginning. Read as widely as possible, consult community experts. Have a good understanding of linked data structure, available ontologies and your own data. Strive for long-term data reconciliation & consolidation. Involve your institution/community. Experiment and start small. Start now! Just do it! Advice from the implementers

Full details of responses linked-data-implementers-survey-2014.xlsx

SM Together we make breakthroughs possible. Thank you! Contact: Karen Smith-Yoshimura CNI Spring 2016 Membership Meeting, San Antonio TX 4 April ©2016 OCLC. This work is licensed under a Creative Commons Attribution 4.0 International License. Suggested attribution: “This work uses content from Linked Data Implementations—Who, What and Why? © OCLC, used under a Creative Commons Attribution 4.0 International License: