Linked Library Data Modeling Metadata for the [Semantic] Web Presented 2010-11-19 Columbia University Digital Library Seminar Series Corey A Harper.

Slides:



Advertisements
Similar presentations
Presented to the ALCTS FRBR Interest Group, ALA Annual, 24 June 2011
Advertisements

Initiatives to make standard library metadata models and structures available to the Semantic Web Gordon Dunsire, UK Mirna Willer,
Resource description and access for the digital world Gordon Dunsire Centre for Digital Library Research University of Strathclyde Scotland.
Linked Library Data Tuning Library Metadata for the [Semantic] Web Presented ALCTS RDA Webinar Series Corey A Harper.
From content standards to RDF Gordon Dunsire Presented at AKM 15, Porec, 2011.
Introduction to linked data Gordon Dunsire Presented at the Cataloguing and Indexing Group Scotland seminar Linked data and the Semantic Web: what have.
How to publish local metadata as linked data Gordon Dunsire Presented at Linked Open Data: current practice in libraries and archives (3rd Linked Open.
An introduction to RDF and library linked data Gordon Dunsire Presented at the Dewey Decimal Classification Executive Briefing 15 Sep 2011, London.
RDA and the semantic Web Lectio magistralis in Library Science by Gordon Dunsire Florence University, Florence, Italy 4th March, 2014.
Developing a Metadata Exchange Format for Mathematical Literature David Ruddy Project Euclid Cornell University Library DML 2010 Paris 7 July 2010.
RDF AND LINKED DATA Jenn Riley Head, Carolina Digital Library and Archives The University of North Carolina at Chapel Hill.
RDA AND LINKED DATA: MOVING BEYOND THE RULES Jenn Riley Head, Carolina Digital Library and Archives The University of North Carolina at Chapel Hill.
Linked Library Data Miiya Holmes October 6-7, 2012.
IFLA Namespaces Gordon Dunsire Chair, IFLA Namespaces Technical Group Session 204 — IFLA library standards and the IFLA Committee on Standards – how can.
Corey A Harper DC2006 October 4, 2006 Authority Control for the Semantic Web Encoding Library of Congress Subject Headings (LCSH) in SKOS.
LODLAM Presented at ELUNA 2014 by Corey A Harper Current Trends, Tools & Techniques, and the Role of Vendors.
COMP 6703 eScience Project Semantic Web for Museums Student : Lei Junran Client/Technical Supervisor : Tom Worthington Academic Supervisor : Peter Strazdins.
Structures and Standards for Our Bibliographic Future Diane I. Hillmann Research Librarian Cornell University Library.
The RDF meta model: a closer look Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations.
© 2006 DCMI DC-2006 – International Conference on Dublin Core and Metadata Applications 3-6 October 2006 Thomas Baker Dublin Core Metadata Initiative.
National libraries and identity in the Semantic Web Gordon Dunsire BNE, Madrid, 14 Dec 2011.
Exposing the University of Economics‘ academic bibliography database as linked data Jitka Hladká, University of Economics, Prague Jindřich Mynarz,
Metadata: Its Functions in Knowledge Representation for Digital Collections 1 Summary.
Metadata Standards and Applications 5. Applying Metadata Standards: Application Profiles.
Context and Prosopography: Putting the 'Archives' Into LOD-LAM Corey A Harper SAA MDOR
PREMIS Tools and Services Rebecca Guenther Network Development & MARC Standards Office, Library of Congress NDIIPP Partners Meeting July 21,
Teaching Metadata and Networked Information Organization & Retrieval The UNT SLIS Experience William E. Moen School of Library and Information Sciences.
Is Semantic Web Our Future? Computers in Libraries Conference 2012 March 21-23, 2012 Hilton Washington Washington, DC Sharon Q. Yang, Rider University,
RDA data and applications Gordon Dunsire Presented to staff of the British Library, Boston Spa, 20 Mar 2014.
RDA and Linked Data by Gordon Dunsire National Seminar, National Library of Finland, Helsinki, Finland, 25 March 2014.
INF 384 C, Spring 2009 Ontologies Knowledge representation to support computer reasoning.
Linked data the next network?. The Web of documents is for people The Web of data is for computers The Web of documents is difficult for computers to.
Jenn Riley Metadata Librarian IU Digital Library Program New Developments in Cataloging.
LINKED DATA AND RDA: LOOKING TOWARD NEXT GENERATION CATALOGING Jenn Riley Head, Carolina Digital Library and Archives Digital Discussions series Twitter:
Creating an Application Profile Tutorial 3 DC2004, Shanghai Library 13 October 2004 Thomas Baker, Fraunhofer Society Robina Clayphan, British Library Pete.
Evolving MARC 21 for the future Rebecca Guenther CCS Forum, ALA Annual July 10, 2009.
Resource Description and Access Deirdre Kiorgaard Australian Committee on Cataloguing Representative to the Joint Steering Committee for the Development.
Lifecycle Metadata for Digital Objects November 1, 2004 Descriptive Metadata: “Modeling the World”
The Semantic Web and expert metadata: pull apart then bring together Presented at 12.seminar Arhivi, Knjižnice, Muzeji Nov 2008, Pore č, Croatia.
It’s all semantics! The premises and promises of the semantic web. Tony Ross Centre for Digital Library Research, University of Strathclyde
RELATORS, ROLES AND DATA… … similarities and differences.
1 Dublin Core & DCMI – an introduction Some slides are from DCMI Training Resources at:
The future of the Web: Semantic Web 9/30/2004 Xiangming Mu.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Introduction to the Semantic Web and Linked Data
Strategies for subject navigation of linked Web sites using RDF topic maps Carol Jean Godby Devon Smith OCLC Online Computer Library Center Knowledge Technologies.
The RDF meta model Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations of XML compared.
THE BIBFRAME EDITOR AND THE LC PILOT Module 3 – Unit 1 The Semantic Web and Linked Data : a Recap of the Key Concepts Library of Congress BIBFRAME Pilot.
Pete Johnston, Eduserv Foundation 16 April 2007 An Introduction to the DCMI Abstract Model JISC.
Paloma Marín Arraiza 17 th International Conference on Grey Literature 1 st and 2 nd December 2015, Amsterdam (Netherlands) SCIENTIFIC AUDIOVISUAL MATERIALS.
Differences and distinctions: metadata types and their uses Stephen Winch Information Architecture Officer, SLIC.
Current initiatives in developing library linked data Gordon Dunsire Presented at the Cataloguing and Indexing Group Scotland seminar “Linked data and.
Describing resources II: Dublin Core CERN-UNESCO School on Digital Libraries Rabat, Nov 22-26, 2010 Annette Holtkamp CERN.
CNI Spring 2016 Membership Meeting San Antonio TX Linked Data Implementations— Who, What and Why? Karen Smith-Yoshimura OCLC Research.
RDA and Linked Data Gordon Dunsire Presented at Cita BNE - RDA and Linked Data, 15 April 2016, Madrid, Spain.
Digitizing Historical Newspapers South Carolina Digital Newspaper Program's participation with the Library of Congress' Chronicling America: Historic American.
Linked Library (+AM) Data Presented LITA Next-Generation Catalog IG Corey A Harper Publish, Enrich, Relate and Un-Silo.
Setting the stage: linked data concepts Moving-Away-From-MARC-a-thon.
Review of the DCMI Abstract Model Thomas Baker, DCMI Joint Meeting of the DCMI Architecture Forum and W3C Library Linked Data Incubator Group 22 October.
RDA and Linked Data Gordon Dunsire Presented at Selmathon 1, 9 May 2016, Stockholm, Sweden.
Some basic concepts Week 1 Lecture notes INF 384C: Organizing Information Spring 2016 Karen Wickett UT School of Information.
Dublin Core Metadata Initiative Abstract Model
Authority Control for the Semantic Web
Introduction to Metadata
Lifecycle Metadata for Digital Objects
Applications of IFLA Namespaces
Cataloging the Internet
PREMIS Tools and Services
Some Options for Non-MARC Descriptive Metadata
RDA in a non-MARC environment
Presentation transcript:

Linked Library Data Modeling Metadata for the [Semantic] Web Presented Columbia University Digital Library Seminar Series Corey A Harper

Harper - Linked Library Data - Columbia University2 Topical Overview Semantic Web Intro Linked Open Data –Graphs: Entity – Attribute – Value –A Few Examples Library Data

Harper - Linked Library Data - Columbia University3 Topical Overview (cont) Linked Library Data –SKOS and Authority Control –FRBR and Bibliographic Data –National Libraries Resource Description and Access (RDA) Dublin Core Metadata Initiative

Harper - Linked Library Data - Columbia University4 Semantic Web TBL’s original vision –“Weaving the Web” – 1999 Then: Focus on Machine Reasoning –Scientific American Article Now: Focus on things & links –Reasoning becoming lower level

Harper - Linked Library Data - Columbia University5 Semantic Web Originally: –Metadata standard built on XML –Metadata about “Web” things Eventually: –Metadata about all things –Metadata about relationships between things

Harper - Linked Library Data - Columbia University6 Semantic Web Terminology Resource: Any thing Class: Abstraction of a type of thing Individual: An instance of a class Property: An attribute of an individual Ontology: A domain specific collection of classes and properties Statement/Triple: –A Resource (subject) - Nodes –A Property (predicate) - Arcs –A Value (object) - Nodes

Harper - Linked Library Data - Columbia University7 Semantic Web Terminology Graphs: Representations of statements about resources Nodes: The Subjects and Objects in a Graph Arcs: The Predicates in a Graph Literals: “Objects” represented as strings (constant values) rather than things (URI References) Domains and Ranges: Constraints on Nodes For Example…

Harper - Linked Library Data - Columbia University8

Harper - Linked Library Data - Columbia University9 RDF Resource Description Framework Formally Begun in 1999 Ideas from 1995 Finalized in 2004 Frighteningly complex at times… –“Directed Labeled Graphs”

Harper - Linked Library Data - Columbia University10 SemWeb Value Proposition Formally Modeled (Meta) Data Formal Semantics Declaration Increased Granularity compared to record-based Metadata Improved Interoperability

Harper - Linked Library Data - Columbia University11 “The vast bulk of data to be on the Semantic Web is already sitting in databases … all that is needed [is] to write an adapter to convert a particular format into RDF and all the content in that format is available.” - Tim Berners-Lee in an interview with the Consortium Standards Bulletin

Harper - Linked Library Data - Columbia University12 Linked Open Data Use URIs as names for things Use HTTP URIs so that people can look up those names. When someone looks up a URI, provide useful information. Include links to other URIs. so that they can discover more things.

Harper - Linked Library Data - Columbia University13

Harper - Linked Library Data - Columbia University14

Harper - Linked Library Data - Columbia University15

Harper - Linked Library Data - Columbia University16 Linked Data Cloud Automated generation –Comprehensive Knowledge Archive Network (CKAN)Comprehensive Knowledge Archive Network (CKAN) –Vocabulary of Interlinked Datasets (voiD)Vocabulary of Interlinked Datasets (voiD) –Basically, catalog your metadata! Recent criticism: data quality

Harper - Linked Library Data - Columbia University17 Data in the Cloud Hubs in the May 2008 Version: –FOAF –DBPedia Myriad Sources coming online: –Thompson Reuters –New York Times –British Broadcasting Corporation –Google and Facebook –More and More Library Data –Geonames –MusicBrains

Harper - Linked Library Data - Columbia University18 DBpedia Structured Wikipedia Data Genres, Influences, External Links Multi-lingual / Multi-script labels Rich Semantics Many linkages to other datasets

Harper - Linked Library Data - Columbia University19 DBpedia 3.4 Million “things” described Ontology based on “infoboxes”Ontology –1.5 million things classified Approx. 50,000 “Properties” –Approx. 1,200 defined in ontology Brief Example

Harper - Linked Library Data - Columbia University20 Domain Modeling Starting from application / goal / function “To guide and evaluate our designs, we need objective criteria that are founded on the purpose of the resulting artifact, rather than based on a priori notions of naturalness or Truth.” – Gruber, 1993 Does this apply to Libraries? FRBRer?

Harper - Linked Library Data - Columbia University21 DBPedia Model Partial basis in data entry conventions InfoBox’s, and InfoBox Templates Metadata Entry Format Partial source of Ontology –Class Structure –Vocabulary Design

Harper - Linked Library Data - Columbia University22 DBpedia 3.4 Million “things” described Ontology based on “infoboxes” –1.5 million things classified – Approx. 50,000 “Properties” –Approx. 1,200 defined in ontology

Harper - Linked Library Data - Columbia University23

Harper - Linked Library Data - Columbia University24

Harper - Linked Library Data - Columbia University26 More Examples British Broadcasting Corporation –Programmes, Music, Wildlife Google Refine Data.gov and data.gov.uk NY Times

Harper - Linked Library Data - Columbia University27 What *things* are in our data???

Harper - Linked Library Data - Columbia University28 …Library data is extremely complicated

Harper - Linked Library Data - Columbia University29 Bibliographic Data Rich stores of MARC, MODS, &c. Robust Controlled Vocabularies –Subject Heading lists –Code lists –Thesauri Emerging data model in FR*

Harper - Linked Library Data - Columbia University30 Bibliographic Vocabs Bibliographic Ontology –Zotero, Omeka, EPrints and Others FRBR – unofficial –And now Official (Thank you IFLA!) ISBD

Harper - Linked Library Data - Columbia University31 Library Authority Data “Include links to other URIs. so that they can discover more things.” Short of providing and linking to URIs, this *is* authority data. This is what our authority files are for.

Harper - Linked Library Data - Columbia University32 Library Controlled Vocabularies: Benefits Reputation - Trusted Tradition Mature - Time tested and carefully developed General & Comprehensive - Cover large knowledge spaces

Harper - Linked Library Data - Columbia University33 SKOS Simple Knowledge Organization System Properties and Classes for describing Controlled Vocabulary RDF Page skos:primaryTopic skos:person

Harper - Linked Library Data - Columbia University34 LCSH in Dublin Core Encoding Scheme for DC Subject No easy way to draw on equivelent terms and cross-references Abstract Model, RDF and SKOS could enable applications to make use of the whole vocabulary

Harper - Linked Library Data - Columbia University35 LCSH as a Web Service! Uses principles of linked data lcsh.info -> id.loc.gov People noticed when taken down Links to French Subject Headings URIs for Literal String lookup Wide Web

Harper - Linked Library Data - Columbia University36

Harper - Linked Library Data - Columbia University37 Other Vocabularies Thesaurus for Economics French Subject Headings Swedish Subject Headings IconClass (not on web yet) OCLC Terminology Services Dewey Decimal Classification Virtual International Authority File

Harper - Linked Library Data - Columbia University38 Linked Library Data VIAF, LCSH, MARC Codes Open Library, XC, Kualli OLE Library of Congress, OCLC Hungarian, German, British, Swedish National Libraries Formalized Efforts: W3C, IFLA & RDA

Harper - Linked Library Data - Columbia University39 Kungliga Biblioteket Image courtesy of Martin Malmstemhttp://blog.libris.kb.se/semweb/?p=7

Harper - Linked Library Data - Columbia University40 National Széchényi Library “ Our RDFDC, FAOF and SKOS statements are linked together. Our name authority is matched with the DBPedia name files and URI aliases are handled as owl:sameAs statements.” - Adam Horvath

Harper - Linked Library Data - Columbia University41 W3C LLD XG “Incubator Group” Membership: –Researchers, Consultants, Librarians –National Libraries: Germany, France, LoC, Sweden –OCLC & IFLA

Harper - Linked Library Data - Columbia University42

Harper - Linked Library Data - Columbia University43 W3C LLD XG Goals Collecting, Curating and Clustering over 50 Use Cases Mining use cases for functional requirements and design patterns Recommendations to W3C –Should lead to Working Groups

Harper - Linked Library Data - Columbia University44 RDA Development RDA elements, roles and vocabularies have been provisionally registered IFLA FRBRer and ISBD elements and vocabularies have been officially registered Discussions about long term maintenance of both RDA and the vocabularies Effort to create multi-language RDA Vocabularies RDA Slides Adapted from Diane Hillmann

Harper - Linked Library Data - Columbia University45 RDA Elements Listing 334!

Harper - Linked Library Data - Columbia University46 RDA Elements Listing 334! Base material

Harper - Linked Library Data - Columbia University47 Detail: Base Material

Harper - Linked Library Data - Columbia University48 Detail: Base Material URI

Harper - Linked Library Data - Columbia University49 RDA Base Material Vocabulary

Harper - Linked Library Data - Columbia University50 RDA WEMI Relationships

Harper - Linked Library Data - Columbia University51 Detail: RDA WEMI Relationship

Harper - Linked Library Data - Columbia University52 Metadata Registries Formerly NSDL Registry –Now “Open Metadata Registry” –Managing Vocabularies –Providing Vocabulary Services DCMI Registry Community DCMI Architecture Forum

Harper - Linked Library Data - Columbia University53 DCMI and the Semantic Web Collaboration from the start Libraries (esp. OCLC) were at the table Perception of DCMI as DCMES –DCMI = Metedata Vocab / Framework –DCMES = Metadata Record Format

Harper - Linked Library Data - Columbia University54 DCMI and the Semantic Web Every example above had dcterms DCMI as Research Institute and Metadata Think Tank –Modeling Work –Metadata Registries –Application Profiles –Description Set Profiles –Singapore Framework

Harper - Linked Library Data - Columbia University55 Changing Role of DCMI Mike Bergman at DC2010: –Reference Metadata –Reference Concepts –Mapping Predicates “Mappings should be approximate” –Usage Guidelines Compliment to W3C Standards

Harper - Linked Library Data - Columbia University56 Why Does This Matter? Our descriptions no longer stand alone! Connect our data with the rest of the WEB Allow others to reuse more easily –FOAF –DBPedia –Geonames –MusicBrains –New York Times –Thompson Reuters –Government Data - data.gov –British Broadcasting Corporation

Harper - Linked Library Data - Columbia University57 Conclusions Distributed bibliographic control environment –Linking Data –Focus on identification over description “In short, by treating values as non- literal resources and assigning URIs to them we give ourselves (and others) the hooks on which to hang further descriptions.” - Andy Powell

Harper - Linked Library Data - Columbia University58 Endless possibilities This barely scratches the surface The Giant Global Graph!! With more soundly modeled bibliographic and authority data… –Terminology Services –Context sensitive interfaces –Customized Exhibits –Mashups –Web Services –User Profiling –Collaboration tools

Harper - Linked Library Data - Columbia University59 Continuing Challenges Emerging Technology Design Patterns Complexity (http-range14) Existing Technical Infrastructure Bootstrapping Business Cases

Harper - Linked Library Data - Columbia University60 More Information W3C LLD XG: ALA LLD Interest Group: – IFLA Semantic Web SIG –

Harper - Linked Library Data - Columbia University61 Thanks! Questions?