LINKED DATA DEMYSTIFIED PRACTICAL EFFORTS TO TRANSFORM CONTENTDM METADATA INTO LINKED DATA.

Slides:



Advertisements
Similar presentations
Digital Repositories – Linked Open Data – the possible Role of D4Science Workshop, December 2010, FAO use cases A tool to create Linked Data providers.
Advertisements

CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
Bibliographic Framework Initiative Approach for MARC Data as Linked Data Sally McCallum Library of Congress.
Creating Linked Data Juan F. Sequeda Semantic Technology Conference June 2011.
Unleashing Expressivity Linked Data for Digital Collections Managers Cory Lampert Head, Digital Collections Mountain West Digital Library Hubs Meeting.
RDF AND LINKED DATA Jenn Riley Head, Carolina Digital Library and Archives The University of North Carolina at Chapel Hill.
RDF Tutorial.
Semantic Web Introduction
© Copyright IBM Corporation 2014 Getting started with Rational Engineering Lifecycle Manager queries Andy Lapping – Technical sales and solutions Joanne.
Linked Data for Libraries, Archives, Museums. Learning objectives Define the concept of linked data State 3 benefits of creating linked data and making.
Linked Library Data Miiya Holmes October 6-7, 2012.
Building and Analyzing Social Networks Web Data and Semantics in Social Network Applications Dr. Bhavani Thuraisingham February 15, 2013.
JOINING UP GOVERNMENTS EUROPEAN COMMISSION ADMS-enabled exploration of GS1 Dox 20 February 2013.
Behshid Behkamal Ferdowsi University of Mashhad Web Technology Lab.
RDF: Building Block for the Semantic Web Jim Ellenberger UCCS CS5260 Spring 2011.
A Registry for controlled vocabularies at the Library of Congress
Metadata: Its Functions in Knowledge Representation for Digital Collections 1 Summary.
Metadata Standards and Applications 4. Metadata Syntaxes and Containers.
Context and Prosopography: Putting the 'Archives' Into LOD-LAM Corey A Harper SAA MDOR
RDF (Resource Description Framework) Why?. XML XML is a metalanguage that allows users to define markup XML separates content and structure from formatting.
PREMIS Tools and Services Rebecca Guenther Network Development & MARC Standards Office, Library of Congress NDIIPP Partners Meeting July 21,
RDA and Linking Library Data VuStuff III Conference Villanova University, Villanova, PA October 18, 2012 Dr. Sharon Yang Rider University.
Interoperability Scenario Producing summary versions of compound multimedia historical documents.
8/28/97Organization of Information in Collections Introduction to Description: Dublin Core and History University of California, Berkeley School of Information.
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
Metadata: An Overview Katie Dunn Technology & Metadata Librarian
Interoperable Digitised Content “Discover, search, extract, link, associate, and view digitised content” Les Carr.
Using Vocabulary Services in Validation of Water Data May 2010 Simon Cox, JRC Jonathan Yu & David Ratcliffe, CSIRO.
Not Just For Data Geeks! A Practical Approach to Linked Data for Digital Library Managers Cory Lampert and Silvia Southwick Salt Lake City October 9, 2013.
Indo-US Workshop, June23-25, 2003 Building Digital Libraries for Communities using Kepler Framework M. Zubair Old Dominion University.
By: Dan Johnson & Jena Block. RDF definition What is Semantic web? Search Engine Example What is RDF? Triples Vocabularies RDF/XML Why RDF?
Semantic Web Applications GoodRelations BBC Artists BBC World Cup 2010 Website Emma Nherera.
Meta Tagging / Metadata Lindsay Berard Assisted by: Li Li.
Taking Action: Linked Data for Digital Library Managers Silvia Southwick and Cory Lampert UNLV Digital Collections American Library Association Annual.
07/11/2002Thomas Baron - JACoW Workshop1 CERN Library Requirements T. Baron CERN ETT-DH-CDS.
LINKED DATA AND RDA: LOOKING TOWARD NEXT GENERATION CATALOGING Jenn Riley Head, Carolina Digital Library and Archives Digital Discussions series Twitter:
Evolving MARC 21 for the future Rebecca Guenther CCS Forum, ALA Annual July 10, 2009.
Antoine Isaac 1 st PRELIDA Workshop Pisa, June 26, 2013.
Evidence from Metadata INST 734 Doug Oard Module 8.
Common Terminology Services 2 CTS 2 Submission Team Status Update HL7 Vocabulary Working Group May 17, 2011.
Linked Data: Emblematic applications on Legacy Data in Libraries.
Management Information Systems, 4 th Edition 1 Chapter 8 Data and Knowledge Management.
Introduction to Archon for CARLI Members Jen Masciadrelli, Library Systems Coordinator, CARLI Office Sarah Horowitz, Special Collections Librarian, Augustana.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Introduction to the Semantic Web and Linked Data
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Metadata : an overview XML and Educational Metadata, SBU, London, 10 July 2001 Pete Johnston UKOLN, University of Bath Bath, BA2 7AY UKOLN is supported.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
THE BIBFRAME EDITOR AND THE LC PILOT Module 3 – Unit 1 The Semantic Web and Linked Data : a Recap of the Key Concepts Library of Congress BIBFRAME Pilot.
A Resource Discovery Service for the Library of Texas Requirements, Architecture, and Interoperability Testing William E. Moen, Ph.D. Principal Investigator.
DSpace System Architecture 11 July 2002 DSpace System Architecture.
Tiziana // Alessandra Lenzi - MG Breaking down the walls Project Museo Galileo and the Linked Open Data A joint project between.
LINKED DATA PILOT PROJECT AT SYRACUSE UNIVERSITY LIBRARIES Sarah Theimer & Brian Dobreski Acquisitions and Cataloging Syracuse University Libraries.
Linked Open Data for European Earth Observation Products Carlo Matteo Scalzo CTO, Epistematica epistematica.
LINKED DATA what you need to know to understand, produce, and work with Linked Data Robert Chavez, PhD. Senior Content Solutions Architect, NEJMGroup NETSL.
Digitizing Historical Newspapers South Carolina Digital Newspaper Program's participation with the Library of Congress' Chronicling America: Historic American.
PREPARING FOR LINKED DATA IN DIGITAL REPOSITORIES Sai Deng, University of Central Florida Libraries ACRL Technical Services Interest Group ALA.
Setting the stage: linked data concepts Moving-Away-From-MARC-a-thon.
Linked Data Web that can be processed by machines
Presented at Archives Records 2016, session 510
ALA Practical Linked Data With Open Source
Jenn Riley Metadata Librarian Digital Library Program
Introduction to Metadata
Lifting Data Portals to the Web of Data
Digging into Linked Data: Perspectives from the Long Tail
Getting started With Linked Data.
PREMIS Tools and Services
LOD reference architecture
Jenn Riley Metadata Librarian Digital Library Program
Linked Data Ryan McAlister.
Presentation transcript:

LINKED DATA DEMYSTIFIED PRACTICAL EFFORTS TO TRANSFORM CONTENTDM METADATA INTO LINKED DATA

PRESENTERS Silvia Southwick Digital Collections Metadata Librarian UNLV Libraries Cory Lampert Head of the Digital Collections Department UNLV Libraries

OUTLINE Why should I care? What is it? Defining Linked Data / Introduction to Linked Data Concepts / Linked Data Principles Technologies & Standards for Linked Data The Linked Data Cloud How? Applying these concepts to digital collection records Anticipated challenges working with CONTENTdm The UNLV Libraries Linked Data Project How could you start working with Linked Data?

LINKED DATA MYTHS My collections are already visible through Google; so who cares This is a topic for catalogers It’s too technical / complicated / boring Actually... Linked data is the future of the Web Data will no longer be in silos (catalog, CONTENTdm) Relationships are powerful and worth the effort

HOW DO WE CURRENTLY CREATE OUR DIGITAL COLLECTIONS? Data (or metadata) are encapsulated in records Records are contained in collections Very few links are created within and/or across collections Links have to be manually created Existing links do not specify the nature of the relationships among records This structure hides potential links within and across collections – DATA IS TRAPPED!

UNIQUE LOCAL COLLECTIONS, HIDDEN RELATIONSHIPS Example: A search on “water” in the OCLC collection of collections resulted in 26 collections that are not cross- linked Digital Collections containing records on “water” California Water Documents Western Waters Digital Library Bear River Watershed Historical Collection The Historic Landscape of Nevada: Development, Water, and Natural Environment Seattle Power Water Supply Collection Western Waters Digital Library: The Columbia River Basin in Oregon ……………

EXPOSED DATA RELATIONSHIPS POWERFUL, RELATED DATA Example: Google Knowledge Graph

A LEGO METAPHOR FOR CREATING LINKED DATA This is the Data Model

Transforming records into data Publishing data Linking data as you search or browse

DEFINING LINKED DATA Linked Data refers to a set of best practices for publishing and interlinking data on the Web Data needs to be machine-readable Linked data (Web of Data) is an expansion of the Web we know (Web of documents)

WEB IN TRANSITION 1.Two types of data: 1.Human-readable documents ( , brochure, report) 2.Machine-readable data (calendar, playlist, spreadsheet) 2.Shopping example 1.A web page ad (document) says “dress”, “color”, “price”, “designer” 2.But machines cannot extract data to re-use in another application (e.g., spreadsheet to compare prices) 3.RDF – new way to specify relationships and transfer context with data across applications: reusable data 4. The time is now to start to evolve our documents into data

TECHNOLOGIES FOR LINKED DATA Linked data is built in the Web architecture (HTTP, URIs) RDF is a data model (not a format) Most common serializations: RDF/XML RDFa RDF is based on triples/statements SPARQL - SPARQL Protocol and RDF Query Language -- is an query language able to retrieve and manipulate data stored in RDF.

WHAT ARE TRIPLES? Triples are expressed as: subject – predicate – object Examples: Frank Sinatra -- is an – entertainer Frank Sinatra – knows – Jack Entratter

EXAMPLE TRIPLE  RDF Introduction to RDF at

PRINCIPLES OF LINKED DATA 1.Use URIs as names for things (people, organizations, artifacts, abstract concepts, etc.) 2.Use HTTP URIs so that people can look up those names 3.When someone looks up a URI, provide useful information, using the standards (RDF, SPARQL) 4.Include links to other URI so that they users discover other related items (note: RDF Links have types)

THE LINKED DATA CLOUD

CREATING LINKED DATA FROM ORIGINAL RECORD VS. HARVESTED RECORD

ORIGINAL RECORD Title: Café Monico menu, February 19, 1903Category: regular services Restaurant Name: Café Monico (London, England) Additional Information: Advertisement on back and around edges if the menu. Insert lists Indian curries as special on Mondays and Thursdays Graphic Elements: Borders(Ornament areas); Buildings; Photographs Enclosures: daily specials; advertisements Type of restaurant: Non-specialized restaurant Type of menu: `a la carte Meals served: dinner; lunch City: London …..

OCLC WORLDCAT LINKED DATA SAME RECORD (HARVESTED)

HOW CAN WE ADDRESS THIS PROBLEM? Create a complementary data structure that would allow dynamic interlinking among data How? Export records from the collections Deconstruct these records by extracting data from them Apply vocabularies Adopt a common model to express data Publish data in a data space (Linked Data Cloud) where links among data are created automatically

EXAMPLES OF RECORDS Showgirls Menus Dreaming the Skyline

TRANSFORMING RECORDS INTO DATA What are possible triples for this photo?

GRAPHICAL REPRESENTATION OF THE PHOTO TRIPLES

ADDING TRIPLES FROM THE OTHER RECORDS What are the URIs for subjects, predicates and objects?

VOCABULARIES ALERT: Finally a place in the presentation we feel at home! Vocabularies are specific terms used in RDF statements to describe specific resources Vocabulary examples in linked data (Linked Open Vocabulary): DCMI Type Vocabulary Friend of a Friend Vocabulary Geonames MARC Code List for Relators Creative Commons Rights Expression vocabulary Schema.org Many more at:

UNLV LINKED DATA PROJECT Goals: Study the feasibility of developing a single process that would allow the conversion of our collection records into linked data preserving their original expressivity and richness Publish data from our collections in the Linked Data Cloud to improve discoverability and connections with other related data sets on the Web.

PHASES OF THE PROJECT Literature Review Evaluating Technologies Research existing technologies and best practices Develop small experiments with technologies Make decisions of which technologies to adopt, adapt or develop Data preparation Select and prepare records from digital collections to participate in the project Run process for data transformation Publish on the Linked Data Cloud Assess results

DATA PREPARATION Defining vocabularies that will be adopted for predicates Defining types of triples to be created (literal, outgoing links, incoming links, triples that describe related resources, triples that link to descriptions, triples that indicate provenance of the data, etc.) Specifying URIs for new “things” Identifying potential URIs for external links (e.g., things that already have URIs) Describing data sets that will be published in the linked data cloud

TECHNOLOGY OPTIONS FOR DATA TRANSFORMATION

Type of Data Data Preparation Data Storage Data Publication Structured Data (CONTENTdm) RDF-izers for Excel or XML RDF Store Linked Data Wrapper Linked Data on the Web Linked Data Interface RDF Files Web Server Data Source API Drupal DB Drupal RDFa Adapted from Linked Data Evolving the Web into a Global Data Space by Heath and Bizer

ANTICIPATED CHALLENGES Developing of a single process for transforming records into data because digital collections adopt different metadata schema Creating URIs for all our unique materials Finding ways to associate URIs to “things” in CONTENTdm Adopting linked data while it is in early stage of development

TIPS TO CONSIDER WHEN CREATING DIGITAL COLLECTIONS METADATA Avoid mixing different types of data in metadata fields Avoid creating aggregated data fields Record URIs whenever available Reinforce use of controlled vocabularies Monitor how CMS are adopting linked data technologies

HOW WE STARTED Created a study group in the Library (members from various areas of the library) Watched webinars on the topic and have discussions after the webinars Created an internal wiki with linked data resources Participated in linked data interest groups Follow the literature on this topic

QUESTIONS? Contact Information: Silvia Southwick Cory Lampert Department of Digital Collections UNLV Libraries