Project Report Presentation and Update October 10, 2014 Jeff Mixter - OCLC Research Patrick OBrien - Montana State Univeristy Kenning Arlitsch - Montana.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

DC2001, Tokyo DCMI Registry : Background and demonstration DC2001 Tokyo October 2001 Rachel Heery, UKOLN, University of Bath Harry Wagner, OCLC
DCMI Workshop on Metadata and Search Vendor Panel Presentation Bradley P. Allen
A centre of expertise in digital information management Approaches To The Validation Of Dublin Core Metadata Embedded In (X)HTML Documents Background The.
Lorcan Dempsey OCLC Big Heads – Heads of Technical Services of Large Research Libraries ALA 2013 Chicago 28 June things about
Maira Bundža Western Michigan University IFLA Satellite Post-Conference Tallinn, August 18, 2012.
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
Establishing Semantic Identity For Accurate Representation on the Web 12/09/2014 Kenning Arlitsch Dean of the Library Kenning Arlitsch, Dean of the Library.
Linked Data, Discovery and Discoverability John McCullough Senior Product Manager, OCLC December 3, 2014 UCL Discovery and Discoverability.
Schema.org, an ontology for discovery on the web Phil Barker, Heriot-Watt University
Developing a Metadata Exchange Format for Mathematical Literature David Ruddy Project Euclid Cornell University Library DML 2010 Paris 7 July 2010.
RDFa: Embedding RDF Knowledge in HTML Some content from a presentation by Ivan Herman of the W3c, Introduction to RDFa, given at the 2011 Semantic Technologies.
RDF Tutorial.
The Case for Faceting Ed O’Neill OCLC Research November 5, 2013 ASIS&T Montreal Maximizing the Usage of Value Vocabularies in the Linked Data Ecosystem:
Vocabulary Mapping Framework & Libraries Alan Danskin Metadata & Bibliographic Standards Coordinator.
Ontology Notes are from:
COMP 6703 eScience Project Commercial Semantic Web of Digital Library  Student : Yin Chen  Client/Technical Supervisor : Tom Worthington  Academic Supervisor.
Dr. Alexandra I. Cristea RDF.
RDF Kitty Turner. Current Situation there is hardly any metadata on the Web search engine sites do the equivalent of going through a library, reading.
RDF: Building Block for the Semantic Web Jim Ellenberger UCCS CS5260 Spring 2011.
The RDF meta model: a closer look Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations.
IBM User Technology March 2004 | Dynamic Navigation in DITA © 2004 IBM Corporation Dynamic Navigation in DITA Erik Hennum and Robert Anderson.
Microdata for Dallas County Historical & Genealogical Cemetery Data Tony Hanson Webmaster 1.
RDA and Linking Library Data VuStuff III Conference Villanova University, Villanova, PA October 18, 2012 Dr. Sharon Yang Rider University.
Making Library Collections Discoverable on the Web Axel Kaschte Product Strategy Director EMEA OCLC 04. July, 2015 ICSTI Workshops Hannover.
Practical RDF Chapter 1. RDF: An Introduction
@LorcanD Lorcan Dempsey, OCLC 11 October 2013 ARL Fall Forum: Mobilizing the research enterprise #ARLforum13 SHARE : Discovery:Focus on papers.
First they have to find it: Getting Government Data Discovered and Used Adapted from: John S. Erickson, Ph.D. Tetherless World Constellation Rensselaer.
Ontology-Driven Automatic Entity Disambiguation in Unstructured Text Jed Hassell.
Meta Tagging / Metadata Lindsay Berard Assisted by: Li Li.
Lifecycle Metadata for Digital Objects (INF 389K) September 18, 2006 The Big Metadata Picture, Web Access, and the W3C Context.
A Future for the Library Catalogue T. Hickey ACRL/DVC Bryn Mawr 3 November 2006.
Marshall Breeding Director for Innovative Technology and Research Vanderbilt University
McLean HIGHER COMPUTER NETWORKING Lesson 7 Search engines Description of search engine methods.
Lifecycle Metadata for Digital Objects November 1, 2004 Descriptive Metadata: “Modeling the World”
Image BioInformatics Research Group Department of Zoology University of Oxford, UK CERIF Data Surgery University of Bath 9 February.
How do I find works in the Repository?. University of Texas Libraries UT DR Digital Repository Search in the Repository Keyword search from the Repository.
RELATORS, ROLES AND DATA… … similarities and differences.
Linked Data: Emblematic applications on Legacy Data in Libraries.
The physical parts of a computer are called hardware.
Introduction to the Semantic Web and Linked Data
 Structured Data An Introduction to Semantic Web “It is very hard for search engines to understand the structure and semantics of data embedded in an.
Strategies for subject navigation of linked Web sites using RDF topic maps Carol Jean Godby Devon Smith OCLC Online Computer Library Center Knowledge Technologies.
The RDF meta model Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations of XML compared.
Metadata and Meta tag. What is metadata? What does metadata do? Metadata schemes What is meta tag? Meta tag example Table of Content.
WDS Knowledge Networks Summary of Major Elements.
Achieving Semantic Interoperability at the World Bank Designing the Information Architecture and Programmatically Processing Information Denise Bedford.
IESR, A Registry of Collections and Services: Using the DCMI Collection Description Profile in Practice Ann Apps MIMAS, The University of Manchester, UK.
Fast Forward>> Keeping Up with the Changing Research Environment.
RDFa Primer Bridging the Human and Data webs Presented by: Didit ( )
Current initiatives in developing library linked data Gordon Dunsire Presented at the Cataloguing and Indexing Group Scotland seminar “Linked data and.
Linked Library (+AM) Data Presented LITA Next-Generation Catalog IG Corey A Harper Publish, Enrich, Relate and Un-Silo.
The world’s libraries. Connected. Linked Data A View of OCLC’s Strategy Ted Fons Executive Director, Data Services,& WorldCat Quality ALA Annual Conference,
Setting the stage: linked data concepts Moving-Away-From-MARC-a-thon.
Developing our Metadata: Technical Considerations & Approach Ray Plante NIST 4/14/16 NMI Registry Workshop BIPM, Paris 1 …don’t worry ;-) or How we concentrate.
Tiewei (Lucy) Liu Metadata Librarian June 26, 2016
Linking Your Data Jeff Mixter Software Engineer
LINKED DATA Telling the Library’s Story through
A Lightweight Structured Data Implementation Using JSON-LD and Schema
Jenn Riley Metadata Librarian Digital Library Program
Introduction to Metadata
Digging into Linked Data: Perspectives from the Long Tail
Getting started With Linked Data.
Applications of IFLA Namespaces
PREMIS Tools and Services
IDEALS at the University Of Illinois: A Case Study of Integration Between an IR and Library Discovery Systems Sarah L. Shreeves University of Illinois.
Hands-on Introduction and Refresher Course
Google Dataset Search Evaluation
Attributes and Values Describing Entities.
Jenn Riley Metadata Librarian Digital Library Program
Presentation transcript:

Project Report Presentation and Update October 10, 2014 Jeff Mixter - OCLC Research Patrick OBrien - Montana State Univeristy Kenning Arlitsch - Montana State University Describing Theses and Dissertations Using Schema.org DCMI 2014 Austin, Texas

Background This project is based on an IMLS Grant that Kenning and Patrick were awarded in 2010 Initial scope was to improve indexing and visibility of digital collections in Search Engines Since the release of Schema.org in 2011 the scope has expanded to include modeling IR material in a way that make them more visible to traditional search engines 2

Schema.org Released in 2011 by Bing, Google, Yahoo and Yandex Lingua franca for describing things on the web W3C Working Group SchemaBibExtend was created to help make bibliographic recommendations and suggestions to Schema.org 3

Data Sample 1,909 DC records from the Montana State University ScholarWorks IR They had already undergone extensive metadata clean-up 4

Data Model Started with Schema.org as the base We created an extension vocabulary using the same mechanics and conventions used in Schema.org –RDFS vocabulary –It is published as RDFa – 5

6 schema: dcterms: mont:

Classes There was a need to add more specificity to the existing Creative Work branch classes –Mont:Thesis –Mont:Concept There was also a need to describe entities unique to IRs and Universities that are not covered in Schema.org’s current vocabulary –Mont:InstitutionalRepository –Mont:AcademicDepartment 7

Properties Create more granular relationships between classes –Mont:committeeMember Describe important attributes of Theses and Dissertations that were not included in Schema.org* –Mont:firstPage** Highlight and model unique relationships that were otherwise locked in the metadata records –Mont:advisor * Schema.org underwent an update following the publication of the project report ** This property has since been replaced by the schema:pageStart 8

Inferring additional information from the record This has the potential of allowing Universities to aggregate a large amount of data about Academic Output and use it for reviews/marketing This highlights the idea of developing a graph of university entities 9

Process Model Data was loaded into OpenRefine Data was reconciled against Dbpedia.org, LCSH and VIAF –Matching was made easier by the specific metadata fields that the records used –dc:subjects.lcsh matched 78% Generated our own internal URIs*** *** The URI pattern for the current production data differs from that used in the example data presented in the project report 10

Syndication of RDF data Data from three records was published online along with an HTML page that described all of the entities referenced in the CBDs –Serialized at RDFa Since then we have loaded all 1,909 RDF descriptions back into the ScholarWorks repository and tweaked the Dspace instance to pull over and display JSON-ld data All newly created entities are loaded into a Triple Store with a Pubby front end – 11

Google Webmaster Tools 12

Next Steps Setup a more production ready Pubby interface Make modifications to the ScholarWorks structured data Make libraries visible on the Web –Build the presence of the library and its sub- organizations on the Semantic Web –Kenning, A., OBrien, P., Clark, J. A., Young, S. W. H. & Rossmann, D. (2014). Demonstrating Library Value at Network Scale: Leveraging the Semantic Web With New Knowledge Work. Journal of Library Administration, 54(5),

Questions? 14

Thank You! ©2014 OCLC. This work is licensed under a Creative Commons Attribution 3.0 Unported License. Suggested attribution: “This work uses content from [presentation title] © OCLC, used under a Creative Commons Attribution license: Jeff