Authority Control for the Semantic Web

Slides:



Advertisements
Similar presentations
Putting the Pieces Together Grace Agnew Slide User Description Rights Holder Authentication Rights Video Object Permission Administration.
Advertisements

Resource description and access for the digital world Gordon Dunsire Centre for Digital Library Research University of Strathclyde Scotland.
A centre of expertise in digital information management Approaches To The Validation Of Dublin Core Metadata Embedded In (X)HTML Documents Background The.
Metadata vocabularies and ontologies Dr. Manjula Patel Technical Research and Development
February Harvesting RDF metadata Building digital library portals with harvested metadata workshop EU-DL All Projects concertation meeting DELOS.
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
Bibliographic Framework Initiative Approach for MARC Data as Linked Data Sally McCallum Library of Congress.
Linked Library Data Miiya Holmes October 6-7, 2012.
Module 5a: Authority Control and Encoding Schemes IMT530: Organization of Information Resources Winter 2007 Michael Crandall.
The OCLC Metadata Switch Project Jean Godby, Thomas Hickey, Diane Vizine-Goetz OCLC Office of Research Digital Library Federation May 14, 2003.
An Introduction to MODS: The Metadata Object Description Schema Tech Talk By Daniel Gelaw Alemneh October 17, 2007 October 17, 2007.
Corey A Harper DC2006 October 4, 2006 Authority Control for the Semantic Web Encoding Library of Congress Subject Headings (LCSH) in SKOS.
SKOS and Other W3C Vocabulary Related Activities Gail Hodge Information International Assoc. NKOS Workshop Denver, CO June 10, 2005.
© Tefko Saracevic, Rutgers University1 metadata considerations for digital libraries.
The RDF meta model: a closer look Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations.
A Registry for controlled vocabularies at the Library of Congress
Module 2b: Modeling Information Objects and Relationships IMT530: Organization of Information Resources Winter, 2007 Michael Crandall.
Everything Around the Core Practices, policies, and models around Dublin Core Thomas Baker, Fraunhofer-Gesellschaft DC2004, Shanghai Library
OCLC Online Computer Library Center Two Paths to Interoperable Metadata Jean Godby, Devon Smith, Eric Childress DC-2003 September 29, 2003.
By Carrie Moran. To examine the Metadata Object Description Schema (MODS) metadata scheme to determine its utility based on structure, interoperability.
Chinese-European Workshop on Digital Preservation, Beijing July 14 – Chinese-European Workshop on Digital Preservation Beijing (China), July.
PREMIS Tools and Services Rebecca Guenther Network Development & MARC Standards Office, Library of Congress NDIIPP Partners Meeting July 21,
Teaching Metadata and Networked Information Organization & Retrieval The UNT SLIS Experience William E. Moen School of Library and Information Sciences.
The role of metadata schema registries XML and Educational Metadata, SBU, London, 10 July 2001 Pete Johnston UKOLN, University of Bath Bath, BA2 7AY UKOLN.
Inside the DDC Dewey goes Europe: On the use and development of the Dewey Decimal Classification (DDC) in European libraries Austrian National Library.
The Metadata Object Description Schema (MODS) NISO Metadata Workshop May 20, 2004 Rebecca Guenther Network Development and MARC Standards Office Library.
Using the Open Metadata Registry (openMDR) to create Data Sharing Interfaces October 14 th, 2010 David Ervin & Rakesh Dhaval, Center for IT Innovations.
By: Dan Johnson & Jena Block. RDF definition What is Semantic web? Search Engine Example What is RDF? Triples Vocabularies RDF/XML Why RDF?
D4: SKOS and HIVE—Enhancing the Creation, Design and Flow of Information Speakers: Hollie White Jane Greenberg Coordinator: Alan Keely.
An Alternative Approach to Interoperability Testing The Use of Special Diagnostic Records in the Context of Z39.50 and Online Library Catalogs William.
INLS 520 – Fall 2007 Erik Mitchell INLS 520 Information Organization.
Roy Tennant Life After MARC A Metadata Infrastructure for the 21st Century.
1 Metadata –Information about information – Different objects, different forms – e.g. Library catalogue record Property:Value: Author Ian Beardwell Publisher.
Evolving MARC 21 for the future Rebecca Guenther CCS Forum, ALA Annual July 10, 2009.
Resource Description and Access Deirdre Kiorgaard Australian Committee on Cataloguing Representative to the Joint Steering Committee for the Development.
PREMIS Controlled vocabularies Rebecca Guenther Sr. Networking & Standards Specialist, Library of Congress PREMIS Implementation Fair San.
Metadata Bridget Jones Information Architecture I February 23, 2009.
It’s all semantics! The premises and promises of the semantic web. Tony Ross Centre for Digital Library Research, University of Strathclyde
Evidence from Metadata INST 734 Doug Oard Module 8.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Trustworthy Semantic Webs Dr. Bhavani Thuraisingham The University of Texas at Dallas Lecture #4 Vision for Semantic Web.
Radioactive Metadata Records An Interoperability Testing Approach Based on Metadata Utilization William E. Moen School of Library and Information Sciences.
The RDF meta model Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations of XML compared.
THE BIBFRAME EDITOR AND THE LC PILOT Module 3 – Unit 1 The Semantic Web and Linked Data : a Recap of the Key Concepts Library of Congress BIBFRAME Pilot.
Future of Cataloguing: how RDA positions us for the future for RDA Workshop June, 2010.
Standards for representing meeting metadata and annotations in meeting databases Standards for representing meeting metadata and annotations in meeting.
“New Dimensions in KOS” CENDI/NKOS Workshop September 11, 2008 Washington, DC, USA An international conference to share and advance knowledge and experience.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
Differences and distinctions: metadata types and their uses Stephen Winch Information Architecture Officer, SLIC.
Silterra, April 2004 RDF, RSS and all that THREADING THE RDF MAZE.
Describing resources II: Dublin Core CERN-UNESCO School on Digital Libraries Rabat, Nov 22-26, 2010 Annette Holtkamp CERN.
Setting the stage: linked data concepts Moving-Away-From-MARC-a-thon.
Semantic Web Technologies Readings discussion Research presentations Projects & Papers discussions.
The Semantic Web By: Maulik Parikh.
Metadata Standards - Types
Ready...Set...URIs...Actionable!
Repository Software - Standards
WHAT DOES THE FUTURE HOLD? Ann Ellis Dec. 18, 2000
Introduction to Metadata
Workshop on XML-Based Library Applications 5
Diane Vizine-Goetz OCLC Research
Cataloging the Internet
A Whirlwind Tour Through Part of the Metadata Landscape
PREMIS Tools and Services
A Web service for transforming metadata schemas
Linked Data  at  loc.gov show of hands:
Some Options for Non-MARC Descriptive Metadata
Attributes and Values Describing Entities.
Taxonomy of public services
Taxonomy of public services
Presentation transcript:

Authority Control for the Semantic Web Encoding Library of Congress Subject Headings (LCSH) in SKOS Corey A Harper DC2006 October 4, 2006

Outline Library Controlled Vocabularies and the Semantic Web Library of Congress Subject Headings Encoding: MARC, MADS, SKOS XML & XSLT: Intentions and Problems Alternate Approaches Conclusion - Benefits, Related & Future Work

“The vast bulk of data to be on the Semantic Web is already sitting in databases … all that is needed [is] to write an adapter to convert a particular format into RDF and all the content in that format is available.” -Tim Berners-Lee in an interview with the Consortium Standards Bulletin

Library Controlled Vocabularies: Benefits Reputation - Trusted Tradition Mature - Time tested and carefully developed General & Comprehensive - Cover large knowledge spaces

Library Controlled Vocabularies: Drawbacks Overly Complicated - extraneous information Archaic Syntax - MARC Records Slow to evolve - authorities control the authority control

LCSH Both the benefits and drawbacks are at their strongest when dealing with Library of Congress Controlled Vocabularies. LCSH is a prime example of the best and worst of Library Authority Land. Syndetic Structure - Relationships between concepts. Relationships to other Controlled Vocabularies (LC Classification)

LCSH in Dublin Core Encoding Scheme for DC Subject No easy way to draw on equivelent terms and cross-references Abstract Model, RDF and SKOS could enable applications to make use of the whole vocabulary

}Helping Get Library Apps online Vocbaluary Encodings MARC - Great for Library Applications MARC-XML MADS SKOS - Designed for use with RDF }Helping Get Library Apps online

LCSH in SKOS <skos:Concept rdf:about="http://example.com/lcsh#95000541"> <skos:prefLabel>World Wide Web</skos:prefLabel> <skos:altLabel>W3 (World Wide Web)</skos:altLabel> <skos:altLabel>Web (World Wide Web)</skos:altLabel> <skos:altLabel>World Wide Web (Information Retrieval System)</skos:altLabel> <skos:broader rdf:about="http://example.com/lcsh#88002671" /> <skos:broader rdf:about="http://example.com/lcsh#92002381" /> <skos:related rdf:about="http://example.com/lcsh#92002816"/> <skos:narrower rdf:about="http://example.com/lcsh#2002000569"/> <skos:narrower rdf:about="http://example.com/lcsh#2003001415"/> <skos:narrower rdf:about="http://example.com/lcsh#97003254"/> </skos:Concept> Talk a bit about the benefits, merging data stores and all that jazz. As tom mentioned Tuesday in the Opening, SKOS and RDF are like building blocks - bricks that fit together nicely with Dublin Core Data Model to support interoperability and Sementic Web Development ( &to enable more interesting and robust applications.)

XML to XML MARC can be represented as XML SKOS can be represented as XML XSLT is easy and effective MARC-XML to MADS exists (in Beta) Should be easy, right…

Many Challenges Records only include broader terms References identified by Label, not ID Pre-coordinated subject strings What to keep, what to exclude? Inconsistent identifier format

Alternate Approaches X-Query - Allows parsing of XML in chunks rather than tree based X-Path Intermediary structures: Internal to a scripting language like Perl Using a relational database

Expected Benefits Common RDF Semantics Many Possible Web Services Publish Vocabulary in Multiple Formats Ease of re-use Entertainment

Related Work OCLC’s Terminology Services Project NSDL Registry Project

Next Steps Finish parsing using an intermediary Discuss publishing options with LC Publish LCSH-SKOS as a test case Experiment with FAST SKOS extensions to represent additional data Experiment with other Library Vocabs Test web-services and tools

Tools and Web Services SRU/SRW Use to enhance metadata creation and search Facilitate Controlled Vocabularies in Social Tagging Environments

Corey A Harper DC2006 October 4, 2006 Thank You Any Questions Corey A Harper DC2006 October 4, 2006