Metadata Modularization Concepts and Tools Carl Lagoze CS502 2001-03-14.

Slides:



Advertisements
Similar presentations
Resource description and access for the digital world Gordon Dunsire Centre for Digital Library Research University of Strathclyde Scotland.
Advertisements

1 ICS-FORTH EU-NSF Semantic Web Workshop 3-5 Oct Christophides Vassilis Database Technology for the Semantic Web Vassilis Christophides Dimitris Plexousakis.
1 ICS-FORTH & Univ. of Crete SeLene November 15, 2002 A View Definition Language for the Semantic Web Maganaraki Aimilia.
Metadata vocabularies and ontologies Dr. Manjula Patel Technical Research and Development
UKOLN, University of Bath
W3C and RDF. Why OCLC is a W3C Member Access to networked information resources –the browser and online access –the breath and depth of networked information.
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
Developing a Metadata Exchange Format for Mathematical Literature David Ruddy Project Euclid Cornell University Library DML 2010 Paris 7 July 2010.
CS570 Artificial Intelligence Semantic Web & Ontology 2
An Introduction to MODS: The Metadata Object Description Schema Tech Talk By Daniel Gelaw Alemneh October 17, 2007 October 17, 2007.
Natalia Wehler: Dublin Core Requirements on Metadata  multiple softwares to use metadata  management of changing standards  needs to be functional,
© Tefko Saracevic, Rutgers University1 metadata considerations for digital libraries.
Building the Semantic Web CS 431 – March 28, 2005 Carl Lagoze – Cornell University.
COMP 6703 eScience Project Semantic Web for Museums Student : Lei Junran Client/Technical Supervisor : Tom Worthington Academic Supervisor : Peter Strazdins.
RDF Kitty Turner. Current Situation there is hardly any metadata on the Web search engine sites do the equivalent of going through a library, reading.
Cornell CS 502 Metadata for the Web From Discovery to Description CS 502 – Carl Lagoze – Cornell University.
More RDF CS 431 – Carl Lagoze – Cornell University Acknowledgements: Eric Miller Dieter Fensel.
Resource Description Framework Building the Semantic Web CS 431 – Carl Lagoze – Cornell University Acknowledgements: Eric Miller Dieter Fensel.
The RDF meta model: a closer look Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations.
Basic Dublin Core Semantics DC 2006 Tutorial 1, 3 October 2006 Marty Kurth Head of Metadata Services Cornell University Library.
Metadata : Setting the Scene or a Basic Introduction Wendy Duff University of Toronto, Faculty of Information Studies.
Module 2b: Modeling Information Objects and Relationships IMT530: Organization of Information Resources Winter, 2007 Michael Crandall.
Some URLs JODI Paper – Harmony project –
Cornell CS 502 Resource Description Framework Building the Semantic Web CS 502 – Carl Lagoze – Cornell University Acknowledgements: Eric Miller.
UKOLUG - July Metadata for the Web RDF and the Dublin Core Andy Powell UKOLN, University of Bath UKOLN.
Metadata Standards and Applications 4. Metadata Syntaxes and Containers.
Chinese-European Workshop on Digital Preservation, Beijing July 14 – Chinese-European Workshop on Digital Preservation Beijing (China), July.
Metadata Standards and Applications 5. Applying Metadata Standards: Application Profiles.
RDF (Resource Description Framework) Why?. XML XML is a metalanguage that allows users to define markup XML separates content and structure from formatting.
An Introduction to the Resource Description Framework Eric Miller Online Computer Library Center, Inc. Office of Research Dublin, Ohio 元智資工所 系統實驗室 楊錫謦.
The Semantic Web Service Shuying Wang Outline Semantic Web vision Core technologies XML, RDF, Ontology, Agent… Web services DAML-S.
Scalable Metadata Definition Frameworks Raymond Plante NCSA/NVO Toward an International Virtual Observatory How do we encourage a smooth evolution of metadata.
Lifecycle Metadata for Digital Objects (INF 389K) September 18, 2006 The Big Metadata Picture, Web Access, and the W3C Context.
Aligning library-domain metadata with the Europeana Data Model Sally CHAMBERS Valentine CHARLES ELAG 2011, Prague.
The Resource Description Framework And its application to thegateway.org For the IIAP Jon Jablonski, Research Assistant The Information.
The JISC IE Metadata Schema Registry and IEEE LOM Application Profiles Pete Johnston UKOLN, University of Bath CETIS Metadata & Digital Repositories SIG,
Cornell CS 502 Metadata for the Web Issues and Simple Answers CS 502 – Carl Lagoze – Cornell University.
Lifecycle Metadata for Digital Objects September 11, 2002 Major archival and digital library metadata schemes.
Creating an Application Profile Tutorial 3 DC2004, Shanghai Library 13 October 2004 Thomas Baker, Fraunhofer Society Robina Clayphan, British Library Pete.
Semantic Web - an introduction By Daniel Wu (danielwujr)
1 Metadata –Information about information – Different objects, different forms – e.g. Library catalogue record Property:Value: Author Ian Beardwell Publisher.
Modularization and Interoperability: Dublin Core and the Warwick Framework Sandra D. Payette Digital Library Research Group Cornell University November.
RDF and XML 인공지능 연구실 한기덕. 2 개요  1. Basic of RDF  2. Example of RDF  3. How XML Namespaces Work  4. The Abbreviated RDF Syntax  5. RDF Resource Collections.
Metadata and Documentation Iain Wallace Performing Arts Data Service.
Lifecycle Metadata for Digital Objects November 1, 2004 Descriptive Metadata: “Modeling the World”
1 The ABC Metadata Ontology and Model Carl Lagoze, Cornell University Jane Hunter, DSTC.
The future of the Web: Semantic Web 9/30/2004 Xiangming Mu.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
A centre of expertise in digital information managementwww.ukoln.ac.uk DCMI Affiliates: Implications for Institutions Rosemary Russell UKOLN University.
Intellectual Works and their Manifestations Representation of Information Objects IR Systems & Information objects Spring January, 2006 Bharat.
The RDF meta model Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations of XML compared.
Metadata : an overview XML and Educational Metadata, SBU, London, 10 July 2001 Pete Johnston UKOLN, University of Bath Bath, BA2 7AY UKOLN is supported.
5. Applying metadata standards: Application profiles Metadata Standards and Applications Workshop.
Metadata and Meta tag. What is metadata? What does metadata do? Metadata schemes What is meta tag? Meta tag example Table of Content.
1cs The Need “Most of the Web's content today is designed for humans to read, not for computer programs to manipulate meaningfully.” Berners-Lee,
1 Dublin Core and its implementation in RDF/XML Paul Miller Interoperability Focus UK Office for Library & Information Networking (UKOLN)
Cornell CS 502 Metadata for the Web Issues and Simple Answers CS 502 – Carl Lagoze – Cornell University.
Metadata for the Web Beyond Dublin Core? CS 431 – March 9, 2005 Carl Lagoze – Cornell University Acknowledgements to Liz Liddy and Geri Gay.
1 CS 502: Computing Methods for Digital Libraries Lecture 10 New Developments in XML: MathML, Namespaces, RDF.
Application Profiles Application profiles -- are schemas which consist of data elements drawn from one or more namespaces, combined together by implementers,
The Semantic Web. What is the Semantic Web? The Semantic Web is an extension of the current Web in which information is given well-defined meaning, enabling.
Differences and distinctions: metadata types and their uses Stephen Winch Information Architecture Officer, SLIC.
Describing resources II: Dublin Core CERN-UNESCO School on Digital Libraries Rabat, Nov 22-26, 2010 Annette Holtkamp CERN.
A centre of expertise in digital information management UKOLN is supported by: IEMSR, the Information Environment & Metadata Application.
Setting the stage: linked data concepts Moving-Away-From-MARC-a-thon.
Geospatial metadata Prof. Wenwen Li School of Geographical Sciences and Urban Planning 5644 Coor Hall
Resource Description Framework Building the Semantic Web
Metadata for the Web From Discovery to Description
Metadata in Digital Preservation: Setting the Scene
Semantic Web Basics (cont.)
Presentation transcript:

Metadata Modularization Concepts and Tools Carl Lagoze CS

Metadata Structured data about data….

Why is Metadata important? Key to organizing, managing, preserving, and locating content and services in digital libraries

Why is Metadata difficult? Cost Interoperability –Syntax –Semantics Customizability Extensibility Distribution Integrity, Authenticity, Quality Human and Machine Factors Naming

Metadata Thoughts Metadata takes a variety of forms –descriptive cataloging –specialized terms and conditions administrative content ratings provenance linkage

More Metadata Thoughts New metadata sets will continually evolve Many metadata sets are “community- specific” –administration –use Human and machine use

Dublin Core Metadata Set for Simple Resource Discovery 15 elements allowing simple descriptive sentences about document like objects: –“Document has title Hamlet” –“Document has creator William Shakespeare” –“Document has subject love and anguish”

The Dublin Core 15 Title Creator Subject /Keywords Description Publisher Other Contributor Date Resource Type Format Resource Identifier Source Language Relation Coverage Rights Management

A Scope for the Dublin Core Increase or decrease number of elements? Structured or Unstructured value syntax? Accommodate community extensions?

Warwick Framework Provide context for Dublin Core effort Integrate multiple sets of metadata addressing issues of: –individual integrity –distinct audiences –separate realms of responsibility and management

Warwick Framework Design Containers for aggregating … Packages of typed metadata sets General principles - information hiding: –only operation defined at container level returns sequence of contained packages –packages are opaque at the container level –access to package contents subject to terms and conditions

Package Types Simple metadata set –segregating distinct metadata into separate packages Recursive container –nesting semantically related metadata sets Indirect reference –allowing distribution and sharing of metadata sets

Metadata Container Container Package Dublin Core Package MARC record Package Indirect Reference Package Terms and Conditions URI

Open Implementation Issues Data encoding Semantic interaction of overlapping sets –between semantically-related packages –between semantically distinct packages Type registry

Modeling & Encoding Metadata Components: XML Namespaces Prevent term clash: –record?, creator? Establish concept spaces through URIs xmlns:dc=“ xmlns:abc=“ Herbert Van de Sompel Cornell University

Modeling & Encoding Metadata Components: RDF RDF (Resource Description Format) The instantiation of the Warwick Framework on the Web Provides enabling technology for richly- structured metadata Rich data model supporting notions of distinct entities and properties Syntax expressed in XML

RDF Components Formal data model Syntax for interchange of data Schema Type system (schema model)

RDF Data Model Directed labeled graphs Model elements –Resource –Property –Value –Statement –Containers

RDF Model Primitives Resource Property Value Resource Statement

RDF Syntax Example URI:R “CIMI Presentation” Title Creator dc: “Eric Miller” <RDF xmlns = “ xmlns:dc = “ CIMI Presentation Eric Miller

“Eric Miller” RDF Model Example #2 URI:R URI:ERIC oclc.org” “Eric Miller” “OCLC” bib: bib:Aff bib:Name URI:OCLC “CIMI Presentation” Title Creator oa: dc:

<RDF xmlns = “ xmlns:dc = “ xmlns:bib = “ CIMI Presentation Eric Miller RDF Syntax Example #2

RDF Containers Permit the aggregation of several values for a property Express multiple aggregation semantics –unordered –sequential or priority order –alternative

RDF Schemas Declaration of vocabularies –properties defined by a particular community –characteristics of properties and/or constraints on corresponding values Schema Type System - Basic Types –Property, Class, SubClassOf, Domain, Range –Minimal (but extensible) at this time –minimize significant clashes with typing system designed for XML Schema WG Expressible in the RDF model and syntax

Relationships among vocabularies dc:Creator ms:director marc:100 bib:Author

Bringing it together RDF Data Model –Support consistent encoding, exchange and processing of metadata… critical when aggregating data from multiple sources RDF Schema –Declare, define, reuse vocabularies RDF Metadata transmission –XML encoding

Interoperability among Metadata Vocabularies core classes Dublin Core MARC INDECSIMS

Attribute/Value approaches to metadata… Hamlet has a creator Shakespeare subjectimplied verbmetadata nounliteral Playwright metadata adjective The playwright of Hamlet was Shakespeare R1 “ Shakespeare ” “ Hamlet ” dc:creator.playwright dc:title

…run into problems for richer descriptions… Hamlet has a creator Stratford birthplace The playwright of Hamlet was Shakespeare, who was born in Stratford “ Stratford ” R1 “ Shakespeare ” dc:creator.playwright dc:creator.birthplace Hamlet has a creator Shakespeare

…because of their failure to model entity distinctions R1 “ Stratford ” creator R2 name “ Shakespeare ” birthplace title “ Hamlet ”

Understanding Metadata based on Query Capabilities Simple boolean tags? Agent, time, place questions? –Who was responsible for what and when

Applying a Model-Centric Approach Formally define common entities and relationships underlying multiple metadata vocabularies Describe them (and their inter-relationships) in a simple logical model Provide the framework for extending these common semantics to domain and application-specific metadata vocabularies.

Conceptual Basis: Evolution of Content over Time IFLA Entity Model From Bearman, et. al., D-Lib Magazine, January 1999.

Events are key to understanding metadata relationships? Recognizing inherent lifecycle aspects of digital content - transformation of “input” resources to “output” resources and of their descriptions. (e.g., IFLA model) Modeling implied events as first-class objects provides attachment points for common entities – e.g., agents, contexts (times & places), roles. Clarifying attachment points facilitates mapping across common entities in different vocabularies.

Content, Events, & Descriptions

Museum Data