A Quick Introduction to Metadata Michael Day UKOLN: The UK Office for Library and Information Networking, University of Bath Running a Public Library Website A workshop organised by UKOLN in association with EARL University of Bath, November 1999
Running a Public Library Website, University of Bath, November Presentation Outline Some definitions Metadata and the Web –RDF Resource discovery –Dublin Core –Information Gateways Other metadata implementations –Digital preservation
Running a Public Library Website, University of Bath, November Metadata: definitions (1) Metadata = data about data “… the Internet-age term for structured data about data” - Joint NSF-EU Working Group on Metadata (1998) “… structured data about data that imposes order on a disordered information universe” - Carl Lagoze (Cornell University)
Running a Public Library Website, University of Bath, November Metadata: definitions (2) “… machine understandable information about web resources or other things” - Tim Berners-Lee (World Wide Web Consortium) Roles: Provides information about resources Supports operations carried out on information objects
Running a Public Library Website, University of Bath, November Metadata: uses Metadata can support many potential applications: Resource discovery Content ratings E-commerce Authentication Data management Intellectual property rights management Digital preservation
Running a Public Library Website, University of Bath, November Metadata and the Web Metadata - the missing architectural component from the initial implementation of the Web Metadata - RDF PICS, TCN, MCF, DSig, DC,... Addressing URL Data format HTML Transport HTTP
Running a Public Library Website, University of Bath, November RDF The Resource Description Framework: Part of the W3C (World Wide Web Consortium) Metadata Activity Developing a common syntax for expressing assertions about information on the web –RDF Syntax Working Group –RDF data model and RDF/XML syntax –RDF Schema Working Group
Running a Public Library Website, University of Bath, November Resource discovery Main approaches: –Robot-based Web index services (AltaVista, Lycos, etc.) –Utilising human intelligence to identify and evaluate Internet resources. –Links pages –Information gateways –The library cataloguing method, creating bibliographic records for Internet resources in library catalogues (InterCat)
Running a Public Library Website, University of Bath, November A metadata typology Simple Rich Adapted from: L. Dempsey and R. Heery, “Metadata: a current view of practice and issues”, Journal of Documentation, vol. 54, no.2, March 1998, pp
Running a Public Library Website, University of Bath, November The Dublin Core Dublin Core Metadata Initiative (DCMI): An initiative to define a core set of metadata elements for resource discovery on the Internet 7 DC workshops –“... the broadest international, interdisciplinary effort in resource description on the Internet... the leading initiative for improving resource discovery on the Web” - Stu Weibel (OCLC)
Running a Public Library Website, University of Bath, November DC elements 15 Elements: Title Subject Description Creator Publisher Contributor Date Type Semantics defined in Internet RFC 2413 (1998); now superseded by DC version 1.1 Format Identifier Source Language Relation Coverage Rights
Running a Public Library Website, University of Bath, November DC qualifiers DC-4 Workshop (Canberra): TYPE, SCHEME and LANGUAGE DC Data Model working group: Element Qualifiers - refine the semantics of a DC element Value Qualifiers - gives context to the element value by –indicating how to parse the value, e.g. an ISO 8601 date –indicating the use of controlled vocabularies, e.g. LCSH, DDC or LCNAF Value Components
Running a Public Library Website, University of Bath, November DC syntax Guidelines and tools developed: “Encoding DC Metadata in HTML” (Internet-Draft) Data Model working group - “Guidance on expressing DC within the RDF” (working draft) Creation tools - e.g., DC-dot: Some examples...
Running a Public Library Website, University of Bath, November
Running a Public Library Website, University of Bath, November DC in HTML (1) Dorset Library Service
Running a Public Library Website, University of Bath, November
Running a Public Library Website, University of Bath, November DC in HTML (2) Bath and North East Somerset Library and Archives
Running a Public Library Website, University of Bath, November
Running a Public Library Website, University of Bath, November DC in RDF/XML <rdf:RDF xmlns:rdf=" xmlns:dc=" EARL, the Consortium for Public Library Networking EARL Consortium Text 4699 bytes en
Running a Public Library Website, University of Bath, November In abbreviated syntax <rdf:RDF xmlns:rdf=" xmlns:dc=" <rdf:Description about=" dc:Title="EARL, the Consortium for Public Library Networking" dc:Creator="EARL Consortium dc:Subject="earl, public libraries, uk, networking, consortium" dc:Publisher="EARL Consortium dc:Date=" " dc:Type="Text" > <rdf:Bag rdf:_1="text/html" rdf:_2="4699 bytes" />
Running a Public Library Website, University of Bath, November DC Implementations DC creation tools –DC-dot –Nordic Metadata Project - Template Metadata-aware indexing tools –DESIRE - Combine Conversion tools –Metadata Cross-walks –Nordic Metadata Project - d2m –Project BIBLINK Interoperability –AHDS Gateway
Running a Public Library Website, University of Bath, November Information Gateways Roles of gateways: Selection –Gateways select resources according to some pre-defined criteria (e.g. subject area, some measure of quality) Creation of metadata –Gateways create simple resource descriptions that can be both searched and browsed
Running a Public Library Website, University of Bath, November The eLib programme JISC funded: Selected gateways (SOSIG, EEVL, OMNI, Biz/ed, History, etc.) ROADS Resource Organisation and Discovery in Subject-based services –Developing Web-based tools for information gateways –Cross-searching (Whois++) –Content creation rules (cataloguing guidelines)
Running a Public Library Website, University of Bath, November
Running a Public Library Website, University of Bath, November
Running a Public Library Website, University of Bath, November The RDN Resource Discovery Network Funded by JISC, ESRC and AHRB Co-operative network: –Independent service providers (hubs) –Resource Discovery Network Centre (RDNC) –Set service standards –Collection management policy –Develop strategic partnerships –Cross-searching across multiple hubs
Running a Public Library Website, University of Bath, November Digital preservation A variety of preservation strategies are available - all are dependent upon the creation, capture and storage of metadata Recent initiatives include: –Reference Model for an Open Archival Information System (OAIS) –Research Libraries Group (RLG) Working Group on Preservation Issues of Metadata –Cedars project - funded by JISC under eLib, managed by Consortium of University Research Libraries (CURL) –Digital Services Project - National Library of Australia
Running a Public Library Website, University of Bath, November UKOLN UKOLN is funded by the Library and Information Commission, the Joint Information Systems Committee (JISC) of the higher education funding councils, as well as by project funding from the JISC and the European Union. UKOLN also receives support from the University of Bath, where it is based.