OAI-PMH The Open Archives Initiative Protocol for Metadata Harvesting Presenter: Knud Möller Friday,
OAI-PMH - Knud Möller, DERI Galway Content Basic idea behind OAI-PMH Architectural Overview – Repositories and Harvesters – Resources, Items and Records Internal Record Format Sets Selective Harvesting Response Format Command Overview
OAI-PMH - Knud Möller, DERI Galway Basic idea behind OAI-PMH provide a standard protocol for the harvesting/ querying of metadata about any kind of resource - “What kind of resources can you provide and what are their properties?“ OAI-PMH is only the protocol, needs to be implemented some implementations exist: – Emblem Project Utrecht – Virginia Tech (VTOAI)
OAI-PMH - Knud Möller, DERI Galway Architectural Overview Repositories and Harvesters Repository Harvester
OAI-PMH - Knud Möller, DERI Galway Architectural Overview Repositories and Harvesters Repository Harvester Request Harvesters issue OAI-PMH requests for metadata via HTTP.
OAI-PMH - Knud Möller, DERI Galway Architectural Overview Repositories and Harvesters Repository Harvester Request Response Harvesters issue OAI-PMH requests for metadata via HTTP. A Repository processes the OAI-PMH requests and has to implement the protocol.
OAI-PMH - Knud Möller, DERI Galway Architectural Overview Resources, Items and Records Resource Anything - physical artifact, a digital resource, a concept, etc. Whatever the metadata is about.
OAI-PMH - Knud Möller, DERI Galway Architectural Overview Resources, Items and Records Resource Item Representation of resource in repository. Can disseminate metadata in various formats. Must always provide Dublin Core. Has unique identifier. Anything - physical artifact, a digital resource, a concept, etc. Whatever the metadata is about. oai:arXiv.org:cs/
OAI-PMH - Knud Möller, DERI Galway Architectural Overview Resources, Items and Records Resource Item Record (oai_dc) Record (lom) Record (id3) XML-encoded byte stream of actual metadata. Anything - physical artifact, a digital resource, a concept, etc. Whatever the metadata is about. Representation of resource in repository. Can disseminate metadata in various formats. Must always provide Dublin Core. Has unique identifier. oai:arXiv.org:cs/
OAI-PMH - Knud Möller, DERI Galway Internal Record Format I
OAI-PMH - Knud Möller, DERI Galway Internal Record Format I oai:arXiv.org:cs/ cs math
OAI-PMH - Knud Möller, DERI Galway Internal Record Format II <oai_dc:dc xmlns:oai_dc=" xmlns:dc=" xmlns:xsi=" xsi:schemaLocation= " Using Structural Metadata to Localize Experience of Digital Content Dushay, Naomi Digital Libraries With the increasing [..bla..] to particular communities of users e-print
OAI-PMH - Knud Möller, DERI Galway Internal Record Format III <provenance xmlns=" xmlns:xsi=" xsi:schemaLocation= " <originDescription harvestDate=" T14:10:02Z" altered="true"> oai:r2.org:klik
OAI-PMH - Knud Möller, DERI Galway Sets Items can be organized into sets. Sets can either be organized flat or hierarchically. setNamesetSpec Institutions institution Oceanside University of Nebraska institution:nebraska Valley View University of Florida institution:florida Subject subject Existential Kenesiology subject:kenesiology Quantum Psychology subject:quantum
OAI-PMH - Knud Möller, DERI Galway Selective Harvesting Harvesters can specify some constraints on which items they are interested in Regarding datestamps: – only items that where created, modified or deleted (optional) in a certain time period Regarding sets: – only items that belong to a specific set (or any of its subsets)
OAI-PMH - Knud Möller, DERI Galway Response Format <OAI-PMH xmlns=" xmlns:xsi=" xsi:schemaLocation= " T19:20:30Z <request verb="GetRecord" identifier="oai:arXiv.org:hep-th/ “ metadataPrefix="oai_dc">
OAI-PMH - Knud Möller, DERI Galway Command Overview I GetRecord: get a specific record, must specify item‘s URI and metadata prefix Identify: retrieve information about a repository (name, protocol version, supports deletion,...) ListRecords: get either all records or a subset, must specify metadata prefix ListIdentifiers: like ListRecords, but retrieves only headers
OAI-PMH - Knud Möller, DERI Galway Command Overview II ListMetadataFormats: lists the available metadata formats of a repository ListSets: returns the set structure of a repository
OAI-PMH - Knud Möller, DERI Galway References OAI-PMH specification: chivesprotocol.htm chivesprotocol.htm
OAI-PMH - Knud Möller, DERI Galway Thanks and goodbye!