OAI Overview DLESE OAI Workshop April 29-30, 2002 John Weatherley

Slides:



Advertisements
Similar presentations
Geospatial One-Stop A Federal Gateway to Federal, State & Local Geographic Data
Advertisements

Rapid Visual OAI Tool S. Kothamasa, K. Maly, M. Zubair (Old Dominion University) X. Liu (Los Alamos National Laboratory) RCDL 2003, St. Petersburg.
Y.T. a brief history of the OAI 0 Kaynak: Herbert van de Sompel.
OAI in DigiTool DigiTool Version 3.0.
OAI-PMH Dawn Petherick, University Web Services Team Manager, Information Services, University of Birmingham MIDESS Dissemination.
National Science Digital Library (NSDL) Core Infrastructure Metadata Repository (“union catalog”) Naomi Dushay Cornell University.
The Open Archives Initiative Simeon Warner (Cornell University) Symposium on “Scholarly Publishing and Archiving on the Web”, University.
OAI Standards for Sheet Music Meeting March 28-29, 2002 Basic OAI Principals How They Apply to Sheet Music Presenter: Curtis Fornadley, Senior Programmer/Analyst.
OAI-PMH at Yale Report on the DLF OAI Training Session November 10, 2005 Charlottesville, VA.
Basic Concepts Architecture Topology Protocols Basic Concepts Open e-Print Archive Open Archive -- generalization of e-print Data Provider and Service.
Introduction to the OAI Metadata Harvesting Protocol Hussein Suleman, Digital Library Research Laboratory Virginia Tech.
Metadata Harvesting The Hague, 13 & 14 January 2009 Julie Verleyen Scientific Coordinator, Europeana Office EuropeanaLocal Knowledge Sharing Workshop.
Rapid Visual OAI Tool S. Kothamasa, K. Maly, M. Zubair (Old Dominion University) X. Liu (Los Alamos National Laboratory) RCDL 2003, St. Petersburg.
Open Archives Iniative – Protocol for Metadata Harvesting Iztok Kavkler, University of Ljubljana Some slides by Stefaan Ternier, KUL Bram Vandenputte,
Metadata Harvesting Interoperable digital collections.
ALA 2002 LITA Open Source Software Open Archives Initiative Hussein Suleman AmericanSouth.org 14 June 2002.
Dec 9-11, 2003ICADL Challenges in Building Federation Services over Harvested Metadata Hesham Anan, Jianfeng Tang, Kurt Maly, Michael Nelson, Mohammad.
Indo-US Workshop, June23-25, 2003 Building Digital Libraries for Communities using Kepler Framework M. Zubair Old Dominion University.
LIS 654 BUILDING DIGITAL LIBRARIES FALL 2011 NOVEMBER 03, 2011 The OAI-PMH Harvester Plugin for The Omeka Content Management System JAMES R. GRIFFIN III.
Metadata Lessons Learned Katy Ginger Digital Learning Sciences University Corporation for Atmospheric Research (UCAR)
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
1 DLESE-IMS Metadata, ADN Metadata and the DLESE Catalog System.
OAI-PMH: Open Archives Initiative Protocol for Metadata Harvesting T.B. Rajashekar National Centre for Science Information (NCSI) Indian Institute of Science,
The Digital Library for Earth System Science: Contributing resources and collections Meeting with GLOBE 5/29/03 Holly Devaul.
Introduction to Digital Libraries hussein suleman uct cs honours 2004.
The OAI Protocol for Metadata Harvesting Van de Sompel, Herbert Los Alamos National Laboratory – Research Library.
Metadata harvesting in regional digital libraries in PIONIER Network Cezary Mazurek, Maciej Stroiński, Marcin Werla, Jan Węglarz.
1 A Very Large Digital Library Technology Demonstration William Y. Arms Cornell University.
Digital Library Interoperability Architecture CS 502 – Carl Lagoze – Cornell University.
Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH) Phil Barker, March © Heriot-Watt University. You may reproduce all or any part.
Open Archive Initiative – Protocol for metadata Harvesting (OAI-PMH) Surinder Kumar Technical Director NIC, New Delhi
Caltech CODA CODA: Collection of Digital Archives Caltech Scholarly Communication.
The Digital Library for Earth System Science: Contributing resources and collections GCCS Internship Orientation Holly Devaul 19 June 2003.
Slavic Digital Text Workshop 2006 The Open Archives Initiative Protocol for Metadata Harvesting: an Opportunity for Sharing Content in a Distributed Environment.
1 GRID Based Federated Digital Library K. Maly, M. Zubair, V. Chilukamarri, and P. Kothari Department of Computer Science Old Dominion University February,
OAI Protocol for Metadata Harvesting hussein suleman uct cs honours 2006.
Integrating Access to Digital Content Sarah Shreeves University of Illinois at Urbana-Champaign Visual Resources Association 23 rd Annual Conference Miami.
Search Interoperability, OAI, and Metadata Sarah Shreeves University of Illinois at Urbana-Champaign Basics and Beyond Grainger Engineering Library April.
NSDL October 12-15, 2003Eisenhower National Clearinghouse Slide 1 NSDL and the Open Archives Initiative NSDL – OAI – and the Eisenhower National Clearinghouse.
SPASE and the VxOs Jim Thieman Todd King Aaron Roberts.
Enforcing Interoperability with the Open Archives Initiative Repository Explorer Hussein Suleman, Digital Library Research.
Metadata and OAI DLESE OAI Workshop April 29-30, 2002 Katy Ginger Presentation available at:
Metadata and OAI DLESE OAI Workshop June 29 to July 2, 2002 Katy Ginger Presentation available at:
Building Interoperable Digital Libraries: A Practical Guide to creating Open Archives Hussein Suleman, Digital Library Research.
Building Interoperable and Accessible ETD Collections: A Practical Guide to Creating Open Archives Hussein Suleman, Digital.
The OAI: technical overview OAI Open Meeting – Washington DC – January 23 rd 2001 Herbert Van de Sompel & Carl Lagoze Cornell University -- Computer Science.
Oct 12-14, 2003NSDL Challenges in Building Federation Services over Harvested Metadata Kurt Maly, Michael Nelson, Mohammad Zubair Digital Library.
The Open Archives Initiative Marshall Breeding Director for Innovative Technologies and Research Vanderbilt University
Open Archives Initiative Protocol for Metadata Harvesting.
Metadata Harvesting Interoperable digital collections.
NSDL STEM Exchange: Technical Overview and Implications for Active Dissemination of Federally Funded Resources Across Implementation Systems.
Designing Protocols in Support of Digital Library Componentization Hussein Suleman and Edward A. Fox Digital Library Research Laboratory Virginia Tech.
2/22/2016J Ammerman1 Open Archives Initiative What is it? What’s it good for?
NSDL & the Open Archives Initiative A Brief Introduction to OAI Timothy W. Cole Mathematics Librarian & Professor of Library Administration.
DLESE Metadata Frameworks March Talk Organizer Terminology DLESE metadata history (DC/IMS to DLESE- IMS to ADN) ADN Collection News-opps Object.
1 CS 430: Information Discovery Lecture 26 Architecture of Information Retrieval Systems 1.
The NSDL, OAI and Your Metadata Core Infrastructure Metadata Repository (“union catalog”) Naomi Dushay Cornell University.
NDLTD Toward Universal Accessibility of ETDs: Building the NDLTD Union Archive Hussein Suleman, Edward A. Fox,
OAI and ODL Building Digital Libraries from Components Ryan Richardson Virginia Tech DLRL 18 September 2003.
OAI and ODL Building Digital Libraries from Components Hussein Suleman Virginia Tech DLRL 12 September 2002.
NDLTD Standards, Metadata and the OAI-PMH Hussein Suleman University of Cape Town October 2003.
Harvesting and Exporting Metadata 714: Metadata Margaret E.I. Kipp -
1 XML and XML in DLESE Katy Ginger November 2003.
Getting a Leg Up on OAI for the NSDL
Georges Arnaout Chaitanya Krishna
OAI and Metadata Harvesting
OAI 11/20/07.
Open Archive Initiative
IVOA Interoperability Meeting - Boston
Presentation transcript:

OAI Overview DLESE OAI Workshop April 29-30, 2002 John Weatherley

DLESE OAI April 29-30, Workshop Schedule Day 1 Morning Overview of OAI Look at OAI tools and resources Afternoon DLESE OAI software installation, configuration and setup Day 2 Morning Overview of NDSL and DLESE interoperability architecture NSDL metadata overview Metadata and OAI

DLESE OAI April 29-30, Resources Workshop presentation slides, links to tools and other OAI resources are located at:

DLESE OAI April 29-30, What is DLESE and NSDL? DLESE: Digital Library for Earth System Education: provides access to digitally accessible resources for learning about the Earth system NSDL: National Science (STEM) Digital Library: network of scholarly and educational digital libraries related to science (DLESE will be part of this network)

DLESE OAI April 29-30, What is the OAI? What is the Open Archive Initiative (OAI)? Organization dedicated to solving problems of digital library interoperability by defining simple protocols and standards Grew out of the e-prints (arXiv) community at Los Alamos What is the OAI Protocol for Metadata Harvesting (OAI-PMH)? Protocol to transfer metadata from a source archive to a destination archive How is the OAI-PMH Being Used by the NSDL and DLESE? The OAI-PMH has been adopted as a primary means of gathering and sharing metadata among contributors Also used to facilitate internal management of metadata stores

DLESE OAI April 29-30, What is Metadata? Data refers to digital objects e.g. the resources themselves Metadata is data about data e.g. a description about a resource, not the resource itself OAI is used to transmit metadata

DLESE OAI April 29-30, Definitions / Concepts Basic Principles Harvesting vs. Federation Data Providers vs. Service Providers Underlying Technology HTTP and XML XML Namespaces and Schema Protocol Policies and Conventions Basic Policies Sets

DLESE OAI April 29-30, Harvesting vs. Federation Competing approaches to interoperability Federation is when services such as searching are run remotely Harvesting is when metadata is transferred from remote sources to the destination where the services are located Federation requires more effort at the remote site but is easier for the local system Harvesting requires less effort at the remote site; Services are provided by the local system OAI uses the harvesting model

DLESE OAI April 29-30, Data Providers vs. Service Providers Data Providers refer to entities who possess metadata and are willing to share this with others (e.g. collection builders) Service Providers are entities who harvest data from Data Providers in order to provide higher-level services to users (e.g. searching, browsing, recommender systems, etc.). The NSDL and DLESE are examples.

DLESE OAI April 29-30, Features of the OAI Approach Lightweight: Low overhead for Data Providers Protocol is relatively simple to implement Many plug-and-play tools publicly available Transports any metadata framework that can be made available in XML form (details to come) Details of searching, browsing, annotation and other advanced services are handled by the Service Provider

DLESE OAI April 29-30, Data Providers: (collection builders) … … … Service Provider (DLESE, NSDL) Harvested Records 3. Provide searching, browsing, and other services over the data. OAI protocol (over http) 1. Service Provider polls periodically for new records 2. New records downloaded and cached by the Service Provider Metadata Harvesting Framework Library User

DLESE OAI April 29-30, HTTP and XML The OAI-PMH is an almost stateless request/response protocol Requests and responses are sent via the HTTP protocol Requests are encoded as GET/POST operations Responses are well-formed XML documents

DLESE OAI April 29-30, Well-formed and Valid XML Correct Dodge Spirit 1994 you CO Incorrect Dodge Spirit 1994 CO you

DLESE OAI April 29-30, DTD, Schemas & Namespace DTD’s: Document Type Definition Describe the elements of XML instance documents Not well-formed XML Some data-typing Namespaces harder to deal with Schemas Describe the elements of XML instance documents Well-formed XML Strong data-typing Namespaces are easier to deal with Namespace: Collection of related element names identified by a name label (e.g. dc)

DLESE OAI April 29-30, XML Namespaces and Schema Consistency and data quality is ensured by using XML Schema descriptions for each possible response XML Namespaces are used where necessary to clearly define which parts of the responses are actual metadata and which support the OAI-PMH. Example: bin/OAI/CSTC.pl?verb=GetRecord&identifier=oai%3ACSTC%3A103&metad ataPrefix=oai_dc

DLESE OAI April 29-30, Basic OAI Policies and Conventions Each metadata record from a given Data Provider must have a unique ID (OAI ID is not necessarily the same as the record ID) Each metadata record must be persistent so that Service Providers can always refer back to the source Each record must have a date stamp indicating creation / modification date Dates provide a mechanism for incremental and continuous transfer of metadata by only requesting records that have changed since the previous harvest Flow Control - Resumption Tokens can be used to return partial results – the client is issued a token which may be presented to the server to receive more results Multiple formats of metadata are allowed Examples: Dublin Core, DLESE IMS

DLESE OAI April 29-30, Sets OAI-PMH mechanism to allow for harvesting of sub- collections Semantics for sets are defined outside of the protocol Sets are defined by conventions established between data and service providers Example sets within DLESE might be: DWEL, COMET, LDEO, etc. Example sets within the NDSL might be: DLESE, DLESE:DWEL, DLESE:COMET, DLESE:LDEO, etc. Sets can be established that enable querying (e.g. by topic, author name, subject area, etc.) Example: The Open Digital Library (Suleman, 2001)

DLESE OAI April 29-30, Requirements to be a Data Provider Source of metadata Human or automated resource catalogers Metadata mappings Crosswalks from native formats to DC or other formats Server technology Handled by the OAI software Datestamps Deletions Unique identifiers

DLESE OAI April 29-30, The OAI-PMH Service Requests Identify ListMetadataFormats ListSets GetRecord ListIdentifiers ListRecords Date Ranges Resumption Tokens

DLESE OAI April 29-30, Identify Purpose Return general information about the archive and its policies Parameters None Sample URL

DLESE OAI April 29-30, ListMetadataFormats Purpose List metadata formats supported by the archive as well as their schema locations and namespaces Parameters Identifier – for a specific record ( O ) Sample URL

DLESE OAI April 29-30, ListSets Purpose Provide a hierarchical listing of sets in which records may be organized Parameters None Sample URL

DLESE OAI April 29-30, GetRecord Purpose Returns the metadata for a single identifier in the form on an OAI record Parameters identifier – id for the record ( R ) metadataPrefix – metadata format ( R ) Sample URL SE &metadataPrefix=dlese_ims SE &metadataPrefix=dlese_ims

DLESE OAI April 29-30, ListIdentifiers Purpose List all unique identifiers corresponding to the record in the repository Parameters from – start date ( O ) until – end date ( O ) resumptionToken – flow control mechanism ( X ) Sample URL

DLESE OAI April 29-30, ListRecords Purpose Retrieves metadata for multiple records Parameters from – start date ( O ) until – end date ( O ) resumptionToken – flow control mechanism ( X ) set – set to harvest from ( O ) metadataPrefix – metadata format ( R ) Sample URL

DLESE OAI April 29-30, DLESE Architecture Metadata Repository Collections DLESE Portal Search & Discovery Direct Entry OAI Resources Services: (e.g. What’s New) NSDL OAI Library Users

DLESE OAI April 29-30, References 1.“Building Interoperable Digital Libraries: A Practical Guide to creating Open Archives,” Hussein Suleman JCDL “A Framework for Building Open Digital Libraries,” Hussein Suleman and Edward A. Fox, in D-Lib Magazine, December, The Open Archives Initiative