OAI Tools By Thomas G. Habing Grainger Engineering Library Information Center University.

Slides:



Advertisements
Similar presentations
A centre of expertise in digital information management The OAI Protocol for Metadata Harvesting Andy Powell UKOLN,
Advertisements

Heinrich Stamerjohanns Institute for Science Networking Distributed Open Archives Dr. Heinrich Stamerjohanns Institute for Science Networking at the University.
Digital Collections: Storage and Access Jon Dunn Assistant Director for Technology IU Digital Library Program
Rapid Visual OAI Tool S. Kothamasa, K. Maly, M. Zubair (Old Dominion University) X. Liu (Los Alamos National Laboratory) RCDL 2003, St. Petersburg.
NSF – DLF – JISC/UKOLN Digital Library Service Registry Workshop National Science Foundation, Arlington, VA March 2006 The University of Illinois.
OAI-PMH Dawn Petherick, University Web Services Team Manager, Information Services, University of Birmingham MIDESS Dissemination.
Providing Online Access to the HKUST University Archives: EAD to INNOPAC Sintra Tsang and K.T. Lam The Hong Kong University of Science and Technology 7th.
An easy option? OAI static repositories as a method of exposing publishers’ metadata to the wider information environment Elpub 2006 John Robertson and.
University of Michigan’s OAIster Service Provider Kat Hagedorn OAIster/Metadata Harvesting Librarian University of Michigan, DLPS November 5, 2002.
National Science Digital Library (NSDL) Core Infrastructure Metadata Repository (“union catalog”) Naomi Dushay Cornell University.
Dspace – Digital Repository Dawn Petherick, University Web Services Team Manager Information Services, University of Birmingham MIDESS Dissemination.
UCLA Digital Library UC Digital Library Forum August 5, 2002 UCLA Digital Library Presenter: Curtis Fornadley Senior Programmer/Analyst.
OAI Standards for Sheet Music Meeting March 28-29, 2002 Basic OAI Principals How They Apply to Sheet Music Presenter: Curtis Fornadley, Senior Programmer/Analyst.
OAIster Kat Hagedorn University of Michigan Libraries September 12, 2007.
The Open Archives Initiative and OAIster: Past, Present and Future Kat Hagedorn University of Michigan Libraries April 6, 2006.
OAI-PMH at Yale Report on the DLF OAI Training Session November 10, 2005 Charlottesville, VA.
Basic Concepts Architecture Topology Protocols Basic Concepts Open e-Print Archive Open Archive -- generalization of e-print Data Provider and Service.
Content Management at Grainger Engineering Library Case studies from various digital library research projects Tom Habing
Thomas G. Habing – University of Illinois at Urbana-Champaign Recap: SIGIR 2001 OAI Workshop 19 September OAI Provider Workshop, University of.
ORGANIZING AND STRUCTURING DATA FOR DIGITAL PROJECTS Suzanne Huffman Digital Resources Librarian Simpson Library.
Introduction to the OAI Metadata Harvesting Protocol Hussein Suleman, Digital Library Research Laboratory Virginia Tech.
Describing Collections So Visitors Can Find Them: A sampling of ways to get materials on-line Amanda Focke, Rice University
Metadata Harvesting The Hague, 13 & 14 January 2009 Julie Verleyen Scientific Coordinator, Europeana Office EuropeanaLocal Knowledge Sharing Workshop.
University of Illinois at Urbana-Champaign OAI Alpha Experiences Timothy W. Cole Thomas G. Habing Grainger Engineering.
Rapid Visual OAI Tool S. Kothamasa, K. Maly, M. Zubair (Old Dominion University) X. Liu (Los Alamos National Laboratory) RCDL 2003, St. Petersburg.
1. 2 introductions Nicholas Fischio Development Manager Kelvin Smith Library of Case Western Reserve University Benjamin Bykowski Tech Lead and Senior.
Metadata Harvesting Interoperable digital collections.
Sheet Music Consortium: Tools for Data Providers Jenn Riley Head, Carolina Digital Library and Archives The University of North Carolina at Chapel Hill.
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
ALCME: OAI at OCLC Jeffrey A. Young OCLC Online Computer Library Center, Inc.
ALA 2002 LITA Open Source Software Open Archives Initiative Hussein Suleman AmericanSouth.org 14 June 2002.
Indo-US Workshop, June23-25, 2003 Building Digital Libraries for Communities using Kepler Framework M. Zubair Old Dominion University.
Habing1 Integrating PREMIS and METS PREMIS Tutorial Implementers’ Panel June 21, 2007, 9:00-5:30 Library of Congress, Jefferson Building, Whittall.
Search Interoperability, OAI, and Metadata An Introduction to the OAI Protocol for Metadata Harvesting Sarah Shreeves University of Illinois at Urbana-Champaign.
Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH) Phil Barker, March © Heriot-Watt University. You may reproduce all or any part.
Open Archive Initiative – Protocol for metadata Harvesting (OAI-PMH) Surinder Kumar Technical Director NIC, New Delhi
Caltech CODA CODA: Collection of Digital Archives Caltech Scholarly Communication.
Slavic Digital Text Workshop 2006 The Open Archives Initiative Protocol for Metadata Harvesting: an Opportunity for Sharing Content in a Distributed Environment.
OAI User Services Kat Hagedorn, UM University of Michigan 11/10/2005.
OAI Overview DLESE OAI Workshop April 29-30, 2002 John Weatherley
Search Interoperability, OAI, and Metadata Sarah Shreeves University of Illinois at Urbana-Champaign Basics and Beyond Grainger Engineering Library April.
Hussein Suleman University of Cape Town Department of Computer Science Digital Libraries Laboratory February 2008 Data Curation Repositories:
OAI Registry at UIUC Presented by: Thomas Habing Grainger Engineering Library.
The OAI: technical overview OAI Open Meeting – Washington DC – January 23 rd 2001 Herbert Van de Sompel & Carl Lagoze Cornell University -- Computer Science.
Open Archives Initiative Protocol for Metadata Harvesting.
Sharing Digital Scores: Will the Open Archives Initiative Protocol for Metadata Harvesting Provide the Key? Constance Mayer, Harvard University Peter Munstedt,
Distributed Service Registry Workshop, Warwick, U.K. 1 Distributed Functionality in the UIUC OAI Registry
DLF Fall Forum DLF/IMLS OAI Project Update A Tale of Three Registries Plus a few other things By Tom Habing
2/22/2016J Ammerman1 Open Archives Initiative What is it? What’s it good for?
NSDL & the Open Archives Initiative A Brief Introduction to OAI Timothy W. Cole Mathematics Librarian & Professor of Library Administration.
DLF Fall Forum The Distributed Library: OAI for Digital Library Aggregation UIUC’s Role: Registry of OAI Data Providers
1 CS 430: Information Discovery Lecture 26 Architecture of Information Retrieval Systems 1.
The NSDL, OAI and Your Metadata Core Infrastructure Metadata Repository (“union catalog”) Naomi Dushay Cornell University.
Do Real Archivists Use OAI? Mid-Atlantic Regional Archives Conference Gettysburg, PA October 31, 2003 Chris Prom Assistant University Archivist University.
Mod_oai: Metadata Harvesting for Everyone Michael L. Nelson, Herbert Van de Sompel, Xiaoming Liu, Aravind Elango
OAI and ODL Building Digital Libraries from Components Ryan Richardson Virginia Tech DLRL 18 September 2003.
OAI and ODL Building Digital Libraries from Components Hussein Suleman Virginia Tech DLRL 12 September 2002.
The Open Archives Initiative: Perspectives on Metadata Harvesting OAI Provider & Harvesting Services at the University of Illinois Timothy W. Cole Mathematics.
Introduction to OAI Static Repositories By Thomas G. Habing Grainger Engineering Library.
7th Annual Hong Kong Innovative Users Group Meeting
Getting a Leg Up on OAI for the NSDL
University of Illinois at Urbana-Champaign OAI Alpha Experiences
Integrating PREMIS and METS
OAI and Metadata Harvesting
OAI 11/20/07.
Enforcing Interoperability with the Open Archives Initiative Repository Explorer Hussein Suleman, Digital Library Research Laboratory Virginia.
Open Archive Initiative
OAI Data Providers grainger. uiuc
OAI Tools By Thomas G. Habing Grainger Engineering Library Information Center University.
Developing a Technical Registry of OAI Data Providers
Presentation transcript:

OAI Tools By Thomas G. Habing Grainger Engineering Library Information Center University of Illinois at Urbana-Champaign

Outline Emory’s Metadata Migrator OAI Static Repositories UIUC’s OAI FileMakerPro Gateway Other Tools Validating

Emory’s Metadata Migrator Turning it over to Martin “conquering the digital library world” A Google Images search for “Martin Halbert” really did turn up this image of Caesar

OAI Static Repositories Back to me “wrangling metadata” This is the closest I could come in Google Images to an unusual picture for “Habing” (I don’t know who this person really is)

OAI Static Repositories The Problem OAI-PMH is simple, but not simple enough for: –Technically challenged organizations Limited resources No control over their web server –With small collections records (10-20 MB XML File) –That do not change often Less frequent than monthly

OAI Static Repositories The Solution Static Repository –A single XML file containing all metadata, identifiers, and datestamps –Accessible from a web server via an HTTP URL, such as –May be created manually by an XML or simple text editor, or programmatically Static Repository Gateway –Provides intermediation for one or more Static Repositories

OAI Static Repositories Official Specification guidelines-static-repository.htm

OAI Static Repositories Illustration Static Repositories Static Repository Gateway OAIster OAI Harvesters reap

OAI Static Repositories Static Repository Limitations Must be a single XML file (mime: text/xml) –No resumptionTokens Must be UTF-8 encoded Unicode – Must validate against Static Repository XML Schema The baseURL element must be the concatenation of the Static Gateway URL and the Static Repository URL ListRecords elements must conform to the OAI-PMH record format

OAI Static Repositories Additional Limitations The URL of the Static Repository XML file cannot include a fragment or query string Sets are not supported Deleted records are not supported Response compression is not supported Only YYYY-MM-DD date stamp granularity is supported The guidelines for OAI identifiers should be followed: – identifier.htm

OAI Static Repositories Static Repository XML Sections … … … … …

OAI Static Repositories Demo no YYYY-MM-DD

OAI Static Repositories oai_dc …

OAI Static Repositories oai:this.edu: Some Title … … …

UIUC’s OAI FileMakerPro Gateway FileMakerPro Databases OAI FileMaker Gateway OAIster OAI Harvesters reap

OAI FileMakerPro Gateway The Problem FMP has widespread use in the museum community and is often used for special collections in libraries Until recently there are no easy or convenient tools for making FMP databases OAI accessible Could use Emory’s Metadata Migrator (or similar tools), but there could be latency problems if the database was active.

OAI FileMakerPro Gateway Solution Out of the box, FMP has a built-in web server and can export XML – This facilitates a solution similar to OAI Static Repositories Except it is not static; data is being fed directly from the database and not from a static copy –This is a slight fib: because of how datestamps are derived they only have a ganularity of one day, so an incremental harvest might be up to 24 hours out of date

OAI FileMakerPro Gateway Some Technical Details How to Get XML From FMP ?-db=database &-lay=layout &-format=format &-max=max_records &-skip=skip-records &-recid=record_id &-command -lay=[short layout | full layout] for ListIdentifiers | ListRecords -format=-fmp_xml -dso_xml (easier to transform) -find -dbnames -layoutnames -etc

OAI FileMakerPro Gateway More Technical Details FMP XML Formats –The -dso_xml format: Easier to transform with XSLT But may be malformed in some cases (the gateway can accommodate this) The XML Schema varies by database Same as XML export format used by MS SQL Server –The –fmp_xml format: Always the same XML Schema regardless of the database Difficult to transform

OAI FileMakerPro Gateway More Technical Details Datestamps –All FMP records have a RECORDID and a MODID –The MODID increments each time the record is changed, thus it can be used as a surrogate for the datestamp –When a new FMP database is added to the Gateway, all RECORDID and MODID are recorded locally, and each record is assigned the current date for the datestamp. Once a day, the MODID of each record are compared against the locally stored value, and the datestamp of the record is set to the current date if the MODID has changed.

OAI FileMakerPro Gateway Configuring the Gateway

OAI FileMakerPro Gateway Covert Implementations It is relatively easy to identify and intermediate FMP databases using the Gateway. Use Google to Find them: – Gather configuration details like layouts, etc. Write an XSLT to transform –dso_xml into oai_dc Most FMP database owners probably don’t even realize how easy it is for someone to perform a wholesale download of their entire database –Good for OAI implementers, –But FMP database owners, be careful of sensitive data!!! –Make sure the web-based edit features are secured!!!

OAI FileMakerPro Gateway An Invitation We are looking for FMP collections we can test with the Gateway We do plan to maintain the Gateway, similar to our OAI Static Gateway

Other OAI Gateways z39.50 OAI-PMH – –ZMARCO SRU/W OAI-PMH – 02sanderson.htmlhttp:// 02sanderson.html

Open Source OAI Toolkits OCLC – UIUC Grainger Engineering Library – Virginia Tech DLRL Projects – Lots of other Open Source tools – –

OAI Turnkey Solutions Adlib CWIS ContentDM Digitool DLESE DLXS DSpace EPrints Encompass Fedora Greenstone Ockham Others…

How to Test Your OAI Provider Repository Explorer –Good start, but does not do a complete harvest, nor does it check non- oai_dc metadata formats, so can’t find all problems W3C Validator for XML Schema –Great for pinpointing obscure XML Schema validation errors or character encoding problems –Only one request at a time though Character Encoding Problems – Try to harvest your OAI provider yourself –Use REAP, the Windows command line OAI harvester from UIUC – –Use the U. Michigan Harvester (Kat can provide more detail) Ask one of us to do it