OAI-PMH for Resource Harvesting Tutorial OAI4, October 20 th 2005, CERN, Geneva, Switzerland The American Physical Society Project: Standards-based Mirroring.

Slides:



Advertisements
Similar presentations
RESEARCH LIBRARY Content Packaging for Complex Objects MPEG – 21 1 February 2007 Frances Knudson Repository Team Los Alamos National Laboratory Research.
Advertisements

UKOLN is supported by: JISC Information Environment update Repositories and Preservation Programme meeting, October 24-25, 2006 Rachel Heery UKOLN
Copying Archives Project Group Members: Mushashu Lumpa Ngoni Munyaradzi.
CNI Fall Task Force Meeting 2003, Portland, OR Using MPEG-21 DIDL, the OAI-PMH, and the OpenURL as building blocks for storing & disseminating complex.
Y.T. a brief history of the OAI 0 Kaynak: Herbert van de Sompel.
Extended-Linking Services: towards a Quality Web Eric F. Van de Velde California Institute of Technology
1 Managing Legal Deposit for Online Publications in Germany Cornelia Diebel.
Depositing e-material to The National Library of Sweden.
Object Re-Use and Exchange Mellon Retreat, Nassau Inn, Princeton, NJ, March Herbert Van de Sompel, Carl Lagoze The OAI Object Re-Use & Exchange.
Extended-Linking Services: towards a Quality Web Eric F. Van de Velde California Institute of Technology Oren Beit-Arie Ex Libris.
Archivematica-Islandora Integration Module Evelyn McLellan
Interoperability Among Scholarly Repositories: Enabling Workflows Across Distributed Information Carl Lagoze Information Science Cornell University, USA.
1 Repository Synchronization in the OAI Framework Xiaoming Liu DL Research and Prototyping Los Alamos National Laboratory.
Some thoughts on OpenURL version 1.0 Herbert Van de Sompel Los Alamos National Laboratory – Research Library NISO AX meeting, Getty Museum, May
Developments in Linking: OpenURL Eric F. Van de Velde California Institute of Technology
The Open Archives Initiative Simeon Warner (Cornell University) Symposium on “Scholarly Publishing and Archiving on the Web”, University.
The Open Archives Initiative Simeon Warner (Cornell University) Open Archives seminar “Facilitating Free and Efficient Scientific.
Addressing Metadata in the MPEG-21 and PDF-A ISO Standards NISO Workshop: Metadata on the Cutting Edge May 2004 William G. LeFurgy U.S. Library of Congress.
Metadata Harvesting The Hague, 13 & 14 January 2009 Julie Verleyen Scientific Coordinator, Europeana Office EuropeanaLocal Knowledge Sharing Workshop.
OAI-PMH for Resource Harvesting Tutorial OAI4, October 20 th 2005, CERN, Geneva, Switzerland OAIResource Software Her This work supported in part by the.
ECDL 2005, September 18 th - 23 th 2005, Vienna, Austria File-based storage of Digital Objects: XMLtapes & Internet Archive ARC files Xiaoming Liu, Luda.
LIS 654 BUILDING DIGITAL LIBRARIES FALL 2011 NOVEMBER 03, 2011 The OAI-PMH Harvester Plugin for The Omeka Content Management System JAMES R. GRIFFIN III.
OAI-PMH for Resource Harvesting Tutorial OAI4, October 20 th 2005, CERN, Geneva, Switzerland A New Model for Web Resource Harvesting Her This work supported.
OAI-PMH for Resource Harvesting Tutorial OAI4, October 20 th 2005, CERN, Geneva, Switzerland OAI-PMH for Resource Harvesting Herbert Van de Sompel Digital.
SCIELO AS AN OPEN ARCHIVE: the development of SciELO / OpenArchives data provider interface Prof. Carlos H. Marcondes Federal Fluminense University/ Information.
Mirroring an OAI archive with an I2-DSI channel Ryan Richardson Edward A. Fox Digital Library Research Laboratory Virginia Tech May 7 th, 2002.
Research Library, Los Alamos National Laboratory RESEARCH OAI4 - Geneva, Switzerland Digital Library Research & Prototyping Team Multi-Graph.
Van de Sompel, Herbert Los Alamos National Laboratory – Research Library OAI-PMH for Resource Harvesting.
Digital preservation activities at the NLW Sally McInnes 18 September 2009.
Kurt Maly Department of Computer Science Old Dominion University Norfolk, Virginia 23529, USA Digital Libraries, OAI and Free Software.
Open Archive Initiative – Protocol for metadata Harvesting (OAI-PMH) Surinder Kumar Technical Director NIC, New Delhi
OAI Overview DLESE OAI Workshop April 29-30, 2002 John Weatherley
OAIS: From Requirements to Reality at OCLC FLICC / CENDI Symposium, Dec Pam Kircher Product Manager, Digital Archive OCLC Digital & Preservation.
Selene Dalecky March 20, 2007 FDsys: GPO’s Digital Content System.
OAIS Rathachai Chawuthai Information Management CSIM / AIT Issued document 1.0.
How to Implement an Institutional Repository: Part II A NASIG 2006 Pre-Conference May 4, 2006 Technical Issues.
Funded by: © AHDS Preservation in Institutional Repositories Preliminary conclusions of the SHERPA DP project Gareth Knight Digital Preservation Officer.
OAI and peer review Workshop (CERN 22/03/2001) Thomas Baron – Tibor Simko CERN Document Server: Validation & OAI WORKSHOP on the Open Archives initiative.
The Open Archives Initiative Marshall Breeding Director for Innovative Technologies and Research Vanderbilt University
Archive Ingest and Handling Test: ODU’s Perspective Michael L. Nelson Department of Computer Science Old Dominion University
GPO’s Future Digital System (FDsys) November 2, 2006 LS&CM CENDI Presentation.
UKOLN is supported by: Content packaging and MPEG-21 DID Andy Powell, UKOLN, University of Bath JISC Joint Programmes Meeting, July.
MathArc Co-operating Preservation Archives Sharing Collections Among Dissimilar OAIS Repositories William Kehoe, Adam Smith, Marcy Rosenkrantz Cornell.
Open Archives Initiative CNI Phoenix December 13, 1999 Dale Flecker, Harvard Carl Lagoze, Cornell John Ober, CDL Don Waters, Mellon.
Herbert Van de Sompel Research Library, Los Alamos National Laboratory OAI4, October , CERN, Geneva, Switzerland RESEARCH LIBRARY Lessons in.
Carl Lagoze Digital Library Service Registry Workshop Services in a Scholarly Communication Framework.
Designing Protocols in Support of Digital Library Componentization Hussein Suleman and Edward A. Fox Digital Library Research Laboratory Virginia Tech.
The OAIS Reference Model and Trustworthy Repositories Josh Lubell Manufacturing Engineering Laboratory NIST
CNRS Documentation project : CCSD (Center for Direct Scientific Communication ) Htask meeting (Madrid) 06/12/ Lyon Daniel Charnay / Hélène Jamet.
Fedora Service Framework Sandy Payette, Executive Director UK Fedora Training London January 22-23, 2009.
Technical Report 4th CERN Workshop of Innovations in Scholarly Communication (OAI4)
Preservation Functionality in a Digital Archive Erik Oltmans Koninklijke Bibliotheek Raymond J. van Diessen IBM Business Consulting Services Hilde van.
Mod_oai: Metadata Harvesting for Everyone Michael L. Nelson, Herbert Van de Sompel, Xiaoming Liu, Aravind Elango
Developing a Dark Archive for OJS Journals Yu-Hung Lin, Metadata Librarian for Continuing Resources, Scholarship and Data Rutgers University 1 10/7/2015.
LWW January 27, 2004, Los Alamos, NM LANL Ingestion and Repository architecture Research Library, Los Alamos National Laboratory RESEARCH LIBRARY LANL’s.
May 2011DLM Forum, Budapest1 The First OAIS-compliant Ingest of Digital Records Zoltán Lux The National Archives of Hungary web:
The Multi-Faceted Use of the OAI-PMH in the LANL Repository Written By: Henry, Xiaoming,Patrick Henry, Xiaoming,Patrick and Herbert. Presented By: Shashi.
A Modular, Standards-based Digital Object Repository
Building A Repository for Digital Objects
Athabasca University’s Institutional Repository
Flexible Extensible Digital Object Repository Architecture
Flexible Extensible Digital Object Repository Architecture
OAI protocol beyond discovery metadata
Implementing an Institutional Repository: Part II
Digital Preservation Seminar
Open Archive Initiative
Institutional Repositories
Robin Dale RLG OAIS Functionality Robin Dale RLG
Implementing an Institutional Repository: Part II
How to Implement an Institutional Repository: Part II
Presentation transcript:

OAI-PMH for Resource Harvesting Tutorial OAI4, October 20 th 2005, CERN, Geneva, Switzerland The American Physical Society Project: Standards-based Mirroring of Digital Library Content Jeroen Bekaert, and Herbert Van de Sompel Digital Library Research & Prototyping Team Research Library, Los Alamos National Laboratory This work supported in part by the Library of Congress

OAI-PMH for Resource Harvesting Tutorial OAI4, October 20 th 2005, CERN, Geneva, Switzerland context Add APS collection to locally hosted LANL collection o Remain permanently synced o Ensure correctness of locally stored APS data Bigger picture: o Archive APS content o Create efficient content transfer/mirroring approach between information providers & LANL o NDIIP: Create efficient content transfer/mirroring approach between heterogeneous content repositories. -Efficient mechanisms are largely non-existent. -Devise a standards-based approach: – MPEG-21 DIDL – OAI-PMH – W3C XML Signatures

Bigger picture: OAIS perspective

OAI-PMH for Resource Harvesting Tutorial OAI4, October 20 th 2005, CERN, Geneva, Switzerland APS / LANL mirroring process APS repository OAI-PMH repository LANL pre-ingest & ingest OAI-PMH harvester OAI-PMH request OAI-PMH response aDORe repository APS Digital Object represented as application-neutral MPEG-21 DIDL document & exposed through OAI-PMH front-end Each datastream provided via a DIDL document is accorded a digest. Digests delivered in DIDL document via W3C XML Signatures A complete DIDL document is accorded a digest; delivered in the OAI- PMH « about » container via W3C XML Signature

OAI-PMH for Resource Harvesting Tutorial OAI4, October 20 th 2005, CERN, Geneva, Switzerland APS / LANL mirroring process APS repository OAI-PMH repository LANL pre-ingest & ingest OAI-PMH harvester OAI-PMH request OAI-PMH response aDORe repository Remain synced via OAI-PMH datestamp-based harvesting of DIDL documents: o New APS Digital Objects o Updated APS Digital Objects

OAI-PMH for Resource Harvesting Tutorial OAI4, October 20 th 2005, CERN, Geneva, Switzerland APS / LANL mirroring process Datastreams delivered By-Value and/or By-Reference o By-Reference requires dereferencing of datastream post harvest Storage in pre-ingest area: o Harvested DIDL documents in XMLtape o Dereferenced content in ARC files APS repository OAI-PMH repository LANL pre-ingest & ingest OAI-PMH harvester OAI-PMH request OAI-PMH response aDORe repository

OAI-PMH for Resource Harvesting Tutorial OAI4, October 20 th 2005, CERN, Geneva, Switzerland APS / LANL mirroring process Verification of digests: o DIDL document o Datastreams Digest correct: continue Digest incorrect: reharvest APS repository OAI-PMH repository LANL pre-ingest & ingest OAI-PMH harvester OAI-PMH request OAI-PMH response aDORe repository

OAI-PMH for Resource Harvesting Tutorial OAI4, October 20 th 2005, CERN, Geneva, Switzerland APS / LANL mirroring process Ingest Digital Objects: o Map application-neutral DIDL documents to aDORe-profile DIDL documents o Insert digests per constituent datastream (W3C XML Signatures) o Store in aDORe XMLtape/ARCfile environment APS repository OAI-PMH repository LANL pre-ingest & ingest OAI-PMH harvester OAI-PMH request OAI-PMH response aDORe repository

OAI-PMH for Resource Harvesting Tutorial OAI4, October 20 th 2005, CERN, Geneva, Switzerland APS / LANL mirroring process Recurrent introspection in both repositories Ability to harvest in both directions in case of problems with stored Digital Objects APS repository OAI-PMH repository LANL pre-ingest & ingest OAI-PMH harvester OAI-PMH request OAI-PMH response aDORe repository

OAI-PMH for Resource Harvesting Tutorial OAI4, October 20 th 2005, CERN, Geneva, Switzerland software OAIResource: generic Java-based OAI-PMH resource harvesting software package: o Goal: gather resources by OAI-PMH harvesting first o Can deal with OAI-PMH repositories irrespective of their supported metadata formats o Plug-in structure makes the process of dereferencing datastreams configurable per OAI-PMH repository o Results of harvesting/gathering stored as follows: -OAI-PMH records concatenated into XMLtapes -Datastreams concatenated into Internet Archive ARC files o Log files: -List successful and unsuccesful harvesting/gathering -List relationship between OAI-PMH records in XMLtapes and datastreams in ARC files

OAI-PMH for Resource Harvesting Tutorial OAI4, October 20 th 2005, CERN, Geneva, Switzerland Papers Jeroen Bekaert and Herbert Van de Sompel. A Standards-based Solution for the Accurate Transfer of Digital Assets. D-Lib Magazine, June Standards-based Solution for the Accurate Transfer of Digital Assetshttp://dx.doi.org/ /june2005-bekaert Jeroen Bekaert, Herbert Van de Sompel. Access Interfaces for Open Archival Information Systems based on the OAI-PMH and the OpenURL Framework for Context-Sensitive Services Preprint at Draft of an accepted submission for PV 2005 "Ensuring Long-term Preservation and Adding Value to Scientific and Technical data". Herbert Van de Sompel, Jeroen Bekaert, Xiaoming Liu, Lyudmila Balakireva, Thorsten Schwander. aDORe: a modular, standards-based Digital Object Repository The Computer Journal. Preprint at arXiv:cs.DL/ Computer Journal paper at doi: /comjnl/bxh114 arXiv:cs.DL/ doi: /comjnl/bxh114