Creating an Open Archives Metadata Harvesting Protocol Compliant Repository for the American Memory Online Collections OAI Open Meeting, Washington, DC.

Slides:



Advertisements
Similar presentations
E-Content Service Group Virtual Meeting Digital Preservation: How to Get Started.
Advertisements

OSI and Bibliographic Access: opening a conversation Caroline Arms Kevin Novak Michelle Rago.
Harvesting Metadata Using OAI-PMH Roy Tennant California Digital Library.
National Science Digital Library (NSDL) Core Infrastructure Metadata Repository (“union catalog”) Naomi Dushay Cornell University.
1 CS 502: Computing Methods for Digital Libraries Lecture 22 Repositories.
UCLA Digital Library UC Digital Library Forum August 5, 2002 UCLA Digital Library Presenter: Curtis Fornadley Senior Programmer/Analyst.
OAI Standards for Sheet Music Meeting March 28-29, 2002 Basic OAI Principals How They Apply to Sheet Music Presenter: Curtis Fornadley, Senior Programmer/Analyst.
NOBLE Digital Library. How does it work? The NOBLE Digital Library uses the DSpace platform. Image files and metadata are imported into DSpace using.
1 Archiving and Preserving the Web Kristine Hanna Internet Archive April 2006.
Sai Deng, Metadata Catalog Librarian, Wichita State University Libraries Tse-Min Wang, Graduate Student in CS, Wichita State University Digital Imaging.
A Digital Library Repository Utilizing the Open Archives Initiative Developed to meet the needs of UTK Library Special Collections.
Teaching Metadata and Networked Information Organization & Retrieval The UNT SLIS Experience William E. Moen School of Library and Information Sciences.
METS-Based Cataloging Toolkit for Digital Library Management System Dong, Li Tsinghua University Library
1 The NSDL: A Case Study in Interoperability William Y. Arms Cornell University.
XML: The Strategic Opportunity Roy Tennant Challenges*  Only librarians like to search, everyone else likes to find  Our users want more information.
Metadata Harvesting The Hague, 13 & 14 January 2009 Julie Verleyen Scientific Coordinator, Europeana Office EuropeanaLocal Knowledge Sharing Workshop.
The TARO Project Texas Archival Resources Online Fred Gilmore Sr Operating Systems Specialist UT Austin General Libraries April.
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
The Metadata Object Description Schema (MODS) NISO Metadata Workshop May 20, 2004 Rebecca Guenther Network Development and MARC Standards Office Library.
Getting Started with CONTENTdm Corey Harper, University of Oregon Terry Reese, Oregon State University OLA - April 8, 2005.
Dec 9-11, 2003ICADL Challenges in Building Federation Services over Harvested Metadata Hesham Anan, Jianfeng Tang, Kurt Maly, Michael Nelson, Mohammad.
Indo-US Workshop, June23-25, 2003 Building Digital Libraries for Communities using Kepler Framework M. Zubair Old Dominion University.
The DiVA System: Current Status and Ongoing Development Uwe Klosa Electronic Publishing Centre, Uppsala University, Sweden Eva Müller.
Enhancing Linkages Between Projects and Datasets: Examples from LBA-ECO for NACP Lisa Wilcox, Amy L. Morrell,
Overview of IU Digital Collections Search Hui Zhang Jon Dunn Indiana University Digital Library Program IU Digital Library Brown Bag October 19, 2011.
Open Virginia Tech DLRL Hussein Suleman
Extensible Markup Language (XML) Extensible Markup Language (XML) is a simple, very flexible text format derived from SGML (ISO 8879).ISO 8879 XML is a.
Open Archives Work at Virginia Tech Hussein Suleman Digital Libraries Research Laboratory Virginia Tech.
07/11/2002Thomas Baron - JACoW Workshop1 CERN Library Requirements T. Baron CERN ETT-DH-CDS.
SCIELO AS AN OPEN ARCHIVE: the development of SciELO / OpenArchives data provider interface Prof. Carlos H. Marcondes Federal Fluminense University/ Information.
The Resource Discovery Network and OAI Andy Powell UKOLN, University of Bath UKOLN is funded by Resource: The Council.
Roy Tennant Life After MARC A Metadata Infrastructure for the 21st Century.
CBSOR,Indian Statistical Institute 30th March 07, ISI,Kokata 1 Digital Repository support for Consortium Dr. Devika P. Madalli Documentation Research &
Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH) Phil Barker, March © Heriot-Watt University. You may reproduce all or any part.
Open Archive Initiative – Protocol for metadata Harvesting (OAI-PMH) Surinder Kumar Technical Director NIC, New Delhi
Caltech CODA CODA: Collection of Digital Archives Caltech Scholarly Communication.
1 GRID Based Federated Digital Library K. Maly, M. Zubair, V. Chilukamarri, and P. Kothari Department of Computer Science Old Dominion University February,
OAI Overview DLESE OAI Workshop April 29-30, 2002 John Weatherley
Integrating Access to Digital Content Sarah Shreeves University of Illinois at Urbana-Champaign Visual Resources Association 23 rd Annual Conference Miami.
NSDL October 12-15, 2003Eisenhower National Clearinghouse Slide 1 NSDL and the Open Archives Initiative NSDL – OAI – and the Eisenhower National Clearinghouse.
1 Overview Finding and importing data sets –Searching for data –Importing data_.
Hussein Suleman University of Cape Town Department of Computer Science Digital Libraries Laboratory February 2008 Data Curation Repositories:
JISC/NSF PI Meeting, June Archon - A Digital Library that Federates Physics Collections with Varying Degrees of Metadata Richness Department of Computer.
Oct 12-14, 2003NSDL Challenges in Building Federation Services over Harvested Metadata Kurt Maly, Michael Nelson, Mohammad Zubair Digital Library.
The Open Archives Initiative Marshall Breeding Director for Innovative Technologies and Research Vanderbilt University
National Coastal Data Development Center Status of OPeNDAP at NCDDC 11 September 2003 Susan Starke, Chief of IT Operations
Sharing Digital Scores: Will the Open Archives Initiative Protocol for Metadata Harvesting Provide the Key? Constance Mayer, Harvard University Peter Munstedt,
Standards OAI-Protocol Metadata: DC - Agris - MODS Marc Goovaerts Hasselt University Library ODIN-PI TRAINING OSTENDE, May 2008.
The Open Archives Initiative and the Sheet Music Consortium Jon Dunn, Jenn Riley IU Digital Library Program October 10, 2003.
NSDL & the Open Archives Initiative A Brief Introduction to OAI Timothy W. Cole Mathematics Librarian & Professor of Library Administration.
1 CS 430: Information Discovery Lecture 26 Architecture of Information Retrieval Systems 1.
Repository-specific Spoke Scripts Content Repository JSR-170/283 Content Repository for Java Technology API Normalized H&S METS Files METS Import/ExportMETS.
Do Real Archivists Use OAI? Mid-Atlantic Regional Archives Conference Gettysburg, PA October 31, 2003 Chris Prom Assistant University Archivist University.
Metayogi Increasing the Accessibility of the Semantic Web Karim Tharani Doug Macdonald Rachel Heidecker.
Sharing Your Finding Aids in CONTENTdm Encoded Archival Description (EAD) Files in Mountain West Digital Library June 3, 2009 Sandra McIntyre, Mountain.
Mod_oai: Metadata Harvesting for Everyone Michael L. Nelson, Herbert Van de Sompel, Xiaoming Liu, Aravind Elango
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
The Open Archives Initiative: Perspectives on Metadata Harvesting OAI Provider & Harvesting Services at the University of Illinois Timothy W. Cole Mathematics.
Building A Repository for Digital Objects
Information modeling and infrastructures for metadata
Lifecycle …of OAI …of DPs and SPs
WHAT DOES THE FUTURE HOLD? Ann Ellis Dec. 18, 2000
Introduction, Features & Technology
CS 501: Software Engineering Fall 1999
Outline Pursue Interoperability: Digital Libraries
OAI and Metadata Harvesting
OAI 11/20/07.
IDEALS at the University Of Illinois: A Case Study of Integration Between an IR and Library Discovery Systems Sarah L. Shreeves University of Illinois.
Oya Y. Rieger Cornell University Library May 2004
Why IIIF? Shane Huddleston Jeff Mixter Dave Collins Product Manager
Presentation transcript:

Creating an Open Archives Metadata Harvesting Protocol Compliant Repository for the American Memory Online Collections OAI Open Meeting, Washington, DC January 23, 2001 Dave Woodward, Library of Congress

Why is this important for the AM Online Collections? With LC as data provider: –American Memory content could be made more accessible through new value-added services that utilize harvested metadata, e.g., search services or specially targeted services –metadata harvesting would be enabled for other groups such as researchers and educators

Why is this important for the AM Online Collections? Perhaps with LC as a service provider: –integration of content into American Memory could be simplified and standardized –collaborative collection development would be less labor intensive

Why is this important for the AM Online Collections? Participation in the development and testing of OAI has provided valuable practical experience with wide applicability, i.e., –conversion of MARC8 character encoding to UTF-8 w/ Unicode character references –MARC -> dc mapping –MARC -> xml conversion

What collections were used? From American Memory (alpha test): –Map Collections: The focus of Map Collections is Americana and Cartographic Treasures of the Library of Congress. These images were created from maps and atlases... –Dance Instruction Manuals: ca An American Ballroom Companion presents a collection of over two hundred social dance manuals at the Library of Congress

What collections were used? Data characteristics: –same data stores used for AM web site and OAI –item-level MARC descriptive records Why this data? –common data characteristics –various MARC fields used –interesting character set challenges

What is the technical infrastructure? Hardware, OS and systems software: –IBM rs6000, AIX, Apache Application software: –one perl script for handling requests via CGI CPAN stuff: cgi.pm, sgmls.pm existing MARC parsing tools upgrade of existing MARC8 character set tools –another perl script to create index files

How was it implemented? separate from the American Memory online application, but built on same data stores a simple index of identifiers, status, and dates was built –ListIdentifiers becomes an “index only” verb –enables dynamic access to MARC records in flat files

How was it implemented? dynamic translation from MARC –MARC is a popular storage format at LC –flexible for updating/adding metadata formats –flexible for adjusting mappings Resumption token & “retry after” responses –thresholds, time-to-live Verified results in a variety of ways: –Hussein Suleman’s excellent repository explorer! –XSV from w3c.org, some XML Spy

How was it implemented?

What was the level of effort? Very simple: –handling the protocol requests and responses A little more difficult: –selecting and organizing collections Most challenging: –mapping (format crosswalks) –preparing data for transport

Where do we go from here? Expand to include more sets Offer non-MARC collections Experiment with other metadata formats, i.e., LC’s MARC21 xml format Continue to refine MARC mappings and character reference encoding Tune flow controls