A Digital Library Repository Utilizing the Open Archives Initiative Developed to meet the needs of UTK Library Special Collections.

Slides:



Advertisements
Similar presentations
OAI from 50,000 Feet OAI develops and promotes interoperability solutions that aim to facilitate the efficient dissemination of content. Begun in 1999.
Advertisements

A centre of expertise in digital information management The OAI Protocol for Metadata Harvesting Andy Powell UKOLN,
Putting together a METS profile. Questions to ask when setting down the METS path Should you design your own profile? Should you use someone elses off.
Rapid Visual OAI Tool S. Kothamasa, K. Maly, M. Zubair (Old Dominion University) X. Liu (Los Alamos National Laboratory) RCDL 2003, St. Petersburg.
OAI in DigiTool DigiTool Version 3.0.
Harvesting Metadata Using OAI-PMH Roy Tennant California Digital Library.
OAI-PMH Dawn Petherick, University Web Services Team Manager, Information Services, University of Birmingham MIDESS Dissemination.
StatCat Building a Statistical Data Finder ssrs.yale.edu/statcat Steven Citron-Pousty Ann Green Julie Linden Yale University.
UCLA Digital Library UC Digital Library Forum August 5, 2002 UCLA Digital Library Presenter: Curtis Fornadley Senior Programmer/Analyst.
OAI Standards for Sheet Music Meeting March 28-29, 2002 Basic OAI Principals How They Apply to Sheet Music Presenter: Curtis Fornadley, Senior Programmer/Analyst.
Basic Concepts Architecture Topology Protocols Basic Concepts Open e-Print Archive Open Archive -- generalization of e-print Data Provider and Service.
Europeana: Europe's Digital Library, Museum and Archive Ashley Carter and Dana Sagona.
Digital Encoding What’s behind E-text Resources?.
DUBLIN CORE: BEYOND THE LIBRARY David Hirsch LIS Knowledge Organization Dr. Selenay Aytac Spring 2013.
Guest Lecture LIS 656, Spring 2011 Kathryn Lybarger.
UKOLUG - July Metadata for the Web RDF and the Dublin Core Andy Powell UKOLN, University of Bath UKOLN.
Publishing Digital Content to a LOR Publishing Digital Content to a LOR 1.
Introduction to the OAI Metadata Harvesting Protocol Hussein Suleman, Digital Library Research Laboratory Virginia Tech.
Metadata Harvesting The Hague, 13 & 14 January 2009 Julie Verleyen Scientific Coordinator, Europeana Office EuropeanaLocal Knowledge Sharing Workshop.
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
Rapid Visual OAI Tool S. Kothamasa, K. Maly, M. Zubair (Old Dominion University) X. Liu (Los Alamos National Laboratory) RCDL 2003, St. Petersburg.
Metadata: An Overview Katie Dunn Technology & Metadata Librarian
Open Archives Iniative – Protocol for Metadata Harvesting Iztok Kavkler, University of Ljubljana Some slides by Stefaan Ternier, KUL Bram Vandenputte,
Metadata Harvesting Interoperable digital collections.
The Metadata Object Description Schema (MODS) NISO Metadata Workshop May 20, 2004 Rebecca Guenther Network Development and MARC Standards Office Library.
Indo-US Workshop, June23-25, 2003 Building Digital Libraries for Communities using Kepler Framework M. Zubair Old Dominion University.
Jan 9, 2004 Symposium on Best Practice LSA, Boston, MA 1 Metadata Helen Aristar Dry Eastern Michigan University LINGUIST List.
Creating an Open Archives Metadata Harvesting Protocol Compliant Repository for the American Memory Online Collections OAI Open Meeting, Washington, DC.
07/11/2002Thomas Baron - JACoW Workshop1 CERN Library Requirements T. Baron CERN ETT-DH-CDS.
OAI-PMH: Open Archives Initiative Protocol for Metadata Harvesting T.B. Rajashekar National Centre for Science Information (NCSI) Indian Institute of Science,
Introduction to Digital Libraries hussein suleman uct cs honours 2004.
1 CS 502: Computing Methods for Digital Libraries Lecture 19 Interoperability Z39.50.
1 Metadata –Information about information – Different objects, different forms – e.g. Library catalogue record Property:Value: Author Ian Beardwell Publisher.
ICDL 2004 Improving Federated Service for Non-cooperating Digital Libraries R. Shi, K. Maly, M. Zubair Department of Computer Science Old Dominion University.
The OAI Protocol for Metadata Harvesting Van de Sompel, Herbert Los Alamos National Laboratory – Research Library.
Metadata harvesting in regional digital libraries in PIONIER Network Cezary Mazurek, Maciej Stroiński, Marcin Werla, Jan Węglarz.
Digital Library Interoperability Architecture CS 502 – Carl Lagoze – Cornell University.
Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH) Phil Barker, March © Heriot-Watt University. You may reproduce all or any part.
Open Archive Initiative – Protocol for metadata Harvesting (OAI-PMH) Surinder Kumar Technical Director NIC, New Delhi
Caltech CODA CODA: Collection of Digital Archives Caltech Scholarly Communication.
Slavic Digital Text Workshop 2006 The Open Archives Initiative Protocol for Metadata Harvesting: an Opportunity for Sharing Content in a Distributed Environment.
1 GRID Based Federated Digital Library K. Maly, M. Zubair, V. Chilukamarri, and P. Kothari Department of Computer Science Old Dominion University February,
OAI Overview DLESE OAI Workshop April 29-30, 2002 John Weatherley
Bitter Harvest Metadata Harvesting Issues, Problems, and Possible Solutions Roy Tennant California Digital Library.
Search Interoperability, OAI, and Metadata Sarah Shreeves University of Illinois at Urbana-Champaign Basics and Beyond Grainger Engineering Library April.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
The OAI: technical overview OAI Open Meeting – Washington DC – January 23 rd 2001 Herbert Van de Sompel & Carl Lagoze Cornell University -- Computer Science.
The Open Archives Initiative Marshall Breeding Director for Innovative Technologies and Research Vanderbilt University
Open Archives Initiative Protocol for Metadata Harvesting.
Digitization – Basics and Beyond workshop Interoperability of cultural and academic resources New services for digitized collections Muriel Foulonneau.
Metadata Harvesting Interoperable digital collections.
Feb 24-27, 2004ICDL 2004, New Dehli Improving Federated Service for Non-cooperating Digital Libraries R. Shi, K. Maly, M. Zubair Department of Computer.
Sharing Digital Scores: Will the Open Archives Initiative Protocol for Metadata Harvesting Provide the Key? Constance Mayer, Harvard University Peter Munstedt,
Standards OAI-Protocol Metadata: DC - Agris - MODS Marc Goovaerts Hasselt University Library ODIN-PI TRAINING OSTENDE, May 2008.
Designing Protocols in Support of Digital Library Componentization Hussein Suleman and Edward A. Fox Digital Library Research Laboratory Virginia Tech.
2/22/2016J Ammerman1 Open Archives Initiative What is it? What’s it good for?
NSDL & the Open Archives Initiative A Brief Introduction to OAI Timothy W. Cole Mathematics Librarian & Professor of Library Administration.
1 CS 430: Information Discovery Lecture 26 Architecture of Information Retrieval Systems 1.
The NSDL, OAI and Your Metadata Core Infrastructure Metadata Repository (“union catalog”) Naomi Dushay Cornell University.
A RCHIVAL COLLECTIONS IN A D IGITAL W ORLD Cheryl Walters Nov. 6, 2008.
OAI and ODL Building Digital Libraries from Components Hussein Suleman Virginia Tech DLRL 12 September 2002.
The Open Archives Initiative: Perspectives on Metadata Harvesting OAI Provider & Harvesting Services at the University of Illinois Timothy W. Cole Mathematics.
Open your Alfresco Data
Getting a Leg Up on OAI for the NSDL
University of Illinois at Urbana-Champaign OAI Alpha Experiences
WHAT DOES THE FUTURE HOLD? Ann Ellis Dec. 18, 2000
Georges Arnaout Chaitanya Krishna
OAI and Metadata Harvesting
OAI 11/20/07.
Open Archive Initiative
Presentation transcript:

A Digital Library Repository Utilizing the Open Archives Initiative Developed to meet the needs of UTK Library Special Collections

Tremendous quantities of valuable information exist in Museums, Libraries, and Research Centers which are not available in a standardized format via centralized search engines Photos and videosScientific recordsMathematical findings Musical scores and sound tracks Historical Documents Theses and Dissertations The Problem : How to make the connection???

 Translation of records: Into a Common Format and Language: XML & Unqualified Dublin Core  Storage: of these translations  Response: to a standardized set of queries The Open Archives Solution:  Gather document descriptions from Repositories into large databases, using OAI Harvesters  Set up search engines to offer up information in these databases

Photos and videosScientific recordsMathematical findings Musical scores and sound tracks Historical Documents Theses and Dissertations Required For Translation:  Understanding of XML and XML schemas  Determining correct mapping of information to Unqualified Dublin Core Elements, in order to translate legacy files into a metadata format supported by the Open Archives Initiative  Scripts to reduce the labor of translation

The 15 elements of Dublin Core Unqualified: A Common Language…. Dublin Core Content: Title Description Coverage Relation Source Subject Type Intellectual Property: Contributor Creator Publisher Rights Instantiation: Date Format Identifier Language

A Common Framework: XML schemas The XML schema constrains each element of the document, providing rules and framework for parsing:

SCHEME="LCSH"> Letters Cherokee Indians—Claims against Tennessee From a TEI Lite SGML file segment: To an Unqualified Dublin Core XML file segment: Letters Cherokee Indians Claims against Tennessee A Common Format…. XML

[Letter] July 8, 1839, Washington City DC, [to] HP King, Qualla Town / William Holland Thomas: a machine-readable transcription of an image … Thomas, William Holland … The University of Tennessee Libraries wt025 … This work is the property of the Special Collections Library, University of Tennessee, Knoxville, TN. It may be used freely by individuals for research, teaching, and personal use as long as this statement of availability is included in the text. … July 8, 1839 … This document is a letter dated July 8, 1839 to H.P. King from William Holland Thomas with instructions for running the Indian Store. … KEYWORDS SCHEME="LCSH"> Cherokee Indians Government relations /KEYWORDS> … … Selected Portions of a TEI-Lite SGML record

… Translated to XML Unqualified Dublin Core [Letter] July 8, 1839, Washington City DC, [to] HP King, QuallaTown The University of Tennessee Libraries, Knoxville Southeastern Native American Documents Collection (GALILEO (Georgia statewide project)) GAGAL Thomas, William Holland The University of Tennessee Libraries July 8, 1839 This document is a letter dated July 8, 1839 toH.P. King from William Holland Thomas with instructions for running the Indian Store. Document ID: wt025 Cherokee Indians Government relations This work is the property of the Special Collections Library, University of Tennessee, Knoxville, TN. It may be used freely by individuals for research, teaching, and personal use as long as this statement of availability is included in the text. letter computer file

Crosswalks available: MARC to DC: Shown in action at: OTHERS: _crosswalks/index.html Translation Tools:

The Open Archives Solution:  Gather document descriptions from Repositories into large databases, using OAI Harvesters  Set up search engines to offer up information in these databases  Translation of records: Into a Common Format and Language: XML & Unqualified Dublin Core  Storage: of these translations  Response: to a standardized set of queries

 Storage of OAI Records mysql> create table gsm( -> id char(10) not null, -> primary key (id), -> date char(10), -> path char (80), -> listit text); $sth = $dbh->prepare("select listit from $set where date <= '$until' and date >= '$from' order by id"); MySQL: small, fast, and free: Use scripts to load database and retrieve information Store entire records, already marked up in Unqualified Dublin Core, for quick response; …or Store fields untagged, multiple values for a field separated by tags, and retag upon request: flexibility. This structure allows for a record to be entered once and retrieved in various formats upon request. For local search engines, also store hardcoded xml files in a directory.

The Open Archives Solution:  Gather document descriptions from Repositories into large databases, using OAI Harvesters  Set up search engines to offer up information in these databases  Translation of records: Into a Common Format and Language: XML & Unqualified Dublin Core  Storage: of these translations  Response: to a standardized set of queries

 Response: Offer up document descriptions via a standardized set of queries & responses: the Open Archives Initiative Protocol 1)6 Verbs, with 5 required and/or optional arguments 2) Unique Identifiers, Optional Sets, and Metadata Prefixes 3) Flow control & Resumption Tokens 4) Error Codes

 Verbs and arguments: The Open Archives Protocol 1)Identify 2)ListSets 3)ListMetadataFormats: optional: identifier 4)ListIdentifiers: required: metadata prefix (oai_dc); optional: from, until, set, resumption token 5)ListRecords: required: metadata prefix (oai_dc); optional: from, until, set, resumption token 6)GetRecord: required: identifier and metadata prefix

 Identifiers, Sets, and Metadata Prefixes oai:tkn:har/har0001 oai:tkn:che/che0003 oai:tkn:civ/civ0001 oai:tkn:etd/etd0002 oai:tkn:emn/emn0001 oai:tkn:ead/ead0003 oai:tkn:gsm/gsm0045 oai:tkn:ldr/ldr0002 oai:tkn:rth/rth0034 oai:tkn:tdh/tdh0005 oai:tkn:vid/vid0001 har che civ etd emn ead gsm ldr rth tdh vid Bessie Harvey Collection Cherokee Civil War Collection Electronic Theses and Dissertations Emancipator Encoded Archival Description Great Smoky Mountains Library Development Review Roth Photography Collection Tennessee Documentary History Videos Sample Identifiers: Input as "Set": Current Sets: Supported Metadata prefix: oai_dc

 Flow Control and ResumptionTokens For ListIdentifiers, ListSets and ListRecords LRrtdc20f u LR or LI for ListRecord or ListIdentifier rt: Number or letter combination: which set next dc: Metadata format 20: Which record number to start with this time f = From date U = Until date Specifies the call to the database when this Resumption token is returned!!

badResumptionToken badVerb badArgument idDoesNotExist cannotDisseminateFormat noMetadataFormats noRecordsMatch noSetHierarchy  Error Codes: version 2.0

OAI 1.1 Test interface and Local Search Engine: Search by: word or phrase Searching by all or any field and set, Sorting by date or set Returning: Lists of identifiers or short file descriptions, each with links to full file in HTML, XML, and online document Scientific recordsMathematical findings Musical scores and sound tracks Historical Documents Theses and Dissertations Videos and Photos

The Open Archives Solution:  Gather document descriptions from Repositories into large databases, using OAI Harvesters  Set up search engines to offer up information in these databases  Translation of records: Into a Common Format and Language: XML & Unqualified Dublin Core  Storage: of these translations  Response: to a standardized set of queries

CrossWalks: _crosswalks/index.html More Information: Pre-developed repositories, harvesters, search engines, and more: Current Service Providers, who can offer searches of your records from your repository responses;