Kurt Maly Department of Computer Science Old Dominion University Norfolk, Virginia 23529, USA Digital Libraries, OAI and Free Software.

Slides:



Advertisements
Similar presentations
OAI from 50,000 Feet OAI develops and promotes interoperability solutions that aim to facilitate the efficient dissemination of content. Begun in 1999.
Advertisements

A brief overview of the Open Archives Initiative Steve Hitchcock Open Citation Project (OpCit) Southampton University Prepared for Z39.50/OAI/OpenURL plenary.
Creating Institutional Repositories Stephen Pinfield.
OAI and Publishers metadata Using the static repositories approach to disclose small journals.
Retrieval of Information from Distributed Databases By Ananth Anandhakrishnan.
Rapid Visual OAI Tool S. Kothamasa, K. Maly, M. Zubair (Old Dominion University) X. Liu (Los Alamos National Laboratory) RCDL 2003, St. Petersburg.
Y.T. a brief history of the OAI 0 Kaynak: Herbert van de Sompel.
June 22-23, 2005 Technology Infusion Team Committee1 High Performance Parallel Lucene search (for an OAI federation) K. Maly, and M. Zubair Department.
ELPUB 2006 June Bansko Bulgaria1 Automated Building of OAI Compliant Repository from Legacy Collection Kurt Maly Department of Computer.
Object Re-Use and Exchange Mellon Retreat, Nassau Inn, Princeton, NJ, March Herbert Van de Sompel, Carl Lagoze The OAI Object Re-Use & Exchange.
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
Fun with Geospatial Metadata, CUGIR, CORC, MARC, and OAI: The CSDGM to MARC Grant Project Adam Chandler, Olin Library Elaine Westbrooks, Mann Library Vivek.
The Open Archives Initiative Simeon Warner (Cornell University) Symposium on “Scholarly Publishing and Archiving on the Web”, University.
OAI Standards for Sheet Music Meeting March 28-29, 2002 Basic OAI Principals How They Apply to Sheet Music Presenter: Curtis Fornadley, Senior Programmer/Analyst.
The Open Archives Initiative Simeon Warner (Cornell University) Open Archives seminar “Facilitating Free and Efficient Scientific.
The Open Archives Initiative and OAIster: Past, Present and Future Kat Hagedorn University of Michigan Libraries April 6, 2006.
The Open Archives Initiative Simeon Warner Cornell University, Ithaca, NY, USA CREPUQ 2002, Montréal, Canada 14:00, 24 October 2002.
Basic Concepts Architecture Topology Protocols Basic Concepts Open e-Print Archive Open Archive -- generalization of e-print Data Provider and Service.
Digital Library Architecture and Technology
Dienst Distributed Networked Publishing Carl Lagoze Digital Library Scientist Cornell University.
CONTI’2008, 5-6 June 2008, TIMISOARA 1 Towards a digital content management system Gheorghe Sebestyen-Pal, Tünde Bálint, Bogdan Moscaliuc, Agnes Sebestyen-Pal.
Malaysian Grid for Learning October DC 2004, Shanghai, China. © 2004 MIMOS Berhad. All Rights Reserved Metadata Management System DC2004: International.
Global & Regional Initiatives on Information Management Eero Mikkola(IUFRO) Joris Siermann (CIFOR) Global Forest.
1 The NSDL: A Case Study in Interoperability William Y. Arms Cornell University.
XML: The Strategic Opportunity Roy Tennant Challenges*  Only librarians like to search, everyone else likes to find  Our users want more information.
CS621 : Seminar-2008 DEEP WEB Shubhangi Agrawal ( )‏ Jayalekshmy S. Nair ( )‏
Rapid Visual OAI Tool S. Kothamasa, K. Maly, M. Zubair (Old Dominion University) X. Liu (Los Alamos National Laboratory) RCDL 2003, St. Petersburg.
LIS 506 (Fall 2006) LIS 506 Information Technology Week 11: Digital Libraries & Institutional Repositories.
Herbert van de sompel Workshop on OAI and peer review journals in Europe Geneva, Switserland – March 22nd to 24th 2001 Herbert Van de Sompel Cornell University.
Dec 9-11, 2003ICADL Challenges in Building Federation Services over Harvested Metadata Hesham Anan, Jianfeng Tang, Kurt Maly, Michael Nelson, Mohammad.
Indo-US Workshop, June23-25, 2003 Building Digital Libraries for Communities using Kepler Framework M. Zubair Old Dominion University.
Design of a Search Engine for Metadata Search Based on Metalogy Ing-Xiang Chen, Che-Min Chen,and Cheng-Zen Yang Dept. of Computer Engineering and Science.
SCIELO AS AN OPEN ARCHIVE: the development of SciELO / OpenArchives data provider interface Prof. Carlos H. Marcondes Federal Fluminense University/ Information.
A centre of expertise in digital information management RDN, e-Prints UK and NOF- Digitise: a (very) small sample of UK OAI activity Andy.
The OAI: overview and historical context OAI Open Meeting – Washington DC – January 23 rd 2001 Herbert Van de Sompel & Carl Lagoze Cornell University --
OAI-PMH: Open Archives Initiative Protocol for Metadata Harvesting T.B. Rajashekar National Centre for Science Information (NCSI) Indian Institute of Science,
1 CS 502: Computing Methods for Digital Libraries Lecture 19 Interoperability Z39.50.
1 Metadata –Information about information – Different objects, different forms – e.g. Library catalogue record Property:Value: Author Ian Beardwell Publisher.
ICDL 2004 Improving Federated Service for Non-cooperating Digital Libraries R. Shi, K. Maly, M. Zubair Department of Computer Science Old Dominion University.
The OAI Protocol for Metadata Harvesting Van de Sompel, Herbert Los Alamos National Laboratory – Research Library.
Freelib: A Self-sustainable Digital Library for Education Community Ashraf Amrou, Kurt Maly, Mohammad Zubair Computer Science Dept., Old Dominion University.
The OAI: overview and historical context OAI Open Meeting – Washington DC – January 23 rd 2001 Herbert Van de Sompel & Carl Lagoze Cornell University --
Open Archive Initiative – Protocol for metadata Harvesting (OAI-PMH) Surinder Kumar Technical Director NIC, New Delhi
Caltech CODA CODA: Collection of Digital Archives Caltech Scholarly Communication.
1 GRID Based Federated Digital Library K. Maly, M. Zubair, V. Chilukamarri, and P. Kothari Department of Computer Science Old Dominion University February,
OAI Overview DLESE OAI Workshop April 29-30, 2002 John Weatherley
Integrating Access to Digital Content Sarah Shreeves University of Illinois at Urbana-Champaign Visual Resources Association 23 rd Annual Conference Miami.
Search Interoperability, OAI, and Metadata Sarah Shreeves University of Illinois at Urbana-Champaign Basics and Beyond Grainger Engineering Library April.
Automatic Metadata Discovery from Non-cooperative Digital Libraries By Ron Shi, Kurt Maly, Mohammad Zubair IADIS International Conference May 2003.
Digital Collections: Making it Happen Hema Ramachandran Ed Sponsler Jim O’Donnell, Caltech Library System SCELC, September , Caltech.
JISC/NSF PI Meeting, June Archon - A Digital Library that Federates Physics Collections with Varying Degrees of Metadata Richness Department of Computer.
May 26-28ICNEE 2003 ARCHON: BUILDING LEARNING ENVIRONMENTS THROUGH EXTENDED DIGITAL LIBRARY SERVICES Hesham Anan, Kurt Maly, Mohammad Zubair,et al. Digital.
Oct 12-14, 2003NSDL Challenges in Building Federation Services over Harvested Metadata Kurt Maly, Michael Nelson, Mohammad Zubair Digital Library.
The Open Archives Initiative Marshall Breeding Director for Innovative Technologies and Research Vanderbilt University
Feb 24-27, 2004ICDL 2004, New Dehli Improving Federated Service for Non-cooperating Digital Libraries R. Shi, K. Maly, M. Zubair Department of Computer.
Open Archives Initiative Gail McMillan Digital Library and Archives, Virginia Tech Society for Scholarly Publishing: June 1, 2000.
Sharing Digital Scores: Will the Open Archives Initiative Protocol for Metadata Harvesting Provide the Key? Constance Mayer, Harvard University Peter Munstedt,
Arc – Federated Searching Service Kurt Maly, Xiaoming Liu, M.Zubair, Michael L.Nelson Old Dominion University January 23, 2001.
Open Archives Initiative CNI Phoenix December 13, 1999 Dale Flecker, Harvard Carl Lagoze, Cornell John Ober, CDL Don Waters, Mellon.
2/22/2016J Ammerman1 Open Archives Initiative What is it? What’s it good for?
1 CS 430: Information Discovery Lecture 26 Architecture of Information Retrieval Systems 1.
June 3-6, 2003E-Society Lisbon Automatic Metadata Discovery from Non-cooperative Digital Libraries R. Shi, K. Maly, M. Zubair Department of Computer Science.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
1 herbert van de sompel CS 502 Computing Methods for Digital Libraries Cornell University – Computer Science Herbert Van de Sompel
Introduction: AstroGrid increases scientific research possibilities by enabling access to distributed astronomical data and information resources. AstroGrid.
OAI and Metadata Harvesting
Open Archives Initiative
Open Archive Initiative
Institutional Repositories
Presentation transcript:

Kurt Maly Department of Computer Science Old Dominion University Norfolk, Virginia 23529, USA Digital Libraries, OAI and Free Software for Education and Science 5 th National Conference Computer Application Federation of China Instrument & Control Society Yinchuan, Ningxia Province,PRC September 22-24, 2003

Sept 24, 20035th National CACIS Conference2 Outline Digital Libraries The Open Archives Initiative Free Software Systems Arc DP9 Kepler RVOT Conclusions Important URLs

Sept 24, 20035th National CACIS Conference3 Digital Libraries DL = library whose content is stored digitally and can be accessed over the Internet Key difference between DLs and the general Web is that the content is structured and has metadata associated with it allowing for more precise results to queries

Sept 24, 20035th National CACIS Conference4 Digital Libraries Development of software to support DLs has proceeded along proprietary software lines It is extremely difficult for the average user to find information that is in different DLs Need for interoperability between DLs

Sept 24, 20035th National CACIS Conference5 Digital Libraries DL interoperability can be achieved at three levels technical:protocol, format, etc. should be consistent so that messages can be exchanged content: agreements cover the data and metadata, agreements on the interpretation of messages organizational: includes rules for access, for changing collections and services, payment, and authentication Need to federate, filter and provide value- added services on remote content

Sept 24, 20035th National CACIS Conference6 Open Archives Initiative address technical interoperability among distributed archives facilitate the discovery of content in distributed archives The OAI framework defines two functional roles: data providers (archives) and service providers

Sept 24, 20035th National CACIS Conference7 Open Archives Initiative Data providers: expose the metadata of their objects for harvesting Service providers: extract metadata from data providers via the OAI metadata harvesting protocol Service provider develop value-added services that are based on the metadata collected from data providers such as: cross-archive search engines, linking systems, and peer-review systems

Sept 24, 20035th National CACIS Conference8 herbert van de sompel The Open Archives Iinitiative has been set up to create a forum to discuss and solve matters of interoperability between preprint solutions, as a way to promote their global acceptance. Paul Ginsparg, Rick Luce & Herbert Van de Sompel OAI origin herbert van de sompel

Sept 24, 20035th National CACIS Conference9 Core concepts of Santa Fe convention herbert van de sompel low-barrier interoperability data-provider & service-provider model metadata harvesting model shared metadata format and parallel, community- specific metadata formats acceptable use Dienst subset OAMS XML reply HTTP based Gentelmen’s agreement

Sept 24, 20035th National CACIS Conference10 core concepts in OAI 1.0 herbert van de sompel low-barrier interoperability data-provider & service-provider model metadata harvesting model shared metadata format and parallel, community- specific metadata formats acceptable use flexibility OAI 1.0 protocol Dublin Core HTTP based Community specific Reply XML Schema Self contained

Sept 24, 20035th National CACIS Conference11 The Open Archives Initiative develops and promotes interoperability standards that aim to facilitate the efficient dissemination of content. new OAI mission statement herbert van de sompel

Sept 24, 20035th National CACIS Conference12 The Open Archives Initiative has its roots in an effort to enhance access to e-print archives as a means of increasing the availability of scholarly communication. Continued support of this work remains a cornerstone of the Open Archives program. new OAI mission statement herbert van de sompel

Sept 24, 20035th National CACIS Conference13 The fundamental technological framework and standards that are developing to support this work are, however, independent of the both the type of content offered and the economic mechanisms surrounding that content, and promise to have much broader relevance in opening up access to a range of digital materials. [...] new OAI mission statement herbert van de sompel

Sept 24, 20035th National CACIS Conference14 Free software - Arc Arc harvests metadata currently from about 150 OAI compliant archives normalizes them, and stores them in a search service based on a relational database (MySQL or Oracle) over 6 Million metadata records from various subject domains Arc also provides OAI layer, thus making hierarchical harvesting possible

Sept 24, 20035th National CACIS Conference15

Sept 24, 20035th National CACIS Conference16

Sept 24, 20035th National CACIS Conference17 Free Software – DP9 “deep web" or "invisible web" a vast repository of content, such as documents in online databases, that general-purpose web crawlers cannot reach 500 times that of the surface web Internet search engines can not index OAI collections, as they are not aware of the OAI protocol

Sept 24, 20035th National CACIS Conference18 Free Software – DP9 A Web crawler indexes a Web site by starting with a base HTML page and following the links on this page to go deeper to retrieve other pages on the Web site DP9 computes and presents an HTML page presented to a Web crawler as a result of an OAI request, and the links on the Web page leads to other OAI requests

Sept 24, 20035th National CACIS Conference19 Free Software – DP9 DP9 provides an entry page and if a web crawler finds this entry page, it may follow the links on this page and send requests to DP9. DP9 will then forward the request to corresponding OAI Data Providers and process the returned XML records Depending on the depth a crawler follows, it can index all records in an OAI Data Provider

Sept 24, 20035th National CACIS Conference20 Free Software – DP9

Sept 24, 20035th National CACIS Conference21

Sept 24, 20035th National CACIS Conference22 Free Software - Kepler The objective of the Kepler framework is to satisfy the need for the average researchers at an average university to publish results and disseminate them to a wide audience quickly and conveniently The Kepler framework is based on OAI to support what is called "personal data providers" or "archivelets"

Sept 24, 20035th National CACIS Conference23 Free Software - Kepler Kepler framework - a digital library of many ‘little’ publishers. an easy-to-use archivelet that is downloadable and self-installing an automated registration service to support tens of thousands of publishers a simple service provider to harvest metadata from archivelets.

Sept 24, 20035th National CACIS Conference24

Sept 24, 20035th National CACIS Conference25

Sept 24, 20035th National CACIS Conference26

Sept 24, 20035th National CACIS Conference27 Free Software - RVOT Rapid Visual OAI Tool (RVOT) is a tool that can help small organizations in making their collections OAI-PMH compliant construct an OAI-PMH repository from a collection of files metadata translation tool records in the original collection can be in any of the supported formats including RFC1807, MARC subset, and COSATI formats lightweight HTTP server including an OAI-PMH request handler

Sept 24, 20035th National CACIS Conference28 Free Software - RVOT

Sept 24, 20035th National CACIS Conference29 Free Software – RVOT

Sept 24, 20035th National CACIS Conference30

Sept 24, 20035th National CACIS Conference31

Sept 24, 20035th National CACIS Conference32

Sept 24, 20035th National CACIS Conference33 Conclusions OAI makes the many digital libraries available today interoperate in such a way that users can discover information across a wide variety of domains without having to be aware of the many different user interfaces of the individual libraries OAI was founded by researchers who were interested not only in free distribution of information but also in free distribution of software

Sept 24, 20035th National CACIS Conference34 Conclusions All the software systems described in this paper are freely available either in OpenSource or directly from the research group that created it one caveat: free software does not necessarily mean no cost running of services. One still has to account for the need for technical support and hardware to set up services

Sept 24, 20035th National CACIS Conference35 Important URLs - ODU digital library research group