Service Providers: Future Perspectives Michael L. Nelson Old Dominion University Norfolk Virginia, USA 2003.

Slides:



Advertisements
Similar presentations
Search, access and impact: Web citation services Tim Brody Intelligence, Agents, Multimedia Group University of Southampton.
Advertisements

28 April 2004Second Nordic Conference on Scholarly Communication 1 Citation Analysis for the Free, Online Literature Tim Brody Intelligence, Agents, Multimedia.
Tim Brody University of Southampton CiteBase Services 13/07/2001.
IST Humboldt University Berlin, Germany – Computer and Media Service – Electronic Publishing Group Birgit Matthaei, 4th Sept. 2003, Bath,
Open Archives Initiative Where we are, Where we are going Carl Lagoze 4 th OAF Workshop September, 2003.
Heinrich Stamerjohanns Institute for Science Networking Distributed Open Archives Dr. Heinrich Stamerjohanns Institute for Science Networking at the University.
White Paper on Establishing an Infrastructure for Open Language Archiving Steven Bird and Gary Simons.
Using OAI-PMH for Resource Exchange OAI Metadata Harvesting Workshop, JCDL 03 Michael L. Nelson, Terry L. Harrison Old Dominion University Norfolk VA
Service Providers: Future Perspectives Michael L. Nelson Old Dominion University Norfolk Virginia, USA 2nd Workshop.
The Library behind the scene How does it work ? The Library behind the scenes 1 JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot.
1 Repository Synchronization in the OAI Framework Xiaoming Liu DL Research and Prototyping Los Alamos National Laboratory.
Dspace – Digital Repository Dawn Petherick, University Web Services Team Manager Information Services, University of Birmingham MIDESS Dissemination.
Fun with Geospatial Metadata, CUGIR, CORC, MARC, and OAI: The CSDGM to MARC Grant Project Adam Chandler, Olin Library Elaine Westbrooks, Mann Library Vivek.
The Open Archives Initiative Simeon Warner (Cornell University) Symposium on “Scholarly Publishing and Archiving on the Web”, University.
OAI Standards for Sheet Music Meeting March 28-29, 2002 Basic OAI Principals How They Apply to Sheet Music Presenter: Curtis Fornadley, Senior Programmer/Analyst.
The Open Archives Initiative Simeon Warner (Cornell University) Open Archives seminar “Facilitating Free and Efficient Scientific.
The Open Archives Initiative Simeon Warner Cornell University, Ithaca, NY, USA CREPUQ 2002, Montréal, Canada 14:00, 24 October 2002.
National Aeronautics and Space Administration Implementing DSpace at NASA Langley Research Center 1 Greta Lowe Librarian NASA Langley Research Center
Digital Library Architecture and Technology
Dienst Distributed Networked Publishing Carl Lagoze Digital Library Scientist Cornell University.
Implementation of Digital Libraries Michael L. Nelson Old Dominion University Congreso Internacional de Información.
Introduction to Digital Libraries hussein suleman uct cs honours 2004.
How to participate in the Union Catalogue Project Hussein Suleman Sivulile – Open Access South Africa Advanced Information Management.
Serenate1 Non-standard users: The Library Raf Dekeyser K.U.Leuven.
A Review of Institutional Repository Projects and Technologies Michael L. Nelson Old Dominion University Texas.
‘The Universal Catalogue’ a cultural sector viewpoint David Dawson Senior Policy Adviser (Digital Futures) Museums, Libraries and archives Council.
Dec 9-11, 2003ICADL Challenges in Building Federation Services over Harvested Metadata Hesham Anan, Jianfeng Tang, Kurt Maly, Michael Nelson, Mohammad.
OAI-PMH Tools Open Source or other linsences den Haag
Indo-US Workshop, June23-25, 2003 Building Digital Libraries for Communities using Kepler Framework M. Zubair Old Dominion University.
University of Illinois at Urbana-Champaign A Unified Platform for Archival Description and Access Christopher J. Prom, Christopher A. Rishel, Scott W.
The Open Archives Initiative Movement Kurt Maly Old Dominion University Norfolk Virginia, USA Brazilian DL.
BMC Open Access Colloquium, 8 February Morgan: "Open Access Repositories"
OAI-PMH for Resource Harvesting Tutorial OAI4, October 20 th 2005, CERN, Geneva, Switzerland OAI-PMH for Resource Harvesting Herbert Van de Sompel Digital.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
A centre of expertise in digital information management RDN, e-Prints UK and NOF- Digitise: a (very) small sample of UK OAI activity Andy.
Institutional Archives Technology Overview Michael L. Nelson Old Dominion University Institutional Archives.
OAI-PMH: Open Archives Initiative Protocol for Metadata Harvesting T.B. Rajashekar National Centre for Science Information (NCSI) Indian Institute of Science,
OAI Overview Michael L. Nelson Old Dominion University Norfolk Virginia, USA Bioinformatics Seminar ODU CS 791/891.
Van de Sompel, Herbert Los Alamos National Laboratory – Research Library OAI-PMH for Resource Harvesting.
ICDL 2004 Improving Federated Service for Non-cooperating Digital Libraries R. Shi, K. Maly, M. Zubair Department of Computer Science Old Dominion University.
Alexandria Digital Earth ProtoType DIGITAL LIBRARIES AND ENVIRONMENTAL INFORMATION Terence R. Smith Alexandria Digital Library Project.
Kurt Maly Department of Computer Science Old Dominion University Norfolk, Virginia 23529, USA Digital Libraries, OAI and Free Software.
IUScholarWorks Technical Overview Randall Floyd Digital Library Program Programmer/Database Administrator.
Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH) Phil Barker, March © Heriot-Watt University. You may reproduce all or any part.
Open Archive Initiative – Protocol for metadata Harvesting (OAI-PMH) Surinder Kumar Technical Director NIC, New Delhi
Slavic Digital Text Workshop 2006 The Open Archives Initiative Protocol for Metadata Harvesting: an Opportunity for Sharing Content in a Distributed Environment.
1 GRID Based Federated Digital Library K. Maly, M. Zubair, V. Chilukamarri, and P. Kothari Department of Computer Science Old Dominion University February,
OAI Overview DLESE OAI Workshop April 29-30, 2002 John Weatherley
DSpace vs Fedora Ralph LeVan OCLC Research. What Do You Want From a Repository? How do you create your metadata? How do you assemble your objects? How.
Bitter Harvest Metadata Harvesting Issues, Problems, and Possible Solutions Roy Tennant California Digital Library.
Metadata and OAI DLESE OAI Workshop April 29-30, 2002 Katy Ginger Presentation available at:
Hussein Suleman University of Cape Town Department of Computer Science Digital Libraries Laboratory February 2008 Data Curation Repositories:
Metadata and OAI DLESE OAI Workshop June 29 to July 2, 2002 Katy Ginger Presentation available at:
JISC/NSF PI Meeting, June Archon - A Digital Library that Federates Physics Collections with Varying Degrees of Metadata Richness Department of Computer.
May 26-28ICNEE 2003 ARCHON: BUILDING LEARNING ENVIRONMENTS THROUGH EXTENDED DIGITAL LIBRARY SERVICES Hesham Anan, Kurt Maly, Mohammad Zubair,et al. Digital.
Oct 12-14, 2003NSDL Challenges in Building Federation Services over Harvested Metadata Kurt Maly, Michael Nelson, Mohammad Zubair Digital Library.
DSpace - Digital Library Software
Serenate1 The librarian’s view Raf Dekeyser K.U.Leuven.
Sharing Digital Scores: Will the Open Archives Initiative Protocol for Metadata Harvesting Provide the Key? Constance Mayer, Harvard University Peter Munstedt,
Designing Protocols in Support of Digital Library Componentization Hussein Suleman and Edward A. Fox Digital Library Research Laboratory Virginia Tech.
U.S. Government Use of the OAI-PMH Michael L. Nelson Old Dominion University Norfolk Virginia, USA ISTEC / NSF.
1 CS 430: Information Discovery Lecture 26 Architecture of Information Retrieval Systems 1.
1 herbert van de sompel CS 502 Computing Methods for Digital Libraries Cornell University – Computer Science Herbert Van de Sompel
Mod_oai: Metadata Harvesting for Everyone Michael L. Nelson, Herbert Van de Sompel, Xiaoming Liu, Aravind Elango
GNU EPrints 2 Overview Christopher Gutteridge 19 th October 2002 CERN. Geneva, Switzerland.
Web Services Overview Thomas Hickey. 2 What are Web Services? Machine-to-machine communication Run over standard Web protocols –XML syntax, HTTP packaging.
OAI and Metadata Harvesting
Digitometric Services for Open Archives Environments
The New Face of Information Retrieval: The Ankara University Open Access Platform Prof. Dr. Sekine Karakaş Prof. Dr. Doğan.
Institutional Repositories
Presentation transcript:

Service Providers: Future Perspectives Michael L. Nelson Old Dominion University Norfolk Virginia, USA PSP Annual Conference Washington DC February 5, 2003

Outline The flavor of OAI-PMH talks Some traditional public service providers Why the OAI-PMH is not important Defining the OAI-PMH data model Abusing the OAI-PMH data model Current and nearly-current interesting services

OAI-PMH Meeting History OAI Open Day, Washington DC 1/2001 2nd OAI Workshop CERN 10/2002 Protocol definition, development tools DPs, retrofitting existing DLs SPs, new services Socio-Economic- Political Issues

Shift of Topics From the protocol itself, supporting & debugging tools and how to retrofit (existing) DLs… …to building (new) services that use the OAI-PMH as a core technology and reporting on their impact to the institution/community

NTRS metadata harvesting replacement for nasa.gov/cgi-bin/NTRS –previous NTRS was based on distributed searching –hierarchical harvesting (nigh) publicly available

Arc harvests all known archives first end-user service provider source available through SourceForge hierarchical harvesting

NCSTRL metadata harvesting replacement for Dienst-based NCSTRL based on Arc computer science metadata

Archon physics metadata based on Arc features: –citation indexing –equation-based searching

Torii physics metadata features –personalization –recommendations –WAP access

iCite physics metadata features –citation based access to arXiv metadata

my.OAI covers all registered metadata features –result sets –personalization –many other advanced features

Cyclades scientific metadata features –personalization –recommendations –collaboration status?

citebase arXiv metadata citation based indexing, reporting

OAIster harvests all known archives

Public Knowledge Project domain-specific filtering of harvested metadata (?)

Perseus they claim to harvest all DPs, but only humanities related DPs appear in the pull down menu

Others… Commercial publishers –American Physical Society (APS) –Institute of Physics –Elsevier / Scirus ( Department of Energy –OSTI –LANL Institutional servers –DSpace (MIT; –Eprints ( –DARE (All Dutch universities)

Service Providers It is clear that SPs are proliferating, despite (because of?) the inherent bias toward DPs in the protocol –easy to be a DP -> many DPs -> SPs eventually emerge –hard to be a DP -> SPs starve –currently 5x DPs more than SPs SPs are beginning to offer increasingly sophisticated services –competitive market originally envisioned for SPs is emerging

Why The OAI-PMH is NOT Important Users don’t care OAI-PMH is middleware –if done right, the uninterested user should never have to know OAI Inside Using OAI-PMH does not insure a good SP OAI-PMH is (or is becoming) HTTP for DLs –few people get excited about http now http & OAI-PMH are core technologies whose presence is now assumed

Other Uses For the OAI-PMH Assumptions: –Traditional DLs / SPs will continue on their present path of increasing sophistication citation indexing, search results viz, personalization, recommendations, subject-based filtering, etc. –growth rates remain the same (5x DPs as SPs) Premise: OAI-PMH is applicable to any scenario that needs to update / synchronize distributed state –Future opportunities are possible by creatively interpreting the OAI-PMH data model

resource all available metadata about David item Dublin Core metadata MARC metadata SPECTRUM metadata records item = identifier record = identifier + metadata format + datestamp set-membership is item-level property OAI-PMH Data Model

Typical Values repository –collection of publications resource –scholarly publication item –all metadata (DC + MARC) record –a single metadata format datestamp –last update / addition of a record metadata format –bibliographic metadata format set –originating institution or subject categories

Interesting Services DP9 –gateway to expose repository contents in HTML suitable for web crawlers Celestial –OAI “cache”, also 1.1 -> 2.0 converter Static (mini-) repositories –XML files, based on OLAC work OpenURL metadata format registries –record = metadata format

DP9 Architecture see Liu et al., JCDL 2002; Slide from Liu

Celestial Developed by Southampton – –designed to complement DP9 –see Liu, Brody, et al., D-Lib Magazine 8(11) Where DP9 is a non-caching proxy, Celestial caches the metadata records –can off-load work from individual archives, higher availability –can harvest 1.1, 2.0; exports in 2.0

“Static” Repositories Premise: a repository does not wish to have an executing program on its site, so it has a “static” XML file with some of the OAI- PMH responses in place accessed through a proxy could be a low functionality node, or the XML file could be produced by a process and moved outside a firewall Based on OLAC work by Bird & Simons –

Registry of metadata formats for OpenURL – –Van de Sompel & Bergmark, DCADL02, OpenURL Metadata Registry

Conclusions DPs continue to proliferate –and spawn SPs! SPs are / are becoming a competitive market –e.g., at least 10 different interfaces to arXiv metadata –growing sophistication of services –differentiation of SPs will be on features that have little to nothing to do with OAI-PMH

Conclusions Protocol / transport gateways –Dienst OAI DOG, –Z39.50 ZMARCO (UIUC) –SOAP VT (Suleman) & ODU (Zubair) –WebDAV/DASL resurrect DASL?

OAI-PMH Will Have Arrived When: general web robots issue OAI-PMH verbs –…DP9 will no longer be needed –requires shift in “control”: harvester or repository? mod_oai is developed and is included in the default Apache configuration OAI-PMH fades into the background –similar to TCP/IP, http, XML, etc.

Backup Slides

Repositories… Stretching the idea of a repository a bit: –contextually sensitive repositories “personalization for harvesters” communication between strangers, or communication between friends? –OAI-PMH for individual complex objects? OAI-PMH without MySQL?! –Fedora, Multi-valent documents, buckets –tar, jar, zip, etc. files

Resource What if resource were: –computer system status uptime, who, w, df, ps, etc. –or generalized “system” status e.g., sports league standings –people personnel databases authority files for authors

Item What if item were: –software union of versions + formats –all forms of metadata administrative + structural citations, annotations, reviews, etc. –data e.g., newsfeeds and other XML expressible content –metadataPrefixes or sets could be defined to be different versions

Record What if record were: –specific software instantiations / updates –access / retrieval logs for DLs (or computer systems) –push / pull model inversion put a harvester on the client behind a firewall, the client contacts a DP and receives “instructions” on how to submit the desired document (e.g., send to a specified address)

Datestamp semantics of datestamp are strongly influenced by the choice of resource / item / record / metadataPrefix, but it could be used to: –signify change of set membership (e.g., workflow: item moves from “submitted” to “approved”) –change datestamp to reflect access to the DP e.g., in conjunction with metadataPrefixes of “accessed” or “mirrored”

metadataPrefix what if metadataPrefix were: –instructions for extracting / archiving / scraping the resource verb=ListRecords&metadataPrefix=extract_TIFFs –code fragments to run locally (harvested from a trusted source!) –XSLT for other metadataPrefixes branding container is at the repository-level, this could be record- or item-level

Set sets are already used for tunneling OAI- PMH extensions (see Suleman & Fox, D-Lib 7(12)) other uses: –in aggregators, automatically create 1 set per baseURL –have “hidden” sets (or metadataPrefix) that have administrative or community-specific values (or triggers) set=accessed>1000&from= set=harvestMeWithTheseARGS&until= &metadataPrefix=oai_marc