Adding OAI-ORE Support to Repository Platforms Alexey Maslov, Adam Mikeal, Scott Phillips, John Leggett, Mark McFarland Texas Digital Library TCDL09.

Slides:



Advertisements
Similar presentations
Richard Jones, Systems Developer Technical Issues for Repository Software Theses Alive! Edinburgh University Library SHERPA Nottingham.
Advertisements

The REPOX system Nuno Freire -
What is intraLibrary Connect? Martin Morrey Product Director, Intrallect Ltd
OAI from 50,000 Feet OAI develops and promotes interoperability solutions that aim to facilitate the efficient dissemination of content. Begun in 1999.
Summary Overview of Vireo Student Submission of ETDs
Introducing Vireo ETD Submittal and Management for DSpace Adam Mikeal, Scott Phillips, John Leggett, Mark McFarland Texas Digital Library.
Embedding A Digital Repository within the Texas A&M University Library Web Services Texas Conference on Digital Libraries June 5, 2008 James Creel, Tommy.
ETD Management in the Texas Digital Library Adam Mikeal Texas Digital Library ETD 08 Aberdeen, Scotland June 6, 2008.
IRRA DSpace April 2006 Claire Knowles University of Edinburgh.
Data Publishing Service Indiana University Stacy Kowalczyk April 9, 2010.
Metadata for Digital Content at the Library of Congress Jane Mandelbaum Information Technology Services Library of Congress May 2009.
A Tour of the OAIS Reference Model Brian Lavoie Research Scientist Office of Research OCLC Museum Computer Network Annual Conference September 2002.
The DRIVER Infrastructure (Digital Repository Infrastructure Vision for European Research) Paolo Manghi ISTI - National Research Council, Italy.
Richard Jones, Systems Developer, Edinburgh University Library DSpace Ingest Workflows Workshop 13 th – 15 th October 2004.
UKOLN is supported by: JISC Information Environment update Repositories and Preservation Programme meeting, October 24-25, 2006 Rachel Heery UKOLN
OAI and Publishers metadata Using the static repositories approach to disclose small journals.
Overview of IMS Content Packaging Sheila MacNeill.
Data Life Cycle Management Mark McFarland – University of Texas/Texas Digital Library John Leggett – Texas A&M University/Texas Digital Library.
U.S. Government Printing Office Packaging and Metadata PREMIS Implementers Panel Library of Congress June 13, 2007.
DuraSpace: Digital Information All Ways, Always Pretoria, South Africa May 14 th, 2009.
October 28, 2003Copyright MIT, 2003 METS repositories: DSpace MacKenzie Smith Associate Director for Technology MIT Libraries.
Copying Archives Project Group Members: Mushashu Lumpa Ngoni Munyaradzi.
Repositories: Disruptive Technology or Disrupted Technology? Sandy Payette, Executive Director DORSDL Workshop at ECDL 2008 September 2008.
Depositing e-material to The National Library of Sweden.
Defacing DSpace with Manakin DSpace User Group, February 2006 Scott Phillips Texas A&M University DSpace XML UI:
1 Archiving Workflow between a Local Repository and the National Library Archive Experiences from the DiVA Project Eva Müller, Peter Hansson, Uwe Klosa,
Merrilee Proffitt e(X)literature / Digital Cultures Project April 2003 News from the Digital Library The Metadata Encoding and Transmission Standard; the.
Robust Tools for Archiving and Preserving Digital Data Joseph JaJa, Mike Smorul, and Mike McGann Institute for Advanced Computer Studies Department of.
PAWN: A Novel Ingestion Workflow Technology for Digital Preservation
Dspace – Digital Repository Dawn Petherick, University Web Services Team Manager Information Services, University of Birmingham MIDESS Dissemination.
The Open Archives Initiative Simeon Warner (Cornell University) Symposium on “Scholarly Publishing and Archiving on the Web”, University.
NOBLE Digital Library. How does it work? The NOBLE Digital Library uses the DSpace platform. Image files and metadata are imported into DSpace using.
OCLC Online Computer Library Center OCLC’s Digital Archive – Disseminating with METS Jay Goodkin Software Engineer Digital Collection and Preservation.
DSpace XML UI Project Texas A&M University Digital Initiatives, Research and Technology Scott Phillips, Cody Green, Alexey Maslov, Adam Mikeal, Brian Surratt,
1 The NSDL: A Case Study in Interoperability William Y. Arms Cornell University.
XML: The Strategic Opportunity Roy Tennant Challenges*  Only librarians like to search, everyone else likes to find  Our users want more information.
Metadata Harvesting The Hague, 13 & 14 January 2009 Julie Verleyen Scientific Coordinator, Europeana Office EuropeanaLocal Knowledge Sharing Workshop.
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
IUScholarWorks is a set of services to make the work of IU scholars freely available. Allows IU departments, institutes, centers and research units to.
Indo-US Workshop, June23-25, 2003 Building Digital Libraries for Communities using Kepler Framework M. Zubair Old Dominion University.
University of North Texas Libraries Building Search Systems for Digital Library Collections Mark E. Phillips Texas Conference on Digital Libraries May.
Design of a Search Engine for Metadata Search Based on Metalogy Ing-Xiang Chen, Che-Min Chen,and Cheng-Zen Yang Dept. of Computer Engineering and Science.
Creating Archive Information Packages for Data Sets: Early Experiments with Digital Library Standards Ruth Duerr, NSIDC MiQun Yang, THG Azhar Sikander,
Tsinghua University Library Yang Zhao & Airong Jiang Tsinghua University Library, Beijing China 4 June, 2004 Electronic Thesis and Dissertation System.
OAI Overview DLESE OAI Workshop April 29-30, 2002 John Weatherley
OAIS Rathachai Chawuthai Information Management CSIM / AIT Issued document 1.0.
Metadata and Technology/Architecture Working Groups DLF Aquifer Project DLF Fall Forum Providence, RI November 14, 2008.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
DSpace - Digital Library Software
DSpace System Architecture 11 July 2002 DSpace System Architecture.
Managing live digital content with DuraSpace services Bill Branan PASIG Spring 2015.
Metayogi Increasing the Accessibility of the Semantic Web Karim Tharani Doug Macdonald Rachel Heidecker.
Developing a Dark Archive for OJS Journals Yu-Hung Lin, Metadata Librarian for Continuing Resources, Scholarship and Data Rutgers University 1 10/7/2015.
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
Functional Object Re-use and Exchange:
Building A Repository for Digital Objects
Information modeling and infrastructures for metadata
Joseph JaJa, Mike Smorul, and Sangchul Song
Repository Software - Standards
Introduction, Features & Technology
Flexible Extensible Digital Object Repository Architecture
Jenn Riley Metadata Librarian Digital Library Program
Flexible Extensible Digital Object Repository Architecture
Outline Pursue Interoperability: Digital Libraries
Introduction to DSpace
eCulture Science Gateway – reloaded
Context Interoperability Submission Search Preservation
Oya Y. Rieger Cornell University Library May 2004
Institutional Repositories
Jenn Riley Metadata Librarian Digital Library Program
Presentation transcript:

Adding OAI-ORE Support to Repository Platforms Alexey Maslov, Adam Mikeal, Scott Phillips, John Leggett, Mark McFarland Texas Digital Library TCDL09

Overview Texas Digital Library Use Case for OAI-ORE Mapping ORE model to DSpace architecture Implementation Results and Implications

Texas Digital Library State-wide initiative Eighteen members Public/Private Small/Medium/Large

Electronic Theses and Dissertations Federated Collection Built on top of DSpace/Manakin

Current Federation Method Performed via scripted ingest process New batch every semester Manual corrections to existing content

Replacement Requirements Perform maintenance automatically Detect changes in existing content Support interchange of metadata and content

Harvesting Solution Use the Open Archives Initiative Protocol for Metadata Harvesting Member institutions as data providers TDL Federated Repository as a service provider * Open Archives Initiative Protocol for Metadata Harvesting

OAI-PMH, advantages Ubiquitous Supports selective harvesting Tracks changes Can be automated

OAI-PMH, obstacles No existing harvesting solution for DSpace Supports harvesting of metadata specifically

Disseminating content How do you disseminate content through a metadata harvesting protocol? – Wrap it in a packaging format – Include the metadata – Encode the references to the files – Harvest the package

METS, advantages Metadata Encoding and Transmission Standard Maintained by the Library of Congress Mature standard Widely adopted * Metadata Encoding and Transmission Standard, Library of Congress

Packaging, disadvantages Complete packaging format Open to interpretation Ambiguities at the OAI-PMH layer

OAI-ORE Open Archives Initiative Object Reuse and Exchange defines standards for the description and exchange of aggregations of Web resources. Specialized Simple * Open Archives Initiative Object Reuse and Exchange

Mapping DSpace to OAI-ORE ORE Abstract Data Model DSpace architecture The Mapping

ORE Data Model Aggregations Aggregated Resources Resource Maps

Aggregation (A) Describes a set of resources Conceptual construct

Aggregated Resource (AR) Object of interest Part of an aggregation Can itself be an aggregation

Aggregated Resource (AR) Object of interest Part of an aggregation Can itself be an aggregation

Resource Map (ReM) Describes an aggregation Enumerates its aggregated resources Can be serialized in RDF or Atom XML

DSpace Model v1.x Communities Collections Items Bundles Bitstreams

ORE DSpace

Mapping

Bundles?

Bundles, Potential Options Bundles as Aggregations of Bitstreams Bundles as filters for Aggregated Resources Bundles as DSpace-specific metadata

Bundles, Observations By default, specialized for internal tasks Extendible for any use Obscured from the end user

DSpace Bundles

Serialization in Atom

Implementation ORE Dissemination ORE Harvesting Automation

Interfacing with DSpace Web UI LNI and SWORD Ingest and export scripts Crosswalks – Ingestion – Dissemination

ORE Dissemination Crosswalk Requires: – A DSpace Item Produces: – Atom-serialized ORE ReM

ORE Dissemination via OAI-PMH Dissemination crosswalk produces ORE ReMs from DSpace Items OAI-PMH data provider disseminates them

ORE Harvesting Item-level ORE ReM interpreter Collection-level OAI-PMH harvester Repository level harvest scheduler

ORE Ingestion Crosswalk Requires: – A DSpace Item – Atom-serialized ORE ReM Produces: – A DSpace Item with Bitstreams created from ARs

OAI-PMH Harvester Queries remote OAI-PMH providers Processes responses as individual records Implemented at Collection level

Collection Settings Source of collections content OAI-PMH provider information Harvesting Level

Collection Source

OAI-PMH Settings OAI-PMH Provider OAI Set Id DMD Format

Harvest Level

Harvesting a Collection Local collection (OAI-PMH harvester) Remote collection (OAI-PMH provider)

Harvest Metadata Local collection (OAI-PMH harvester) Remote collection (OAI-PMH provider)

Metadata Replicated Local collection (OAI-PMH harvester) Remote collection (OAI-PMH provider)

Case 1: Metadata Only Local collection (OAI-PMH harvester) Remote collection (OAI-PMH provider)

Harvest ORE ReMs Local collection (OAI-PMH harvester) Remote collection (OAI-PMH provider)

Case 2: Metadata + Content Refs Local collection (OAI-PMH harvester) Remote collection (OAI-PMH provider)

Case 2: Metadata + Content Refs Local collection (OAI-PMH harvester) Remote collection (OAI-PMH provider)

Case 3: Metadata + Content Local collection (OAI-PMH harvester) Remote collection (OAI-PMH provider)

Case 3: Metadata + Content Local collection (OAI-PMH harvester) Remote collection (OAI-PMH provider)

Harvest Scheduling System Monitors harvested collections Starts harvests at regular intervals Alerts administrators of errors

Results The Primary Use Case TDL in General The Greater Web Community

Harvesting using PMH+ORE Federated ETD collection currently in pre-production at TDL Addresses primary requirements – Performs maintenance automatically – Detects changes in existing content – Supports interchange of metadata and content

Other Possibilities Specialized DSpace instances Flexible repository architecture Interoperability with other repository systems Large-scale ETD repositories: A case study of a digital library application, Adam Mikeal, James Creel, Alexey Maslov, Scott Phillips, John Leggett, Mark McFarland. JCDL 2009

Current Priorities Live deployment at TDL Release to the open source community Integration into DSpace 1.6

National Leadership Grant #LG

Questions?