A hybrid approach of digital long term preservation to institutional repositories - A case study of DSpace/SRB Integration Ya-ning Arthur Chen, Feng-chien.

Slides:



Advertisements
Similar presentations
Data Publishing Service Indiana University Stacy Kowalczyk April 9, 2010.
Advertisements

OGF-23 iRODS Metadata Grid File System Reagan Moore San Diego Supercomputer Center.
October 28, 2003Copyright MIT, 2003 METS repositories: DSpace MacKenzie Smith Associate Director for Technology MIT Libraries.
INFSO-RI Enabling Grids for E-sciencE Grid & Data Preservation Boon Low System Development, EGEE Training National.
Transformations at GPO: An Update on the Government Printing Office's Future Digital System George Barnum Coalition for Networked Information December.
ISO & OAI-PMH By Neal Harmeyer, Amy Hatfield, and Brandon Beatty PURDUE UNIVERSITY RESEARCH REPOSITORY.
DSpace Devika P. Madalli DRTC, ISI Bangalore.
1 Archiving Workflow between a Local Repository and the National Library Archive Experiences from the DiVA Project Eva Müller, Peter Hansson, Uwe Klosa,
MIT’s DSpace A good fit for ETDs Margret Branschofsky Keith Glavash MIT LIBRARIES.
MacKenzie Smith Associate Director for Technology MIT Libraries.
Brief Overview of Major Enhancements to PAWN. Producer – Archive Workflow Network (PAWN) Distributed and secure ingestion of digital objects into the.
The KnowledgeBank: Powered by DSpace Laura Tull Systems Librarian Ohio State University Libraries WiLSWorld July 27, 2004.
PAWN: A Novel Ingestion Workflow Technology for Digital Preservation
Dspace – Digital Repository Dawn Petherick, University Web Services Team Manager Information Services, University of Birmingham MIDESS Dissemination.
The Open Archives Initiative Simeon Warner (Cornell University) Symposium on “Scholarly Publishing and Archiving on the Web”, University.
Introducing Symposia : “ The digital repository that thinks like a librarian”
Institutional Repositories Tools for scholarship Mary Westell University of Calgary AMTEC Conference May 26, 2005.
PAWN: A Novel Ingestion Workflow Technology for Digital Preservation Mike Smorul, Joseph JaJa, Yang Wang, and Fritz McCall.
I:\Share\Bestuursinligting\OUDITfinaal\Portfolio\Statistics\BI UPSpace An institutional repository for the University of.
I:\Share\Bestuursinligting\OUDITfinaal\Portfolio\Statistics\BI UPSpace An institutional repository for the University of Pretoria.
OCLC Online Computer Library Center OCLC’s Digital Archive – Disseminating with METS Jay Goodkin Software Engineer Digital Collection and Preservation.
I:\Share\Bestuursinligting\OUDITfinaal\Portfolio\Statistics\BI UPSpace An institutional research repository for the University of Pretoria.
Digital Asset Management for All? Visualising a Flexible DAMS Solution for Small and Medium Scale Institutions Paul Bevan Llyfrgell Genedlaethol Cymru.
ETD Repositories Using DSpace Software Andrew Penman The Robert Gordon University 27 th September 2004.
Statewide Digitization and the FCLA Digital Archive Priscilla Caplan, Florida Center for Library Automation Statewide Digitization Planners Meeting OCLC,
Adventures in Digital Asset Management: Fedora at the National Library of Wales Glen Robson National Library of Wales
1. 2 introductions Nicholas Fischio Development Manager Kelvin Smith Library of Case Western Reserve University Benjamin Bykowski Tech Lead and Senior.
DSpace. TM 2 Agenda  Introduction to DSpace  DSpace community  Institutional Repository  Easy to add/find content in DSpace  Building Online Communities.
Implementing an Integrated Digital Asset Management System: FEDORA and OAIS in Context Paul Bevan DAMS Implementation Manager
© WRLC November 2005 Research Commons Supporting Scholarship in the 21st Century.
A disaggregated model for preservation of E-Prints Gareth Knight SHERPA DP Project Arts and Humanities Data Service.
Preserving Digital Collections for Future Scholarship Oya Y. Rieger Cornell University
The Global Video Grid: DigitalWell Update & Plan For SRB Integration Myke Smith, Manager Streaming Media Technologies University of Washington / ResearchChannel.
DAITSS: Dark Archive in the Sunshine State Priscilla Caplan, Florida Center for Library Automation DCC Workshop on Long-term Curation within Digital Repositories.
OAIS Rathachai Chawuthai Information Management CSIM / AIT Issued document 1.0.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
Use & Access 26 March Use “Proof of Concept” Model for General Libraries & IS faculty Model for General Libraries & IS faculty Test bed for DSpace.
The FCLA Digital Archive Joint Meeting of CSUL Committees, 2005.
CBSOR,Indian Statistical Institute 30th March 07, ISI,Kokata 1 Digital Repository support for Consortium Dr. Devika P. Madalli Documentation Research &
National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center Persistent Management of Distributed Data Reagan W. Moore.
IUScholarWorks Technical Overview Randall Floyd Digital Library Program Programmer/Database Administrator.
This presentation describes the development and implementation of WSU Research Exchange, a permanent digital repository system that is being, adding WSU.
Selene Dalecky March 20, 2007 FDsys: GPO’s Digital Content System.
OAIS Rathachai Chawuthai Information Management CSIM / AIT Issued document 1.0.
How to Implement an Institutional Repository: Part II A NASIG 2006 Pre-Conference May 4, 2006 Technical Issues.
Funded by: © AHDS Preservation in Institutional Repositories Preliminary conclusions of the SHERPA DP project Gareth Knight Digital Preservation Officer.
The NLW Digital Asset Management System Paul Bevan DAMS Implementation Manager
DSpace - Digital Library Software
Vicki Tobias Introduction to and Institutional Repositories.
DSpace System Architecture 11 July 2002 DSpace System Architecture.
A Basic Introduction By Scott Phillips 2005/8/7. Agenda What is DSpace and what does it do? The DSpace Information Model Components & Features of DSpace.
Lifecycle Metadata for Digital Objects November 15, 2004 Preservation Metadata.
Institutional Repositories July 2007 DIGITAL CURATION creating, managing and preserving digital objects Dr D Peters DISA Digital Innovation South.
National Archives and Records Administration1 Integrated Rules Ordered Data System (“IRODS”) Technology Research: Digital Preservation Technology in a.
A Project of the University Libraries Ball State University Libraries A destination for research, learning, and friends.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
Collection-Based Persistent Archives Arcot Rajasekar, Richard Marciano, Reagan Moore San Diego Supercomputer Center Presented by: Preetham A Gowda.
Store and exchange data with colleagues and team Synchronize multiple versions of data Ensure automatic desktop synchronization of large files B2DROP is.
Breeda Herlihy, IR Manager, UCC Library. UCC selected DSpace in 2008 Software selection group Staff from Library IT, Computer Centre, Special Collections,
Joint Meeting of CSUL Committees,
Building A Repository for Digital Objects
DAITSS: Dark Archive in the Sunshine State
Statewide Digitization and the FCLA Digital Archive
Introduction, Features & Technology
VI-SEEM Data Repository
VI-SEEM Data Repository
Introduction to DSpace
Institutional Repositories
Technical Issues in Sustainability
Robin Dale RLG OAIS Functionality Robin Dale RLG
Presentation transcript:

A hybrid approach of digital long term preservation to institutional repositories - A case study of DSpace/SRB Integration Ya-ning Arthur Chen, Feng-chien Chung Computing Centre, Academia Sinica 11 April, ISGC 2008

Outline Background of MAAT From Website to Institutional Repository Long Term Preservation & OAIS The Hybrid Approach Future

MAAT – Background The Metadata Architecture & Application Team (MAAT) was established in 2002 to engage in metadata research and service supportive for the National Digital Archives Program (NDAP) in Taiwan To date, the MAAT has been supporting over 80 digital library projects of Taiwan E-Learning & Digital Archive Program (TELDAP, former: NDAP)

MAAT – Motivation A number of documents have been created and can be categorized into –questionnaires, –work sheets, –meeting records, –metadata mapping tables, –system specifications, –best practices of metadata standards, –technical reports, –research papers, –briefings, and –tutorial materials. Most documents of the MAAT website are arranged in a static manner.

MAAT Website Academia Sinica

MAAT - Consideration 1 Document management and repository –over 1,000 documents and URL links have been arranged and served at the MAAT website. –the MAAT website needs an effective system of document management. Access control –The MAAT website still lacks access control for document access.

MAAT - Consideration 2 Workflow reengineering –the MAAT website adopts a centralized model to maintain documents and website arrangement. –This model is very complicated and labor- intensive, and the overhead cost is very high. Usage Statistics Report

MAAT - Challenge Too many publications, Too much change (that is various document versions), Too many contributors, and Too many institutions.

Implementation Level Static Website Institution Repository Phase1: from website to IR

DSpace - feature Captures –Digital research material in any format –Directly from creators (e.g. faculty)‏ –Large-scale, stable, managed long-term storage Describes –Descriptive metadata (Dublin Core) –Technical metadata (file size, format…) –Rights metadata (licenses, creative commons…) Distributes –Via WWW, with necessary access control Preserves –Persistent ID and Handle –Bitstream format registry

DSpace - Data Model

MAAT – Content 1 Content Type – 支援計畫 (Documents from the Projects we support) – 出版與活動 (Documents of Publication and Activity) – 計畫管理 (Project Management related – restricted documents) – 研究發展 (Research & Development - restricted documents) –48 Communities, 110 collections, 783 items Document Format –User upload: 794 pdf files, 446 ms word files, 59 ms powerpoint slides, 27 xml files, 17 jpeg images, 16 html files, 7 ms excel files…and the others –System generate: Over 1900 Plain Text files (mainly DSpace License files)…

MAAT – Content 2 Access Method –DSpace user browse and search interface –Search engines (google, yahoo…etc.) –OAI-PMH harvesting

MAAT DSpace

DSpace - Consideration The Need for Extending DSpace Storage Capabilities –The amount of documents grows so fast that an enormous size storage solution is required The Lack of Risk Management Mechanism –The Reliable Backup and Disaster Recovery Systems are not included in the default DSpace Installation

Implementation Level Statis Website Institution Repository Phase1: from website to IR Institution Repository + Grid Phase2: from IR to Long Term Preservation

DSpace/SRB Approach 1 In 2004, NARA (with NSF/NPACI) has funded a project aimed at integrating DSpace and SRB to –allow DSpace to use the data grid as a storage layer –permit the exchange of authentic documents between them NARA Proposal & Participants –San Diego Super Computer Center (SDSC)‏ Member of National Partnership for Advanced Computational Infrastructure (NPACI) an NSF sponsored program –MIT Libraries –UC San Diego Libraries (UCSD)‏ –Hewlett Packard Laboratories (HP)‏ –National Archives and Records Administration (NARA)‏

DSpace/SRB Approach 2 In DSpace, there can be multiple bitstream stores, each of these bitstream stores can be traditional storage or SRB storage. Both traditional and SRB storage are specified by configuration parameters. Both traditional and SRB bitstream stores are configured in dspace.cfg

Examination of DSpace/SRB An Open Archive Information System (OAIS) intends to preserve information for access and use by a Designated Community

OAIS Functional Model

Workflow

OAIS Functional Model…Again DSpace & SRB Administration DSpace RDBMS & SRB MCAT DSpace Submit Interface DSpace User Interface SRB Mass Storage DSpace Ingest DSpace Batch Import

Producer, Management and Consumer Producer –DSpace may play the role of ingest SIP from producer, and generate AIP for Management & Storage Management –SRB May play the role of receive AIP then Store & Manage data, and generate AIP for Access Consumer –DSpace May Play the role of process the access request and generate the proper DIP for dissemination DSpace RDBMS & SRB MCAT DSpace Submit Interface DSpace User Interface SRB Mass Storage DSpace Ingest DSpace Batch Import SIP AIP DIP

Archives arrangement Logical Archives structure: –DSpace allow multi-level communities and one level collection –Archive’s principle Principle of provenance Principle of respect des fonds Physical Files Arrangement: –SRB Mass Storage Technology

Future 1 Best Practice & SOP for DSpace/SRB integration Deeper Check Against Activities of OAIS Preservation Planning and policy –Monitor Producer/Management/Consumer’s service requirements and emerging technology, develop archival strategy & migration plan

Future 2 Feasibility Evaluation –Migrate from SRB to others advanced technology, such as SRM, iRODS… –Adopt metadata approach to enhance digital preservation, such as PREMIS and METS (ex: structural map, behavior section…)

Thank You