Report on Preservation of ETDs: The LOCKSS Prototype The work of Kamini Santhanagopalan Virginia Tech Graduate Student in Computer Science Reported at.

Slides:



Advertisements
Similar presentations
ETD Preservation Survey Results Gail McMillan Digital Library and Archives, Virginia Tech 11th International ETD Symposium Robert Gordon University.
Advertisements

Ensuring Long-term Access to ETDs through Distributed Digital Preservation Gail McMillan Director, Digital Library and Archives Virginia Tech Newcomers.
ETD Preservation Workshop Session Four: Collection Management for Preservation Gail McMillan, Virginia Tech.
ETD Preservation Workshop Session One: ETDs and Preservation Needs Gail McMillan, Virginia Tech.
LABT-ETD 2004 ETDs Submission Software Kaunas, Arūnas Franckevičius
Overview of LOCKSS. Session Learning Objectives  Provide an overview of the LOCKSS architecture.  Describe the LOCKSS polling process  Describe how.
Lawrence Webley, Hussein Suleman, Tatenda Chipeperekwa University of Cape Town Department of Computer.
ETD-db: Original ETD-db 2.0: Enhanced Gail McMillan Director, Digital Library and Archives, Virginia Tech and Edward A. Fox, Executive Director, NDLTD.
1 ETD-db Providing Access and Managing Your ETD Workflow Gail McMillan Digital Library and Archives Virginia Polytechnic Institute.
AN OPEN-SOURCE SYSTEM FOR AUTOMATIC POLICY-BASED COLLABORATIVE ARCHIVAL REPLICATION Using the SafeArchive System The SafeArchive System coordinates six.
A Community Approach to Preservation: “Experiences with Social Science Data” Community Approaches to Digital Preservation 2009 Jonathan Crabtree February.
DCAPE Project Update Richard MarcianoChien-Yi Hou Caryn Wojcik University of University of State of Michigan North Carolina North Carolina Records Management.
ETDs: An American Sampler Gail McMillan Director, Digital Library and Archives Virginia Polytechnic Institute &State University JISC/CNI: YorkJuly 6, 2006.
MetaArchive of Southern Digital Cultural Partners in the dispersed redundant dark archive University Libraries at Emory Auburn Florida State Georgia Tech.
ETD-db: Today ETD-db 2.0: Tomorrow Gail McMillan Director, Digital Library and Archives, Virginia Tech Recorded by Edward A. Fox, Virginia Tech Newcomers’
Collaborative Digital Preservation with LOCKSS Gail McMillan Digital Library and Archives, University Libraries Virginia Polytechnic Institute and State.
A Practical, Working and Replicable Approach to ETD Preservation Catherine M. Jannik, Georgia Institute of Technology Robert H. McDonald, Florida State.
South Carolina Information Technology Directors Association September 8, 2008 Bill Henry, Matt Guzzi SC Department of Archives and History.
Collaborative Preservation of ETDs: The MetaArchive Cooperative and LOCKSS Gail McMillan Digital Library and Archives, Virginia Tech 1 st Canadian ETD.
Preservation Collaboration: NDLTD & MetaArchive Cooperative Gail McMillan Digital Library and Archives, Virginia Tech Newcomers’ ETDs 2010 University.
Persistent Digital Archives and Library System (PeDALS) A Guide for Wisconsin State Agencies.
Data-PASS Shared Catalog Micah Altman & Jonathan Crabtree 1 Micah Altman Harvard University Archival Director, Henry A. Murray Research Archive Associate.
Anthony Atkins Digital Library and Archives VirginiaTech ETD Technology for Implementers Presented March 22, 2001 at the 4th International.
MetaArchive Distributed Digital Preservation Workshop Session 3: Costs and Operational Considerations Wednesday, May 30, 2007 Robert W. Woodruff Library.
MetaArchive of Southern Digital Cultural Partners in a dispersed redundant dark archive University Libraries at Emory Auburn Florida State Georgia Tech.
Chapter-4 Windows 2000 Professional Win2K Professional provides a very usable interface and was designed for use in the desktop PC. Microsoft server system.
Keeping your Archive Safe (and on TRAC) with SafeArchive and LOCKSS Thu-Mai Christian [Slides] Micah Altman Jonathan Crabtree [Project Directors]
Wrangling DigiTool Data For LOCKSS Brian Meuse - Digital Collections Systems Analyst University Libraries Boston College MetaArchive Cooperative Annual.
How to participate in the Union Catalogue Project Hussein Suleman Sivulile – Open Access South Africa Advanced Information Management.
Hussein Suleman University of Cape Town Department of Computer Science Advanced Information Management Laboratory High Performance.
The ASERL LOCKSS-ETD INITIATIVE: Developing Preservation Strategies for Libraries that Publish E-Scholarship Robert H. McDonald – Florida State University.
Persistent Digital Archives and Library System (PeDALS) SC Department of Archives and History.
Digital Preservation through Cooperation: LOCKSS Gail McMillan Digital Library and Archives, University Libraries Virginia Polytechnic Institute and State.
Growing the MetaArchive Cooperative: ETDs (electronic theses and dissertations) Gail McMillan Digital Library and Archives, Virginia Tech July 2008 NDIIPP.
Technology Choices for the JSTOR Online Archive Presented by Chang Feng Department of Computer Engineering and Computer Science, University of Missouri-Columbia,
Katherine Skinner Educopia Institute and MetaArchive Cooperative Matt Schultz Educopia Institute and MetaArchive Cooperative NDIIPP Partners Meeting Arlington,
Preserving ETDs: NDLTD & MetaArchive Collaboration Gail McMillan Digital Library and Archives, Virginia Tech Newcomers’ USETDA 2012.
ETD Software: Toward the Future with Retrospective Hindsight Gail McMillan Digital Library and Archives, Virginia Tech ETD 2008: 10th International Symposium.
Click to edit Master subtitle style 12/16/09 MetaArchive Architecture Monika Mevenkamp MetaArchive Annual Membership Meeting Houston, Texas Friday October.
Preserving eScholarship and Digitized Special Collections Distributed Digital Preservation Bill Donovan
1 Data Curation Workshop Some Reflections on Students’ Roles ETD 2011: 14 th Int. Symp. on ETDs Cape Town, South Africa Edward A. Fox Executive Director,
Katherine Skinner, Executive Director, Educopia Institute ESOPI 2013 Chapel Hill, NC April 19, 2013.
Growing the MetaArchive Cooperative ETDs Gail McMillan Digital Library and Archives, Virginia Tech July 2008 NDIIPP Partners Meeting.
The Story of at the Alaska State Library Presented by Sheri Somerville Alaska State Library March 14, 2009.
Persistent Digital Archives and Library System (PeDALS)
ETD-db: Workflow, the Short Story Edward A. Fox and Gail McMillan Virginia Tech Newcomers’ ETD 2009 University.
Collaborative Preservation of ETDs: The MetaArchive Cooperative and LOCKSS Gail McMillan Digital Library and Archives, Virginia Tech Canadian.
Digital Library of the Caribbean. Shared History & Collections Need for multiple copies for access and preservation led to cooperative agreements for.
Chronopolis – MetaArchive Improving and Strengthening Inter-Institutional Preservation.
Catherine Fournier ICOLC October LOCKSS: FEEDBACK FROM INIST’s EXPERIENCE Foreword Preservation-Why? LOCKSS overview LOCKSS at INIST Conclusion.
Open Access Conference, Pretoria, July 2004 Wouter Klapwijk, Univ. of Stellenbosch The LOCKSS Project: an overview Open Access Conference, Pretoria, July.
Distributed Digital Preservation Networks Across a Region, Across a State: Stretching LOCKSS Gail McMillan, Virginia Tech Martin Halbert, Emory Aaron Trehub,
Digital Preservation through Cooperation: LOCKSS Gail McMillan Digital Library and Archives, University Libraries Virginia Polytechnic Institute and State.
Digital preservation of CBUC theses with MetaArchive 11th SELL Meeting Porto, June 4th 2011.
3/17/2005 CS 791/891 Digital Preservation 1 LOCKSS: A Permanent Web Publishing and Access System V. Reich & D. S. H. Rosenthal Presented By Roopa D. Vegesna.
Libraries in the digital age Collection & preservation for generational access part two The LOCKSS Program.
CMU Libraries’ Digital Assets Preservation Strategy Presenter Gabrielle V. Michalek Principal Archivist and Head, Archives/Digital Library Initiatives.
KEEPS – a system for UELMA preservation and security
KEEPS – a system for UELMA preservation and security

Implementing Metaarchive At Robert E. Kennedy Library
An Overview of Data-PASS Shared Catalog
The Hosted Model Charl Roberts Good morning again,
Cloud based Open Source Backup/Restore Tool
File Manager for Microsoft Office 365, SharePoint, and OneDrive: Extensible Via Custom Connectors in Enterprise Deployments, Ideal for End Users OFFICE.
SCALABLE OPEN ACCESS Hussein Suleman
Gail McMillan Digital Library and Archives, Virginia Tech
Gail McMillan Digital Library and Archives, University Libraries
The MetaArchive Model: Distributed Digital Preservation Networks
How to Implement an Institutional Repository: Part II
Presentation transcript:

Report on Preservation of ETDs: The LOCKSS Prototype The work of Kamini Santhanagopalan Virginia Tech Graduate Student in Computer Science Reported at the 9 th International Symposium on ETDs, Quebec City Presented By: Gail McMillan, Director Digital Library and Archives Virginia Tech

Agenda  Goals  What is LOCKSS?  Participating Universities  International ETD Preservation  Analysis and Results  Conclusion

Digital Preservation  Goal: Information should be  Readable  Usable in the future  Preservation – NOT just backup  Existing preservation techniques Floppy, CD and hard disk drives Central and distributed database servers

Technical Infrastructure Goals Build on successful LOCKSS open- source model Create dark archive for locally produced digital content Use off-the-shelf hardware Use open-source software Easy replication Demonstrate LOCKSS scalability

LOCKSS  Lots of Copies Keep Stuff Safe Peer-to-peer digital preservation system Open source software Turns an inexpensive desktop computer into a digital preservation appliance Easy, inexpensive way to  Collect  Store  Preserve  Provide access to the contents--or, not.

Functions of LOCKSS (1)  Collect Via a web crawler  Appropriate crawl rules are specified  Preserve and Audit Every institution preserves  Its own contents  Contents of partner universities  Contents are polled to determine authenticity and reinstate bad files

Functions of LOCKSS (2)  Provide access By running web proxies Open or restricted access  Dark Archives for partners’ ETDs Levels of access controlled at originating institutions  Administration Via a web user interface  Controlling access to cached contents and other functions

LOCKSS Preservation  Contents of each university (nodes M1 through M5) preserved at every other university Multiple, dispersed copies  Not a backup-- nothing is overwritten  All versions retained M1 M3 M2 M5 M4

ASERL-LOCKSS-ETD Initiative Florida State University Georgia Institute of Technology University of Kentucky University of Tennessee Vanderbilt University Virginia Polytechnic Institute and State University

Preservation using LOCKSS  Prerequisites Minimum hardware configuration LOCKSS software installed on all participating partners’ systems Permissions for the LOCKSS system to collect, preserve, periodically validate, repair ETDs

Example Hardware Configuration  Enterprise (3TB) Dell PowerEdge Server 1850 LOCKSS - $3500 Dell PowerEdge Server 1850 Firewall - $2500 Dell/EMC AX100 SAN (3TB) - $10,000 RedHat Enterprise AS – = $100 UPS - $700 Server Rack - $1200  Grand Total - $16, w/ Rack - $18,  Desktop (200Gb) Intel Based Desktop LOCKSS (200Gb) - $500 Intel Based Desktop Firewall - $350 CentOS Linux - $0 UPS - $50  Grand Total - $900.00

Participating Universities  International universities Pontifícia Universidade Católica do Rio de Janeiro, Brazil Humboldt-Universität, Germany University of Cape Town, South Africa  US universities Florida State University Georgia Tech Virginia Tech

International ETDs Preservation (1)  For international universities KS wrote plug-ins to collect contents (ETDs) from the 3 universities  For US universities Verified and reused OAI plug-ins for the 3 universities

International ETD Preservation (2)  Example ETD collection University of Cape Town ETD collection Manifest (i.e., permissions) page: html html Screen shots of UCT plug-in and the crawl results of contents follow

University of Cape Town Plug-in (1)

UCT plug- in: Crawl Results with Level (depth) =4 Fetch delay = 6 seconds

Harvested International ETD Collections

Harvested American ETD Collection [source: ]

Tutorial on how to write plug-ins  KS developed mini-tutorial  10 screens  This tutorial can be Generalized for ETD plug-ins Extended to write OAI plug-ins

Conclusion and Future Work  International ETDs can be harvested and preserved using LOCKSS and OAI-PMH  It requires cooperation and collaboration from participating universities  Future Work An online portal open for the public to view certain details Brazil expressed interest in formalizing ETD preservation for the NDLTD using LOCKSS

Acknowledgements  Special thanks to LOCKSS (Stanford University) Thomas Robertson Seth Morabito  Thanks to all participating universities Florida State Georgia Tech Humboldt-Universität, Germany Pontifícia Universidade Católica do Rio de Janeiro, Brazil University of Cape Town, South Africa Virginia Tech

Send Questions/Comments to