Collaborative Digital Preservation with LOCKSS Gail McMillan Digital Library and Archives, University Libraries Virginia Polytechnic Institute and State.

Slides:



Advertisements
Similar presentations
A Community Approach to Preservation: Experiences with Social Science Data ASIST Summit 2010 Jonathan Crabtree April 9, 2010.
Advertisements

Ensuring Long-term Access to ETDs through Distributed Digital Preservation Gail McMillan Director, Digital Library and Archives Virginia Tech Newcomers.
ETD Preservation Workshop Session Four: Collection Management for Preservation Gail McMillan, Virginia Tech.
ETD Preservation Workshop Session One: ETDs and Preservation Needs Gail McMillan, Virginia Tech.
BUILDING A COLLABORATIVE DIGITAL PRESERVATION NETWORK Caroline Arms Office of Strategic Initiatives, Library of Congress Robert H. McDonald Associate Director.
Katherine Skinner Executive Director, Educopia Institute Program Manager, MetaArchive Cooperative An Age of Discovery, ARL-CNI Washington D.C. Friday,
National Digital Information Infrastructure and Preservation Program (NDIIPP) Data-PASS/NDIIPP: A new effort to harvest our history A funder view May 25,
Southeastern Digital Libraries: An Overview of Current Projects and Future Trends Toby Graham Director, Digital Library of Georgia Catherine M. Jannik.
The Digital Preservation Network at UT Austin Chris Jordan Texas Advanced Computing Center.
A Community Approach to Preservation: “Experiences with Social Science Data” Community Approaches to Digital Preservation 2009 Jonathan Crabtree February.
Distributed Digital Preservation Networks Across a Region, Across a State: Stretching LOCKSS Gail McMillan, Virginia Tech Martin Halbert, Emory Aaron Trehub,
MetaArchive of Southern Digital Cultural Partners in the dispersed redundant dark archive University Libraries at Emory Auburn Florida State Georgia Tech.
A Practical, Working and Replicable Approach to ETD Preservation Catherine M. Jannik, Georgia Institute of Technology Robert H. McDonald, Florida State.
Collaborative Preservation of ETDs: The MetaArchive Cooperative and LOCKSS Gail McMillan Digital Library and Archives, Virginia Tech 1 st Canadian ETD.
Preservation Collaboration: NDLTD & MetaArchive Cooperative Gail McMillan Digital Library and Archives, Virginia Tech Newcomers’ ETDs 2010 University.
National Digital Information Infrastructure and Preservation Program (NDIIPP) Building a Network of Preservation Partners CNI Spring Task Force Meeting.
Data-PASS Shared Catalog Micah Altman & Jonathan Crabtree 1 Micah Altman Harvard University Archival Director, Henry A. Murray Research Archive Associate.
Promoting Digital Preservation Partnerships at the U.S. Library of Congress April 2004.
The Alabama Digital Preservation Network (ADPNet) A statewide private LOCKSS network Aaron Trehub, Auburn University Libraries NDIIPP Partners Meeting.
MetaArchive Distributed Digital Preservation Workshop Session 3: Costs and Operational Considerations Wednesday, May 30, 2007 Robert W. Woodruff Library.
MetaArchive of Southern Digital Cultural Partners in a dispersed redundant dark archive University Libraries at Emory Auburn Florida State Georgia Tech.
Michael Seadle “A Social Model for Archiving Digital Serials” Preprint available Sept. 14, 1006 at:
The ASERL LOCKSS-ETD INITIATIVE: Developing Preservation Strategies for Libraries that Publish E-Scholarship Robert H. McDonald – Florida State University.
Digital Preservation through Cooperation: LOCKSS Gail McMillan Digital Library and Archives, University Libraries Virginia Polytechnic Institute and State.
Electronic Thesis and Dissertation Initiative at Indiana State University(ISU) where to start and where to go Valentine Muyumba (Chair of Cataloging and.
Growing the MetaArchive Cooperative: ETDs (electronic theses and dissertations) Gail McMillan Digital Library and Archives, Virginia Tech July 2008 NDIIPP.
Data curation in an existing infrastructure: Stellenbosch University 1 st African Digital Curation Conference 12 – 13 February 2008 Wouter Klapwijk Senior.
Digital Preservation: Lessons learned through national action Digital Preservation Interoperability Framework Workshop April 2010.
Richard MarcianoChien-Yi Hou Caryn Wojcik University of University of State of Michigan North Carolina North Carolina Records Management ServicesSALT DCAPE.
Katherine Skinner Educopia Institute and MetaArchive Cooperative Matt Schultz Educopia Institute and MetaArchive Cooperative NDIIPP Partners Meeting Arlington,
Preserving ETDs: NDLTD & MetaArchive Collaboration Gail McMillan Digital Library and Archives, Virginia Tech Newcomers’ USETDA 2012.
Session 2.  Wake Up Call, LSTA Digitization Grant  Digital Preservation Summit, May 2008  ISU Digital Preservation Group, September 2009.
The Library of Congress Martha Anderson Program Officer, NDIIPP Office of Strategic Initiatives Library of Congress April 2005 LC Perspective : Preservation.
National Digital Information Infrastructure and Preservation Program (NDIIPP) CNI Project Briefing December 5, 2005.
November 2004 NDIIPP: Future Directions and Relevance to Other Countries Beth Dulabahn Office of Strategic Initiatives Library of Congress November 7,
Martin Halbert UNT Dean of Libraries MetaArchive President Monday, April 11, 2011 Newspaper Archive Summit University of Missouri Columbia, MO.
Preserving eScholarship and Digitized Special Collections Distributed Digital Preservation Bill Donovan
Digital Preservation MetaArchive Cooperative.  9:00-9:45 - Session 1: Digital Preservation Overview  9:45-11:00 - Session 2: Policy & Planning Overview.
T HE M ETA A RCHIVE M ODEL : D ISTRIBUTED D IGITAL P RESERVATION N ETWORKS Dr. Martin Halbert VIVA/SCHEV LAC Meeting Christopher Newport University Trible.
Katherine Skinner, Executive Director, Educopia Institute ESOPI 2013 Chapel Hill, NC April 19, 2013.
Growing the MetaArchive Cooperative ETDs Gail McMillan Digital Library and Archives, Virginia Tech July 2008 NDIIPP Partners Meeting.
Report on Preservation of ETDs: The LOCKSS Prototype The work of Kamini Santhanagopalan Virginia Tech Graduate Student in Computer Science Reported at.
Implementing an Institutional Repository: Part III 16 th North Carolina Serials Conference March 29, 2007 Resource Issues.
Martin Halbert President, MetaArchive Cooperative DigCCurr 2009 Meeting Chapel Hill, NC Friday, April 3, 2009.
The Alabama Digital Preservation Network (ADPNet) Aaron Trehub Director of Library Technology Auburn University State Council of Higher Education for Virginia.
The Alabama Digital Preservation Network (ADPNet) A statewide Private LOCKSS Network Aaron Trehub, Auburn University Libraries SAA/CoSA Joint Annual Meeting.
What is NDIIPP doing?. July 7 th, Web-At-Risk is opening its archives for public access, having captured nearly 6 TB of data—the entire CA State Government.
How to Implement an Institutional Repository: Part II A NASIG 2006 Pre-Conference May 4, 2006 Technical Issues.
Collaborative Preservation of ETDs: The MetaArchive Cooperative and LOCKSS Gail McMillan Digital Library and Archives, Virginia Tech Canadian.
Katherine Skinner, Educopia Institute Emily Gore, Clemson University U.S. Workshop on Roadmap for Digital Preservation Interoperability Framework NIST,
Chronopolis – MetaArchive Improving and Strengthening Inter-Institutional Preservation.
DuraCloud Open technologies and services for managing durable data in the cloud Michele Kimpton, CBO DuraSpace.
Martin Halbert MetaArchive Cooperative Thursday, June 25, 2009 NDIIPP Annual Meeting Washington, D.C.
Open Archives Initiative Gail McMillan Digital Library and Archives, Virginia Tech Society for Scholarly Publishing: June 1, 2000.
Distributed Digital Preservation Networks Across a Region, Across a State: Stretching LOCKSS Gail McMillan, Virginia Tech Martin Halbert, Emory Aaron Trehub,
Digital Preservation through Cooperation: LOCKSS Gail McMillan Digital Library and Archives, University Libraries Virginia Polytechnic Institute and State.
Custodians of Culture, Architects of Archives  Martin Halbert (Emory Univ., MetaArchive Cooperative) - Facilitator  Thib Guicherd ‐ Callin (Stanford.
LOCKSS at Georgia Tech Patricia E. Kenly April 2007.
Digital preservation of CBUC theses with MetaArchive 11th SELL Meeting Porto, June 4th 2011.
Libraries in the digital age Collection & preservation for generational access part two The LOCKSS Program.
CMU Libraries’ Digital Assets Preservation Strategy Presenter Gabrielle V. Michalek Principal Archivist and Head, Archives/Digital Library Initiatives.
Beyond Technology: Creating and Sustaining the MetaArchive Cooperative Joint Annual Meeting, Society of American Archivists & the Council of State Archivists.
Implementing an Institutional Repository: Part II
Gail McMillan Digital Library and Archives, Virginia Tech
CNI Project Briefing December 5, 2005
Gail McMillan Digital Library and Archives, University Libraries
Implementing an Institutional Repository: Part II
The MetaArchive Model: Distributed Digital Preservation Networks
How to Implement an Institutional Repository: Part II
Presentation transcript:

Collaborative Digital Preservation with LOCKSS Gail McMillan Digital Library and Archives, University Libraries Virginia Polytechnic Institute and State University TRLN LOCKSS Symposium University of North Carolina, Chapel Hill November 1, 2005

The MetaArchive of Southern Digital Culture is a collaborative venture of Emory University, Georgia Tech, Virginia Tech, Florida State University, Auburn University, University of Louisville, and the Library of Congress. The project is part of the National Digital Information Infrastructure and Preservation Program (NDIIPP) supported by the Library of Congress. / The ASERL-LOCKSS-ETD Initiative is a collaborative venture between the LOCKSS Program at Stanford University and the University Libraries of the Florida State University, Georgia Institute of Technology, University of Kentucky, University of Tennessee, Vanderbilt University, and the Virginia Polytechnic Institute and State University, to preserve electronic theses and dissertations.

Commonalities Belief in digital preservation strategy Trusted partners Standards Open source software Off-the-shelf hardware

NDIIPP: National Digital Information Infrastructure and Preservation Program Created by federal legislation in December 2000 Support preservation of significant “born-digital” content at risk Three areas of focus –Network of preservation partners –Architectural framework for preservation –Digital preservation research Primary outcomes for partnerships: –Identify and preserve significant content –Leverage resources, experience via collaboration –Promote standards and best practices

MetaArchive NDIIPP Network via Internet2 Auburn University Emory University Ga Tech Va Tech University of Louisville Florida State University DC NYC CH IN ATL FL Lambda Rail Abilene NetworkSOX Network MAX Network MAX Connection to Va Tech

Key Features of the MetaArchive of Southern Digital Culture Distributed preservation strategy Flexible organizational model Formal content selection process Capability for migrating archives Dark archiving strategy Low cost to deployment Self-sustaining incentives Simple preservation exchange mechanisms with the Library of Congress

MetaArchive Rights Issues Use of protected works generally will need to: Fit within an exception to the exclusive rights of owners, such as the “fair-use” doctrine or other provisions relating specifically to library copying and other activities Undergo an investigation to determine whether the work enjoys protection or lapsed into the public domain due to notice or renewal defects Occur as a result of permission from copyright owner(s) Constitute acceptable risk for the institution in potential absence of “clear” resolution

MetaArchive Collaboration CLOCKSS Collecting Lots of Copies Keeps Stuff Safe Diversifying LOCKSS –Software, hardware, collections, communities Study issues –Dynamic content –Format migration (next grant) Cooperative agreement model –Effective preservation network of broad digital content Communication –Telephone conference, video conference I2, iVocalize Chat/VOIP Room, Wiki, PhpCollab

ASERL Project Using LOCKSS to Preserve ETDs Suggested March 2005, call for volunteers May, first virtual meeting June 10; first harvest ____________ Test the use of LOCKSS to preserve electronic theses and dissertations –Available, restricted, and withheld from public access –Dark archive Volunteer institutions committed –Personnel –Time –Finances –Hardware

ASERL ETD LOCKSS Project Virginia Tech NC State Georgia Tech Florida State University of Miami University of Tennessee Vanderbilt University of Kentucky

Technical Infrastructure: Goals Build on successful LOCKSS open-source model Create dark archive for locally produced digital content Use off-the-shelf hardware Use open-source software Create ease of replication Demonstrate LOCKSS scalability Enable benefits of Internet2 network

MetaArchive Software Operating System –RedHat Linux Enterprise AS v. 3/4 Ease of update management and experience w/OS –Could easily work on other versions of Linux JAVA SDK –Also tested with CentOS Linux Distribution LOCKSS Content Ingestion/Replication –LOCKSS Daemon – 6-8 week updates w/RPM Conspectus Database –MySQL/PHP Interface Integrated with LOCKSS Plugin Registry MetaArchive Collection Description Metadata Schema

MetaArchive Hardware Off-the-Shelf Strategy –Dell/Intel Based Hardware Could easily be HP or SUN Intel Based Hardware etc. Could be old desktops w/large hard drives. –New Low Cost SATA SAN EMC AX100 –$4.00 per GB (already dropping in price)

Sample CLOCKSS Setups Enterprise (3TB) –Dell PowerEdge Server 1850 LOCKSS - $3500 –Dell PowerEdge Server 1850 Firewall - $2500 –Dell/EMC AX100 SAN (3TB) - $10,000 –RedHat Enterprise AS – = $100 –UPS - $700 –Server Rack - $1200 Grand Total - $16, –w/ Rack - $18, Desktop (200Gb) –Intel Based Desktop LOCKSS (200Gb) - $500 –Intel Based Desktop Firewall - $350 –CentOS Linux - $0 –UPS - $50 Grand Total - $900.00

Future Refinements Currently Testing Administrative Interface for LOCKSS Networks –Enables partners to verify ETD backup and LOCKSS quorum –Enables ingest control for preservation groups once OAI harvesting is setup and/or plugin is published International ETD LOCKSS storage nodes

NDLTD and LOCKSS “dedicated to … preservation … of electronic theses and dissertations.” NDLTD Board of Directors endorsed ASERL and MetaArchive approaches to digital preservation ETD-IPN: Electronic Thesis and Dissertation International Preservation Network