MetaArchive Cooperative Annual Membership Meeting Welcome & Overview Dr. Martin Halbert MetaArchive Annual Membership Meeting Atlanta, Georgia Friday,

Slides:



Advertisements
Similar presentations
OCLC Online Computer Library Center Steering Around the Iceberg: Economic Sustainability for Digital Collections Brian Lavoie Research Scientist OCLC Economics.
Advertisements

Ensuring Long-term Access to ETDs through Distributed Digital Preservation Gail McMillan Director, Digital Library and Archives Virginia Tech Newcomers.
How to commence the IT Modernization Process?
Katherine Skinner Executive Director, Educopia Institute Program Manager, MetaArchive Cooperative An Age of Discovery, ARL-CNI Washington D.C. Friday,
The National Digital Stewardship Alliance: Community, Content, Commitment.
How can a library consortia help your library? Some thoughts on the development of library consortia Sarah Aerni Special Projects Librarian University.
Distributed Digital Preservation Networks Across a Region, Across a State: Stretching LOCKSS Gail McMillan, Virginia Tech Martin Halbert, Emory Aaron Trehub,
Depositing and Disseminating Digital Resources Alan Morrison Collections Manager AHDS Subject Centre for Literature, Linguistics and Languages.
Working Together Revisited: Diverse Skills for Sustainability Robert P. Spindler Arizona State University December 5 th, 2006.
Collaborative Preservation of ETDs: The MetaArchive Cooperative and LOCKSS Gail McMillan Digital Library and Archives, Virginia Tech 1 st Canadian ETD.
Preservation Collaboration: NDLTD & MetaArchive Cooperative Gail McMillan Digital Library and Archives, Virginia Tech Newcomers’ ETDs 2010 University.
Promoting Digital Preservation Partnerships at the U.S. Library of Congress April 2004.
The Alabama Digital Preservation Network (ADPNet) A statewide private LOCKSS network Aaron Trehub, Auburn University Libraries NDIIPP Partners Meeting.
Tom Clareson Society of Florida Archivists Annual Meeting May 13, 2015.
MetaArchive Distributed Digital Preservation Workshop Session 3: Costs and Operational Considerations Wednesday, May 30, 2007 Robert W. Woodruff Library.
Organization Mission Organizations That Use Evaluative Thinking Will Develop mission statements specific enough to provide a basis for goals and.
MetaArchive Distributed Digital Preservation Workshop Wednesday, May 30, 2007 Robert W. Woodruff Library Emory University Atlanta, Georgia.
Mid-Michigan Digital Practitioners, March 14, 2014 The National Digital Stewardship Alliance Agenda Mid-Michigan Digital Practitioners Meeting Abigail.
1 Koalicja Otwartej Edukacji OER conference Warsaw, 23 April 2009 Open Educational Resources: Building a Culture of Sharing Susan D’Antoni UNESCO.
Tyler Walters Dean, University Libraries and Professor Virginia Tech July 18, 2013 Collaboratively Preserving Our Digital Memory.
Katherine Skinner, Executive Director, Educopia Institute Martin Halbert, Dean of Libraries, University of North Texas CNI 2010 Spring Forum, Baltimore.
The Challenge of IT-Business Alignment
Digital Preservation through Cooperation: LOCKSS Gail McMillan Digital Library and Archives, University Libraries Virginia Polytechnic Institute and State.
Growing the MetaArchive Cooperative: ETDs (electronic theses and dissertations) Gail McMillan Digital Library and Archives, Virginia Tech July 2008 NDIIPP.
Investing in the Long-Term Viability of British Columbia’s Digital Collections A presentation to the Steering Committee of the B.C. Digitization Coalition.
SCSC 311 Information Systems: hardware and software.
Preserving Digital Collections for Future Scholarship Oya Y. Rieger Cornell University
Digital Preservation: Lessons learned through national action Digital Preservation Interoperability Framework Workshop April 2010.
MetaArchive Cooperative Membership Agreements Martin Halbert NDIIPP Partners Meeting Washington, D.C. Wednesday July 9, 2008.
NDIIPP The Next Phase Meg Williams Associate General Counsel The Library of Congress.
Katherine Skinner Educopia Institute and MetaArchive Cooperative Matt Schultz Educopia Institute and MetaArchive Cooperative NDIIPP Partners Meeting Arlington,
Preserving ETDs: NDLTD & MetaArchive Collaboration Gail McMillan Digital Library and Archives, Virginia Tech Newcomers’ USETDA 2012.
Session 2.  Wake Up Call, LSTA Digitization Grant  Digital Preservation Summit, May 2008  ISU Digital Preservation Group, September 2009.
Martin Halbert UNT Dean of Libraries MetaArchive President Monday, April 11, 2011 Newspaper Archive Summit University of Missouri Columbia, MO.
Preserving eScholarship and Digitized Special Collections Distributed Digital Preservation Bill Donovan
Digital Preservation MetaArchive Cooperative.  9:00-9:45 - Session 1: Digital Preservation Overview  9:45-11:00 - Session 2: Policy & Planning Overview.
T HE M ETA A RCHIVE M ODEL : D ISTRIBUTED D IGITAL P RESERVATION N ETWORKS Dr. Martin Halbert VIVA/SCHEV LAC Meeting Christopher Newport University Trible.
Katherine Skinner, Executive Director, Educopia Institute ESOPI 2013 Chapel Hill, NC April 19, 2013.
Session 3.  Now you know WHY to make policies and WHAT they should contain…  But HOW do you implement policies?  And then HOW do you implement a program.
Growing the MetaArchive Cooperative ETDs Gail McMillan Digital Library and Archives, Virginia Tech July 2008 NDIIPP Partners Meeting.
Martin Halbert President, MetaArchive Cooperative DigCCurr 2009 Meeting Chapel Hill, NC Friday, April 3, 2009.
Dr. Martin Halbert Dr. Katherine Skinner Digital Preservation: What’s Now, What’s Next. Amigos Online Conference, August 12, 2011.
The Alabama Digital Preservation Network (ADPNet) Aaron Trehub Director of Library Technology Auburn University State Council of Higher Education for Virginia.
The Alabama Digital Preservation Network (ADPNet) A statewide Private LOCKSS Network Aaron Trehub, Auburn University Libraries SAA/CoSA Joint Annual Meeting.
UK LOCKSS Alliance: Investigation into Private LOCKSS Networks Adam Rusbridge EDINA, University of Edinburgh.
Collaborative Preservation of ETDs: The MetaArchive Cooperative and LOCKSS Gail McMillan Digital Library and Archives, Virginia Tech Canadian.
MetaArchive Cooperative Annual Membership Meeting Welcome & Overview Dr. Martin Halbert, President MetaArchive Annual Membership Meeting Houston, TX Friday,
Katherine Skinner, Educopia Institute Emily Gore, Clemson University U.S. Workshop on Roadmap for Digital Preservation Interoperability Framework NIST,
Distributed Digital Preservation Workshop for ETDs Gail McMillan, Virginia Tech Martin Halbert, Emory University Bill Donovan, Boston College MetaArchive.
JISC/CNI Conference Edinburgh, 26th June 2002 Challenges of Digital Preservation – do we have a road map? Maggie Jones.
State of Georgia Release Management Training
Distributed Digital Preservation Networks Across a Region, Across a State: Stretching LOCKSS Gail McMillan, Virginia Tech Martin Halbert, Emory Aaron Trehub,
Digital Preservation through Cooperation: LOCKSS Gail McMillan Digital Library and Archives, University Libraries Virginia Polytechnic Institute and State.
Custodians of Culture, Architects of Archives  Martin Halbert (Emory Univ., MetaArchive Cooperative) - Facilitator  Thib Guicherd ‐ Callin (Stanford.
Infrastructure Breakout What capacities should we build now to manage data and migrate it over the future generations of technologies, standards, formats,
The R EPOSITORY AS P UBLISHER OPPORTUNITIES AND CHALLENGES IN A DUAL ROLE BEN HOCKENBERRY SYSTEMS LIBRARIAN | ST. JOHN FISHER COLLEGE.
The National Digital Stewardship Alliance: Stewardship, Collaboration, Inclusiveness, Exchange.
Katherine Skinner, Martin Halbert & Matt Schultz Educopia Institute and MetaArchive Cooperative NDSA Infrastructure Committee
A Shared Commitment to Digital Preservation and Access.
Working with personal digital archives Susan Thomas Project Manager & Digital Archivist project Manuscripts Matter, Electronica panel London, October.
Database Principles: Fundamentals of Design, Implementation, and Management Chapter 1 The Database Approach.
Beyond Technology: Creating and Sustaining the MetaArchive Cooperative Joint Annual Meeting, Society of American Archivists & the Council of State Archivists.
Distributed Digital Preservation Workshop for ETDs Gail McMillan, Virginia Tech Martin Halbert, Emory University Bill Donovan, Boston College MetaArchive.
The Alabama Digital Preservation Network (ADPNet)
Digital Libraries: Planning, Creating, Collaborating, & Reality
Architectural Records Roundtable
Gail McMillan Digital Library and Archives, Virginia Tech
Research data preservation in Canada
Overview & Update on Recent Canadiana Activities
The MetaArchive Model: Distributed Digital Preservation Networks
Presentation transcript:

MetaArchive Cooperative Annual Membership Meeting Welcome & Overview Dr. Martin Halbert MetaArchive Annual Membership Meeting Atlanta, Georgia Friday, October 24, 2008

Structure of the Meeting 0900 AM AM Welcome to MetaArchive Membership Meeting and Overview of PLN Operations (Dogwood Boardroom) 1000 AM AM MetaArchive Architecture, Conspectus database and cache manager interfaces (Dogwood Boardroom) 1100 AM PMTECH TRACK: Archive Ingestion (Maple Boardroom) 1100 AM PM ADMIN TRACK: MetaArchive Cooperative Membership Meeting (Dogwood Boardroom) 1230 PM PM Lunch (Conference Center Dining Room) 0130 PM PM TECH TRACK: MetaArchive Node Systems Administration (Maple Boardroom) 0130 PM PM ADMIN TRACK: MetaArchive Cooperative Membership Meeting (Dogwood Boardroom) 0400 PM PMMeeting Conclusion (Dogwood Boardroom) 10/24/2008MetaArchive 2008 Membership Meeting2

Welcome! to both Original and New Members You are now members of the fastest growing digital preservation cooperative in the world You and your institutions are now part of an extended community dedicated to the active preservation of the shared cultural memory of our society over the long term Welcome to the first annual membership meeting of the MetaArchive Cooperative! 10/24/20083MetaArchive 2008 Membership Meeting

Session Questions What is the MetaArchive Cooperative? What is distributed digital preservation? Why has MetaArchive embraced it as the most critically important strategy for the preservation of digital archives? Why did we form it? What differentiates it from other efforts? How can we best conceptualize several new Archives within MetaArchive? 10/24/20084MetaArchive 2008 Membership Meeting

What led to MetaArchive? Planning meetings by librarians and archivists in on concerns about preserving digital archives Sense that we needed to do something practical to help each other preserve our data Not based on studies, just the observation of our collective anxieties about keeping our (expensive) digital materials preserved and viable. 10/24/2008MetaArchive 2008 Membership Meeting5

The Data Loss Problem From NDIIPP Website on the Importance of Digital preservation ( 10/24/2008MetaArchive 2008 Membership Meeting 6

The Gap in Digital Preservation Programs 66% of cultural heritage institutions (academic libraries, archives, art museums, public libraries, and other similar kinds of institutions) report that no one is responsible for digital preservation activities 30% of all archives have been backed up one time or not at all Source: 2005 NEDCC Survey by Bishoff and Clareson 10/24/2008MetaArchive 2008 Membership Meeting 7

The Need for Collaborative Approaches “The increased number and diversity of those concerned with digital preservation—coupled with the current general scarcity of resources for preservation infrastructure—suggests that new collaborative relationships that cross institutional and sector boundaries could provide important and promising ways to deal with the data preservation challenge. These collaborations could potentially help spread the burden of preservation, create economies of scale needed to support it, and mitigate the risks of data loss.” - The Need for Formalized Trust in Digital Repository Collaborative Infrastructure NSF/JISC Repositories Workshop (April 16, 2007) 10/24/2008MetaArchive 2008 Membership Meeting 8

Backups versus Digital Preservation What differentiates a schedule for data backups from a digital preservation program? Backups are tactical measures. Backups are typically stored in a single location (often nearby or collocated with the servers backed up) and are performed only periodically. Backups are designed to address short-term data loss via minimal investment of money and staff time resources. Backups are better than nothing, but not a comprehensive solution to the problem of preserving information over time. Digital preservation is strategic. Preserving information over long periods requires systematic attention rather than benign neglect or unthinking actions. 10/24/2008MetaArchive 2008 Membership Meeting 9

Institutional Repositories versus Digital Preservation What differentiates an IR program from a distributed digital preservation program? The IR is not distributed. The IR is a centralized approach aimed at managing information flow within the institution. It typically does not attempt to securely cache prioritized content at multiple geographically dispersed sites. DDP mobilizes efforts of multiple institutions. A digital preservation program entails a geographically dispersed set of secure caches of critical information. A true digital preservation program will require multi-institutional collaboration and at least some ongoing investment to realistically address the issues involved in preserving information over time. 10/24/2008MetaArchive 2008 Membership Meeting 10

Secure and Distributed Cache Networks Why are the characteristics of geographically distribution and security so important? This strategy maximizes survivability of content in both individual and collective terms: Security reduces the likelihood that any single cache will be compromised. Distribution reduces the likelihood that the loss of any single cache will lead to a loss of the preserved content. By creating a collaborative network for secure and distributed preservation, a group can also work together on more complex issues such as format migration. 10/24/2008MetaArchive 2008 Membership Meeting 11

Both Technical and Organizational Networking are Required A single cultural heritage organization is unlikely to have the capability to operate several geographically dispersed and securely maintained servers Collaboration between institutions on technological solutions is essential Similarly, inter-institutional agreements must be put in place or there will be no commitment to act in concert over time 10/24/2008MetaArchive 2008 Membership Meeting 12

Shared Archiving Fails without a Pre- coordinated DDP Network in Place Lessons from the NDIIPP Archive Ingest and Handling Test (AIHT) and other shared archiving experiments: Encounter many unexpected incompatibilities because of different systems and data packaging Realization that much of the cost in preserving digital material is in coordinating the organizational and institutional imperatives of preservation, and not the technological costs of storage space 10/24/2008MetaArchive 2008 Membership Meeting 13

M eta A rchive Cooperative A distributed digital preservation cooperative for digital archives Established under the auspices of and with funding from the National Digital Information and Infrastructure Preservation Program (NDIIPP) of the U.S. Library of Congress, but now international in scope and focus A functioning DDP network and cooperative for libraries and other cultural heritage organizations Sustained by cooperative fee memberships, LC contracts, and other sponsored funding Provides training and models for other groups to establish similar distributed digital preservation networks Fosters broader awareness of digital preservation issues 10/24/2008MetaArchive 2008 Membership Meeting 14

MetaArchive compared with Other Efforts MetaArchive is a cooperative not a vendor: o A cooperative (also co-op) is an organization that consists of a group of individuals who have joined together to perform a function more efficiently than each individual could do alone. The purpose of a cooperative is not to make profits, but to improve each member's situation and the situation of the surrounding society. MetaArchive is a collaborative association of cultural memory organizations with a nonprofit administration All hardware and software assets are owned by members Membership fees go to a central pool of support for members’ co-op activities 10/24/2008MetaArchive 2008 Membership Meeting15

MetaArchive Project –Phase I ( ) Developed a working model for distributed digital preservation (DDP) in which institutions with shared subject domain focus mobilize for mutual benefit Developed a technical solution for DDP based on a reuse of LOCKSS technology, in the form of a separate network with higher capacity nodes Created an administrative nonprofit corporation Now preserving via DDP more than 120 collections from many different organizations 10/24/2008MetaArchive 2008 Membership Meeting 16

Collection Variety Collections include: o Images o Text files o Multimedia files o Datasets o Program executables 10/24/2008MetaArchive 2008 Membership Meeting17

MetaArchive Founding Members 10/24/2008MetaArchive 2008 Membership Meeting 18

Catalytic Efforts Hosted first workshop in distributed digital preservation strategies in 2007 o Instructed new MetaArchive members in processes o Advised other groups considering DDP approaches Assisted in creation of two additional DDPNs o Alabama o Arizona 10/24/2008MetaArchive 2008 Membership Meeting 19

MetaArchive Cooperative – Phase II ( ) Established additional distributed archives o Transatlantic slave trade historical data o ETD distributed archive Became international with the addition of Hull University in UK Recent DDP workshops o ETD 2008 Plan to double in size each year for this period With funding from NHPRC will provide consulting and outreach services on the MetaArchive model for DDP services 10/24/2008MetaArchive 2008 Membership Meeting 20

Institutional Roles Preservation Sites are entities responsible for the ongoing activity of preserving digital content. At a minimum, every preservation site must include responsible staff and a node server of the relevant preservation network. Preservation sites collectively comprise a preservation network. Development Sites are responsible for technical development of the computer systems that enable the preservation network. Obviously, development sites may also be preservation sites and/or contributing sites. Contributing (Content) Sites are institutions that need to preserve digital content, and therefore decide to contribute digital content into the preservation network. The preservation network acts for the common good to preserve the at-risk content submitted by the contributing sites. Contributing sites may also be preservation sites. 10/24/2008MetaArchive 2008 Membership Meeting 21

Individual Roles Program Managers are leaders that accept responsibility for coordinating the activities of a digital preservation network. Data Wranglers are programmers and other technically adept workers that prepare local digital archives for ingestion into a preservation network. System Administrators are staff members that maintain individual preservation node servers of the relevant preservation network. Selectors are staff that identify and prioritize content to be preserved. They will most often be knowledgeable concerning the content of an institution’s digital archives, and may have been the same individuals that originally created or acquired the archives. 10/24/2008MetaArchive 2008 Membership Meeting 22

Archive Creation Process Identify shared collection focus to mobilize institutions in creating a shared archive Several shared Archives in MetaArchive: o Southern Digital Culture o ETD Repositories o Black Cultural History? Forced Migration? o Manuscripts and Rare Books Conspectus Database enables collaborators to record metadata about collections for preservation decisions Each new institution signs agreement to mutually preserve others collections securely 10/24/2008MetaArchive 2008 Membership Meeting23

Archive Ingest Process A “Plugin” is written for collections selected for preservation Plugins are programs describing rules and structure for the “archival unit” Either local staff or MetaArchive staff write these plugins and install them in the network At least 6 dispersed sites are selected for repositing the archival unit Caching process begins, with updates following if necessary 5/21/2008Indiana Digital Preservation Summit - Halbert

Factors to Consider Recovery of data in the event of loss should be planned for, not put off for another day Whose job is this going to be in the organization? What are the highest priority items for distributed digital preservation? What digital preservation challenges are most important to your institutional setting? What expertise can you share with the other members of the cooperative? 10/24/2008MetaArchive 2008 Membership Meeting25

10/24/2008MetaArchive 2008 Membership Meeting26

New Archives Discussion “Archives” in the MetaArchive identify and articulate a shared collection focus to mobilize institutions in contributing to a shared preservation agenda New shared Archives under development in the MetaArchive Cooperative: o ETD Repositories (planning and implementation discussion will take place this afternoon) o Black Cultural History? Forced Migration? (we need to articulate the focus of the collaboration with York and Hull) o Manuscripts and Rare Books (the Folger will be the cornerstone of an obvious new Archive) 10/24/2008MetaArchive 2008 Membership Meeting27

Black Cultural History? Forced Migration? Founding members: York, Hull, Emory Arguments for “Black Cultural History”: o Broad focus on major cultural area of study o Synergistic overlap with Southern Digital Culture Archive would jumpstart with many collections, and enable Southeastern US, MetaArchive institutions to logically participate Arguments for “Forced Migration”: o Another broad area of focus, especially internationally o Would enable reaching out to other US and UK institutions (notably Oxford FMO) 10/24/2008MetaArchive 2008 Membership Meeting28

Manuscripts and Rare Books Addition of Folger Shakespeare Library brings a new focus to MetaArchive on digitized early modern literature Many libraries and archives have digitized collections in these areas, offering a logical, broad area of unifying focus for a new Archive Other possible formulations: early modern literature archive, English literature 10/24/2008MetaArchive 2008 Membership Meeting29

MetaArchive 2008 Annual Meeting WELCOME! 10/24/2008MetaArchive 2008 Membership Meeting30