Mairéad Martin, Penn State University Commons Solutions Group Storage Workshop May 2010.

Slides:



Advertisements
Similar presentations
Panel 2 – Promoting Re-Use of Scientific Collections John Harrison SHAMAN Project University of Liverpool
Advertisements

A Community Approach to Preservation: Experiences with Social Science Data ASIST Summit 2010 Jonathan Crabtree April 9, 2010.
OGF-23 iRODS Metadata Grid File System Reagan Moore San Diego Supercomputer Center.
A centre of expertise in data curation and preservation DCC Workshop: Curating sApril 24 – 25, 2006 Funded by: This work is licensed under the Creative.
Archiving research data in the cloud or in a local repository Michele Kimpton, CEO DuraSpace CNI Dec 2014.
Background Chronopolis Goals Data Grid supporting a Long-term Preservation Service Data Migration Data Migration to next generation technologies Trust.
PREMIS in Thought: Data Center for LC Digital Holdings Ardys Kozbial, Arwen Hutt, David Minor February 11, 2008.
Hydra Partners Meeting March 2012 Bill Branan DuraCloud Technical Lead.
Trustworthy Repository Criteria, Virtual Organizations, and Infrastructure MacKenzie Smith, MIT Libraries NDIIPP Meeting, July 2010.
Common Use Cases for Preservation Metadata Deborah Woodyard-Robinson Digital Preservation Consultant Long-term Repositories:
Chronopolis: Preserving Our Digital Heritage David Minor UC San Diego San Diego Supercomputer Center.
Rutgers University Libraries What is RUcore? o An institutional repository, to preserve, manage and make accessible the research and publications of the.
1 What is RUcore?  A cyberinfrastructure for the Rutgers Community that includes:  An institutional repository, to preserve, manage and make accessible.
Richard MARCIANO Chien-Yi HOU School of Information and Library Science (SILS) Sustainable Archives & Leveraging Technologies Group (SALT) University of.
Digital preservation Hydra Europe, LSE 24 April 2015 Anders Conrad.
Promoting Digital Preservation Partnerships at the U.S. Library of Congress April 2004.
Toward a Distributed and Collaborative Framework for Preservation Martin Halbert, UNT Dean of Libraries David Minor, Chronopolis Program Manager Katherine.
New Value from the DSpace Foundation and Fedora Commons Michele Kimpton and Sandy Payette Executive Directors DuraSpace.
DuraCloud A service provided by Sandy Payette and Michele Kimpton.
Social Science Data and ETDs: Issues and Challenges Joan Cheverie Georgetown University Myron Gutmann ICPSR – University of Michigan Austin McLean ProQuest.
DATA CURATION & PRESERVATION CSG Fall Meeting, Princeton Mairéad Martin Penn State September, 2012.
Trends & Challenges in Digital Object Storage Infrastructure: Notes from the National Digital Stewardship Alliance (NDSA) Infrastructure Working Group.
May 12, 2006Spring 2006 Common Solutions Group Archival, Digital Preservation, and Records Management David Millman, Columbia University Ron Thielen, University.
Open Access Symposium 2015 Open Access, the Law, and Public Information Mary Alice Baish UNT Dallas College of Law May 19, 2015 National Plan for Access.
Digital Preservation through Cooperation: LOCKSS Gail McMillan Digital Library and Archives, University Libraries Virginia Polytechnic Institute and State.
Cloud Task Replica Repository Preservation Tools Open Repositories Atlanta Richard Rodgers MIT Libraries.
Preserving Digital Collections for Future Scholarship Oya Y. Rieger Cornell University
Digital Preservation: Lessons learned through national action Digital Preservation Interoperability Framework Workshop April 2010.
Katherine Skinner, Emory University Gail McMillan, Virginia Tech NDIIPP Annual Partners Meeting June 24, 2009.
Libraries, Archives, and Digital Preservation: The Reality of What We Must Do Leslie Johnston Acting Director, National Digital Information Infrastructure.
1 Designing Storage Architecture for Digital Collections 2012.
Preserving eScholarship and Digitized Special Collections Distributed Digital Preservation Bill Donovan
Digital Preservation MetaArchive Cooperative.  9:00-9:45 - Session 1: Digital Preservation Overview  9:45-11:00 - Session 2: Policy & Planning Overview.
The Canadian Information Network for Research in the Social Sciences and Humanities Tim Au Yeung and Mary Westell Libraries.
Safeguarding the Freedom of Information: Digital Archive Initiatives in the United States Federal Government Michael Paul Huff Information Resource Officer.
Interoperability within the Grid NDIIPP Partners Meeting Arlington, VA July 9, 2008 Interoperability within the Grid Robert H. McDonald Digital Preservation.
Katherine Skinner, Executive Director, Educopia Institute ESOPI 2013 Chapel Hill, NC April 19, 2013.
Session 3.  Now you know WHY to make policies and WHAT they should contain…  But HOW do you implement policies?  And then HOW do you implement a program.
Martin Halbert President, MetaArchive Cooperative DigCCurr 2009 Meeting Chapel Hill, NC Friday, April 3, 2009.
Developments in long term preservation LIBER 2012, Marcel Ras.
Symposium on Global Scientific Data Infrastructures Panel Two: Stakeholder Communities in the DWF Ann Wolpert, Massachusetts Institute of Technology Board.
Chronopolis – MetaArchive Improving and Strengthening Inter-Institutional Preservation.
Millman—Nov 04—1 An Update on Digital Libraries David Millman Director of Research & Development Academic Information Systems Columbia University
DuraCloud Open technologies and services for managing durable data in the cloud Michele Kimpton, CBO DuraSpace.
April 14, 2005MIT Libraries Visiting Committee Libraries Strategic Plan Theme III Work to shape the future MacKenzie Smith Associate Director for Technology.
Martin Halbert MetaArchive Cooperative Thursday, June 25, 2009 NDIIPP Annual Meeting Washington, D.C.
System Development & Operations NSF DataNet site visit to MIT February 8, /8/20101NSF Site Visit to MIT DataSpace DataSpace.
N EXT - GENERATION R ESEARCH A ND THE U NIVERSITY OF C ALIFORNIA WHAT THE UC LIBRARIES BRING TO THE EQUATION Brian E. C. Schottlaender The Audrey Geisel.
Digital Preservation through Cooperation: LOCKSS Gail McMillan Digital Library and Archives, University Libraries Virginia Polytechnic Institute and State.
National Archives and Records Administration1 Integrated Rules Ordered Data System (“IRODS”) Technology Research: Digital Preservation Technology in a.
Infrastructure Breakout What capacities should we build now to manage data and migrate it over the future generations of technologies, standards, formats,
Rights Management for Shared Collections Storage Resource Broker Reagan W. Moore
SAN DIEGO SUPERCOMPUTER CENTER Replication Policies for Federated Digital Repositories Robert H. McDonald Chronopolis Project Manager
The National Digital Information Infrastructure and Preservation Program (NDIIPP) Challenges and Solutions Laura E. Campbell Associate Librarian for Strategic.
Digital preservation of CBUC theses with MetaArchive 11th SELL Meeting Porto, June 4th 2011.
Managing live digital content with DuraSpace services Bill Branan PASIG Spring 2015.
Katherine Skinner, Martin Halbert & Matt Schultz Educopia Institute and MetaArchive Cooperative NDSA Infrastructure Committee
Fedora Commons Overview and Background Sandy Payette, Executive Director UK Fedora Training London January 22-23, 2009.
Digital Preservation MetaArchive Cooperative, Digital Preservation Policy Planning Workshop Boston College, Boston, MA October 26, 2010.
Developing a Dark Archive for OJS Journals Yu-Hung Lin, Metadata Librarian for Continuing Resources, Scholarship and Data Rutgers University 1 10/7/2015.
CMU Libraries’ Digital Assets Preservation Strategy Presenter Gabrielle V. Michalek Principal Archivist and Head, Archives/Digital Library Initiatives.
The New Now: Institutional Repositories and Academia Institutional Repository USM April 17, 2015 Marilyn Billings Scholarly Communication Librarian.
Agenda’s for Preservation Research Micah Altman MIT Libraries Prepared for SAA Research Forum Atlanta August 2016.
Data Grids, Digital Libraries and Persistent Archives: An Integrated Approach to Publishing, Sharing and Archiving Data. Written By: R. Moore, A. Rajasekar,
Joseph JaJa, Mike Smorul, and Sangchul Song
Research data preservation in Canada
PASIG LOCKSS Seminar Agenda
The MetaArchive Model: Distributed Digital Preservation Networks
Archiving and preservation services in the cloud
Presentation transcript:

Mairéad Martin, Penn State University Commons Solutions Group Storage Workshop May 2010

 Designing and implementing storage architectures and systems to support data curation and preservation needs ◦ What does this entail? ◦ Who’s thinking about this? ◦ Who’s doing anything about this?

 Digital Preservation ◦ Managed activities to ensure long term retention, retrieval of, and access to data  Digital Curation ◦ Maintaining, preserving, and enhancing data throughout its lifecycle  Archival storage ◦ Depends on who you talk to  Information Lifecycle Management ◦ Storage industry term for the above  Object-based storage ◦ Data with metadata “container”

 eScience/eResearch data management needs  NSF requirement for data management plans  Compliance ◦ e-Discovery, FERPA, HIPAA, Sarbanes-Oxley ◦ Institutional record retention regulations and policies  Storage services for libraries, archives, cultural heritage entities  Great efficiencies

 Storage is cheap  Storage is smart  Stuff on the Internet is persistent  Digital safer than analog  Storage provider = curators and preservation experts  Repositories take care of preservation  Metadata will take care of it  Libraries will take care of it  The Cloud will take care of it

 New roles, new responsibilities, new collaborations, practices, workflows  Intellectual capital requirements – digital preservation/curation policy determination and implementation  Bar for trust is rising  Cloud antithetical to preservation?  Increased storage management requirements  Scaling issues with preservation requirements

 More likely to meet these today at the system level – DR & BC practices and tiered storage architectures  Immutable storage  Data integrity checking ◦ Mitigation of bit rot ◦ Auditing function  Mitigation of obsolescence ◦ File format migration  Deposition as important as retention  Need for storage management metadata ◦ Technical – file size, name, location, ACL, date, time, versioning,  Biggest need: system-independence

 iRODS (integrated Rule-based Data System)  Storage Resource Broker (SRB)  Content Addressable Storage (CAS) ◦ Fixed content storage, retrieval based on content rather than location  eXtensible Access Method (XAM) ◦ Emerging SNIA standard for an API for content- addressable storage objects

 NSF DataNet Program ◦ Data Conservancy project – JHU lead with 23 institutions to create curation, discovery, and preservation network  Chronopolis ◦ SDSC, UCSD, UMIACS, NCAR: Federated data grid using SRB/iRODS  LOCKSS (Lots of Copies Keep Things Safe) ◦ Replication of licensed journals and other content  MetaArchive – ◦ a private LOCKSS archive  Internet Archive

 National Digital Information Infrastructure & Preservation Program (NDIIP) ◦ Library of Congress program to “to develop a national strategy to collect, preserve and make available significant digital content via a preservation network of over 130 partners."

 California Digital Library ◦ Curation Micro-services  DuraSpace ◦ DuraCloud project to implement a preservation- oriented cloud storage service  HaithiTrust ◦ Repository and storage infrastructure initiated for CIC Google book project  Sun Preservation and Archiving SIG (PASIG)  Storage Networking Industry Association

 Content Stewardship Program – strategic collaboration between University Libraries and Information Technology Services (ITS)  Goal: a suite of services to support the lifecycle of the digital object – creation, discovery, access, storage, preservation and archiving  Hired Digital Library Architect and Digital Collections Curator  Governance in place

 Anchor projects/activities: ◦ Storage and Preservation strategy development  Prototyped the XAM standard for archival storage ◦ Institutional record repository ◦ Research data prototype ◦ Best practices for data management ◦ ETD platform replacement  Sponsoring curation technology workshop in August  LOCKSS member, recently joined MetaArchive  Exploration of California Digital Library’s curation micro-services  Application of service management principles and processes to the above

 What are CSG member institutions doing in this space?