The Digital Preservation Network at UT Austin Chris Jordan Texas Advanced Computing Center.

Slides:



Advertisements
Similar presentations
Texas Digital Library Services Preservation Network.
Advertisements

ETD Management in the Texas Digital Library Adam Mikeal Texas Digital Library ETD 08 Aberdeen, Scotland June 6, 2008.
An Overview From a Technical Perspective Sebastien Korner Representing the DPN Technical Team PASIG May 22, 2013.
Joint CASC/CCI Workshop Report Strategic and Tactical Recommendations EDUCAUSE Campus Cyberinfrastructure Working Group Coalition for Academic Scientific.
The Internet2 NET+ Services Program Jerry Grochow Interim Vice President CSG January, 2012.
The Frame NSF-funded national supercomputer centers Centers have hosted significant projects: TeraGrid, NPACI, GEON, SCEC, Chronopolis Fostered development.
Background Chronopolis Goals Data Grid supporting a Long-term Preservation Service Data Migration Data Migration to next generation technologies Trust.
DPN Digital Preservation Network. Digital Preservation.
PREMIS in Thought: Data Center for LC Digital Holdings Ardys Kozbial, Arwen Hutt, David Minor February 11, 2008.
Chronopolis: Preserving Our Digital Heritage David Minor UC San Diego San Diego Supercomputer Center.
ADAPT An Approach to Digital Archiving and Preservation Technology Principal Investigator: Joseph JaJa Lead Programmers: Mike Smorul and Mike McGann Graduate.
Tools and Services for the Long Term Preservation and Access of Digital Archives Joseph JaJa, Mike Smorul, and Sangchul Song Institute for Advanced Computer.
Robust Tools for Archiving and Preserving Digital Data Joseph JaJa, Mike Smorul, and Mike McGann Institute for Advanced Computer Studies Department of.
1 The Australian Partnership for Sustainable Repositories Margaret Henty Digital Futures Industry Briefing November 8, 2006.
Tools and Services for the Long Term Preservation and Access of Digital Archives Joseph JaJa, Mike Smorul, and Sangchul Song Institute for Advanced Computer.
PAWN: Producer-Archive Workflow Network University of Maryland Institute for Advanced Computer Studies Joseph JaJa, Mike Smorul, Mike McGann.
Mike Smorul Saurabh Channan Digital Preservation and Archiving at the Institute for Advanced Computer Studies University of Maryland, College Park.
UMIACS PAWN, LPE, and GRASP data grids Mike Smorul.
Preservation Collaboration: NDLTD & MetaArchive Cooperative Gail McMillan Digital Library and Archives, Virginia Tech Newcomers’ ETDs 2010 University.
Preservation In The Cloud Markus Wust NCSU Libraries.
An Introduction to DuraCloud Carissa Smith, Partner Specialist Michele Kimpton, Project Director Bill Branan, Lead Software Developer Andrew Woods, Lead.
The University of Texas Research Data Repository : “Corral” A Geographically Replicated Repository for Research Data Chris Jordan.
Data-PASS Shared Catalog Micah Altman & Jonathan Crabtree 1 Micah Altman Harvard University Archival Director, Henry A. Murray Research Archive Associate.
Vireo: The TDL Solution to Electronic Thesis and Dissertation Submission and Management Brought to you by the Texas Digital Library
DuraCloud A service provided by Sandy Payette and Michele Kimpton.
Near East Rural & Agricultural Knowledge and Information Network - NERAKIN Food and Agriculture Organization of the United Nations Near East and North.
DuraCloud Managing durable data in the cloud Michele Kimpton, Director DuraSpace.
TDL Forum WEDNESDAY, APRIL 16, Agenda - Updates & Announcements ◦TCDL 2014 (Kristi) ◦Vireo Users Group Meeting (Kristi) ◦Staffing (Ryan) ◦SHARE.
OASIS ebXML Registry Standard Open Forum 2003 on Metadata Registries 10:30 – 11:15 January 20, 2003 Kathryn Breininger The Boeing Company Chair, OASIS.
Flexibility and user-friendliness of grid portals: the PROGRESS approach Michal Kosiedowski
Preserving Digital Collections for Future Scholarship Oya Y. Rieger Cornell University
Texas Digital Library CENTRAL TEXAS AND SAN ANTONIO-AREA REGIONAL MEETING SEPTEMBER 5, 2013.
Digital Preservation: Lessons learned through national action Digital Preservation Interoperability Framework Workshop April 2010.
Research Data Management Victoria University Context Lyle Winton Adrian Gallagher Julie Gardner.
Corral: A Texas-scale repository for digital research data Chris Jordan Data Management and Collections Group Texas Advanced Computing Center.
Richard MarcianoChien-Yi Hou Caryn Wojcik University of University of State of Michigan North Carolina North Carolina Records Management ServicesSALT DCAPE.
Preserving ETDs: NDLTD & MetaArchive Collaboration Gail McMillan Digital Library and Archives, Virginia Tech Newcomers’ USETDA 2012.
Academic Preservation Trust Introducing APTrust. HOW THE DISCUSSION BEGAN In the beginning…
Production Data Grids SRB - iRODS Storage Resource Broker Reagan W. Moore
DuraCloud Enabling services for managing data in the cloud Michele Kimpton, CBO DuraSpace Bill Branan, Senior Developer DuraSpace.
Libraries, Archives, and Digital Preservation: The Reality of What We Must Do Leslie Johnston Acting Director, National Digital Information Infrastructure.
The Canadian Information Network for Research in the Social Sciences and Humanities Tim Au Yeung and Mary Westell Libraries.
A consortium committed to digital preservation Academic Preservation Trust CNI Fall Member Meeting 12/10/12 1 Robin Ruggaber, University of Virginia Michele.
Session 3.  Now you know WHY to make policies and WHAT they should contain…  But HOW do you implement policies?  And then HOW do you implement a program.
Chapter 5 McGraw-Hill/Irwin Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved.
OAIS Rathachai Chawuthai Information Management CSIM / AIT Issued document 1.0.
How to Implement an Institutional Repository: Part II A NASIG 2006 Pre-Conference May 4, 2006 Technical Issues.
Funded by: © AHDS Preservation in Institutional Repositories Preliminary conclusions of the SHERPA DP project Gareth Knight Digital Preservation Officer.
G ET A HEAD ON Y OUR R EPOSITORY Tom Cramer Chief Technology Strategist Stanford University Libraries.
Chronopolis – MetaArchive Improving and Strengthening Inter-Institutional Preservation.
Millman—Nov 04—1 An Update on Digital Libraries David Millman Director of Research & Development Academic Information Systems Columbia University
Ministry of Science and Technology Mozambique Research and Education Network - MoRENet Jussi Hinkkanen Ministry of Science and Technology Mozambique.
DuraCloud Open technologies and services for managing durable data in the cloud Michele Kimpton, CBO DuraSpace.
GRID ANATOMY Advanced Computing Concepts – Dr. Emmanuel Pilli.
Replicate Research Data Safely eudat.eu/b2safe B2SAFE How to replicate your data using EUDAT’s B2SAFE Version 3 November 2015 This work is.
Infrastructure Breakout What capacities should we build now to manage data and migrate it over the future generations of technologies, standards, formats,
National Geospatial Enterprise Architecture N S D I National Spatial Data Infrastructure An Architectural Process Overview Presented by Eliot Christian.
Built on the Powerful Microsoft Azure Platform, Forensic Advantage Helps Public Safety and National Security Agencies Collect, Analyze, Report, and Distribute.
DPLAfest, April 15, 2016 Chip German, Program Director, APTrust and Senior Director, Content Stewardship, at the University of Virginia Library
A Shared Commitment to Digital Preservation and Access.
Store and exchange data with colleagues and team Synchronize multiple versions of data Ensure automatic desktop synchronization of large files B2DROP is.
DIGITAL PRESERVATION NETWORK DPLAfest 2016 Mary Molinaro DPN Chief Operating Officer.
BEST CLOUD COMPUTING PLATFORM Skype : mukesh.k.bansal.
Joseph JaJa, Mike Smorul, and Sangchul Song
Public Key Infrastructure from the Most Trusted Name in e-Security
HIMSS National Conference New Orleans Convention Center
Research data preservation in Canada
Michele Kimpton Project Director, DuraCloud NDIPP Partner meeting
The MetaArchive Model: Distributed Digital Preservation Networks
Presentation transcript:

The Digital Preservation Network at UT Austin Chris Jordan Texas Advanced Computing Center

DPN Member Repository DPN MEmber DPN Member Repository DPN Member Reposiitory What Is DPN? DPN Member 57 member organizations cooperatively investing in long-term, scalable, digital preservation

Preservation System DPN Member Repository DPN Member Repository DPN Member Reposiitory What Is DPN? DPN Member technical staff and systems from 5 large scale preservation repositories

Preservation System DPN Member Repository DPN MEmber DPN Member Repository DPN Member Reposiitory What Is DPN? DPN Member …working groups of experts in succession rights, business services, communications and research data…

DPN Node What is DPN? All building a digital preservation backbone for the academy

What Does DPN Do? 1.Establishes a network of heterogeneous, interoperable, trustworthy, preservation repositories (Nodes) 2.Replicates content across the network, to multiple nodes 3.Enables restoration of preserved content to any node in the event of data loss, corruption or disaster 4.Ensures the ongoing preservation of digital information from depositors in the event of dissolution or divestment of depositors or an individual repository 5.Provides the option to (technically and legally) "brighten content" preserved in the network over time

Initial DPN technical partners Initial DPN launch will feature five nodes: Academic Preservation Trust (APTrust) Chronopolis HathiTrust Stanford Digital Repository (SDR) University of Texas Data Repository (UTDR) And a participating partner: DuraSpace

DPN, UT and TDL TACC & TDL have an established partnership TACC also collaborates with UT Library on: – Data Management Planning – Local research support – HPC for Digital Libraries DPN extends these efforts to include design and implementation of a replicating node

What is UTDR? UT Research Cyberinfrastructure Initiative Supports all 15 UT System Schools with: – High Performance Computing – 10Gb Research Network – 5PB Replicated Data Repository Deployed in early 2012, now over 100 investigators, 100s of users, over 1PB allocated

TACC Capabilities Corral UTDR System – 5PB, geographically replicated online storage iRODS Data Management, Databases, Web applications Ranch – 100PB+ Tape Archive capacity Additional data-intensive systems this year Stampede/Lonestar/Longhorn – World-Class Supercomputing and Visualization

DPN Network Concepts “First Nodes” submit data packages “Replicating Nodes” hold copies of data Messaging framework and Registry track data submissions and replicas “Bags” are used to package data for preservation – contents are opaque to DPN Each node provides its own interfaces

DPN Design Principles Nodes should be as independent as possible Content owners should have control over format of data Network should be flexible – easy to add and remove nodes Diversity of implementation is crucial to successful long-term preservation

Components in Technical Architecture Messaging infrastructure to support federated services Registry to track objects within the federation, including copies, version, rights, brightening information Transfer mechanisms (rsync, https, gridFTP, etc.) Private PKI for securing transport layers Logging and reporting Other components we implement separately, but may be common, for example a secure transfer area. DPN objects that hold administrative content such as DPN framework agreements, DPN bagit profiles, versioned Brightening information for a collection/repository

TDL and DPN In DPN terms, TDL is a content provider and “first node” TDL retains primary responsibility for data DPN provides a backup function for institutional, technical, or other failures TACC provides storage for both TDL and DPN – Data packages will be separate – Content packaging will be different

UT DPN Implementation UT Library, TACC have significant presence in DPN leadership teams Participation in technical, sustainability, other DPN working groups Library will provide interfaces to TDL and other local repositories TACC will provide back-end storage and other implementation components

Other Repositories and DPN DPN is effectively a “dark archive” Repositories still must have their own solutions for access/data management/etc But DPN can provide preservation functions If you are a DPN member and can generate “bags” you can deposit data into DPN Many institutions are already DPN members Membership is open but fee-based

The DPN Technical Team APTrust Scott Turnbull Tim Sigmon Adam Soroka Chronopolis David Minor Mike Smorul Don Sutton DuraSpace Andrew Woods HathiTrust Sebastien Korner Bryan Hockey Stanford Tom Cramer James Simon Texas Data Repository Ladd Hanson Christopher Jordan