The DuraCloud Workshop Your hosts: Bill Branan & Carissa Smith.

Slides:



Advertisements
Similar presentations
Texas Digital Library Services Preservation Network.
Advertisements

IRRA DSpace April 2006 Claire Knowles University of Edinburgh.
Preserv: Preservation architecture and interface A brief overview of ideas wrt to the project plan For Preserv partners meeting, BL, London, 18th November.
OGF-23 iRODS Metadata Grid File System Reagan Moore San Diego Supercomputer Center.
Archiving research data in the cloud or in a local repository Michele Kimpton, CEO DuraSpace CNI Dec 2014.
Background Chronopolis Goals Data Grid supporting a Long-term Preservation Service Data Migration Data Migration to next generation technologies Trust.
Hydra Partners Meeting March 2012 Bill Branan DuraCloud Technical Lead.
Depositing e-material to The National Library of Sweden.
Chronopolis: Preserving Our Digital Heritage David Minor UC San Diego San Diego Supercomputer Center.
Archivematica-Islandora Integration Module Evelyn McLellan
ADAPT An Approach to Digital Archiving and Preservation Technology Principal Investigator: Joseph JaJa Lead Programmers: Mike Smorul and Mike McGann Graduate.
Robust Tools for Archiving and Preserving Digital Data Joseph JaJa, Mike Smorul, and Mike McGann Institute for Advanced Computer Studies Department of.
Tools and Services for the Long Term Preservation and Access of Digital Archives Joseph JaJa, Mike Smorul, and Sangchul Song Institute for Advanced Computer.
1 of 6 This document is for informational purposes only. MICROSOFT MAKES NO WARRANTIES, EXPRESS OR IMPLIED, IN THIS DOCUMENT. © 2007 Microsoft Corporation.
Preservation In The Cloud Markus Wust NCSU Libraries.
An Introduction to DuraCloud Carissa Smith, Partner Specialist Michele Kimpton, Project Director Bill Branan, Lead Software Developer Andrew Woods, Lead.
“Salesforce” - meet Amazon Cloud Upload unlimited files sizes to your salesforce Get Started On Demand Files Storage for Salesforce Cloud Storage, a Salesforce.
ArcGIS Workflow Manager An Introduction
Linux Operations and Administration
#watitis2014 ONTARIO LIBRARY RESEARCH CLOUD: BUILDING A PROVINCE-WIDE RESEARCH CLOUD FOR ONTARIO’S ACADEMIC LIBRARIES.
Customized cloud platform for computing on your terms !
DuraCloud Managing durable data in the cloud Michele Kimpton, Director DuraSpace.
Tutorial 11 Installing, Updating, and Configuring Software
“Filling the digital preservation gap” an update from the Jisc Research Data Spring project at York and Hull Jenny Mitcham Digital Archivist Borthwick.
TDL Forum WEDNESDAY, APRIL 16, Agenda - Updates & Announcements ◦TCDL 2014 (Kristi) ◦Vireo Users Group Meeting (Kristi) ◦Staffing (Ryan) ◦SHARE.
An Introduction to DuraCloud Michele Kimpton, Project Director Carissa Smith, Partner Specialist DuraSpace Webinar  Sept 2011.
Codeigniter is an open source web application. It occupies a very small amount of space in the memory and is most useful for developers who aim to develop.
A brief overview… “The Obama Administration is committed to the proposition that citizens deserve easy access to the results of scientific research their.
DuraCloud pilot program Michele Kimpton, CEO DuraSpace Richard Rodger, Dept Head Software development, M.I.T. Libraries Claire Stewart Dept Head Digital.
Texas Digital Library CENTRAL TEXAS AND SAN ANTONIO-AREA REGIONAL MEETING SEPTEMBER 5, 2013.
Preserving Digital Culture: Tools & Strategies for Building Web Archives : Tools and Strategies for Building Web Archives Internet Librarian 2009 Tracy.
DuraCloud Enabling services for managing data in the cloud Michele Kimpton, CBO DuraSpace Bill Branan, Senior Developer DuraSpace.
IT 456 Seminar 5 Dr Jeffrey A Robinson. Overview of Course Week 1 – Introduction Week 2 – Installation of SQL and management Tools Week 3 - Creating and.
OAIS Rathachai Chawuthai Information Management CSIM / AIT Issued document 1.0.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
AIP Backup & Restore Sunita Barve NCRA, Pune. AIP The latest version of DSpace 1.7.0, supports backup and restore of all its contents as a set of AIP.
Database Design and Management CPTG /23/2015Chapter 12 of 38 Functions of a Database Store data Store data School: student records, class schedules,
Preservation in the Cloud: Three Ways Michele Kimpton – CEO, DuraSpace Richard Rodgers, Mark Leggott, & Simon Waddington Open Repositories 2012  Edinburgh,
ArcGIS Server for Administrators
May 2, 2013 An introduction to DSpace. Module 1 – An Introduction By the end of this module, you will … Understand what DSpace is, and what it can be.
CLASS Information Management Presented at NOAATECH Conference 2006 Presented by Pat Schafer (CLASS-WV Development Lead)
GPO’s Federal Digital System December 10, 2009 U.S. Government Printing Office.
OAIS: From Requirements to Reality at OCLC FLICC / CENDI Symposium, Dec Pam Kircher Product Manager, Digital Archive OCLC Digital & Preservation.
OAIS Rathachai Chawuthai Information Management CSIM / AIT Issued document 1.0.
The New DRS Introduction. What is DRS? Digital repository for preservation and access – Maintains integrity of deposited content – Preserves content for.
Funded by: © AHDS Preservation in Institutional Repositories Preliminary conclusions of the SHERPA DP project Gareth Knight Digital Preservation Officer.
UC3 Services In-Depth: Data Curation for Practitioners 2012 Workshop.
CharMeck.org Contributer Training SharePoint 2013 Orientation and Basic Training.
DuraCloud Open technologies and services for managing durable data in the cloud Michele Kimpton, CBO DuraSpace.
DuraCloud for Dummies Should I stay or should I go [to the cloud]? ~Carissa Smith, DuraSpace.
Licensed under Creative Commons Attribution-Share Alike 3.0 Unported License (CC BY-SA 3.0) To request other use: Integrating DSpace.
Archiving and Preservation Michele Kimpton CEO, DuraSpace Bryan Beecher Director, ICPSR DuraSpace Webinar November 2, 2011.
A Technical Overview Bill Branan DuraCloud Technical Lead.
Enterprise Messaging & Collaboration. e-Interact Modules.
IUScholarWorks Repository Update Jim Halliday, Stacy Konkiel & Jennifer Laherty.
Store, Manage, and Archive Content in the Cloud Michele Kimpton, DuraSpace CEO DuraSpace Nate Klingenstein, Internet 2 Internet 2 meeting, April 2013.
Managing live digital content with DuraSpace services Bill Branan PASIG Spring 2015.
Here are some things you can do while you wait 1.Open your omeka.net site in your browser (e.g. 2.Open.
CloudBerry Explorer for S3. CB Explorer Free to use Browse and manage files PowerShell functions Open and edit files  CloudBerry Explorer is an easy.
Developing a Dark Archive for OJS Journals Yu-Hung Lin, Metadata Librarian for Continuing Resources, Scholarship and Data Rutgers University 1 10/7/2015.
Explore Various Options for Bulk File Transfer out of Alfresco Craig Tan Technical Account Manager.
Preservation support and data management services built on cloud infrastructure Features:  Ingest once, and make multiple copies to multiple storage.
APTrust and Georgetown University Library
Business Directory REST API
Amazon Storage- S3 and Glacier
Joseph JaJa, Mike Smorul, and Sangchul Song
Module 01 ETICS Overview ETICS Online Tutorials
Michele Kimpton Project Director, DuraCloud NDIPP Partner meeting
ArchivesSpace – Archivematica – DSpace Workflow Integration
Archiving and preservation services in the cloud
Presentation transcript:

The DuraCloud Workshop Your hosts: Bill Branan & Carissa Smith

1.The Basics 2.Lab: Getting Started with DuraCloud 10:30 (in Cosmopolitan AB) 4.Architecture and Integrations 5.Lab: DuraCloud Deep Dive 12:30 (in Cosmopolitan AB) What we plan to accomplish.

●Informal, let’s have some fun :) ●Ask questions! ●What do you want to learn about DuraCloud? The (un)format.

●First, there was DuraSpace: ○ 501(c)(3) not-for-profit ○ created from Fedora Commons, Inc. + DSpace Foundation ○ “committed to our shared digital future” ●DuraSpace = Projects + Services ○ DSpace, Fedora, VIVO ○ DuraCloud, DSpaceDirect, ArchivesDirect A little background.

1.Library of Congress funding 7/14/2009 a market assessment & initial development b pilot phase 1 (storage) c pilot phase 2 (interface, services) 2.DuraCloud Launched 11/1/ Integrations/Features Chronopolis/DPN Then there was DuraCloud.

●open source software ●offsite storage ●one interface ○ multiple cloud copies ○ multiple storage providers ●automated synchronization ●audited content health checks ●consolidated billing and agreements ●integrations with other services ○ DSpace, Archive-It, Archivematica, etc. This is DuraCloud.

●a repository replacement ●an Amazon account billed by DuraSpace ●intelligent about metadata ●a network drive or file system This is *not* DuraCloud.

●Why DuraCloud over Amazon? ●Reality: DuraCloud is built on Amazon (but with additional preservation services): ○ MD5 content health checks ○ content upload tools ○ synchronization (local to cloud & cloud copies) ○ integration with other storage providers ●Honest Truth: sometimes it makes sense to use DuraCloud & other times Amazon directly The burning question.

❏ Use DuraCloud when you want… ❏ multiple copies of your content ❏ cloud storage with different providers ❏ cloud storage in different geographic locations ❏ simple tooling for managing data transfer ❏ audited health checks of cloud provider ❏ a pathway to DPN ❏ integration with other open source technologies ❏ one annual predictable storage invoice ❏ personalized support So...

1.Go to duracloud.org. 2.Click “no waiting: try it now”. 3.Create a trial account. 4.Check your . 5.Log in at 6.Poke around. Let’s get started.

A brief tour.

1.Upload a file or two. 2.Copy a file. 3.Delete a file. 4.Add properties and tags. 5.Navigate between storage providers. Let’s kick the tires.

The administrator view.

●Create, edit, delete spaces. ●Make content “public”. ●Turn streaming on/off. ●Manage user/group permissions. ●View full account storage details. I have the power.

See you back here at 11:00am. Break time! image courtesy of

●Components ○ DuraStore - storage systems, the interface to all storage providers ○ DurAdmin - web-based user interface ○ DuraBoss - manages storage reporting ○ The Mill - handles auditing, manifest generation, bit integrity, and duplication ○ Standalone tools - synchronization, retrieval ●REST API The architecture.

A diagram.

DuraCloud instances.

The mill.

●Sync Tool o Tool for transferring local content to DuraCloud o Option of UI-based or command-line interface o Built-in transfer-rate optimization utility o Lots of options for selecting files and directories and specifying how you want the files stored ●Retrieval Tool o Command line tool for retrieving content that is stored in DuraCloud space o Can retrieve a content listing, all content in a space, or a specific set of desired content Standalone Tools

1.Log in to 2.Click on your trial space. 3.Click the “Upload” button. 4.Click “Want to make uploads automatic?”Want to make uploads automatic? 5.Download the synchronization utility (or as we call it “the sync tool”). Let’s upload more content.

The wizard.

●Chronopolis/DPN ●DSpace/DSpaceDirect ●Archivematica/ArchivesDirect ●Archive-It ●Upcoming: Fedora, Figshare So many integrations.

●Chronopolis = digital preservation storage network spanning multiple institutions and geographic regions ○ Comprised of UC San Diego, NCAR, and Univ of Maryland ○ Dark archive ○ Focused on long-term digital content preservation ●Extends the DuraCloud network to include a non-commercial, highly distributed, dark archive option DuraCloud + Chronopolis

●DuraCloud provides a pathway to Chronopolis ●Chronopolis provides a pathway to DPN ●DPN = Digital Preservation Network, a nascent national preservation system ●DuraCloud Vault: ○ DuraCloud + Chronopolis = DPN node ○ ●TPN (Texas Preservation Node) - another DPN node, also using DuraCloud DPN

The best of both worlds.

●DSpace = institutional repository ●DSpace generates AIPs ●DSpace Curation Task System configured to transfer new content to DuraCloud ●Can also perform complete batch backup of all stored data to DSpace ●Also used to manage restores DuraCloud + DSpace

●Hosted and managed DSpace repository ●Simple to start-up ●Wide selection of available features ●Free version upgrades ●Migration support ●Automatic backup to DuraCloud included at no additional cost ●

DSpace Curation Task Setup

●Archivematica = workflow tool for generating preservation ready AIP packages o Customizable OAIS-based workflow processing o Permanent ID generation o Checksumming o Virus checking o File format identification and validation o Metadata extraction and generation o Normalization ●DuraCloud as preservation storage DuraCloud + Archivematica

●Hosted Archivematica with pre-configured DuraCloud backup ●Launched in 2015! ●

●Archive-It = web archiving service from the Internet Archive ●Content automatically copied from Archive-It to DuraCloud as a secondary backup ●Provides Archive-It users with direct download access to their entire collection of WARC (web archive) files DuraCloud + Archive-It

●DuraCloud Vault release ●Figshare integration ●Fedora 4 integration On the horizon

DPN File System Data Amazon S3 Amazon Glacier Rackspace Cloud Files SDSC Cloud Chronopolis

Time to get our hands dirty! The REST API

●All urls will begin with: (https) ●Use the correct request type (GET, PUT, POST, HEAD, DELETE) Lab tips

●Contact ○ Bill Branan ○ Carissa Smith ●User Group ○ ●YouTube ○ Thank you!