Cloud Task Replica Repository Preservation Tools Open Repositories 2009 - Atlanta Richard Rodgers MIT Libraries.

Slides:



Advertisements
Similar presentations
Adding OAI-ORE Support to Repository Platforms Alexey Maslov, Adam Mikeal, Scott Phillips, John Leggett, Mark McFarland Texas Digital Library TCDL09.
Advertisements

Whos the Architect? Credential Provisioning Network Access Directory Services Authentication, Authorization and Accounting Federation Single.
Breakout Group 3: Distribution, Replication and Interfacing with Specialized Repositories.
Repositories, Learned Societies and Research Funders Stephen Pinfield University of Nottingham.
Abstraction Layers Why do we need them? –Protection against change Where in the hourglass do we put them? –Computer Scientist perspective Expose low-level.
KeepIt Kultur, eCrystals, EdShare (and NECTAR) – Preserve It! David Tarrant School of Electronics.
Digital Preservation A Matter of Trust. Context * As of March 5, 2011.
An Overview From a Technical Perspective Sebastien Korner Representing the DPN Technical Team PASIG May 22, 2013.
Fedora Service Framework Simple Queue Services For fulfillment of the Mellon Grant June 29, 2009.
Copying Archives Project Group Members: Mushashu Lumpa Ngoni Munyaradzi.
Manajemen Basis Data Pertemuan 12 Matakuliah: M0264/Manajemen Basis Data Tahun: 2008.
Mairéad Martin, Penn State University Commons Solutions Group Storage Workshop May 2010.
The Digital Preservation Network at UT Austin Chris Jordan Texas Advanced Computing Center.
Using Sakai to Support eScience Sakai Conference June 12-14, 2007 Sayeed Choudhury Tim DiLauro, Jim Martino, Elliot Metsger, Mark Patton and David Reynolds.
A Very Brief Introduction to iRODS
Hydra Partners Meeting March 2012 Bill Branan DuraCloud Technical Lead.
Richard MARCIANO Chien-Yi HOU School of Information and Library Science (SILS) Sustainable Archives & Leveraging Technologies Group (SALT) University of.
Robust Tools for Archiving and Preserving Digital Data Joseph JaJa, Mike Smorul, and Mike McGann Institute for Advanced Computer Studies Department of.
PAWN: A Novel Ingestion Workflow Technology for Digital Preservation
Tools and Services for the Long Term Preservation and Access of Digital Archives Joseph JaJa, Mike Smorul, and Sangchul Song Institute for Advanced Computer.
1 Archival Storage for Digital Libraries Arturo Crespo Hector Garcia-Molina Stanford University.
Mike Smorul Saurabh Channan Digital Preservation and Archiving at the Institute for Advanced Computer Studies University of Maryland, College Park.
1 Awareness Services for Digital Libraries Arturo Crespo Hector Garcia-Molina Stanford University.
A Framework for Distributed Preservation Workflows Rainer Schmidt AIT Austrian Institute of Technology iPres 2009, Oct. 5, San.
Richard MARCIANO Chien-Yi HOU School of Information and Library Science (SILS) Sustainable Archives & Leveraging Technologies Group (SALT) University of.
Preservation In The Cloud Markus Wust NCSU Libraries.
Towards smart storage for repository preservation services Steve Hitchcock, David Tarrant, Adrian Brown 1, Ben O’Steen 2, Neil Jefferies 2 and Leslie Carr.
Opensource for Cloud Deployments – Risk – Reward – Reality
SOA, BPM, BPEL, jBPM.
DCC Conference, Glasgow November, Digital Archive Policies and Trusted Digital Repositories MacKenzie Smith, MIT Libraries Reagan Moore, San Diego.
DuraCloud A service provided by Sandy Payette and Michele Kimpton.
Rule-Based Data Management Systems Reagan W. Moore Wayne Schroeder Mike Wan Arcot Rajasekar {moore, schroede, mwan, {moore, schroede, mwan,
Preserving Digital Collections for Future Scholarship Oya Y. Rieger Cornell University
San Diego Supercomputer Center National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center National Partnership for Advanced.
Design of a Search Engine for Metadata Search Based on Metalogy Ing-Xiang Chen, Che-Min Chen,and Cheng-Zen Yang Dept. of Computer Engineering and Science.
Libraries, Archives, and Digital Preservation: The Reality of What We Must Do Leslie Johnston Acting Director, National Digital Information Infrastructure.
Ocean Observatories Initiative Data Management (DM) Subsystem Overview Michael Meisinger September 29, 2009.
Issues in (Financial) High Performance Computing John Darlington Director Imperial College Internet Centre Fast Financial Algorithms and Computing 4th.
Digital Preservation MetaArchive Cooperative.  9:00-9:45 - Session 1: Digital Preservation Overview  9:45-11:00 - Session 2: Policy & Planning Overview.
DataNet – Flexible Metadata Overlay over File Resources Daniel Harężlak 1, Marek Kasztelnik 1, Maciej Pawlik 1, Bartosz Wilk 1, Marian Bubak 1,2 1 ACC.
Working Group Practical Policy based on slides and latest documents from the PP WG chaired by Reagan Moore, Rainer Stotzka presented by Johannes Reetz.
Rule-Based Preservation Systems Reagan W. Moore Wayne Schroeder Mike Wan Arcot Rajasekar Richard Marciano {moore, schroede, mwan, sekar,
ICOM TC Charter TC’s Scope –Specify the normative standards for collaboration objects, along with their attributes, relationships, constraints, and behavior,
System Development & Operations NSF DataNet site visit to MIT February 8, /8/20101NSF Site Visit to MIT DataSpace DataSpace.
©MIT LKTR Workshop, Digital Archive Policies and Trusted Digital Repositories MacKenzie Smith, MIT Libraries Reagan Moore, San Diego Supercomputer.
Replicate Research Data Safely eudat.eu/b2safe B2SAFE How to replicate your data using EUDAT’s B2SAFE Version 3 November 2015 This work is.
Infrastructure Breakout What capacities should we build now to manage data and migrate it over the future generations of technologies, standards, formats,
Rights Management for Shared Collections Storage Resource Broker Reagan W. Moore
SharePoint Fest 2013 Chicago What’s New and Exciting (and not so great) in SharePoint Designer 2013 Workflows Ira Fuchs – SharePoint Technical Specialist,
ETERE A Cloud Archive System. Cloud Goals Create a distributed repository of AV content Allows distributed users to access.
Autonomic aspects in cloud data management Alexandra Carpen-Amarie KerData.
St. Petersburg, 2016 Openstack Disk Storage vs Amazon Disk Storage Computing Clusters, Grids and Cloud Erasmus Mundus Master Program in PERCCOM Author:
Agenda’s for Preservation Research Micah Altman MIT Libraries Prepared for SAA Research Forum Atlanta August 2016.
Data Grids, Digital Libraries and Persistent Archives: An Integrated Approach to Publishing, Sharing and Archiving Data. Written By: R. Moore, A. Rajasekar,
Enhancements to Galaxy for delivering on NIH Commons
Data sharing and exchange: Experiences within the
GGF OGSA-WG, Data Use Cases Peter Kunszt Middleware Activity, Data Management Cluster EGEE is a project funded by the European.
Joseph JaJa, Mike Smorul, and Sangchul Song
China Communications Standards Association ZTE Corporation, P.R. China
THE STEPS TO MANAGE THE GRID
KeepIt Kultur, eCrystals, EdShare (and NECTAR) – Preserve It!
Real IBM C exam questions and answers
Storage & Digital Asset Management CIO Council Update
Odum Institute iRODS Policies to Support Preservation
Service Oriented Architecture (SOA)
Institutional Repositories
Robin Dale RLG OAIS Functionality Robin Dale RLG
RDA uptake activities and plans: ESGF
ICOM TC Charter TC’s Scope Out of TC’s Scope Call for Participation
Presentation transcript:

Cloud Task Replica Repository Preservation Tools Open Repositories Atlanta Richard Rodgers MIT Libraries

cloud computing dynamic capacity: elastic high availability > storage: compute, database, more new programming model WOA - service bus in the sky lightweight protocols

problem space: replication replication != backup time decay of trust - needs maintenance coordination costs $$$$ who’s watching the detectives ? impermeable system boundaries sizing forecast uncertainty

reliable messaging enables asynchronous handling queue = list of messages coordination of work, non-persistent access controlled, encryptable cheap: $0.01 per 10k messages Amazon SQS + S3

roles decompose work into distinct replaceable agents archive = content home replicator = manages copies auditor = implements and enforces policy role != institution

process model a message queue for each role message post triggers activity asynchronously bucket brigade - message is a handoff or acknowledgment storage is abstracted (cloud in prototype)

workflow: replication archivereplicatorauditor S3

workflow: removal archivereplicatorauditor S3

workflow: audit archivereplicatorauditor S3

message semantics web-standard URI addressing entities: packages, ORE maps content model agnostic entity checksums for integrity standard identifiers for actors

self-managed deployment mit

peered deployment mitgatech

service provider deployment mitduracloudmit

todo plumbing only - replication requires more all policy definition and agreements OOB address business model content packaging/description expand skeletal prototype stress at scale

opportunities federated & large scale problems distributed registries metadata harvesting subject overlays preservation workflows, micro-services

thanks

extra credit