DuraCloud Open technologies and services for managing durable data in the cloud Michele Kimpton, CBO DuraSpace.

Slides:



Advertisements
Similar presentations
DuraSpace, Fedora and DuraCloud Triangle Research Libraries Network September, 2009.
Advertisements

DuraSpace: Digital Information All Ways, Always Pretoria, South Africa May 14 th, 2009.
Introducing Progress Arcade Roy Ellis
Archiving research data in the cloud or in a local repository Michele Kimpton, CEO DuraSpace CNI Dec 2014.
Building community clouds to support access to scholarship Michele Kimpton CEO, DuraSpace Jonathan Markow CSO, DuraSpace.
Hydra Partners Meeting March 2012 Bill Branan DuraCloud Technical Lead.
Unified Logs and Reporting for Hybrid Centralized Management
What is Cloud Computing? o Cloud computing:- is a style of computing in which dynamically scalable and often virtualized resources are provided as a service.
1 The Australian Partnership for Sustainable Repositories Margaret Henty Digital Futures Industry Briefing November 8, 2006.
Simple Online Accounts for Your Business – With Help from Microsoft Azure, Big Red Cloud Makes Accounting Easier for Thousands of Businesses MICROSOFT.
Preservation In The Cloud Markus Wust NCSU Libraries.
An Introduction to DuraCloud Carissa Smith, Partner Specialist Michele Kimpton, Project Director Bill Branan, Lead Software Developer Andrew Woods, Lead.
Plan Introduction What is Cloud Computing?
New Value from the DSpace Foundation and Fedora Commons Michele Kimpton and Sandy Payette Executive Directors DuraSpace.
Opensource for Cloud Deployments – Risk – Reward – Reality
DuraSpace, Fedora and DuraCloud Thorny Staples Director, Community Strategy and Alliances ESIP Meeting, July 8, 2009.
Derek Slayton Sr. Director, Products Citrix Systems.
Findly Leads the World in Talent Innovation with Its Enterprise-Cloud for Global Talent Acquisition COMPANY PROFILE: FINDLY Findly is a SaaS ISV founded.
OCLC Research Libraries Partners 10 June 2011 Robin Murray Vice President, Global Product Management OCLC Collaboratively Building Web-Scale with Libraries.
DuraCloud A service provided by Sandy Payette and Michele Kimpton.
DuraCloud Managing durable data in the cloud Michele Kimpton, Director DuraSpace.
An Introduction to DuraCloud Michele Kimpton, Project Director Carissa Smith, Partner Specialist DuraSpace Webinar  Sept 2011.
An emerging computing paradigm where data and services reside in massively scalable data centers and can be ubiquitously accessed from any connected devices.
DuraCloud pilot program Michele Kimpton, CEO DuraSpace Richard Rodger, Dept Head Software development, M.I.T. Libraries Claire Stewart Dept Head Digital.
- Raghavi Reddy.  With traditional desktop computing, we run copies of software programs on our own computer. The documents we create are stored on our.
Cloud Computing John Engates CTO, Rackspace Presented: Rackspace Customer Conference, 2008 October 29, 2008.
DuraCloud Enabling services for managing data in the cloud Michele Kimpton, CBO DuraSpace Bill Branan, Senior Developer DuraSpace.
2009 Federal IT Summit Cloud Computing Breakout October 28, 2009.
MICROSOFT AZURE ISV PROFILE: D-SCOPE SYSTEMS D-Scope Systems is an enterprise-level medical media product and integration specialist company. It provides.
May 2, 2013 An introduction to DSpace. Module 1 – An Introduction By the end of this module, you will … Understand what DSpace is, and what it can be.
1 NETE4631 Working with Cloud-based Storage Lecture Notes #11.
WHAT OUR CUSTOMERS ARE SAYING “After thorough market research and a review process, Qorus Breeze Proposals stood out from the competitors because of its.
Securely Synchronize and Share Enterprise Files across Desktops, Web, and Mobile with EasiShare on the Powerful Microsoft Azure Cloud Platform MICROSOFT.
Accumulus Delivers Enterprise Class Subscription Billing and Automation Solutions for Gaming, Retail, and More on the Scalable Microsoft Azure Platform.
Technical Update 2008 Sandy Payette, Executive Director Eddie Shin, Senior Developer April 3, 2008 Open Repositories 2008, Fedora User Group.
How to Implement an Institutional Repository: Part II A NASIG 2006 Pre-Conference May 4, 2006 Technical Issues.
OpenField Consolidates Stadium Data, Provides CRM and Analysis Functions for an Intelligent, End-to-End Solution COMPANY PROFILE : OPENFIELD Founded by.
Datalayer Notebook Allows Data Scientists to Play with Big Data, Build Innovative Models, and Share Results Easily on Microsoft Azure MICROSOFT AZURE ISV.
IEEE IT (Information Technology) Strategy – 2005 Unapproved.
From ePrints to eSPIDA: Digital Preservation at the University of Glasgow William J Nixon, Service Development DAEDALUS, University of Glasgow DPC: Digital.
Nov 22/26 Tech Forum 2015 Roberto Trinconi Cloud the New Path to the Business Leadership.
DuraCloud for Dummies Should I stay or should I go [to the cloud]? ~Carissa Smith, DuraSpace.
Archiving and Preservation Michele Kimpton CEO, DuraSpace Bryan Beecher Director, ICPSR DuraSpace Webinar November 2, 2011.
Microsoft and Symantec
A Technical Overview Bill Branan DuraCloud Technical Lead.
MidVision Enables Clients to Rent IBM WebSphere for Development, Test, and Peak Production Workloads in the Cloud on Microsoft Azure MICROSOFT AZURE ISV.
3/12/2013Computer Engg, IIT(BHU)1 CLOUD COMPUTING-1.
Web Technologies Lecture 13 Introduction to cloud computing.
Smart Syncing: Travelers Get News, Information, and Entertainment along with Free Internet via WiFi COMPANY PROFILE: SIENN With a team across Europe, SIENN.
Tekla Model Sharing and Microsoft Azure Create Secure and Seamless Collaboration Environment for Construction Projects, Locally and Globally MICROSOFT.
Zentera Guardia Fabric ™ Securely Connects Client-Server Apps between Microsoft Azure, Enterprise Datacenters & Other Public Clouds MICROSOFT AZURE ISV.
Store, Manage, and Archive Content in the Cloud Michele Kimpton, DuraSpace CEO DuraSpace Nate Klingenstein, Internet 2 Internet 2 meeting, April 2013.
Trajectory’s Game-Powered Apps Extend the Value of Business Training and Testing Materials with Help from the Microsoft Azure Cloud MICROSOFT AZURE ISV.
Built on the Powerful Microsoft Azure Platform, Forensic Advantage Helps Public Safety and National Security Agencies Collect, Analyze, Report, and Distribute.
Managing live digital content with DuraSpace services Bill Branan PASIG Spring 2015.
Fedora Commons Overview and Background Sandy Payette, Executive Director UK Fedora Training London January 22-23, 2009.
MICROSOFT AZURE APP BUILDER PROFILE: RAVERUS LTD. Raverus is a customer-driven company engaged in providing software applications designed to improve and.
Preservation support and data management services built on cloud infrastructure Features:  Ingest once, and make multiple copies to multiple storage.
EGI-InSPIRE RI EGI Compute and Data Services for Open Access in H2020 Tiziana Ferrari Technical Director, EGI.eu
Agenda  What is Cloud Computing?  Milestone of Cloud Computing  Common Attributes of Cloud Computing  Cloud Service Layers  Cloud Implementation.
Univa Grid Engine Makes Work Management Automatic and Efficient, Accelerates Deployment of Cloud Services with Power of Microsoft Azure MICROSOFT AZURE.
Veeam Backup Repository
AWS. Introduction AWS launched in 2006 from the internal infrastructure that Amazon.com built to handle its online retail operations. AWS was one of the.
Utilizing the Capabilities of Microsoft Azure, Skipper Offers a Results-Based Platform That Helps Digital Advertisers with the Marketing of Their Mobile.
Implementing an Institutional Repository: Part II
Michele Kimpton Project Director, DuraCloud NDIPP Partner meeting
Implementing an Institutional Repository: Part II
IBM Cloud Computer Services
How to Implement an Institutional Repository: Part II
Archiving and preservation services in the cloud
Presentation transcript:

DuraCloud Open technologies and services for managing durable data in the cloud Michele Kimpton, CBO DuraSpace

Open Source Portfolio DuraCloud

Goals of DuraSpace Stewardship: – Support and align open source development communities for DSpace and Fedora Innovation: – Think beyond existing platforms – New strategies for enabling access and preservation of digital content Sustainability: – Develop business model to sustain the non- profit and open technologies we support

DSpace and Fedora Installations Largest share of open repositories worldwide … over 700 institutions tracked in our registries Universities Research Centers Libraries Archives Cultural Heritage Government More…

Challenges (From our communities) Digital preservation and archiving is hard to achieve, even just basic replication Making digital content more accessible and useable to researchers Easy and elastic provisioning of shared infrastructure (also across institutions!) Robust compute environments for data mining and analysis of large datasets

Implications for our future work more distributed more collaborative more web - oriented more open more interoperable

What About the Cloud? A style of computing where massively scalable IT-related capabilities are provided “as a service” using Internet technologies to multiple external customers. (Gartner, 6/08).

Cloud services

Public Cloud Services Elastic web-based infrastructure for storage and compute

Economies of Scale and Cost Public cloud providers drive cost down through scale, location and virtualization technology Large Datacenters (tens of thousands of computers) Medium Datacenters (thousands) Source: Hamilton, Internet-Scale Service Efficiency,, LADIS Workshop (Sept 08) Technology* Cost Medium Datacenter Cost Large Datacenter Network$95 per Mbit/sec/mo$13 per Mbit/sec/mo Storage$2.20 per Gbyte/mo$.40 per Gbyte/mo Admin140 servers/admin>1000 servers/admin

Study of 605 government IT Yet, only 13% utilizing cloud compute today

Barriers

Here to stay

DuraCloud Proposition Trust and durability in the cloud DuraCloud is a platform aimed at supporting libraries, universities, and other cultural heritage organizations that wish to provide perpetual access to their digital content. The service replicates and distributes content across multiple cloud providers and enables the deployment of services to support: * access * preservation * re-use

DuraCloud A web based service enabling management of Data in the cloud DuraCloud mediating web Service Sun EMC Rackspace Microsoft

Vision: Preservation Support DuraCloud: content replication, auditing, and repair

Vision: Shared infrastructure DuraCloud: collaboration and data linking of stored objects

Vision: Data Analysis and Mining DuraCloud: running large compute jobs on stored content

DuraCloud Underlying software Open core Core components available for others to build on and run Open source - apache license Architecture to create cloud networks Public clouds Private clouds University consortia Also useful in research partnerships

Preservation Services -ability to replicate content to multiple providers and locations -ability to synchronize backup with primary store or repository system -management,monitoring, audit and repair through web based interface Hosted by DuraSpace not-for-profit org Partnerships with cloud providers

software services Other DuraSpace-provided services on top of content stored in the cloud –Data mining –Video Streaming –Format transformation –Repository hosting –discovery

DuraCloud: run your application as a service on content Enable others to build and deploy services and apps in DuraCloud environment

Partners and Pilots Selected initial cloud providers Selected 2 initial pilot partners

NYPL pilot -back up copy 700k images (50 TB data) -transformation from Tiff to JPEG run image server in cloud -Push JPEG 2000 back into Fedora Repository Digital Gallery Collection

BHL pilot -back up copy entire corpus (40 TB data) -have multiple copies including Europe -Do compute intensive data mining over corpus BioDiversity Heritage Library

Pilot use cases NYPL Replication and preservation support Format conversion Instant provisioning of image server Synchronization with repository BHL Replication and preservation support International collaborative infrastructure Researcher platform for data mining

Timeline Begin pilots(MOU’s in place) – September 2009 DuraCloud Alpha Pilot release- Oct 2009 Pilot data loading and testing – Fall 2009 Beta for repository community - Q Pilot testing with software services Q Cloud partner evaluations complete-Q Strategic cloud partnerships in place- Q Pricing Model determined-Q Report pilot results – Q Launch production service Q3 2010

Critical success factors Ease of use- simplicity Trusted partner for end user Cost effective Scalable/Flexible Can establish key partnerships with service providers Can build community of developers and users

Thank You For more information: DuraSpace Organization: Wiki: commons.org/confluence/display/duracloudpilot /