DuraCloud Managing durable data in the cloud Michele Kimpton, Director DuraSpace.

Slides:



Advertisements
Similar presentations
DuraSpace, Fedora and DuraCloud Triangle Research Libraries Network September, 2009.
Advertisements

DuraSpace: Digital Information All Ways, Always Pretoria, South Africa May 14 th, 2009.
Introducing Progress Arcade Roy Ellis
Archiving research data in the cloud or in a local repository Michele Kimpton, CEO DuraSpace CNI Dec 2014.
Building community clouds to support access to scholarship Michele Kimpton CEO, DuraSpace Jonathan Markow CSO, DuraSpace.
Hydra Partners Meeting March 2012 Bill Branan DuraCloud Technical Lead.
Unified Logs and Reporting for Hybrid Centralized Management
What is Cloud Computing? o Cloud computing:- is a style of computing in which dynamically scalable and often virtualized resources are provided as a service.
Manasa Guduru Sai Prasanth Sridhar Malini srinivasan Sinduja Narasimhan Reference: Aymerich, F. M., Fenu, G., & Surcis, S. (2008). An approach to a cloud.
Preservation In The Cloud Markus Wust NCSU Libraries.
An Introduction to DuraCloud Carissa Smith, Partner Specialist Michele Kimpton, Project Director Bill Branan, Lead Software Developer Andrew Woods, Lead.
Plan Introduction What is Cloud Computing?
New Value from the DSpace Foundation and Fedora Commons Michele Kimpton and Sandy Payette Executive Directors DuraSpace.
Cloud Computing Source:
Introduction to Cloud Computing
“ Does Cloud Computing Offer a Viable Option for the Control of Statistical Data: How Safe Are Clouds” Federal Committee for Statistical Methodology (FCSM)
Connect. Transact. Profit. Lessons Learned: 5 Reasons Cloud is CFO Friendly.
A Brief Overview by Aditya Dutt March 18 th ’ Aditya Inc.
Opensource for Cloud Deployments – Risk – Reward – Reality
DuraSpace, Fedora and DuraCloud Thorny Staples Director, Community Strategy and Alliances ESIP Meeting, July 8, 2009.
DuraCloud A service provided by Sandy Payette and Michele Kimpton.
CLOUD COMPUTING. What is cloud computing ? History Virtualization Cloud Computing hardware Cloud Computing services Cloud Architecture Advantages & Disadvantages.
Cloud Computing. What is Cloud Computing? Cloud computing is a model for enabling convenient, on-demand network access to a shared pool of configurable.
Cloud Computing 1. Outline  Introduction  Evolution  Cloud architecture  Map reduce operation  Platform 2.
An Introduction to DuraCloud Michele Kimpton, Project Director Carissa Smith, Partner Specialist DuraSpace Webinar  Sept 2011.
DuraCloud pilot program Michele Kimpton, CEO DuraSpace Richard Rodger, Dept Head Software development, M.I.T. Libraries Claire Stewart Dept Head Digital.
The Global Video Grid: DigitalWell Update & Plan For SRB Integration Myke Smith, Manager Streaming Media Technologies University of Washington / ResearchChannel.
Content in the Cloud Scalability NOVEMBER 9, :00 – 10:30 AM Conference B: Infrastructure for the CLOUD Scalability Daniel Kenyon Vice President Equilibrium.
1 © 2009 Cisco Systems, Inc. All rights reserved.Cisco Confidential Cloud Computing – The Value Proposition Wayne Clark Architect, Intelligent Network.
Cloud Computing John Engates CTO, Rackspace Presented: Rackspace Customer Conference, 2008 October 29, 2008.
DuraCloud Enabling services for managing data in the cloud Michele Kimpton, CBO DuraSpace Bill Branan, Senior Developer DuraSpace.
2009 Federal IT Summit Cloud Computing Breakout October 28, 2009.
May 2, 2013 An introduction to DSpace. Module 1 – An Introduction By the end of this module, you will … Understand what DSpace is, and what it can be.
Built on Azure, Moodle Helps Educators Create Proprietary Private Web Sites Filled with Dynamic Courses that Extend Learning Anytime, Anywhere MICROSOFT.
Technical Update 2008 Sandy Payette, Executive Director Eddie Shin, Senior Developer April 3, 2008 Open Repositories 2008, Fedora User Group.
How to Implement an Institutional Repository: Part II A NASIG 2006 Pre-Conference May 4, 2006 Technical Issues.
Discover the Newest Solution from Expertime: Magento + PimCore Running on Microsoft Azure MICROSOFT AZURE ISV PROFILE: EXPERTIME Expertime works with clients.
HUSKY CONSULTANTS FRANKLIN VALENCIA WIOLETA MILCZAREK ANTHONY GAGLIARDI JR. BRIAN CONNERY.
DuraCloud Open technologies and services for managing durable data in the cloud Michele Kimpton, CBO DuraSpace.
Nov 22/26 Tech Forum 2015 Roberto Trinconi Cloud the New Path to the Business Leadership.
DuraCloud for Dummies Should I stay or should I go [to the cloud]? ~Carissa Smith, DuraSpace.
Archiving and Preservation Michele Kimpton CEO, DuraSpace Bryan Beecher Director, ICPSR DuraSpace Webinar November 2, 2011.
3/12/2013Computer Engg, IIT(BHU)1 CLOUD COMPUTING-1.
Web Technologies Lecture 13 Introduction to cloud computing.
Microsoft Azure and DataStax: Start Anywhere and Scale to Any Size in the Cloud, On- Premises, or Both with a Leading Distributed Database MICROSOFT AZURE.
Store, Manage, and Archive Content in the Cloud Michele Kimpton, DuraSpace CEO DuraSpace Nate Klingenstein, Internet 2 Internet 2 meeting, April 2013.
Built on the Powerful Microsoft Azure Platform, Forensic Advantage Helps Public Safety and National Security Agencies Collect, Analyze, Report, and Distribute.
Managing live digital content with DuraSpace services Bill Branan PASIG Spring 2015.
Fedora Commons Overview and Background Sandy Payette, Executive Director UK Fedora Training London January 22-23, 2009.
MICROSOFT AZURE APP BUILDER PROFILE: RAVERUS LTD. Raverus is a customer-driven company engaged in providing software applications designed to improve and.
© 2007 IBM Corporation IBM Software Strategy Group IBM Google Announcement on Internet-Scale Computing (“Cloud Computing Model”) Oct 8, 2007 IBM Confidential.
Preservation support and data management services built on cloud infrastructure Features:  Ingest once, and make multiple copies to multiple storage.
EGI-InSPIRE RI EGI Compute and Data Services for Open Access in H2020 Tiziana Ferrari Technical Director, EGI.eu
Agenda  What is Cloud Computing?  Milestone of Cloud Computing  Common Attributes of Cloud Computing  Cloud Service Layers  Cloud Implementation.
Univa Grid Engine Makes Work Management Automatic and Efficient, Accelerates Deployment of Cloud Services with Power of Microsoft Azure MICROSOFT AZURE.
Scalable Web Apps Target this solution to brand leaders responsible for customer engagement and roll-out of global marketing campaigns. Implement scenarios.
? What is Institutional Repository for Rutgers University
Recommendation 6: Using ‘cloud computing’ to meet the societal need ‘Faster and transparent access to public sector services’ Cloud computing Faster and.
Joseph JaJa, Mike Smorul, and Sangchul Song
Chapter 18 MobileApp Design
Scalable Web Apps Target this solution to brand leaders responsible for customer engagement and roll-out of global marketing campaigns. Implement scenarios.
Veeam Backup Repository
Introduction to Cloud Computing
AWS. Introduction AWS launched in 2006 from the internal infrastructure that Amazon.com built to handle its online retail operations. AWS was one of the.
Implementing an Institutional Repository: Part II
Media365 Portal by Ctrl365 is Powered by Azure and Enables Easy and Seamless Dissemination of Video for Enhanced B2C and B2B Communication MICROSOFT AZURE.
Michele Kimpton Project Director, DuraCloud NDIPP Partner meeting
Implementing an Institutional Repository: Part II
How to Implement an Institutional Repository: Part II
Archiving and preservation services in the cloud
Presentation transcript:

DuraCloud Managing durable data in the cloud Michele Kimpton, Director DuraSpace

Open Source Portfolio DuraCloud

Goals of DuraSpace Stewardship: – Support and align open source development communities for DSpace and Fedora Innovation: – Think beyond existing platforms – New strategies for enabling access and preservation of digital content Sustainability: – Develop business model to sustain the non- profit and open technologies we support

Emergence of Infrastructure Source: Understanding Infrastructure: Lessons for New Scientific Infrastructure, Systems Integrate components Central control Dedicated/specialized gateways More closed More preconceived Integrate systems Distributed control Generic gateways More open More reconfigurable Networks

Vision: Federated Repositories and Cyberinfrastructure DuraCloud Heaven

What About the Cloud? A style of computing where massively scalable IT-related capabilities are provided “as a service” using Internet technologies to multiple external customers. (Gartner, 6/08).

Cloud Services Elastic web-based infrastructure for storage and compute

What have we learned from our users? Focus Groups Site Visits Forums Over 750 organizations using DSpace or Fedora worldwide

Challenge Tools and processes unproven Limited IT support Resources unavailable Task can be overwhelming (replication, migration, emulation, etc.) Digital preservation is essential but difficult to implement

Challenge Systems not interoperable Heterogeneous applications/platforms Lack of commons standards Non-elastic compute capability Barriers to making digital content more accessible and useful to researchers

Advantages – Cloud Services Flexibility Scalability Pay for use Easy to implement Cost

Economies of Scale and Cost Public cloud providers drive cost down through scale, location and virtualization technology Large Datacenters (tens of thousands of computers) Medium Datacenters (thousands) Source: Hamilton, Internet-Scale Service Efficiency,, LADIS Workshop (Sept 08) Technology* Cost Medium Datacenter Cost Large Datacenter Network$95 per Mbit/sec/mo$13 per Mbit/sec/mo Storage$2.20 per Gbyte/mo$.40 per Gbyte/mo Admin140 servers/admin>1000 servers/admin

Issues Stability Transparency Data lock in SLA’s Trust

DuraCloud Trusted management of and access to durable digital assets in the cloud DuraSpace Mediating Service Sun EMC Amazo n Microsoft

DuraCloud - basics Replicate to multiple storage providers Replicate to multiple geographic areas Monitor and audit digital assets Compute services in cloud next to content Hosted by DuraSpace not-for-profit org Partnerships with cloud providers “Pay for use” for services and storage Available to run internally- open source Chinese Menu of Service Options

Additional services Other DuraSpace-provided services on top of content stored in the cloud –Search –Aggregation –Streaming –Migration –Hosting repositories

Enable others to build and deploy services and apps in DuraCloud environment

Use Cases: DuraCloud with Cloud Storage Online backup for text, images, datasets, video, audio Enable preservation via multiple copies, geographies, administrations Elastic provisioning of temporary or permanent storage for projects or jobs

Streaming service for video Hosting JPEG2000 image engine Indexing and other processing heavy jobs Repositories in cloud Data and text mining over open data Aggregation and web 2.0 tools on open content and collections Use Cases: DuraCloud with Cloud Compute

DuraCloud Underlying software Open core Core components available for others to build on and run Open source - apache license Architecture to create cloud networks Public clouds Private clouds University consortia Also useful in research partnerships

Critical success factors Ease of use - simplicity Trusted partner within community Cost effective Elastic, scalable, flexible Establish key partnerships with cloud preferred cloud service providers Build community of developers and users

Partners and Pilots Selected initial cloud providers Selected 2 initial pilot partners

Pilot use cases Ingest large quantity of material Replicate to multiple cloud platforms Manage replication and monitoring Run services

Timeline Initial open source release– summer 2009 Begin pilots – September 2009 Pilot data loading and testing – Fall 2009 Plug-ins for repository platforms – Q Beta for repository community - Q Pilot testing with compute services Q Report pilot results – Q Launch production service Q2 2010

For more information: DuraSpace Organization: