Computer Science Department University of California, Santa Barbara

Slides:



Advertisements
Similar presentations
Building Clouds using Commodity, Open- Source Software Components Rich Wolski Chris Grzegorczyk, Dan Nurmi, Graziano Obertelli, Woody Rollins, Sunil Soman,
Advertisements

Understanding and Managing WebSphere V5
Additional SugarCRM details for complete, functional, and portable deployment.
VAP What is a Virtual Application ? A virtual application is an application that has been optimized to run on virtual infrastructure. The application software.
A Brief Overview by Aditya Dutt March 18 th ’ Aditya Inc.
Cloud computing is the use of computing resources (hardware and software) that are delivered as a service over the Internet. Cloud is the metaphor for.
 Cloud computing  Workflow  Workflow lifecycle  Workflow design  Workflow tools : xcp, eucalyptus, open nebula.
Cloud Computing. What is Cloud Computing? Cloud computing is a model for enabling convenient, on-demand network access to a shared pool of configurable.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Overview of Atmosphere.
Introduction to Cloud Computing
Presented by: Sanketh Beerabbi University of Central Florida COP Cloud Computing.
608D CloudStack 3.0 Omer Palo Readiness Specialist, WW Tech Support Readiness May 8, 2012.
An Introduction to Progress Arcade ™ June 12, 2013 Rob Straight Senior Manager, OpenEdge Product Management.
Eucalyptus: An Open-source Infrastructure for Cloud Computing Rich Wolski Eucalyptus Systems Inc.
1 4/23/2007 Introduction to Grid computing Sunil Avutu Graduate Student Dept.of Computer Science.
Predicting Queue Waiting Time in Batch Controlled Systems Rich Wolski, Dan Nurmi, John Brevik, Graziano Obertelli Computer Science Department University.
Tool Integration with Data and Computation Grid GWE - “Grid Wizard Enterprise”
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Authors: Ronnie Julio Cole David
The Eucalyptus Open-source Cloud Computing System Daniel Nurmi Rich Wolski, Chris Grzegorczyk, Graziano Obertelli, Sunil Soman, Lamia Youseff, Dmitrii.
3/12/2013Computer Engg, IIT(BHU)1 CLOUD COMPUTING-1.
Tool Integration with Data and Computation Grid “Grid Wizard 2”
1 TCS Confidential. 2 Objective : In this session we will be able to learn:  What is Cloud Computing?  Characteristics  Cloud Flavors  Cloud Deployment.
ETICS An Environment for Distributed Software Development in Aerospace Applications SpaceTransfer09 Hannover Messe, April 2009.
© 2012 Eucalyptus Systems, Inc. Cloud Computing Introduction Eucalyptus Education Services 2.
EUCALYPTUS: An Elastic Utility Computing Architecture for Linking Your Programs to Useful Systems Rich Wolski Chris Grzegorczyk, Dan Nurmi, Graziano Obertelli,
Trusted Virtual Machine Images the HEPiX Point of View Tony Cass October 21 st 2011.
EUCALYPTUS: An Open Source Infrastructure for Elastic Computing Research Rich Wolski Chris Grzegorczyk, Dan Nurmi, Graziano Obertelli, Shriram Rajagopalan,
SEMINAR ON.  OVERVIEW -  What is Cloud Computing???  Amazon Elastic Cloud Computing (Amazon EC2)  Amazon EC2 Core Concept  How to use Amazon EC2.
Workspace Management Services Kate Keahey Argonne National Laboratory.
Prof. Jong-Moon Chung’s Lecture Notes at Yonsei University
Unit 3 Virtualization.
New Paradigms: Clouds, Virtualization and Co.
Introduction to Cloud Technology
Grid and Cloud Computing
VMware ESX and ESXi Module 3.
Cloud Technology and the NGS Steve Thorn Edinburgh University (Matteo Turilli, Oxford University)‏ Presented by David Fergusson.
Understanding The Cloud
Use of HLT farm and Clouds in ALICE
Computing Clusters, Grids and Clouds Globus data service
BEST CLOUD COMPUTING PLATFORM Skype : mukesh.k.bansal.
Current Generation Hypervisor Type 1 Type 2.
Cloud Security– an overview Keke Chen
Containers: The new network endpoint
IGE Globus Appliances Dr. Ioan Lucian Muntean, Dr. Adrian Colesa
GWE Core Grid Wizard Enterprise (
Quattor in Amazon Cloud
StratusLab Final Periodic Review
StratusLab Final Periodic Review
Tools and Services Workshop Overview of Atmosphere
Study course: “Computing clusters, grids and clouds” Andrey Y. Shevel
LQCD Computing Operations
Oracle Solaris Zones Study Purpose Only
AWS COURSE DEMO BY PROFESSIONAL-GURU. Amazon History Ladder & Offering.
Introduction to Cloud Computing
Cloud Computing.
Management of Virtual Execution Environments 3 June 2008
Cloud Computing B. Ramamurthy 9/19/2018 B. Ramamurthy.
Cloud Computing Dr. Sharad Saxena.
Dev Test on Windows Azure Solution in a Box
20409A 7: Installing and Configuring System Center 2012 R2 Virtual Machine Manager Module 7 Installing and Configuring System Center 2012 R2 Virtual.
Cloud Management Mechanisms
Cloud Computing Cloud computing refers to “a model of computing that provides access to a shared pool of computing resources (computers, storage, applications,
It’s a Mixed Up World David J. Wippich Chief Executive Officer Ensim Corp. Deploying Unified Communications and Collaboration in Mixed Environments.
Virtualization, Cloud Computing, and TeraGrid
Module 01 ETICS Overview ETICS Online Tutorials
Lecture 16B: Instructions on how to use Hadoop on Amazon Web Services
Cloud Computing: Concepts
Fundamental Concepts and Models
Using and Building Infrastructure Clouds for Science
Presentation transcript:

Computer Science Department University of California, Santa Barbara EUCALYPTUS: An Open Source Infrastructure for Elastic Computing Research Rich Wolski Chris Grzegorczyk, Dan Nurmi, Graziano Obertelli, Shriram Rajagopalan, Sunil Soman, Lamia Youseff, Dmitrii Zagorodnov The MAYHEM Lab Computer Science Department University of California, Santa Barbara

ElastaCloudility Computing What is the problem? -- there is a great deal of interest in cloud computing and no infrastructure for studying it Why should you care? -- because investing in an untried technology without prior study can be very expensive

The Basic Model: Computing as a Service SLAs Web Services Defining terms: What is cloud computing? -- At a high level, it is that the user gets compute service rather than direct access to computers Virtualization

Elasticity, Cloudiness, and Utility Elastic Computing, Cloud Computing, and Utility Computing are, at some level, synonyms SLA-driven interface Fee-for use and pay-as-you-go Allocations can vary dynamically Typical scenario User registers a credit card and gets credentials Provider hosts Linux “images” and/or application services User instantiates one or more images and pays an occupancy fee for each Local system administration not provided by hosting service SLAs and tools differentiate between systems Coordinated allocations Leases Installed software packages Management tools Definition continued: the trade press tries to claim the term -- need to be clear on what “we” mean when we say cloud computing for the purpose of this presentation

Commercial Offerings Amazon EC2 3Tera IBM Blue Cloud Linux image hosting 3 flavors of SLA (small, medium, large) S3 storage facility 3Tera Offers a variety hosting services and SLAs appLogic => visual configuration tool for SaaS IBM Blue Cloud Based on Tivoli “blade management” utility computing platform Basis for current Google/IBM collaboration Sun Microsystem’s network.com Grid-based solution with many pre-packaged applications About 10x as expensive as EC2 What are the current approaches? -- we didn’t invent this idea. What technologies are out there?

Open Source Clouds? Nimbus (Freeman and Keahey, University of Chicago) Client-side cloud-computing interface to Globus-enabled TeraPort cluster at U of C Based on GT4 and the Globus Virtual Workspace Service Lots of cool features Great if local resources are GT4 proficient Tutorial here in 4:00 PM session Enomalism Start-up company planning to distribute open source REST APIs User “dashboard” Downloads appear to be disabled What is the problem with the alternative approaches? -- they are all either proprietary, very complex, or non-exitant

Linux image hosting ala Amazon and IBM/Google Elastic Utility Computing Architecture Linking Your Programs To Useful Systems Web services based implementation of elastic/utility/cloud computing infrastructure Linux image hosting ala Amazon and IBM/Google Interface compatible with EC2 Works with command-line tools from Amazon w/o modification Eucalyptus 1.0 == EC2@Home Functions as a software overlay “One-button” install Version. 1.0 comes as an RPM and Rocks “roll” for easy cluster deployment “System Administrators are people too.” Overview of the solution -- summary of how it meets the requirements

Goals for Eucalyptus Foster research in elastic/cloud/utility computing models of service provisioning, scheduling, SLA formulation, hypervisor portability and feature enhancement, etc. Experimentation vehicle prior to buying commercial services “Tech Preview” using local machines with local system administration support Provide a debugging and development platform for EC2 (and other clouds) Allow the environment to be set up and tested before it is instantiated in a for-fee environment Provide a basic software development platform for the open source community E.g. the “Linux Experience” In solving this problem, what do we hope to achieve? -- what good things will happen if we succeed?

Challenges Extensibility Client-side interface Networking Security Simple architecture and open internal APIs Client-side interface Amazon’s EC2 interface and functionality (familiar and testable) Networking Virtual private network per cloud Must function as an overlay => cannot supplant local networking Security Must be compatible with local security policies Packaging, installation, maintenance system administration staff is an important constituency for uptake What challenges do we need to meet to solve the problem? -- enumerate them

Eucalyptus Architecture: WS-Cloud Amazon EC2 Interface Client-side API Translator Cloud Controller Cluster Controller Node Controller Challenge 1: extensibility -- solved by using a simple, straight forward architecture and open APIs that use web services

EC2 Compatibility Version 1.0 Interface is based on Amazon’s published WSDL 2008 compliant except for static IP address assignment Security groups “Availability” zones correspond to individual clusters Uses the EC2 command-line tools downloaded from Amazon REST interface S3 support/emulation: not yet, but on its way Images accessed by file system name instead of S3 handle for the moment Unless user wants to use the actual S3 and pay for the egress charges System administration is different Eucalyptus defines its own Cloud Admin. tool set for user accounting and cloud management Challenge 2: client interface -- solved by implementing Amazon’s EC2 interface -- familiar to users and already commercially successful

Networking Eucalyptus does not assume that all worker nodes will have publically routable IP addresses Each cloud allocation will have one or more public IP addresses All cloud images have access to a private network interface Two types of networks internal to a cloud allocation Virtual private network Uses VDE interfaced to Xen and VLANs set up dynamically Substantial performance hit within a cluster Allows a cloud allocation to span clusters High-performance private network (availability zone) Bypasses VDE and uses local cluster network for each allocation Runs at “native” network speed (I.e. with Xen) Cloud allocations cannot span clusters Availability zone approach fits with Amazon’s high-level semantics Challenge 3: networking -- solved using VDE but there is a performance problem -- performance problem can be addressed by leveraging Amazon’s mechanism => they have the same problem

Network Performance Comparison

Security All Eucalyptus components use WS-security for authentication Encryption of inter-component communication is not enabled by default Configuration option Ssh key generation and installation ala EC2 is implemented Cloud controller generates the public/private key pairs and installs them User sign-up is web based User specifies a password and submits sign-up request Cert is generated but withheld until admin. approves request User gains access to cert. through password-protected web page Similar to EC2 model without the credit cards Challenge 4: security -- solved using industry-standard web technologies

Packaging, Installation, and Deployment Rocks “Roll” per cluster One-button install Requires Rock V (the most curret release) for Xen support Multiple clusters requires a configuration file edit at Version 1.0 Multi-cluster configuration tools ala Rocks not readily available Requires Xen version 3.1 to be installed and functioning Does not require modification to dom0 Does require Xen-bridge (not an IP tables approach yet) All needed packages are bundled in the roll Rev. 1.0 is not smart enough to determine if local versions of the dependencies will work or not Full version (minus images) is 55 MB Challenge 5: installation and maintenance -- solved through the use of emerging tools that make standard software easier to install and maintain

Eucalyptus: The Movie The Movie

Software Technologies Axis2 and Axis2c version 1.4.0 Hibernate 3.2.2 HSQLDB 1.8.0 jetty 6.1.9 JiBX (March 30th sourceforge) Mule 2.0.1 Rampart version 1.3 libvirt version 0.4.2 socat-1.6.0 VDE version 2.2.0-pre2 How we did it

Release Status Eucalyptus version 1.0 will be available for public release 5/28/08 http://eucalyptus.cs.ucsb.edu EC2 interface Simple load-balancing cloud controller Simple web-based user accounting and system administration toolset In testing now Minor releases for version 1.0 will fix bugs and add limited IP tables support Static IPs and security groups Version 2.0 (planned) “smart” private networking S3 emulation/support SQS and SimpleDB What is the state of the system now?

Future Test deployment at SDSC using On-demand Cluster SLA research Shake down by June 1st Friendly user community after that SLA research We built Eucaltyptus so that we could study how SLAs could be automatically formulated Leverage TeraGrid QBETS technology to determine whether an SLA can be signed or not VGrADS at SC08 Virtual Grid Application Development Software project (NSF Large ITR) Single unifying programming abstraction for large-scale workflows Planning to demo TeraGrid, EC2, and multiple Eucalyptus Clouds using Linked Environments for Atmospheric Discovery (LEAD) workflows Development underway now What is there left to do?

Thanks and More Information National Science Foundation SDSC RightScale.com The Eucalyptus Development Team at UCSB is Chris Grzegorczyk -- grze@cs.ucsb.edu Dan Nurmi -- nurmi@cs.ucsb.edu Graziano Obertelli -- graziano@cs.ucsb.edu Shriram Rajagopalan -- shriram@cs.ucsb.edu Sunil Soman -- sunils@cs.ucsb.edu Lamia Youseff -- lyouseff@cs.ucsb.edu Dmitrii Zagordnov -- dmitrii@cs.ucsb.edu rich@cs.ucsb.edu http://eucalyptus.cs.ucsb.edu