Cloud Computing with Nimbus

Slides:



Advertisements
Similar presentations
First Steps in the Clouds
Advertisements

A Scalable Approach to Deploying and Managing Appliances Kate Keahey Rick Bradshaw, Narayan Desai, Tim Freeman Argonne National Lab, University of Chicago.
Virtualization, Cloud Computing, and TeraGrid Kate Keahey (University of Chicago, ANL) Marlon Pierce (Indiana University)
Virtual Appliances for Scientific Applications Kate Keahey Argonne National Laboratory University of Chicago.
Wei Lu 1, Kate Keahey 2, Tim Freeman 2, Frank Siebenlist 2 1 Indiana University, 2 Argonne National Lab
Sponsors and Acknowledgments This work is supported in part by the National Science Foundation under Grants No. OCI , IIP and CNS
Science Clouds: Early Experiences in Cloud Computing for Scientific Applications Chicago, October 2008 Kate Keahey, Renato Figueiredo, Jose Fortes, Tim.
Nimbus or an Open Source Cloud Platform or the Best Open Source EC2 No Money Can Buy ;-) Kate Keahey Tim Freeman University of Chicago.
Cloud Computing with Nimbus FNAL, January 2009 Kate Keahey University of Chicago Argonne National Laboratory.
An Approach to Secure Cloud Computing Architectures By Y. Serge Joseph FAU security Group February 24th, 2011.
SaaS, PaaS & TaaS By: Raza Usmani
Kate Keahey Argonne National Laboratory University of Chicago Globus Toolkit® 4: from common Grid protocols to virtualization.
What Happens When Cloud Computing Meets HPC Dr. Dan Fraser Director, CDIGS (Community Driven Improvement of Globus Software)
Introduction to Cloud Computing
Cloud Computing for the Enterprise November 18th, This work is licensed under a Creative Commons.
Virtual Infrastructure in the Grid Kate Keahey Argonne National Laboratory.
 Cloud computing  Workflow  Workflow lifecycle  Workflow design  Workflow tools : xcp, eucalyptus, open nebula.
Nimbus & OpenNebula Young Suk Moon. Nimbus - Intro Open source toolkit Provides virtual workspace service (Infrastructure as a Service) A client uses.
Cloud Computing. What is Cloud Computing? Cloud computing is a model for enabling convenient, on-demand network access to a shared pool of configurable.
Cloud Computing & Amazon Web Services – EC2 Arpita Patel Software Engineer.
Presented by: Sanketh Beerabbi University of Central Florida COP Cloud Computing.
Large Scale Sky Computing Applications with Nimbus Pierre Riteau Université de Rennes 1, IRISA INRIA Rennes – Bretagne Atlantique Rennes, France
608D CloudStack 3.0 Omer Palo Readiness Specialist, WW Tech Support Readiness May 8, 2012.
Globus Virtual Workspaces OOI Cyberinfrastructure Design Meeting, San Diego, October Kate Keahey University of Chicago Argonne National Laboratory.
Virtual Workspaces Kate Keahey Argonne National Laboratory.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Ian Gable University of Victoria 1 Deploying HEP Applications Using Xen and Globus Virtual Workspaces A. Agarwal, A. Charbonneau, R. Desmarais, R. Enge,
Web Technologies Lecture 13 Introduction to cloud computing.
Purdue RP Highlights TeraGrid Round Table May 20, 2010 Preston Smith Manager - HPC Grid Systems Rosen Center for Advanced Computing Purdue University.
© 2012 Eucalyptus Systems, Inc. Cloud Computing Introduction Eucalyptus Education Services 2.
WP5 – Infrastructure Operations Test and Production Infrastructures StratusLab kick-off meeting June 2010, Orsay, France GRNET.
Nimbus Update March 2010 OSG All Hands Meeting Kate Keahey Nimbus Project University of Chicago Argonne National Laboratory.
Building on virtualization capabilities for ExTENCI Carol Song and Preston Smith Rosen Center for Advanced Computing Purdue University ExTENCI Kickoff.
Workspace Management Services Kate Keahey Argonne National Laboratory.
CS 6027 Advanced Networking FINAL PROJECT ​. Cloud Computing KRANTHI ​ CHENNUPATI PRANEETHA VARIGONDA ​ SANGEETHA LAXMAN ​ VARUN ​ DENDUKURI.
Dynamic Accounts: Identity Management for Site Operations Kate Keahey R. Ananthakrishnan, T. Freeman, R. Madduri, F. Siebenlist.
PaaS services for Computing and Storage
Prof. Jong-Moon Chung’s Lecture Notes at Yonsei University
Unit 3 Virtualization.
New Paradigms: Clouds, Virtualization and Co.
Cloud Technology and the NGS Steve Thorn Edinburgh University (Matteo Turilli, Oxford University)‏ Presented by David Fergusson.
Use of HLT farm and Clouds in ALICE
By: Raza Usmani SaaS, PaaS & TaaS By: Raza Usmani
StratusLab First Periodic Review
IGE Globus Appliances Dr. Ioan Lucian Muntean, Dr. Adrian Colesa
ATLAS Cloud Operations
Management of Virtual Machines in Grids Infrastructures
StratusLab Final Periodic Review
StratusLab Final Periodic Review
Study course: “Computing clusters, grids and clouds” Andrey Y. Shevel
Management of Virtual Machines in Grids Infrastructures
Introduction to Cloud Computing
Cloud Computing.
Management of Virtual Execution Environments 3 June 2008
Cloud Computing with Nimbus
OpenNebula Offers an Enterprise-Ready, Fully Open Management Solution for Private and Public Clouds – Try It Easily with an Azure Marketplace Sandbox MICROSOFT.
Cloud Computing Dr. Sharad Saxena.
20409A 7: Installing and Configuring System Center 2012 R2 Virtual Machine Manager Module 7 Installing and Configuring System Center 2012 R2 Virtual.
Cloud Computing and Virtualization with Globus
Sky Computing on FutureGrid and Grid’5000
Brandon Hixon Jonathan Moore
Virtualization, Cloud Computing, and TeraGrid
Cloud Computing: Concepts
Fundamental Concepts and Models
Using and Building Infrastructure Clouds for Science
Day 2, Session 2 Connecting System Center to the Public Cloud
Sky Computing on FutureGrid and Grid’5000
Microsoft Virtual Academy
NSF Middleware Initiative: GridShib
06 | SQL Server and the Cloud
Presentation transcript:

Cloud Computing with Nimbus FNAL, January 2009 Kate Keahey (keahey@mcs.anl.gov) University of Chicago Argonne National Laboratory

Science Clouds Cloud Computing Elastic computing, Pay-as-you-go, Capital expense operational expense Science Clouds 10/20/08 The Nimbus Toolkit: http//workspace.globus.org

Everything-as-a-Service SaaS PaaS IaaS 10/20/08 The Nimbus Toolkit: http//workspace.globus.org

The Quest Begins Code complexity Resource control 10/20/08 The Nimbus Toolkit: http//workspace.globus.org

“Workspaces” Dynamically provisioned environments Environment control Resource control Hardware implementations vs virtualization 10/20/08 The Nimbus Toolkit: http//workspace.globus.org

A Brief History of Nimbus STAR production runs on EC2 Xen released EC2 goes online Nimbus Cloud comes online 2003 2006 2009 Research on agreement-based services First Workspace Service release EC2 gateway available Support for EC2 interfaces 10/20/08 The Nimbus Toolkit: http//workspace.globus.org

Nimbus Overview Goal: open source, extensible, IaaS implementation and tools Specifically targeting scientific community A platform for experimentation with features for scientific needs Set up private clouds (privacy, expense considerations) Tools IaaS layer (Workspace Service) Orchestration layer (Context Broker, gateway) http://workspace.globus.org/ 10/20/08 The Nimbus Toolkit: http//workspace.globus.org

The Workspace Service VWS Service Pool node Pool node Pool node VWS Service Pool node Pool node Pool node Pool node Pool node Pool node Pool node Pool node Pool node 10/20/08 The Nimbus Toolkit: http//workspace.globus.org

The Workspace Service Trusted Computing Base (TCB) The workspace service publishes information on each workspace as standard WSRF Resource Properties. Pool node Pool node Pool node VWS Service Pool node Pool node Pool node Users can query those properties to find out information about their workspace (e.g. what IP the workspace was bound to) Pool node Pool node Pool node Pool node Pool node Pool node Users can interact directly with their workspaces the same way the would with a physical machine. Trusted Computing Base (TCB) 10/20/08 The Nimbus Toolkit: http//workspace.globus.org

Workspace Service Interfaces and Clients Web Services based Web Service Resource Framework (WSRF) GT-based Elastic Computing Cloud (EC2) Supported: ec2-describe-images, ec2-run-instances, ec2-describe-instances, ec2-terminate-instances, ec2-reboot-instances, ec2-add-keypair, ec2-delete-keypair Unsupported: availability zones, security groups, elastic IP assignment, REST Used alongside WSRF interfaces E.g., the University of Chicago cloud allows you to connect via the cloud client or via the EC2 client 10/20/08 The Nimbus Toolkit: http//workspace.globus.org

Security GSI authentication and authorization Secure access to VMs PKI credential required Works with Grid proxies VOMS, Shibboleth (via GridShib), custom PDPs Secure access to VMs EC2 key generation or accessed from .ssh Validating images and image data Collaboration with Vienna University of Technology 10/20/08 The Nimbus Toolkit: http//workspace.globus.org

Networking Network configuration External: public IPs or private IPs (via VPN) Internal: private network via a local cluster network Each VM can specify multiple NICs mixing private and public networks (WSRF only) E.g., cluster worker nodes on a private network, headnode on both public and private network 10/20/08 The Nimbus Toolkit: http//workspace.globus.org

The Back Story Workspace back-end: Trusted Computing Base (TCB) Workspace WSRF front-end that allows clients to deploy and manage virtual workspaces VWS Service Pool node Pool node Pool node Workspace back-end: Pool node Pool node Pool node Resource manager for a pool of physical nodes Deploys and manages Workspaces on the nodes Pool node Pool node Pool node Each node must have a VMM (Xen)‏ installed, as well as the workspace control program that manages individual nodes Pool node Pool node Pool node Trusted Computing Base (TCB) 10/20/08 The Nimbus Toolkit: http//workspace.globus.org

Workspace Components workspace resource manager workspace workspace WSRF workspace service workspace control EC2 workspace pilot workspace client 10/20/08 The Nimbus Toolkit: http//workspace.globus.org

Workspace Control VM image propagation Image management and reconstruction Creating blank partitions, sharing partitions VM control Starting, stopping, pausing, etc. Integrating a VM into the network Assigning MAC addresses and IP addresses DHCP delivery tool Building up a trusted (non-spoofable) networking layer Contextualization information management Talks to the workspace service via ssh Standalone component Some functionality overlap with libvirt Implementations in Xen and KVM (queued up for release) 10/20/08 The Nimbus Toolkit: http//workspace.globus.org

The Workspace Resource Manager Basic slot fitting Implements “immediate leases” Extensible vehicle to experiment with different leases Open source resource manager for multiple different VMMs Datacenter technology equivalent Can be replaced by OpenNebula or other datacenter technologies Deployment University of Chicago, University of Florida, Purdue, Masaryk University and all the other Science Cloud sites 10/20/08 The Nimbus Toolkit: http//workspace.globus.org

The Workspace Pilot Challenge: how can I provide a virtualization solution without disrupting the current operation of my cluster? Flying Low: the Workspace Pilot Integrates with popular LRMs (such as PBS, SGE) Implements “best effort” leases Glidein approach: submits a “pilot” program that claims a resource slot Includes administrator tools Deployment Testing @ U of Victoria (Atlas), Ian Gable and collaborators Adapting for the use of the Atlas experiment @ CERN, Omer Khalid 10/20/08 The Nimbus Toolkit: http//workspace.globus.org

Cloud Closure storage service workspace resource manager workspace WSRF workspace service workspace control EC2 workspace pilot cloud client workspace client 10/20/08 The Nimbus Toolkit: http//workspace.globus.org

IaaS Gateway Goals Released as service, not as code Access to different IaaS infrastructures Account management Facilitate movement between academic and commercial clouds and creation of meta-clouds Combine higher-level tools and IaaS Released as service, not as code First online in June 2007, currently in a rewrite Used to move e.g., HEP STAR experiments between Science Clouds and EC2 10/20/08 The Nimbus Toolkit: http//workspace.globus.org

The IaaS Gateway storage service workspace resource manager workspace WSRF workspace service workspace control EC2 workspace pilot IaaS gateway EC2 Enabling migration between commercial and academic resources 100 node STAR run potentially other providers cloud client workspace client 10/20/08 The Nimbus Toolkit: http//workspace.globus.org

One-click Virtual Clusters Parameterizable appliance Tightly-coupled clusters IP1 HK1 IP2 HK2 IP3 HK3 MPI Reciprocal exchange of information: networking and security 10/20/08 The Nimbus Toolkit: http//workspace.globus.org

Context Broker Context Broker IP1 HK1 IP1 IP2 HK2 HK1 IP1 IP3 HK1 HK3 10/20/08 The Nimbus Toolkit: http//workspace.globus.org

Goals for Context Broker Can work with every appliance Appliance schema, can be implemented in terms of many configuration systems Can work with every cloud provider Simple and minimal conditions on generic context delivery Can work across multiple cloud providers, in a distributed environment 10/20/08 The Nimbus Toolkit: http//workspace.globus.org

Status for Context Broker Release history: In alpha testing since August ‘07 First released summer July ‘08 (v 1.3.3) Latest update January ‘09 (v 2.2) Used to contextualize 100s of nodes for EC2 STAR runs Contextualized images on workspace marketplace Working with rPath to make contextualizatin easier for the user 10/20/08 The Nimbus Toolkit: http//workspace.globus.org

End of Nimbus Tour storage service workspace resource manager WSRF workspace service workspace control EC2 workspace pilot context broker IaaS gateway EC2 potentially other providers context client cloud client workspace client 10/20/08 The Nimbus Toolkit: http//workspace.globus.org

Science Clouds Make it easy for scientific projects to experiment with cloud computing Can cloud computing be used for science? Evolve software in response to the needs of scientific projects Start with EC2-like functionality and evolve to serve scientific projects: virtual clusters, diverse resource leases Federating clouds: moving between cloud resources in academic and commercial space What applications can use cloud computing and which ones cannot? What are the problems in practice? 2) NOT SOMETHING WE CAN RUN A SIMULATION FOR 10/20/08 The Nimbus Toolkit: http//workspace.globus.org

Science Cloud Resources University of Chicago (Nimbus): first cloud, online since March 4th 2008 16 nodes of UC TeraPort cluster, public IPs University of Florida Online since 05/08 16-32 nodes, access via VPN Other Science Clouds Masaryk University, Brno, Czech Republic (08/08), Purdue (09/08) Installations in progress: IU, Grid5K, others Using EC2 for overflow Minimal governance model http://workspace.globus.org/clouds 10/20/08 The Nimbus Toolkit: http//workspace.globus.org

Cloud Use ~100 DNs Utilization: Requests rejected: Overall: 16% Peak pw: 86% (week of 7/14) Requests rejected: None till 7/14 Lots afterwards ;-) Data scaled to the nubmer of days 10/20/08 The Nimbus Toolkit: http//workspace.globus.org

Who Runs on Nimbus? Project diversity: Science, CS, education, build&test… 10/20/08 The Nimbus Toolkit: http//workspace.globus.org

Hadoop over Many Clouds U of Florida U of Chicago ViNE router ViNE router CS research: investigate latency-sensitive apps, e.g. hadoop Need access to distributed resources, and high level of privilege to run a ViNE router Virtual workspace: ViNE router + application VMs Paper: “CloudBLAST: Combining MapReduce and Virtualization on Distributed Resources for Bioinformatics Applications” by Andréa Matsunaga, Maurício Tsugawa and José FortesFirst, accepted to eScience 2008. 10/20/08 The Nimbus Toolkit: http//workspace.globus.org

Alice HEP Experiment at CERN CHEP paper in preparation 10/20/08 The Nimbus Toolkit: http//workspace.globus.org

STAR STAR: a high-energy physics experiment Need resources with the right configuration Complex environments: correct versions of operating systems, libraries, tools, etc all have to be installed. Consistent environments: require validation A virtual OSG STAR cluster OSG cluster OSG CE (headnode), gridmapfiles, host certificates, NSF, PBS STAR worker nodes: SL4 + STAR conf Requirements One-click virtual cluster deployment Migration: Science Clouds -> EC2 10/20/08 The Nimbus Toolkit: http//workspace.globus.org

STAR (cntd) From proof-of-concept to production runs Performance ~2 years ago: proof-of-concept Last September: EC2 runs of up to 100 nodes (production scale, non-critical codes) Testing for critical production deployment Performance Within 10% of expected performance for applications Work by Jerome Lauret, Doug Olson, Leve Hajdu, Lidia Didenko 10/20/08 The Nimbus Toolkit: http//workspace.globus.org

Scalability Testing Motivation Workspaces Requirements Test scalability of various Globus components Test on a different platforms Workspaces Globus 101 + others Requirements very short-term but flexible access to diverse platforms Work by various members of the Globus Toolkit (Tom Howe and John Bresnahan) Resulted in provisioning a private cloud for Globus Typically very short-lived communities of one 10/20/08 The Nimbus Toolkit: http//workspace.globus.org

Montage Workflows Evaluating a cloud from user’s perspective Paper: “Exploration of the Applicability of Cloud Computing to Large-Scale Scientific Workflows”, C. Hoffa, T. Freeman, G. Mehta, E. Deelman, K. Keahey, SWBES08: Challenging Issues in Workflow Applications 10/20/08 The Nimbus Toolkit: http//workspace.globus.org

Cloud Computing Ecosystem Appliance Providers marketplaces commercial providers communities Deployment Orchestrator orchestrate the deployment of environments across possibly many cloud providers VMM/datacenter/IaaS User Environments 10/20/08 The Nimbus Toolkit: http//workspace.globus.org

Open Source IaaS Implementations Eucalyptus Open source implementation of EC2 UCSB, R. Wolski & team, 06/2008 OpenNebula Open source datacenter implementation University of Madrid, I. Llorente & team, 03/2008 Cloud-enabled Nimrod-G Monash University, MeSsAGE Lab, 01/2009 Industry efforts openQRM, Enomalism 10/20/08 The Nimbus Toolkit: http//workspace.globus.org

Friends and Family Committers: Kate Keahey & Tim Freeman (ANL/UC), Ian Gable (UVIC) A lot of help from the community, see: http://workspace.globus.org/people.html Collaborations: Cumulus: S3 implementation (Globus team) EBS implementation with IU Appliance management: rPath and Bcfg2 project Virtual network overlays: University of Florida Security: Vienna University of Technology 10/20/08 The Nimbus Toolkit: http//workspace.globus.org

To the Future and Beyond Increasing Importance of Appliance Providers Cloud computing tools Increased interest in cloud interoperability Standards: “rough consensus & working code” Image formats, contextualization capabilities, cloud interfaces, etc. Cloud markets 10/20/08 The Nimbus Toolkit: http//workspace.globus.org