Globus Virtual Workspaces OOI Cyberinfrastructure Design Meeting, San Diego, 17-19 October Kate Keahey University of Chicago Argonne National Laboratory.

Slides:



Advertisements
Similar presentations
First Steps in the Clouds
Advertisements

A Scalable Approach to Deploying and Managing Appliances Kate Keahey Rick Bradshaw, Narayan Desai, Tim Freeman Argonne National Lab, University of Chicago.
Virtualization, Cloud Computing, and TeraGrid Kate Keahey (University of Chicago, ANL) Marlon Pierce (Indiana University)
On-Demand Virtual Workspaces: Quality of Life in the Grid Kate Keahey Argonne National Laboratory.
Virtual Appliances for Scientific Applications Kate Keahey Argonne National Laboratory University of Chicago.
Virtual Workspaces State of the Art and Current Directions Borja Sotomayor University of Chicago (Dept. of CS) Kate Keahey ANL/UC.
Virtualization: Towards More Flexible and Efficient Grids Kate Keahey Argonne National Laboratory.
Enabling Cost-Effective Resource Leases with Virtual Machines Borja Sotomayor University of Chicago Ian Foster Argonne National Laboratory/
Workspaces for CE Management Kate Keahey Argonne National Laboratory.
Wei Lu 1, Kate Keahey 2, Tim Freeman 2, Frank Siebenlist 2 1 Indiana University, 2 Argonne National Lab
Division of Labor: Tools for Growing and Scaling Grids Tim Freeman, Kate Keahey, Ian Foster, Abhishek Rana, Frank Wuerthwein, Borja Sotomayor.
Virtual Workspaces in the Grid Kate Keahey Argonne National Laboratory Ian Foster, Tim Freeman, Xuehai Zhang, Daniel Galron.
Cloud Computing and Virtualization with Globus Oakland, May 2008 Kate Keahey Tim Freeman University of Chicago.
Globus Virtual Workspaces An Update SC 2007, Reno, NV Kate Keahey Argonne National Laboratory University of Chicago
Nimbus or an Open Source Cloud Platform or the Best Open Source EC2 No Money Can Buy ;-) Kate Keahey Tim Freeman University of Chicago.
Globus Virtual Workspaces HEPiX Fall 2007, St Louis Kate Keahey Argonne National Laboratory University of Chicago
SLA-Oriented Resource Provisioning for Cloud Computing
System Center 2012 R2 Overview
Virtualization and Cloud Computing. Definition Virtualization is the ability to run multiple operating systems on a single physical system and share the.
Xen , Linux Vserver , Planet Lab
 Max Planck Institute for Software Systems Towards trusted cloud computing Nuno Santos, Krishna P. Gummadi, and Rodrigo Rodrigues MPI-SWS.
FI-WARE – Future Internet Core Platform FI-WARE Cloud Hosting July 2011 High-level description.
Cloud Computing and Virtualization Sorav Bansal CloudCamp 2010 IIT Delhi.
European Organization for Nuclear Research Virtualization Review and Discussion Omer Khalid 17 th June 2010.
Virtualization for Cloud Computing
1 Bridging Clouds with CernVM: ATLAS/PanDA example Wenjing Wu
Cloud Don McGregor Research Associate MOVES Institute
Jaeyoung Yoon Computer Sciences Department University of Wisconsin-Madison Virtual Machines in Condor.
Kate Keahey Argonne National Laboratory University of Chicago Globus Toolkit® 4: from common Grid protocols to virtualization.
System Center 2012 Setup The components of system center App Controller Data Protection Manager Operations Manager Orchestrator Service.
Copyright © 2010 Platform Computing Corporation. All Rights Reserved.1 The CERN Cloud Computing Project William Lu, Ph.D. Platform Computing.
Ocean Observatories Initiative Common Execution Environment Kate Keahey OOI Cyberinfrastructure Life Cycle Objectives Milestone Review, Release 1 San Diego,
Virtual Infrastructure in the Grid Kate Keahey Argonne National Laboratory.
 Cloud computing  Workflow  Workflow lifecycle  Workflow design  Workflow tools : xcp, eucalyptus, open nebula.
Nimbus & OpenNebula Young Suk Moon. Nimbus - Intro Open source toolkit Provides virtual workspace service (Infrastructure as a Service) A client uses.
The Glidein Service Gideon Juve What are glideins? A technique for creating temporary, user- controlled Condor pools using resources from.
Copyright © 2011 EMC Corporation. All Rights Reserved. MODULE – 6 VIRTUALIZED DATA CENTER – DESKTOP AND APPLICATION 1.
M.A.Doman Short video intro Model for enabling the delivery of computing as a SERVICE.
Presented by: Sanketh Beerabbi University of Central Florida COP Cloud Computing.
1 Evolution of OSG to support virtualization and multi-core applications (Perspective of a Condor Guy) Dan Bradley University of Wisconsin Workshop on.
Grids, Clouds and the Community. Cloud Technology and the NGS Steve Thorn Edinburgh University Matteo Turilli, Oxford University Presented by David Fergusson.
Large Scale Sky Computing Applications with Nimbus Pierre Riteau Université de Rennes 1, IRISA INRIA Rennes – Bretagne Atlantique Rennes, France
1 The Fast(est) Path to Building a Private/Hybrid Cloud October 25th, 2011 Paul Mourani RightScale.
COMS E Cloud Computing and Data Center Networking Sambit Sahu
High Performance Computing on Virtualized Environments Ganesh Thiagarajan Fall 2014 Instructor: Yuzhe(Richard) Tang Syracuse University.
ServiceSs, a new programming model for the Cloud Daniele Lezzi, Rosa M. Badia, Jorge Ejarque, Raul Sirvent, Enric Tejedor Grid Computing and Clusters Group.
Presented by: Reem Alshahrani. Outlines What is Virtualization Virtual environment components Advantages Security Challenges in virtualized environments.
Virtual Workspaces Kate Keahey Argonne National Laboratory.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
20409A 7: Installing and Configuring System Center 2012 R2 Virtual Machine Manager Module 7 Installing and Configuring System Center 2012 R2 Virtual.
Virtualization One computer can do the job of multiple computers, by sharing the resources of a single computer across multiple environments. Turning hardware.
Purdue RP Highlights TeraGrid Round Table May 20, 2010 Preston Smith Manager - HPC Grid Systems Rosen Center for Advanced Computing Purdue University.
Predrag Buncic (CERN/PH-SFT) Software Packaging: Can Virtualization help?
Unit 2 VIRTUALISATION. Unit 2 - Syllabus Basics of Virtualization Types of Virtualization Implementation Levels of Virtualization Virtualization Structures.
Nimbus Update March 2010 OSG All Hands Meeting Kate Keahey Nimbus Project University of Chicago Argonne National Laboratory.
Building on virtualization capabilities for ExTENCI Carol Song and Preston Smith Rosen Center for Advanced Computing Purdue University ExTENCI Kickoff.
Workspace Management Services Kate Keahey Argonne National Laboratory.
Open Source Virtualization Andrey Meganov RHCA, RHCX Consultant / VDEL
New Paradigms: Clouds, Virtualization and Co.
DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING CLOUD COMPUTING
ATLAS Cloud Operations
StratusLab Final Periodic Review
StratusLab Final Periodic Review
Cloud Computing with Nimbus
Managing Clouds with VMM
20409A 7: Installing and Configuring System Center 2012 R2 Virtual Machine Manager Module 7 Installing and Configuring System Center 2012 R2 Virtual.
Cloud Computing and Virtualization with Globus
Sky Computing on FutureGrid and Grid’5000
Virtualization, Cloud Computing, and TeraGrid
Sky Computing on FutureGrid and Grid’5000
Presentation transcript:

Globus Virtual Workspaces OOI Cyberinfrastructure Design Meeting, San Diego, October Kate Keahey University of Chicago Argonne National Laboratory

10/18/07, ORION meetingVirtual Workspaces: Why Virtual Workspaces? 1)Configuration: finding environment tailored to my application 2) Leasing: negotiating a resource allocation tailored to my needs ?

10/18/07, ORION meetingVirtual Workspaces: Why Virtual Workspaces: Challenges l Quality of Service u We get: best-effort provisioning (one size fits all) u We need: advance reservations, urgent computing, periodic, best-effort, and others l Quality of Life u Commonly heard: “I have 512 nodes I cannot use” u We need nodes we can use l Separating environment/resource provisioning from job execution is simply a good idea u E.g. workflow based applications

10/18/07, ORION meetingVirtual Workspaces: Elastic Computing l Leases with clearly defined and enforceable service terms u when you need them and how you need them l A variety of lease shapes u Short-term as well as long-term leases u No “one size fits all”: suitable availability l Extending, reducing, renegotiating leases based on need l Workspaces: resources you can use u Configured with your environment

10/18/07, ORION meetingVirtual Workspaces: What are Virtual Workspaces? l A dynamically provisioned environment u Environment definition: we get exactly the (software) environment we need on demand. u Resource allocation: Provision the resources the workspace needs (CPUs, memory, disk, bandwidth, availability), allowing for dynamic renegotiation to reflect changing requirements and conditions. l Implementation u Traditional means: publishing, automated configuration, coarse-grained enforcement u Virtual Machines: encapsulated configuration and fine-grained enforcement Paper: “Virtual Workspaces: Achieving Quality of Service and Quality of Life in the Grid”

10/18/07, ORION meetingVirtual Workspaces: Virtual Machines Hardware Virtual Machine Monitor (VMM) / Hypervisor Guest OS (Linux) Guest OS (NetBSD) Guest OS (Windows) VM App Xen VMWare UML KVM etc. Parallels l Encapsulate the environment l Fast to deploy, enables short-term leasing l Excellent enforcement and performance isolation l Very good isolation l Also: suspend/resume -> migration

10/18/07, ORION meetingVirtual Workspaces: Deploying Workspaces Remotely Pool node Pool node Pool node Pool node Pool node Pool node Pool node Pool node Pool node Pool node Pool node Pool node Workspace -Workspace metadata -Pointer to the image -Logistics information -Deployment request -CPU, memory, node count, etc. VWS Service

10/18/07, ORION meetingVirtual Workspaces: Interacting with Workspaces Pool node Trusted Computing Base (TCB) Pool node Pool node Pool node Pool node Pool node Pool node Pool node Pool node Pool node Pool node Pool node The workspace service publishes information on each workspace as standard WSRF Resource Properties. Users can query those properties to find out information about their workspace (e.g. what IP the workspace was bound to) Users can interact directly with their workspaces the same way the would with a physical machine. VWS Service

10/18/07, ORION meetingVirtual Workspaces: Workspace Service Components Pool node Trusted Computing Base (TCB) Pool node Pool node Pool node Pool node Pool node Pool node Pool node Pool node Pool node Pool node Pool node VWS Service Workspace WSRF front-end that allows clients to deploy and manage virtual workspaces Resource manager for a pool of physical nodes Deploys and manages Workspaces on the nodes Each node must have a VMM (Xen) ‏ installed, as well as the workspace control program that manages individual nodes along with the workspace backend (software that manages individual nodes) ‏ Contextualization creates a common context for a virtual cluster

10/18/07, ORION meetingVirtual Workspaces: Workspace Service Components l GT4 WSRF front-end u Leverages GT core and services, notifications, security, etc. u Follows the OGF WS-Agreement provisioning model l Publishes available lease terms u Provides lease descriptions l Workspace Resource Manager (back-end) u Currently focused on Xen u Implements multiple deployment modes l Contextualization u Put the virtual appliance in its deployment context l Current release 1.2.3, available at: u

10/18/07, ORION meetingVirtual Workspaces: Workspace Resource Managers l Default resource manager (basic slot fitting) u Commercial datacenter technology would also fit l Amazon Elastic Compute Cloud (EC2) u Selling cycles as Xen VMs u Software similar to Workspace Service l No virtual clusters, contextualization, fine-grain allocations, etc. u Grid credential admission -> EC2 charging model l Workspace Pilot u Integrating VMs into current provisioning models (essentially a PBS glidein) l Long-term solutions u Interleaving soft and hard leases u Providing better articulated leasing models u Developed in the context of existing schedulers

10/18/07, ORION meetingVirtual Workspaces: Providing Resources: The Workspace Pilot l Challenge: find the simplest way to integrate VMs into current provisioning models l Glide-ins (Condor): poor man’s resource leasing u Best-effort semantics: submit a job “pilot” that claims resources but does not run a job l The Workspace Pilot u Resources run dom0 u Pilot adjusts memory u VWS leases “slots” to VMs u Kill-all facility

10/18/07, ORION meetingVirtual Workspaces: Workspace Resource Management (long term solutions) l Challenge: How can we provide semantically rich leases in a cost-effective way?

SHORT-TERM LEASE SHORT-TERM LEASE Scheduling the lease without using virtualization : Scheduling the lease using virtualization:

10/18/07, ORION meetingVirtual Workspaces: Interleaving Soft and Hard Leases Injected leases are short (1h-2h), very frequent (every 4 to 8 hours), large (number of nodes between 1/3 and ½ of the cluster)‏ Not using VMs (even with backfilling) results in a noticeable hit on runtime. In this case, the scheduler cannot readily start large parallel jobs because of the resource leases. With VMs, these can be started, and suspended before the leases start.

10/18/07, ORION meetingVirtual Workspaces: Middleware Development: Contextualization l Challenge: Putting a VM in the deployment context of the Grid, site, and other VMs u Assigning and sharing IP addresses, name resolution, application- level configuration, etc. l Management of Common Context Paper: “A Scalable Approach To Deploying And Managing Appliances”, TeraGrid conference 2007 u Configuration-dependent l provides&requires u Common understanding between the image “vendor” and deployer u Mechanisms for securely delivering the required information to images across different implementations contextualization agent Common Context IP hostname pk

10/18/07, ORION meetingVirtual Workspaces: Workspace Ecosystem Resource Providers: Local clusters, Grid resource providers (TeraGrid, OSG) Commercial providers: EC2, Sun, slicehost, Provisioning a resource, not a platform Appliance Providers: OSFarm, rPath, CohesiveFT, bcfg2, etc. marketplaces of all kinds Virtual Organizations: configuration, attestation, maintenance Middleware: appliances --> resources manage appliance deployment Combining networks and storage VWSEC2In-Vigo

10/18/07, ORION meetingVirtual Workspaces: STAR: Why Workspaces? l STAR: a high energy nuclear physics application l Complex experimental application codes u Developed over more than 10 years, by more than 100 scientists, comprises ~2 M lines of C++ and Fortran code l Require complex, customized environments u Rely on the right combination of compiler versions and available libraries u Dynamically load external libraries depending on the task to be performed l Environment validation u To ensure reproducibility and result uniformity across environments u Regression tests cannot be done on all OS flavors due to simple manpower considerations

10/18/07, ORION meetingVirtual Workspaces: Virtual Workspaces for STAR l STAR image configuration u A virtual cluster composed of an OSG headnode and STAR worker nodes l Using the workspace service over EC2 to provision resources u Allocations of up to 100 nodes u Dynamically contextualized for out of the box cluster

10/18/07, ORION meetingVirtual Workspaces: Running jobs : 300 PDSF Fermi VWS/EC2BNL Running jobs : 230 Running jobs : 150 Running jobs : 50 Running jobs : 150 Running jobs : 300Running jobs : 282Running jobs : 243Running jobs : 221Running jobs : 195Running jobs : 140Running jobs : 76Running jobs : 0 Running jobs : 200 Running jobs : 50 Running jobs : 150Running jobs : 142Running jobs : 124Running jobs : 109Running jobs : 94Running jobs : 73Running jobs : 42 Running jobs : 195Running jobs : 183Running jobs : 152Running jobs : 136Running jobs : 96Running jobs : 54Running jobs : 37Running jobs : 0 Running jobs : 42Running jobs : 39Running jobs : 34Running jobs : 27Running jobs : 21Running jobs : 15Running jobs : 9Running jobs : 0 Job Completion : File Recovery : WSU with thanks to Jerome Lauret and Doug Olson of the STAR project

10/18/07, ORION meetingVirtual Workspaces: Nersc PDSF EC2 (via Workspace Service) WSU Accelerated display of a workflow job state Y = job number, X = job state with thanks to Jerome Lauret and Doug Olson of the STAR project

10/18/07, ORION meetingVirtual Workspaces: Parting Thoughts l Division of labor u Resource providers u Appliance vendors l Outsourcing of resource provisioning u We need rich lease semantics l Moving toward a Grid where your environment can run on the available resources u Configure once, run many times l Local resources, Grid resources, commercially available resources

10/18/07, ORION meetingVirtual Workspaces: Credits l Workspace team: u Tim Freeman, u Borja Sotomayor l Guest appearances u Ian Foster, Frank Siebenlist l STAR collaborators: u Jerome Lauret (BNL), Doug Olson (LBNL)