Infrastructure-as-a-Service Cloud Computing for Science

Slides:



Advertisements
Similar presentations
First Steps in the Clouds
Advertisements

Mitesh Soni. Not an Expert Session… Only an Overview Please Ask Questions Stop me if I am throwing Bouncers Language Preference?
A Scalable Approach to Deploying and Managing Appliances Kate Keahey Rick Bradshaw, Narayan Desai, Tim Freeman Argonne National Lab, University of Chicago.
Virtualization, Cloud Computing, and TeraGrid Kate Keahey (University of Chicago, ANL) Marlon Pierce (Indiana University)
On-Demand Virtual Workspaces: Quality of Life in the Grid Kate Keahey Argonne National Laboratory.
Virtual Appliances for Scientific Applications Kate Keahey Argonne National Laboratory University of Chicago.
Virtual Workspaces State of the Art and Current Directions Borja Sotomayor University of Chicago (Dept. of CS) Kate Keahey ANL/UC.
Workspaces for CE Management Kate Keahey Argonne National Laboratory.
The VM deployment process has 3 major steps: 1.The client queries the VM repository, sending a list of criteria describing a workspace. The repository.
Wei Lu 1, Kate Keahey 2, Tim Freeman 2, Frank Siebenlist 2 1 Indiana University, 2 Argonne National Lab
Virtual Workspaces in the Grid Kate Keahey Argonne National Laboratory Ian Foster, Tim Freeman, Xuehai Zhang, Daniel Galron.
Cloud Computing and Virtualization with Globus Oakland, May 2008 Kate Keahey Tim Freeman University of Chicago.
Sponsors and Acknowledgments This work is supported in part by the National Science Foundation under Grants No. OCI , IIP and CNS
Science Clouds: Early Experiences in Cloud Computing for Scientific Applications Chicago, October 2008 Kate Keahey, Renato Figueiredo, Jose Fortes, Tim.
Globus Virtual Workspaces An Update SC 2007, Reno, NV Kate Keahey Argonne National Laboratory University of Chicago
Nimbus or an Open Source Cloud Platform or the Best Open Source EC2 No Money Can Buy ;-) Kate Keahey Tim Freeman University of Chicago.
Cloud Computing with Nimbus FNAL, January 2009 Kate Keahey University of Chicago Argonne National Laboratory.
Globus Virtual Workspaces HEPiX Fall 2007, St Louis Kate Keahey Argonne National Laboratory University of Chicago
University of St Andrews School of Computer Science Experiences with a Private Cloud St Andrews Cloud Computing co-laboratory James W. Smith Ali Khajeh-Hosseini.
Cloud Resource Broker for Scientific Community By: Shahzad Nizamani Supervisor: Peter Dew Co Supervisor: Karim Djemame Mo Haji.
Computing Infrastructure
The Platform as a Service Model for Networking Eric Keller, Jennifer Rexford Princeton University INM/WREN 2010.
SZTAKI Cloud akadémiai vonatkozása Kecskeméti Gábor MTA-SZTAKI Párhuzamos és Elosztott rendszerek kutatólaboratórium.
1/16 Distributed Systems Architecture Research Group Universidad Complutense de Madrid An Introduction to Virtualization and Cloud Technologies to Support.
1 Cloud Computing Prof. Ravi Sandhu Executive Director and Endowed Chair April 12, © Ravi Sandhu World-Leading.
University of Notre Dame
System Center 2012 R2 Overview
Cloud Computing Imranul Hoque. Today’s Cloud Computing.
An Approach to Secure Cloud Computing Architectures By Y. Serge Joseph FAU security Group February 24th, 2011.
Obstacles Security Culture Cloud Cloud Computing will be the primary delivery model, the big question is how fast is going to get there. The cost is the.
FI-WARE – Future Internet Core Platform FI-WARE Cloud Hosting July 2011 High-level description.
Introduction to DoC Private Cloud
An Introduction to Cloud Computing. The challenge Add new services for your users quickly and cost effectively.
SOFTWARE AS A SERVICE PLATFORM AS A SERVICE INFRASTRUCTURE AS A SERVICE.
Kate Keahey Argonne National Laboratory University of Chicago Globus Toolkit® 4: from common Grid protocols to virtualization.
Cloud Computing Why is it called the cloud?.
CLOUD COMPUTING & COST MANAGEMENT S. Gurubalasubramaniyan, MSc IT, MTech Presented by.
Ocean Observatories Initiative Common Execution Environment Kate Keahey OOI Cyberinfrastructure Life Cycle Objectives Milestone Review, Release 1 San Diego,
Cloud Computing for the Enterprise November 18th, This work is licensed under a Creative Commons.
PhD course - Milan, March /09/ Some additional words about cloud computing Lionel Brunie National Institute of Applied Science (INSA) LIRIS.
Virtual Infrastructure in the Grid Kate Keahey Argonne National Laboratory.
Nimbus & OpenNebula Young Suk Moon. Nimbus - Intro Open source toolkit Provides virtual workspace service (Infrastructure as a Service) A client uses.
Cloud Computing. What is Cloud Computing? Cloud computing is a model for enabling convenient, on-demand network access to a shared pool of configurable.
Introduction to Cloud Computing
M.A.Doman Short video intro Model for enabling the delivery of computing as a SERVICE.
Improving Network I/O Virtualization for Cloud Computing.
Presented by: Sanketh Beerabbi University of Central Florida COP Cloud Computing.
1 Evolution of OSG to support virtualization and multi-core applications (Perspective of a Condor Guy) Dan Bradley University of Wisconsin Workshop on.
Grids, Clouds and the Community. Cloud Technology and the NGS Steve Thorn Edinburgh University Matteo Turilli, Oxford University Presented by David Fergusson.
Large Scale Sky Computing Applications with Nimbus Pierre Riteau Université de Rennes 1, IRISA INRIA Rennes – Bretagne Atlantique Rennes, France
608D CloudStack 3.0 Omer Palo Readiness Specialist, WW Tech Support Readiness May 8, 2012.
Globus Virtual Workspaces OOI Cyberinfrastructure Design Meeting, San Diego, October Kate Keahey University of Chicago Argonne National Laboratory.
What is the cloud ? IT as a service Cloud allows access to services without user technical knowledge or control of supporting infrastructure Best described.
SALSASALSASALSASALSA FutureGrid Venus-C June Geoffrey Fox
CLOUD COMPUTING RICH SANGPROM. What is cloud computing? “Cloud computing is a model for enabling ubiquitous, convenient, on-demand network access to a.
Web Technologies Lecture 13 Introduction to cloud computing.
KAASHIV INFOTECH – A SOFTWARE CUM RESEARCH COMPANY IN ELECTRONICS, ELECTRICAL, CIVIL AND MECHANICAL AREAS
© 2012 Eucalyptus Systems, Inc. Cloud Computing Introduction Eucalyptus Education Services 2.
INFN/IGI contributions Federated Clouds Task Force F2F meeting November 24, 2011, Amsterdam.
Nimbus Update March 2010 OSG All Hands Meeting Kate Keahey Nimbus Project University of Chicago Argonne National Laboratory.
Building on virtualization capabilities for ExTENCI Carol Song and Preston Smith Rosen Center for Advanced Computing Purdue University ExTENCI Kickoff.
Cloud Computing with Nimbus
StratusLab Final Periodic Review
StratusLab Final Periodic Review
GGF15 – Grids and Network Virtualization
Cloud Computing with Nimbus
Virtualization, Cloud Computing, and TeraGrid
Internet and Web Simple client-server model
Cloud Computing: Concepts
Using and Building Infrastructure Clouds for Science
Presentation transcript:

Infrastructure-as-a-Service Cloud Computing for Science April 2010 Salishan Conference Kate Keahey keahey@mcs.anl.gov Nimbus project lead Argonne National Laboratory Computation Institute, University of Chicago

Cloud Computing for Science Environment control Resource control

“Workspaces” Dynamically provisioned environments Implementations Environment control Resource control Implementations Via leasing hardware platforms: reimaging, configuration management, dynamic accounts… Via virtualization: VM deployment Isolation

The Nimbus Workspace Service Pool node Pool node Pool node VWS Service Pool node Pool node Pool node Pool node Pool node Pool node Pool node Pool node Pool node

The Nimbus Workspace Service The workspace service publishes information about each workspace Pool node Pool node Pool node VWS Service Pool node Pool node Pool node Users can find out information about their workspace (e.g. what IP the workspace was bound to) Pool node Pool node Pool node Pool node Pool node Pool node Users can interact directly with their workspaces the same way the would with a physical machine.

Nimbus: Cloud Computing for Science Allow providers to build clouds Workspace Service: a service providing EC2-like functionality WSRF and WS (EC2) interfaces Allow users to use cloud computing Do whatever it takes to enable scientists to use IaaS Context Broker: turnkey virtual clusters, Also: protocol adapters, account managers and scaling tools Allow developers to experiment with Nimbus For research or usability/performance improvements Open source, extensible software Community extensions and contributions: UVIC (monitoring), IU (EBS, research), Technical University of Vienna (privacy, research) Nimbus: www.nimbusproject.org 3/27/2017 The Nimbus Toolkit: http//workspace.globus.org

Clouds for Science: a Personal Perspective OOI starts First STAR production run on EC2 Experimental Clouds for Science Xen released Science Clouds available EC2 released 2004 2006 2008 2010 “A Case for Grid Computing on VMs” In-Vigo, VIOLIN, DVEs, Dynamic accounts Policy-driven negotiation First Nimbus release Nimbus Context Broker release

STAR experiment Work by Jerome Lauret, Leve Hajdu, Lidia Didenko (BNL), Doug Olson (LBNL) STAR: a nuclear physics experiment at Brookhaven National Laboratory Studies fundamental properties of nuclear matter Problems: Complexity Consistency Availability

STAR Virtual Clusters Virtual resources A virtual OSG STAR cluster: OSG headnode (gridmapfiles, host certificates, NFS, Torque), worker nodes: SL4 + STAR One-click virtual cluster deployment via Nimbus Context Broker From Science Clouds to EC2 runs Running production codes since 2007 The Quark Matter run: producing just-in-time results for a conference: http://www.isgtw.org/?pid=1001735

Priceless? Compute costs: $ 5,630.30 Data transfer costs: $ 136.38 300+ nodes over ~10 days, Instances, 32-bit, 1.7 GB memory: EC2 default: 1 EC2 CPU unit High-CPU Medium Instances: 5 EC2 CPU units (2 cores) ~36,000 compute hours total Data transfer costs: $ 136.38 Small I/O needs : moved <1TB of data over duration Storage costs: $ 4.69 Images only, all data transferred at run-time Producing the result before the deadline… …$ 5,771.37

Cloud Bursting ALICE: Elastically Extend a Grid Elastically Extend a cluster React to Emergency OOI: Provide a Highly Available Service

Hadoop in the Science Clouds U of Chicago U of Florida Hadoop cloud Purdue Papers: “CloudBLAST: Combining MapReduce and Virtualization on Distributed Resources for Bioinformatics Applications” by A. Matsunaga, M. Tsugawa and J. Fortes. eScience 2008. “Sky Computing”, by K. Keahey, A. Matsunaga, M. Tsugawa, J. Fortes, to appear in IEEE Internet Computing, September 2009 3/27/2017 The Nimbus Toolkit: http//workspace.globus.org

Genomics: Dramatic Growth in the Need for Processing moving from big science (at centers) to many players democratization of sequencing we currently have more data than we can handle GB 454 Solexa Next gen Solexa From Folker Meyer, “The M5 Platform”

Ocean Observatory Initiative CI: Linking the marine infrastructure to science and users

Benefits and Concerns Now Environment per user (group) On-demand access “We don’t want to run datacenters!” Capital expense -> operational expense Growth and cost management Concerns Performance: “Cloud computing offerings are good, but they do not cater to scientific needs” Price and stability Privacy What are we hearing from people today? b. What prevents scientists from using clouds? Is cloud computing suitable for science? Ed Lazowska says 85% suitable. c. What specific challenges make it hard? d. What will be my “fusion slide” 5 years from now when I give a talk? What communities will have inspired the next advance in science?

Performance (Hardware) From Walker, ;login: 2008 Challenges Big I/O degradation Small CPU degradation Ethernet vs Infiniband No OS bypass drivers, no infiniband New development OS bypass drivers Potentially add graph from Walker’s paper What is the incoming/outgoing bandwidth exactly? Performance is half the problem – variability is worse! Relatively small scale: 800 nodes

Performance (Configuration) Trade-off: CPU vs. I/O VMM configuration Sharing between VMs “VMM latency” Performance instability Multi-core opportunity A price performance trade-off From Santos et al., Xen Summit 2007 Ultimately a change in mindset: “performance matters”!

From Performance… …to Price-Performance “Instance” definitions for science I/O oriented, well-described Co-location of instances To stack or not to stack Availability @ price point CPU: on-demand, reserved, spot pricing Data: high/low availability? Data access performance and availability Pricing Finer and coarser grained

Availability, Utilization, and Cost/Price Most of science today is done in batch The cost of on-demand Overprovisioning or request failure? Clouds + HTC = marriage made in heaven? Spot pricing courtesy of Rob Simmonds, example of WestGrid utilization

Data in the Cloud Storage clouds and SANs Challenges AWS Simple Storage Service (S3) AWS Elastic Block Store (EBS) Challenges Bandwidth performance and sharing Sharing data between users Sharing storage between instances Availability Data Privacy (the really hard issue) Descher et al., Retaining Data Control in Infrastructure Clouds, ARES (the International Dependability Conference), 2009.

What if my IaaS provider Cloud Markets Is computing fungible? Can it be fungible? Diverse paradigms: IaaS, PaaS, SaaS, and other aaS.. Interoperability Comparison basis What if my IaaS provider double their prices tomorrow?

IaaS Cloud Interoperability Cloud standards OCCI (OGF), OVF (DMTF), and many more… Cloud-standards.org Cloud abstractions Deltacloud, jcloud, libcloud, and many more… Appliances, not images rBuilder, BCFG2, CohesiveFT, Puppet, and many more…

Can you give us a better deal? In terms of… Performance, deployment, data access, price Based on relevant scientific benchmarks Comprehensive, current, and public The Bitsource: CloudServers vs EC2 LKC Cost by Instance

Science Cloud Ecosystem Goal: time to science in clouds -> zero Scientific appliances New tools “turnkey clusters”, cloud bursting, etc. Open source important Change in the mindset Education and adaptation “Profiles” New paradigm requires new approaches Teach old dogs some new tricks

The Magellan Project Joint project: ALCF and NERSC Funded by DOE/ASCR ARRA QDR InfiniBand Switch File Servers (8) (/home) 160TB Compute 504 Compute Nodes Nehalem Dual quad-core 2.66GHz 24GB RAM, 500GB Disk QDR IB link Totals 4032 Cores, 40TF Peak 12TB RAM, 250TB Disk Aggregation Switch ESNet 10Gb/s Apply at: http://magellan.alcf.anl.gov Login Nodes (4) ANI 100 Gb/s Router Gateway Nodes (~20)

The FutureGrid Project NSF-funded experimental testbed (incl. clouds) ~6000 total cores connected by private network Apply at: www.futuregrid.org

Parting Thoughts Opinions split into extremes: “Cloud computing is done!” “Infrastructure for toy problems” The truth is in the middle We know that IaaS represents a viable paradigm for a set of scientific applications… …it will take more work to enlarge that set Experimentation and open source are a catalyst Challenge: let’s make it work for science!