A Performance Evaluation of Azure and Nimbus Clouds for Scientific Applications Radu Tudoran KerData Team Inria Rennes ENS Cachan 10 April 2012 Joint work.

Slides:



Advertisements
Similar presentations
Sponsors and Acknowledgments This work is supported in part by the National Science Foundation under Grants No. OCI , IIP and CNS
Advertisements

Cloud Service Models and Performance Ang Li 09/13/2010.
SLA-Oriented Resource Provisioning for Cloud Computing
System Center 2012 R2 Overview
Virtualization and Cloud Computing. Definition Virtualization is the ability to run multiple operating systems on a single physical system and share the.
Virtual Machine Usage in Cloud Computing for Amazon EE126: Computer Engineering Connor Cunningham Tufts University 12/1/14 “Virtual Machine Usage in Cloud.
Infrastructure as a Service (IaaS) Amazon EC2
An Approach to Secure Cloud Computing Architectures By Y. Serge Joseph FAU security Group February 24th, 2011.
Reciprocal Resource Fairness: Towards Cooperative Multiple-Resource Fair Sharing in IaaS Clouds School of Computer Engineering Nanyang Technological University,
Ken Birman. Massive data centers We’ve discussed the emergence of massive data centers associated with web applications and cloud computing Generally.
Authors: Thilina Gunarathne, Tak-Lon Wu, Judy Qiu, Geoffrey Fox Publish: HPDC'10, June 20–25, 2010, Chicago, Illinois, USA ACM Speaker: Jia Bao Lin.
Evaluating GPU Passthrough in Xen for High Performance Cloud Computing Andrew J. Younge 1, John Paul Walters 2, Stephen P. Crago 2, and Geoffrey C. Fox.
Nikolay Tomitov Technical Trainer SoftAcad.bg.  What are Amazon Web services (AWS) ?  What’s cool when developing with AWS ?  Architecture of AWS 
INTRODUCTION TO CLOUD COMPUTING CS 595 LECTURE 4.
Jennifer Rexford Princeton University MW 11:00am-12:20pm Data-Center Traffic Management COS 597E: Software Defined Networking.
5205 – IT Service Delivery and Support
Cloud computing Tahani aljehani.
An Introduction to Cloud Computing. The challenge Add new services for your users quickly and cost effectively.
1 Integrating a Network IDS into an Open Source Cloud Computing Environment 1st International Workshop on Security and Performance in Emerging Distributed.
SUMMER VACATION SCHOLARSHIP | IM&T Scientific Computing in the Cloud.
A Brief Overview by Aditya Dutt March 18 th ’ Aditya Inc.
Opensource for Cloud Deployments – Risk – Reward – Reality
PhD course - Milan, March /09/ Some additional words about cloud computing Lionel Brunie National Institute of Applied Science (INSA) LIRIS.
Windows Azure Conference 2014 Running Docker on Windows Azure.
1 NETE4631 Managing the Cloud and Capacity Planning Lecture Notes #8.
JuxMem: An Adaptive Supportive Platform for Data Sharing on the Grid Gabriel Antoniu, Luc Bougé, Mathieu Jan IRISA / INRIA & ENS Cachan, France Workshop.
M.A.Doman Short video intro Model for enabling the delivery of computing as a SERVICE.
Your First Azure Application Michael Stiefel Reliable Software, Inc.
Improving Network I/O Virtualization for Cloud Computing.
Microsoft Virtual Academy.
Windows Azure Conference 2014 Deploy your Java workloads on Windows Azure.
Large Scale Sky Computing Applications with Nimbus Pierre Riteau Université de Rennes 1, IRISA INRIA Rennes – Bretagne Atlantique Rennes, France
Cloud Computing Instructor: Pankaj Mehra Teaching Assistant: Raghav Gautam Lec. 5 April 22, 2010 ISM 158.
Magellan: Experiences from a Science Cloud Lavanya Ramakrishnan.
JuxMem: An Adaptive Supportive Platform for Data Sharing on the Grid Gabriel Antoniu, Luc Bougé, Mathieu Jan IRISA / INRIA & ENS Cachan, France Grid Data.
WINDOWS AZURE Scott Guthrie Corporate Vice President Windows Azure
What is the cloud ? IT as a service Cloud allows access to services without user technical knowledge or control of supporting infrastructure Best described.
Windows Azure Virtual Machines Anton Boyko. A Continuous Offering From Private to Public Cloud.
What’s New with Windows Server 2012 and Microsoft System Center 2012 SP1 Vijay Tewari Principal Group Program Manager Microsoft Corporation.
Distributed FutureGrid Clouds for Scalable Collaborative Sensor-Centric Grid Applications For AMSA TO 4 Sensor Grid Technical Interchange Meeting March.
Virtual techdays INDIA │ august 2010 Cloud Computing – What and How ? Sandeep J Alur │ Microsoft India.
Rick Claus Sr. Technical Evangelist,
Zvezdan Pavković. Storage Non-Persistent Storage Persistent Storage Easily add additional storage. Networking Internal and Input Endpoints configured.
Cloud Computing Paradigms for Pleasingly Parallel Biomedical Applications Thilina Gunarathne, Tak-Lon Wu Judy Qiu, Geoffrey Fox School of Informatics,
Ian Gable HEPiX Spring 2009, Umeå 1 VM CPU Benchmarking the HEPiX Way Manfred Alef, Ian Gable FZK Karlsruhe University of Victoria May 28, 2009.
NC College of Engineering 1 Grid Computing: Harnessing Underutilized Resources Compiled by Compiled by Rajesh & Anju NCCE,Israna, Panipat.
Vignesh Ravindran Sankarbala Manoharan. Infrastructure As A Service (IAAS) is a model that is used to deliver a platform virtualization environment with.
Windows Azure poDRw_Xi3Aw.
Launch Amazon Instance. Amazon EC2 Amazon Elastic Compute Cloud (Amazon EC2) provides resizable computing capacity in the Amazon Web Services (AWS) cloud.
CLOUD COMPUTING WHAT IS CLOUD COMPUTING?  Cloud Computing, also known as ‘on-demand computing’, is a kind of Internet-based computing,
Scalable and elastic Enterprise scale and performance for the largest workloads Shared- nothing live migration Hyper-V Network.
Øg fleksibiliteten i din infrastruktur 32 virtual processors per VM 1 TB virtual machine memory New 64TB VHDX format Native 4k disk support Hyper-V.
KAASHIV INFOTECH – A SOFTWARE CUM RESEARCH COMPANY IN ELECTRONICS, ELECTRICAL, CIVIL AND MECHANICAL AREAS
© 2012 Eucalyptus Systems, Inc. Cloud Computing Introduction Eucalyptus Education Services 2.
Autonomic aspects in cloud data management Alexandra Carpen-Amarie KerData.
StratusLab is co-funded by the European Community’s Seventh Framework Programme (Capacities) Grant Agreement INFSO-RI Work Package 5 Infrastructure.
St. Petersburg, 2016 Openstack Disk Storage vs Amazon Disk Storage Computing Clusters, Grids and Cloud Erasmus Mundus Master Program in PERCCOM Author:
EGI-InSPIRE RI EGI Compute and Data Services for Open Access in H2020 Tiziana Ferrari Technical Director, EGI.eu
CS 6027 Advanced Networking FINAL PROJECT ​. Cloud Computing KRANTHI ​ CHENNUPATI PRANEETHA VARIGONDA ​ SANGEETHA LAXMAN ​ VARUN ​ DENDUKURI.
Course: Cluster, grid and cloud computing systems Course author: Prof
Cloud Technology and the NGS Steve Thorn Edinburgh University (Matteo Turilli, Oxford University)‏ Presented by David Fergusson.
C Loomis (CNRS/LAL) and V. Floros (GRNET)
Logo here Module 3 Microsoft Azure Web App. Logo here Module Overview Introduction to App Service Overview of Web Apps Hosting Web Applications in Azure.
Outline Virtualization Cloud Computing Microsoft Azure Platform
Sky Computing on FutureGrid and Grid’5000
Sky Computing on FutureGrid and Grid’5000
Client/Server Computing and Web Technologies
Deploying machine learning models at scale
Presentation transcript:

A Performance Evaluation of Azure and Nimbus Clouds for Scientific Applications Radu Tudoran KerData Team Inria Rennes ENS Cachan 10 April 2012 Joint work with Alexandru Costan, Gabriel Antoniu, Luc Bougé

Outline Context and motivation 2 cloud environments: Azure and Nimbus Metrics Evaluation and discussions Conclusion 10 April 2012 Performance Evaluation of Azure and Nimbus- 2

Scientific Context for Clouds Up to recent time, scientists mainly relied on grids and clusters for their experiments Clouds emerged as an alternative to these due to: -Elasticity -Easier management -Customizable environment -Experiments’ repeatability -Larger scales 10 April 2012 Performance Evaluation of Azure and Nimbus- 3

Requirements for Science Applications Performance: throughput, computation power, etc. Control Cost Stability Large storage capacity Reliability Data access and throughput Security Intra-machine communication 10 April 2012 Performance Evaluation of Azure and Nimbus- 4

Stages of a Scientific Application 10 April 2012 Performance Evaluation of Azure and Nimbus Computation Nodes Cloud Storage Local Host

Public Clouds: Azure On demand – pay as you go Computation (Web/Worker Role) separated from storage (Azure BLOBs) HTTP for storage access BLOB are structured in containers Multitenancy model 10 April 2012 Performance Evaluation of Azure and Nimbus- 6 VM TypeCPU CoresMemoryDisk Small11.75GB225GB Medium23.5GB490GB Large47GB1000GB ExtraLarge814GB2040GB VM Characteristics

Private Nimbus-powered Clouds Deployed in a controlled infrastructure Allows to lease a set of virtualized computation resources and to customize the environment 2 Phases deployment: - The cloud environment - The computation VMs Cumulus API - allows multiple storage - quota based storage - VM image repository 10 April 2012 Performance Evaluation of Azure and Nimbus- 7

Focus of Our Evaluation Initial phase - Deploying the environment (hypervisors, application, etc.) - Data staging - Metric: Total time Applications’ Performance and Variability -Computation - Real application ABrain - Metric: Makespan -Data transfer - Synthetic benchmarks - Metric: Throughput Cost - Pay-as you go - Infrastructure & Maintenance 10 April 2012 Performance Evaluation of Azure and Nimbus- 8

10 April 2012 Performance Evaluation of Azure and Nimbus- 9 Deployment time - Azure – 10 – 20 minutes - Nimbus - Phase 1 – 15 minutes - Phase 2 – 10 minutes (on Grid5000) Pre-processing Data moved from Local to Cloud Storage

10 April 2012 Performance Evaluation of Azure and Nimbus- 10 Pre-processing (2) Data moved from Cloud Storage to Computing nodes

Pre-processing Deployment time - Azure – 10 – 20 minutes - Nimbus - Phase 1 – 15 minutes - Phase 2 – 10 minutes (on Grid5000) 10 April 2012 Performance Evaluation of Azure and Nimbus- 11 Size GB AzureNimbus Local->AzureBlobsAzureBlobs->LocalLocal->CumulusCumulus->Local 0, Size GB AzureNimbus ExtraLarge->AzureBlobsAzureBlobs->ExtraLargeVM->CumulusCumulus->VM 0, Data moved from Local to Cloud Storage Data moved from Cloud Storage to Computing nodes

ABrain 10 April 2012 Performance Evaluation of Azure and Nimbus - 12 p(p(), Genetic data Brain image Y q~ N~2000 X p~10 6 – Anatomical MRI – Functional MRI – Diffusion MRI – DNA array (SNP/CNV) – gene expression data – others... finding associations:

Application Performance – Computation Time 10 April 2012 Performance Evaluation of Azure and Nimbus- 13 Repeated a run of ABrain 1440 times on each machine (Operations on large matrices) Evaluate only the local resources (CPU, Memory, Local storage)

10 April 2012 Performance Evaluation of Azure and Nimbus- 14 Computation Variability vs. Fairness Multitenancy model Variability of a VM sharing a node with others with respect to an isolated one

Data Transfer Throughput TCP - default and most commonly used - reliable transfer mechanism of data between VM instances RPC - based on HTTP - used by many applications as communication paradigm between their distributed entities 10 April 2012 Performance Evaluation of Azure and Nimbus- 15

Application Performance – Data Transfers 10 April 2012 Performance Evaluation of Azure and Nimbus- 16 Delivered TCP throughput at application level

Application Performance – Data Transfers 10 April 2012 Performance Evaluation of Azure and Nimbus- 17 RPC (HTTP) – delivered throughput at application level HTTP traffic control on nodes

Cost Analysis 10 April 2012 Performance Evaluation of Azure and Nimbus- 18

Conclusions Compared Nimbus and Azure clouds from the point of view of scientific applications stages Azure - lower cost - good TCP throughput and stability - faster transfer from storage to VM Nimbus - additional control - good RPC (HTTP) throughput - in general better data staging An analysis of how multitenancy model affects the fairness in public clouds 10 April 2012 Performance Evaluation of Azure and Nimbus- 19

thank you! Alexandru Costan Gabriel Antoniu Radu Tudoran Luc Bougé A Performance Evaluation of Azure and Nimbus Clouds for Scientific Applications