Www.egi.eu EGI-InSPIRE RI-261323 EGI-InSPIRE www.egi.eu EGI-InSPIRE RI-261323 04/11/11 1 GPU Computing in EGI environment using Cloud approach F. Vella,

Slides:



Advertisements
Similar presentations
SLA-Oriented Resource Provisioning for Cloud Computing
Advertisements

Advanced Virtualization Techniques for High Performance Cloud Cyberinfrastructure Andrew J. Younge Ph.D. Candidate Indiana University Advisor: Geoffrey.
GPGPU Introduction Alan Gray EPCC The University of Edinburgh.
By Adam Balla & Wachiu Siu
Clouds C. Vuerli Contributed by Zsolt Nemeth. As it started.
An Approach to Secure Cloud Computing Architectures By Y. Serge Joseph FAU security Group February 24th, 2011.
INTRODUCTION TO CLOUD COMPUTING CS 595 LECTURE 6 2/13/2015.
CLOUD COMPUTING AN OVERVIEW & QUALITY OF SERVICE Hamzeh Khazaei University of Manitoba Department of Computer Science Jan 28, 2010.
SDN and Openflow.
GPUs on Clouds Andrew J. Younge Indiana University (USC / Information Sciences Institute) UNCLASSIFIED: 08/03/2012.
Resource Management of Grid Computing
Public cloud definition Public cloud is a cloud in which Cloud infrastructure is available to the general public. Public cloud define cloud computing.
Evaluating GPU Passthrough in Xen for High Performance Cloud Computing Andrew J. Younge 1, John Paul Walters 2, Stephen P. Crago 2, and Geoffrey C. Fox.
© Copyright 2011 John Wiley & Sons, Inc.
Cloud Computing (101).
INTRODUCTION TO CLOUD COMPUTING CS 595 LECTURE 4.
Presented by Sujit Tilak. Evolution of Client/Server Architecture Clients & Server on different computer systems Local Area Network for Server and Client.
Microsoft ® Application Virtualization 4.5 Infrastructure Planning and Design Series.
WORKFLOWS IN CLOUD COMPUTING. CLOUD COMPUTING  Delivering applications or services in on-demand environment  Hundreds of thousands of users / applications.
Plan Introduction What is Cloud Computing?
DESIGN OF A PLATFORM OF VIRTUAL SERVICE CONTAINERS FOR SERVICE ORIENTED CLOUD COMPUTING Carlos de Alfonso Andrés García Vicente Hernández.
Cloud Computing.
1/8 Enhancing Grid Infrastructures with Virtualization and Cloud Technologies Ignacio M. Llorente Business Workshop EGEE’09 September 21st, 2009 Distributed.
PhD course - Milan, March /09/ Some additional words about cloud computing Lionel Brunie National Institute of Applied Science (INSA) LIRIS.
 Cloud computing  Workflow  Workflow lifecycle  Workflow design  Workflow tools : xcp, eucalyptus, open nebula.
SICSA student induction day, 2009Slide 1 Social Simulation Tutorial Session 6: Introduction to grids and cloud computing International Symposium on Grid.
+ CS 325: CS Hardware and Software Organization and Architecture Cloud Architectures.
Software Architecture
DISTRIBUTED COMPUTING
Active Monitoring in GRID environments using Mobile Agent technology Orazio Tomarchio Andrea Calvagna Dipartimento di Ingegneria Informatica e delle Telecomunicazioni.
M.A.Doman Short video intro Model for enabling the delivery of computing as a SERVICE.
Through the development of advanced middleware, Grid computing has evolved to a mature technology in which scientists and researchers can leverage to gain.
Middleware for Grid Computing and the relationship to Middleware at large ECE 1770 : Middleware Systems By: Sepehr (Sep) Seyedi Date: Thurs. January 23,
PARALLEL COMPUTING overview What is Parallel Computing? Traditionally, software has been written for serial computation: To be run on a single computer.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Ruth Pordes November 2004TeraGrid GIG Site Review1 TeraGrid and Open Science Grid Ruth Pordes, Fermilab representing the Open Science.
Compiler and Runtime Support for Enabling Generalized Reduction Computations on Heterogeneous Parallel Configurations Vignesh Ravi, Wenjing Ma, David Chiu.
The Eucalyptus Open-source Cloud Computing System Daniel Nurmi Rich Wolski, Chris Grzegorczyk, Graziano Obertelli, Sunil Soman, Lamia Youseff, Dmitrii.
SDN Management Layer DESIGN REQUIREMENTS AND FUTURE DIRECTION NO OF SLIDES : 26 1.
Grid and Cloud Computing Alessandro Usai SWITCH Sergio Maffioletti Grid Computing Competence Centre - UZH/GC3
7. Grid Computing Systems and Resource Management
Full and Para Virtualization
Internet of Things. IoT Novel paradigm – Rapidly gaining ground in the wireless scenario Basic idea – Pervasive presence around us a variety of things.
International Symposium on Grid Computing (ISGC-07), Taipei - March 26-29, 2007 Of 16 1 A Novel Grid Resource Broker Cum Meta Scheduler - Asvija B System.
Development of e-Science Application Portal on GAP WeiLong Ueng Academia Sinica Grid Computing
Aneka Cloud ApplicationPlatform. Introduction Aneka consists of a scalable cloud middleware that can be deployed on top of heterogeneous computing resources.
3/12/2013Computer Engg, IIT(BHU)1 CLOUD COMPUTING-1.
Tool Integration with Data and Computation Grid “Grid Wizard 2”
Web Technologies Lecture 13 Introduction to cloud computing.
DIRAC Pilot Jobs A. Casajus, R. Graciani, A. Tsaregorodtsev for the LHCb DIRAC team Pilot Framework and the DIRAC WMS DIRAC Workload Management System.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI How to integrate portals with the EGI monitoring system Dusan Vudragovic.
Università di Perugia Enabling Grids for E-sciencE Status of and requirements for Computational Chemistry NA4 – SA1 Meeting – 6 th April.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI VM Management Chair: Alexander Papaspyrou 2/25/
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI MPI and Parallel Code Support Alessandro Costantini, Isabel Campos, Enol.
1 TCS Confidential. 2 Objective : In this session we will be able to learn:  What is Cloud Computing?  Characteristics  Cloud Flavors  Cloud Deployment.
CLOUD COMPUTING WHAT IS CLOUD COMPUTING?  Cloud Computing, also known as ‘on-demand computing’, is a kind of Internet-based computing,
EGI-InSPIRE RI EGI Webinar EGI-InSPIRE RI Porting your application to the EGI Federated Cloud 17 Feb
PARALLEL AND DISTRIBUTED PROGRAMMING MODELS U. Jhashuva 1 Asst. Prof Dept. of CSE om.
Unit 2 VIRTUALISATION. Unit 2 - Syllabus Basics of Virtualization Types of Virtualization Implementation Levels of Virtualization Virtualization Structures.
© 2012 Eucalyptus Systems, Inc. Cloud Computing Introduction Eucalyptus Education Services 2.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI Services for Distributed e-Infrastructure Access Tiziana Ferrari on behalf.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
EGI-InSPIRE EGI-InSPIRE RI The European Grid Infrastructure Steven Newhouse Director, EGI.eu Project Director, EGI-InSPIRE 29/06/2016CoreGrid.
Organizations Are Embracing New Opportunities
Introduction to Cloud Computing
Management of Virtual Execution Environments 3 June 2008
Cloud Computing Dr. Sharad Saxena.
Systems Analysis and Design 5th Edition Chapter 8. Architecture Design
Cloud Computing: Concepts
Presentation transcript:

EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI /11/11 1 GPU Computing in EGI environment using Cloud approach F. Vella, R. Cefalà, A. Costantini, O. Gervasi, and C. Tanci Dept of Mathematics and Computer Science University of Perugia, Italy April 11, 2011 Vilnius (Lithuania) EGI User Forum 2011

EGI-InSPIRE RI Introduction GPU Computing has generated considerable interest in the scientific community Some EGI Virtual Organizations are reshap- ing their applications to exploit the new programming paradigm. Cloud Computing implements a transparent use of computational resources We provide the access to a GPU environ- ment on EGI using a Cloud approach 04/11/11 2

EGI-InSPIRE RI Introduction GPU Computing is pushing the development of distribution models aimed at integrating HPC and HTC resources. We provide on-demand execution environ- ments through the joint usage of g-Lite components and the EC2 web-service API. The entire Job flow has been defined, enabl- ing the request of GPU resources through JDL parameters, dynamically allocating the resources in a Cloud-like infrastructure 04/11/11 3

EGI-InSPIRE RI Grid and Cloud Grid and Cloud paradigms share some essential driving ideas and overlapping areas which lead to −Encapsulate the complexity of hardware resources and make them easily accessible by means of high-level user interfaces −Address the intrinsic scalability issues of large scale computational challenges −Cope with the need of resources that cannot be hosted locally 04/11/11 4

EGI-InSPIRE RI Grid and Cloud Among the key differences between Grid and Cloud there are those related to abstraction and computational models 04/11/11 5 Cloud definition: A large-scale distributed computing paradigm that is driven by economies of scale, in which a pool of abstracted, virtualized, dynamically-scalable, managed computing power, storage, platforms, and services are delivered on demand to external customers over the Internet.

EGI-InSPIRE RI Grid and Cloud Clouds enable the users to choose between different computational models suited to match the requirements of a particular application This allow to encompass some limits of the batch model of the Grid, where resources are often statically managed and partitioned preventing any possible dynamical arrangement to match the application requirements. The workload in Grid is often unpredictable leading to unbalanced distribution on the use of resources and to a reduction of QoS 04/11/11 6

EGI-InSPIRE RI Grid and Cloud Cloud Computing presents still some weak aspects that require further developments which are instead well-established in Grid, i.e: −Security −Data management The more reasonable approach could be an integration model that combines the features of both paradigms 04/11/11 7

EGI-InSPIRE RI Grid and Cloud integration Hybrid with batch-dependent cloud-enabled LRMS −a single local batch system is used to schedule the jobs on a pool of dynamically provisioned re- sources on public/private clouds Hybrid batch independent −the local Grid site spawns resources on public/ private clouds on the basis of job requests. The integration is done at the Computing Element level. It enables the creation of multiple virtual clusters, including LRMS. 04/11/11 8

EGI-InSPIRE RI HPC in EGI Grid can be seen as the ideal computing infrastruc- ture to solve very large computational campaigns, distributing calculations on the available Grid resources in a secure way. 04/11/11 9

EGI-InSPIRE RI HPC in EGI: MPI EGI allows to the execution of HPC applica- tions, at Grid site level, using the MPI API, which gained an important role in many VOs, paying some efforts to integrate the parallel environment with gLite middleware. Nevertheless, the use of MPI in Grid takes some limitations due mainly to the policies adopted by the scheduler on each Grid site. This reduces the advantage of parallelization and may cause job failures 04/11/11 10

EGI-InSPIRE RI Some MPI tests We ran some tests using COMPCHEM VO applica- tions and results confirmed that there are several problems running MPI jobs. 04/11/11 11

EGI-InSPIRE RI HPC in EGI: Multi/Manycore The opportunity to using heterogeneous resources (CPUs and GPU) as compute units can solve the need of HPC resources. An application written in OpenCL or CUDA is inherently parallel but the management of its workflow is simple like a single job. The cost of communications among threads is smaller in a single node in respect of an MPI one An application as the like would be less dependent from the site peculiarities. Not all HPC scientific application can be implemented in a Many/Multicore fashion. 04/11/11 12

EGI-InSPIRE RI A simple integration model A typical Grid site runs an application that “speaks” with the CREAM CE BLAH component and an IaaS provider To create a working testbed we chose and implemented a simple infrastructure for the provision of GPU virtual instances using Eucalyptus on top of the Xen Hypervisor The interfaces to the Cloud Controller are Amazon EC2 compliant and we used boto API to develop our testbed 04/11/11 13

EGI-InSPIRE RI A simple integration model The physical GPU resources have been virtualized and made available in a Infrastructure as a Service (IaaS) private Cloud. A centralized mechanism that interfaces a generic EGI Grid site to an IaaS provider has been implemented to fulfill job requirements. The fully working testbed has been built with the adoption of Eucalyptus software system to implement a private Cloud over the cluster. We created virtual appliances for OpenCL or CUDA to match job requirements related to Multi/Manycore technology. 04/11/11 14

EGI-InSPIRE RI Eucalyptus architecture 04/11/11 15

EGI-InSPIRE RI Xen and GPU virtualization When hardware prerequisites are met, Xen permits to transparently assign PCIE devices to Virtual Machines. A single PCIE resource cannot be shared between Virtual Machines (unless one separate and decouple the graphic driver between front-end and back-end*). At the present time only a few GPUs officially support passing through Virtual Machines. The selection of commodity components, new chipsets, motherboards, CPUs and GPUs require a particular care since some components are not compatible with full hardware virtualization. * 04/11/11 16 F. Giunta, R. Montella, G. Laccetti, F. Isaila and F. J. García Blas. A GPU Accelerated High Performance Cloud Computing Infrastructure for Grid Computing Based Virtual Environmental Laboratory. Advances in Grdi Computing, Edited by Z. Constantinescu, pp , InTech publ. ISBN (2011)

EGI-InSPIRE RI OpenCL performances To assess OpenCL performance gap between a real machine and a virtual one a testbed has been implemented using an Asus P7P55 LX motherboard with support of both Intel Virtualization Technology (VT-x) and Intel Virtualization Technology for Directed I/O (VT-D) provided by I7 870 CPU, a 4 cores, 64-bit, x86 part from INTEL. Graphic Adapter: AMD FireStream We used Phoronix test suite benchmark The virtual GPU is 3% slower 04/11/11 17 Black=Real GPU Gray=Virtual GPU

EGI-InSPIRE RI The implemented system The main components are: Grid site CE and BLAH, a demon named resource-marshal and an IaaS provider that exposes EC2 interfaces. For each job the CE receives, after the BLAH parsing operation, we are able to gather all informa- tion for that job and marshal its execution. To this end we implemented a basic daemon that manages the allocation of instances according to Job information and simple policies 04/11/11 18

EGI-InSPIRE RI The implemented system Job information are used to determine the flavor of the required virtual appliance, specified via GlueSchema in the JDL If an instance is already available the job proceeds immediately. To implement such behavior a call to simple client application is interposed to the BLAH XXX_job_submit scripts that inquires the resource-marshal daemon about the availability of resources 04/11/11 19 Type = "job"; executable="/bin/sleep"; [...] CeRequirements= "other.GlueHostMainMemoryRAMSize > 2048 && (Member( "GPU\", other. GlueHostApplicationSoftwareRuntimeEnvironmen t))"; The daemon keeps track of available instances and sends requests of new ones to the IaaS provider via boto EC2 interfaces. We haven't applied a strict termination policy for the instances, however the possibilities are: Job termination events can be used to trigger the reclamation of unused instances Grid users proxy lifetime could be bound to the instances lifecycle. Some instances could be kept permanently running

EGI-InSPIRE RI Conclusions We presented a system to provide to EGI users a specific on-demand GPU environment to transparently execute jobs on GPU devices The proposed system uses a Cloud approach, based on EC2 Compliant Clouds, in order to control the specific GPU-enabled VO environment from EGI middleware interfaces. The proposed solution enables the users to run their applications in parallel, exploring the Many/Multicore capabilities of the Grid nodes. This innovative approach provides to the users an interesting alternative to MPI for running parallel jobs 04/11/11 20