Use cases and a practical example for Grid and Cloud integration Elisabetta Ronchieri, INFN CNAF EPIKH School, Beijing,

Slides:



Advertisements
Similar presentations
SLA-Oriented Resource Provisioning for Cloud Computing
Advertisements

Chapter 22: Cloud Computing and Related Security Issues Guide to Computer Network Security.
By Adam Balla & Wachiu Siu
Cloud Computing to Satisfy Peak Capacity Needs Case Study.
Clouds C. Vuerli Contributed by Zsolt Nemeth. As it started.
Tunis, Tunisia, 28 April 2014 Business Values of Virtualization Mounir Ferjani, Senior Product Manager, Huawei Technologies 2.
CoreGRID Workpackage 5 Virtual Institute on Grid Information and Monitoring Services Authorizing Grid Resource Access and Consumption Erik Elmroth, Michał.
Workload Management Workpackage Massimo Sgaravatto INFN Padova.
Workload Management Massimo Sgaravatto INFN Padova.
Lecture Nine Database Planning, Design, and Administration
Course Instructor: Aisha Azeem
Cloud Usability Framework
SaaS, PaaS & TaaS By: Raza Usmani
Cloud Computing Guide & Handbook SAI USA Madhav Panwar.
Grid and Cloud Operations Interoperability – An overview Alberto Di Meglio (CERN, EMI) Morris Riedel (FZJ, EMI) Shahbaz Memon (FZJ, EMI) Cal Loomis (CNRS/LAL,
Clouds on IT horizon Faculty of Maritime Studies University of Rijeka Sanja Mohorovičić INFuture 2009, Zagreb, 5 November 2009.
Introduction to Cloud Computing
©Ian Sommerville 2004Software Engineering, 7th edition. Chapter 18 Slide 1 Software Reuse.
 Cloud computing  Workflow  Workflow lifecycle  Workflow design  Workflow tools : xcp, eucalyptus, open nebula.
Virtual Machine Hosting for Networked Clusters: Building the Foundations for “Autonomic” Orchestration Based on paper by Laura Grit, David Irwin, Aydan.
©Ian Sommerville 2006Software Engineering, 8th edition. Chapter 12 Slide 1 Distributed Systems Architectures.
EGI-Engage EGI-Engage Engaging the EGI Community towards an Open Science Commons Project Overview 9/14/2015 EGI-Engage: a project.
Cloud Computing. What is Cloud Computing? Cloud computing is a model for enabling convenient, on-demand network access to a shared pool of configurable.
McGraw-Hill/Irwin © The McGraw-Hill Companies, All Rights Reserved BUSINESS PLUG-IN B17 Organizational Architecture Trends.
Nicholas LoulloudesMarch 3 rd, 2009 g-Eclipse Testing and Benchmarking Grid Infrastructures using the g-Eclipse Framework Nicholas Loulloudes On behalf.
Cloud Use Cases, Required Standards, and Roadmaps Excerpts From Cloud Computing Use Cases White Paper
Grid Computing at The Hartford Condor Week 2008 Robert Nordlund
WNoDeS – Worker Nodes on Demand Service on EMI2 WNoDeS – Worker Nodes on Demand Service on EMI2 Local batch jobs can be run on both real and virtual execution.
Tool Integration with Data and Computation Grid GWE - “Grid Wizard Enterprise”
Grid Middleware Tutorial / Grid Technologies IntroSlide 1 /14 Grid Technologies Intro Ivan Degtyarenko ivan.degtyarenko dog csc dot fi CSC – The Finnish.
Ames Research CenterDivision 1 Information Power Grid (IPG) Overview Anthony Lisotta Computer Sciences Corporation NASA Ames May 2,
9 Systems Analysis and Design in a Changing World, Fourth Edition.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
US LHC OSG Technology Roadmap May 4-5th, 2005 Welcome. Thank you to Deirdre for the arrangements.
Conference name Company name INFSOM-RI Speaker name The ETICS Job management architecture EGEE ‘08 Istanbul, September 25 th 2008 Valerio Venturi.
Trusted Virtual Machine Images a step towards Cloud Computing for HEP? Tony Cass on behalf of the HEPiX Virtualisation Working Group October 19 th 2010.
Security Vulnerabilities in A Virtual Environment
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) Giuseppe Andronico INFN Sez. CT / Consorzio COMETA Beijing,
Globus and PlanetLab Resource Management Solutions Compared M. Ripeanu, M. Bowman, J. Chase, I. Foster, M. Milenkovic Presented by Dionysis Logothetis.
Development of e-Science Application Portal on GAP WeiLong Ueng Academia Sinica Grid Computing
GRID ANATOMY Advanced Computing Concepts – Dr. Emmanuel Pilli.
© 2006 Open Grid Forum INFN WNoDes and OCCI Michele Orru’, INFN-CNAF.
3/12/2013Computer Engg, IIT(BHU)1 CLOUD COMPUTING-1.
Tool Integration with Data and Computation Grid “Grid Wizard 2”
Web Technologies Lecture 13 Introduction to cloud computing.
INFSO-RI Enabling Grids for E-sciencE Policy management and fair share in gLite Andrea Guarise HPDC 2006 Paris June 19th, 2006.
1 TCS Confidential. 2 Objective : In this session we will be able to learn:  What is Cloud Computing?  Characteristics  Cloud Flavors  Cloud Deployment.
Servizi di brokering Valerio Venturi CCR Giornata di formazione dedicata al Cloud Computing 6 Febbraio 2013.
Resource Provisioning EGI_DS WP3 consolidation workshop, CERN Fotis Karayannis, GRNET.
INFN OCCI implementation on Grid Infrastructure Michele Orrù INFN-CNAF OGF27, 13/10/ M.Orrù (INFN-CNAF) INFN OCCI implementation on Grid Infrastructure.
INTRODUCTION TO GRID & CLOUD COMPUTING U. Jhashuva 1 Asst. Professor Dept. of CSE.
© 2012 Eucalyptus Systems, Inc. Cloud Computing Introduction Eucalyptus Education Services 2.
WNoDeS – a Grid/Cloud Integration Framework Elisabetta Ronchieri (INFN-CNAF) for the WNoDeS Project
Claudio Grandi INFN Bologna Virtual Pools for Interactive Analysis and Software Development through an Integrated Cloud Environment Claudio Grandi (INFN.
DIRAC for Grid and Cloud Dr. Víctor Méndez Muñoz (for DIRAC Project) LHCb Tier 1 Liaison at PIC EGI User Community Board, October 31st, 2013.
INFN/IGI contributions Federated Clouds Task Force F2F meeting November 24, 2011, Amsterdam.
DGAS Distributed Grid Accounting System INFN Workshop /05/1009, Palau Giuseppe Patania Andrea Guarise 6/18/20161.
Trusted Virtual Machine Images the HEPiX Point of View Tony Cass October 21 st 2011.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) Giuseppe Andronico INFN Sez. CT & Consorzio COMETA Workshop Clouds.
Enabling Grids for E-sciencE Claudio Cherubino INFN DGAS (Distributed Grid Accounting System)
Dr. Ir. Yeffry Handoko Putra
Accessing the VI-SEEM infrastructure
Workload Management Workpackage
Chapter 6: Securing the Cloud
By: Raza Usmani SaaS, PaaS & TaaS By: Raza Usmani
Cloud Management Mechanisms
Presentation transcript:

Use cases and a practical example for Grid and Cloud integration Elisabetta Ronchieri, INFN CNAF EPIKH School, Beijing, China May 18, 2011

Outline Clouds and Grids An example of Grid and Cloud Integration Summary 18/05/2011Elisabetta Ronchieri, EPIKH School, Beijing School

The compulsory slide on definitions: Grid vs. Cloud GRID: The essence of the [definition] can be captured in a simple checklist, according to which a Grid is a system that: – coordinates resources that are not subject to centralized control... –... using standard, open, general-purpose protocols and interfaces... –... to deliver nontrivial qualities of service. (I. Foster, What is the Grid? A three Point Checklist, 2002) CLOUD: Cloud Computing is a model for enabling ubiquitous, convenient, on-demand network access to a shared pool of configurable computing resources (e.g., networks, servers, storage, applications, and services) that can be rapidly provisioned and released with minimal management effort or service provider interaction (NIST Working Definition of Cloud Computing). 18/05/2011Elisabetta Ronchieri, EPIKH School, Beijing School

In practice Distributed computing infrastructures (whatever they incarnation is) should provide solutions for resource discovery, usage, policing honor contracted service level agreements ensure proper security enforcement measures (authentication, authorization) are taken. 18/05/2011Elisabetta Ronchieri, EPIKH School, Beijing School

More pragmatically Grids are about federating resources. Clouds are about provisioning resources. Grids never found the right business model in the industry, so they failed to get industry uptake What will happen when we want to federate clouds? – Does anybody want to? How about authentication, authorization, accounting, and brokering? – Different domains, like industrial, e-Science, e-Government will have different requirements and perceived risks for cloud computing. ‘Cloud is a semaless extension of the Grid’, Dan Read, OGF30. 18/05/2011Elisabetta Ronchieri, EPIKH School, Beijing School

The Grid, from a User’s Perspective How use Be part of a Virtual Organization. If you can’t find one, you must set it up Access the Grid via a User Interface, authenticating via x.509 digital certificate Specify your job requirements via a Job Description Language (JDL) Your job requirements will be matched against available resources. If suitable resources are found, your job will sooner or later run somewhere. You will be able to check job status, collect output, store, find and retrieve data. 18/05/2011Elisabetta Ronchieri, EPIKH School, Beijing School Architecture characteristics Emphasis on sharing resources at a virtual organizational level Mainly adopted by scientific communities, with limited industry uptake Typically batch-focused, with limited provision for interactive, dynamic usage.

The Cloud, from a User’s Perspective How use Identify a Service Provider Allocate your seemingly infinite desired resources, typically through Web Applications Gain access to your resources (which can be services, software organizations, hardware cores) through pay-as-you-go models 18/05/2011Elisabetta Ronchieri, EPIKH School, Beijing School Architecture characteristics Emphasis on ease of access to resources for individual users Initiated within the cmmercial sector, with wide success Several level of abstraction are possible: infrastructure as a service, platform as a service, software as a service, and so on.

Which use cases are really there for the Cloud? See the Whitepaper by the Cloud Computing Use Case Discussion Group ( g/Cloud_Computing_Use_Cases_Whi tepaper-2_0.pdf) g/Cloud_Computing_Use_Cases_Whi tepaper-2_0.pdf 18/05/2011Elisabetta Ronchieri, EPIKH School, Beijing School

Mantras: Service, Cost Savings, Consolidation, Infinity, Utility. All On-demands January 27, The UK government has unveiled a sweeping strategy to create its own internal cloud computing system, such as that used by Google, Microsoft and Amazon, as part of a radical plan that it claims could save up to £23bn a year from an annual bill of at least £16bn.J – The key part of the new strategy [...] will be the concentration of governement computing power into a series of about a dozen highly secure data centres, each costing up to $250m to build, which will replace more than 500 presently used by central government, police forces and local authorities. – By 2015, the strategy suggests, 80% of central governement desktops could be supplied through a shared utility service. – The new cloud system will not include the security services such as M15 or M16, which have their own, separate systems. 18/05/2011Elisabetta Ronchieri, EPIKH School, Beijing School

Can this be really used in the scientific world? It appears that at lest in some cases the answer can be yes. Take for example the Gaia project* of the European Space Agency, whose goal is to survey about a billion stars to make an extremely precise three-dimensional map of our galaxy: – For the full 1 billion star project numbers (the Gaia Science Operations Development team) calculated that they will analyze 100 million primary stars, plus 6 years of data, which will require a total of 16,200 hours of a 20-node EC2 cluster. That’s an estimated total computing cost of 344,000 EUR. By comparison, an in-house solution would cost roughly 720,000 EUR (at today’s prices) – which does not include electricity or storage or sys-admin costs. Storage alone would be an additional 100,000 EUR. *( 18/05/2011Elisabetta Ronchieri, EPIKH School, Beijing School

Problems solved with Clouds? Not really Based on the successful multi-year experience built on Grids, some heavy users note that for what regards Clouds: Inter-operation of multiple cloud providers is not a reality yet, and vendor lock-in is a big issue; Political, legal, or security-related considerations discourage the idea of outsourcing control to external entities; These concerns are particularly acute in the case of the interconnection of different components: computing, storage and network resources; For example, given the level of optimization that was neded for the interaction between storage and computing resources in High-Energy-Physics experiments, it is debatable whether the same performance can be achieved by general purpose infrastructures, like commercial clouds. – Customers will pay either in terms of latencies, or in terms of extra costs. Can we adapt and reuse our existing Grid related know how and infrastructures? 18/05/2011Elisabetta Ronchieri, EPIKH School, Beijing School

New requirements to existing Grid Infrastructures While Grid Interfaces are widely used among large scientific communities, Cloud computing offers significant advantages for many reasons such as pay- as-you-go models, and simplified access. Ideally, though, one would like to adopt Cloud services so that: – Resources are shared between acess interfaces (Grid, Cloud or else) – Scalability is ensured – Existing services and agreements are not required to change substantially – Resource centers policies are honored and know-how is preserved – New services can attract both existing and new customers. These are both key challenges and opportunities for existing Grid infrastructures. 18/05/2011Elisabetta Ronchieri, EPIKH School, Beijing School

Examples of services requested today Some of the typical new service requests we receive: Customer-definable softwre environments. This is a feature that finds several uses in traditional Grids as well; Setting up dynamic pools of virtual servers such as user interfaces, or worker nodes for parallel interactive analysis. More generally, flexibility allocating hardware resources through complex advance-reservation requests. Instantiating pre-packaged, ready-to-go services. Truly distributed, on-demand, Cloud storage Not everybody, speaks Grid: providing access to distributed, traditional Grid infrastructures as if they were not Grids,. This might be offered to non-traditional users, like public administrations, or to the private sector. The key problem is one of integration between several access interfaces, that is resources and multiple access interfaces (Grid, Cloud or other). 18/05/2011Elisabetta Ronchieri, EPIKH School, Beijing School

Grids and Clouds: common grounds Grids and Clouds (abstracting from the concept of a Grid job, which one should regard as an implementation detail) basically target the use of resources. The two terms come from differnt grounds, but really they are just different interfaces to access resources: – Users may actually benefit joining an existing infrastructure, rather than building a new one. – Sharing of data and resources across Grid and Cloud interfaces should be encouraged. – Leveraging on multi-year investments and know-how on Grids to incrementally evolve and build new services is a strategic decision. – Grids like EGEE/EGI are production infrastructures, serving the needs of many (big and small) research communities. The question is then how in practive can you integrate Grids and Cloud. 18/05/2011Elisabetta Ronchieri, EPIKH School, Beijing School

Interactions between computing, storage, network resources: a simple example A minimalistic workflow for analysis performed in the context of a physics experiment is such that in general a user will: Decide which analysis to perform for his new research Develop the code which performs it; this is typically a high-level macro or a plugin of some experiment-based software framework Ask a system about the data requirements – Which files contain the needed information. This info is often in an experiment-based metadata repository or file catalogue. Ask another system to process his analysis. This could happen via a Grid, a Cloud, local (virtual) batch farms, or possibly even one’s own computer. Collect the results. 18/05/2011Elisabetta Ronchieri, EPIKH School, Beijing School

A possible approach to the workflow One could: Carefully choose where to send a processing job, for example to the place which best matches the needed data set; Use tools to create local replicas of the needed data files. High Energy Physics data files are typically big and fairly static, so it is better to exploit locality if possible. – In the right places – Eventually use tools also to push new produced data files to the official repositories. Think here of complementarity between Grid and Cloud tasks – If overdone, this can be quite time and resource consuming Any variation is possible. For example pre-populate everything before sending jobs. – What is best is left to you to decide 18/05/2011Elisabetta Ronchieri, EPIKH School, Beijing School

Outline Clouds and Grids An example of Grid and Cloud Integration Summary 18/05/2011Elisabetta Ronchieri, EPIKH School, Beijing School

How to cloudify the [Scientific] World, then? An evolutionary model is needed possibly offering ways – to interconnect multiple Distributed Computing infrastructures – to continue to support existing scientific computing patterns A mix of public and private clouds is a concrete possibility 18/05/2011Elisabetta Ronchieri, EPIKH School, Beijing School

A practical Grid and Cloud integration example: WNoDeS The Worker Nodes on Demand Service (WNoDeS) is a framework developed by INFN. It is built around a tight integration with a LRMS (a batch system). It is running in production at the INFN Tier1 Computing Centre since November – It has been also deployed at INFN Legnaro. – Tests are ongoing on other INFN sites. It is focused on making resource polymorphism easy and on providing flexibility to both users and resource providers in a production environment. It provides transparent user interfaces for Grids and Clouds. It reuses several existing and proven components (like Grid authentication and authorization), current workflows, data center schedulers. 18/05/2011Elisabetta Ronchieri, EPIKH School, Beijing School

Key WNoDeS Characteristics The main architectural characteristics are: fully integration with existing computing resource scheduling, policing, monitoring and accounting workflows on-demands virtual resource provisioning VLAN support to dynamically isolate Virtual Machines depending on the service type and customer requests – by using Linux KVM as VM manager Support for users to select and access WNoDeS-based resources through Grid, Cloud interfaces, or also through direct job submissions – by using either command line or a Web portal Scalability having to handle thousands of VMs No concept of ‘Cloud over Grid’ or ‘Grid over Cloud’ – Just use all resources and present them to the user as the user wants to see them 18/05/2011Elisabetta Ronchieri, EPIKH School, Beijing School

WNoDeS: overall architecture framework 18/05/2011Elisabetta Ronchieri, EPIKH School, Beijing School Key points: 1.On-demand virtual service provisioning; 2.Flexible, integrated scheduling policies; 3.Multiple access interfaces; 4.Multiple authentication methods; 5.Integrated access to existing infrasturctures; 6.Access to external resources.

WNoDeS: VM layer 18/05/2011Elisabetta Ronchieri, EPIKH School, Beijing School Policing, resource allocation and scheduling are directly demanded to an underlying batch system called LRMS. All physical nodes have the following model: An hypervisor able to instantiate Virtual Machines; A special Virtual Machine called bait, part of LRMS, responsible to publish available local resources to the LRMS and to attract jobs on the local physical system; Zero or more running Virtual Machines to execute jobs, instantiated on-demand by the local hyerpvisor upon appropriate requests made by the local bait. Virtual Machines begin to exist in the configuration requested by the user, only when needed. When the Virtual Machine is not needed any more, it may be destroyed.

WNoDeS: Grid and Local Access 18/05/2011Elisabetta Ronchieri, EPIKH School, Beijing School WNoDeS-based resources may be accessed transparently by users through the local batch system and the Grid. By the Grid: Jobs of some VOs are automatically directed on VMs without user intervention. Grid users may explicitly specify the VM they want to use, adding a CE requirement statement in the JDL for their jobs. Users must be authorized by resource providers to locally use the selected VM images through proper WNoDeS configuration. This works today with the current CE-CREAM software.

WNoDeS: Cloud Access 1/2 18/05/2011Elisabetta Ronchieri, EPIKH School, Beijing School WNoDeS allows generic instantiation of compute cores. They are allocated to the user requesting them, and released only when the user explicitly says so User chooses resource size, operating system, and gets root access to them It can be used to develop, instal, test new software, run services, create small ad-hoc farms, and so on. Cloud resources are taken from the same resource pool used e.g. for Grid resources to optimize the usage of resources. WNoDeS does not need to dedicate resources to batch, Grid, Cloud, but one may do so if this is desired.

WNoDeS: Cloud Access 2/2 18/05/2011Elisabetta Ronchieri, EPIKH School, Beijing School WNoDeS supports the Open Cloud Computing Interface (OCCI) standard, at the moment rarely used by users. Cloud instantiations typically happen through a Web-based portal in which VOMS/gLite Argus for authentication and authorization are integrated allowing flexible policies to determine who you are and what you are allowed to do.

WNoDeS: Virtual Interactive Pools Access 18/05/2011Elisabetta Ronchieri, EPIKH School, Beijing School WNoDeS integrates the Virtual Interactive Pools interface. It is a way for a local user to dynamically and automatically instantiate a compute node for interactive access. User can specify RAM, number of CPUs, and so on. It can be used for software development, to submit jobs, and so on. It has been tested by the local CMS group in a Tier-3 environment hosted at CNAF and intergrated int he bigger Tier-1

WNoDeS: Authentication Gateway 18/05/2011Elisabetta Ronchieri, EPIKH School, Beijing School WNoDeS integrates different mechanisms of user authentication through the authentication gateway layer. It lets Cloud users accessing Grid resources allowing Cloud users to exploit previous investments with minimal changes to the sites. This solution provides the user an X.509 short lived certificate and registers the user into a dedicated VO that needs to be accepted by the site. It lets Grid users accessing Cloud resources. using their Grid credentials. This solution supports the integration with VOMS/gLite Argus to validate user’s credentials and access policies.

Outline Clouds and Grids An example of Grid and Cloud Integration Summary 18/05/2011Elisabetta Ronchieri, EPIKH School, Beijing School

Where are we? The WNoDeS framework is running at the INFN Tier-1 It is an example of a framework targeted at integrating Grid and Cloud resources – running in production mode at a sizable computing center. It encourages integration with regard to data production and consumption using both Grid and Cloud interfaces. – Data produced via the Cloud can be put in Grid-accessible storage repositories, and viceversa. It is being developed by following an integrated approach: – reusing some of the Grid developments of the past decade to achieve interconnection of multiple Clouds; – being the interconnection of Grids a reality today. WNoDeS 2 is targeted for public release in September 2011 – supporting among other things PBS/Torque and Platform LSF as batch systems. 18/05/2011Elisabetta Ronchieri, EPIKH School, Beijing School

What do we need? We need to ride the Cloud wave in an organized way to keep on offering our public (users and country) domains valuable distributed computing services. At the policy level – In relation to e-Infrastructures for e-Science or e-Governement; national and logical digital agendas; privacy, legal and ethical issues; and sustainability (are there really infinite resources?). At the technical level – Investigating and investing into new possibilities; evaluating feasibility of current solutions; working on development and implementation of Cloud frameworks. By following an integrated approach – Reusing our existing know-how and multi-years experience. ‘Cloud is a seamless extension of the Grid’, Dan Reed, OGF /05/2011Elisabetta Ronchieri, EPIKH School, Beijing School

That’s it – It’s shoppable although perhaps not yet shippable [Cloud computing is] nothing more than a faddish term for the established concept of computers linked by network. A cloud is water vapor (Larry Ellison, co-founder and CEO, Oracle Corporation, September 2009). Q: What is Oracle’s Cloud Computing strategy? A: Oracle has two cloud computing objectives. The first is to ensure that [it] is fully enterprise- grade to enable enterprise adoption. [...] The second [is] to support both public and private cloud computing to give customers choice (Oracle Cloud Computing FAQ, October 2010). The truth is rarely pure an never simple (Oscar Wilde, The importance of Being Earnest, 1895) 18/05/2011Elisabetta Ronchieri, EPIKH School, Beijing School

Acknowledgements Davide Salomoni, Further Info: WNoDeS Web: 18/05/2011Elisabetta Ronchieri, EPIKH School, Beijing School