Virtual Machines on BiG Grid INFN Annual Meeting May 2010 Sander Klous, Nikhef.

Slides:



Advertisements
Similar presentations
Wei Lu 1, Kate Keahey 2, Tim Freeman 2, Frank Siebenlist 2 1 Indiana University, 2 Argonne National Lab
Advertisements

© 2012 Open Grid Forum Simplifying Inter-Clouds October 10, 2012 Hyatt Regency Hotel Chicago, Illinois, USA.
Overview of local security issues in Campus Grid environments Bruce Beckles University of Cambridge Computing Service.
HEPiX Virtualisation Working Group Status, July 9 th 2010
CoreGRID Workpackage 5 Virtual Institute on Grid Information and Monitoring Services Authorizing Grid Resource Access and Consumption Erik Elmroth, Michał.
VMs at a Tier-1 site EGEE’09, Sander Klous, Nikhef.
EGI-Engage Recent Experiences in Operational Security: Incident prevention and incident handling in the EGI and WLCG infrastructure.
Cloud Computing for the Enterprise November 18th, This work is licensed under a Creative Commons.
David Groep Nikhef Amsterdam PDP & Grid Traceability in the face of Clouds EGI-GEANT Symposium – cloud security track With grateful thanks for the input.
EGI-Engage Recent Experiences in Operational Security: Incident prevention and incident handling in the EGI and WLCG infrastructure.
INFSO-RI Enabling Grids for E-sciencE SA1: Cookbook (DSA1.7) Ian Bird CERN 18 January 2006.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks C. Loomis (CNRS/LAL) M.-E. Bégin (SixSq.
Large Scale Sky Computing Applications with Nimbus Pierre Riteau Université de Rennes 1, IRISA INRIA Rennes – Bretagne Atlantique Rennes, France
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI Federated Cloud F2F Security Issues in the cloud Introduction Linda Cornwall,
The Grid System Design Liu Xiangrui Beijing Institute of Technology.
Responsibilities of ROC and CIC in EGEE infrastructure A.Kryukov, SINP MSU, CIC Manager Yu.Lazin, IHEP, ROC Manager
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks David Kelsey RAL/STFC,
Virtual Workspaces Kate Keahey Argonne National Laboratory.
CEOS WGISS-21 CNES GRID related R&D activities Anne JEAN-ANTOINE PICCOLO CEOS WGISS-21 – Budapest – 2006, 8-12 May.
Virtualised Worker Nodes Where are we? What next? Tony Cass GDB /12/12.
Conference name Company name INFSOM-RI Speaker name The ETICS Job management architecture EGEE ‘08 Istanbul, September 25 th 2008 Valerio Venturi.
EGI-Engage Recent Experiences in Operational Security: Incident prevention and incident handling in the EGI and WLCG infrastructure.
Trusted Virtual Machine Images a step towards Cloud Computing for HEP? Tony Cass on behalf of the HEPiX Virtualisation Working Group October 19 th 2010.
Security Policy Update David Kelsey UK HEP Sysman, RAL 1 Jul 2011.
6/23/2005 R. GARDNER OSG Baseline Services 1 OSG Baseline Services In my talk I’d like to discuss two questions:  What capabilities are we aiming for.
Security Vulnerabilities in A Virtual Environment
Security Policy: From EGEE to EGI David Kelsey (STFC-RAL) 21 Sep 2009 EGEE’09, Barcelona david.kelsey at stfc.ac.uk.
JSPG Update David Kelsey MWSG, Zurich 31 Mar 2009.
HEPiX Virtualisation Working Group Status, February 10 th 2010 April 21 st 2010 May 12 th 2010.
Why a Commercial Provider should Join the Academic Cloud Federation David Blundell Managing Director 100 Percent IT Ltd Simple, Flexible, Reliable.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Draft Security Virtualisation Policy (for Romain Wartel – CERN) EGI Technical.
Ian Collier, STFC, Romain Wartel, CERN Maintaining Traceability in an Evolving Distributed Computing Environment Introduction Security.
Breaking the frontiers of the Grid R. Graciani EGI TF 2012.
G. Russo, D. Del Prete, S. Pardi Kick Off Meeting - Isola d'Elba, 2011 May 29th–June 01th A proposal for distributed computing monitoring for SuperB G.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Questionnaires to Cloud technology providers and sites Linda Cornwall, STFC,
Trusted Virtual Machine Images the HEPiX Point of View Tony Cass October 21 st 2011.
WP5 – Infrastructure Operations Test and Production Infrastructures StratusLab kick-off meeting June 2010, Orsay, France GRNET.
INFSO-RI Enabling Grids for E-sciencE Grid & Cloud Computing Introduction
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI John Gordon EGI Virtualisation and Cloud Workshop Amsterdam 13 th May 2011.
The HEPiX Virtualisation Working Group Towards a Grid of Clouds Tony Cass CHEP 2012 May 24 th 2012.
HEPiX Virtualisation working group Andrea Chierici INFN-CNAF Workshop CCR 2010.
INFSO-RI Enabling Grids for E-sciencE Workshop WLCG Security for Grid Sites Louis Poncet System Engineer SA3 - OSCT.
EGI-InSPIRE RI An Introduction to European Grid Infrastructure (EGI) March An Introduction to the European Grid Infrastructure.
1 EGI Federated Cloud Architecture Matteo Turilli Senior Research Associate, OeRC, University of Oxford Chair – EGI Federated Clouds Task Force
Bob Jones EGEE Technical Director
Accessing the VI-SEEM infrastructure
New Paradigms: Clouds, Virtualization and Co.
WLCG IPv6 deployment strategy
DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING CLOUD COMPUTING
C Loomis (CNRS/LAL) and V. Floros (GRNET)
Sviluppi in ambito WLCG Highlights
Cloud Challenges C. Loomis (CNRS/LAL) EGI-TF (Amsterdam)
HEPiX Virtualisation working group
Dag Toppe Larsen UiB/CERN CERN,
Dag Toppe Larsen UiB/CERN CERN,
ATLAS Cloud Operations
Ian Bird GDB Meeting CERN 9 September 2003
StratusLab Final Periodic Review
StratusLab Final Periodic Review
LCG/EGEE Incident Response Planning
How to enable computing
Grid Computing.
Short update on the latest gLite status
WLCG Collaboration Workshop;
Management of Virtual Execution Environments 3 June 2008
Leigh Grundhoefer Indiana University
Fundamental Concepts and Models
ONAP Architecture Principle Review
Presentation transcript:

Virtual Machines on BiG Grid INFN Annual Meeting May 2010 Sander Klous, Nikhef

Contents Introduction Who are we? What do we do? Motivation Why, What and How with VMs? Status Challenges Conclusions May 2010BIG Grid - Virtualization working group2

Introduction Collaboration between NCF: national computing facilities Nikhef: national institute for subatomic physics NBIC: national bioinformatics center Participation from Philips, SARA, etc. Goal: “Enable access to grid infrastructures for scientific research in the Netherlands” May 2010BIG Grid - Virtualization working group3

BiG Grid VM Working Group Representatives from: User Communities/Support, Operations and Security, Management, “Independent Chair” Recipe for disaster Charge for phase 1: “Provide a design and Proof-of-Concept for virtualized worker nodes that fulfill requirements of different user communities.” Preliminary result: Extensive report on all aspects that need to be addressed before BiG Grid accepts VMs. May 2010BIG Grid - Virtualization working group4

Motivation: Why Virtual Machines? Site perspective Resource flexibility (e.g. SL4 / SL5) Resource management Scheduling / multi-core / sandboxing User perspective Isolation from environment Identical environment on multiple sites Identical environment on local machine May 2010BIG Grid - Virtualization working group5

Different VM classes Class 1: Site generated Virtual Machines No additional trust issues Benefits for system administration Class 2: Certified Virtual Machines Inspection and certification to establish trust Requirements for monitoring / integration Class 3: User generated Virtual Machines No trust relation Requires appropriate security measures May 2010BIG Grid - Virtualization working group6

STATUS Different VM Classes May 2010BIG Grid - Virtualization working group7

Resource management Site infrastructure Typical use case Class 1 VM May 2010BIG Grid - Virtualization working group8 Torque/PBS Box 2 “8 Virtual SL4 WNs” Box 3 “8 Virtual SL5 WNs” Virtual Machine Manager Job queue VM queue Box 1 “Normal WN”

Typical use case Class 2 VM Analysis on Virtual Machines Run minimal analysis on desktop/laptop Access to grid services Run full analysis on the grid Identical environment Identical access to grid services No interest to become system administrator Standard experiment software is sufficient May 2010BIG Grid - Virtualization working group9

Between Class 1 and 2/3: WNoDeS May 2010BIG Grid - Virtualization working group10

HEPiX VM working group Chaired by Tony Cass (CERN) Representatives from site operations Charge: “Enable virtual machine images created at one site to be used at other HEPiX (and WLGC) sites.” Preliminary results: Draft security policy VM Image Catalog (straw man model and interface definition) May 2010BIG Grid - Virtualization working group11

Draft Security Policy May 2010BIG Grid - Virtualization working group12 “The aim is to enable Grid Sites to trust and instantiate endorsed VM images that have been generated elsewhere.” Main points: VM combined image split in base image (root install), VO environment (user install) Complete image is endorsed by an individual to confirm that it respects the policy requirements Extendible with a site local policy

May 2010BIG Grid - Virtualization working group13

BiG Grid Policy Addendum VO environment endorsed by VO person Complete image endorsed by non-VO person Nikhef endorsers will defer responsibilities and liabilities for VO environment to VO person. Written and signed deferral of all responsibilities and liabilities of VO environment, including: Consequences of possible security leaks Consequences of possible license violations May 2010BIG Grid - Virtualization working group14

Other applicable policies Grid Security Policy, version 5.7a VO Portal Policy, version 1.0 (draft) Big Grid Security Policy, version Grid Acceptable Use Policy, version 3.1 Grid Site Operations Policy, version 1.4a LCG/EGEE Incident Handling and Response Guide, version 2.1 Grid Security Traceability and Logging Policy, version 2.0 VO-Box Security Recommendations and Questionnaire, version 0.6 (draft, not ratified) May 2010BIG Grid - Virtualization working group15

Relevant policy statements Network security is covered by site local security policies and practices A VO Box is part of the trusted network fabric. Privileged access is limited to resource administrators Software deployed in the grid must include sufficient and relevant site central logging. May 2010BIG Grid - Virtualization working group16

VM Image Catalog Endorsed means: Image created according to HEPiX policy requirements Endorser responsible/liable Approved means: Site will run this image Acceptable endorser Acceptable Meta Data May 2010BIG Grid - Virtualization working group17

May 2010BIG Grid - Virtualization working group18

How does this fit in the gLite middleware? May 2010BIG Grid - Virtualization working group19 Argus Authorization framework

Local Policy Enforcement May 2010BIG Grid - Virtualization working group20 = Argus framework

Execution Environment Service May 2010BIG Grid - Virtualization working group21

Deployment A lot of similarities with WNoDeS, Nimbus Additional benefits compared to WNoDeS Policy enforcement with standard middleware Modular: EES middleware sets environment OpenNebula is a standard VM Manager Disadvantages Not operational yet, scalability unproven May 2010BIG Grid - Virtualization working group22

Typical use case Class 3 VM Identification and classification of GPCRs Requires very specific software set Blast HMMER BioPython1.50 Even non-x86 (binary) applications! Specific software for a single user No common experiment software May 2010BIG Grid - Virtualization working group23

Cloudia project, SARA Make separate DMZ for Class 3 VMs SSH accessible (Amazon like, OpenNebula based) Comparable to “Guest networks” Only outbound connectivity Detection of compromised guests Extended security monitoring Packet inspection, netflows, hypervisor monitoring Small, but upgrade expected (500 kEuro) Floris Sluiter: May 2010BIG Grid - Virtualization working group24

Already made HPC weekly May 2010BIG Grid - Virtualization working group25

CHALLENGES Scalability, Reliability, Maintainability, Usability, *bilities May 2010BIG Grid - Virtualization working group26

Class 2/3 upload solution iSCSI/LVM Distributing VM images May 2010BIG Grid - Virtualization working group27 Box 3 “8 Class 2/3 VMs” Box 1 “Normal WN” Box 2 “8 Virtual WNs” Repository (SAN) Image

Cached copy-on-write May 2010BIG Grid - Virtualization working group28 Box 1 Repository Cache Image COW VM Box 2 Cache Image COW VM Image

Hybrid solution: integration May 2010BIG Grid - Virtualization working group29 Class 2 VMs Class 3 VMs Cloudia Interface

Reliability / Usability Experiences from WNoDeS and Cloudia EGI focus point See position paper on CloudScape II, _and_Professional_Profiles.pdf Industry developments Grid was so far a purely scientific endeavor Keep a close eye on industry for all the *bilities May 2010BIG Grid - Virtualization working group30

Conclusions Integration of Grids and VMs is a hot topic WNoDeS, HEPiX, BiG Grid, OpenNebula, etc. Many different approaches Trusted/Untrusted, Grid like/Cloud like Scalability, reliability, maintainability, usability remain issues of primary concern. Cloud like solutions: do they scale? Grid like solutions: are they user friendly? May 2010BIG Grid - Virtualization working group31