Platform & Engineering Services CERN IT Department CH-1211 Geneva 23 Switzerland www.cern.ch/i t PES Improving resilience of T0 grid services Manuel Guijarro.

Slides:



Advertisements
Similar presentations
Cloud computing is used to describe a variety of computing concepts that involve a large number of computers connected through a real-time communication.
Advertisements

The Case for Enterprise Ready Virtual Private Clouds Timothy Wood, Alexandre Gerber *, K.K. Ramakrishnan *, Jacobus van der Merwe *, and Prashant Shenoy.
CERN IT Department CH-1211 Genève 23 Switzerland t CERN-IT Plans on Virtualization Ian Bird On behalf of IT WLCG Workshop, 9 th July 2010.
Tunis, Tunisia, 28 April 2014 Business Values of Virtualization Mounir Ferjani, Senior Product Manager, Huawei Technologies 2.
-How To leverage Virtual Desktop for Manageability & Security -Desktop Computing “as a service” Andreas Tsangaris CTO, PERFORMANCE
By Sean Danko.  What is Virtualization  How does Virtualization Work  History of Virtualization  Why Should I Virtualize  Infrastructure  Advantages.
Towards High-Availability for IP Telephony using Virtual Machines Devdutt Patnaik, Ashish Bijlani and Vishal K Singh.
Virtual techdays INDIA │ 9-11 February 2011 Cross Hypervisor Management Using SCVMM 2008 R2 Vikas Madan │ Partner Consultant II, Microsoft Corporation.
Virtual Machines. Virtualization Virtualization deals with “extending or replacing an existing interface so as to mimic the behavior of another system”
European Organization for Nuclear Research Virtualization Review and Discussion Omer Khalid 17 th June 2010.
Virtualization for Cloud Computing
VIRTUALISATION OF HADOOP CLUSTERS Dr G Sudha Sadasivam Assistant Professor Department of CSE PSGCT.
Platform & Engineering Services CERN IT Department CH-1211 Geneva 23 Switzerland t PES Ewan Roche, Ulrich Schwickerath, Manuel Guijarro,
CERN IT Department CH-1211 Genève 23 Switzerland t Next generation of virtual infrastructure with Hyper-V Michal Kwiatek, Juraj Sucik, Rafal.
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES News on monitoring for CMS distributed computing operations Andrea.
Tanenbaum 8.3 See references
Senior Design – Spring 2009 Richard Gory Focus: Networking & Web.
Real Security for Server Virtualization Rajiv Motwani 2 nd October 2010.
Copyright © 2010 Platform Computing Corporation. All Rights Reserved.1 The CERN Cloud Computing Project William Lu, Ph.D. Platform Computing.
CERN IT Department CH-1211 Genève 23 Switzerland t Virtualization with Windows at CERN Juraj Sucik, Emmanuel Ormancey Internet Services Group.
 Cloud computing  Workflow  Workflow lifecycle  Workflow design  Workflow tools : xcp, eucalyptus, open nebula.
CERN - IT Department CH-1211 Genève 23 Switzerland t Monitoring the ATLAS Distributed Data Management System Ricardo Rocha (CERN) on behalf.
ICT Day Term 4,  Virtualisation is growing in usage.  Current CPU’s are designed to support Virtualisation.  Businesses are looking at virtualisation.
Introduction to VMware Virtualization
+ CS 325: CS Hardware and Software Organization and Architecture Cloud Architectures.
Lets agree that an Optimized Data Center is not yet a Cloud +=
INTRODUCTION TO CLOUD COMPUTING CS 595 LECTURE 2.
Module 19 Managing Multiple Servers. Module Overview Working with Multiple Servers Virtualizing SQL Server Deploying and Upgrading Data-Tier Applications.
Virtual Machine and its Role in Distributed Systems.
CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services Job Monitoring for the LHC experiments Irina Sidorova (CERN, JINR) on.
Challenges towards Elastic Power Management in Internet Data Center.
COMS E Cloud Computing and Data Center Networking Sambit Sahu
Virtualization within FermiGrid Keith Chadwick Work supported by the U.S. Department of Energy under contract No. DE-AC02-07CH11359.
From Virtualization Management to Private Cloud with SCVMM 2012 Dan Stolts Sr. IT Pro Evangelist Microsoft Corporation
High Performance Computing on Virtualized Environments Ganesh Thiagarajan Fall 2014 Instructor: Yuzhe(Richard) Tang Syracuse University.
Platform & Engineering Services CERN IT Department CH-1211 Geneva 23 Switzerland t PES Network as a Service Use cases for load balancing.
CERN IT Department CH-1211 Genève 23 Switzerland t Evolution of virtual infrastructure with Hyper-V Juraj Sucik, Slavomir Kubacka Internet.
Server Virtualization
Presented by: Reem Alshahrani. Outlines What is Virtualization Virtual environment components Advantages Security Challenges in virtualized environments.
©2015 EarthLink. All rights reserved. Private Cloud Hosting Create Your Own Private IT Environment.
Jose Castro Leon CERN – IT/OIS CERN Agile Infrastructure Infrastructure as a Service.
Virtualised Worker Nodes Where are we? What next? Tony Cass GDB /12/12.
CERN - IT Department CH-1211 Genève 23 Switzerland t Oracle Real Application Clusters (RAC) Techniques for implementing & running robust.
Private Cloud Hosting. IT Business Challenges I need to extend my on-premises virtualized environment to utilize the Cloud and manage the entire environment.
11 CLUSTERING AND AVAILABILITY Chapter 11. Chapter 11: CLUSTERING AND AVAILABILITY2 OVERVIEW  Describe the clustering capabilities of Microsoft Windows.
VMware vSphere Configuration and Management v6
1 Andrea Sciabà CERN Critical Services and Monitoring - CMS Andrea Sciabà WLCG Service Reliability Workshop 26 – 30 November, 2007.
CERN IT Department CH-1211 Genève 23 Switzerland PES 1 Ermis service for DNS Load Balancer configuration HEPiX Fall 2014 Aris Angelogiannopoulos,
CERN IT Department CH-1211 Genève 23 Switzerland t IT Configuration Activities Gavin McCance Online Cross-experiment Meeting, 14 June 2012.
20409A 7: Installing and Configuring System Center 2012 R2 Virtual Machine Manager Module 7 Installing and Configuring System Center 2012 R2 Virtual.
Tim Bell 04/07/2013 Intel Openlab Briefing2.
Cloud Computing Lecture 5-6 Muhammad Ahmad Jan.
CERN IT Department CH-1211 Genève 23 Switzerland t CERN IT Monitoring and Data Analytics Pedro Andrade (IT-GT) Openlab Workshop on Data Analytics.
Cloud Computing – UNIT - II. VIRTUALIZATION Virtualization Hiding the reality The mantra of smart computing is to intelligently hide the reality Binary->
CERN IT Department CH-1211 Genève 23 Switzerland t SL(C) 5 Migration at CERN CHEP 2009, Prague Ulrich SCHWICKERATH Ricardo SILVA CERN, IT-FIO-FS.
Patricia Méndez Lorenzo Status of the T0 services.
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES Andrea Sciabà Ideal information system - CMS Andrea Sciabà IS.
Ian Collier, STFC, Romain Wartel, CERN Maintaining Traceability in an Evolving Distributed Computing Environment Introduction Security.
Unit 2 VIRTUALISATION. Unit 2 - Syllabus Basics of Virtualization Types of Virtualization Implementation Levels of Virtualization Virtualization Structures.
GPCF* Update Present status as a series of questions / answers related to decisions made / yet to be made * General Physics Computing Facility (GPCF) is.
CERN IT Department CH-1211 Genève 23 Switzerland The CERN internal Cloud Sebastien Goasguen, Belmiro Rodrigues Moreira, Ewan Roche, Ulrich.
Dag Toppe Larsen UiB/CERN CERN,
What is Virtualization Last Update
Infrastructure Orchestration to Optimize Testing
Dag Toppe Larsen UiB/CERN CERN,
EPAM Cloud Orchestration
Virtualization Dr. S. R. Ahmed.
Presentation transcript:

Platform & Engineering Services CERN IT Department CH-1211 Geneva 23 Switzerland t PES Improving resilience of T0 grid services Manuel Guijarro – IT/PES Steve Traylen– IT/PES EGI Community Forum 2012

CERN IT Department CH-1211 Geneva 23 Switzerland t PES Outline Introduction One server, one application Virtualisation Service Consolidation DNS Load balancing Grid WMS Example Conclusion 2

CERN IT Department CH-1211 Geneva 23 Switzerland t PES Introduction  Platform Support Section in IT-PES: Interactive Login Services and Batch Grid (mainly Computing) Services: –CEs, WMS, LB, VOMS, BDII, CVMFS, FTS, and LFC. Infrastructure Services: –Messaging Service –DNS Load Balancing Service –Service Consolidation Service –Internal Cloud Infrastructure 3

CERN IT Department CH-1211 Geneva 23 Switzerland t PES Introduction II Grid Services not all HA by design. Need to increase their Availability Use in house infrastructure services: –Service Consolidation Service (Virtualisation) –DNS Load Balancing Service –Cheap solutions Do not provide real High Availability But greatly reduces down time of Grid Services 4

CERN IT Department CH-1211 Geneva 23 Switzerland t PES “one server, one application” Low Infrastructure Utilization –Typically one application per server to avoid the risk of vulnerabilities in one application affecting the availability of another application on the same server Increasing Physical Infrastructure Costs –Power consumption, cooling and facilities costs that do not vary with utilization levels Increasing IT Management Costs –Spend disproportionate time and resources on manual tasks associated with server maintenance, and thus require more personnel to complete these tasks Insufficient Failover and Disaster Protection –The threat of security attacks and natural disasters has elevated the importance of business continuity Operating System Application Server Operating System Application Server Operating System Application Server

CERN IT Department CH-1211 Geneva 23 Switzerland t PES Virtualization Virtualization is the ability of running multiple independent virtual operating systems on a single physical computer

CERN IT Department CH-1211 Geneva 23 Switzerland t PES Server consolidation Grid VOMS servers usage 7 CPU utilization –Grid VOMS cluster – March 2012

CERN IT Department CH-1211 Geneva 23 Switzerland t PES Server consolidation Main advantages: –Multiple services in the same server –Hardware agnostic –No resources underutilization Operating System Application Server Operating System Application Server Operating System Application Server Computer Center (513) Hypervisor Server Hypervisor Server Hypervisor Server Computer Center (513) OSOS OSOS OSOS OSOS OSOS OSOS OSOS OSOS ApAp ApAp ApAp ApAp ApAp ApAp ApAp ApAp OSOS OSOS OSOS OSOS OSOS OSOS OSOS OSOS ApAp ApAp ApAp ApAp ApAp ApAp ApAp ApAp OSOS OSOS OSOS OSOS OSOS OSOS OSOS OSOS ApAp ApAp ApAp ApAp ApAp ApAp ApAp ApAp 8

CERN IT Department CH-1211 Geneva 23 Switzerland t PES Hypervisor Server Hypervisor Server Hypervisor Server Computer Center (513) OSOS OSOS OSOS OSOS OSOS OSOS OSOS OSOS A1A1 A1A1 A1A1 A1A1 Ax OSOS OSOS OSOS OSOS OSOS OSOS OSOS OSOS A1A1 A1A1 A2A2 A2A2 Hardware interventions Main advantages: –User transparent –No service degradation Hypervisor Server Hypervisor Server Hypervisor Server Computer Center (513) OSOS OSOS OSOS OSOS OSOS OSOS OSOS OSOS A1A1 A1A1 A1A1 A1A1 A1A1 A1A1 A2A2 A2A2 OSOS OSOS OSOS OSOS Ax OSOS OSOS OSOS OSOS Operating System Application 1 Server Operating System Application 1 Server Operating System Application 1 Server Computer Center (513) 9

CERN IT Department CH-1211 Geneva 23 Switzerland t PES Virtualization tools There are different virtualization technologies: –XEN –KVM –Microsoft Hyper-V –VMware ESXi PES-PS tested XEN and currently we are using KVM and Microsoft Hyper-V 10

CERN IT Department CH-1211 Geneva 23 Switzerland t PES Cloud Orchestration tools There are several cloud orchestration tools to build private clouds: –Openstack, OpenNebula, Platform ISF, Eucalyptus, Nimbus, Microsoft SCVMM, VMware vSphere,... PES-PS test(ed) Platform ISF, OpenNebula, Microsoft SCVMM and OpenStack For Service Consolidation currently using Microsoft SCVMM 11

CERN IT Department CH-1211 Geneva 23 Switzerland t PES Is this the silver bullet? 90% of PES Grid services run on VMs Still some on real HWD (until it expires) Other saving excuse: –5-10% lost in CPU performance –20% lost on disk I/O –Overall performance still OK for most services Still exposed to (partial) interruptions: –OS or Grid Application upgrades –….. 12

CERN IT Department CH-1211 Geneva 23 Switzerland t PES DNS Load Balancing

CERN IT Department CH-1211 Geneva 23 Switzerland t PES WMS Example: setup 14 3 load-balancing DNS aliases for different configuration classes (“subclusters”) –SAM monitoring (wmssam.cern.ch), CMS (wmscms.cern.ch), other VOs (wmsshared.cern.ch) –Identical configuration for all nodes in a same subcluster (using central configuration mgmt) Node load taken into account to select a set of “best nodes” to be exposed in each DNS alias –Using metrics specific to WMS –Highly loaded nodes stop receiving new jobs Well supported by client software (gLite UI) –Users specify a single server name in their config: the DNS alias –DNS server returns a list of IP addresses for the alias –Client software randomly tries IP addresses from the list

CERN IT Department CH-1211 Geneva 23 Switzerland t PES Benefits & limits Benefits –Flexibility: nodes can be added or removed from a DNS alias without users changing their configuration –Resource optimization: even load distribution on WMS nodes –Availability: highly loaded or sick nodes automatically removed from DNS alias –Transparent maintenance: nodes undergoing maintenance are not exposed to users But does not replace a full HA solution –Each job remains tied to a specific node (we use WMS+LB co-hosting) –WMS node unavailable = no job status update 15

CERN IT Department CH-1211 Geneva 23 Switzerland t PES Conclusion Service Consolidation via Virtualisation should become a common practise DNS Load balancing is cheap and helps The real challenge is ahead of us: –Running services in a(n) (internal) cloud –# of Nodes varies constantly –Dynamic Configuration becomes a must Will require service redesign for most of what we know. 16