Federico Calzolari 1, Silvia Arezzini 2, Alberto Ciampa 2, Enrico Mazzoni 2 1 Scuola Normale Superiore - Pisa, Italy 2 National Institute of Nuclear Physics.

Slides:



Advertisements
Similar presentations
Express5800/ft series servers Product Information Fault-Tolerant General Purpose Servers.
Advertisements

Copyright © 2012 DataCore Software Corp. – All Rights Reserved. Practical High Availability NAS Cost-effective, non-stop disk access for clustered file.
Virtual Machine Technology Dr. Gregor von Laszewski Dr. Lizhe Wang.
A match made in heaven?. Who am I? Richard Barlow Systems Architect and Engineering Manager for the Virginia Credit Union Worked in IT for almost 20 years.
High Availability through Virtualization
High Performance Computing Course Notes High Performance Storage.
Exploiting SCI in the MultiOS management system Ronan Cunniffe Brian Coghlan SCIEurope’ AUG-2000.
Do MUCH More with Less Presented by: Jon Farley 2W Technologies.
Deployment Options Frank Bergmann
© 2005 DataCore Software Corp SANsymphony™ Application Support Services Fast iSCSI Boot Capability Disaster Recovery, Flexibility and Cost Savings DataCore,
Designing Storage Architectures for Preservation Collections Library of Congress, September 17-18, 2007 Preservation and Access Repository Storage Architecture.
Session 3 Windows Platform Dina Alkhoudari. Learning Objectives Understanding Server Storage Technologies Direct Attached Storage DAS Network-Attached.
Virtual Network Servers. What is a Server? 1. A software application that provides a specific one or more services to other computers  Example: Apache.
© Hitachi Data Systems Corporation All rights reserved. 1 1 Det går pænt stærkt! Tony Franck Senior Solution Manager.
© 2010 VMware Inc. All rights reserved Data Protection Module 10.
1 Virtualization Services. 2 Cloud Hosting –Shared Virtual Servers –Dedicated Servers Managed Server Options Multiple Access Methods –EarthLink Business.
SAP on windows server 2012 hyper-v documentation
GDC Workshop Session 1 - Storage 2003/11. Agenda NAS Quick installation (15 min) Major functions demo (30 min) System recovery (10 min) Disassembly (20.
Methodologies, strategies and experiences Virtualization.
Paper on Best implemented scientific concept for E-Governance projects Virtual Machine By Nitin V. Choudhari, DIO,NIC,Akola.
Module 13: Configuring Availability of Network Resources and Content.
Quantitative Methodologies for the Scientific Computing: An Introductory Sketch Alberto Ciampa, INFN-Pisa Enrico Mazzoni, INFN-Pisa.
Los Angeles County eCloud Overview November 26, 2012.
May l Washington, DC l Omni Shoreham Nick Dobrovolskiy VP Parallels Open Platform May 19 th, 2008 Introducing Parallels Server.
Introduction to VMware Virtualization
Virtualization. Virtualization  In computing, virtualization is a broad term that refers to the abstraction of computer resources  It is "a technique.
Sydney Region IT School Support Term Smaller Servers available on Contract.
Test Of Distributed Data Quality Monitoring Of CMS Tracker Dataset H->ZZ->2e2mu with PileUp - 10,000 events ( ~ 50,000 hits for events) The monitoring.
Chapter 8 Implementing Disaster Recovery and High Availability Hands-On Virtual Computing.
Appendix B Planning a Virtualization Strategy for Exchange Server 2010.
INTRODUCTION TO CLOUD COMPUTING CS 595 LECTURE 2.
1 © 2010 Overland Storage, Inc. © 2012 Overland Storage, Inc. Overland Storage The Storage Conundrum Neil Cogger Pre-Sales Manager.
Mark A. Magumba Storage Management. What is storage An electronic place where computer may store data and instructions for retrieval The objective of.
Storage Tank in Data Grid Shin, SangYong(syshin, #6468) IBM Grid Computing August 23, 2003.
Sandor Acs 05/07/
InstantGrid: A Framework for On- Demand Grid Point Construction R.S.C. Ho, K.K. Yin, D.C.M. Lee, D.H.F. Hung, C.L. Wang, and F.C.M. Lau Dept. of Computer.
1 Week #10Business Continuity Backing Up Data Configuring Shadow Copies Providing Server and Service Availability.
Virtualization for the LHCb Online system CHEP Taipei Dedicato a Zio Renato Enrico Bonaccorsi, (CERN)
©2015 EarthLink. All rights reserved. Private Cloud Hosting Create Your Own Private IT Environment.
T3 analysis Facility V. Bucard, F.Furano, A.Maier, R.Santana, R. Santinelli T3 Analysis Facility The LHCb Computing Model divides collaboration affiliated.
LegendCorp What is System Center Virtual Machine Manager (SCVMM)? SCVMM at a glance Features and Benefits Components / Topology /
Private Cloud Hosting. IT Business Challenges I need to extend my on-premises virtualized environment to utilize the Cloud and manage the entire environment.
VMware vSphere Configuration and Management v6
(WINDOWS PLATFORM - ITI310 – S15)
HEP Computing Status Sheffield University Matt Robinson Paul Hodgson Andrew Beresford.
CERN - IT Department CH-1211 Genève 23 Switzerland t High Availability Databases based on Oracle 10g RAC on Linux WLCG Tier2 Tutorials, CERN,
Final Implementation of a High Performance Computing Cluster at Florida Tech P. FORD, X. FAVE, K. GNANVO, R. HOCH, M. HOHLMANN, D. MITRA Physics and Space.
The 2001 Tier-1 prototype for LHCb-Italy Vincenzo Vagnoni Genève, November 2000.
LHC Logging Cluster Nilo Segura IT/DB. Agenda ● Hardware Components ● Software Components ● Transparent Application Failover ● Service definition.
From VMware to Proxmox Federico Calzolari Scuola Normale Superiore - INFN Pisa.
Office of Administration Enterprise Server Farm September 2008 Briefing.
Start out with questions? Does anyone work with Databases? Has anyone ever had their computer slow down, to a crash? Would it be more beneficial for your.
Liberty Mutual Group Asset Management Inc. Group Liberty Mutual Group Asset Management Inc. Business Continuity & Securing Your Data Our responsibilities.
Claudio Grandi INFN Bologna Virtual Pools for Interactive Analysis and Software Development through an Integrated Cloud Environment Claudio Grandi (INFN.
Enterprise Vitrualization by Ernest de León. Brief Overview.
Open-E Data Storage Software (DSS V6)
Introduction to VMware Virtualization
Current Generation Hypervisor Type 1 Type 2.
Belle II Physics Analysis Center at TIFR
Virtualization OVERVIEW
Management of Virtual Machines in Grids Infrastructures
Enrico Bonaccorsi, (CERN) Loic Brarda, (CERN) Gary Moine, (CERN)
Welcome! Thank you for joining us. We’ll get started in a few minutes.
Virtualization Cloud and Fedora
Management of Virtual Machines in Grids Infrastructures
1. 2 VIRTUAL MACHINES By: Satya Prasanna Mallick Reg.No
Overview Introduction VPS Understanding VPS Architecture
Scuola Normale Superiore - INFN Pisa
Replibit.
Presentation transcript:

Federico Calzolari 1, Silvia Arezzini 2, Alberto Ciampa 2, Enrico Mazzoni 2 1 Scuola Normale Superiore - Pisa, Italy 2 National Institute of Nuclear Physics INFN - Pisa, Italy contact: Federico Calzolari 1, Silvia Arezzini 2, Alberto Ciampa 2, Enrico Mazzoni 2 1 Scuola Normale Superiore - Pisa, Italy 2 National Institute of Nuclear Physics INFN - Pisa, Italy contact: CHEP 2009 Prague Aims: A zero cost solution to the High availability problem. Requirements: Full exploitation of virtual environment features: start, stop and move virtual machines between physical hosts. Reliable shared storage infrastructure. Solution: Using virtualization, it is possible to achieve a redundancy system for all the services running on a data center, distributing the running virtual machines over the only up and running physical servers. Summary gridce.sns.it [SNS-Pisa Grid CE] crashes for system AM Scenario Grid data center Infrastructure: reliable shared Storage Unified management Local Controller and Monitoring service Installation tool: PXE technology Availability of all system components Spin-off: Host on-demand Host on-demand: basic concepts Virtualization and PXE architecture allows to bring up a server in a few minutes Possibility to offer host on-demand:  CPUn core  RAMn GB  DISKn TB  Operating System Linux [several distros], Windows  Middleware and Applications  for T time at the end of time T hosts will be erased! Spin-off: Host on-demand Host on-demand: basic concepts Virtualization and PXE architecture allows to bring up a server in a few minutes Possibility to offer host on-demand:  CPUn core  RAMn GB  DISKn TB  Operating System Linux [several distros], Windows  Middleware and Applications  for T time at the end of time T hosts will be erased! High Availability System design protocol that ensures a certain degree of operational continuity during a given period. High Availability System design protocol that ensures a certain degree of operational continuity during a given period. Virtualization Abstraction of computer resources. Abstraction layer that allows each physical server to run one or more virtual servers, decoupling operating system and applications from the underlying physical server. Virtualization Abstraction of computer resources. Abstraction layer that allows each physical server to run one or more virtual servers, decoupling operating system and applications from the underlying physical server. Classical solution Virtualized solution Operation in a real crash example Proposal RELAXED High availability service: A system able to restore any previously running application in less than ten minutes from the crash time. Proposal RELAXED High availability service: A system able to restore any previously running application in less than ten minutes from the crash time. Primary server Secondary server Pro & Contra  Zero cost solution  Server consolidation  Relaxed recovery time [~3 minutes]  Sessions are NOT kept alive Pro & Contra  Zero cost solution  Server consolidation  Relaxed recovery time [~3 minutes]  Sessions are NOT kept alive Outcomes RECOVERcrashedmachine in 3 min REINSTALLbrokenmachine in 9 min SNS-PISA is the first EGEE/LCG Grid node  fully virtualized (services + WN)  highly available  NO downtime after service crash Outcomes RECOVERcrashedmachine in 3 min REINSTALLbrokenmachine in 9 min SNS-PISA is the first EGEE/LCG Grid node  fully virtualized (services + WN)  highly available  NO downtime after service crash 3 Re-Cycle Finite state machine with Hysteresis  REBOOTVirtual Machine  RESTARTVirtual Layer  REINSTALLfrom scratch - PXE Finite state machine with Hysteresis  REBOOTVirtual Machine  RESTARTVirtual Layer  REINSTALLfrom scratch - PXE Goals  relaxed High Availability < 10 min  backup  each physical server can backup each virtual machine Goals  relaxed High Availability < 10 min  backup  each physical server can backup each virtual machine 3RC High Availability Project Requirements  Remote Redundant Controller  Reliable Storage:  SAN or NAS via FC or NFS  RAID over network DRBD Requirements  Remote Redundant Controller  Reliable Storage:  SAN or NAS via FC or NFS  RAID over network DRBD Experimental data Recovery time distribution Gaussian:mean181sec sigma10sec Reinstall time Gaussian:mean542sec sigma17sec NON Destructive test  overhead; shutdown DESTRUCTIVE test  rm /boot  dd 0 on filesystem  reboot crash test crash test Several redundancy strategies for several availability levels  Virtual machines/disks on external storage ►► problems if software crashes  Scheduled virtual machines dump: disk, ram, registers ►► scheduled dumps: T_{n-1}  Virtual machines ready to be mounted ►► virgin machine from disk copy  Install from scratch: operating system and middleware ►► virgin machine from real installation via PXE Several redundancy strategies for several availability levels  Virtual machines/disks on external storage ►► problems if software crashes  Scheduled virtual machines dump: disk, ram, registers ►► scheduled dumps: T_{n-1}  Virtual machines ready to be mounted ►► virgin machine from disk copy  Install from scratch: operating system and middleware ►► virgin machine from real installation via PXE