Pablo Pinés León – FTEC 2016 Program

Slides:



Advertisements
Similar presentations
Deployment Adavatages Provisioning time reduced to minutes, not days to weeks! Configure hardware Install OS Configure OS & Tools Assign IP Addr Configure.
Advertisements

Intel® Manager for Lustre* Lustre Installation & Configuration
High Availability through Virtualization
Introduction to Network Administration. Objectives.
OpenStack High Availability
Private Cloud or Dedicated Hosts Mason Mabardy & Matt Maples.
Microsoft ® Application Virtualization 4.5 Infrastructure Planning and Design Series.
Implementing Failover Clustering with Hyper-V
Copyright © 2002 Wensong Zhang. Page 1 Free Software Symposium 2002 Linux Virtual Server: Linux Server Clusters for Scalable Network Services Wensong Zhang.
Paper on Best implemented scientific concept for E-Governance Virtual Machine By Nitin V. Choudhari, DIO,NIC,Akola By Nitin V. Choudhari, DIO,NIC,Akola.
Microsoft ® Application Virtualization 4.6 Infrastructure Planning and Design Published: September 2008 Updated: February 2010.
Client Management. Introduction In a typical organization there are a lot of client machines used for day to day operations Client management is a necessary.
Chapter 10 : Designing a SQL Server 2005 Solution for High Availability MCITP Administrator: Microsoft SQL Server 2005 Database Server Infrastructure Design.
Paper on Best implemented scientific concept for E-Governance projects Virtual Machine By Nitin V. Choudhari, DIO,NIC,Akola.
Sumit Kumar Archana Kumar Group # 4 CSE 591 : Virtualization and Cloud Computing.
INTRODUCTION TO CLOUD COMPUTING CS 595 LECTURE 7 2/23/2015.
Implementing Multi-Site Clusters April Trần Văn Huệ Nhất Nghệ CPLS.
Linux & UNIX Version 5.3 (Power) Version 6.1 (Power) Version 7.1 (Power) AIX Version 11iv2 (PA-RISC/IA64) Version 11iv3 (PA-RISC/IA64) HP-UX Version.
Microsoft ® Application Virtualization 4.6 Infrastructure Planning and Design Published: September 2008 Updated: November 2011.
Installation and Integration of Virtual Clusters onto Pragma Grid NAIST Nara, Japan Kevin Lam 06/28/13.
Submitted by: Shailendra Kumar Sharma 06EYTCS049.
October, Scientific Linux INFN/Trieste B.Gobbo – Compass R.Gomezel - T.Macorini - L.Strizzolo INFN - Trieste.
ITServices Virtualization Terry Black January 2013.
ALMA Integrated Computing Team Coordination & Planning Meeting #4 Santiago, November 2014 Reporting Software Principles M Chavan, ESO R Soto, ALMA.
Mike Gore School of Computer Science. This talk is on the Web in TWIKI format A more detailed copy of this talk can be access on with web page. Including.
 High-Availability Cluster with Linux-HA Matt Varnell Cameron Adkins Jeremy Landes.
 Load balancing is the process of distributing a workload evenly throughout a group or cluster of computers to maximize throughput.  This means that.
11 CLUSTERING AND AVAILABILITY Chapter 11. Chapter 11: CLUSTERING AND AVAILABILITY2 OVERVIEW  Describe the clustering capabilities of Microsoft Windows.
NTU Cloud 2010/05/30. System Diagram Architecture Gluster File System – Provide a distributed shared file system for migration NFS – A Prototype Image.
CERN IT Department CH-1211 Genève 23 Switzerland t IT Configuration Activities Gavin McCance Online Cross-experiment Meeting, 14 June 2012.
Computer Science 340 Software Design & Testing Software Architecture.
Construction of Computational Segment at TSU HEPI Erekle Magradze Zurab Modebadze.
CERN - IT Department CH-1211 Genève 23 Switzerland t High Availability Databases based on Oracle 10g RAC on Linux WLCG Tier2 Tutorials, CERN,
Data Communications and Networks Chapter 9 – Distributed Systems ICT-BVF8.1- Data Communications and Network Trainer: Dr. Abbes Sebihi.
CERN IT Department CH-1211 Genève 23 Switzerland t Migration from ELFMs to Agile Infrastructure CERN, IT Department.
Cloud Computing – UNIT - II. VIRTUALIZATION Virtualization Hiding the reality The mantra of smart computing is to intelligently hide the reality Binary->
© ExplorNet’s Centers for Quality Teaching and Learning 1 Explain the purpose of Microsoft virtualization. Objective Course Weight 2%
High-Availability MySQL with DR:BD and Heartbeat: MTV Japan mobile services ©2008 MTV Networks Japan K.K.
MySQL HA An overview Kris Buytaert. ● Senior Linux and Open Source ● „Infrastructure Architect“ ● I don't remember when I started.
1 High Availability in 37 Easy Steps Tim Serong Senior Clustering Engineer
High Availability Low Dollar Clustered Storage
CERN IT Department CH-1211 Genève 23 Switzerland PES Version control services at CERN Status of Version Control and Twiki services at CERN.
High Availability Clusters in Linux Sulamita Garcia EDS Unix Specialist
RHEV Platform at LHCb Red Hat at CERN 17-18/1/17
Web application hosting with Openshift, and Docker images
IT Services Katarzyna Dziedziniewicz-Wojcik IT-DB.
Web application hosting with Openshift, and Docker images
Dag Toppe Larsen UiB/CERN CERN,
Dag Toppe Larsen UiB/CERN CERN,
Diskpool and cloud storage benchmarks used in IT-DSS
Software Defined Storage
Installing OS.
Windows in BE-CO Terminal Servers, VirtualPCs, Consoles
Cross Community CI (XCI)
Cluster Communications
A Complete Internet Company
HEPiX Fall 2017 CERN project Follow-up
Virtualization Cloud and Fedora
Dumps PDF Insstallation, Storage, and Compute with Windows Server 2016 Exam Code: Exam Name:
Welcome To : Group 1 VC Presentation
3.2 Virtualisation.
DHCP, DNS, Client Connection, Assignment 1 1.3
High Availability Low Dollar Clustered Storage
Windows Server 2016 Software Defined Storage
ODP Global node cluster model
Lab 1: Getting Started.
OpenStack Summit Berlin – November 14, 2018
Module 1: Overview of Systems Management Server 2003
CCNA 4 v3.1 Module 6 Introduction to Network Administration
KX-NS1000 Step by Step Guide Network Survivability
Presentation transcript:

Pablo Pinés León – FTEC 2016 Program BE5002 - Investigate redundant network filesystem technologies for CERN's Accelerator Control System servers Pablo Pinés León – FTEC 2016 Program

Introduction BE-CO-IN is responsible for the infrastructure of CERN’s accelerators control servers ACC-ADM team There is currently one dedicated NFS server for each of 4 subsystems Around 1500 being served in total An NFS server unavailability can affect many client machines Client machines cannot be rebooted Recovery can take significant effort (restart required) High Availability solution would be a clear improvement This infrastructure can be used for other services too

First steps Learning Ansible and git Evaluation of high-availability technologies E-groups management Benchmarking of physical and virtual systems (CPU/Storage/Network) Support on 2FA and deployment of servers on the Technical Network

investigation Previous effort with GlusterFS not completely successful CEPH suggested It was deemed not adequate due to its resources requirements Alternatives explored though online search (HA-NFS, Mars,…) Active/Active solution considered (GFS, OCFS2) Decision was taken to use Pacemaker and its associated stack Lots of documentation Support from big companies (Red Hat, Suse…)

Architecture Design Target OS is CentOS 7 Active/Passive on a 2 nodes setting 4 nodes configuration investigated Requires a newer software version for which no precompiled packages exist More complicated set up, specially in terms of connections 2 nodes was considered a “good enough” improvement Heartbeat deprecated in CentOS 7

Software Stack Virtual IP Active Sync NFS Master Slave Filesystem Ext4 DRBD Primary Secondary Resource Manager Pacemaker Messaging Corosync Hardware Node 1 Node 2

Test environments First on SLC6 machine on Openstack Network problems (multicast, virtual IP) Then VirtualBox on my Workstation First on SLC6, Python version too old Tried on Ubuntu, successful CentOS 7 successful too Currently, two Physical Machines (workstations) Floating Virtual IP Significant issues encountered

Cluster Status view

Virtual Personal Computers VPCs are VMs for developers in the accelerator servers on CERN’s Openstack BE, TE, EN developers ACC Operational support Support on the creation, configuration and troubleshooting Issues with new templates New ways to create VMs and meet users needs Phasing out old infrastructure Dealing with other groups Monitoring on DIAMON

NEXT STEPs Implement the solution on real servers Add more services Test dedicated connection for DRBD Add more services Continue the work on VPCs 3 screens configuration for CentOS 7 White Hat Challenge Egroups management and other ACC-ADM tasks

Any questions?