Operational procedures and tools for scheduled shutdowns at CC-IN2P3

Slides:



Advertisements
Similar presentations
Steve Lewis J.D. Edwards & Company
Advertisements

Systems Analysis and Design 9th Edition
J.B. Speed School of Engineering University of Louisville KEEPS Energy Management Toolkit Step 4: Create an Action Plan Toolkit 4: Create an Energy Management.
Development and Quality Plans
The Evolution of Fleet Management How do you get device information now? Manually? Configuration pages?
Configuration Management and Change Control Change is inevitable! So it has to be planned for and managed.
Preparing for Automation Dr. Dania Bilal IS 582 Spring 2007.
Web Site Steering Task Force Action Report Feb 2007.
State of Georgia Release Management Training
Compliance Management System. Intelex System Overview Focus Modules: –Permits Management –Monitoring & Measurement –Training Management –Document Control.
HNDIT23082 Lecture 10:Software Project Management
CHAPTER 2 SYSTEM PLANNING DFC4013 System Analysis & Design.
Start-SPPowerShell – Introduction to PowerShell for SharePoint Admins and Developers Paul BAker.
Web Technology Solutions
SharePoint 101 – An Overview of SharePoint 2010, 2013 and Office 365
Chapter 1 Computer Technology: Your Need to Know
Gu Minhao, DAQ group Experimental Center of IHEP February 2011
Task 4.2 Grid Coordination & DR Activation Module
Capstone Project W-CMS `.
Modern Systems Analysis and Design Third Edition
Stacy Kowalczyk WIC Technical Hour 3/30/2005
Software Project Configuration Management
Essentials of UrbanCode Deploy v6.1 QQ147
BUSINESS DRIVEN TECHNOLOGY
Machine operation and daily maintenance management in SOLEIL
Chapter 6: Database Project Management
Get to know SQL Manager SQL Server administration done right 
Testing Process Roman Yagodka ISS Test Leader.
PRESENTED BY MICHAEL PREMUZAK
Chapter 18 Maintaining Information Systems
IEEE Std 1074: Standard for Software Lifecycle
Cyril L’Orphelin (CC-IN2P3) COD-19, Bologna, March 30th 2009
Systems Analysis – ITEC 3155 Evaluating Alternatives for Requirements, Environment, and Implementation.
EOB Methodology Overview
Analysis Operations Monitoring Requirements Stefano Belforte
Task Management System (TMS)
CIT 470: Advanced Network and System Administration
University Event Management System
SharePoint Saturday Omaha April 2016
ITIL: Why Your IT Organization Should Care Service Support
SharePoint Administrative Communications Planning: Dynamic User Notifications for Upgrades, Migrations, Testing, … Presented by Robert Freeman (
ITIL: Why Your IT Organization Should Care Service Support
Chapter 1 (pages 4-9); Overview of SDLC
Gotcha! SharePoint Online Migration Mistakes to Avoid
Coventry, Jun, 2016 – Coordination of activities related to WP3 –
Organizational Consulting
[Work Order #] [ARB Date]
POP: Building Automation Around Secure Server Deployment
Course: Module: Lesson # & Name Instructional Material 1 of 32 Lesson Delivery Mode: Lesson Duration: Document Name: 1. Professional Diploma in ERP Systems.
DPS-7/Espanola E-911 System Upgrade
1 Stadium Company Network. The Stadium Company Project Is a sports facility management company that manages a stadium. Stadium Company needs to upgrade.
AppointmentmentPeach Appointment Manager
Managing Large-scale Days of Service
Maintaining Information Systems (SAD- 18)
Health & Consumers DG SANCO Unit A.4 Information systems
ITIL: Why Your IT Organization Should Care Service Support
Chapter # 4 Development and Quality Plans
System Start-Up and Shutdown
Introduction to Web Authoring
Team 21: Project Design Team Members: Nathan Staley Steven Murray
The Survival Plan.
Modern Systems Analysis and Design Third Edition
Introduce myself & around table
Executive Project Kickoff
Horizon 2020 Project Grant Agreement
ESDEN - modernisation of data exchange in the ESS
Mid-term review: “CONTINUING REGIONAL SUPPORT FOR THE PERSISTENT ORGANIC POLLUTANTS (POPs) GLOBAL MONITORING PLAN (GMP2)” under the Stockholm Convention.
WORKSHOP Establish a Communication and Training Plan
Presentation transcript:

Operational procedures and tools for scheduled shutdowns at CC-IN2P3 Frédéric AZEVEDO, Philippe CORREIA, Thibaut SALANON 15 / 02 / 2017

When and why we started change management Developments Content When and why we started change management Developments Organizing a site maintenance using GOA Summary FJPPL 15/02/2017

Data center and Service provider Why we started change management Data center and Service provider A lot of services and components More or less dependencies within each others Common dependencies for everything : Power supplies Cooling system Network Since 2008 : 4 days of site maintenance per year Focus on high impact maintenances (implies service(s) outage) Major software updates Hardware renewal and upgrade Facility management components (chiller, power supplies, electrical components, …) Improve change management Avoid unexpected impacts on Services Have a global view on major changes Optimize downtime duration Improve communication (internal and external) FJPPL 15/02/2017

To help us with change management Developments To help us with change management A global CMDB (Configuration Management Data Base) Collect information from several sources Puppet DB, sysadmin DB, services configuration, … Including components and services dependencies Work still in progress Impact analysis tool based on the CMDB Web application Given a component, we can see the impact on others, on services and on users We can also guess possible causes for a given failure FJPPL 15/02/2017

Developments - Impact Analysis FJPPL 15/02/2017

Developments - GOA GOA Web application Admin LTE 2 for the web interface (based on Bootstrap 3) Symfony (PHP framework and components) DHMTLXGantt : Javascript and HTML5 Gantt library Main objective is to improve the way we manage high impact changes For each site maintenance Timeline with 28 pre-defined tasks (for organisation and communication) Allows anyone to ask for a change request Handles different roles (user / moderator) Gives a global view of what will be done Displays a Gantt diagram to make things easy to read FJPPL 15/02/2017

Timeline overview Collect the needs for changes Organizing a site maintenance using GOA Timeline overview Collect the needs for changes Analyse the impact on other services and on users/experiments Identify conflicts or opportunities Approve the maintenance requests with the management if needed Establish and approve the planning with the management and the steering committee On D-day coordination and internal/external communication Update the status for ongoing tasks Global review with all people involved in maintenance tasks 1 or 2 meetings 1 meeting FJPPL 15/02/2017

GOA – Main page FJPPL 15/02/2017

GOA – Timeline FJPPL 15/02/2017

GOA – Tasks list FJPPL 15/02/2017

GOA – Gantt FJPPL 15/02/2017

GOA – Final review Generally 2 days after, we have a final review Go through every task Identify difficulties See the gap between the time expected vs real FJPPL 15/02/2017

GOA – Final review FJPPL 15/02/2017

To improve our change management we choose to make some developments Summary To improve our change management we choose to make some developments CMDB to have a gathered view of all configurations items and their relations Impact analysis tool based on this CMDB Manuel check for now GOA : a web application Define a timeline Collect changes requests Approve and schedule requests View the “real time” status on D-day Final review of single tasks and of the global site maintenance Still some possibilities to automate things. Real meetings for discussions are still mandatory ! FJPPL 15/02/2017

Thank you for your attention. Any questions ? FJPPL 15/02/2017