PPD Computing “Business Continuity” David Kelsey 3 May 2012.

Slides:



Advertisements
Similar presentations
Pennsylvania BANNER Users Group 2007 Disaster Recover For The Financial Aid Environment.
Advertisements

RAL Particle Physics Dept. Site Report. Gareth Smith RAL PPD About 2 staff mainly on windows and general infrastructure About 1.5 staff on departmental.
GCSE ICT Networks & Security..
Copyright 2006 Mid-City Offices Systems. Busy people… How would your business be affected, if you suddenly lost all of your computer data? Rush through.
Laptop 101 Campus Training Mac Version. Introduction Learning Objectives After completing this course the participant will be able to: 1. Successfully.
Encryption Jack Roberts, PPD, RAL, STFC. Why? Government reaction to high profile data losses. STFC General Notices 30 th January, 1 st February 2008.
Information Technology Disaster Recovery Awareness Program.
A new standard in Enterprise File Backup. Contents 1.Comparison with current backup methods 2.Introducing Snapshot EFB 3.Snapshot EFB features 4.Organization.
VMWare to Hyper-V FOR SERVER What we looked at before migration  Performance – Hyper-V performs at near native speeds.  OS Compatibility – Hyper-V.
An Introduction to System Administration Chapter 1.
Princeton Collaborative Research Resources Feb. 18, 2011 Todd Hines Linda Oppenheim.
Preservasi Informasi Digital.  It will never happen here!  Common Causes of Loss of Data  Accidental Erasure (delete, power, backup)  Viruses and.
1 Lesson 3 Computer Protection Computer Literacy BASICS: A Comprehensive Guide to IC 3, 3 rd Edition Morrison / Wells.
Back Up and Recovery Sue Kayton February 2013.
1 Disaster Recovery Planning & Cross-Border Backup of Data among AMEDA Members Vipin Mahabirsingh Managing Director, CDS Mauritius For Workgroup on Cross-Border.
Virtual Network Servers. What is a Server? 1. A software application that provides a specific one or more services to other computers  Example: Apache.
Networking Computers. Advantages & Disadvantages of Networking.
November 2009 Network Disaster Recovery October 2014.
Gareth Smith RAL PPD HEP Sysman. April 2003 RAL Particle Physics Department Site Report.
ICT at Work Global Communication.
Methods of communication
Elite Networking & Consulting Presents: Everything You Wanted To Know About Data Insurance* * But Were Afraid To Ask Elite Networking & Consulting, LLC,
Fundamentals of Networking Discovery 1, Chapter 2 Operating Systems.
Laptops and Computer Security Gareth Smith. Current Situation in PPD Standardised on Dells (D400, D600) Total bought to date by department: ~50. Loan.
Court IT Issues Windows XP Problem April 8, 2014 Microsoft Ends Security Updates April 9, 2014 XP Computers will contract an OS Infection as soon.
Working for a hospital at home (Teleworking). Teleworking Home working or teleworking for hospitals staff is the process where they work from home using.
SLIR Computer Lab: Orientation and Training December 16, 1998.
PPD Computing “Business Continuity” Windows and Mac Kevin Dunford May 17 th 2012.
PPD & CLRC's response to the (IS) Security Threat Gareth Smith PPD/CG Christmas Lectures 2002.
Local Area Networks (LAN) are small networks, with a short distance for the cables to run, typically a room, a floor, or a building. - LANs are limited.
DECS Community IT DIVISION OF ENGINEERING COMPUTING SERVICES Michigan State University College of Engineering.
RAL PPD Site Update and other odds and ends Chris Brew.
Chapter 16 Designing Effective Output. E – 2 Before H000 Produce Hardware Investment Report HI000 Produce Hardware Investment Lines H100 Read Hardware.
Welcome! West Allis: Yes you can!. Terms: Data – Any form of information stored in a computer Data – Any form of information stored in a computer Database.
1 Lesson 3 Computer Protection Computer Literacy BASICS: A Comprehensive Guide to IC 3, 3 rd Edition Morrison / Wells.
Networks. A network is formed when a group of computers are connected together. Computers in a Local Area Network (LAN) are fairly close together, generally.
Preventing Common Causes of loss. Common Causes of Loss of Data Accidental Erasure – close a file and don’t save it, – write over the original file when.
Kevin Dunford – Windows Support & Development What do I do.. Support, configuration, and development of - Windows servers, desktops, Laptops, printers,
Manchester Particle Physics Policies and Computing Model.
St. Agnes School Technology for Teachers Acceptable Protocol.
Physical ways of keeping your system secure. Unit 7 – Assignment 2. (Task1) By, Rachel Fiveash.
RAL PPD Computing A tier 2, a tier 3 and a load of other stuff Rob Harper, June 2011.
PMS Software Ltd Electronic Communications A Guide.
E.Soundararajan R.Baskaran & M.Sai Baba Indira Gandhi Centre for Atomic Research, Kalpakkam.
Cosc 4750 Backups Why Backup? In case of failure In case of loss of files –User and system files Because you will regret it, if you don’t. –DUMB = Disasters.
What is Data Communication? Data communication is the process of collecting and distributing data(text, voice, graphics, video, etc) electrically from.
MOE – Experience 1.What do you get 2.It just works 3.Same look and feel across the university 4.Your documents and desktop available.
Gareth Smith RAL PPD RAL PPD Site Report. Gareth Smith RAL PPD RAL Particle Physics Department Overview About 90 staff (plus ~25 visitors) Desktops mainly.
Campus Network upgrade and Wi-Fi Rollout REVIEW AND PHASE 3 PROJECT MANAGER TASKS.
Campus Network upgrade and Wi-Fi Rollout PHASE 3 - CHANGES & HOW THESE AFFECT USERS.
Communications & Networks National 4 & 5 Computing Science.
Disaster RECOVERY It’s about the connections. Goals for this session Administrators recognize their role in the decision-making process associated with.
ILT Guide for Students: Using Office Access your s and files CNWL Student Guide on OneDrive & Office 365 In college: Log in on a computer and.
RAL PPD Tier 2 (and stuff) Site Report Rob Harper HEP SysMan 30 th June
Gareth Smith RAL PPD HEP Sysman. April 2003 Security Changes at RAL.
Planning for LCG Emergencies HEPiX, Fall 2005 SLAC, 13 October 2005 David Kelsey CCLRC/RAL, UK
2: Operating Systems Networking for Home & Small Business.
Introduction to Networking. What is a Network? Discuss in groups.
1Copyright © 2008, Printer Working Group. All rights reserved. PWG Plenary Status Report MFD Working Group February 7, 2008 Irvine, CA PWG F2F Meeting.
Department of Mathematics Technology Orientation.
Computer Security Sample security policy Dr Alexei Vernitski.
Recovery from the earthquake Takashi Sasaki. Disaster recovery “Disaster” comes from human error or hardware failure was considered before We were preparing.
Al Lilianstrom CD/LSC/SOS/ESG  Blocked?  Operating Systems  Baselines  Detection  TiSSUE  Compliance  Windows  OS/X  Questions.
Networking Objectives Understand what the following policies will contain – Disaster recovery – Backup – Archiving – Acceptable use – failover.
Definition, DIS/Advantages & Services
Physics Network Integration
WELCOME Start of Semester Meeting Fall 2018
OnBase Training Speaker: Dora Compis Disaster Recovery.
Data Backup Strategies
Division of Engineering Computing Services
Presentation transcript:

PPD Computing “Business Continuity” David Kelsey 3 May 2012

The RAL electrical work and risks SSE will replace two old HV switch-boards in RAL main sub-station – Will take ~6 months from mid May 2012 Normally we have two 132 kV supplies and 11 kV transformers – One is sufficient to power RAL so we have a live spare During the work – Only one transformer is live – If that fails we have no fast failover – But no digging allowed near the underground cables from Harwell Estimated time for SSE to patch to second supply is <48 hours Increased risk of power outages during this period – Increased risk is difficult to quantify Bottom line – Need to plan for short breaks in electrical power and possibly up to ~48 hours 03/05/20122Kelsey, PPD IT continuity

PPD Business Continuity planning PPD has a Business Continuity Plan – Started with the Y2K problem – And Disaster Recovery plan – This is good practice and useful anyway E.g. What do we do if R1 burns down? Or RAL is closed for other reasons? As part of this plan – PPD Computing Group has plans – for different time-scales 1-2 days; ~1 week; several weeks or more This is a good time to review and revise the plans! 03/05/20123Kelsey, PPD IT continuity

If RAL power is off … Services UP (generators) Core network – Parts of R26, parts of R89 – Off-site connections (JANET and DL) CLRC Windows Domain Exchange mail servers VPN? (not yet sure?) Also failover of some services to DL (e.g. Exchange servers) – We can VPN in to DL to access SSC services (from home) Central STFC web server – For advice about RAL status Most Services are DOWN Telephones – Landlines, Vodafone mast Access control & gates Fire Alarms Catering Water pumps Many computer services Etc etc etc NO COFFEE :=( RAL WILL BE SHUT! – Access only for small number of authorised staff 03/05/20124Kelsey, PPD IT continuity

What will be down in PPD (R1)? R1 will have no power We (Computing Group) will not be here! – Unless coming in to retrieve machines and/or backups Machine rooms will be down (we have no generators) No PPD Windows or Linux servers (including file servers) – No H drive, No T drive, etc. – No web servers PPD Windows domain will be down No network No printers No Scientific Computing Tier 2/3 compute service No dCache service – no access to scientific data No video conferencing Pointsec recovery will be unavailable 03/05/20125Kelsey, PPD IT continuity

What is computing group doing? Identifying those things that can be done now in advance – E.g. Check and test configuration of our UPS units (for orderly shutdown) We will provide best efforts support to keep PPD working from homes or other institutes – But without PPD compute servers being up Make changes in advance to help make laptops useable from elsewhere while PPD is down – E.g. Sophos (Windows) already reconfigured to failover to Sophos site for updates Provide documentation in advance – How to re-configure devices Windows security updates etc – Advice on failover to Exchange at DL – Etc – To be automatically copied to laptops 03/05/20126Kelsey, PPD IT continuity

What should PPD groups do? We (CG) cannot make IT service plans for individuals or groups Develop your own Business Continuity Plan – Only you know which services are critical Establish communication means with all members of your group – Phone, non-STFC Plan for lack of PPD computing services – Mission-critical software, data, computer power E.g. just before conferences! Access to high-speed networking, videoconferencing, printing, web services not available – Negotiate alternative work locations for staff This is all part of the wider PPD Business Continuity Plan 03/05/20127Kelsey, PPD IT continuity

What do individuals need to do? Have access to a laptop (or home PC) Have a copy of all important files (H and T drives) – E.g. via Windows Offline Files – or rsync copy on MACs – And paper files from your office! Have current documentation and contact details For regular PPD Tier 3 analysis users – Make a plan What data do you need? How much CPU? Can you submit elsewhere? (the Grid or CERN or Amazon?) – Do not leave everything until the very last minute :=) 03/05/20128Kelsey, PPD IT continuity

Communication Cascade: STFC senior management -> Director –> Div Heads –> Group leaders -> all staff Collect and store important contact details – Phone numbers – Non-STFC addresses – Contact details for Computing Group – And not just kept on the PPD file server! 03/05/20129Kelsey, PPD IT continuity

PPD IT Forum A meeting of the “PPD IT Forum” (i.e. All Staff and Visitors welcome!) planned for – Thursday 17 th May 2012 – CR03 R61 – 11:00 to 12:30 To present more details and discuss issues and concerns Please come! 03/05/201210Kelsey, PPD IT continuity