Can Your Data Center Recover from a Disaster?. Presenters Rick Boyer Senior Technology Sales Specialist Technology Sales Group GE Healthcare IT

Slides:

Advertisements

Similar presentations

1 The Basics of Business Continuity Presented by Mary F. Sandy, CBCP Business Continuity/Disaster Recovery Class DePaul University ©Mary F. Sandy, 2006.

Advertisements

Enterprise Architectures

Business Plug-In B4 MIS Infrastructures.

Protect Your Business and Simplify IT with Symantec and VMware Presenter, Title, Company Date.

Maximizing Uptime and Your Firm's Bottom Line: Understanding risk and budget when evaluating business continuity & disaster recovery protocols Michael.

Module – 9 Introduction to Business continuity

Business Continuity Section 3(chapter 8) BC:ISMDR:BEIT:VIII:chap8:Madhu N PIIT1.

VERITAS Confidential Disaster Recovery – Beyond Backup Jason Phippen – Director Product and Solutions Marketing, EMEA.

1 Vladimir Knežević Microsoft Software d.o.o.. 80% Održavanje 80% Održavanje 20% New Cost Reduction Keep Business Up & Running End User Productivity End.

1 Disaster Recovery “Protecting City Data” Ron Bergman First Deputy Commissioner Gregory Neuhaus Assistant Commissioner THE CITY OF NEW YORK.

Cloud Disaster Recovery. Typical Business Challenges How much does it cost me to have my IT environment off-line, and how quickly does my disaster recovery.

© 2009 EMC Corporation. All rights reserved. Introduction to Business Continuity Module 3.1.

VERITAS Confidential Copyright © 2002 VERITAS Software Corporation. All Rights Reserved. VERITAS, VERITAS Software, the VERITAS logo, and all other VERITAS.

1 Disk Based Disaster Recovery & Data Replication Solutions Gavin Cole Storage Consultant SEE.

1 © Copyright 2010 EMC Corporation. All rights reserved. EMC RecoverPoint/Cluster Enabler for Microsoft Failover Cluster.

Keith Burns Microsoft UK Mission Critical Database.

McGraw-Hill/Irwin © 2008 The McGraw-Hill Companies, All Rights Reserved Business Plug-In B4 Enterprise Architecture.

Managing Cybersecurity Risk in the Emerging Healthcare Eco-System

1© Copyright 2011 EMC Corporation. All rights reserved. EMC RECOVERPOINT/ CLUSTER ENABLER FOR MICROSOFT FAILOVER CLUSTER.

CENTRICITY SUPPORT PRO-TIPS Pro-Tips for making Centricity life easier.

John Graham – STRATEGIC Information Group Steve Lamb - QAD Disaster Recovery Planning MMUG Spring 2013 March 19, 2013 Cleveland, OH 03/19/2013MMUG Cleveland.

CHAPTER OVERVIEW SECTION 5.1 – MIS INFRASTRUCTURE

November 2009 Network Disaster Recovery October 2014.

© 2010 Plexent – All rights reserved. 1 Change –The addition, modification or removal of approved, supported or baselined CIs Request for Change –Record.

Treasury in the Cloud Bob Stark – Vice President, Strategy September 17, 2014.

Chapter 10 : Designing a SQL Server 2005 Solution for High Availability MCITP Administrator: Microsoft SQL Server 2005 Database Server Infrastructure Design.

Centricity ™ LIVE 2015 Technology & serviceability… The new game Chris Jackson February 16, 2015.

IT Business Continuity Briefing March 3,  Incident Overview  Improving the power posture of the Primary Data Center  STAGEnet Redundancy  Telephone.

Business Continuity and Disaster Recovery Chapter 8 Part 2 Pages 914 to 945.

CHAPTER FIVE INFRASTRUCTURES: SUSTAINABLE TECHNOLOGIES

Chapter 2: Non functional Attributes.  It infrastructure provides services to applications  Many of these services can be defined as functions such.

David N. Wozei Systems Administrator, IT Auditor.

What is Ubiquity™? ubiquity (noun): the state of being, or seeming to be, everywhere at once Ubiquity™ is a powerful building management system that.

The Ins and Outs of ICD10 in Centricity™ Group Management May, 2015.

Co-location Sites for Business Continuity and Disaster Recovery Peter Lesser (212) Peter Lesser (212) Kraft.

1 Track Owners: Zinc Instructions Please update the chart to the right before submitting the presentation into Active Following review and approval in.

©2006 Merge eMed. All Rights Reserved. Energize Your Workflow 2006 User Group Meeting May 7-9, 2006 Disaster Recovery Michael Leonard.

Module 9 Planning a Disaster Recovery Solution. Module Overview Planning for Disaster Mitigation Planning Exchange Server Backup Planning Exchange Server.

Image consolidation: Helping transform department silos with vendor neutral solutions April 29, 2015.

Improving Workflow Efficiency with the Centricity™ Business Dictionary Sync Utility April 29 – May 2, 2015.

Future of the Server Room Tour. Ottawa Montreal Calgary Vancouver Toronto Future of Your Server Room Three Pillars of Windows Server 2008 Virtualization.

Enterprise Imaging Roundtable “Just what does a collaborative care network look like…?” Leslie Weisenbacher, Tom Zimmerman, Barbara Mullarky May, 2015.

Module 13 Implementing Business Continuity. Module Overview Protecting and Recovering Content Working with Backup and Restore for Disaster Recovery Implementing.

E.Soundararajan R.Baskaran & M.Sai Baba Indira Gandhi Centre for Atomic Research, Kalpakkam.

Chapter 5 McGraw-Hill/Irwin Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved.

Application Center 2000 at Microsoft A solution for managing high availability Web applications built on Windows 2000 Published April 2002.

Slide 1 Security Engineering. Slide 2 Objectives l To introduce issues that must be considered in the specification and design of secure software l To.

Data Center Management Microsoft System Center. Objective: Drive Cost of Data Center Management 78% Maintenance 22% New Issue:Issue: 78% of IT budgets.

Component 8/Unit 9aHealth IT Workforce Curriculum Version 1.0 Fall Installation and Maintenance of Health IT Systems Unit 9a Creating Fault Tolerant.

CBIZ RISK & ADVISORY SERVICES BUSINESS CONTINUITY PLANNING Developing a Readiness Strategy that Mitigates Risk and is Actionable and Easy to Implement.

가상화 기반의 Workload 관리솔루션 : FORGE PlateSpin Virtualization and Workload Management 나영관 한국노벨 /

This courseware is copyrighted © 2016 gtslearning. No part of this courseware or any training material supplied by gtslearning International Limited to.

Business Continuity Planning 101

Azure Site Recovery For Hyper-V, VMware, and Physical Environments

Enterprise Architectures

Providing Application High Availability

Server Upgrade HA/DR Integration

Module – 9 Introduction to Business continuity

Business Continuity / Recovery

Maximum Availability Architecture Enterprise Technology Centre.

Test Upgrade Name Title Company 9/18/2018 Microsoft SharePoint

Audit Plan Michelangelo Collura, Folake Stella Alabede, Felice Walden, Matthew Zimmerman.

Audit Planning Presentation - Disaster Recovery Plan

Microsoft Azure P wer Lunch

TechEd /15/2018 8:51 AM © 2013 Microsoft Corporation. All rights reserved. Microsoft, Windows, and other product names are or may be registered.

Business Continuity Technology

Using the Cloud for Backup, Archiving & Disaster Recovery

OPS-7: Building and Deploying a Highly Available Application

IT SERVICE CONTINUITY PLAN

Presentation transcript:

Can Your Data Center Recover from a Disaster?

Presenters Rick Boyer Senior Technology Sales Specialist Technology Sales Group GE Healthcare IT Tim Darling Senior Technology Sales Specialist Technology Sales Group (TSG) GE Healthcare IT

©2015 General Electric Company – All rights reserved. The results expressed in this document may not be applicable to a particular site or installation and individual results may vary. This document and its contents are provided to you for informational purposes only and do not constitute a representation, warranty or performance guarantee. GE disclaims liability for any loss, which may arise from reliance on or use of information, contained in this document. All illustrations are provided as fictional examples only. Your product features and configuration may be different than those shown. Information contained herein is proprietary to GE. No part of this publication may be reproduced for any purpose without written permission of GE. DESCRIPTIONS OF FUTURE FUNCTIONALITY REFLECT CURRENT PRODUCT DIRECTION, ARE FOR INFORMATIONAL PURPOSES ONLY AND DO NOT CONSTITUTE A COMMITMENT TO PROVIDE SPECIFIC FUNCTIONALITY. TIMING AND AVAILABILITY REMAIN AT GE’S DISCRETION AND ARE SUBJECT TO CHANGE AND APPLICABLE REGULATORY CLEARANCE. GE, the GE Monogram, Centricity, and imagination at work are trademarks of General Electric Company. All other product names and logos are trademarks or registered trademarks of their respective companies. General Electric Company, by and through its GE Healthcare division. 3

Agenda Definitions Cost of downtime Risks – causes of downtime Proactive planning for availability Planning for disasters - Failure and outage scenarios Q & A

Definitions

Business continuity – A general term that refers to the planning and design required to maintain the highest level of service consistency, availability, and recoverability Fault tolerance – Totally redundant system and application design with near 100% availability For example, the Centricity Enterprise Non-Stop is a fault tolerant hardware and software platform designed to provide 5 or 6 “9’s” uptime High-availability – Designing an IT infrastructure with minimal single points of failure, includes clustering or failover features

Definitions Disaster – Several definitions we’ve found include: A natural or man-made catastrophic event resulting in severe damage and/or loss of life; e.g. “Sandy” A crisis causing widespread damage which far exceeds the ability to easily recover A calamitous event causing considerable outage and/or unrecoverable disruption of services

Definitions Disaster recovery – The design and preparation needed to recover from a disaster within an organization’s recovery guidelines, goals, and objectives as defined by: RPO – Recovery Point Objective – Amount of data loss your business can sustain based on your business’ recovery rules RTO – Recovery Time Objective – Amount of downtime your business is capable of enduring

Definitions MTD – Maximum Tolerable Downtime* Maximum perceived downtime your users experience when trying to use a system that is available, or, length of time access to the applications users need to perform their job is unavailable (aka – the maximum downtime allowed by management before you get fired) * Thanks to Dave Hoffman from IBM Enterprise 2014

Cost of Downtime

Cost of downtime Questions for the audience: How quickly does your management expect your applications and systems to recover from an outage? RTO How much data is your business willing to lose during an outage? RPO What do you think your organization’s MTD is? How much do you think downtime costs your business per hour or per day, or does anyone in the audience know?

Cost of downtime Typical RTO and RPO after unplanned outage High-availability (clustered) system: RTO – approximately 60 sec to 10 min, RPO minimal data loss – depends on replication method & cause of outage * Disaster recovery with hot site: RTO – approximately 1 hour to >4 hours, RPO 24 hours * * Note: recovery depends on how your databases and systems will resume after a failure, and how quickly your IT staff can initialize failover processes and/or DR plans. No guarantees are provided or implied.

Cost of downtime Some estimates range from $50K/hr to an average of $200K/hr¹ or more, depending on type and size of the business – varies by business segment Some costs cannot be estimated Loss of reputation and “bad press” in healthcare, an outage can potentially result in loss of life Perception of downtime varies by role within any organization Management, front desk users, clinicians, etc. ¹From Availabilitydigest.com, : - page 3http://

Cost of downtime

From GE Web site:

Cost of downtime According to a Healthcare IT News article on 12/4/2013 entitled: Data center outages come with whopping $8K per minute price tag*: “Healthcare organizations face average costs of $690,000 per outage incident, according to the findings of a new Ponemon Institute/Emerson Network Power report, roughly a 41 percent increase since Larger groups with more extensive IT systems, however, could pay out nearly $1.74 million per incident” * pricetag?topic=06,17,19

Cost of downtime Developing a solid ROI will help to justify budgeting for a Business Continuity plan Wide range of options are available, but first you need to have reasonable estimates of what downtime costs your business Example: 9 hours of unplanned outage ( 3 “9’s” or 99.9% $200K per hour = $1.8M of financial exposure per year Unfortunately, most customers experience more unplanned downtime per year so it is the IT manager/director’s job to estimate annual exposure to outages so a reasonable ROI can be developed

Risks – Causes of Downtime

Risks – causes of downtime Operator error or human behavior Software or Application failures Servers, storage, power/AC, networks, physical security failures Risk aversion planning IT processes Testing Quality control Redundancy DR planning Service Contracts Facility Mgmt Monitoring Training Security

Proactive planning for availability

Application and/or software failures Establish consistent processes and quality control Strive to remain current on latest software versions Develop complete version testing and validation procedures Practice repeatable upgrade processes Invest in user training and/or certification

Proactive planning for availability Data Center Infrastructure, servers, storage, power, A/C, networking, physical security, etc. Configure redundancy of all major components in your infrastructure, eliminating single points of failure Develop a disaster recovery planning or failover processes –Clustering, load balancing, remote replication, DR warm or hot site, etc. Enhanced vendor services contracts for all hardware and infrastructure components and operating systems

Proactive planning for availability Data Center Infrastructure, servers, storage, power, A/C, networking, physical security, etc. Incorporate Service Oriented Architecture (SOA) facility management tools, such as GE Proficy SOA* to improve interoperability, monitoring, and control. * centers/12850/0/

Proactive planning for availability Use SOA tools for facility management to consolidate operation activities for enhanced data center reliability, monitoring and control From GE White Paper found at:

Proactive planning for availability Operator error or human behavior Deliver proper training & maintain accurate and updated documentation for all staff Set clear policies & procedures that the IT staff must follow Install and maintain enhanced system monitoring and alerting tools for notification and logging of system access, configuration changes, and resolutions to system issues

Planning for disasters Failure and outage scenarios

Failure and outage scenario #1 Software/application outage scenario: A weekend application upgrade causes major issues with the Scheduling application on Monday morning

Failure and outage scenario #1 Processes or steps you can implement do help avoid risk: IT upgrade procedures must in place to fully test and certify changes before code is merged into production There must be at least a separate test database, or preferably a test/development system, where all changes are validated IT needs to maintain the ability to revert back to the original environment in the event an issue occurs after upgrading

Failure and outage scenario #1 (Cont’d) Ensure all associated software versions are current and/or compatible, including operating systems, database software, etc. Develop repeatable processes that standardize all steps in upgrade planning, testing, and adoption methods

Failure and outage scenario #2 Data Center Hardware Infrastructure outage scenario: Production site server failure impacts user access to the applications.

Failure and outage scenario #2 Protecting against production data center hardware failures Configure redundant hardware and failover software to enhance availability Users connecting to the cluster are configured with load balancing to the remaining, redundant server(s)

Failure and outage scenario #2 (cont’d) Clustering type (active/active or active/passive) is determined by the operating system and clustering software installed Reduce single points of failure by designing your infrastructure with the highest level of redundancy available

Web Server Failure

Cache ’ Server Failur e

Failure and outage scenario #3 Data Center Hardware Infrastructure outage scenario: Production site experiences a catastrophic data center failure - fire, flood, explosion, tornado, or other event

Failure and outage scenario #3 Protecting against a catastrophic data center failure Databases should be periodically replicated to the DR site using some form of DR replication for all databases, with frequency determined by RPO Manual recovery of affected systems is recommended using proven scripts, procedures, and vendor assistance RPO and RTO depends on frequency of replication and failover processes

IDRR = Intelligent Disaster Recovery Replication

For more information on ISS IDRR and other related tools, please visit the ISS booth in the vendor exhibitor area.

Failure and outage scenario #4 Human-initiated failure scenario: A SAN storage manager discovers he is being terminated and deletes database files on a business critical system storage subsystem

Failure and outage scenario #4 Steps you can take to mitigate risk Install a secure, centralized access control tool for all consoles with audit trail monitoring and event alerting A product like TDi ConsoleWorks will provide this level of monitoring, audit trail tracking, and much more Ensure multiple points of secure database redundancy Organize IT for redundancy – more than one person should be responsible for access control and system security

Risks – causes of downtime Operator error or human behavior Software or Application failures Servers, storage, power/AC, networks, physical security failures Risk aversion planning IT processes Testing Quality control Redundancy DR planning Service Contracts Facility Mgmt Monitoring Training Security

Takeaways You should now be familiar with: 1)The different levels of availability – fault tolerance, high- availability, and disaster recovery and some available options to satisfy your organization’s RPO and RTO, 2)Some of the major causes of downtime and how they can impact your business, 3)Options your IT department can use to design an infrastructure, processes, and monitoring that can help reduce unplanned downtime and risk

Questions ?