Introduction to High Availability

Slides:



Advertisements
Similar presentations
Express5800/ft series servers Product Information Fault-Tolerant General Purpose Servers.
Advertisements

© 2006 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice HP and Carrier Network System Availability.
SQL Server Disaster Recovery Chris Shaw Sr. SQL Server DBA, Xtivia Inc.
Copyright ©2003 Digitask Consultants Inc., All rights reserved Storage Area Networks Digitask Seminar April 2000 Digitask Consultants, Inc.
Multi-layer ICT Management Presented by Andy Park.
Network Administrator Career Brief synopsis of my work by Bill Kuhl.
Data Communications & Computer Networks, Second Edition 1 Chapter 1 The Big Picture Introduction to Computer Networks and Data Communications.
Introduction to Networking & Telecommunications School of Business Eastern Illinois University © Abdou Illia, Spring 2015 (January 14, 2015)
UNIT 17 Computing Support.
CompSci Self-Managing Systems Shivnath Babu.
Networks Am I hooked up?. What is a network? A network is a group of 2 or more computers that are linked together either by wires or by a wireless means.
LAN / WAN Business Proposal. What is a LAN or WAN? A LAN is a Local Area Network it usually connects all computers in one building or several building.
Day 10 Hardware Fault Tolerance RAID. High availability All servers should be on UPSs –2 Types Smart UPS –Serial cable connects from UPS to computer.
Chapter 5 Networks Communicating and Sharing Resources
Chapter 9A Network Basics.
1 Networks and Telecommunications. 2 Applying Telecommunications in Business TELECOMMUNICATIONS – the transmission of data between devices in different.
CSCI-235 Micro-Computer in Science The Network. © Prentice-Hall, Inc Communications  Communication is the process of sending and receiving messages 
Chapter 1 Introduction to Computer Networks and Data Communications Data Communications and Computer Networks: A Business User’s Approach.
SAP R/3 Budget. SAP R/3 Costs  Hardware  Software  Add-on software and hardware  Implementation process  Training of operators  Maintaining the.
© 2005 Mt Xia Technical Consulting Group - All Rights Reserved. HACMP – High Availability Introduction Presentation November, 2005.
IMPROUVEMENT OF COMPUTER NETWORKS SECURITY BY USING FAULT TOLERANT CLUSTERS Prof. S ERB AUREL Ph. D. Prof. PATRICIU VICTOR-VALERIU Ph. D. Military Technical.
UNIT 17 Computing Support.
1 Computer and Network Bottlenecks Author: Rodger Burgess 27th October 2008 © Copyright reserved.
MODULE 1 Computing Essentials © Paradigm Publishing, Inc.1.
Practical Reports on Dependability Manifestation of System Failure Site unavailability System exception /access violation Incorrect result Data loss/corruption.
Microsoft Reseach, CambridgeBrendan Murphy. Measuring System Behaviour in the field Brendan Murphy Microsoft Research Cambridge.
FireProof. The Challenge Firewall - the challenge Network security devices Critical gateway to your network Constant service The Challenge.
Business Data Communications, Fourth Edition Chapter 11: Network Management.
1 Fault Tolerant Computing Basics Dan Siewiorek Carnegie Mellon University June 2012.
1 After completing this lesson, you will know: What computers need to operate—basic hardware and peripheral devices Why and how to protect your computer.
Network Structure Elements of communication message source the channel message destination Network data or information networks capable of carrying many.
IT Professionals David Tesar | Microsoft Technical Evangelist David Aiken | Microsoft Group Technical Product Manager 07 | High Availability and Load Balancing.
Remote Data Mirroring Solutions for High Availability David Arrigo EMC Corporation
Higher Computing Networking. Networking – Local Area Networks.
Install, configure and test ICT Networks
Component 8/Unit 9aHealth IT Workforce Curriculum Version 1.0 Fall Installation and Maintenance of Health IT Systems Unit 9a Creating Fault Tolerant.
Copyright © 2006 Heathkit Company, Inc. All Rights Reserved Introduction to Networking Technologies Introduction to Networking.
Internet Options By OTX-West. Discussion Points Technologies  Dial-Up  DSL  Cable Data Transfer Speeds  1 Kbps (Kilobit per second) = one thousand.
 There is a growing trend in the use of digital media compared to more traditional formats  The Apple iTunes Store SM became the #1 music retailer in.
Trends In Distributed File Systems Professor: Dr. Zhang CSc 8320 Advanced Operating Systems Presented by: Tu Tran Section 5.3.
The Economics of Notes and Domino 8.5: How to Decrease Cost & Increase Productivity by Optimizing your Infrastructure.
Database recovery contd…
Introduction to Networks
Managing Multi-User Databases
Internet Options By OTX-West.
Embracing Failure: A Case for Recovery-Oriented Computing
BUSINESS CONTINUITY BY HUI ZHENG.
Large Distributed Systems
Chapter 1: Introduction
High Availability Planning
Maximum Availability Architecture Enterprise Technology Centre.
Introduction to Networking & Telecommunications
NETWORKING TECHNOLOGIES
UNIT 17 Computing Support.
3.2 Virtualisation.
Introduction of Week 6 Assignment Discussion
How to Fix Android File Transfer Not Working on Mac?
Chapter 5 Networks Communicating and Sharing Resources
Integration services: Analysis Services:
Network Attached Storage NAS100
Web Server Administration
Fault Tolerance Distributed Web-based Systems
LAN Hardware Concepts Module 3 H3065S F.00 [Course Title]
Configuring Subnetting
Computer troubleshooting
Using the Cloud for Backup, Archiving & Disaster Recovery
Introduction To Distributed Systems
Chapter One Introduction to Computer Networks and Data Communications
Distributed Systems and Concurrency: Distributed Systems
Seminar on Enterprise Software
Presentation transcript:

Introduction to High Availability H6487S I.02 Module 1

What Causes a System to Go Down? Up Planned Unplanned Down H6487S I.02 © 2003 Hewlett-Packard Development Company, L.P.

© 2003 Hewlett-Packard Development Company, L.P. Causes of Failures Application Failure Hardware 20% 40% 40% IT Processes Operator Errors Source: Gartner Group October 1999 H6487S I.02 © 2003 Hewlett-Packard Development Company, L.P.

Not a Big Deal? You Tell Me! H6487S I.02 © 2003 Hewlett-Packard Development Company, L.P.

Average Cost per Hour of Downtime Financial - Brokerage Operations $6.45 Million Financial - Credit Card Sales $2.6 Million Media - Pay per view Retail - Home Shopping (TV) Retail - Home Catalog Sales Transportation Airline reservation Media Teleticket sales Transportation - Package shipping Finance - ATM fees $100,000 $200,000 $300,000 Millions Source: Dataquest Perspective, Sept. 1996 H6487S I.02 © 2003 Hewlett-Packard Development Company, L.P.

What Is High Availability? 3/14/2018 What Is High Availability? A system is highly available if a single component or resource failure interrupts the system for only a brief time. What is a system? (Computer? Network? Application?) What is a resource? (Hardware? Software? OS? Database?) What is a failure? (Disk crash? Too many packets? Full file system?) What is an interruption? (Reboot? User reconnect? Poor performance?) What is a brief time? (Minutes? Hours? Days?) HIGH AVAILABILITY IS A DESIGN! Depends on the viewpoint . . . . This definition of High Availability (HA) is very general and requires the expansion of the terms: system, resource, failure, interrupts and brief. These will vary depending on the viewpoint taken. For example system and resource tend to mean ‘hardware’ to an administrator. To an application user however, the ‘killing’ of a database process is a failure resulting in loss of availability of a ‘soft’ resource. The ‘system’ is more than the hardware, and includes the operating system, application processes, data AND the ability of users to connect to and use the resources. Some failures can be handled transparently without any interruption (eg. disk failure in a RAID array, single bit memory error) others can result in complete loss of service and a restart. The priority of an HA system is to minimise the the duration of the interruption ( typically tens of minutes) and by design reduce it to zero for many types of failure. The focus is TIME. It should be noted that a well designed HA configuration can also provide an excellent control environment for applications, helping to reduce interruptions to service from system upgrades and backups. H6487S I.02 © 2003 Hewlett-Packard Development Company, L.P.

Computer System Availability System: Computer Resources: CPU Memory Disk Failures: System crash Disk failure Interruption: System reboots Replace failed hardware Outage Time: Minutes to days H6487S I.02 © 2003 Hewlett-Packard Development Company, L.P.

© 2003 Hewlett-Packard Development Company, L.P. Network Availability System: Network Resources: Computers, routers, hubs, LAN cables, backbone, Modems, phone lines Failures: Failed network hardware, Bad cables, High packet collision rate Interruption: Slow user response, User reconnects, Replace failed hardware Outage Time: Minutes to days H6487S I.02 © 2003 Hewlett-Packard Development Company, L.P.

Application Availability System: Application Resources: Computers, networks, operating system resources Failures: System crash, Network component failure, Full file system, performance paralysis Interruption: Slow response time, system reboots, Replace failed hardware Outage Time: Minutes to days H6487S I.02 © 2003 Hewlett-Packard Development Company, L.P.

Three Pillars of High Availability High Availability Alliances High Availability Alliances Support Partnerships IT Processes & People Support Partnerships IT Processes and People Technology Infrastructure Technology Infrastructure H6487S I.02 © 2003 Hewlett-Packard Development Company, L.P.

5nines Support Partnerships H6487S I.02 © 2003 Hewlett-Packard Development Company, L.P.

High Availability Terms Downtime Unplanned Outage Availability Fault Tolerant H6487S I.02 © 2003 Hewlett-Packard Development Company, L.P.

High Availability Percentages 99.999 99.99 99.95 99.90 99.86 99.73 99.00 98 97 96 95 Total Down Time 5 minutes 50 minutes 4.3 hours 8.8 hours 12 hours 24 hours 3.6 days 7.2 days 10.8 days 14.4 days 18 days Type of System Fault Tolerant Top High Availability Median High Avail HP Standard Avail Most Standard Avail H6487S I.02 © 2003 Hewlett-Packard Development Company, L.P.

Availability Continuum Hierarchy Cost $$ continuously available systems highly available systems highly resilient reliable systems systems Availability H6487S I.02 © 2003 Hewlett-Packard Development Company, L.P.