High Availability 24 hours a day, 7 days a week, 365 days a year… Vik Nagjee Product Manager, Core Technologies InterSystems Corporation.

Slides:

Advertisements

Similar presentations

Copyright © 2012 DataCore Software Corp. – All Rights Reserved. Practical High Availability NAS Cost-effective, non-stop disk access for clustered file.

Advertisements

Efficient Upgrades Steve Mallam, Sales Engineer. Highly available systems 24/7/365 Service Level Agreements Mission critical operations Time sensitive.

Introduction to DBA.

High Availability Group 08: Võ Đức Vĩnh Nguyễn Quang Vũ

Ashesh Parekh Principal Product Manager Oracle Application Server Oracle Corporation.

Oracle Data Guard Ensuring Disaster Recovery for Enterprise Data

June 23rd, 2009Inflectra Proprietary InformationPage: 1 SpiraTest/Plan/Team Deployment Considerations How to deploy for high-availability and strategies.

1 © Copyright 2010 EMC Corporation. All rights reserved. EMC RecoverPoint/Cluster Enabler for Microsoft Failover Cluster.

Lesson 17 – UNDERSTANDING OTHER NETWARE SERVICES.

Keith Burns Microsoft UK Mission Critical Database.

ArcGIS for Server Reference Implementations An ArcGIS Server’s architecture tour.

Lesson 1: Configuring Network Load Balancing

1© Copyright 2011 EMC Corporation. All rights reserved. EMC RECOVERPOINT/ CLUSTER ENABLER FOR MICROSOFT FAILOVER CLUSTER.

Lync /19/2017 © 2014 Microsoft Corporation. All rights reserved. Microsoft, Windows, and other product names are or may be registered trademarks.

National Manager Database Services

SharePoint Business Continuity Management with SQL Server AlwaysOn

Microsoft Load Balancing and Clustering. Outline Introduction Load balancing Clustering.

Managing DHCP. 2 DHCP Overview Is a protocol that allows client computers to automatically receive an IP address and TCP/IP settings from a Server Reduces.

1 Oracle 9i AS Availability and Scalability Margaret H. Mei Senior Product Manager, ST.

SANPoint Foundation Suite HA Robert Soderbery Sr. Director, Product Management VERITAS Software Corporation.

Chapter 10 : Designing a SQL Server 2005 Solution for High Availability MCITP Administrator: Microsoft SQL Server 2005 Database Server Infrastructure Design.

High-Availability Methods Lesson 25. Skills Matrix.

Oracle Application Server 10g (9.0.4) Recommended Topologies Pavana Jain.

Module 12: Designing High Availability in Windows Server ® 2008.

Buffalo Data Storage Expansion June As organizations grow the amount of data storage capacity required to support it grows as well Increased data.

Oracle10g RAC Service Architecture Overview of Real Application Cluster Ready Services, Nodeapps, and User Defined Services.

Clustering  Types of Clustering. Objectives At the end of this module the student will understand the following tasks and concepts. What clustering is.

INSTALLING MICROSOFT EXCHANGE SERVER 2003 CLUSTERS AND FRONT-END AND BACK ‑ END SERVERS Chapter 4.

Sofia, Bulgaria | 9-10 October SQL Server 2005 High Availability for developers Vladimir Tchalkov Crossroad Ltd. Vladimir Tchalkov Crossroad Ltd.

Chapter 8 Implementing Disaster Recovery and High Availability Hands-On Virtual Computing.

© 2005 Mt Xia Technical Consulting Group - All Rights Reserved. HACMP – High Availability Introduction Presentation November, 2005.

IMPROUVEMENT OF COMPUTER NETWORKS SECURITY BY USING FAULT TOLERANT CLUSTERS Prof. S ERB AUREL Ph. D. Prof. PATRICIU VICTOR-VALERIU Ph. D. Military Technical.

Leaders Have Vision™ visionsolutions.com 1 Leading Edge Solutions, Proven Technologies Anne-Elisabeth Caillot Pre-Sales & Business Development

Module 4: Planning, Optimizing, and Troubleshooting DHCP

©2006 Merge eMed. All Rights Reserved. Energize Your Workflow 2006 User Group Meeting May 7-9, 2006 Disaster Recovery Michael Leonard.

Rajib Kundu Agenda Definitions Failover Cluster Database Snapshots Log shipping Database Mirroring.

High-Availability MySQL DB based on DRBD-Heartbeat Ming Yue September 27, 2007 September 27, 2007.

Module 10: Maintaining High-Availability. Overview Introduction to Availability Increasing Availability Using Failover Clustering Standby Servers and.

Overview of high availability in Microsoft SQL Server Szymon Wójcik.

7. Replication & HA Objectives –Understand Replication and HA Contents –Standby server –Failover clustering –Virtual server –Cluster –Replication Practicals.

OSIsoft High Availability PI Replication

CERN - IT Department CH-1211 Genève 23 Switzerland t Oracle Real Application Clusters (RAC) Techniques for implementing & running robust.

 Load balancing is the process of distributing a workload evenly throughout a group or cluster of computers to maximize throughput.  This means that.

High Availability in DB2 Nishant Sinha

70-412: Configuring Advanced Windows Server 2012 services

Oracle Database Architecture By Ayesha Manzer. Automatic Storage Management Spreads database data across all disks Creates and maintains a storage grid.

May l Washington, DC l Omni Shoreham Parallels Virtuozzo Containers Roadmap Andrey Moruga Virtualization Product Manager, Parallels.

High Availability Technologies for Tier2 Services June 16 th 2006 Tim Bell CERN IT/FIO/TSI.

LHC Logging Cluster Nilo Segura IT/DB. Agenda ● Hardware Components ● Software Components ● Transparent Application Failover ● Service definition.

Virtual Machine Movement and Hyper-V Replica

1 Chapter Overview Using Standby Servers Using Failover Clustering.

Deploying Highly Available SQL Server in Windows Azure A Presentation and Demonstration by Microsoft Cluster MVP David Bermingham.

Deploying Highly Available SAP in the Cloud

SQL Server High Availability Introduction to SQL Server high availability solutions.

All the things you need to know before setting up AlwaysOn Michael Steineke SQL & BI Solution Lead Enterprise Architect Concurrency, Inc.

Click to edit Master title style Sytel’s High Availability Strategy © 2012 Sytel Limited. All rights reservedVersion 2.5.

OSIsoft High Availability PI Replication Colin Breck, PI Server Team Dave Oda, PI SDK Team.

What is Hybrid Cloud Software?. What is Cloud Storage? Before talking about hybrid cloud software we have to know about cloud storage. Cloud Storage means.

Bentley Systems, Incorporated

Fault Tolerance Comparison

Services DFS, DHCP, and WINS are cluster-aware.

High Availability 24 hours a day, 7 days a week, 365 days a year…

iSCSI Storage Area Network

High Availability Linux (HA Linux)

Maximum Availability Architecture Enterprise Technology Centre.

A Technical Overview of Microsoft® SQL Server™ 2005 High Availability Beta 2 Matthew Stephen IT Pro Evangelist (SQL Server)

SharePoint disaster recovery as a service

SQL Server High Availability Amit Vaid.

SpiraTest/Plan/Team Deployment Considerations

High Availability/Disaster Recovery Solution

Presentation transcript:

High Availability 24 hours a day, 7 days a week, 365 days a year… Vik Nagjee Product Manager, Core Technologies InterSystems Corporation

Topics What is High Availability (HA)? What is High Availability (HA)? Current HA strategies Current HA strategies What’s coming? What’s coming? Questions & Discussion Questions & Discussion

What is High Availability (HA)? Reliability Reliability Fault-tolerance Fault-tolerance High Uptime High Uptime Operational Continuity Operational Continuity Redundancy Redundancy Minimal Disruption Minimal Disruption Availability %Downtime per year Downtime per month Downtime per week 90%36.5 days72 hours16.8 hours 95%18.25 days36 hours8.4 hours 99%3.65 days7.20 hours1.68 hours 99.9%8.67 hours43.2 minutes10.1 minutes 99.99%52.6 minutes4.32 minutes1.01 minutes %5.26 minutes25.9 seconds6.05 seconds %31.5 seconds2.59 seconds0.605 seconds

High Availability vs. Disaster Recovery High Availability = fault detection & correction procedures to maximize availability of critical services and applications, often in an automated fashion. High Availability = fault detection & correction procedures to maximize availability of critical services and applications, often in an automated fashion. Disaster Recovery = process of preparing for recovery or continuation of technology infrastructure critical to an organization after a natural or human-induced disaster. Disaster Recovery = process of preparing for recovery or continuation of technology infrastructure critical to an organization after a natural or human-induced disaster. High Availability ≠ Disaster Recovery!

Current HA Strategies Failover = Automatic switch to redundant system Failover = Automatic switch to redundant system Uses some type of heartbeat software (e.g., HACMP) Uses some type of heartbeat software (e.g., HACMP) Current Failover Options: Current Failover Options: –Failover Clusters –Concurrent Clusters –ECP Clusters With Failover Cluster for DatabaseWith Failover Cluster for Database With Concurrent Cluster for DatabaseWith Concurrent Cluster for Database

Failover Clusters One active system (PROD), and one standby system (STDBY), with a heartbeat connection One active system (PROD), and one standby system (STDBY), with a heartbeat connection Windows Cluster, IBM HACMP, Sun Cluster, HP Serviceguard, Red Hat Cluster Suite, Veritas Cluster Services… Windows Cluster, IBM HACMP, Sun Cluster, HP Serviceguard, Red Hat Cluster Suite, Veritas Cluster Services… Needs shared disk for install directory, WIJ, database files, and journal files Needs shared disk for install directory, WIJ, database files, and journal files Users/Applications connect to a DNS which is mapped to PROD Users/Applications connect to a DNS which is mapped to PROD In event of failure, 3 rd party cluster software fails Caché to STDBY node In event of failure, 3 rd party cluster software fails Caché to STDBY node Caché performs recovery on STDBY node before allowing connections - open Tx’s are rolled back, open locks are released, etc… Caché performs recovery on STDBY node before allowing connections - open Tx’s are rolled back, open locks are released, etc…

Concurrent Clusters AKA Caché Clusters AKA Caché Clusters Can be configured on OpenVMS and Tru64 UNIX Can be configured on OpenVMS and Tru64 UNIX Two or more servers, each running an instance of Caché and each with access to all disks, concurrently provide access to all data Two or more servers, each running an instance of Caché and each with access to all disks, concurrently provide access to all data Users connect to either one of the clustered nodes; Caché provides data and lock synchronization across nodes Users connect to either one of the clustered nodes; Caché provides data and lock synchronization across nodes If one machine fails, users can immediately reconnect to any of the remaining cluster nodes If one machine fails, users can immediately reconnect to any of the remaining cluster nodes Caché performs cluster-wide recovery during failover – logical and physical data integrity is maintained Caché performs cluster-wide recovery during failover – logical and physical data integrity is maintained

ECP Clusters – with DB as Failover Cluster Enterprise Cache Protocol (ECP) provides a distributed, tiered system Enterprise Cache Protocol (ECP) provides a distributed, tiered system Typical configuration: Typical configuration: –N+1 application servers –Users load-balanced across app servers If any app server goes down, users can be reconnected to other remaining app servers If any app server goes down, users can be reconnected to other remaining app servers If database goes down, users on app servers will experience pause while DB failover completes (here DB is configured as a failover cluster) If database goes down, users on app servers will experience pause while DB failover completes (here DB is configured as a failover cluster) Application servers will reconnect after database has performed recovery Application servers will reconnect after database has performed recovery

ECP Clusters – with DB as Concurrent Cluster Similar to previous example, except DB server is configured as a concurrent cluster (OpenVMS or Tru64 UNIX) Similar to previous example, except DB server is configured as a concurrent cluster (OpenVMS or Tru64 UNIX) App servers can connect to any one of the nodes App servers can connect to any one of the nodes If any node fails, the app server(s) connected to that node will reconnect to another surviving node after failover If any node fails, the app server(s) connected to that node will reconnect to another surviving node after failover Caché performs cluster-wide recovery during failover – logical and physical data integrity is maintained Caché performs cluster-wide recovery during failover – logical and physical data integrity is maintained

High Availability: What’s Coming? Database Mirroring: Delivers faster, automated failover Delivers faster, automated failover Eliminates requirement for shared disk configurations Eliminates requirement for shared disk configurations Reduces dependency on 3 rd party clustering software Reduces dependency on 3 rd party clustering software Uses multiple redundant servers Uses multiple redundant servers Integrated ECP recovery Integrated ECP recovery

Database Mirroring Multiple servers in Mirror Set - one is Primary, others are Backup (1+) Multiple servers in Mirror Set - one is Primary, others are Backup (1+) TCP connections between mirror members TCP connections between mirror members Primary PUSHES journal updates to Backups, who ack and continuously de-journal Primary PUSHES journal updates to Backups, who ack and continuously de-journal Primary role can flip from one server to another within moments – automated failover Primary role can flip from one server to another within moments – automated failover All clients (except ECP) connect to a Mirror Virtual IP – mirror handles appropriate redirection to current Primary All clients (except ECP) connect to a Mirror Virtual IP – mirror handles appropriate redirection to current Primary ECP protocol is “mirror aware” – app servers will connect directly to current primary, and will fail over to new primary as appropriate. ECP will perform recovery on reconnection. ECP protocol is “mirror aware” – app servers will connect directly to current primary, and will fail over to new primary as appropriate. ECP will perform recovery on reconnection.

Wrap-up Questions & Discussion