ClusterLion Automatic switchover for NetApp MetroCluster Robert Graf

Slides:



Advertisements
Similar presentations
VSE Corporation Proprietary Information
Advertisements

Business Continuity Section 3(chapter 8) BC:ISMDR:BEIT:VIII:chap8:Madhu N PIIT1.
© 2009 EMC Corporation. All rights reserved. Introduction to Business Continuity Module 3.1.
Business Continuity The Business of Keeping A Business Running John Dooly Senior Analyst CEMA Region Prague, Czech Republic.
1 BIG-IP Global Traffic Manager Presented by: your name, your title.
EIM April 19, Robin Weaver 13 Years with IBM Prior to Assignment at UNC Charlotte Range of Database Development/Data Management Projects and Products.
Barracuda Networks Confidential1 Barracuda Backup Service Integrated Local & Offsite Data Backup.
Implementing Failover Clustering with Hyper-V
National Manager Database Services
Bosch Video Management Systems
Automated Backup, Recovery, Inventory and Management for Security and Networking Devices.
Enhanced HA and DR with MetroCluster & Vmware
Chapter 10 : Designing a SQL Server 2005 Solution for High Availability MCITP Administrator: Microsoft SQL Server 2005 Database Server Infrastructure Design.
Implementing Multi-Site Clusters April Trần Văn Huệ Nhất Nghệ CPLS.
Guide to Linux Installation and Administration, 2e 1 Chapter 9 Preparing for Emergencies.
The Role of High Availability Software in Quality of Service Joe McFadden Vice President, Marketing, Nuasis.
Service Overview CA- IROD- Instant Recovery on Demand CRITICAL SERVER CONTINUITY, NON-STOP OPERATIONS, TOTAL DATA PROTECTION Turnkey solution that provides.
© 2013 Cisco and/or its affiliates. All rights reserved. This document is Cisco Confidential. For Channel Partners only. Do not distribute. C
1 Data Guard. 2 Data Guard Reasons for Deployment  Site Failures  Power failure  Air conditioning failure  Flooding  Fire  Storm damage  Hurricane.
Continuous Backup for Business CrashPlan PRO offers a paradigm of backup that includes a single solution for on-site and off-site backups that is more.
 Load balancing is the process of distributing a workload evenly throughout a group or cluster of computers to maximize throughput.  This means that.
Peter Mattei HP Storage Consultant 16. May 2013
Stretching A Wolfpack Cluster Of Servers For Disaster Tolerance Dick Wilkins Program Manager Hewlett-Packard Co. Redmond, WA
NERC Lessons Learned Summary LLs Published in September 2015.
ClusterLion Robert Graf | CEO Mobile
1 High-availability and disaster recovery  Dependability concepts:  fault-tolerance, high-availability  High-availability classification  Types of.
THE TOP FOUR BEST PRACTICES WHEN SELECTING A DALLAS DATA CENTER.
Artificial Intelligence In Power System Author Doshi Pratik H.Darakh Bharat P.
About ProLion CEO, Robert Graf Headquarter in Austria
Robert Graf | CEO Mobil
Managed IT Solutions More Reliable Networks Are Our Business
Azure Site Recovery For Hyper-V, VMware, and Physical Environments
Delivering on the Promise of a Virtualized Dynamic Data Center
Avaya VA Line-interactive UPS
ATS Service Assurance Suite presentation
IP-based 8-port Switched Power Manager
Maintain, Manage And Monitor Outdoor Systems Remotely
1+1 Ethernet Failover & Network Protection Switches
StreetSmart Mobile Workforce App Incorporates Microsoft Office 365 Outlook Add-In for Improved Field Worker Scheduling and Streamlined Invoicing OFFICE.
Fault Tolerance Comparison
Server Upgrade HA/DR Integration
Failover and High Availability
High Availability Linux (HA Linux)
IOT Critical Impact on DC Design
ALWAYSON AVAILABILITY GROUPS
BUSINESS CONTINUITY BY HUI ZHENG.
Enterprise data center
DS3 Fail-over Protection Switch
Maximum Availability Architecture Enterprise Technology Centre.
Always On Multi-Site Patterns
CCNET Managed Services
Robert Graf | CEO Mobile
Workgroup Technology Partners
VMware VM Replication for High Availability in Vembu VMBackup
Infrastructure, Data Center & Managed Services
Stratus Innovations Group DR as a Service Solution Offering
Protect | Transform | Innovate
Protect | Transform | Innovate
Robert Graf | CEO Mobile
Microsoft Virtual Academy
Planning High Availability and Disaster Recovery
Don’t settle for the status quo
Smart Team Making a Beautiful software
CryptoSpike Robert Graf CEO Mobil
Disaster Recovery is everyone’s job!
Modular Edge-connected data centers
ClusterLion Automatic switchover for SAP HANA Robert Graf CEO
DataOptimizer Transparent File Tiering for NetApp Storage Robert Graf
CryptoSpike Ransomware Protection & File System Auditing Robert Graf
Presentation transcript:

ClusterLion Automatic switchover for NetApp MetroCluster Robert Graf CEO robert.graf@prolion.com +43 664 1314403 2nd Sept. 2019

1001110110101110100111111001 We care about your data! protect manage analyze

High Availability in IT High availability is a critical in todays IT world! Most businesses depend on their critical applications! “Always On” became mandatory for many companies! Downtime impacts productivity, money and reputation! Worst Ransomware Strains

Is system uptime important to your business? Cost of Downtime Is system uptime important to your business? File Activity Cost and studies vary depending on industries… However, IT downtime causes significant damage! Source: ServiceMax, from GE Digital, commissioned Vanson Bourne to conduct a global study into unplanned downtime, “After The Fall: Cost, Causes and Consequences of Unplanned Downtime. “The Study surveyed 450 IT & field services decision makers in the UK, US, France and Germany across the manufacturing, medical, oil and gas, energy & utilities, telecoms, distribution, logistics and transportation sectors, among others.

always On availability The Main Question is: Do you need always On availability for your IT applications? File Activity if Yes: Automatic Switchover is needed! if No: Manual Switchover is a valid solution!

SRV1 SRV2 SRV2 Ethernet Fabric / ATTO’s Grid UPS Grid UPS Srvc(a) Srvc(b) Fabric / ATTO’s Grid UPS Grid UPS

ClusterLion Q Switchover Power-OFF Switchover SRV1 SRV2 Ethernet Srvc(a) Srvc(b) Fabric / ATTO’s 100m Grid UPS 100m Grid UPS Telco B Telco A Q Switchover Power-OFF

Avoid a Split-Brain-Syndrome!!! Wikipedia: High-availability clusters usually use a heartbeat or quorum connection which is used to monitor the health and status of each node in the cluster. For example, the split-brain syndrome may occur when all connections go down simultaneously, but the cluster nodes are still running, each one believing they are the only one running. The data sets of each cluster may then randomly serve I/O by their own, without any coordination with the other data sets. This may lead to data corruption or other data inconsistencies… Worst Ransomware Strains

The challenge for every Storage Cluster Every storage vendor on the market needs a quorum, witness or tie-breaker to run automatic switchover in case of site-failure! Expensive infrastructure investments in a 3rd data center location and highly redundant interconnects from the primary data centers to the quorum site are required! With ClusterLion no infrastructure investment is needed, which offers the lowest possible TCO for automatic switchover. Ransomeware attack at Lukaskrankenhaus in Neuss

MetroCluster® Management and Disaster Recovery Guide If all controller modules fail at a site because of power loss, replacement of equipment, or disaster. Typically, MetroCluster configurations can not differentiate between failures and disasters. An administrator, or the MetroCluster Tiebreaker software must determine that a disaster has occurred and perform the MetroCluster switchover. Tie-Breaker should only be used for monitoring and alerting! AUSO (automatic unscheduled switchover) is not supported on MetroCluster- IP configurations. Execute command: switchover -override-veto true -forced-on-disaster true Ransomware Attack Data source: https://library.netapp.com/ecm/ecm_download_file/ECMLP2495113

Why is ClusterLion the right solution? Running on totally independent infrastructure (mobile network and batteries) Switchover actions will be disabled if NVRAM or Plexes are not in sync Storage controllers are physically powered off before switchover is triggered Due to Cloud Quorum no 3rd Datacenter is needed Application integration into SAP, VMware, etc. (trigger post-scripts) Switchover in case of Network failure – (NFO on Ethernet) Tamper proof! High End security due to Layer-1 “Firewall” Automatic guidance through giveback process Proactive ProLion Support and MetroCluster expertise Proactive MetroCluster configuration checks Ransomware Attack

(Network Failover Option) NFO Switchover SVR1 SVR2 Ping Ping Ethernet Ping Ping Srvc(a) Srvc(b) Fabric / ATTO’s Grid 100m UPS 100m Grid UPS Telco B Telco A Q Power-OFF

No! automatic unplanned switchover MetroCluster IP SVR1 SVR2 Ethernet Action required to switchover!!! Srvc(a) Srvc(b) Fabric / ATTO’s no AUSO!!! 100m Grid UPS 100m Grid UPS Telco B Telco A Q No! automatic unplanned switchover

Solutions for MetroCluster Switchover

Q Detailed Setup A2 A1 B2 B1 Power-OFF SVR1 SVR2 “Giveback” Customer Support during Giveback Switchover Ethernet / SAN 2x RS232 2x Ethernet 2x RS232 2x Ethernet Srvc(a) Srvc(a) Srvc(b) Srvc(b) Fabric / ATTO’s A2 A1 B2 B1 100m Grid UPS 100m Grid UPS Telco B Telco A 1. Reporting: A2: Active Controller Heartbeat A1: Lost Cluster Partner, NVRAM etc. B2: No Controller Heartbeat B1: Controller Error Monitoring: Power Supply Storage Controller Partner Status Heart-Beat Use Case: Power Outage 2. Action: B2: Power Off B1: Power Off A2: Active Controller Heartbeat A1: Force Switchover Q: Open Helpdesk Ticket Q Power-OFF Open Ticket Helpdesk

ClusterLion Hardware (front) ClusterLion without Front Cover „Hot Swap“ Battery

ClusterLion Hardware (rear) 4x Power Input Reset Button 4x Power Output 2x Serial Consol Port Cooling Fan 4x Ethernet Connectivity PoE Output for Gateways Fuse

ClusterLion Premium Support Premium support contract: 24x7 Support Proactive customer notification Proactive configuration check Support during MetroCluster switchback 3rd party maintenance in EMEA

Do you need always On availabilty? …the question is can you afford to run your MetroCluster without ClusterLion? The question is not if you can afford ClusterLion…

...we go the extra mile...