Find and fix the Root cause code Recover the client experience Repair the symptom Remove complexity.

Slides:



Advertisements
Similar presentations
1 Windows 2008 Failover Clustering Witness/Quorum Models.
Advertisements

Copyright © 2012 DataCore Software Corp. – All Rights Reserved. Practical High Availability NAS Cost-effective, non-stop disk access for clustered file.
3 4 AUGSEPOCTAUGSEPOCTAUGSEPTOCTAUGSEPOCT Americas 99.99% 99.95%99.97%99.98%99.99% 99.95%99.92% EMEA 99.99% 99.95%99.97%99.98%99.99% 99.95%99.92%
EXCHANGE SERVER 2010 HIGH AVAILABILITY CONCEPTS Scott Schnoll Principal Technical Writer Microsoft Corporation SESSION CODE: EXL303 (c) 2011 Microsoft.
Scott Schnoll Exchange Server 2013 Site Resilience.
How does Microsoft approach change management communication? What happens when I have an outage? What is the Service Health Dashboard? What is the future.
Topic Of This Hour Exchange 2013: Database Availability Group Sept 26, :00 PM EST / 12:00 PM CST / 10:00 AM PST / 7:00 PM CET / 6:00 PM UTC/10:30.
Chapter 5: Server Hardware and Availability. Hardware Reliability and LAN The more reliable a component, the more expensive it is. Server hardware is.
Princeton PC Users Group Hard Drive Disaster! By Paul Kurivchack March 14, 2005.
1 © Copyright 2010 EMC Corporation. All rights reserved. EMC RecoverPoint/Cluster Enabler for Microsoft Failover Cluster.
Keith Burns Microsoft UK Mission Critical Database.
ArcGIS for Server Reference Implementations An ArcGIS Server’s architecture tour.
1© Copyright 2011 EMC Corporation. All rights reserved. EMC RECOVERPOINT/ CLUSTER ENABLER FOR MICROSOFT FAILOVER CLUSTER.
Backup and Recovery Part 1.
Lync /19/2017 © 2014 Microsoft Corporation. All rights reserved. Microsoft, Windows, and other product names are or may be registered trademarks.
Active Directory lookupReplay RPC server wrapperTPR API manager Copy status lookupRemote data provider wrapperSupport API manager Replay core managerVssWriterServer.
Chapter 5 Configuring the RMAN Environment. Objectives Show command to see existing settings Configure command to change settings Backing up the controlfile.
Agenda  Overview  Configuring the database for basic Backup and Recovery  Backing up your database  Restore and Recovery Operations  Managing your.
National Manager Database Services
Workflow Steps Perform a datacenter switchover for a database availability group Version 1.2 (Updated 12/2012)
Implementing High Availability
Copyright© Microsoft Corporation DAG Architecture.
CAS Array DAG MBX-A MBX-B DB1 Load Balancer.
Empower Hyper-V Improved Manageability Improved Manageability Continue Simplification Mission Expand Cluster Validation Flexible Migrations.
Course 6425A Module 9: Implementing an Active Directory Domain Services Maintenance Plan Presentation: 55 minutes Lab: 75 minutes This module helps students.
But what if there is a catastrophic event? Fire, flood, earthquake …
Chapter 10 : Designing a SQL Server 2005 Solution for High Availability MCITP Administrator: Microsoft SQL Server 2005 Database Server Infrastructure Design.
Implementing Multi-Site Clusters April Trần Văn Huệ Nhất Nghệ CPLS.
CAS Exchange 2013 architecture For a given mailbox’s connectivity, the protocol being used is always served by the server that hosts the active database.
PPD Computing “Business Continuity” Windows and Mac Kevin Dunford May 17 th 2012.
Oracle10g RAC Service Architecture Overview of Real Application Cluster Ready Services, Nodeapps, and User Defined Services.
Chapter 8 Implementing Disaster Recovery and High Availability Hands-On Virtual Computing.
DATABASE MIRRORING  Mirroring is mainly implemented for increasing the database availability.  Is configured on a Database level.  Mainly involves two.
SQLCAT: SQL Server 2012 AlwaysOn Lessons Learned from Early Customer Deployments Sanjay Mishra Program Manager Microsoft Corporation DBI360.
Module 9 Planning a Disaster Recovery Solution. Module Overview Planning for Disaster Mitigation Planning Exchange Server Backup Planning Exchange Server.
Site Power OutageNetwork Disconnect Node Shutdown for Patching Node Crash Quorum Witness Failure How do I make sure my Cluster stays up ??... Add/Evict.
SQLCAT: SQL Server HA and DR Design Patterns, Architectures, and Best Practices Using Microsoft SQL Server 2012 AlwaysOn Sanjay Mishra Program Manager.
Module 10: Maintaining High-Availability. Overview Introduction to Availability Increasing Availability Using Failover Clustering Standby Servers and.
1 Administering Shared Folders Understanding Shared Folders Planning Shared Folders Sharing Folders Combining Shared Folder Permissions and NTFS Permissions.
DB4DB3 DB2 PassiveActiveLagged 4-member DAG 4 databases 4 copies of each database 4 databases per volume Symmetrical design.
Windows Server 2003 硬碟管理與磁碟機陣列 林寶森
"1"1 Introduction to Managing Data " Describe problems associated with managing large numbers of disks " List requirements for easily managing large amounts.
Scott Schnoll m Microsoft Corporation.
Online Snapshots (up to 512) Disk-based Recovery Tape-based Backup Data Protection Manager Up to Every 15 minutes Disaster Recovery with offsite replication.
70-293: MCSE Guide to Planning a Microsoft Windows Server 2003 Network, Enhanced Chapter 12: Planning and Implementing Server Availability and Scalability.
High Availability in DB2 Nishant Sinha
Peter Mattei HP Storage Consultant 16. May 2013
Best Practices for Implementing Unicenter NSM r11.1 in an HA MSCS Environment Part I -Last Revision April 24, 2006.
Enabling Disaster Recovery for Hyper-V Workloads Using Hyper-V Replica Vijay Sistla Senior Program Manager Microsoft Corporation VIR302.
Enhancing Scalability and Availability of the Microsoft Application Platform Damir Bersinic Ruth Morton IT Pro Advisor Microsoft Canada
Best Practices for Implementing Unicenter NSM r11 in an HA MSCS Environment Part I -Last Revision April 24, 2006.
1 Chapter Overview Using Standby Servers Using Failover Clustering.
VCS Building Blocks. Topic 1: Cluster Terminology After completing this topic, you will be able to define clustering terminology.
SQL Server 2012: AlwaysOn HA and DR Design Patterns, and Lessons Learned from Early Customer Deployments Sanjay Mishra SQLCAT.
A CHIEVING SITE R ESILIENCY WITH E XCHANGE SERVER Pradeep Kini.
Unit 10 ITT TECHNICAL INSTITUTE NT1330 Client-Server Networking II Date: 2/24/2016 Instructor: Williams Obinkyereh.
Seminar On Rain Technology
AlwaysOn In SQL Server 2012 Fadi Abdulwahab – SharePoint Administrator - 4/2013
Scott Schnoll Exchange Server 2013 High Availability.
High Availability 24 hours a day, 7 days a week, 365 days a year…
Managing Multi-User Databases
Implementing Exchange Server 2013
Network Load Balancing
Maximum Availability Architecture Enterprise Technology Centre.
A Technical Overview of Microsoft® SQL Server™ 2005 High Availability Beta 2 Matthew Stephen IT Pro Evangelist (SQL Server)
Always on HA SQL Server Always ON feature is the new comprehensive high availability and disaster recovery solution which increases application availability.
VCS-257 Dumps Questions
Unit 10 NT1330 Client-Server Networking II Date: 8/16/2016
What’s new in SQL Server 2016 Availability Groups
Designing Database Solutions for SQL Server
Presentation transcript:

Find and fix the Root cause code Recover the client experience Repair the symptom Remove complexity

Easier deployment and management Fewer things that can fail

X

Forget about replacing disks as they fail Probability you’ll need to replace more than monthly: =(1-BINOM.DIST(spares + 1, disks per server, AFR/12, TRUE))*servers

Probe engine: data collection and notifications mechanism, feeding into… Monitor engine: contains business logic to evaluate health of customer- impacting features Responder engine: set of recovery actions that can be taken to recover degraded state of the monitored resource

… OneCopyMonitor UNHEALTHY

Maximum Preferred Actives Optimized for load Still allows mount Example: 19 Designed optimum Result of Redistribute- ActiveDatabases.ps1 Example: 14

Move-Activ boxDatabase -SkipMaximumActiveDatabaseChecks skips both

ToolParameterValueInstanceUsage Suspend- MailboxDatabaseCopy ActivationOnly N/APer database copy Keep active off a working but questionable drive Set-MailboxServer DatabaseCopyAutoActivationPolicy “Blocked” or “Unrestricted” Per server Used to control active/passive SR configurations and maintenance Can force admin move Set-MailboxServer DatabaseCopyActivationDisabledAndMoveNow $true or $falsePer server Used to do faster site failovers and maintain database availability Databases are not blocked from failing back Continuous move-off operation

Majority of 7 required

X X X Majority of 4 requiredMajority of 7 required

X X X X Majority of 3 required

X X X X X Majority of 2 required

X X X X X

X X X X X 1 0

X X X X X 0 1

X X X X X 0 1 X

X X X X X 0 1 X X

NameDynamicWeightNodeWeightState EX111Up

Deployment scenarioRecommendations DAG(s) deployed in a single datacenterLocate witness server in the same datacenter as DAG members; can share one server across DAGs DAG(s) deployed across two datacenters; No additional locations available Locate witness server in primary datacenter; can share one server across DAGs DAG(s) deployed across two+ datacentersLocate witness server in third location; can share one server across DAGs

Witness Offline Witness vote gets removed by the cluster Witness Online If necessary, Witness vote is added back by the cluster Witness Failure Witness vote gets removed by the cluster Windows Server 2012 R2 and later

cas3cas4cas1cas2

mbx1mbx2mbx3mbx4 Assuming MBX3 and MBX4 are operating and one of them can lock the witness.log file, automatic failover should occur witness

mbx1mbx2mbx3mbx4 1.Mark the failed servers/site as down: Stop-DatabaseAvailabilityGroup DAG1 –ActiveDirectorySite:Redmond 2.Stop the Cluster Service on Remaining DAG members: Stop-Clussvc 3.Activate DAG members in 2 nd datacenter: Restore-DatabaseAvailabilityGroup DAG1 –ActiveDirectorySite:Portland

mbx1mbx2mbx3mbx4 1.Mark the failed servers/site as down: Stop-DatabaseAvailabilityGroup DAG1 –ActiveDirectorySite:Redmond 2.Stop the Cluster Service on Remaining DAG members: Stop-Clussvc 3.Activate DAG members in 2 nd datacenter: Restore-DatabaseAvailabilityGroup DAG1 –ActiveDirectorySite:Portland