1EMC CONFIDENTIAL—INTERNAL USE ONLY Recovery Check FAQs Ibrahim Shamel.

Slides:



Advertisements
Similar presentations
PC Encryption installation progress/password screen Includes comments from: Encryption team Sarah Deane Tony Stieber Selected people who took part in the.
Advertisements

P3, M2,M3,M4.
TOI - Refresh Upgrades in Cisco Unity Connection 8.6
1 Vendor Reverse Auction - Event User Guide. 2 Minimum System Requirements Internet connection - Modem, ISDN, DSL, T1. Your connection speed determines.
1© Copyright 2013 EMC Corporation. All rights reserved. ACCELERATING MICROSOFT EXCHANGE PERFORMANCE WITH EMC XtremSW Cache EMC VNX Storage and VMware vSphere.
1© Copyright 2013 EMC Corporation. All rights reserved. EMC RECOVERPOINT FAMILY Protecting Your Data.
| © 2007 LenovoLenovo Confidential Lenovo OneKey Recovery - Ver for Y510&Y710-
Software Quality Assurance Inspection by Ross Simmerman Software developers follow a method of software quality assurance and try to eliminate bugs prior.
EE694v-Verification-Lect5-1- Lecture 5 - Verification Tools Automation improves the efficiency and reliability of the verification process Some tools,
STORAGE Virtualization
Copyright © 2013 FingerTec Worldwide Sdn.Bhd. All rights reserved.
1© Copyright 2013 EMC Corporation. All rights reserved. EMC and Microsoft SharePoint Server Performance Name Title Date.
Using Atlas During Data Migration PARTNER READINESS BY JULIAN TALL.
File System. NET+OS 6 File System Architecture Design Goals File System Layer Design Storage Services Layer Design RAM Services Layer Design Flash Services.
1 © Copyright 2008 EMC Corporation. All rights reserved. Symmetrix Capacity Planning and Performance Aspects Bob Rau Technical Business Consultant Symmetrix.
1 Vendor RFI - Event User Guide. 2 Minimum System Requirements Internet connection - Modem, ISDN, DSL, T1. Your connection speed determines your access.
Administration etc.. What is this ? This section is devoted to those bits that I could not find another home for… Again these may be useless, but humour.
Hosted Exchange The purpose of this Startup Guide is to familiarize you with ExchangeDefender's Exchange and SharePoint Hosting. ExchangeDefender.
1 Vendor ITB - Event User Guide. 2 Minimum System Requirements Internet connection - Modem, ISDN, DSL, T1. Your connection speed determines your access.
7 Copyright © 2006, Oracle. All rights reserved. Dealing with Database Corruption.
Cisco IOS & Router Config Semester 2V2 Chapter 6.
Copyright © 2014 EMC Corporation. All Rights Reserved. Advanced Storage Concepts Upon completion of this module, you should be able to: Describe LUN Migration.
Guide to Linux Installation and Administration, 2e 1 Chapter 9 Preparing for Emergencies.
Copyright © 2014 EMC Corporation. All Rights Reserved. Block Storage Provisioning and Management Upon completion of this module, you should be able to:
Problem Determination Your mind is your most important tool!
Module – 4 Intelligent storage system
1EMC CONFIDENTIAL—INTERNAL USE ONLY Why EMC for SQL Performance Optimization.
Copyright © 2014 EMC Corporation. All Rights Reserved. VNX Block Local Replication Principles Upon completion of this module, you should be able to: Explain.
Copyright © 2014 EMC Corporation. All Rights Reserved. SnapView Snapshot Upon completion of this module, you should be able to: Describe SnapView Snapshot.
GlobalWare Database clean-up April Sigmon Sr. Technical Analyst Application Support Team - Point of Sale September 2011.
Recovery System By Dr.S.Sridhar, Ph.D.(JNUD), RACI(Paris, NICE), RMR(USA), RZFM(Germany) DIRECTOR ARUNAI ENGINEERING COLLEGE TIRUVANNAMALAI.
Chapter 6 Protecting Your Files. 2Practical PC 5 th Edition Chapter 6 Getting Started In this Chapter, you will learn: − What you should know about losing.
1© Copyright 2012 EMC Corporation. All rights reserved. EMC Mission Critical Infrastructure For Microsoft SQL Server 2012 Accelerated With VFCache EMC.
Systems Management Server 2.0: Backup and Recovery Overview SMS Recovery Web Site location: Updated.
1© Copyright 2012 EMC Corporation. All rights reserved. EMC PERFORMANCE OPTIMIZATION FOR MICROSOFT FAST SEARCH SERVER 2010 FOR SHAREPOINT EMC Symmetrix.
Copyright © 2014 EMC Corporation. All Rights Reserved. Managing Host Access to Storage Upon completion of this module, you should be able to: Explain Access.
Continuous Backup for Business CrashPlan PRO offers a paradigm of backup that includes a single solution for on-site and off-site backups that is more.
1 © Copyright 2011 EMC Corporation. All rights reserved. BIG DATA & Storage Automation George Kokkinakis Enterprise Account Manager.
11 Copyright © 2004, Oracle. All rights reserved. Dealing with Database Corruption.
ASP. What is ASP? ASP stands for Active Server Pages ASP is a Microsoft Technology ASP is a program that runs inside IIS IIS stands for Internet Information.
Micro/Nano Fabrication Center (MFC) University of Arizona.
Summer Computing Workshop. Session 3 Conditional Branching  Conditional branching is used to alter the normal flow of execution depending on the value.
1EMC CONFIDENTIAL—INTERNAL USE ONLY FAST VP and Exchange Server 2010 Don Turner Consultant Systems Integration Engineer Microsoft TPM.
2007 TAX YEARERO TRAINING - MODULE 61 ERO (Transmitter) Training Module 6 Federal and State Installation and Updates.
Copy to Tape TOI. 2 Copy to Tape TOI Agenda Overview1 Technical Feature Implementation2 Q&A3.
Best Available Technologies: External Storage Overview of Opportunities and Impacts November 18, 2015.
SQL SERVER 2008 Installation Guide A Step by Step Guide Prepared by Hassan Tariq.
Jérôme Jaussaud, Senior Product Manager
Eric Liu – Remote Proactive
Praveen Srivatsa Director| AstrhaSoft Consulting blogs.asthrasoft.com/praveens |
1EMC CONFIDENTIAL—INTERNAL USE ONLY Unified PUHC Issues Troubleshooting Manfred Zhuang 2014/9/30 Space Issues & Running Tasks.
Virtual Machines Module 2. Objectives Define virtual machine Define common terminology Identify advantages and disadvantages Determine what software is.
1EMC CONFIDENTIAL—INTERNAL USE ONLY For VNX, there are 2 Drive part numbers affected; and Disk Drives will update their Firmware to.
13 Copyright © 2007, Oracle. All rights reserved. Using the Data Recovery Advisor.
CACI Proprietary Information | Date 1 PD² SR13 Client Upgrade Name: Semarria Rosemond Title: Systems Analyst, Lead Date: December 8, 2011.
Copyright © 2010 Pearson Education, Inc. or its affiliate(s). All rights reserved.1 | Assessment & Information 1 Online Testing Administrator Training.
Using existing lifts in existing buildings to evacuate disabled persons Derek Smith Technical Director UK Lift and Escalator Industry Association.
CACI Proprietary Information | Date 1 PD² v4.2 Increment 2 SR13 and FPDS Engine v3.5 Database Upgrade Name: Semarria Rosemond Title: Systems Analyst, Lead.
Chapter 6 Protecting Your Files
vSphere 6 Foundations Beta Question Answer
Extended Operating System Support
Performance Data Collection and Reporting (PDCR)
Archiving and Document Transfer Utilities
Experiences and Outlook Data Preservation and Long Term Analysis
Introduction To Computers
EMC DES-1D11 VCE Test Dumps
Optimizing SQL Server Performance in a Virtual Environment
Cisco IOS & Router Config
Test Case Test case Describes an input Description and an expected output Description. Test case ID Section 1: Before execution Section 2: After execution.
Modern PC operating systems
Presentation transcript:

1EMC CONFIDENTIAL—INTERNAL USE ONLY Recovery Check FAQs Ibrahim Shamel

2© Copyright 2014 EMC Corporation. All rights reserved. What is SMLink Failure What is SMLink_Check Tool What is Recovery Check FAQs Q&A Agenda

3© Copyright 2014 EMC Corporation. All rights reserved. In R31 it was possible for Slice Manager Link Corruption to be present in Pool LUNs. This corruption is found on the metadata of Pool LUNs and not on the actual data itself. R32 has stricter validation of Pool LUNs built in, and as a result after an upgrade to R32, Pool LUNs which are affected by the SMLink issue will go offline What is SMLink Failure SML stands for “Slice Manager Link”

4© Copyright 2014 EMC Corporation. All rights reserved. When planning a non-disruptive upgrade (NDU) from VNX OE to VNX OE and the array has run a version of prior to (Franklin) with FAST VP (auto-tiering) enabled at some point in its time of operation, it is important to run Recoverycheck on all pools. There is no need to run Recoverycheck on arrays that have never run a version earlier than or versions earlier than that never ran FAST VP (Auto-tiering). Due to the addition of a file system check that checks for link corruption in VNX OE there is a chance that VP LUNs can be taken offline causing a data unavailable (DU) condition after a non- disruptive upgrade (NDU) to VNX What is SMLink Failure From KB 16126

5© Copyright 2014 EMC Corporation. All rights reserved. SMLink_check checks to see if the array was ever running a Release of Release 31 with Autotiering (FastVP) and if a pool was ever created while running an affected release of code. SMLink_Check does not find corruption it only attempts to categorize whether or not it can be affected by the SMLink footprint, it does no data validation What is SMLink_Check Tool See emc to download SMLink_check

6© Copyright 2014 EMC Corporation. All rights reserved. If the SMLink_Check advises that certain pools need to be reviewed then a Recovery Check activity is needed before we can proceed with an upgrade to R32. What SMLink_Check does: – Verifies if the array was ever running Elias code – Verifies if the array has Autotiering installed – Verifies if the pool(s) was created while the array was running Elias If a Pool matches the above rules then the pool is marked vulnerable to the issue and requires Recoverycheck What is SMLink_Check Tool

7© Copyright 2014 EMC Corporation. All rights reserved. Welcome to the VNX SMLINK_check program SMLINK_check version Program start : 12/02/ :08:37 SP information: FCN This array was running code : on : 2011/09/30 07:05:22 This array was running code : on : 2011/12/14 13:38:27 This array was running code : on : 2012/02/10 14:58:14 This array was running code : on : 2012/03/21 14:54:12 This array is NOT potentially affected. Exiting... SMLink check tool completed and found no issues. It is OK to proceed with R31 to R32 NDU. SMLink_Check Output - Pass

8© Copyright 2014 EMC Corporation. All rights reserved. Welcome to the VNX SMLink_check program SMLink_check version Program start : 12/03/ :33:18 SP information: APM This array was running code : on : 2011/08/02 00:09:42 This array was running code : on : 2011/08/25 18:51:45 This array was running code : on : 2012/01/24 01:56:03 This array was running code : on : 2012/04/20 22:39:10 This array is running FAST Virtual Provisioning This array has one or more pools configured Post Elias Code installed on: 2012/01/24 01:56:03 Pool ID 0x0 with name GP_FAST_R5_Pool_3 was created under ELIAS code Pool ID 0x1 with name GP_FAST_R5_Pool_4 was created under ELIAS code Pool ID 0x2 with name Hi_Perf_R5_Pool_5 was created under ELIAS code Pool ID 0x3 with name Hi_Perf_R10_Pool_6 was created under ELIAS code The following Pool ID's need a recovery check, please follow EMC306064(EMC Internal only) or escalate to EMC Pool ID 0x0 with name GP_FAST_R5_Pool_3 Pool ID 0x1 with name GP_FAST_R5_Pool_4 Pool ID 0x2 with name Hi_Perf_R5_Pool_5 Pool ID 0x3 with name Hi_Perf_R10_Pool_6 SMLink_Check Output - Fail

9© Copyright 2014 EMC Corporation. All rights reserved. Recovery Check activity checks for possible corruption of the SMLink. Recoverycheck is a read only tool and does not make any changes. It is advisable to run Recoverycheck at times of lower IO to prevent false positives Questions have been asked what does this mean, realistically this means at period that is not full production. If this cannot be avoided that is ok, but not ideal as it may require many more additional attempts at recovery check What is Recovery Check Recoverycheck is a detailed tool from Escalation Engineering

10© Copyright 2014 EMC Corporation. All rights reserved. Recovery Check is a read only tool. It does not do any modifications to the storage system of the pools. Before running the recovery check, we need to: Make sure there is no or very minimal I/Os on the array – Disable FAST Cache – Disable Autotiering – Disable Compression After running the Recovery Check tool, the FAST Cache, and Compression are re-enabled. Autotiering needs to be left disabled until the upgrade takes place. What is Recovery Check More info on the Recovery Check:

11© Copyright 2014 EMC Corporation. All rights reserved. How long does running Recovery Check take? – The answer is it varies. When running Recovery Check you will need to generate SAT files from each SP, which will take anywhere from minutes (both can be run at the same time.) Once this is complete the actual running of the Recovery Check will take somewhere between 5-30 minutes per pool, depending on the configuration of the array and how many pool LUNs there are. – FAST Cache if enabled increases the time to perform the Recovery Check, since we need to de- stage it first. – As a rule of thumb: Recovery Check with FAST Cache 6 Hours Recovery Check without FAST Cache 3 Hours – You can determine that from the RCM TRiiAGE: Array Serial Number: FNM Array Model: VNX5500 ( BLOCK ) Array Software Revision: IP Address: SP Uptime: 10 days 04:25:05 10 days 04:35:44 EFD/FAST Cache Feature Enabled: Yes FAQs Frequently Asked Questions

12© Copyright 2014 EMC Corporation. All rights reserved. Why you need to disable Data Compression? – Data compression is a feature that analyzes the data on a disk and applies algorithms that reduce the size of repetitive sequences of bits that are inherent in some types of files. During the compression operation for a RAID group LUN, the software migrates and compresses the LUN data to a thin LUN in a pool. The LUN becomes a compressed thin LUN. Compression operations for pool LUNs (thick and thin) take place within the pool in which the LUN being compressed resides. Whenever data is compressed, there is a data movement which will affect the results of the recovery check. Why you need to disable Auto-Tiering? – The auto-tiering feature migrates data between storage tiers or different storage media (EFD, FC & SATA). The purpose of tiered storage is to retain the most frequently accessed or important data on fast, high performance (more expensive) drives, and move the less frequently accessed and less important data to low performance (less expensive) drives. Similar to Data Compression, there is data movement involved in Auto-Tiering too which can also cause false results. Why you need to disable FAST Cache? – Similar to the above two features, FAST Cache also involves data movement. When FAST Cache is enabled on a RAID Group Lun or in a pool, the data that is in the FAST Cache which is used less frequently, the data is moved to the HDD from FAST Cache. This data will be re-promoted to the FAST Cache when it becomes busy or more frequently used. FAQs Frequently Asked Questions

13© Copyright 2014 EMC Corporation. All rights reserved. Does running Recovery Check cause LUNs to go offline? – No, running Recoverycheck does not take LUNs offline Must pools be taken off-line during Recovery? – This answer depends on the analysis of the data previously provided. Escalation Engineering will endeavor not to take the LUN offline and to repair the affected LUN via a scripted batch file. However under certain circumstances the repair does require affected LUNs with SMLink corruption taken off-line for recovery. – If one or more affected LUNs are used for database or File Data Mover applications, the whole application may need to be brought off-line. Keep in mind that there is always a risk of unintentionally taking a pool offline when performing this type of manual recovery. Recovery should be performed during no/low I/O time periods. FAQs Frequently Asked Questions

14© Copyright 2014 EMC Corporation. All rights reserved. Do FAST Cache and Compression need to remain disabled after running Recoverycheck, but before recovery? – No, FAST Cache and Compression can be re-enabled on pool LUNs after Recoverycheck has run. Both features must be disabled on affected pools prior to recovery. This should be factored in to the total overall time required for the recovery operation. How long does running Recoverycheck take? – The answer is it varies. When running Recoverycheck you will need to generate SAT files from each SP, which will take anywhere from minutes (both can be run at the same time.) Once this is complete the actual running of the Recoverycheck will take somewhere between 5-30 minutes, depending on the configuration of the array and how many pool LUNs there are. FAQs Frequently Asked Questions

15© Copyright 2014 EMC Corporation. All rights reserved. FAST Cache has been disabled, but there is no progress – You need to make sure that setstats is enabled If FAST cache is taking too long, can I start the recovery check ? – Answer is you can, and in case you get a clean run from the 1 st time that would be enough. However, if you find corruptions, then you will need to wait for the FAST Cache to finish. FAQs Frequently Asked Questions

16© Copyright 2014 EMC Corporation. All rights reserved. Q&A