Best Practices for Backing Up Your System

Slides:



Advertisements
Similar presentations
SQL-BackTrack for Sybase
Advertisements

Remus: High Availability via Asynchronous Virtual Machine Replication
© 2006 DataCore Software Corp DataCore Traveller Travel in Time : Do More with Time The Continuous Protection and Recovery (CPR) Solution Time Optimized.
INTRODUCTION TO ORACLE Lynnwood Brown System Managers LLC Backup and Recovery Copyright System Managers LLC 2008 all rights reserved.
Recovery Planning A Holistic View Adam Backman, President White Star Software
1 Disk Based Disaster Recovery & Data Replication Solutions Gavin Cole Storage Consultant SEE.
Delphix User Experience
Module – 11 Local Replication
Section 3 : Business Continuity Lecture 29. After completing this chapter you will be able to:  Discuss local replication and the possible uses of local.
Module 12: Backup and Recovery. Overview Backup and recovery methods available in Oracle and SQL Server 2008 Types of failure Types of recovery Formulating.
Database Backup and Recovery
CHAPTER 17 Configuring RMAN. Introduction to RMAN RMAN was introduced in Oracle 8.0. RMAN is Oracle’s tool for backup and recovery. RMAN is much more.
Backup Concepts. Introduction Backup and recovery procedures protect your database against data loss and reconstruct the data, should loss occur. The.
Agenda  Overview  Configuring the database for basic Backup and Recovery  Backing up your database  Restore and Recovery Operations  Managing your.
Servers Redundant Array of Inexpensive Disks (RAID) –A group of hard disks is called a disk array FIGURE Server with redundant NICs.
Navigating the Oracle Backup Maze Robert Spurzem Senior Product Marketing Manager
Storwize V7000 IP Replication solution explained
© 2010 IBM Corporation Kelly Beavers Director, IBM Storage Software Changing the Economics of Storage.
Paper on Best implemented scientific concept for E-Governance Virtual Machine By Nitin V. Choudhari, DIO,NIC,Akola By Nitin V. Choudhari, DIO,NIC,Akola.
BACKUP/MASTER: Immediate Relief with Disk Backup Presented by W. Curtis Preston VP, Service Development GlassHouse Technologies, Inc.
NovaBACKUP 10 xSP Technical Training By: Nathan Fouarge
IBM TotalStorage ® IBM logo must not be moved, added to, or altered in any way. © 2007 IBM Corporation Break through with IBM TotalStorage Business Continuity.
70-293: MCSE Guide to Planning a Microsoft Windows Server 2003 Network, Enhanced Chapter 14: Problem Recovery.
Backup & Recovery 1.
Copyright © 2009 EMC Corporation. Do not Copy - All Rights Reserved.
PPOUG, 05-OCT-01 Agenda RMAN Architecture Why Use RMAN? Implementation Decisions RMAN Oracle9i New Features.
Module 10 Configuring and Managing Storage Technologies.
EMC ITEMPOINT FOR MICROSOFT EXCHANGE SERVER
SRUTHI NAGULAVANCHA CIS 764, FALL 2008 Department of Computing and Information Sciences (CIS) Kansas State University -1- Back up & Recovery Strategies.
Chapter 7 Making Backups with RMAN. Objectives Explain backup sets and image copies RMAN Backup modes’ Types of files backed up Backup destinations Specifying.
11 Copyright © Oracle Corporation, All rights reserved. RMAN Backups.
11 Copyright © Oracle Corporation, All rights reserved. RMAN Backups.
Chapter 8 Implementing Disaster Recovery and High Availability Hands-On Virtual Computing.
Backup & Recovery Backup and Recovery Strategies on Windows Server 2003.
Confidential1 Introducing the Next Generation of Enterprise Protection Storage Enterprise Scalability Enhancements.
| nectar.org.au NECTAR TRAINING Module 9 Backing up & Packing up.
© 2006 IBM Corporation Flash Copy Solutions im Windows Umfeld TSM for Copy Services Wolfgang Hitzler Technical Sales Tivoli Storage Management
RMAN: Your New Best Friend for Backup and Recovery Ruth Gramolini ORACLE DBA Vermont Department of Taxes.
15 Copyright © 2007, Oracle. All rights reserved. Performing Database Backups.
Mark A. Magumba Storage Management. What is storage An electronic place where computer may store data and instructions for retrieval The objective of.
10/23/2015ISYS366 - Installation1 ISYS366 Installation.
Distributed Backup And Disaster Recovery for AFS A work in progress Steve Simmons Dan Hyde University.
Process Architecture Process Architecture - A portion of a program that can run independently of and concurrently with other portions of the program. Some.
14 Copyright © 2005, Oracle. All rights reserved. Backup and Recovery Concepts.
High Availability in DB2 Nishant Sinha
| nectar.org.au NECTAR TRAINING Module 9 Backing up & Packing up.
18 Copyright © 2004, Oracle. All rights reserved. Backup and Recovery Concepts.
2 Copyright © 2007, Oracle. All rights reserved. Configuring for Recoverability.
8 Copyright © 2007, Oracle. All rights reserved. Using RMAN to Duplicate a Database.
18 Copyright © 2004, Oracle. All rights reserved. Recovery Concepts.
14 Copyright © 2005, Oracle. All rights reserved. Backup and Recovery Concepts.
© 2009 IBM Corporation Statements of IBM future plans and directions are provided for information purposes only. Plans and direction are subject to change.
CDP Competitive analysis of FalconStor CONFIDENTIAL DO NOT REDISTRIBUTE.
14 Copyright © 2007, Oracle. All rights reserved. Backup and Recovery Concepts.
Unit 8: Database and Storage Pool Backup and Recovery.
PHD Virtual Technologies “Reader’s Choice” Preferred product.
File-System Management
Integrating Disk into Backup for Faster Restores
Database Recovery Techniques
High Availability 24 hours a day, 7 days a week, 365 days a year…
Tivoli Storage Manager Product Family
iSCSI Storage Area Network
Module – 11 Local Replication
Demystifying Deduplication
SAN and NAS.
Database Backup and recovery
Prepared by Jaroslav makovski
Using the Cloud for Backup, Archiving & Disaster Recovery
February 11-13, 2019 Raleigh, NC.
IBM Tivoli Storage Manager
Presentation transcript:

Best Practices for Backing Up Your System Luca Ravazzolo Technology Architect

Types of backups Cold file-level backup Caché online backup Caché shutdown Server-level copy to disk/tape Caché restarted Caché online backup Caché’s backup tool copies data blocks from CACHE.DAT files to disk file or tape. Full or various incremental backups

Types of backups SAN or disk array backup Backup I/O stays within the SAN or the array Block level copy from device to device (disk, tape, virtual tape) All vendors have some type of software to control backups. To backup a consistent image, a point-in-time snapshot or clone is made of the source device.

Types of backups: others CDP: Continuous Data Protection (Near-CDP) Use of separate appliance to journal changes out-of- band allowing for recovery to any point-in-time, Depending on space available can restore to most any point-in-time. SAN-based Replication Provides a disk-to-disk copy within the SAN, perhaps over long distances, Destination can be archived to tape.

Advantages and Challenges

Caché online backup Advantages: Challenges: Caché stays up, users continue to work Simple to implement, may not need 3rd party software Challenges: Only backs up the CACHE.DAT data – must also backup journals, other files. Restores typically take multiple steps Create a Caché instance Restore “*<date>.cbk” files from storage Apply most recent full backup, then cumulative & incrementals Apply journal files

Disk array/SAN-based snapshot Advantages: Point-in-time copy of all data (Caché and otherwise) Requires no downtime (when using Caché write daemon freeze and thaw) Challenges: Requires snap/clone technology Requires additional software to coordinate There are two main types of storage snapshot, called the copy-on-write (or low-capacity) snapshot and the split-mirror snapshot. Utilities are available that can automatically generate either type. -A copy-on-write snapshot utility creates a snapshot of changes to stored data every time new data is entered or existing data is updated. This allows rapid recovery of data in case of a disk write error, corrupted file, or program malfunction. However, all previous snapshots must be available if complete archiving or recovery of all the data on a network or storage medium is needed. -A split-mirror snapshot utility references all the data on a set of mirrored drives. Every time the utility is run, a snapshot is created of the entire volume, not only of the new or updated data. This makes it possible to access data offline, and simplifies the process of recovering, duplicating, or archiving all the data on a drive. However, this is a slower process, and it requires more storage space for each snapshot. -- from http://searchstorage.techtarget.com/definition/storage-snapshot

CDP or replication Advantages Challenges CDP allows restore to nearly any point-in-time Replication allows geographically separated backups Challenges Non-Caché technologies require coordination with Caché, i.e. May end up with Caché in a crash-consistent state and require recovery before use Requires appliances and software CDP = Continuous Data Protection

Coordinating with Caché External Backup Coordinating with Caché

Freeze the write daemon(s) For a consistent database image on your backup media (i.e. a CACHE.DAT without integrity errors) the write daemon’s cycle must be complete. Use the Backup.General.ExternalFreeze() method Keeps write daemon from writing Waits for current write daemon cycle (if active) to finish Switches journal file Logs information to the cconsole.log file.

Freezing the write daemon ExternalFreeze command: OS command returns a code: 5 – successful 3 - failure While frozen, all updates are made as usual to database cache Processes continue to run normally UNLESS: Available buffers in the database cache falls too low. The ExternalFreeze lasts longer than the default limit (600 seconds) #csession cache –U%SYS “##class(Backup.General).ExternalFreeze()” #echo $? %SYS>SET rc=##class(Backup.General).ExternalFreeze()

Thaw the write daemon Use Backup.General.ExternalThaw to allow write daemon(s) to resume writing. Thaw command: OS-level command returns one of these codes: 5 – success 3 - failure #csession cache –U%SYS “##class(Backup.General).ExternalThaw()” #echo $? %SYS>SET rc=##class(Backup.General).ExternalThaw()

Another useful method Use Backup.General.ExternalSetHistory to log successful backups in the Backup History log is name of an externally created backup log desc is free text %SYS>SET log=“/var/logs/backup.log”,desc=“Full Backup” %SYS>S rc=##class(Backup.General).ExternalSetHistory(log,desc)

Who runs the freeze/thaw? The operating system user that executes the freeze/thaw command must have access to Caché. In normal install, the “backup” user must be a Caché user. %Service_Terminal must allow OS-level authentication. Caché “backup” user needs RW on the %DB_CACHESYS resource as well as use of %Admin_Operate and %Service_Terminal

Case Study: External Backup Using snapshots, a de-duplication appliance and replication for an external backup of Caché

External backup 1: Caché & snaps Backup software initiates the backup process from media server MEDIA Invoke script on server running Caché to FREEZE write daemon Backup software initiates clone or snapshot of all Caché arrays MEDIA Invoke script on server running Caché to THAW write daemon

External backup 2: Mount & copy Backup software mounts snapshot on the media server MEDIA Backup software does file level copy from snapshot to disk-based backup appliance. MEDIA Backup releases the snapshot via command-line interface call to disk controller MEDIA

Ext Backup 3: Replicate, verify & archive Backup software initiates a backup copy to a secondary data center MEDIA In secondary data center, replicated backup is restored, mounted in a Caché instance and an integrity check is run to verify structural integrity. Depending on space and policy backup is kept online and/or archived to tape for long term storage.

Timings and best practices Backup software initiates the backup process from media server MEDIA Backup software: Must be able to call freeze/thaw script on Caché server Must be able to initiate the snapshot Most commercial backup software will work well including EMC Networker, Symantec NetBackup, IBM Tivoli (TSM), etc.

Timings and best practices Sample scripts available from the WRC Time to freeze and return depends on Database activity Current write daemon phase (i.e. is it writing to disk?) Invoke script on server running Caché to FREEZE write daemon 04/02-02:30:00 (1098) 0 ExternalFreeze: Suspending system 04/02-02:30:00 (1098) 0 ExternalFreeze: Description: Backup Performed by TSM at: 2013-04-02 02:30:00 04/02-02:30:01 (1098) 0 ExernalFreeze: Start a journal restore for this backup with journal file: /jrn/20130402.003 04/02-02:30:02 (1098) 0 ExernalFreeze: System suspended

Timings and best practices Creating the clone or snap - this period is when write daemon(s) are frozen. Timing is based on array controller activity If greater than a few minutes there is a risk of running into freeze timeout. 50 seconds frozen with IBM DS5300 using FlashCopy on a few TB of data with active systems Backup software initiates clone or snapshot of all Caché arrays MEDIA 04/02-02:30:02 (1098) 0 ExernalFreeze: System suspended 04/02-02:30:52 (9109) 0 ExternalThaw: Resuming system

Timings and best practices Thawing the write daemon takes seconds at most. Best practice is to be sure to thaw the database on any error along the way. Perhaps have an independent job to check database status and thaw if frozen---so a failed backup will never leave Caché frozen. Invoke script on server running Caché to THAW write daemon

External backup 2: Mount & copy Backup software mounts snapshot on the media server MEDIA Backup software does file level copy from snapshot to disk-based backup appliance. MEDIA Use of a de-duplication appliance as the file- level backup target speeds backup and saves space. Timings vary a lot here---disk used, dedupe rate etc.

Ext Backup 3: Replicate, verify & archive Backup software initiates a backup copy to a secondary data center MEDIA SAN level replication or replication via de- duplication appliance. Timings vary a lot here based on bandwidth and de-dupe rate if applicable.

Ext Backup 3: Replicate, verify & archive Integrity checks vary in timing Another option is to have media server in primary data center run the check. In secondary data center, replicated backup is restored, mounted in a Caché instance and an integrity check is run to verify structural integrity. Depending on space and policy backup is kept online and/or archived to tape for long term storage.

Final points Considering cost and effort, Caché online backup works well for small to medium size databases (~ 100s of GB total) with generous RTOs Use InterSystems Mirroring in conjunction with your backup mechanism. Perhaps there will be no need to restore a backup If needed, the mirror destination will have CACHE.DAT files and journal files.

Final points Backup should have minimal impact on live database Using SAN/disk controller based backups offloads the work to other appliances/servers SAN/disk-based backups meet the fastest RTOs. Restore from backup RPOs are as good as the most recently available journal file.

Best Practices for Backing Up Your System Luca Ravazzolo Technology Architect