Stephen D. Mund, OCP Database Administrator Nevada System of Higher Education System Computing Services.

Slides:



Advertisements
Similar presentations
INTRODUCTION TO ORACLE Lynnwood Brown System Managers LLC Oracle High Availability Solutions RAC and Standby Database Copyright System Managers LLC 2008.
Advertisements

Chapter 14 Handling Online Redo Log Failures. Background RMAN doesn’t back up online redo logs You don’t use RMAN to recover from online redo log failures.
High Availability Group 08: Võ Đức Vĩnh Nguyễn Quang Vũ
Backup and recovery Basics of Backup and restoration Types of recovery Defining strategy Starting up and shutting down 80/20 rule SLA’s.
Oracle Data Guard Ensuring Disaster Recovery for Enterprise Data
FlareCo Ltd ALTER DATABASE AdventureWorks SET PARTNER FORCE_SERVICE_ALLOW_DATA_LOSS Slide 1.
1 Chapter 15 Duplicating Databases and Transporting Data.
5 Copyright © 2006, Oracle. All rights reserved. Database Recovery.
Database Backup and Recovery
RMAN Restore and Recovery
Backup and Recovery (2) Oracle 10g CAP364 1 Hebah ElGibreen.
Backup and Recovery Part 1.
Chapter 12 Performing Incomplete Recovery. Background Viewed as one of the more difficult chapters to write Thought it was important to put in material.
Harvard University Oracle Database Administration Session 11 Database Recovery.
CHAPTER 16 User-Managed Backup and Recovery. Introduction to User Managed Backup and Recovery Backup and recover is one of the most critical skills a.
Chapter 5 Configuring the RMAN Environment. Objectives Show command to see existing settings Configure command to change settings Backing up the controlfile.
CHAPTER 5 Managing Control Files, Online Redo Logs, and Archiving.
CHAPTER 17 Configuring RMAN. Introduction to RMAN RMAN was introduced in Oracle 8.0. RMAN is Oracle’s tool for backup and recovery. RMAN is much more.
Oracle9i Database Administrator: Implementation and Administration
Backup Concepts. Introduction Backup and recovery procedures protect your database against data loss and reconstruct the data, should loss occur. The.
Agenda  Overview  Configuring the database for basic Backup and Recovery  Backing up your database  Restore and Recovery Operations  Managing your.
Oracle backup and recovery strategy
Introduction to Oracle Backup and Recovery
Using RMAN to Perform Recovery
Overview SAP Basis Functions. SAP Technical Overview Learning Objectives What the Basis system is How does SAP handle a transaction request Differentiating.
CHAPTER 16 User-Managed Backup and Recovery. Introduction to User Managed Backup and Recovery Backup and recover is one of the most critical skills a.
Building Highly Available Systems with SQL Server™ 2005 Vineet Gupta Evangelist – Data and Integration Microsoft Corp.
13 Copyright © Oracle Corporation, All rights reserved. RMAN Complete Recovery.
PPOUG, 05-OCT-01 Agenda RMAN Architecture Why Use RMAN? Implementation Decisions RMAN Oracle9i New Features.
Maintaining a Microsoft SQL Server 2008 Database SQLServer-Training.com.
Database Administration TableSpace & Data File Management
Chapter 7 Making Backups with RMAN. Objectives Explain backup sets and image copies RMAN Backup modes’ Types of files backed up Backup destinations Specifying.
11 Copyright © Oracle Corporation, All rights reserved. RMAN Backups.
Chapter 18: Windows Server 2008 R2 and Active Directory Backup and Maintenance BAI617.
Chapter 9 Scripting RMAN. Background Authors felt that scripting was a topic not covered well Authors wanted to cover both Unix/Linux and Windows environments.
ORACLE 10g DATAGUARD Ritesh Chhajer Sr. Oracle DBA.
© ViSolve.com All rights reserved. Privacy Statement April Oracle Disaster Recovery Implementation A Non-Technical Overview.
Backup & Recovery Backup and Recovery Strategies on Windows Server 2003.
11g(R1/R2) Data guard Enhancements Suresh Gandhi
Module 9 Planning a Disaster Recovery Solution. Module Overview Planning for Disaster Mitigation Planning Exchange Server Backup Planning Exchange Server.
9 Copyright © 2004, Oracle. All rights reserved. Flashback Database.
7202ICT – Database Administration
Mark A. Magumba Storage Management. What is storage An electronic place where computer may store data and instructions for retrieval The objective of.
15 Copyright © Oracle Corporation, All rights reserved. RMAN Incomplete Recovery.
Week 3 Lecture 1 The Redo Log Files and Diagnostic Files.
IT Database Administration Section 09. Backup and Recovery Backup: The available options Full Consistent (cold) Backup Database shutdown, all files.
8 Copyright © Oracle Corporation, All rights reserved. Configuring the Database Archiving Mode.
14 Copyright © 2005, Oracle. All rights reserved. Backup and Recovery Concepts.
3 Copyright © 2006, Oracle. All rights reserved. Using Recovery Manager.
Configuring the Database Archiving Mode
Backup and Recovery Part 2. Online backup Online backup requirements: Backup all data files (file system copy) Backup all redo information produced during.
Overview of Oracle Backup and Recovery Darl Kuhn, Regis University.
18 Copyright © 2004, Oracle. All rights reserved. Backup and Recovery Concepts.
2 Copyright © 2007, Oracle. All rights reserved. Configuring for Recoverability.
2 Copyright © 2006, Oracle. All rights reserved. Configuring Recovery Manager.
18 Copyright © 2004, Oracle. All rights reserved. Recovery Concepts.
14 Copyright © 2005, Oracle. All rights reserved. Backup and Recovery Concepts.
20 Copyright © 2006, Oracle. All rights reserved. Best Practices and Operational Considerations.
9 Copyright © 2004, Oracle. All rights reserved. Incomplete Recovery.
14 Copyright © 2007, Oracle. All rights reserved. Backup and Recovery Concepts.
Oracle Standby Implementation Tantra Invedy. Standby Database Introduction Fail over Solution Disaster Recovery Solution if remote Ease of implementation.
1 Implementing Oracle Data Guard for the RLS database Kasia Pokorska CERN, IT-DB 30 th March 2004.
Maintaining Online Redo Log Files
Clonedb: The quick and easy cloning solution you never knew you had.
Oracle9i Database Administrator: Implementation and Administration
Workshop.
Performing Database Recovery
Chapter 5 The Redo Log Files.
Introduction.
Oracle Data Guard Broker Session-3
Presentation transcript:

Stephen D. Mund, OCP Database Administrator Nevada System of Higher Education System Computing Services

Introduction  Blackboard CE’s Section Backup.

Introduction  Blackboard CE’s Section Backup.  Using Oracle for backup of small datasets.

Introduction  Blackboard CE’s Section Backup.  Using Oracle for backup of small datasets.  Offloading I/O from backups.

Case Study  Great Basin College –  Uses Blackboard CE as a primary tool in their Distance Learning program serving students in over 62,000 sq. mi. of rural Nevada.

Case Study  Great Basin College –  Uses Blackboard CE as a primary tool in their Distance Learning program serving students in over 62,000 sq. mi. of rural Nevada.  Over 1200 courses are offered through the Blackboard CE solution.

Case Study  Great Basin College –  Uses Blackboard CE as a primary tool in their Distance Learning program serving students in over 62,000 sq. mi. of rural Nevada.  Over 1200 courses are offered through the Blackboard CE solution.  Section backup was performing nominally, issue was with the amount of coursework needed to be backed up.

Case Study (continued)  Server degradation in performance towards the end of each semester.

Case Study (continued)  Server degradation in performance towards the end of each semester.  Massive slowdown causing users to log off and log on later.

Case Study (continued)  Server degradation in performance towards the end of each semester.  Massive slowdown causing users to log off and log on later.  Hangs during both reading and writing.

Case Study (continued)  Server degradation in performance towards the end of each semester.  Massive slowdown causing users to log off and log on later.  Hangs during both reading and writing.  Caused by extensive I/O on the application server due to constant use of section backup.

Case Study (continued)  Server degradation in performance towards the end of each semester.  Massive slowdown causing users to log off and log on later.  Hangs during both reading and writing.  Caused by extensive I/O on the application server due to constant use of section backup.  Section Backup required use of the application.

Options Considered  Don’t use the section backup utility and use another database to perform Point-in-Time Recovery. Then use Section Backup to recovery the file.

Options Considered  Don’t use the section backup utility and use another database to perform Point-in-Time Recovery. Then use Section Backup to recovery the file.  PROS: Does not require ongoing use of storage. Relieves the I/O issue.

Options Considered  Don’t use the section backup utility and use another database to perform Point-in-Time Recovery. Then use Section Backup to recovery the file.  PROS: Does not require ongoing use of storage. Relieves the I/O issue.  CONS: Cumbersome requires time consuming restoration to another server. More involved, difficult to script.

Options Considered  Optimize Disk Performance. System currently uses RAID 5 could reconfigure to better RAID configuration like 1+0 or even RAID 1.

Options Considered  Optimize Disk Performance. System currently uses RAID 5 could reconfigure to better RAID configuration like 1+0 or even RAID 1.  PROS: I/O performance is increased at the storage level.

Options Considered  Optimize Disk Performance. System currently uses RAID 5 could reconfigure to better RAID configuration like 1+0 or even RAID 1.  PROS: I/O performance is increased at the storage level.  CONS: It is doubtful that the performance increase would be substantial enough. Requires application downtime to configure and is labor intensive.

Options Considered  Standby database with Flashback technology and roll forward.

Options Considered  Standby database with Flashback technology and roll forward.  PROS: Off loads most of the I/O leaving only a minor amount required for resynchronization.

Options Considered  Standby database with Flashback technology and roll forward.  PROS: Off loads most of the I/O leaving only a minor amount required for resynchronization.  CONS: Requires extensive ongoing disk space for flashback logs, archive logs and database files.

Options Considered  Delayed Standby database with Flashback technology

Options Considered  Delayed Standby database with Flashback technology  PROS: Has the advantages of previous option but uses about half the space.

Options Considered  Delayed Standby database with Flashback technology.  PROS: Has the advantages of previous option but uses about half the space.  CONS: More involved setup. Requires special configurations to use extra long delay. Space requirement is more than first two options and recoveries are more involved than third option.

Our Solution  Best solutions.

Our Solution  Best solutions.  Technical points option 3.

Our Solution  Best solutions.  Technical points option 3.  Our specific situation option 4.

Our Solution  Best solutions  Technical points option 3.  Our specific situation option 4.  Option 3 is the easiest to implement and simplest to maintain.

Our Solution  Best solutions  Technical points option 3.  Our specific situation option 4.  Option 3 is the easiest to implement and simplest to maintain.  Option 4 was the best solution for us. Because it uses least disk space.

Implementation  Set up a Standby database.

Implementation  Set up a Standby database.  This is well documented in Oracle® Data Guard Concepts and Administration 10g Release 2 (10.2) Part Number B ,Chapter 3, Chapter 12.7, Chapter 12.8 Appendix F.

Implementation  Remote Archivelog Destination.

Implementation  Remote Archivelog Destination.  Parameters ARCH.

Implementation  Remote Archivelog Destination.  Parameters ARCH.  DELAY=20160.

Implementation  Remote Archivelog Destination.  Parameters ARCH.  DELAY=  MAXIMUM PERFORMANCE.

Implementation  Standby Redo Logs.

Implementation  Standby Redo Logs.  Not needed in this application.

Implementation  Standby Redo Logs.  Not needed in this application.  Flashback Database.

Implementation  Standby Redo Logs.  Not needed in this application.  Flashback Database.  Only on the Standby Database.

Implementation  Special Configurations.

Implementation  Special Configurations.  Control file retention period greater than the amount of the delay (control_file_record_keep_time).

Implementation  Special Configurations.  Control file retention period greater than the amount of the delay (control_file_record_keep_time).  Archive logs will need to be maintained on both the primary and the standby for at least the delay time.

Implementation  Errors.

Implementation  Errors.  If you.see something akin to this in the alert log: FAL[client]: Failed to request gap sequence GAP - thread 1 sequence DBID branch FAL[client]: All defined FAL servers have been attempted.

Implementation  Errors.

Implementation  Errors.  Probably indicates that you are not keeping enough archive logs on the primary rather than keep time for the control file.

Implementation  Errors.  Probably indicates that you are not keeping enough archive logs on the primary rather that keep time for the control file.  You will need to manually ship the archived logs on the standby to the primary.

Implementation  Best Practice.

Implementation  Best Practice.  Place the standby database in it’s own home. During upgrades and patching you will need to patch the standby along with the primary.

Implementation  Best Practice.  Place the standby database in it’s own home. During upgrades and patching you will need to patch the standby along with the primary.  If you use a development or test home then you won’t be able to test the patch without upgrading everything.

Using the Recovery System  Recovery Overview.

Using the Recovery System  Recovery Overview. 1. Ascertain the last “known good” time. This would be provided by the end user. The time which the restore was known to be good.

Using the Recovery System  Recovery Overview. 2. Take the delayed standby out of the managed recovery mode. First you want to know the log that will be next applied to the standby. Use this query: SELECT NAME FROM V$ARCHIVED_LOG WHERE APPLIED = ‘NO’ AND SEQUENCE# = (SELECT SEQUENCE# FROM V$MANAGED_STANDBY WHERE PROCESS LIKE ‘MRP%’); Cancel the recovery on the standby and defer the log shipping on the primary. Make a Guaranteed Recovery Point so you can resynchronize the system.

Using the Recovery System  Recovery Overview. 3. Recover the standby to the correct point in time. RECOVER STANDBY DATABASE UNTIL TIME ‘ YYYY-MM-DD:HH24:MI:SS’;

Using the Recovery System  Recovery Overview. 3. Recover the standby to the correct point in time. RECOVER STANDBY DATABASE UNTIL TIME ‘ YYYY-MM-DD:HH24:MI:SS’; 4. Activate and open the standby database ALTER DATABASE ACTIVATE STANDBY DATABASE; ALTER DATABASE OPEN;

Using the Recovery System  Recovery Overview. 5. Point the application (development) to standby database. The ORACLE_SID of the development database needs to be change to the DB_UNIQUE_NAME of the standby database in the following files: CP-WebCTConnectionPool$PF-jdbc.xml CP-WebCTConnectionPool-jdbc.xml CP-WebCTJMSConnectionPool-jdbc.xml Then restart the applications server.

Using the Recovery System  Recovery Overview. 6. Run the section backup in order to extract the course backup.

Using the Recovery System  Recovery Overview. 6. Run the section backup in order to extract the course backup. 7. Restore the section backup on the production system.

Using the Recovery System  Resynchronization Overview.

Using the Recovery System  Resynchronization Overview. 1. Point the application (standby) to original database Change the following files back to the original sid: CP-WebCTConnectionPool$PF-jdbc.xml CP-WebCTConnectionPool-jdbc.xml CP-WebCTJMSConnectionPool-jdbc.xml.

Using the Recovery System  Resynchronization Overview. 1. Point the application (standby) to original database Change the following files back to the original sid: CP-WebCTConnectionPool$PF-jdbc.xml CP-WebCTConnectionPool-jdbc.xml CP-WebCTJMSConnectionPool-jdbc.xml. 2. Remount the database and flashback to original delay STARTUP FORCE MOUNT; FLASHBACK DATABASE TO RESTORE POINT ; ALTER DATABASE CONVERT TO PHYSICAL STANDBY; STARTUP FORCE MOUNT; Drop the restore point to avoid having too many flashback logs.

Using the Recovery System  Resynchronization Overview. 3. Manually register the next archivelog to pick up the delay information ALTER DATABASE REGISTER OR REPLACE LOGFILE ‘ ’;

Using the Recovery System  Resynchronization Overview. 3. Manually register the next archivelog to pick up the delay information. ALTER DATABASE REGISTER OR REPLACE LOGFILE ‘ ’; 4. Enable log shipping on the primary and restart the managed recovery.

Using the Recovery System  Resynchronization Overview. 3. Manually register the next archivelog to pick up the delay information. ALTER DATABASE REGISTER OR REPLACE LOGFILE ‘ ’; 4. Enable log shipping on the primary and restart the managed recovery. 5. Verify with log switch on the primary.

Using the Recovery System  Resynchronization Overview. 6. Check the standby alert log for appropriate response. You should see logs shipping virtually immediately and eventually you should see “Media Recovery Log” which means a log has applied.

Final Results

 Most notably was that GBC received NO complaints concerning access during the end of semesters.

Final Results  Most notably was that GBC received NO complaints concerning access during the end of semesters.  The change was so marked that it was commented on several times by the Web Campus administration team.

Final Results  Lisa Frasier, Director of Online Education and Curriculum Development at Great Basin stated: “Logon was really not the problem but the extreme lag time and problems with assignment errors, due to resource competition. Assignment errors has stopped almost completely. We haven’t had a help desk call complaining about slow server speed since the upgrade was completed.”

Final Results  Additional benefits are that GBC was able to extend testing of the latest major upgrade because of the reduction of performance problems made upgrading less crucial. Allowing added time for training of faculty and pilots for students.

Stephen D. Mund, OCP Database Administrator Nevada System of Higher Education System Computing Services

Stephen D. Mund, OCP Database Administrator Nevada System of Higher Education System Computing Services