Download presentation
Presentation is loading. Please wait.
Published byAngelica Garrison Modified over 9 years ago
2
How Oracle Database 10g Revolutionizes Availability and Enables the Grid Juan Loaiza Vice President, Systems Technologies Oracle Corporation Session id: 40164
3
3 From High Quality Parts to High Quality Systems Traditionally Low Cost = Low Quality High quality systems were built by combining high quality, high cost parts – Mainframe model Oracle enables a new model Oracle combines high volume inexpensive processors and storage to produce a high quality system Unbreakable Inexpensive Systems
4
4 Low Cost Fault Tolerance Grid Clusters Low Cost Fault Tolerance Computer Failures Data Failures Data Failures System Changes Data Changes Unplanned Downtime Planned Downtime
5
5 Commercial Grids and Availability Grid pools standard low cost nodes and modular disk arrays Perfect for RAC HA Failover can happen to any node on the grid Grid load balancing will redistribute load over time Designed to Tolerate Failures
6
6 New in 10g for Availability Integrated clusterware – Integrated, less moving parts, better tested – Avoid operator errors by reducing cross product coordination – Built in integration with Application Server for failing over connection pools Faster failover between nodes – Single digit seconds
7
7 New Economics for Data Protection & Recovery Disk Based Recovery Computer Failures Data Failures Data Failures System Changes Data Changes Unplanned Downtime Planned Downtime Trade cheap disk space for expensive downtime
8
8 New World: Disk Based Data Recovery Disk economics are close to tape Disk is better than tape – Random access to any data We rearchitected our recovery strategy to take advantage of these economics – Random access allows us to backup and recover just the changes to the database Backup and Recovery goes from hours to minutes 2000 ’ s - 200 GB 1980 ’ s - 200 MB 1000x increase
9
9 Resiliency using Low Cost Storage Computer Failures Data Failures Data Failures System Changes Data Changes Unplanned Downtime Planned Downtime Storage Failure Human Error Corruption Site Failure Four Failure Types
10
10 Data Mirroring with ASM ASM mirrors data across inexpensive modular storage arrays No additional logging or expensive NVRAM to recover mirrors – Database logging recovers mirrors Automatically remirrors when disk or array fails Designed to tolerate failures Failure Resiliency using Low Cost Storage
11
11 Collapsing the Cost of Human Error Computer Failures Data Failures Data Failures System Changes Data Changes Unplanned Downtime Planned Downtime Storage Failure Human Error Corruption Site Failure
12
12 Human Error Goal is to quickly analyze and repair – For Localized damage Need surgical analysis and repair Example – deleted wrong order – For Widespread damage Need complete back-out to avoid long downtime Example – batch job deletes this month’s orders Human Errors Other Downtime Single Biggest Cause of Downtime
13
13 Flashback Time Navigation Flashback Query – Query all data at point in time Flashback Versions Query – See all versions of a row between two times – See transactions that changed the row Flashback Transaction Query – See all changes made by a transaction Tx 1 Tx 2 Tx 3 Select * from Emp AS OF ‘2:00 P.M.’ where … Select * from Emp VERSIONS BETWEEN ‘2:00 PM’ and ‘3:00 PM’ where … Select * from DBA_TRANSACTION_QUERY where xid = ‘000200030000002D’;
14
14 Flashback Database A new strategy for point in time recovery Flashback Log captures old versions of changed blocks – Think of it as a continuous backup – Replay log to restore DB to time – Restores just changed blocks It’s fast - recover in minutes, not hours It’s easy - single command restore Flashback Database to ‘2:05 PM’ “Rewind” button for the Database Data FilesFlashback Log New Block Version Disk Write Old Block Version
15
15 Flashback Error Correction Recovery at all levels Database Level – Flashback Database restores the whole database to time Uses Flashback Logs Table Level – Flashback Table restores rows in a set of tables to time Uses UNDO in database – Flashback Drop restores a dropped table or a index Recycle bin for DROPs Row Level – Restore individual rows Uses Flashback Query Order Database Customer
16
16 Flashback for All Users END USER Flashback Query Flashback Versions Query DEVELOPER Flashback Versions Query Flashback Transaction Query Flashback Table DATABASE ADMIN Flashback Database Flashback Drop SYSTEM ADMIN Data Guard
17
17 Revolution in Recovery Flashback Revolutionizes Recovery – Operates on just the changed data – Time to correct error equals time to make error Minutes instead of hours Flashback is Easy – Single command instead of complex procedure Correction Time = Error Time + f(DB_SIZE)
18
18 Prevention & Recovery of Corruptions Computer Failures Data Failures Data Failures System Changes Data Changes Unplanned Downtime Planned Downtime Storage Failure Human Error Corruption Site Failure
19
19 Oracle End-to-end Data Validation H.A.R.D. – Hardware Assisted Resilient Data – Prevents corruption introduced in IO path between DB and storage Initially introduced in Oracle9iR2 10g HARD provides – Better checks – All file types & block sizes checked DB, log, archive, backup, etc. – A.S.M. enables HARD without using RAW devices Supported by major storage vendors A.S.M. Oracle SAN & Virtualization Blocks validated and Protection info added to block Protection info validated by storage device Volume Manager Operating System Host Bus Adapter SAN Interface Storage Device
20
20 Flash Recovery Area Fully automatic disk based backup and recovery – Set and Forget Nightly incremental backup rolls forward recovery area backup – Changed blocks are tracked in production DB Full scan is never needed – Dramatically faster (20x) – Blocks validated to prevent corruption of backup copy Use low cost ATA disk array for recovery area Two Independent Disk Systems Flash Recovery Area Nightly Apply Validated Incremental Weekly Archive To Tape Database Area
21
21 Low Cost No Compromise Disaster Recovery Computer Failures Data Failures Data Failures System Changes Data Changes Unplanned Downtime Planned Downtime Storage Failure Human Error Corruption Site Failure
22
22 Existing Site Recovery Tradeoffs User can delay log apply to protect from user errors but: – Failover takes hours – Reports run on hours old data After failing over to standby, production DB must be rebuilt – Production has updates that did not get to standby Production Database Standby Database Transaction Shipping 4 Hour Delay Apply Reporting On Delayed Data
23
23 Low Cost No Compromise Disaster Recovery Flashback DB removes need to delay apply of logs to correct errors Flashback DB removes the need to reinstantiate primary on failover Real-time log apply enables real-time reporting on standby Data Guard works transparently across GRID clusters – Standby can use fewer CPU resources than primary Flashback Log Production Database Standby Database Transaction Shipping (Real Time Apply) No Delay Reporting On Real Time Data Flashback Log Some Nodes Used for Other Computing
24
24 Highest Data Protection Lowest Cost Combine the Features to Achieve Any Level of Data Protection Dramatic Advances in Ease of Use ASM Mirroring Storage Failure Protection Flashback Human Error Protection Data Guard Site Failure Protection Flash Recovery Area Corruption Protection
25
25 Other Protection Enhancements Compression of archive logs and backups Automated failover to a previous backup when restore discovers a missing or corrupt backup Automated recovery through a previous point in time recovery - recovery through resetlogs Automated creation of new files during recovery Automated channel failover on backup or restore Automated tablespace point-in-time recovery Full DB “begin backup” command for faster mirror split Improved Recovery Parallelism (2 to 4 X) Tablespace Rename Proxy (third-party) Backup for archive logs Time window based throttling of backups
26
26 Other 10g Data Guard Enhancements SQL Apply Enhancements – Support for Longs – Support for multi-byte CLOBs and NCLOBs – Support for Index Organized Tables without overflow or LOB segments – Instantiation of logical standby with no quiesce of primary Generic Data Guard Enhancements – Data Guard Broker support for RAC
27
27 Allow any change to the system with no downtime Online Reconfiguration Goal No Cost System Changes Rolling Upgrades Computer Failures Data Failures Data Failures System Changes Data Changes Unplanned Downtime Planned Downtime
28
28 No Cost System Changes – Capacity on Demand CPU – Add/remove CPUs on SMP online Cluster Nodes – Add/remove cluster nodes online – No data movement needed Memory – Grow and shrink shared memory and buffer cache online – Auto tuning of memory online Disk – Add/remove disks online – Automatically rebalance – Move datafiles
29
29 Rolling Patch Upgrade using RAC Initial RAC ConfigurationClients on A, Patch B Oracle Patch Upgrades Operating System Upgrades Upgrade Complete Hardware Upgrades Clients Clients on B, Patch A Patch 12 34 AB AB A B AB B A
30
30 Rolling Release Upgrade using Data Guard Initial SQL Apply Config Run mixed to testSwitch to B, upgrade A Upgrade node B to X+1 Major Release Upgrades Patch Set Upgrades Cluster Software & Hardware Upgrades Clients Logs Ship Version X 1 Clients 2 Logs Ship 3 Clients Logs Ship 4 Upgrade Logs Queue XX+1 X
31
31 Online Redefinition Evolution without Interruption Competitive pressures demand continual change Need to change data with no interruption to the application – location, format, indexing, or even definition Goal No Cost Data Changes Computer Failures Data Failures Data Failures System Changes Data Changes Unplanned Downtime Planned Downtime
32
32 All indexing operations can be done online – Create new index, move index, defragment index Tables can be Reorganized & Redefined online – Table contents are copied to a new table Defragments and allows changing location, table type, partitioning – Contents can be transformed as they are copied Can change columns, types, sizes - specified using SQL “Select” Source Table Update Tracking TransformCopy Table Transform Updates GUI interface to make it Simple Result Table Continuous Queries & Updates Store Updates Online Redefinition
33
33 Online Redefinition Enhancements Enhanced Online Table Redefinition – Easy cloning of indexes, grants, constraints, etc. – Convert from long to LOB online – Allow unique index instead of primary key Change tables without recompiling stored procedures – Stored procedures can depend on the signature of a table instead of the table itself Online Segment Shrink – Return unused space within the blocks of a segment to the tablespace
34
34 Maximum Availability Architecture (MAA) Operational Practices are key – Technology alone is not enough MAA is a blueprint for achieving HA & DR – Tested, validated, and documented best practices Database, Storage, Cluster, Network 10 person year effort – otn.oracle.com/deploy/availability M.A.A. How to Prevent, Tolerate, & Recover From Outages Maximum Availability = Unbreakable Architecture + Best Practices
35
35 Highest Availability at Lowest Cost Highest Availability – Fault Tolerant Clusters – Flashback Error Correction – Automated Disk Backup – No Compromise Disaster Recovery – Rolling Upgrades – Online Redefinition At Lowest Cost – Low Cost Grid servers – Low Cost Modular Storage Arrays – Automated & Simple to Use Oracle10g is Unbreakable & Inexpensive
36
36 Next Steps High Availability Sessions from Oracle 11:00 AM How Oracle Database 10g Revolutionizes Availability and Enables the Grid 3:30 PM Oracle Recovery Manager (RMAN) 10g: Reloaded 5:00 PM Proven Techniques for Maximizing Availability 8:30 AM Oracle Database 10g - RMAN and ATA Storage in Action 11:00 AM Oracle Data Guard: Maximum Data Protection at Minimum Cost 1:00 PM Oracle Database 10g Time Navigation: Human-Error Correction 4:30 PM Data Guard SQL Apply: Back to the Future Wednesday in Moscone Room 304Tuesday in Moscone Room 304 For More Info On Oracle HA Go To http://otn.oracle.com/deploy/availability/
37
37 Next Steps High Availability Sessions from Oracle 8:30 AM in Moscone Room 304 Oracle Database 10g Data Warehouse Backup and Recovery: Automatic, Simple, Reliable 8:30 AM in Moscone Room 104 Building RAC Clusters over InfiniBand Thursday For More Info On Oracle HA Go To http://otn.oracle.com/deploy/availability/ Real Application Clusters Data Guard Database Backup & Recovery Flashback Recovery LogMiner, Online Redefinition, and Cross Platform Transportable Tablespaces Database HA Demos All Four Days In The Oracle Demo Campground
38
A Q & Q U E S T I O N S A N S W E R S
39
39 New Oracle Database 10g HA Features Clusters Portable Clusterware Cluster file system for Linux & Windows Automated Patching Data Guard SQL Apply Support for Longs Support for multi-byte CLOBs and NCLOBs Support for Index Organized Tables Simplified zero data loss failover Real time apply allows real time reporting Zero downtime instantiation Rolling Upgrades Rolling Upgrades Using Data Guard SQL Apply Online Redefinition Support of Unique Indexes One Step Cloning of Dependent Objects Columns can be Populated Using Sequences & Sysdate Signature Based Dependency Tracking Using Synonyms Online Segment Shrink Data Guard Generic Data Guard Broker support for RAC Named Data Guard Configurations Real Time Apply Flashback Standby Database Flashback Reinstantiation Improved Recovery Parallelism
40
40 New Oracle Database 10g HA Features Flash Backup & Recovery Automated Management of B&R Disk Space Simplified Backup Using Image Copy Change Aware Incremental Backups Incrementally Updated Backups Compressed archive logs Tuning Improved Recovery Parallelism Faster Instance Startup & Cache Warm Flashback Flashback Drop Flashback Row History Flashback Table Flashback Transaction History Flashback Database Better map of time to SCN for flashback query LogMiner Automated Specification of Logs to Mine Support for Shared Server Configurations Fine Grained Supplemental Logging Backup & Recovery Simplified Recovery Through Resetlogs Restore Tolerates Missing Backups Proxy Backup of Archives Automated TSPITR Instantiation Full DB Begin Backup Automated Backup Channel Failover Simplified RMAN cataloging of backup files Automated File Creation during Recovery Drop Database Rename Tablespace
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.