1 RAC Basics Julian Dyke Independent Consultant Web Version - February 2008 juliandyke.com © 2008 Julian Dyke.

Slides:



Advertisements
Similar presentations
Archive Task Team (ATT) Disk Storage Stuart Doescher, USGS (Ken Gacke) WGISS-18 September 2004 Beijing, China.
Advertisements

Tom Hamilton – America’s Channel Database CSE
ITEC474 INTRODUCTION.
NAS vs. SAN 10/2010 Palestinian Land Authority IT Department By Nahreen Ameen 1.
INTRODUCTION TO ORACLE Lynnwood Brown System Managers LLC Oracle High Availability Solutions RAC and Standby Database Copyright System Managers LLC 2008.
Oracle9i Database Administrator: Implementation and Administration 1 Chapter 2 Overview of Database Administrator (DBA) Tools.
Introduction to DBA.
High Availability Group 08: Võ Đức Vĩnh Nguyễn Quang Vũ
High Availability 24 hours a day, 7 days a week, 365 days a year… Vik Nagjee Product Manager, Core Technologies InterSystems Corporation.
Oracle Data Guard Ensuring Disaster Recovery for Enterprise Data
Smart Storage and Linux An EMC Perspective Ric Wheeler
1 © Copyright 2010 EMC Corporation. All rights reserved. EMC RecoverPoint/Cluster Enabler for Microsoft Failover Cluster.
Storage area Network(SANs) Topics of presentation
Lesson 1: Configuring Network Load Balancing
EIM April 19, Robin Weaver 13 Years with IBM Prior to Assignment at UNC Charlotte Range of Database Development/Data Management Projects and Products.
1© Copyright 2011 EMC Corporation. All rights reserved. EMC RECOVERPOINT/ CLUSTER ENABLER FOR MICROSOFT FAILOVER CLUSTER.
How to Cluster both Servers and Storage W. Curtis Preston President The Storage Group.
Session 3 Windows Platform Dina Alkhoudari. Learning Objectives Understanding Server Storage Technologies Direct Attached Storage DAS Network-Attached.
Data Storage Willis Kim 14 May Types of storages Direct Attached Storage – storage hardware that connects to a single server Direct Attached Storage.
11 SERVER CLUSTERING Chapter 6. Chapter 6: SERVER CLUSTERING2 OVERVIEW  List the types of server clusters.  Determine which type of cluster to use for.
Microsoft Load Balancing and Clustering. Outline Introduction Load balancing Clustering.
© 2009 Oracle Corporation. S : Slash Storage Costs with Oracle Automatic Storage Management Ara Vagharshakian ASM Product Manager – Oracle Product.
1 © 2005 Julian Dyke Oracle 10.2 RAC New Features Julian Dyke Independent Consultant Web Version juliandyke.com.
High Availability & Oracle RAC 18 Aug 2005 John Sheaffer Platform Solution Specialist
1 Data Guard Basics Julian Dyke Independent Consultant Web Version - February 2008 juliandyke.com © 2008 Julian Dyke.
Storage Area Networks The Basics. Storage Area Networks SANS are designed to give you: More disk space Multiple server access to a single disk pool Better.
Increasing Manageability and Availability for SANs: SANPoint Foundation Suite HA.
SANPoint Foundation Suite HA Robert Soderbery Sr. Director, Product Management VERITAS Software Corporation.
Module 10 Configuring and Managing Storage Technologies.
Chapter 10 : Designing a SQL Server 2005 Solution for High Availability MCITP Administrator: Microsoft SQL Server 2005 Database Server Infrastructure Design.
Database Services for Physics at CERN with Oracle 10g RAC HEPiX - April 4th 2006, Rome Luca Canali, CERN.
Implementing Multi-Site Clusters April Trần Văn Huệ Nhất Nghệ CPLS.
Module 12: Designing High Availability in Windows Server ® 2008.
Oracle10g RAC Service Architecture Overview of Real Application Cluster Ready Services, Nodeapps, and User Defined Services.
Clustering  Types of Clustering. Objectives At the end of this module the student will understand the following tasks and concepts. What clustering is.
Best Practices for Backup in SAN/NAS Environments Jeff Wells.
INSTALLING MICROSOFT EXCHANGE SERVER 2003 CLUSTERS AND FRONT-END AND BACK ‑ END SERVERS Chapter 4.
Chapter 8 Implementing Disaster Recovery and High Availability Hands-On Virtual Computing.
Database Edition for Sybase Sales Presentation. Market Drivers DBAs are facing immense time pressure in an environment with ever-increasing data Continuous.
1 RAC Internals Julian Dyke Independent Consultant Web Version juliandyke.com © 2007 Julian Dyke.
1 Data Guard. 2 Data Guard Reasons for Deployment  Site Failures  Power failure  Air conditioning failure  Flooding  Fire  Storm damage  Hurricane.
Achieving Scalability, Performance and Availability on Linux with Oracle 9iR2-RAC Grant McAlister Senior Database Engineer Amazon.com Paper
Using NAS as a Gateway to SAN Dave Rosenberg Hewlett-Packard Company th Street SW Loveland, CO 80537
Copyright ©2003 Digitask Consultants Inc., All rights reserved Cluster Concepts Digitask Seminar November 29, 1999 Digitask Consultants, Inc.
Clustering In A SAN For High Availability Steve Dalton, President and CEO Gadzoox Networks September 2002.
11 CLUSTERING AND AVAILABILITY Chapter 11. Chapter 11: CLUSTERING AND AVAILABILITY2 OVERVIEW  Describe the clustering capabilities of Microsoft Windows.
Mark E. Fuller Senior Principal Instructor Oracle University Oracle Corporation.
70-293: MCSE Guide to Planning a Microsoft Windows Server 2003 Network, Enhanced Chapter 12: Planning and Implementing Server Availability and Scalability.
VMware vSphere Configuration and Management v6
High Availability in DB2 Nishant Sinha
Ashish Prabhu Douglas Utzig High Availability Systems Group Server Technologies Oracle Corporation.
Coupling Facility. The S/390 Coupling Facility (CF), the key component of the Parallel Sysplex cluster, enables multisystem coordination and datasharing.
Oracle Database Architecture By Ayesha Manzer. Automatic Storage Management Spreads database data across all disks Creates and maintains a storage grid.
CERN - IT Department CH-1211 Genève 23 Switzerland t High Availability Databases based on Oracle 10g RAC on Linux WLCG Tier2 Tutorials, CERN,
LHC Logging Cluster Nilo Segura IT/DB. Agenda ● Hardware Components ● Software Components ● Transparent Application Failover ● Service definition.
1 CEG 2400 Fall 2012 Network Servers. 2 Network Servers Critical Network servers – Contain redundant components Power supplies Fans Memory CPU Hard Drives.
Hands-On Microsoft Windows Server 2008 Chapter 7 Configuring and Managing Data Storage.
Thomas Baus Senior Sales Consultant Oracle/SAP Global Technology Center Mail: Phone:
2 Copyright © 2006, Oracle. All rights reserved. RAC and Shared Storage.
Power Systems with POWER8 Technical Sales Skills V1
Storage Area Networks The Basics.
70-293: MCSE Guide to Planning a Microsoft Windows Server 2003 Network, Enhanced Chapter 12: Planning and Implementing Server Availability and Scalability.
High Availability 24 hours a day, 7 days a week, 365 days a year…
Maximum Availability Architecture Enterprise Technology Centre.
Cluster Communications
Introduction to Networks
Introduction to Networks
ASM-based storage to scale out the Database Services for Physics
Oracle10g RAC Service Architecture
Building continuously available systems with Hyper-V
Presentation transcript:

1 RAC Basics Julian Dyke Independent Consultant Web Version - February 2008 juliandyke.com © 2008 Julian Dyke

juliandyke.com © 2008 Julian Dyke 2 Agenda  Real Application Clusters  The Theory  The Reality

juliandyke.com © 2008 Julian Dyke 3 RAC The Theory

juliandyke.com © 2008 Julian Dyke 4 RAC Redundancy  Single Point of Failure  If component fails, system will be inaccessible  Redundancy  Duplicate components  If component fails another can be used  Active-Active or Active-Passive  Examples include  Power Supplies  RAID  Bonded Networks  IO Multipathing  Oracle RAC

juliandyke.com © 2008 Julian Dyke 5 RAC 4-node cluster Public Network Shared Storage Node 1 Instance 1 Node 2 Instance 2 Node 3 Instance 3 Node 4 Instance 4 Private Network (Interconnect) Storage Network

juliandyke.com © 2008 Julian Dyke 6 RAC Cache Coherency  RAC must ensure changes made by any instance  Are not overwritten by another instance  Maintain ACID properties  Current Blocks  Blocks can be updated by any instance  Only current version of a block can be updated  Only one current version of a block can exist across all instances  Consistent Read Blocks  Can have theoretically unlimited number of consistent versions of a block  in each instance  across all instances

juliandyke.com © 2008 Julian Dyke 7 RAC Cluster Manager  All clusters must have cluster management software  Manages node membership and evictions  Oracle Clusterware  Mandatory for RAC in Oracle 10.1 and above  Known as Cluster Ready Services (CRS) 10.1 only  Can be combined with vendor clusterware  IBM HA/CMP  HP ServiceGuard  Sun Cluster  Must be running before ASM/RDBMS instances can be started on a node  Can be used with non-RAC databases and applications  Oracle 10.2 and above

juliandyke.com © 2008 Julian Dyke 8 RAC Interconnect  Used for inter-node communication by:  Oracle Clusterware  ASM Instances  RDBMS Instances  Optimally high bandwidth / low latency  Typically 1GB Ethernet  Uses TCP / UDP protocols  NIC interfaces often bonded for availability  Other physical networks supported e.g. Infiniband

juliandyke.com © 2008 Julian Dyke 9 RAC Shared Storage  Required for:  Oracle Clusterware Files  Oracle Cluster Registry (OCR)  Voting Disk  Database Files  Control Files  Database  Online Redo Logs  Server Parameter File  Strongly recommended for  Archived redo logs  Backup copies

juliandyke.com © 2008 Julian Dyke 10 RAC Shared Storage  Can use:  Storage Area Network (SAN) e.g.:  EMC Clariion / Symmetrix  HP MSA / EVA / XP series  Hitachi  Fujitsu  Network Attached Storage (NAS) e.g.:  Network Appliance  Pillar Data System  Sun StorageTek  EMC Celerra  JBOD (with ASM)

juliandyke.com © 2008 Julian Dyke 11 RAC Shared Storage  Fibre Channel  SCSI protocol - block based  Normally 2Gb or 4Gb  Requires one or more Host Bus Adapters (HBA) per node  Requires fabric switches  iSCSI  SCSI protocol - block based  Packets sent over dedicated IP network  Can use standard network components  Processing often offloaded to NIC firmware  NFS  File-based  Uses standard network components

juliandyke.com © 2008 Julian Dyke 12 RAC Shared Storage  Cluster-aware File Systems:  Automatic Storage Management  Cluster File Systems  Oracle Cluster File System (OCFS/OCFS2)  Red Hat GFS  IBM GPFS  Sun Storedge QFS  Veritas CFS  Network File System  On supported Network Attached Storage only

juliandyke.com © 2008 Julian Dyke 13 RAC Automatic Storage Management (ASM)  Introduced in Oracle 10.1  Additional functionality in 10.2 and 11.1  Generic code (all supported platforms)  Available for both single-instance and RAC databases  Provides shared storage for RAC  Can optionally provide mirroring:  Normal Redundancy (mirrored)  High Redundancy (triple mirroring)  Useful with JBOD or extended clusters  Mandatory for Oracle 10g Standard Edition RAC  Presents storage as disk groups containing  Physical disks  Logical files  Requires additional ASM instance on each node

juliandyke.com © 2008 Julian Dyke 14 RAC Licensing  Standard Edition  RAC option free  Maximum two nodes  Maximum four CPUs  Must use Oracle Clusterware  Must use Automatic Storage Management (ASM)  No extended clusters  Enterprise Edition  RAC option 50% extra (per EE license)  No limit on number of nodes  No limit on number of CPUs  Can use any shared storage (ASM, CFS or NFS)  Can use Enterprise Manager Packs (Diagnostics, Tuning..)

juliandyke.com © 2008 Julian Dyke 15 Node 1Node 2 RAC Process Architecture Clusterware OPROCDOCSSDCRSDEVMD +ASM1 PMONSMONLGWRDBWnARCH LMONLCK0LMD0LMSnDIAG PROD1 PMONSMONLGWRDBWnARCH LMONLCK0LMD0LMSnDIAG Clusterware OPROCDOCSSDCRSDEVMD +ASM2 PMONSMONLGWRDBWnARCH LMONLCK0LMD0LMSnDIAG PROD2 PMONSMONLGWRDBWnARCH LMONLCK0LMD0LMSnDIAG

juliandyke.com © 2008 Julian Dyke 16 RAC Reasons For Deployment  Availability  Node failure  Instance failure  Scalability  Distribute workload across multiple instances  Scale out  Manageability  Economies of scale  Administration / Monitoring / Backups / Standby  Reduction in total cost of ownership  Database consolidation  Commodity hardware

juliandyke.com © 2008 Julian Dyke 17 RAC Availability  Ensure continued availability of database in event of node or instance failure  Automatic failover  No human intervention required  In the event of node or instance failure:  All sessions connected to failed node are terminated  Sessions connected to remaining nodes are  temporarily suspended while resources are re-mastered  resume after brown-out period  New sessions will be connected to remaining nodes only  Ensuring availability requires spare capacity during normal operations  Either additional node  Or reduction in service level

juliandyke.com © 2008 Julian Dyke 18 RAC Availability Public Network Shared Stoage Node 1 Instance 1 Node 2 Instance 2 Node 3 Instance 3 Node 4 Instance 4 Private Network (Interconnect) Storage Network

juliandyke.com © 2008 Julian Dyke 19 RAC Scalability  Workload can be distributed across multiple nodes  Workload can be balanced across all nodes using connection management  Client-side using Oracle Net  Server-side using listener processes  Workload can be directed to specific nodes using services  Level of scalability dependent on application Resources Throughput

juliandyke.com © 2008 Julian Dyke 20 RAC Scalability  Factors that can degrade scalability  Excessive parsing  Consistent reads  SELECT FOR UPDATE / user defined locking  DDL  Object-oriented code  Features that can improve scalability  Services  Automatic Segment Space Management  Partitioning  Sequences  Reverse indexes

juliandyke.com © 2008 Julian Dyke 21 RAC Manageability  Advantages  Consolidation  Economies of scale  Administration  Monitoring  Backup and recovery  Standby database  Disadvantages  Increased Planned downtime  Complexity  Dependencies  Skills

juliandyke.com © 2008 Julian Dyke 22 RAC Total Cost of Ownership  Benefits  Lower hardware costs - commodity hardware  Lower support costs  Management economies of scale  Costs  Redundant hardware  Servers, Storage, NIC, HBA, Switches, Fabric  Oracle licenses  Experienced staff  Application modifications

juliandyke.com © 2008 Julian Dyke 23 RAC Applications  Most applications should run on RAC without modification  Performance is not guaranteed  Applications that perform well in single-instance have best chance of scaling in RAC  Applications performing badly in single-instance will perform worse in RAC  Some features do not port easily to RAC e.g.:  DBMS_ALERT, DBMS_PIPE, External files  Applications that can be logically partitioned tend to scale best  Minimize use of interconnect  Maximize use of buffer caches  Implementation more likely to succeed if you have direct or indirect access to source code

juliandyke.com © 2008 Julian Dyke 24 RAC Database Services  Allow sessions with similar workload characteristics to be logically grouped and managed  Services can be assigned to  set of preferred instances - used if available  set of available instances - used if preferred instances not available  failover to available instances is automatic  failback to preferred instances is manual  Services can be configured to maximize instance affinity  Limited statistics reported at service level  Can also be reported at service / module / action level  Trace can be enabled at service level  Can also be enabled at service / module / action level

juliandyke.com © 2008 Julian Dyke 25 RAC Database Services Listener1 PROD1 Listener2 PROD2 PROD1PROD2 SERVICE1PREFERREDAVAILABLE SERVICE1 Listener1 PROD1 Listener2 PROD2 SERVICE1 AfterBefore

juliandyke.com © 2008 Julian Dyke 26 RAC Extended Clusters  Currently the Holy Grail of high availability  RAC nodes located at physically separate sites  Implicit disaster recovery  Requires Enterprise Edition licences + RAC option  In the event of a site failure, database is still available  Storage is duplicated at each site  Can use ASM or vendor-supplied storage technology  Active / Active configuration  Users can access database via either site  Configuration and performance tuning are complex  Cache fusion traffic between sites

juliandyke.com © 2008 Julian Dyke 27 RAC Extended Clusters Storage Network Public Network Database Storage Network Site2Site1 Private Network Quorum Site3 Instance 1 Node 1 Instance 2 Node 2 Database

juliandyke.com © 2008 Julian Dyke 28 RAC Disaster Recovery  Data Guard and RAC are fully compatible  Can configure any permutation e.g. PrimaryStandby Single-instanceSingle instance RACSingle instance RAC Single instanceRAC  All instances can participate in redo log shipping  Only one instance can perform managed recovery  Standby database might be a potential bottleneck

juliandyke.com © 2008 Julian Dyke 29 RAC Alternatives  Single Instance Databases  No RAC overhead  Simpler to install / configure / manage  Single point of failure  Oracle Products  Oracle Streams  Oracle Clusterware  Proprietary Clustering Solutions  HP ServiceGuard  IBM HA/CMP  Sun Cluster

juliandyke.com © 2008 Julian Dyke 30 RAC The Reality

juliandyke.com © 2008 Julian Dyke 31 RAC The Reality  Many sites running RAC  Mostly Oracle 10.2  A few still running Oracle 10.1  Still some Oracle 9.2  Most RAC users develop their own applications or use bespoke applications developed by a third-party  Probably around 20 extended clusters in production across Europe  Many Oracle 10.2 sites run ASM  Very few run OCFS or raw devices  Very few use third-party cluster file systems  Most sites using SAN - fewer using NAS  In UK most users currently deploy on Linux x86-64  Solaris very popular in other regions

juliandyke.com © 2008 Julian Dyke 32 RAC The Reality  Few Oracle 10g users run vendor clusterware  Most RAC deployments for availability  Decreased unplanned downtime  Increased planned downtime  Increasing number of deployments for scalability  Workload balancing  Services  Manageability benefits very doubtful  Economies of Scale versus Additional complexity  TCO reductions possible in some circumstances  Replace large SMP boxes  Replace legacy active-passive clusters

juliandyke.com © 2008 Julian Dyke 33 RAC The Reality  Most users run 2-node clusters  Some have 3-node or 4-node clusters  A handful run five nodes or more  Most users only have one database per cluster  Few grids  Oracle Clusterware scales well  Number of nodes does not impact performance  Oracle RAC databases might scale well  Dependent on application  Additional nodes may improve or degrade performance

juliandyke.com © 2008 Julian Dyke 34 RAC The Reality  ASM currently the most popular RAC storage technology  Deployed in numerous Oracle 10.2 RAC production systems  No operating system utilities  ASMCMD in Oracle 10.2 and above  Generally disliked by storage administrators  Too much control to DBAs  Acceptable performance  ASM instance provides metadata  RDBMS instances read and write blocks directly from files

juliandyke.com © 2008 Julian Dyke 35 Thank you for your interest  References   Questions 