Achieving Scalability, Performance and Availability on Linux with Oracle 9iR2-RAC Grant McAlister Senior Database Engineer Amazon.com Paper 32110.

Slides:

Advertisements

Similar presentations

Overview of performance tuning strategies Oracle Performance Tuning Allan Young June 2008.

Advertisements

The Architecture of Oracle

Acknowledgments Byron Bush, Scott S. Hilpert and Lee, JeongKyu

Automatic Storage Management The New Best Practice Steve Adams Ixora Rich Long Oracle Corporation Session id:

INTRODUCTION TO ORACLE DATABASE ADMINISTRATION Lynnwood Brown System Managers LLC Introduction – Lecture 1 Copyright System Managers LLC 2007 all rights.

Oracle Data Guard Ensuring Disaster Recovery for Enterprise Data

Business Continuity and DR, A Practical Implementation Mich Talebzadeh, Consultant, Deutsche Bank

1 - Oracle Server Architecture Overview

Harvard University Oracle Database Administration Session 2 System Level.

Module 14: Scalability and High Availability. Overview Key high availability features available in Oracle and SQL Server Key scalability features available.

Chapter 9 Overview  Reasons to monitor SQL Server  Performance Monitoring and Tuning  Tools for Monitoring SQL Server  Common Monitoring and Tuning.

Frangipani: A Scalable Distributed File System C. A. Thekkath, T. Mann, and E. K. Lee Systems Research Center Digital Equipment Corporation.

Infiniband enables scalable Real Application Clusters – Update Spring 2008 Sumanta Chatterjee, Oracle Richard Frank, Oracle.

1 Storage Refinement. Outline Disk failures To attack Intermittent failures To attack Media Decay and Write failure –Checksum To attack Disk crash –RAID.

Database Services for Physics at CERN with Oracle 10g RAC HEPiX - April 4th 2006, Rome Luca Canali, CERN.

Selecting and Implementing An Embedded Database System Presented by Jeff Webb March 2005 Article written by Michael Olson IEEE Software, 2000.

Bob Thome, Senior Director of Product Management, Oracle SIMPLIFYING YOUR HIGH AVAILABILITY DATABASE.

CERN - IT Department CH-1211 Genève 23 Switzerland t The High Performance Archiver for the LHC Experiments Manuel Gonzalez Berges CERN, Geneva.

IT The Relational DBMS Section 06. Relational Database Theory Physical Database Design.

Chapter Oracle Server An Oracle Server consists of an Oracle database (stored data, control and log files.) The Server will support SQL to define.

Oracle10g RAC Service Architecture Overview of Real Application Cluster Ready Services, Nodeapps, and User Defined Services.

Profiling Grid Data Transfer Protocols and Servers George Kola, Tevfik Kosar and Miron Livny University of Wisconsin-Madison USA.

DBMS Transactions and Rollback Recovery Helia / Martti Laiho.

1 Robert Wijnbelt Health Check your Database A Performance Tuning Methodology.

March 19981© Dennis Adams Associates Tuning Oracle: Key Considerations Dennis Adams 25 March 1998.

CERN - IT Department CH-1211 Genève 23 Switzerland t Tier0 database extensions and multi-core/64 bit studies Maria Girone, CERN IT-PSS LCG.

Data Warehousing at Acxiom Paul Montrose Data Warehousing at Acxiom Paul Montrose.

Architecture Rajesh. Components of Database Engine.

1 Moshe Shadmon ScaleDB Scaling MySQL in the Cloud.

Indiana University’s Name for its Sakai Implementation Oncourse CL (Collaborative Learning) Active Users = 112,341 Sites.

Oracle Tuning Considerations. Agenda Why Tune ? Why Tune ? Ways to Improve Performance Ways to Improve Performance Hardware Hardware Software Software.

1 Oracle Architectural Components. 1-2 Objectives Listing the structures involved in connecting a user to an Oracle server Listing the stages in processing.

Oracle Tuning Ashok Kapur Hawkeye Technology, Inc.

Data Dependent Routing may not be necessary when using Oracle RAC Ken Gottry Apr-2003 Through Technology Improvements in: Oracle 9i - RAC Oracle 9i - CacheFusion.

Oracle RAC and Linux in the real enterprise October, 02 Mark Clark Director Merrill Lynch Europe PLC Global Database Technologies October, 02 Mark Clark.

Backup and Recovery Overview Supinfo Oracle Lab. 6.

Oracle9i Performance Tuning Chapter 12 Tuning Tools.

Fast Crash Recovery in RAMCloud. Motivation The role of DRAM has been increasing – Facebook used 150TB of DRAM For 200TB of disk storage However, there.

Mark E. Fuller Senior Principal Instructor Oracle University Oracle Corporation.

11 Copyright © 2006, Oracle. All rights reserved. Checkpoint and Redo Tuning.

INTRODUCTION TO ORACLE DATABASE ADMINISTRATION Lynnwood Brown President System Managers LLC Introduction – Lecture 1 Copyright System Managers LLC 2003.

CERN Database Services for the LHC Computing Grid Maria Girone, CERN.

Oracle Cache Fusion Cache Fusion Concepts, Data Block Shipping, and Recovery with Cache Fusion.

CS525: Big Data Analytics MapReduce Computing Paradigm & Apache Hadoop Open Source Fall 2013 Elke A. Rundensteiner 1.

Infrastructure for Data Warehouses. Basics Of Data Access Data Store Machine Memory Buffer Memory Cache Data Store Buffer Bus Structure.

CERN - IT Department CH-1211 Genève 23 Switzerland t High Availability Databases based on Oracle 10g RAC on Linux WLCG Tier2 Tutorials, CERN,

Preface 1Performance Tuning Methodology: A Review Course Structure 1-2 Lesson Objective 1-3 Concepts 1-4 Determining the Worst Bottleneck 1-5 Understanding.

Copyright ©2003 Dell Inc. All rights reserved. Scaling-Out with Oracle® Grid Computing on Dell™ Hardware J. Craig Lowery, Ph.D. Software Architect and.

Oracle9i Performance Tuning Chapter 4 Tuning the Shared Pool Memory.

LHC Logging Cluster Nilo Segura IT/DB. Agenda ● Hardware Components ● Software Components ● Transparent Application Failover ● Service definition.

RAC aware change In order to reduce the cluster contention, a new scheme for the insertion has been developed. In the new scheme: - each “client” receives.

© 2006 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice Database Growth: Problems & Solutions.

Introduction to Core Database Concepts Getting started with Databases and Structure Query Language (SQL)

8 Copyright © 2006, Oracle. All rights reserved. Tuning the Shared Pool.

6 Copyright © Oracle Corporation, All rights reserved. Backup and Recovery Overview.

What is Oracle ? Oracle is a relational database management system. It is a management system which uses the relational data model. In the relational data.

SQL Advanced Monitoring Using DMV, Extended Events and Service Broker Javier Villegas – DBA | MCP | MCTS.

Oracle Database Architectural Components

1 PVSS Oracle scalability Target = changes per second (tested with 160k) changes per client 5 nodes RAC NAS 3040, each with one.

Chapter 21 SGA Architecture and Wait Event Summarized & Presented by Yeon JongHeum IDS Lab., Seoul National University.

CS 540 Database Management Systems

BDII Performance Tests

Oracle 11g Real Application Clusters Advanced Administration

Maximum Availability Architecture Enterprise Technology Centre.

Lecture 11: DMBS Internals

Overview of Real Application Clustering Features and Functionality

Performance And Scalability In Oracle9i And SQL Server 2000

Database administration

Presentation transcript:

Achieving Scalability, Performance and Availability on Linux with Oracle 9iR2-RAC Grant McAlister Senior Database Engineer Amazon.com Paper 32110

Agenda Why Oracle on Linux and RAC The Tests Scaling Performance Availability Choice of Interconnect Conclusion

Why Linux Lower Total Cost of Ownership Near commodity hardware and support Multiple O/S and hardware vendors Common platform (IA-32) for entire enterprise Unix look and feel New enterprise kernel No database conversions when changing Linux hardware or O/S

Why RAC on Linux Cost Ability to use near commodity systems (2-4 processors) Lower level of support needed on system units The need for availability Young and rapidly evolving O/S Near commodity hardware and support The need to scale database beyond 8 processors The need for large amounts of memory > 32GBytes

The Tests Real life workloads Not modified or partitioned to support RAC Used automatic space management Workload #1 Simple workload of small queries with little locking. Workload #2 Typical nasty workload with many inserts, updates and select for updates causing a lot of locking and blocking.

Workload #1 Single Instance Profile Load Profile ~~~~~~~~~~~~ Per Second Per Transaction Redo size: 77, , Logical reads: 4, Block changes: Physical reads: Physical writes: User calls: 11, Parses: Sorts: Executes: Transactions: % Blocks changed per Read: Recursive Call %: 0.68 Rollback per transaction %: 0.82 Rows per Sort: Top 5 Wait Events on a single instance Avg Total Wait wait Waits Event Waits Timeouts Time (s) (ms) /txn db file sequential read 560, , log file sync 180, log file parallel write 188, , latch free 87,584 6, db file parallel write 5,794 2,

Workload #2 Single Instance Profile Load Profile ~~~~~~~~~~~~ Per Second Per Transaction Redo size: 244, , Logical reads: 14, Block changes: 1, Physical reads: Physical writes: User calls: 2, Parses: Sorts: Executes: Transactions: % Blocks changed per Read: Recursive Call %: 4.16 Rollback per transaction %: 0.96 Rows per Sort: Top 5 wait events on a single instance Avg Total Wait wait Waits Event Waits Timeouts Time (s) (ms) /txn db file sequential read 346, , enqueue free buffer waits db file scattered read 141, log file sync 207,

The Hardware and Software Software Oracle Red Hat Advanced Server 2.1 (2.4.9-e.3) Hardware 3 types of clusters that each have 4 nodes 2 Pentium III Xeon 1.126GHz & 5 Gbytes of RAM 2 Pentium 4 Xeon DP 2.4GHz & 4 Gbytes of RAM 4 Pentium 4 Xeon MP 1.6GHz & 10 Gbytes of RAM Database files were on raw partitions

Scaling The ability to produce higher transactional volumes when adding additional processors or additional nodes.

Scaling of workload #1

Scaling of workload #2

Some workloads scale better

Some of the differences EventWaitsTime (s)%Total Elapsed Time CPU time2, global cache null to x62,6462, db file sequential read391,4741, buffer busy global cache15, log file sync158, Top 5 workload #1 timed events EventWaitsTime (s)%Total Elapsed Time global cache cr request1,324,75619, buffer busy global cache53,41111, enqueue38,79511, global cache null to x88,9086, CPU time5, Top 5 workload #2 timed events

Performance The time taken to perform a query is important Execution time influences transactional volume Can cause dramatic changes in the end user response time Stock Exchange Internet Retailer Bank Only you know what is reasonable for your database and application

Execution times for workload #1

Execution times for workload #2

Some ways to improve Make sure you database is well tuned for single instance operation Consider using different block sizes for hot indexes Hash partition hot tables and indexes Partition the workload

Availability Minimize failures by building clusters with as few single points of failure as possible. Setup your RAC cluster to recover from node and instance failure as quickly as possible.

Redundant RAC Configuration

Instance recovery time MTTR Target=120MTTR Target=240MTTR Target Not Set Cluster Reconfigured 222 Recovery Started91012 Redo Log First Pass 1113 Redo Log Second Pass Total Time fast_start_mttr_target is the key

Node failure recovery time Recovery Time= Failure detection + Instance recovery Failure detection = (MissCount * 1 second) MissCount parameter in found in cmcfg.ora When MissCount = 20 and fast_start_mttr_target=120 All workload #2 processing resumed in less than 1 minute after crashing a node.

Impact of a single node failure Node failedCluster Reconfigured CM ejects nodeRecovery Complete

Choice of Interconnect 1000Mbit (Gigabit) Ethernet Latency ~ 0.07 ms Transfer Rate MBytes per second More expensive but becoming common with the advent of gigabit over copper. 100Mbit Ethernet Latency ~ 0.20 ms Transfer Rate - 10 MBytes per second Common and inexpensive

100mbit vs. Gigabit

Conclusions RAC scaled at 90% on a simple workload RAC scaled consistently at 55+% on a complex workload There is an impact to query performance depending on your workload You can recover from failures in less than 1 minute When configured correctly a RAC cluster can scale, perform and be highly available.

A Q & Q U E S T I O N S A N S W E R S