RAC parameter tuning for remote access Carlos Fernando Gamboa, Brookhaven National Lab, US Frederick Luehring, Indiana University, US Distributed Database.

Slides:



Advertisements
Similar presentations
Client/Server Computing (the wave of the future) Rajkumar Buyya School of Computer Science & Software Engineering Monash University Melbourne, Australia.
Advertisements

© 2003 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice Performance Measurements of a User-Space.
Current Testbed : 100 GE 2 sites (NERSC, ANL) with 3 nodes each. Each node with 4 x 10 GE NICs Measure various overheads from protocols and file sizes.
DB server limits (process/sessions) Carlos Fernando Gamboa, BNL Andrew Wong, TRIUMF WLCG Collaboration Workshop, CERN Geneva, April 2008.
© 2010 VMware Inc. All rights reserved Confidential Performance Tuning for Windows Guest OS IT Pro Camp Presented by: Matthew Mitchell.
1 Web Server Performance in a WAN Environment Vincent W. Freeh Computer Science North Carolina State Vsevolod V. Panteleenko Computer Science & Engineering.
Performance Analysis of Orb Rabin Karki and Thangam V. Seenivasan 1.
Peer-to-peer Multimedia Streaming and Caching Service Jie WEI, Zhen MA May. 29.
Mapping a Network by Latency (and other things) Client connecting to , UDP port.
Measuring Performance Chapter 12 CSE807. Performance Measurement To assist in guaranteeing Service Level Agreements For capacity planning For troubleshooting.
Application Layer  We will learn about protocols by examining popular application-level protocols  HTTP  FTP  SMTP / POP3 / IMAP  Focus on client-server.
Module 1: Database and Instance. Overview Defining a Database and an Instance Introduce Microsoft’s and Oracle’s Implementations of a Database and an.
BNL Oracle database services status and future plans Carlos Fernando Gamboa RACF Facility Brookhaven National Laboratory, US Distributed Database Operations.
© 2011 IBM Corporation 11 April 2011 IDS Architecture.
Process-to-Process Delivery:
KEK Network Qi Fazhi KEK SW L2/L3 Switch for outside connections Central L2/L3 Switch A Netscreen Firewall Super Sinet Router 10GbE 2 x GbE IDS.
INTRODUCTION TO ORACLE Lynnwood Brown System Managers LLC Client-Server – Lecture 4 Copyright System Managers LLC 2007 all rights reserved.
5 Copyright © 2007, Oracle. All rights reserved. Configuring the Oracle Network Environment.
Oracle 10g Administration Oracle Shared Server Copyright ©2006, Custom Training Institute.
IRODS performance test and SRB system at KEK Yoshimi KEK Building data grids with iRODS 27 May 2008.
Test Of Distributed Data Quality Monitoring Of CMS Tracker Dataset H->ZZ->2e2mu with PileUp - 10,000 events ( ~ 50,000 hits for events) The monitoring.
Slide 1 DESIGN, IMPLEMENTATION, AND PERFORMANCE ANALYSIS OF THE ISCSI PROTOCOL FOR SCSI OVER TCP/IP By Anshul Chadda (Trebia Networks)-Speaker Ashish Palekar.
Protocol(TCP/IP, HTTP) 송준화 조경민 2001/03/13. Network Computing Lab.2 Layering of TCP/IP-based protocols.
Boosting Event Building Performance Using Infiniband FDR for CMS Upgrade Andrew Forrest – CERN (PH/CMD) Technology and Instrumentation in Particle Physics.
CS Spring 2012 CS 414 – Multimedia Systems Design Lecture 29 – Buffer Management (Part 2) Klara Nahrstedt Spring 2012.
Oracle Tuning Ashok Kapur Hawkeye Technology, Inc.
Network Tests at CHEP K. Kwon, D. Han, K. Cho, J.S. Suh, D. Son Center for High Energy Physics, KNU, Korea H. Park Supercomputing Center, KISTI, Korea.
1 Wenguang WangRichard B. Bunt Department of Computer Science University of Saskatchewan November 14, 2000 Simulating DB2 Buffer Pool Management.
6 Copyright © 2009, Oracle. All rights reserved. Configuring the Oracle Network Environment.
Oracle 10g Database Administrator: Implementation and Administration Chapter 2 Tools and Architecture.
Data transfer over the wide area network with a large round trip time H. Matsunaga, T. Isobe, T. Mashimo, H. Sakamoto, I. Ueda International Center for.
10GE network tests with UDP
A Measurement Based Memory Performance Evaluation of High Throughput Servers Garba Isa Yau Department of Computer Engineering King Fahd University of Petroleum.
DONE-08 Sizing and Performance Tuning N-Tier Applications Mike Furgal Performance Manager Progress Software
Achieving Scalability, Performance and Availability on Linux with Oracle 9iR2-RAC Grant McAlister Senior Database Engineer Amazon.com Paper
Srihari Makineni & Ravi Iyer Communications Technology Lab
Chapter 2 Applications and Layered Architectures Sockets.
Operation System Support for Multi-User, Remote, Graphical Interaction Alexander Ya-li Wong, Margo Seltzer Harvard University, Division of Engineering.
Atlas LCG 3D Oracle cluster migration strategy at BNL Carlos Fernando Gamboa On behalf of database group Grid Group, RACF Facility, Brookhaven National.
DYNES Storage Infrastructure Artur Barczyk California Institute of Technology LHCOPN Meeting Geneva, October 07, 2010.
Performance measurement with ZeroMQ and FairMQ
Fast Crash Recovery in RAMCloud. Motivation The role of DRAM has been increasing – Facebook used 150TB of DRAM For 200TB of disk storage However, there.
DB Questions and Answers open session Carlos Fernando Gamboa, BNL WLCG Collaboration Workshop, CERN Geneva, April 2008.
Online-Offsite Connectivity Experiments Catalin Meirosu *, Richard Hughes-Jones ** * CERN and Politehnica University of Bucuresti ** University of Manchester.
1 MSRBot Web Crawler Dennis Fetterly Microsoft Research Silicon Valley Lab © Microsoft Corporation.
Mr. P. K. GuptaSandeep Gupta Roopak Agarwal
Lecture 4 Mechanisms & Kernel for NOSs. Mechanisms for Network Operating Systems  Network operating systems provide three basic mechanisms that support.
BNL Oracle database services status and future plans Carlos Fernando Gamboa, John DeStefano, Dantong Yu Grid Group, RACF Facility Brookhaven National Lab,
Module 16: Distributed System Structures Silberschatz, Galvin and Gagne ©2005 Operating System Concepts – 7 th Edition, Apr 4, 2005 Distributed.
5 Copyright © Oracle Corporation, All rights reserved. Usage and Configuration of the Oracle Shared Server.
Usage and Configuration of the Oracle Shared Server Supinfo Oracle Lab. 5.
INDIANAUNIVERSITYINDIANAUNIVERSITY Tsunami File Transfer Protocol Presentation by ANML January 2003.
CS Spring 2011 CS 414 – Multimedia Systems Design Lecture 24 – Client-Server Buffer Management Klara Nahrstedt Spring 2011.
Deployment and operations of a high availability infrastructure for relational databases in a heterogeneous Tier 1 workload environment Carlos Fernando.
Trojan Express Network II Goal: Develop Next Generation research network in parallel to production network to address increasing research data transfer.
Connect communicate collaborate Performance Metrics & Basic Tools Robert Stoy, DFN EGI TF, Madrid September 2013.
CSEN 404 Introduction to Networks Amr El Mougy Lamia AlBadrawy.
DB Questions and Answers open session (comments during session) WLCG Collaboration Workshop, CERN Geneva, 24 of April 2008.
G. Russo, D. Del Prete, S. Pardi Frascati, 2011 april 4th-7th The Naples' testbed for the SuperB computing model: first tests G. Russo, D. Del Prete, S.
A Web Based Job Submission System for a Physics Computing Cluster David Jones IOP Particle Physics 2004 Birmingham 1.
iperf a gnu tool for IP networks
Speaker : Che-Wei Chang
Oracle 11g Real Application Clusters Advanced Administration
Abhishek Singh Rana UC San Diego
Transport Protocols over Circuits/VCs
Conditions Data access using FroNTier Squid cache Server
Oracle Streams Performance
Process-to-Process Delivery:
Scalable Database Services for Physics: Oracle 10g RAC on Linux
Process-to-Process Delivery: UDP, TCP
Presentation transcript:

RAC parameter tuning for remote access Carlos Fernando Gamboa, Brookhaven National Lab, US Frederick Luehring, Indiana University, US Distributed Database Operations Workhop CERN Geneva, November 2008

Outline Introduction Overview to key OS/Database network parameters Case overview Tests Results Conclusions 1Carlos Fernando Gamboa

Introduction -Database Service- COND. DB USER Site 1 USER Site 1 USER Site 2 USER Site 2 USERT Site N USERT Site N WAN LAN LOCAL USER DB DATA SOURCE SITE DB DATA SOURCE SITE 2Carlos Fernando Gamboa

Introduction - tuning strategy - -Tuning SQLnet: Main function of SQLnet: -to read and write to the TCP send/receive buffers -to distribute and retrieve data to and from an application such as SQL*Plus and the Oracle Database. -Can be done at application level: Setting the ARRAY Fetch Size in the Application (Metalink doc id. Note: ) 3Carlos Fernando Gamboa

Overview network parameters - Operative system - Kernel parameters: Over 521 different parameter around 365 are tcp parameters Focus on TCP kernel parameters for a kernel level 2.6, specially on: tcp_moderate_rcvbuf : If setup as 1 activates autotuning. Thus the receiver buffer size and TCP window size is dynamically updated per connection basis. Memory per connection allocation parameters: – tcp_rmem: Memory allocated for TCP rcv buffers – tcp_wmem: Memory allocated for TCP snd buffers 4Carlos Fernando Gamboa

Overview network parameters - Operative system- Memory per connection allocation parameters: – tcp_rmem: Memory allocated for TCP rcv buffers. net.ipv4.tcp_rmem = minimum default maximum – tcp_wmem: Memory allocated for TCP snd buffers net.ipv4.tcp_wmem = minimum default maximum Used to set constrains to autotune and controls memory usage under memory stress. Maximum buffer size for socket buffer declared via the SO_SNDBUF and SO_RECVBF that application can request can be constrained with: net.core.rmem_max net.core.wmem_max 5Carlos Fernando Gamboa

Overview network parameters - Operative System- ParameterMindefaultMax tcp_moderate_rcvbuf1 net.ipv4.tcp_rmem4k *2 net.ipv4.tcp_wmem4k16K128K net.core.rmem_max net.core.wmem_max Table 1. Summary default values RHEL 4 kernel level 2.6 6Carlos Fernando Gamboa

Overview network parameters -Oracle database- Key Network files on oracle listener.ora (server): Contains information related to listening protocol addresses, about supported services, and parameters that control its Listener process runtime behavior. sqlnet.ora (client, server) : contains the parameters that specify preferences for how a client or server uses Oracle’s Network protocol features. tnsnames.ora (client,server) : Maps net services names to connection descriptions. This parameters can be defined as well on this file; Example configuration on 3D twiki Network FileParameter sqlnet.oraDEFAULT_SDU_SIZERECV_BUF_SIZESEND_BUF_SIZE tnsnames.ora Listener.ora SDU_SIZERECV_BUF_SIZESEND_BUF_SIZE 7Carlos Fernando Gamboa

Overview network parameters - Oracle database - Session Data Unit (SDU). – Allows limited control over the packet sizes sent to the NT layer. – Possible values: 512 to bytes, 2048 default. – Minimizing overhead adjust it to the Maximum Segment Size (MSS) of the network protocol being used. Thus, MSS=Maximum Transmission Unit – (TCP and IP) header size =1500 (Ethernet) -20 bytes (TCP) – 20 IP =1460 bytes – Negotiated by client and server for data retrieval. Minimum value when client and server differ 8Carlos Fernando Gamboa

Overview network parameters - Oracle database- RECV_BUF_SIZE SEND_BUF_SIZE -Alters the TCP send and receive windows. -Not setting this parameters OS buffers sizes will be used -This parameters depends on: -Network latency among client and database -SDU size negotiated - OS TCP kernel parameters (previously mentioned) _SEND/RCV_BUFFER proportional to the Band Product Delay (BDP) BDP is proportional to the network latency and the bandwidth -From Metalink doc “Oracle does not recommend, suggest or dictate any values for these Parameters” Neither do I… 9Carlos Fernando Gamboa

Hardware used on this test -general specification - INDIANA CLIENT CPU Intel(R) Xeon(TM) 4 CORES 2.80GHz MEMORY: 5GB OS: RHEL 4 Kernel level 2.6 NIC: 1000Gb/s BNL ORACLE CLUSTER DATABSE server 2 No des CPU= 2 dual core 3GHz, 64 bits Architecture Memory: 9GB SGA, 16GB RHEL 4 ES kernel level 2.6 NICs= 1000Gb/s Dual controller 512MB cache per controller, 4Gbps FCP Physical disks available:2 arrays of 10 disks each. Disk type: IBM SAS 300GBDrive type: Serial Attached SCSI (SAS) Capacity: 300GBSpeed: 15Krpm 10Carlos Fernando Gamboa

Sysctl.Conf -window r/w parameters- BNL Cond DB. ParametersminDefaultMax net.ipv4.tcp_rmem net.ipv4.tcp_wmem net.core.rmem_default net.core.rmem_max net.core.wmem_default net.core.wmem_max Indiana Client ParametersminDefaultMax net.ipv4.tcp_rmem 4k *2 net.ipv4.tcp_wmem 4k16K128K net.core.rmem_default net.core.rmem_max net.core.wmem_default net.core.wmem_max Parameter (Tuned for Streams Data Replication) SDU_SIZERECV_BUF_SIZESEND_BUF_SIZ E Carlos Fernando Gamboa

Sysctl.Conf -window r/w parameters- BNL Cond DB. ParametersminDefaultMax net.ipv4.tcp_rmem net.ipv4.tcp_wmem net.core.rmem_default net.core.rmem_max net.core.wmem_default net.core.wmem_max ParametersminDefaultMax net.ipv4.tcp_rmem 4k *2 net.ipv4.tcp_wmem 4k16K128K net.core.rmem_default net.core.rmem_max net.core.wmem_default net.core.wmem_max CERN Cond DB. Tuned for Streams data replication from CERN 12Carlos Fernando Gamboa

Problem overview -Reconstruction Job- Observation from client side Long execution time observed at client side when running Reco job. RECO jobs last for minutes. Different thread database activity. Not high load on client side observed Reco Job resolultion Database Server observation 7 Threads connected to the database. 1 Thread on constant activity ( sending/receiving data, about 30MB). Longest idle time observe was 37 minutes. Not database locks observed Not database load at job execution Data retrieved from DB cache RTT from BNL DB to client 30ms CLIENTBNL COND DBCERN COND DB BNL Worker Node ~3 minutes CERN Worker Node ~3 minutes 13Carlos Fernando Gamboa

Observation from Database side Trace files show significant wait times events SQL*Net message from client Different “select count(*) ……” queries were found as well. 14Carlos Fernando Gamboa

Monitoring -OEM- 15Carlos Fernando Gamboa

Idle times during job execution Monitoring idle time activity per job run 16Carlos Fernando Gamboa Idle state

IPERF tests - First set of IPERF tests - TestTCP window (MB) Time (seconds) Results (MBytes) Troughput Mbits/s IPERF warning message on every test result TCP window size: 256 KByte (WARNING: requested 2.00 MByte) IPERF client at IDIANA to IPERF server at BNL database cluster (node 1) Indiana parameters OS kernel parameters 17Carlos Fernando Gamboa

IPERF tests - Second set of IPERF tests - TestTCP window (MB) Time (seconds) Result (MBytes) Troughput (Mbits/s) IPERF warning message on last test result TCP window size: 32.0 MByte (WARNING: requested 16.0 MByte) Indiana new set of OS kernel parameters tcp parameters IPERF client at IDIANA to IPERF server at BNL database cluster after tuning 18Carlos Fernando Gamboa

IPERF tests -Third set of IPERF tests - TestTCP window (MB) Time sec Result (MBytes) Troughput Mbits/s IPERF warning message on last test result TCP window size: 32.0 MByte (WARNING: requested 16.0 MByte) Indiana new set of OS kernel parameters tcp parameters IPERF client at IDIANA to IPERF server at BNL database cluster after tuning 19Carlos Fernando Gamboa

TEST RESULTS Date TestsTest / tnsnames.ora SDU (Bytes) SND_BUFFER (Bytes) RCV_BUFFER (Bytes) Finish Time Reco Job (mm:ss:ms) Max Idle observed minutes 10/28/08Prior tuning2048OS 35:00-40:00~ /28/08Job default (DBrelease) 2048OS 19:49.25~18 10/28/ :09:75~ /28/ :14:21~15 11/04/ :29.98~18 11/04/ :20:83~16 11/04/ :08:95~17 11/06/ OS 21:05.31~17:15 11/06/ :30:22~ /06/ :45:22~15 20Carlos Fernando Gamboa

Conclusions The Indiana Client TCP kernel configuration affected the resolution time when retrieving data from BNL database. Network latency affected Job time resolution. Although the initial idle time was considerably reduced there still a significant overhead of idle time which for my simple TRT histogram job seems to be ~15 minutes after tuning of the client side. Contact your local Network admin. 21Carlos Fernando Gamboa

Acknowledgements Many thanks to: THOM SALUKE Indiana University 22Carlos Fernando Gamboa

References Metalink Doc. – – – – Linux Man page: -TCP -sysctl.conf Oracle Database 10g Real Application Clusters Handbook, McGraw Hill Osborne Media; 1 edition (November 22, 2006) Online documentation Oracle database concepts Carlos Fernando Gamboa