Case studies – Atlas and PVSS Oracle archiver

Slides:



Advertisements
Similar presentations
Extreme Performance with Oracle Data Warehousing
Advertisements

Scalable Storage Configuration for the Physics Database Services Luca Canali, CERN IT LCG Database Deployment and Persistency Workshop October, 2005.
Tuning a Very Large Data Warehouse Pichai Bala. About Me Working in the IT industry for the past 17 years Working in Oracle since Working in Data.
Exadata Distinctives Brown Bag New features for tuning Oracle database applications.
DB server limits (process/sessions) Carlos Fernando Gamboa, BNL Andrew Wong, TRIUMF WLCG Collaboration Workshop, CERN Geneva, April 2008.
Delphix User Experience
IS 4420 Database Fundamentals Chapter 6: Physical Database Design and Performance Leon Chen.
Harvard University Oracle Database Administration Session 2 System Level.
07-June Database futures workshop, June 2011 CERN Enhance the ATLAS database applications by using the new Oracle 11g features Gancho Dimitrov.
Database Services for Physics at CERN with Oracle 10g RAC HEPiX - April 4th 2006, Rome Luca Canali, CERN.
CERN - IT Department CH-1211 Genève 23 Switzerland t The High Performance Archiver for the LHC Experiments Manuel Gonzalez Berges CERN, Geneva.
IT The Relational DBMS Section 06. Relational Database Theory Physical Database Design.
Chapter Oracle Server An Oracle Server consists of an Oracle database (stored data, control and log files.) The Server will support SQL to define.
1 Physical Data Organization and Indexing Lecture 14.
1 © Prentice Hall, 2002 Physical Database Design Dr. Bijoy Bordoloi.
Oracle9i Database Administrator: Implementation and Administration 1 Chapter 9 Index Management.
Online Database Support Experiences Diana Bonham, Dennis Box, Anil Kumar, Julie Trumbo, Nelly Stanfield.
Luca Canali, CERN Dawid Wojcik, CERN
1 Robert Wijnbelt Health Check your Database A Performance Tuning Methodology.
Physical Database Design & Performance. Optimizing for Query Performance For DBs with high retrieval traffic as compared to maintenance traffic, optimizing.
March 19981© Dennis Adams Associates Tuning Oracle: Key Considerations Dennis Adams 25 March 1998.
CERN - IT Department CH-1211 Genève 23 Switzerland t Tier0 database extensions and multi-core/64 bit studies Maria Girone, CERN IT-PSS LCG.
Table Compression in Oracle9i R2 Plamen Zyumbyulev INSIDE OUT,, Let someone k n o w ”
Oracle Tuning Ashok Kapur Hawkeye Technology, Inc.
Achieving Scalability, Performance and Availability on Linux with Oracle 9iR2-RAC Grant McAlister Senior Database Engineer Amazon.com Paper
D Copyright © Oracle Corporation, All rights reserved. Loading Data into a Database.
08-Nov Database TEG workshop, Nov 2011 ATLAS Oracle database applications and plans for use of the Oracle 11g enhancements Gancho Dimitrov.
CERN IT Department CH-1211 Geneva 23 Switzerland t Oracle Tutorials CERN June 8 th, 2012 Performance Tuning.
University of Sunderland COM 220 Lecture Ten Slide 1 Database Performance.
CERN Database Services for the LHC Computing Grid Maria Girone, CERN.
SQL/Lesson 7/Slide 1 of 32 Implementing Indexes Objectives In this lesson, you will learn to: * Create a clustered index * Create a nonclustered index.
Physical Database Design Purpose- translate the logical description of data into the technical specifications for storing and retrieving data Goal - create.
Copyright 2007, Information Builders. Slide 1 Machine Sizing and Scalability Mark Nesson, Vashti Ragoonath June 2008.
CERN IT Department CH-1211 Genève 23 Switzerland t DBA Experience in a multiple RAC environment DM Technical Meeting, Feb 2008 Miguel Anjo.
Infrastructure for Data Warehouses. Basics Of Data Access Data Store Machine Memory Buffer Memory Cache Data Store Buffer Bus Structure.
Chapter 5 Index and Clustering
CERN - IT Department CH-1211 Genève 23 Switzerland t High Availability Databases based on Oracle 10g RAC on Linux WLCG Tier2 Tutorials, CERN,
Database Performance Eric Grancher - Nilo Segura Oracle Support Team IT/DES.
11-Nov Distr. DB Operations workshop - November 2008 The PVSS Oracle DB Archive in ATLAS ( life cycle of the data ) Gancho Dimitrov (LBNL)
ORACLE & VLDB Nilo Segura IT/DB - CERN. VLDB The real world is in the Tb range (British Telecom - 80Tb using Sun+Oracle) Data consolidated from different.
Copyright © 2009 Rolta International, Inc., All Rights Reserved Michael R. Messina, Management Consultant Rolta-TUSC, Oracle Open World 2009 (60 min) ID#:
RAC aware change In order to reduce the cluster contention, a new scheme for the insertion has been developed. In the new scheme: - each “client” receives.
Database Systems, 8 th Edition SQL Performance Tuning Evaluated from client perspective –Most current relational DBMSs perform automatic query optimization.
October 15-18, 2013 Charlotte, NC Accelerating Database Performance Using Compression Joseph D’Antoni, Solutions Architect Anexinet.
DB Questions and Answers open session (comments during session) WLCG Collaboration Workshop, CERN Geneva, 24 of April 2008.
Oracle Announced New In- Memory Database G1 Emre Eftelioglu, Fen Liu [09/27/13] 1 [1]
1 PVSS Oracle scalability Target = changes per second (tested with 160k) changes per client 5 nodes RAC NAS 3040, each with one.
The PVSS Oracle Archiver FW WG 6 th July Credits Many people involved IT/DES: Eric Grancher, Nilo Segura, Chris Lambert IT/PSS: Luca Canali ALICE:
Eric Grancher CERN IT department Overview of Database Technologies Computing and Astroparticle Physics 2 nd ASPERA Workshop /1.
Scaling HDFS to more than 1 million operations per second with HopsFS
DCS Status and Amanda News
Table General Guidelines for Better System Performance
Database Services Katarzyna Dziedziniewicz-Wojcik On behalf of IT-DB.
Backup & Recovery of Physics Databases
Physical Database Design
Running virtualized Hadoop, does it make sense?
PVSS Evolution in Relation to Databases
Elizabeth Gallas - Oxford ADC Weekly September 13, 2011
Physical Database Design and Performance
Exadata for Oracle DBAs
Database Performance Tuning and Query Optimization
Oracle Storage Performance Studies
CHAPTER 5: PHYSICAL DATABASE DESIGN AND PERFORMANCE
Instant Add Columns in MySQL
ASM-based storage to scale out the Database Services for Physics
Data Lifecycle Review and Outlook
Oracle Streams Performance
Oracle Solutions for Data Archiving
Table General Guidelines for Better System Performance
Chapter 11 Database Performance Tuning and Query Optimization
Presentation transcript:

Case studies – Atlas and PVSS Oracle archiver Luca Canali, IT-DM Database Developers’ Workshop July 8th, 2008

PVSS Oracle Archiver Commercial SCADA system CERN use-case for the PVSS Oracle archiver has required several optimizations In particular PVSS ‘out of the box’ in 2005 could insert at about ~100Hz Requirement from the experiments: 3 orders of magnitude higher (at peak time)

PVSS Optimization A joint effort Input from experiments, IT/CO, ETM (vendor) and DBAs Performances have been improved considerably Thorough stress testing Many lessons learned A few slides follow focusing on techniques that can be of interest to HEP application developers

PVSS and high insert performance Bulk insert Fastest insert method Direct path insert, i.e. insert /*+ append*/ .. Bypasses the database cache, very fast, does not see slowdown from cluster cache management It’s a bulk operation Implementation detail: data are buffered in temporary tables before ‘bulk’ insert The table is further partitioned The insert process is partition-aware Clients insert into a given partition

PVSS and large data volumes Very large amounts of data are created Data is inserted into an ‘active table’ at a given time When the table reaches a threshold it is archived A new active table is used. Space-speed tradeoff When insert rate is slow direct path insert leaves behind blocks with a lot of free space The outcome is space ‘wastage’

Space Management Compacting space Column order Archived tables can be recreated (‘alter table move’) to improve space filling Implementation note: an update on the metadata tables is needed when changing tablespace Extra gain by using oracle compressed table 3x to 10x improvement measured (Atlas) Column order Placing ‘null columns at the end’ saves space ~20% in some tests

Experimenting with IOTs Index Organized Tables Data is stored in a index Excellent clustering of data with the primary key i.e. retrieval by PK is optimized Saves space for the extra index compared to a ‘normal (heap) table The problem of space wastage is drastically reduced Can have performance problems if additional indexes are needed Direct path insert is not available max rate is about 80KHz (on quadcore machines) Currently work in progress (not ‘certified’ yet)..

High Availability PVSS can work using a filesystem based buffer This allows for the DB to be unavailable for short times Fits in the service failover-clustering approach to Oracle availability (RAC)

Resource Usage Examining PVSS write-only activity (100KHz): Notes: HW: 4-node RAC (dual single core Xeon) and ~32 SATA disks (4 arrays of 8 disks in RAID 10 configuration) Data flow: 8 MB/sec (i.e. about 1TB in a WE) 70 MB/sec writes to datafiles (of which 2/3 to undo segments!) 40 MB/sec redo writes Notes: PVSS is IO intensive, ultimately IO-bound Oracle needs quite a few CPU cycles for I/O ops With quadcore, 1 node has enough CPU for 100KHz (tested, with CPU at about 50%)

Query Tuning Running arbitrary queries on the system will not perform neither scale The queries have been defined and ‘packaged’ for many use cases Queries that retrieve data by PK are the most performing Additional indexes are possible but with a cost on insert Optionally the replicated offline schema can afford more/different indexing Agreement with the users community on the ‘allowed’ list of queries is fundamental

Conclusions PVSS Oracle archiver has been tuned to scale and perform at the peak insert rate values expected from LHC experiments Main lessons Tuning works best if you put together a team of developers, application owners and DB experts In addition, stress testing, and performance measuring is fundamental Users requirements need to be defined as soon as possible

Acknowledgments Many thanks to Gancho, Florbela, Jim (Atlas), Manuel, Wayne, and Piotr (IT-CO), Eric and Anton (IT-DES), Edward (ETM/ Siemens), Dawid, Eva, Jacek, Maria, Miguel (IT-DM).