ASM-based storage to scale out the Database Services for Physics

Slides:

Advertisements

Similar presentations

MicroTerabyte Leveraging InfiniBand to Build a Powerful, Scalable Oracle Database and Application Platform Brian Dougherty Chief Architect, CMA.

Advertisements

Scalable Storage Configuration for the Physics Database Services Luca Canali, CERN IT LCG Database Deployment and Persistency Workshop October, 2005.

Module – 3 Data protection – raid

Introduction to DBA.

Enhanced Availability With RAID CC5493/7493. RAID Redundant Array of Independent Disks RAID is implemented to improve: –IO throughput (speed) and –Availability.

Wim Coekaerts Director of Linux Engineering Oracle Corporation.

CERN IT Department CH-1211 Genève 23 Switzerland t Storage for Data Management and Physics Databases Luca Canali, IT-DM After-C5 Presentation.

Oracle Clustering and Replication Technologies CCR Workshop - Otranto Barbara Martelli Gianluca Peco.

1 RAL Status and Plans Carmine Cioffi Database Administrator and Developer 3D Workshop, CERN, November 2009.

BNL Oracle database services status and future plans Carlos Fernando Gamboa RACF Facility Brookhaven National Laboratory, US Distributed Database Operations.

Simplify your Job – Automatic Storage Management Angelo Session id:

© 2009 Oracle Corporation. S : Slash Storage Costs with Oracle Automatic Storage Management Ara Vagharshakian ASM Product Manager – Oracle Product.

SQL Server 2008 & Solid State Drives Jon Reade SQL Server Consultant SQL Server 2008 MCITP, MCTS Co-founder SQLServerClub.com, SSC

Storage review Carmine Cioffi Database Administrator and Developer 3D Workshop, Barcelona (ES), April 2009.

Storage Survey and Recent Acquisition at LAL Michel Jouvin LAL / IN2P3

Database Services for Physics at CERN with Oracle 10g RAC HEPiX - April 4th 2006, Rome Luca Canali, CERN.

CERN IT Department CH-1211 Geneva 23 Switzerland t Experience with NetApp at CERN IT/DB Giacomo Tenaglia on behalf of Eric Grancher Ruben.

Nexenta Proprietary Global Leader in Software Defined Storage Nexenta Technical Sales Professional (NTSP) COURSE CONTENT.

Luca Canali, CERN Dawid Wojcik, CERN

Andrew Mendelsohn Senior Vice President Database Oracle Corporation.

DAC-FF The Ultimate Fibre-to-Fibre Channel External RAID Controller Solution for High Performance Servers, Clusters, and Storage Area Networks (SAN)

CERN - IT Department CH-1211 Genève 23 Switzerland t Tier0 database extensions and multi-core/64 bit studies Maria Girone, CERN IT-PSS LCG.

Large Scale Test of a storage solution based on an Industry Standard Michael Ernst Brookhaven National Laboratory ADC Retreat Naples, Italy February 2,

Oracle RAC and Linux in the real enterprise October, 02 Mark Clark Director Merrill Lynch Europe PLC Global Database Technologies October, 02 Mark Clark.

Building a Terabyte Data Warehouse, Using Linux and RAC George Lumpkin Director Product Management Oracle Corporation Session id:

CASTOR Databases at RAL Carmine Cioffi Database Administrator and Developer Castor Face to Face, RAL February 2009.

Achieving Scalability, Performance and Availability on Linux with Oracle 9iR2-RAC Grant McAlister Senior Database Engineer Amazon.com Paper

CERN Physics Database Services and Plans Maria Girone, CERN-IT

CERN - IT Department CH-1211 Genève 23 Switzerland t Oracle Real Application Clusters (RAC) Techniques for implementing & running robust.

ASM Configuration Review Luca Canali, CERN-IT Distributed Database Operations Workshop CERN, November 26 th, 2009.

Forschungszentrum Karlsruhe in der Helmholtz-Gemeinschaft Implementation of a reliable and expandable on-line storage for compute clusters Jos van Wezel.

ONStor Pantera 3110 ONStor NAS. Copyright 2008 · ONStor Confidential Pantera 3110 – An Integrated Channel only NAS  Integrated standalone NAS system.

CERN Database Services for the LHC Computing Grid Maria Girone, CERN.

Oracle Database Architecture By Ayesha Manzer. Automatic Storage Management Spreads database data across all disks Creates and maintains a storage grid.

CERN - IT Department CH-1211 Genève 23 Switzerland t High Availability Databases based on Oracle 10g RAC on Linux WLCG Tier2 Tutorials, CERN,

CERN IT Department CH-1211 Genève 23 Switzerland t Storage Overview and IT-DM Lessons Learned Luca Canali, IT-DM DM Group Meeting

PROOF tests at BNL Sergey Panitkin, Robert Petkus, Ofer Rind BNL May 28, 2008 Ann Arbor, MI.

Scalable Oracle 10g for the Physics Database Services Luca Canali, CERN IT January, 2006.

Database CNAF Barbara Martelli Rome, April 4 st 2006.

Enhanced Availability With RAID CC5493/7493. RAID Redundant Array of Independent Disks RAID is implemented to improve: –IO throughput (speed) and –Availability.

2 Copyright © 2006, Oracle. All rights reserved. RAC and Shared Storage.

DB Questions and Answers open session (comments during session) WLCG Collaboration Workshop, CERN Geneva, 24 of April 2008.

CERN Disk Storage Technology Choices LCG-France Meeting April 8 th 2005 CERN.ch.

Oracle Clustering and Replication Technologies UK Metadata Workshop - Oxford Barbara Martelli Gianluca Peco.

System Storage TM © 2007 IBM Corporation IBM System Storage™ DS3000 Series Jüri Joonsaar Tartu.

Journey to the HyperConverged Agile Infrastructure

E2800 Marco Deveronico All Flash or Hybrid system

Storage HDD, SSD and RAID.

Ryan Leonard Storage and Solutions Architect

Storage Area Networks The Basics.

RHEV Platform at LHCb Red Hat at CERN 17-18/1/17

Experience of Lustre at QMUL

Solid State Disks Testing with PROOF

Flash Storage 101 Revolutionizing Databases

IT-DB Physics Services Planning for LHC start-up

Storage Overview and IT-DM Lessons Learned

Stretching an Oracle DB Across Sites with EMC VPLEX

FlashGraph: Processing Billion-Node Graphs on an Array of Commodity SSDs Scribed by Vinh Ha.

Experience of Lustre at a Tier-2 site

Scalable Database Services for Physics: Oracle 10g RAC on Linux

The INFN Tier-1 Storage Implementation

VNX Storage Report Project: Sample VNX Report Project ID:

Oracle Storage Performance Studies

Unity Storage Array Profile

Case studies – Atlas and PVSS Oracle archiver

Implementing ASM Without HW RAID, A User’s Experience

CERN DB Services: Status, Activities, Announcements

Scalable Database Services for Physics: Oracle 10g RAC on Linux

Evaluating and testing storage performance for Oracle DBs

IST346: Storage and File Systems

Presentation transcript:

ASM-based storage to scale out the Database Services for Physics Database Service Meeting - April 11th, 2006 Luca Canali, CERN

Outline Storage for Oracle 10g A solution based on cost/performance Some performance measurements and benchmarks ASM at CERN, lessons learned DB Service Meeting, 11-Apr-2006 Luca Canali

Architectural Goal High-end performance, scalability and HA at low cost RAC Conventional -> Scale UP Grid-like -> Scale OUT ASM DB Service Meeting, 11-Apr-2006 Luca Canali

Storage Solutions for RAC RAC implements ‘shared everything’ clustering Common storage solutions for RAC High-end SAN (ex: EMC2 symmetrix, Hithachi) Specialized NAS (ex: Netapps filer) SAN + low-cost storage + ASM Infortrend storage arrays with FC controllers and SATA disks QLogic HBAs and FC Switches Less common solutions Directly attached storage Solid state disks DB Service Meeting, 11-Apr-2006 Luca Canali

Automatic Storage Manager ASM is a volume manager and cluster filesystem for Oracle DB files Raw IO, Direct IO, Asynch IO Implements S.A.M.E. (stripe and mirror everything) ‘the software glue’ to scale out for performance and HA Online storage reconfiguration (ex: in case of disk failure) Ex: ASM ‘filesystems’ -> disk groups: DiskGrp1 DiskGrp2 Mirroring Striping DB Service Meeting, 11-Apr-2006 Luca Canali

Performance, Capacity and Cost ASM proven to ‘scale out’ low-cost storage: I/Os per second: Tests at CERN showed good scalability up to max tested (64 HDs) ~ 100 IOPS per disk (SATA disks, small random IO) Sequential throughput: Scales out, but limited by fabric to 2Gbps (per HBA) Tests on a 4 node RAC at CERN -> ~800MB/s for seq. read High capacity: leverages SATA disks (typical DB size 5-10 TB) Comparison with the top performers: Solid State Disks (SSD) SSD has highest performance: ~100K IOPS, latency < 1 ms BUT cost/capacity (SSD vs. SATA) > 1000, while cost/IOPS ~ 1 DB Service Meeting, 11-Apr-2006 Luca Canali

Orion 10.2, Sequential IO (RO) Bottleneck: Disk controller = 2 Gb (from other tests HBA = 2 + 2 Gb) DB Service Meeting, 11-Apr-2006 Luca Canali

Sequential IO measured with SQL 4 Disk arrays = 4 x 2 Gb (measured with parallel query) DB Service Meeting, 11-Apr-2006 Luca Canali

Orion 10.2, Small Random IO (RO) DB Service Meeting, 11-Apr-2006 Luca Canali

Small Random IO, with SQL 8675 IOPS 135 IOPS (uniformly) per disk Extra tuning: equivalent to using only the external edges of the disks DB Service Meeting, 11-Apr-2006 Luca Canali

Lessons Learned Performance Administration: ASM and ASMLib scale out, tested with 64 HDs. IO is spread uniformly across disks and mirror pairs for HA and performance (no data yet on write activity, but no issues expected) Administration: We experienced a few stability issues with 10.1, but all fixed in 10.2. No pending issues with ASM. ‘ASM DBA’ need storage admin and sysadmin skills. Ex: we configure ourselves HBAs multipathing, LUN mapping and SAN switch zoning. We don’t use arrays’ RAID, but deploy a custom ASM config using JBOD (HA and performance in return for the added complexity) Single disk failure rate, so far as expected: MTBF = 60 years. DB Service Meeting, 11-Apr-2006 Luca Canali

Conclusions Storage for RAC on Linux at PSS successfully consolidated using Infortrend disk arrays and Oracle ASM. Scalability and performance tests are positive. Extra effort for administration Specialized operation required mainly during installation and replacement of failed disks DBAs occasionally need to wear the storage admin hat More info on wiki: https://twiki.cern.ch/twiki/bin/view/PSSGroup/HAandPerf DB Service Meeting, 11-Apr-2006 Luca Canali