Experience in running relational databases on clustered storage CERN, IT Department CHEP 2015, Okinawa, Japan 13/04/2015.

Slides:



Advertisements
Similar presentations
Storage-Virtualisierung: NetApp arbeitet für 2. Und für die Hälfte. Helmut Putzenlechner Systems Engineer NetApp Austria GmbH.
Advertisements

Tom Hamilton – America’s Channel Database CSE
© 2006 DataCore Software Corp SANmotion New: Simple and Painless Data Migration for Windows Systems Note: Must be displayed using PowerPoint Slideshow.
Tag line, tag line Provisioning Manager 4.0 Customer Strategic Presentation March 2010.
Copyright © 2013, Oracle and/or its affiliates. All rights reserved. Provisioning Storage for Oracle Database with ZFS and NetApp Mike Carew Oracle University.
Evolution of database services
Managing storage requirements in VMware Environments October 2009.
23/04/2008VLVnT08, Toulon, FR, April 2008, M. Stavrianakou, NESTOR-NOA 1 First thoughts for KM3Net on-shore data storage and distribution Facilities VLV.
Updates from Database Services at CERN Andrei Dumitru CERN IT Department / Database Services.
On behalf DBoD team, IT Department HEPIX 2014 Nebraska Union, University of Nebraska – Lincoln, USA.
NetApp The perfect fit for Virtual Desktop Infrastructure
Experience in running relational databases on clustered storage On behalf IT-DB storage team, IT Department HEPIX 2014.
IBM® Spectrum Storage Virtualize™ V V7000 Unified in a nutshell
© 2008 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice HP StorageWorks LeftHand update Marcus.
Backup and restore of Oracle databases: introducing a disk layer
CERN IT Department CH-1211 Geneva 23 Switzerland t CERN IT Department CH-1211 Geneva 23 Switzerland t
NetApp Rapid Cloning Utility (RCU) Internal Training August, 2009
Upgrading the Platform - How to Get There!
IBM Storwize v3700 More performance. More efficiency. No compromises.
© 2009 Oracle Corporation. S : Slash Storage Costs with Oracle Automatic Storage Management Ara Vagharshakian ASM Product Manager – Oracle Product.
1© Copyright 2013 EMC Corporation. All rights reserved. EMC XtremSW Cache Performance. Intelligence. Protection.
Frangipani: A Scalable Distributed File System C. A. Thekkath, T. Mann, and E. K. Lee Systems Research Center Digital Equipment Corporation.
1© Copyright 2013 EMC Corporation. All rights reserved. EMC and Microsoft SharePoint Server Performance Name Title Date.
CERN IT Department CH-1211 Genève 23 Switzerland t Data Protection with Oracle Data Guard Jacek Wojcieszuk, CERN/IT-DM Distributed Database.
SnapManager 2.0 for Virtual Infrastructure Product Manager Lisa Haut-Mikkelsen Technical Marketing Engineer Amrita Das.
1 © Copyright 2009 EMC Corporation. All rights reserved. Agenda Storing More Efficiently  Storage Consolidation  Tiered Storage  Storing More Intelligently.
DBI336. Applications Microsoft Data ONTAP ™ Powershell Toolkit for Integration and Automation FlexClone ® for Rapid Provisioning ApplianceWatch for Health.
Experience and Lessons learnt from running High Availability Databases on Network Attached Storage Ruben Gaspar Manuel Guijarro et al IT/DES.
Database storage at CERN
SANPoint Foundation Suite HA Robert Soderbery Sr. Director, Product Management VERITAS Software Corporation.
Database Services for Physics at CERN with Oracle 10g RAC HEPiX - April 4th 2006, Rome Luca Canali, CERN.
Introducing Snap Server™ 700i Series. 2 Introducing the Snap Server 700i series Hardware −iSCSI storage appliances with mid-market features −1U 19” rack-mount.
CERN IT Department CH-1211 Geneva 23 Switzerland t Experience with NetApp at CERN IT/DB Giacomo Tenaglia on behalf of Eric Grancher Ruben.
XenDesktop Built on FlexPod Flexible IT Infrastructure for Desktop Virtualization.
IT Infrastructure Chap 1: Definition
Content from Analyst Day November 2004 NY Pressures on IT Groups Do more with less Keep running globally 24x7 Adapt rapidly to changes Maximize value.
1 © 2010 Overland Storage, Inc. © 2012 Overland Storage, Inc. Overland Storage The Storage Conundrum Neil Cogger Pre-Sales Manager.
Building the Infrastructure Grid: Architecture, Design & Deployment Logan McLeod – Database Technology Strategist.
VMware View built on FlexPod Flexible IT Infrastructure for Desktop Virtualization.
CERN - IT Department CH-1211 Genève 23 Switzerland t Experience and Lessons learnt from running High Availability Databases on Network Attached.
Large Scale Test of a storage solution based on an Industry Standard Michael Ernst Brookhaven National Laboratory ADC Retreat Naples, Italy February 2,
Engr. Gideon Fatunmbi 05/08/2015 FAS2200 Series Customer Presentation.
Microsoft Azure Storage. Networking Compute Storage Virtual Machine Operating System Applications Data & Access Runtime Provision.
1 Data Guard. 2 Data Guard Reasons for Deployment  Site Failures  Power failure  Air conditioning failure  Flooding  Fire  Storm damage  Hurricane.
CERN IT Department CH-1211 Geneva 23 Switzerland t IT/DB Tests and evolution SSD as flash cache.
WS2012 File and Storage Services Management Name Jeff Alexander Technical Evangelist – Windows Infrastructure Microsoft Australia
VMware vSphere Configuration and Management v6
1© Copyright 2012 EMC Corporation. All rights reserved. EMC VNX5700, EMC FAST Cache, SQL Server AlwaysOn Availability Groups Strategic Solutions Engineering.
Paul Allen (Perforce) Agnes Jacob (NetApp) P4 Flex 1.
Paul Allen (Perforce) Agnes Jacob (NetApp) P4 Flex 1.
CERN - IT Department CH-1211 Genève 23 Switzerland t High Availability Databases based on Oracle 10g RAC on Linux WLCG Tier2 Tutorials, CERN,
CERN IT Department CH-1211 Genève 23 Switzerland t Storage Overview and IT-DM Lessons Learned Luca Canali, IT-DM DM Group Meeting
To provide the world with a next generation storage platform for unstructured data, enabling deployment of mobile applications, virtualization solutions,
Workshop sullo Storage da Small Office a Enterprise Class Presentato da:
Scalable Oracle 10g for the Physics Database Services Luca Canali, CERN IT January, 2006.
Database CNAF Barbara Martelli Rome, April 4 st 2006.
PIC port d’informació científica Luis Diaz (PIC) ‏ Databases services at PIC: review and plans.
© 2009 IBM Corporation Statements of IBM future plans and directions are provided for information purposes only. Plans and direction are subject to change.
NetApp: for the Cloud All Flash SDN © 2015 NetApp, Inc. All rights reserved. NetApp Confidential – Limited Use 1.
REMINDER Check in on the COLLABORATE mobile app Best Practices for Oracle on VMware - Deep Dive Darryl Smith Chief Database Architect Distinguished Engineer.
3 Copyright © 2006, Oracle. All rights reserved. Installation and Administration Basics.
Considerations and Benefits for Archive and Compliance Deploying Enterprise Vault on NetApp Storage 1.
PHD Virtual Technologies “Reader’s Choice” Preferred product.
Database Services Katarzyna Dziedziniewicz-Wojcik On behalf of IT-DB.
Kako sa novim tehnologijama graditi datacentar budućnosti?
Real IBM C exam questions and answers
Oracle Storage Performance Studies
IBM Tivoli Storage Manager
DataOptimizer Transparent File Tiering for NetApp Storage Robert Graf
Presentation transcript:

Experience in running relational databases on clustered storage CERN, IT Department CHEP 2015, Okinawa, Japan 13/04/2015

Agenda 3 Brief introduction Our setup Caching technologies Snapshots Data motion, compression & dedup Conclusions

CERN’s Databases ~100 Oracle databases, most of them RAC Mostly NAS storage plus some SAN with ASM ~600 TB of data files for production DBs in total Using a variety of Oracle technologies: Active Data Guard, Golden Gate, Clusterware, etc. Examples of critical production DBs: LHC logging database ~250 TB, expected growth up to ~90 TB / year 13 production experiments’ databases ~15-25 TB in each Read-only copies (Active Data Guard) Database on Demand (DBoD) single instances 172 MySQL Open community databases (5.6.17) 19 PostgreSQL databases (9.2.9) 9 Oracle11g databases ( ) 4

A few 7-mode concepts 5 Private network FlexVolume Remote Lan Manager Service Processor Rapid RAID Recovery Maintenance center (at least 2 spares) raid_dp or raid4 raid.scrub.schedule raid.media_scrub.rat e once weekly constantly reallocate Thin provisioning File access Block access NFS, CIFSFC,FCoE, iSCSI client access Independent HA pairs

Private network Cluster interconnect Cluster mgmt network A few C-mode concepts cluster node shell systemshell C-mode cluster ring show RDB: vifmgr + bcomd + vldb + mgmt Vserver (protected via Snapmirror) Global namespace Logging files from the controller no longer accessible by simple NFS export client access 6 Cluster should never stop serving data

Agenda 7 Brief introduction Our setup Caching technologies Snapshots Data motion, compression & dedup Conclusions

8 NAS evolution at CERN (last 8 years) FAS3000 FAS6200& FAS % FC disks Flash pool/cache = 100% SATA disk + SSD DS14 mk4 FC DS4246 6gbps 2gbps Data ONTAP® 7-mode Data ONTAP® Clustered-Mode scaling up scaling out

Network architecture 9 Bare metal server 2x10GbE Public Network Private Network 10GbE trunking 1GbE 10 GbE Just cabling of first element of each type is shown cabled Each switch is in fact a set of switches (4 in our latest setup) managed as one by HP Intelligent Resilient Framework (IRF) ALL our databases run with same network architecture NFSv3 is used for data access Cluster interconnect Cluster mgmt network Storage network mtu 1500 mtu 9000

Disk shelf cabling: SAS 10 Owned by 1 st Controller Owned by 2 nd Controller SAS loop at 6gpbs 12gbps per stack due to multi-pathing ~3GB/s per controller SSD

Mount options Oracle and MySQL are well documented Mount Options for Oracle files when used with NFS on NAS devices (Doc ID ) Best Practices for Oracle Databases on NetApp Storage, TR What are the mount options for databases on NetApp NFS? KB ID: PostgreSQL not popular with NFS, though it works well if properly configured MTU 9000, reliable NFS stack e.g. NetApp NFS server implementation Do not underestimate impact 11

12 After setting new mount points options (peaks due to autovacuum):

Mount options: database layout 13 global namespace Oracle RAC, cluster database: MySQL and PostgreSQL single instance

Agenda 14 Brief introduction Our setup Caching technologies Snapshots Data motion, compression & dedup Conclusions

Flash Technologies 15 Depending where SSD are located. Controllers → Flash Cache Disk shelf → Flash Pool Flash pool (hybrid aggregates) based on a Heat Map in order to decide which block stays and for how long Sequential data is not cached ( > 16KB). Data can not be pinned Works on random reads and writes workloads Writes (μs) warm up cache much faster (ms) Data is not sensible to cluster takeover/givebacks → it reduces warm-up period Flash Cache Flash Pool

Agenda 16 Brief introduction Our setup Caching technologies Snapshots Data motion, compression & dedup Conclusions

17 Backup management using snapshots Backup workflow: mysql> FLUSH TABLES WITH READ LOCK; mysql> FLUSH LOGS; or Oracle>alter database begin backup; Or Postgresql> SELECT pg_start_backup('$SNAP'); mysql> UNLOCK TABLES; Or Oracle>alter database end backup; or Postgresql> SELECT pg_stop_backup(), pg_create_restore_point('$SNAP'); snapshot resume … some time later new snapshot

Snapshots for Backup and Recovery 18 Storage-based technology Strategy independent of the RDBMS technology in use Speed-up of backups/restores: from hours/days to seconds SnapRestore requires a separate license API can be used by any application, not just RDBMS Consistency should be managed by the application 8 secs Oracle ADCR: 29TB size, ~ 10 TB archivelogs/day Backup & Recovery API Alert log:

Cloning of RDBMS 19 Based on snapshot technology (FlexClone) on the storage. Requires license. FlexClone is a snapshot with a RW layer on top Space efficient: at first blocks are shared with parent file system We have developed our own API, RDBMS independent Archive logs are required to make the database consistent Solution being developed initially for MySQL and PostgreSQL on our DBoD service. Many use cases: Check application upgrade, database version upgrade, general testing … Check state of your data on a snapshot (backup) Both clone and parent present similar performance

Cloning of RDBMS (II) 20

Agenda 21 Brief introduction Our setup Caching technologies Snapshots Data motion, compression & dedup Conclusions

Vol move Powerful feature: rebalancing, interventions,… whole volume granularity Transparent but watch-out on high IO (writes) volumes Based on SnapMirror technology rac50::> vol move start -vserver vs1rac50 -volume movemetest -destination-aggregate aggr1_rac5071 -cutover- window 45 -cutover-attempts 3 -cutover-action defer_on_failure Example vol move command:

Compression & deduplication 23 Mainly used for Read Only data and our backup to disk solution (Oracle) It is transparent to applications NetApp compression provides similar gains as Oracle12c low compression level. It may vary depending on datasets compression ratio Savings due to compression and dedup: 682TB Total Space used: 641TB ~51.5% savings

Conclusions 24 Positive experience so far running on C-mode Data safety features (raid_dp, scrubbing, checksum,…) has been proven to be very reliable but bugs may be encountered, relying on e.g. checksums at the application layer when available is advisable. Mid to high end NetApp NAS provide good performance using the flash pool SSD caching solution Design of stacks and network access require careful planning Cluster resilience has being proven in a number of planned interventions and unplanned incidents Online interventions are key for critical services Good contacts with vendor specialists has been proven to be very effective Flexibility with clustered ONTAP, helps to reduce the investment Same infrastructure used to provide iSCSI object storage via CINDER New service functionality being built based on storage features

Questions 25

Flash Technologies 26 Depending where SSD are located. Controllers → Flash Cache Disk shelf → Flash Pool Flash pool based on a Heat Map Flash Cache Flash Pool Write to disk read overwrite Eviction scanner Insert into SSD read write Every 60 secs & SSD consumption > 75% hot warm neutral cold evict cold neutral

Flash pool + Oracle directNFS Oracle12c, enable dNFS by: $ORACLE_HOME/rdbms/lib/make -f ins_rdbms.mk dnfs_on