Scalable Storage Configuration for the Physics Database Services Luca Canali, CERN IT LCG Database Deployment and Persistency Workshop October, 2005.

Slides:



Advertisements
Similar presentations
Tom Hamilton – America’s Channel Database CSE
Advertisements

13 Copyright © 2005, Oracle. All rights reserved. Monitoring and Improving Performance.
MINERVA: an automated resource provisioning tool for large-scale storage systems G. Alvarez, E. Borowsky, S. Go, T. Romer, R. Becker-Szendy, R. Golding,
Module – 3 Data protection – raid
5 Copyright © 2005, Oracle. All rights reserved. Managing Database Storage Structures.
Introduction to DBA.
1 Magnetic Disks 1956: IBM (RAMAC) first disk drive 5 Mb – Mb/in $/year 9 Kb/sec 1980: SEAGATE first 5.25’’ disk drive 5 Mb – 1.96 Mb/in2 625.
Automatic Storage Management The New Best Practice Steve Adams Ixora Rich Long Oracle Corporation Session id:
CSE521: Introduction to Computer Architecture Mazin Yousif I/O Subsystem RAID (Redundant Array of Independent Disks)
High Availability Group 08: Võ Đức Vĩnh Nguyễn Quang Vũ
Allocation Methods - Contiguous
MOST COMMON ORACLE SYSTEM-LEVEL TUNING OPPORTUNITIES MARCH2009 Jeff Lippe Master Solution Architect.
5 Creating the Physical Model. Designing the Physical Model Phase IV: Defining the physical model.
OS and Hardware Tuning. Tuning Considerations Hardware  Storage subsystem Configuring the disk array Using the controller cache  Components upgrades.
Simplify your Job – Automatic Storage Management Angelo Session id:
© 2009 Oracle Corporation. S : Slash Storage Costs with Oracle Automatic Storage Management Ara Vagharshakian ASM Product Manager – Oracle Product.
Chapter 4 Physical Database Layouts Database Processing Chapter 4.
Backup & Recovery Concepts for Oracle Database
High Availability & Oracle RAC 18 Aug 2005 John Sheaffer Platform Solution Specialist
Scaling Oracle Spatial using Real Application Cluster! John Abel Executive Consultant.
Two or more disks Capacity is the same as the total capacity of the drives in the array No fault tolerance-risk of data loss is proportional to the number.
Database Services for Physics at CERN with Oracle 10g RAC HEPiX - April 4th 2006, Rome Luca Canali, CERN.
Database Storage Considerations Adam Backman White Star Software DB-05:
Oracle Storage Overview Tomáš Vencelík – Storage sales leader.
IT The Relational DBMS Section 06. Relational Database Theory Physical Database Design.
Oracle on Windows Server Introduction to Oracle10g on Microsoft Windows Server.
Luca Canali, CERN Dawid Wojcik, CERN
Andrew Mendelsohn Senior Vice President Database Oracle Corporation.
1 Robert Wijnbelt Health Check your Database A Performance Tuning Methodology.
7202ICT Database Administration Lecture 7 Managing Database Storage Part 2 Orale Concept Manuel Chapter 3 & 4.
CERN - IT Department CH-1211 Genève 23 Switzerland t Tier0 database extensions and multi-core/64 bit studies Maria Girone, CERN IT-PSS LCG.
Virtualization for Storage Efficiency and Centralized Management Genevieve Sullivan Hewlett-Packard
Designing and Deploying a Scalable EPM Solution Ken Toole Platform Test Manager MS Project Microsoft.
Achieving Scalability, Performance and Availability on Linux with Oracle 9iR2-RAC Grant McAlister Senior Database Engineer Amazon.com Paper
Log-structured Memory for DRAM-based Storage Stephen Rumble, John Ousterhout Center for Future Architectures Research Storage3.2: Architectures.
5 Copyright © 2005, Oracle. All rights reserved. Managing Database Storage Structures.
CERN - IT Department CH-1211 Genève 23 Switzerland t Oracle Real Application Clusters (RAC) Techniques for implementing & running robust.
ASM Configuration Review Luca Canali, CERN-IT Distributed Database Operations Workshop CERN, November 26 th, 2009.
CERN-IT Oracle Database Physics Services Maria Girone, IT-DB 13 December 2004.
6 Copyright © 2007, Oracle. All rights reserved. Managing Database Storage Structures.
CERN Database Services for the LHC Computing Grid Maria Girone, CERN.
Oracle 10g Automatic Storage Management Overview of ASM as a Storage Option for Oracle 10g.
1© Copyright 2012 EMC Corporation. All rights reserved. EMC VNX5700, EMC FAST Cache, SQL Server AlwaysOn Availability Groups Strategic Solutions Engineering.
Oracle Database Architecture By Ayesha Manzer. Automatic Storage Management Spreads database data across all disks Creates and maintains a storage grid.
CERN - IT Department CH-1211 Genève 23 Switzerland t High Availability Databases based on Oracle 10g RAC on Linux WLCG Tier2 Tutorials, CERN,
Page 1 Mass Storage 성능 분석 강사 : 이 경근 대리 HPCS/SDO/MC.
Scalable Oracle 10g for the Physics Database Services Luca Canali, CERN IT January, 2006.
Database CNAF Barbara Martelli Rome, April 4 st 2006.
BNL dCache Status and Plan CHEP07: September 2-7, 2007 Zhenping (Jane) Liu for the BNL RACF Storage Group.
Hands-On Microsoft Windows Server 2008 Chapter 7 Configuring and Managing Data Storage.
Enhanced Availability With RAID CC5493/7493. RAID Redundant Array of Independent Disks RAID is implemented to improve: –IO throughput (speed) and –Availability.
2 Copyright © 2006, Oracle. All rights reserved. RAC and Shared Storage.
Indexing strategies and good physical designs for performance tuning Kenneth Ureña /SpanishPASSVC.
Monitoring Storage Systems for Oracle Enterprise Manager 12c
Sarah Diesburg Operating Systems COP 4610
Flash Storage 101 Revolutionizing Databases
Diskpool and cloud storage benchmarks used in IT-DSS
Maximum Availability Architecture Enterprise Technology Centre.
Scalable Database Services for Physics: Oracle 10g RAC on Linux
Monitoring Storage Systems for Oracle Enterprise Manager 12c
Oracle Database Monitoring and beyond
Oracle Storage Performance Studies
Case studies – Atlas and PVSS Oracle archiver
ASM-based storage to scale out the Database Services for Physics
Data Orgnization Frequently accessed data on the same storage device?
How to Thrive as a DBA in an Oracle10g World
Mark Zbikowski and Gary Kimura
Scalable Database Services for Physics: Oracle 10g RAC on Linux
Sarah Diesburg Operating Systems CS 3430
Presentation transcript:

Scalable Storage Configuration for the Physics Database Services Luca Canali, CERN IT LCG Database Deployment and Persistency Workshop October, 2005

Database Workshop, October 2005L.Canali, CERN2Outline In this talk I will discussIn this talk I will discuss –Storage configuration for scalable database services Main challenges Best practices –An implementation of scalable storage Impacts on DB logical to physical mapping Performance and resource allocations –How we can help you to size new database projects or to scale up existing applications Performance testing Benchmark data

Database Workshop, October 2005L.Canali, CERN3 Oracle Scalable Architecture Goal: A database infrastructure that provides the required system resources to the end-users and applications. How: A modular architecture that can scale up to a large number of components

Database Workshop, October 2005L.Canali, CERN4 RAID 1: HA Storage MirroringMirroring –2-Way mirroring (RAID 1) protects against single point of failures –Can be used to redistribute I/O load (performance)

Database Workshop, October 2005L.Canali, CERN5 RAID 0: Scalable Performances RAID 0 (Striping) automatically redistributes files across multiple disks.RAID 0 (Striping) automatically redistributes files across multiple disks. Performance and scalability are increasedPerformance and scalability are increased Error resiliency is decreasedError resiliency is decreased

Database Workshop, October 2005L.Canali, CERN6 Mechanical and Geometrical constraints The external part of the disk providesThe external part of the disk provides –More throughput –Less latency

Database Workshop, October 2005L.Canali, CERN7 S.A.M.E Strategy Goal: optimize storage I/O utilizationGoal: optimize storage I/O utilization S.A.M.E. (Stripe And Mirror Everything) StrategyS.A.M.E. (Stripe And Mirror Everything) Strategy –Built on the concepts of RAID –Proposed by J. Loaiza (Oracle) in 1999 –Replaces old recipes: manual balancing across volumes Need a Software or Hardware Volume ManagerNeed a Software or Hardware Volume Manager –ASM is Oracles solution with 10g S.A.M.E. out of the box –Other solutions available from different vendors require configuration

Database Workshop, October 2005L.Canali, CERN8 Storage Configuration Guidelines Use all available disk drivesUse all available disk drives Place frequently used data at outer half of diskPlace frequently used data at outer half of disk –Fastest transfer rate –Minimize seek time Stripe data at 1MB extentsStripe data at 1MB extents –Distribute the workload across disks –Eliminate hot spots –Optimum sequential bandwidth gained with1MB I/O Stripe redo logs across multiple drivesStripe redo logs across multiple drives –Maximize write throughput for small writes –Smaller stripe size (128KB) and/or dedicated disks Use cache on the controllerUse cache on the controller –Write-back cache –Battery-backed cache

Database Workshop, October 2005L.Canali, CERN9 Oracles ASM Main Features Mirror protection: –2-way and 3-way mirroring available. –Mirror on a per-file basis –Can mirror across storage arrays Data striping across the volume: –1MB and 128KB stripes available Supports clustering and single instance Dynamic data distribution –A solution to avoid hot spots –On-line add/drop disk with minimal data relocation –Automatic database file management Database File System with performance of RAW I/O

Database Workshop, October 2005L.Canali, CERN10 ASMs Configuration – Examples ASM is a volume manager, its output are disk groups (DG) that Oracle databases can mount to allocate their filesASM is a volume manager, its output are disk groups (DG) that Oracle databases can mount to allocate their files RECOVERY-DGDATA-DG Config 1: Disk groups created with dedicated disks Config 2: Disk groups created by horizontal slicing

Database Workshop, October 2005L.Canali, CERN11 Proposed Storage Configuration Proposed storage configuration:Proposed storage configuration: –High availability - Allows backups to disk –High performance - Allows clusterware mirroring (10.2) –DBs have dedicated resources Storage Arrays Oracle RAC Nodes Data DG-2Recovery DG-1Data DG-1Recovery DG-2 Disk Groups (ASM) DB N.1DB N.2

Database Workshop, October 2005L.Canali, CERN12 FAQ 1: Datafiles Do I need to worry on the number and names of the datafiles allocated for each tablespace?Do I need to worry on the number and names of the datafiles allocated for each tablespace? Traditional storage allocation across multiple volumes:Traditional storage allocation across multiple volumes: –Requires a careful allocation of multiple datafiles across logical volumes and/or filesystems –Datafile-to-filesystem and filesystem-to-physical storage mappings have to be frequently tuned S.A.M.E. storage, such as Oracle ASM, provides balanced I/O access across disksS.A.M.E. storage, such as Oracle ASM, provides balanced I/O access across disks –There is NO NEED, for performance reasons, to allocate multiple datafiles per tablespace. –10g new feature bigfile tablespace allows for tablespaces with a single datafile that can grow up to 32 TB (db_block_size=8k)

Database Workshop, October 2005L.Canali, CERN13 FAQ 2: Data and Index Tablespaces Do I need dedicated tablespaces for indexes and tables?Do I need dedicated tablespaces for indexes and tables? Separation of indexes and tables has often been advised to:Separation of indexes and tables has often been advised to: –Distribute I/O –Reduce fragmentation –Allow separate backup of tables and indexes S.A.M.E. storage, such as Oracle ASM, provides balanced I/O access across disksS.A.M.E. storage, such as Oracle ASM, provides balanced I/O access across disks –No performance gains are expected by using dedicated tablespaces for INDEXes and TABLEs. Additional Notes:Additional Notes: –Tablespaces fragmentation has little impact when using locally managed TBSs and automatic segment space management (9i and 10g) –Very large database can profit from using multiple tablespaces for admin purposes and logical separation of objects

Database Workshop, October 2005L.Canali, CERN14 FAQ 3: Sizing Storage Storage sizing for database should not take size as the first requirementStorage sizing for database should not take size as the first requirement –Bandwidth and performance metrics are bound to the number of disk spindles –Magnetic HD technology has improved the GByte/$ ratio –The rest of HD technology has not seen much improvements in the last 5 years (since 15K rpms HDs) Sizing for storage requirements shouldSizing for storage requirements should –Be based on stress test measurements –Past performance measurements on comparable systems –New projects can leverage benchmark data. Extra HD space is not wastedExtra HD space is not wasted –Can be used to strengthen the B&R policy with Disk Backups

Database Workshop, October 2005L.Canali, CERN15 IO Benchmark Measurements Benchmark data can be used forBenchmark data can be used for –Sizing of new projects and upgrades –Performance baseline, testing for new hardware The following metrics have been measured:The following metrics have been measured: –Sequential throughput (full scans) –Random access (indexed access) –I/O per second (indexed access) –Metrics are measured as a function of workload Other test detailsOther test details –Benchmark tool: Oracles ORION –Infortrend Storage array: 16 SATA x 400 GB disks, 1 controller and 1 GB cache

Database Workshop, October 2005L.Canali, CERN16 IO Benchmark Data Throughput: + 25%

Database Workshop, October 2005L.Canali, CERN17 IO Benchmark Data Latency: - 35%

Database Workshop, October 2005L.Canali, CERN18 IO Benchmark Data I/O /sec: +60%

Database Workshop, October 2005L.Canali, CERN19Conclusions The Database Services for Physics can provideThe Database Services for Physics can provide –Scalable Database services Scalable on CPU and Memory resources Scalable on Storage resources –Sizing for new projects or upgrades Stress/Performance testing Integration and Validation Testing Benchmark data for capacity planning

Database Workshop, October 2005L.Canali, CERN20 Additional Benchmark Data

Database Workshop, October 2005L.Canali, CERN21 Additional Benchmark Data

Database Workshop, October 2005L.Canali, CERN22 Additional Benchmark Data