Flash Storage 101 Revolutionizing Databases

Slides:



Advertisements
Similar presentations
Performance Tuning Methods Author: Vladimir Andreev Semantec GmbH Lector: Stoyan Ivanov Semantec Bulgaria OOD Semantec GmbH Benzstr. 32 D Herrenberg,
Advertisements

Module 13: Performance Tuning. Overview Performance tuning methodologies Instance level Database level Application level Overview of tools and techniques.
Adam Jorgensen Pragmatic Works Performance Optimization in SQL Server Analysis Services 2008.
RAID Redundant Array of Independent Disks
Virtualisation From the Bottom Up From storage to application.
Big Data Working with Terabytes in SQL Server Andrew Novick
Operating Systems ECE344 Ashvin Goel ECE University of Toronto Disks and RAID.
Automatic Storage Management The New Best Practice Steve Adams Ixora Rich Long Oracle Corporation Session id:
Chapter 8 Operating System Support
5 Creating the Physical Model. Designing the Physical Model Phase IV: Defining the physical model.
OS and Hardware Tuning. Tuning Considerations Hardware  Storage subsystem Configuring the disk array Using the controller cache  Components upgrades.
Fast Track, Microsoft SQL Server 2008 Parallel Data Warehouse and Traditional Data Warehouse Design BI Best Practices and Tuning for Scaling SQL Server.
Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 1 Preview of Oracle Database 12 c In-Memory Option Thomas Kyte
Russ Houberg Senior Technical Architect, MCM KnowledgeLake, Inc.
SQL Server 2008 & Solid State Drives Jon Reade SQL Server Consultant SQL Server 2008 MCITP, MCTS Co-founder SQLServerClub.com, SSC
Convergence /20/2017 © 2013 Microsoft Corporation. All rights reserved. Microsoft, Windows, and other product names are or may be registered trademarks.
1 CSE544 Database Architecture Tuesday, February 1 st, 2011 Slides courtesy of Magda Balazinska.
Module 8: Server Management. Overview Server-level and instance-level resources such as memory and processes Database-level resources such as logical.
Troubleshooting SQL Server Enterprise Geodatabase Performance Issues
Disk Access. DISK STRUCTURE Sector: Smallest unit of data transfer from/to disk; 512B 2/4/8 adjacent sectors transferred together: Blocks Read/write heads.
Lecture 9 of Advanced Databases Storage and File Structure (Part II) Instructor: Mr.Ahmed Al Astal.
1 Lecture 8: Secondary-Storage Structure 2 Disk Architecture Cylinder Track SectorDisk head rpm.
Introduction. Outline What is database tuning What is changing The trends that impact database systems and their applications What is NOT changing The.
© Pearson Education Limited, Chapter 16 Physical Database Design – Step 7 (Monitor and Tune the Operational System) Transparencies.
1 Moshe Shadmon ScaleDB Scaling MySQL in the Cloud.
SESSION CODE: BIE07-INT Eric Kraemer Senior Program Manager Microsoft Corporation.
Designing and Deploying a Scalable EPM Solution Ken Toole Platform Test Manager MS Project Microsoft.
SQL Server 2014: Overview Phil ssistalk.com.
Remote Site C Pilot Scheduler Pilots and Pilot Schedulers Jobs Statistics Production Dashboard Dynamic Data Movement Monitor Panda Server (Apache) Development.
Srik Raghavan Principal Lead Program Manager Kevin Cox Principal Program Manager SESSION CODE: DAT206.
Infrastructure for Data Warehouses. Basics Of Data Access Data Store Machine Memory Buffer Memory Cache Data Store Buffer Bus Structure.
CERN - IT Department CH-1211 Genève 23 Switzerland t High Availability Databases based on Oracle 10g RAC on Linux WLCG Tier2 Tutorials, CERN,
Page 1 Mass Storage 성능 분석 강사 : 이 경근 대리 HPCS/SDO/MC.
Best Practices in Loading Large Datasets Asanka Padmakumara (BSc,MCTS) SQL Server Sri Lanka User Group Meeting Oct 2013.
AA202: Performance Enhancers for Laserfiche Connie Anderson, Technical Writer.
Violin Memory Inc. Proprietary 1 Bottlenecks SQL Server on flash storage Matt Henderson Consulting Engineer.
Configuring SQL Server for a successful SharePoint Server Deployment Haaron Gonzalez Solution Architect & Consultant Microsoft MVP SharePoint Server
Network-Attached Storage. Network-attached storage devices Attached to a local area network, generally an Ethernet-based network environment.
Indexing strategies and good physical designs for performance tuning Kenneth Ureña /SpanishPASSVC.
sponsored by HP Enterprise
Flash Storage 101 Matt Henderson.
Understanding and Improving Server Performance
Processes and threads.
Process Management Process Concept Why only the global variables?
Multiple Platters.
RAID, Programmed I/O, Interrupt Driven I/O, DMA, Operating System
Using non-volatile memory (NVDIMM-N) as block storage in Windows Server 2016 Tobias Klima Program Manager.
Data Warehouse in the Cloud – Marketing or Reality?
HPE Persistent Memory Microsoft Ignite 2017
The Client/Server Database Environment
Building Modern Transaction Systems on SQL Server
Software Architecture in Practice
Storage Virtualization
TCB 9 RK/MSFT Benchmark Results
Oracle Storage Performance Studies
Parallel Database Maintenance with 24/7 Systems and Huge DBs
RAID RAID Mukesh N Tekwani
April 30th – Scheduling / parallel
Real world In-Memory OLTP
Proving Hardware Bottlenecks &
Physical Database Design
Ch 4. The Evolution of Analytic Scalability
Data Orgnization Frequently accessed data on the same storage device?
Mid Term review CSC345.
CSE 451: Operating Systems Winter 2009 Module 13 Redundant Arrays of Inexpensive Disks (RAID) and OS structure Mark Zbikowski Gary Kimura 1.
Mark Zbikowski and Gary Kimura
CSE 451: Operating Systems Winter 2012 Redundant Arrays of Inexpensive Disks (RAID) and OS structure Mark Zbikowski Gary Kimura 1.
Specialized Cloud Architectures
RAID RAID Mukesh N Tekwani April 23, 2019
Chapter 3: Processes Process Concept Process Scheduling
Presentation transcript:

Flash Storage 101 Revolutionizing Databases Matt Henderson Consulting Engineer

Inside the Server 10-40% utilization Many threads Many cores Many time slices 10-40% utilization Why is the CPU not at 100%? -> Resource constrained

SQL Server Process Management One SQL scheduler per logical core SQL scheduler time-slices between users

Queues Running Waiting Runnable Currently executing process Waiting for a resource (IO, network, locks, latches, etc) Runnable Resource ready, waiting to get on CPU

Storage Latency Latency issues multiplied by the number of cores in the system

Over 50 IO in the same amount of time Visualizing Latency Block of data requested HDD Storage 8ms latency 8ms Block of data returned Time I/O Bound Apps Oracle DB2 SQL Server Etc… Total latency = seek time + rotational latency Flash Storage 0.15ms (150 microsecond) latency Over 50 IO in the same amount of time Time

Legacy Architecture Aggregation & Segregation Model Cost Inefficient Many autonomous parts Separate RAID groups Data locality Hot spots Transient data / usage Tiering software / admin Cost Inefficient Each workload bottlenecked by a different subset of components Flash != Flash SSDs are a modernization – disk substitution Distributed block all-flash is a revolution One 64k inbound write is split into 16 separate 4k packets Each 4k packet has parity calculated creating a total of 5k of data to store The 5k final packet is split into five 1k packet and one is sent to each of the five VIMMS in a RAID group

Distributed Block Flash VIMM – custom built flash storage cards and controllers Packets striped across VIMM’s flash wafers at the bit level Each VIMM chip splits a 1k packet to the bit level and stores them all in parallel to the many flash chip layers comprising each nand chip 12 RAID groups 5 VIMMS each

Massive Parallelism Batches split into 4k blocks All blocks split over VIMMS VIMMS parallelize to the bit Memory-like addressing 12 RAID groups 5 VIMM cards per group One 64k inbound write is split into 16 separate 4k packets Each 4k packet has parity calculated creating a total of 5k of data to store The 5k final packet is split into five 1k packet and one is sent to each of the five VIMMS in a RAID group

Distributed Block Flash for Databases Simplicity Less to design, plan. Easy to scale. No performance variance Faster, Quicker, More. Speed and Scale Reduced admin time No LUN/stripe mapping Chasing down / resolving performance issues Chores and tasks run faster ETL’s Backup/Restores – reduced RTO No index rebuilding for logical fragmentation Reduced admin costs / issues No costly tiering/locality management software No quarterly new-spindle purchases Turn asynch mirroring into synch mirroring Less hardware to do the same work (context switches) Drop from Enterprise to Standard edition SQL Server

Modern = Simplicity Traditional Architecture Aggregation & Segregation Violin Memory Storage Distributed Block

NTFS Tuning – Need several LUNs

SQL Tuning – IDENTITY Column & Clustered Indexes As PK = OK As Clustered index = NOT OK Huddles data onto one data page Bottlenecks on inserts Removes locks/latches as source of bottleneck

SQL Tuning – In and Outs Out In Any number of files, partitions Many files for speed / data placement Many partitions for speed / data placement Index defragging to re-order data MAXDOP <= 4 Admin in serial mode (one at a time) Table loads (ETL) Index maintenance Backups Archiving Admin only during off hours Segregated workloads In Any number of files, partitions MAXDOP = number of cores in server Admin in parallel Admin during business processing Mixed workloads (many servers & DB’s on same storage device)