Buying Database Hardware Adam Backman – President White Star Software, LLC.

Slides:



Advertisements
Similar presentations
1/17/20141 Leveraging Cloudbursting To Drive Down IT Costs Eric Burgener Senior Vice President, Product Marketing March 9, 2010.
Advertisements

Buying Database Hardware Adam Backman – President White Star Software, LLC.
Ted Krueger SQL Server MVP Data Architect Building a SQL Server Test Lab.
Copyright © 2006 Quest Software SQL 2005 Disk I/O Performance By Bryan Oliver SQL Server Domain Expert.
T OP N P ERFORMANCE T IPS Adam Backman Partner, White Star Software.
MUNIS Platform Migration Project WELCOME. Agenda Introductions Tyler Cloud Overview Munis New Features Questions.
Database Storage for Dummies Adam Backman President – White Star Software, LLC.
Chapter 4 Infrastructure as a Service (IaaS)
Numbers, We don’t need no stinkin’ numbers Adam Backman Vice President DBAppraise, Llc.
Recovery Planning A Holistic View Adam Backman, President White Star Software
SAM SPENCER Server Virtualization. Agenda Introduction History Server Virtualization Software Server Virtualization Hardware Determining Server Hardware.
Take your CMS to the cloud to lighten the load Brett Pollak Campus Web Office UC San Diego.
10 REASONS Why it makes a good option for your DB IN-MEMORY DATABASES Presenter #10: Robert Vitolo.
Enhanced Availability With RAID CC5493/7493. RAID Redundant Array of Independent Disks RAID is implemented to improve: –IO throughput (speed) and –Availability.
RAID- Redundant Array of Inexpensive Drives. Purpose Provide faster data access and larger storage Provide data redundancy.
1 Magnetic Disks 1956: IBM (RAMAC) first disk drive 5 Mb – Mb/in $/year 9 Kb/sec 1980: SEAGATE first 5.25’’ disk drive 5 Mb – 1.96 Mb/in2 625.
Chapter 5: Server Hardware and Availability. Hardware Reliability and LAN The more reliable a component, the more expensive it is. Server hardware is.
REDUNDANT ARRAY OF INEXPENSIVE DISCS RAID. What is RAID ? RAID is an acronym for Redundant Array of Independent Drives (or Disks), also known as Redundant.
Empowering Business in Real Time. © Copyright 2009, OSIsoft Inc. All rights Reserved. Virtualization and HA PI Systems: Three strategies to keep your PI.
Adam Duffy Edina Public Schools.  The heart of virtualization is the “virtual machine” (VM), a tightly isolated software container with an operating.
Ken Birman. Massive data centers We’ve discussed the emergence of massive data centers associated with web applications and cloud computing Generally.
1 Recap (RAID and Storage Architectures). 2 RAID To increase the availability and the performance (bandwidth) of a storage system, instead of a single.
Reliability Week 11 - Lecture 2. What do we mean by reliability? Correctness – system/application does what it has to do correctly. Availability – Be.
High Performance Computing Course Notes High Performance Storage.
VIRTUALIZATION AND YOUR BUSINESS November 18, 2010 | Worksighted.
© 2009 IBM Corporation Statements of IBM future plans and directions are provided for information purposes only. Plans and direction are subject to change.
Session 3 Windows Platform Dina Alkhoudari. Learning Objectives Understanding Server Storage Technologies Direct Attached Storage DAS Network-Attached.
Data Storage Willis Kim 14 May Types of storages Direct Attached Storage – storage hardware that connects to a single server Direct Attached Storage.
Virtual Network Servers. What is a Server? 1. A software application that provides a specific one or more services to other computers  Example: Apache.
Storage System: RAID Questions answered in this lecture: What is RAID? How does one trade-off between: performance, capacity, and reliability? What is.
CERN IT Department CH-1211 Genève 23 Switzerland t Next generation of virtual infrastructure with Hyper-V Michal Kwiatek, Juraj Sucik, Rafal.
Module 3 - Storage MIS5122: Enterprise Architecture for IT Auditors.
Measuring zSeries System Performance Dr. Chu J. Jong School of Information Technology Illinois State University 06/11/2012 Sponsored in part by Deer &
Windows Server MIS 424 Professor Sandvig. Overview Role of servers Performance Requirements Server Hardware Software Windows Server IIS.
VAP What is a Virtual Application ? A virtual application is an application that has been optimized to run on virtual infrastructure. The application software.
Effectively Explaining the Cloud to Your Colleagues.
Evolved Virtual Data Center Reserved Resource Pools.
Redundant Array of Independent Disks
Database Storage Considerations Adam Backman White Star Software DB-05:
N-Tier Client/Server Architectures Chapter 4 Server - RAID Copyright 2002, Dr. Ken Hoganson All rights reserved. OS Kernel Concept RAID – Redundant Array.
Best Practices for Backup in SAN/NAS Environments Jeff Wells.
Top 10 Performance Hints Adam Backman White Star Software
Virtualization for Storage Efficiency and Centralized Management Genevieve Sullivan Hewlett-Packard
Hadoop Hardware Infrastructure considerations ©2013 OpalSoft Big Data.
LOGO Service and network administration Storage Virtualization.
Session objectives Discuss whether or not virtualization makes sense for Exchange 2013 Describe supportability of virtualization features Explain sizing.
1 U.S. Department of the Interior U.S. Geological Survey Contractor for the USGS at the EROS Data Center EDC CR1 Storage Architecture August 2003 Ken Gacke.
RAID SECTION (2.3.5) ASHLEY BAILEY SEYEDFARAZ YASROBI GOKUL SHANKAR.
DONE-08 Sizing and Performance Tuning N-Tier Applications Mike Furgal Performance Manager Progress Software
IST Storage & Backup Group 2011 Jack Shnell Supervisor Joe Silva Senior Storage Administrator Dennis Leong.
VMware vSphere Configuration and Management v6
RAID Systems Ver.2.0 Jan 09, 2005 Syam. RAID Primer Redundant Array of Inexpensive Disks random, real-time, redundant, array, assembly, interconnected,
Infrastructure for Data Warehouses. Basics Of Data Access Data Store Machine Memory Buffer Memory Cache Data Store Buffer Bus Structure.
Using Progress® Analytical Tools Adam Backman White Star Software DONE-05:
Cloud Computing Lecture 5-6 Muhammad Ahmad Jan.
Component 8/Unit 9aHealth IT Workforce Curriculum Version 1.0 Fall Installation and Maintenance of Health IT Systems Unit 9a Creating Fault Tolerant.
1 CEG 2400 Fall 2012 Network Servers. 2 Network Servers Critical Network servers – Contain redundant components Power supplies Fans Memory CPU Hard Drives.
Hands-On Microsoft Windows Server 2008 Chapter 7 Configuring and Managing Data Storage.
Infrastructure for the DBA: An Introduction Peter Shore SQL Saturday Chicago 2016.
Enhanced Availability With RAID CC5493/7493. RAID Redundant Array of Independent Disks RAID is implemented to improve: –IO throughput (speed) and –Availability.
Unit 2 VIRTUALISATION. Unit 2 - Syllabus Basics of Virtualization Types of Virtualization Implementation Levels of Virtualization Virtualization Structures.
RAID Technology By: Adarsha A,S 1BY08A03. Overview What is RAID Technology? What is RAID Technology? History of RAID History of RAID Techniques/Methods.
Amazon Web Services. Amazon Web Services (AWS) - robust, scalable and affordable infrastructure for cloud computing. This session is about:
PHD Virtual Technologies “Reader’s Choice” Preferred product.
Integrating Disk into Backup for Faster Restores
Adam Backman Chief Cat Wrangler – White Star Software
Managing Multi-User Databases
Operating System Structure
Customer Profile (Target)
CS 295: Modern Systems Organizing Storage Devices
Presentation transcript:

Buying Database Hardware Adam Backman – President White Star Software, LLC.

About the speaker President – White Star Software A leader in the Progress OpenEdge services sector for over 30 years Vice President – DBAppraise Managed database services backed up by experienced Progress OpenEdge professionals Author – Progress Software’s Expert Series Over 25 years of Progress OpenEdge experience – Support – Training – Consulting (Database and System configuration, management and tuning)

Agenda Understanding the process Understanding system resources Picking the right vendor Where to spend your money – CPU fast vs. many – Memory – can you ever have too much – Disk – where all the data starts – Network and other parts of the system Conclusion

Understanding the Process Know where you are now Important portions of the application – Core OLTP (call center, orders, shipping) – Subset of daily management tasks – Subset of weekly/monthly tasks Know your direction Plan your direction

Know where you are now Know your model of operation CPU centric – More application logic – More calculations Data centric – Request centric – MRP, Accounting inventory

Have data for comparison Timings – Just time critical items with a watch Frequently used functions Time critical portions of the application Performance Information – System level – SAR, iostat, vmstat – Progress Level – ProTop, promon, custom VST programs

Understanding system resources Supported architectures Understand your options Performance tradeoffs

Architectures supported by OpenEdge Database engine – Database with no portion of the application Host-based system – Database, clients and background all on one system Pure client/server – Database on one machine and clients on other machines n-tier architecture – Database and background on Machine A – AppServers on Machine B – Clients on individual machines

Understand your options Single large system vs. 2 or more smaller machines Virtualization Single platform or multi-platform Cloud vendors SAN vs. Direct attached storage Network considerations

Single large machine vs. 2 or more smaller machines Single large machine – Pros Highest potential performance by eliminating network layer Easier to manage as everything is in one place – Cons A single machine will have limited scalability Usually two mid-range systems are more cost effective than a single high-end system Potential license cost issues (CPU-Based pricing)

Single large machine vs. 2 or more smaller machines (cont.) Multi-machine – Pros Flexibility – ability to repurpose machines Scalability – ability to add additional machines to solution Recoverability – ability to use AppServer machine as the database engine – Cons Cost – duplication of items, power, maintnenace Adding network layer can hurt performance Management – more machines to manage Maintenance – more things to break

Purchase guidance Databases tend to use disk extensively – Spend on disk subsystem – Allow for a minimum of 10% of the database size for database buffers (-B memory) – Do not forget other memory allocations OS buffers can be reduced to 10% or less of total memory Applications are memory and CPU intensive – Generally better to buy fewer faster cores – Memory can greatly reduce I/O via –B -Bp -Bt, -mmax, … Examine your use cases for the machine and buy with both primary use and most likely alternative uses in mind

Purchase guidance Most people over spend on CPU You can have all the CPU in the world but it will do you no good unless you can get data to them efficiently People should focus on the performance “food chain” – Network – Disk – Memory – CPU Slower resources should be addressed before faster resources

Virtualization A thousand ways to go wrong and very few ways to do it right Everyone is doing it but why? – Ability to build new environments – Ability to recover quickly (part of a DR solution) – Potential for better resource saturation (unused CPU) Why not? – Performance – Complexity – Cost (VMWare is not free – More applications affected by an outage

Options: N-tier option Database engine – Fast Disk – Moderate memory (over 10% of DB + OS and extras) – Relatively little CPU AppServer machine – Internal disk – setup well but not crazy – Higher memory usage – CPU intensive Client machine – Web/Mobile – Desktops – Citrix/Windows terminal server

Cloud: Make it someone else’s problem

Cloud Watch for variable performance – Measure throughput (Disk and memory) – Measure compute capacity – Measure at different days/times Performance guaranty from vendor Iops/sec. vs. perception (real measurements)

Why is disk important CPU capacity doubles every 18 months Network bandwidth doubling every 12 months Memory is 37,000+ times faster than disk Disk (per disk I/O rate) fairly static (150 – 200 iops/sec.) Storage will generally cost more than servers and this is particularly true for database servers

Buy better storage Many disks – 150 iops/sec. per disk – Look at you buffer hit rate and total request load – Don’t forget temporary file I/O which can account for a significant percentage of your total I/O load Larger cache – Some systems require you to expand cache when you expand your storage but most don’t – Adding cache is akin to adding database buffers to a database SSD – save money buy fewer devices – SSDs are a real solution now and prices are competitive though not cheap when compared to conventional storage on a per GB basis

Do better disk configuration Still no RAID 5, No RAID S, No RAID 6, No RAID 7 RAID 10 still king for database storage – really there are a bunch of really cool stats to prove this out Large stripe widths – Performance improved with stripe width through 2MB Use best portion of rotating disk (rotating rust) – Using outer edge of disk will provide the best performance which may be as much as 15% better vs. inner portion of disk Even usage across all disks – Eliminate disk variance – Think of ALL sources of I/O (DB, BI, AI, Temp files, OS, …)

Storage Direct attached – Less expensive in most cases – Less complex – Single machine tuning OS and Array – High performance – Disks dedicated SAN – generalized business storage NAS – file optimized storage SAN – Purpose-built high performance Why SAN twice? There is a huge difference in SANs and you need to buy for your need not for their marketing

Direct-attached storage Pros – Not shared with other hosts (isolation is bliss) – Easier problem resolution – Massive controller throughput for little money – Cheaper to maintain Cons – Not shared with other hosts (no cost sharing)

SAN: Generalized business storage Pros – Best option in virtualized environment – Share one powerful storage system with many hosts – One stop storage system for all hosts Cons – High initial cost – Single point of failure unless array mirroring/clustering is in place – Not optimized to individual tasks – Complex

SAN: Purpose-built Pros: – Excellent performance – Additional control at array level – Massively scalable – Ability to dedicate resources to hosts – Reliable (fault tolerant) Cons – Single point of failure unless array mirroring is in place – Cost – Complexity

SAN monitoring More difficult as there are many moving parts Multiple hosts need to be monitored SAN needs to be monitored Monitoring data needs to be synchronized Work loads need to be balanced across hosts

NAS: file optimized storage Pros – Sharable across hosts – Generally cheaper than SAN – Good service for application files Cons – File optimized not block optimized – Not database optimized – Not client temporary file optimized

Network options Simple – Put a single quad card in the server and bind the ports for performance Moderate – Multiple cards bound with a two networks. One for Data and the other for client traffic Complex – Multiple machine – Multiple networks (vlan) – Dedicated networks for DB, replication, client traffic, AppServer

Network Try to use your network efficiently – -Mm 8192 to increase throughput – Remember to move to jumbo frames (client, server, switches, …) Move invasive processes to separate network – Backup – Replication – System syncronizations

Picking the right vendor – The less of two evils

Picking the right vendor Better support nearly always beats a better upfront price Look at quality of “local” support infrastructure – Response time (SLA) – In country – In the correct language Always comparison shop even if you “know” what you want – This keeps vendors honest – Choosing historic rivals helps drive down price Simplify to enhance support – Bundle Linux support under hardware contract – Single vendor simplicity

Paying for support Buy all support with the initial purchase Allows easier (capital) write-off Years 4+ of support can cost as much as the initial price if purchased later

Picking the wrong solution NetApp for database storage. Performance will be non-optimal (Performance will be HORRIBLE) NUMA Architecture – Good vendors make bad solutions – All CPUs allocated to a Progress domain must come from the same book/shelf/node – All Memory must meet the same criteria as CPU Using client/server for reporting – Kill the network access whenever possible – Use AppServer for complex OLTP

Where to spend your money Disks Storage SAN SSD Really, look at storage first then concern yourself with other trivial issues such as memory and CPU This is the problem over 9 out of 10 times

Questions, Comments, … 问题 Preguntas

THANK YOU Thank you for your time