Outperform the Competition with Azure SQL Data Warehouse

Slides:



Advertisements
Similar presentations
Information managers are seeking innovative DBMS’s which are able to handle large data volumes in new ways or to optimize existing products and processes.
Advertisements

Technology Drill Down: Windows Azure Platform Eric Nelson | ISV Application Architect | Microsoft UK |
Azure SQL DW – Elastic Data Analytics in the cloud Josh Sivey | Microsoft TSP #492 | Phoenix.
An Introduction To Big Data For The SQL Server DBA.
The Derivitec Risk Portal Provides Powerful, Cost-Effective Risk Management Solutions, Powered by Azure, that Deploy in Minutes MICROSOFT AZURE ISV PROFILE:
Business Insights Play briefing deck.
AuraPortal Cloud Helps Empower Organizations to Organize and Control Their Business Processes via Applications on the Microsoft Azure Cloud Platform MICROSOFT.
Univa Grid Engine Makes Work Management Automatic and Efficient, Accelerates Deployment of Cloud Services with Power of Microsoft Azure MICROSOFT AZURE.
4/18/2018 6:56 AM © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN.
Data Platform and Analytics Foundational Training
Data Platform Modernization
UnixBench System Score
System Center Marketing
System Center Marketing
Business Critical Application Platform
SQL 2016 new Hosting Offers Secure Database Hybrid HyperScale
Free Cloud Management Portal for Microsoft Azure Empowers Enterprise Users to Govern Their Cloud Spending and Optimize Cloud Usage and Planning MICROSOFT.
Trial.iO Makes it Easy to Provision Software Trials, Demos and Training Environments in the Azure Cloud in One Click, Without Any IT Involvement MICROSOFT.
Example of a page header
Microsoft Build /22/ :52 PM © 2016 Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY,
Couchbase Server is a NoSQL Database with a SQL-Based Query Language
Azure Hybrid Use Benefit Overview
Navision Business Analytics
Hosted on Azure, LoginRadius’ Customer Identity
Business Critical Application Platform
Exploring Azure Event Grid
A developers guide to Azure SQL Data Warehouse
Built on the Powerful Microsoft Azure Platform, Lievestro Delivers Care Information, Capacity Management Solutions to Hospitals, Medical Field MICROSOFT.
Azure SQL Data Warehouse for SQL Server DBAS
9/21/2018 3:41 AM BRK3180 Architect your big data solutions with SQL Data Warehouse & Azure Analysis Services Josh Caplan & Matt Usher Program Managers.
Azure SQL Data Warehouse Scaling: Configuration and Guidance
Take Control of Insurance Product Management: Build, Test, and Launch Any Product Globally 10x Faster, 10x More Cheaply with INSTANDA on Azure Partner.
SQL Server 2012 Licensing Overview.
Welcome! Power BI User Group (PUG)
FACTON Provides Businesses with a Cloud Solution That Elevates Enterprise Product Cost Management to a New Level Using the Power of Microsoft Azure MICROSOFT.
What is the Azure SQL Datawarehouse?
What Azure have to offer for your data
Yellowfin: An Azure-Compatible Business Intelligence Platform That Connects People with Their Data for Better Decision Making MICROSOFT AZURE APP BUILDER.
Scalable SoftNAS Cloud Protects Customers’ Mission-Critical Data in the Cloud with a Highly Available, Flexible Solution for Microsoft Azure MICROSOFT.
Case Study Modernizing an Operational Data Architecture
Data Platform Modernization
Azure SQL Data Warehouse Performance Tuning
Through the Microsoft Azure Platform, TARGIT Decision Suite Enables Organizations to Analyze Critical Data, Giving Them the Courage to Act MICROSOFT AZURE.
Massively Parallel Processing in Azure Comparing Hadoop and SQL based MPP architectures in the cloud Josh Sivey SQL Saturday #597 | Phoenix.
I-POWER JAPAN Gives Small Businesses the Ability to Get Their Work Done from Anywhere, Even a Construction Site, by Using Microsoft Azure MICROSOFT AZURE.
Azure SQL Data Warehouse for SQL Server DBAS
Server & Tools Business
DeFacto Planning on the Powerful Microsoft Azure Platform Puts the Power of Intelligent and Timely Planning at Any Business Manager’s Fingertips Partner.
Data Security for Microsoft Azure
Excelian Grid as a Service Offers Compute Power for a Variety of Scenarios, with Infrastructure on Microsoft Azure and Costs Aligned to Actual Use MICROSOFT.
Welcome! Power BI User Group (PUG)
Unitrends Enterprise Backup Solution Offers Backup and Recovery of Data in the Microsoft Azure Cloud for Better Protection of Virtual and Physical Systems.
MyCloudIT Enables Partners to Drive Their Cloud Profitability Using CSP-Enabled Desktop Hosting Automation with Microsoft Azure and Office 365 MICROSOFT.
Partner Logo Azure Provides a Secure, Scalable Platform for ScheduleMe, an App That Enables Easy Meeting Scheduling with People Outside of Your Company.
Azure's Performance, Scalability, SQL Servers Automate Real Time Data Transfer at Low Cost MINI-CASE STUDY “Azure offers high performance, scalable, and.
20 Questions with Azure SQL Data Warehouse
MARMIND’s New Service Delivers a Single Centralized Marketing Plan That Connects Teams, Campaigns and Outcomes by Using the Power of the Azure Platform.
Azure SQL DWH: Optimization
Appcelerator Arrow: Build APIs in Minutes. Connect to Any Data Source
XtremeData on the Microsoft Azure Cloud Platform:
Quasardb Is a Fast, Reliable, and Highly Scalable Application Database, Built on Microsoft Azure and Designed Not to Buckle Under Demand MICROSOFT AZURE.
Context about the Data Warehouse
Guarantee Hyper-V, System Center Performance and Autoscale to Microsoft Azure with Application Performance Control System from VMTurbo MICROSOFT AZURE.
Power BI with Analysis Services
Introduction to Dataflows in Power BI
Let’s Build a Tabular Model in Azure
Moving your on-prem data warehouse to cloud. What are your options?
Introduction to Azure Data Lake
Comparison of commercial cloud implementations
The Database World of Azure
Presentation transcript:

Outperform the Competition with Azure SQL Data Warehouse Bob Rubocki – Practice Manager, BI Architect March 12, 2019

Agenda Azure SQL DW product overview Cloud Scale Analytics Market GigaOm benchmark study, product comparison What’s new with Azure SQL Data Warehouse Demo

Bob Rubocki Practice Manager & BI Architect, Pragmatic Works brubocki@pragmaticworks.com linkedin.com/in/robertrubocki @BobRubocki bobrubocki.wordpress.com

Azure SQL DW – Massive Parallel Processing Compute nodes are separate from data storage Client tools/apps connect to control node, just like SQL Server Scale up/down to add/remove compute nodes Pause compute when not in use ($) Control node distributes query to compute nodes and distributions Compute nodes read from blob storage, send results to control node

Why Azure SQL DW? Massive data scale Designed for analytic and aggregate query loads

Market Features Cloud based Relational (SQL) Structured and semi-structured data Scale out architecture Columnar compression

The Competitors

Industry-leading price performance You can download the report as well as sign up to use ADW for free at the Azure.com link above https://azure.microsoft.com/en-us/services/sql-data-warehouse/compare/

Background TPC-H decision support benchmark TPC - Transactional Processing Performance Council Benchmarks originally created to standardize OLTP testing OLTP testing grew from ATM testing http://www.tpc.org/ http://www.tpc.org/information/about/history.asp

TPC Members

Methodology Based on TPC-H benchmark 22 queries Schema design Each query executed 3 times, fastest time used for results. Queries to execute ~30 TB data set Test Environments  Comparable performance tiers BigQuery not configurable Azure SQL DW Amazon Redshift Snowflake Google BigQuery 3 1

Pricing Summary Report (TPC-H Q1) Performance Showing Query 1 as a sample of the output format, and an example where Microsoft Azure SQL DW outperformed the competition. Azure SQL DW performed faster than all competitors for TPC-H Query 1

Performance Summary

Shipping Priority (TPC-H Q3) Performance One Amazon Redshift tier performed best with Query 3

Global Sales Opportunity (TPC-H Query 22) Performance Snowflake outperformed Azure SQL DW and Amazon Redshift on TPC-H Query 22. (Subqueries)

Customer Distribution (TPC-H Query 13) - Performance The 1 of 66 queries where Google BigQuery outperformed Azure SQL DW

Price Per Performance Total duration of 22 test queries Cost of operating service for that duration BigQuery charges by data volume processed, not by time https://gigaom.com/report/data-warehouse-cloud-benchmark/#post-id-959633

Azure SQL DW vs Amazon Redshift

Azure SQL DW vs Snowflake

Azure SQL DW vs Google BigQuery https://azure.microsoft.com/en-us/services/sql-data- warehouse/compare/

What’s New In Azure SQL DW Azure SQL DW Gen 2 released April, 2018 Includes new, more powerful Azure hardware Addresses challenges with I/O operations on remote storage New “optimized for compute” SKUs

Adaptive Caching New Azure hardware Compute nodes include NVMe solid state disks (Non-Volatile Memory Express) Based on query history and patterns, algorithm determines column store data likely to be used in queries, caches data on SSD on compute node Queries satisfied with data in cache do NOT read from remote blob storage Faster query performance

Adaptive Caching

Max Concurrent Query Limit 32 Gen 1 (pre-Gen 2) 128 Gen 2 Gen 1 (post-Gen 2)

Additional Performance Tiers Gen 1 – max 6,000 DWU Gen 2 – max 30,000 DWU Gen 2 – new lower priced tiers (DW100c, DW200c, DW300c, DW400c, DW500c) Gen 2 pricing originally started at DW1000c (more expensive to get started with Gen 2)

DEMO

Azure SQL Data Warehouse (ADW) Developer productivity Industry-leading security Intelligent workload management Data flexibility Best in class price-performance Here are the 5 reasons that we think ADW provides compelling business value: First, as you build your cloud analytics solution, you can use the similar set of widely available developer skills & tools that you are using to manage on-prem SQL environments. The fact that ADW is a managed Azure cloud service means that you can set up your DW environments in minutes Second, while the Azure platform leads the industry on security & compliance certifications, ADW’s built in granular security at the row and column level means that you can make the power of your analytics solution available to a wider set of users (e.g. geo diverse employees and vendors) , without the need to create multiple copies of the DW. Third, the most significant piece of innovation has been to separate compute & storage that allows customers to not only control costs, but also finely align the workloads with performance. This means that for the time that you don’t run the DW, you don’t pay for the compute resources…just the storage. This also allowed us to provide high performance storage cache close to compute, thereby driving the major performance improvements over the last several quarters. ADW further supports business agility by allowing you to define workload prioritization…so that the most business critical workloads and requests can take priority on the DW resources (enabled via defining workload classification and importance) Fourth, ADW is one single solution that lets to work with a variety of datatypes, and seamlessly works with a number of first-party Azure and ISV partner services for ingestion, transformation, modeling, and serving of data Last, and the most important piece is the continued market leadership of ADW around – both on price, as well as raw performance and price per performance ============ Azure SQL Data Warehouse Azure SQL Data Warehouse storage is separate from the compute Data Warehouse Unit (DWU). This enables Azure SQL Data Warehouse to scale columnar storage capacity and compute resources independently. This capability adjusts to various workload demands, offering potential cost savings when demand is low. Azure SQL Data Warehouse can pause and resume compute billing, where only storage is billed during the paused time. Azure SQL Data Warehouse achieves good balance in both configurability and simplicity, in a way that is both easy to administer and flexible in handling almost any usage pattern. Azure SQL Data Warehouse is fully ANSI-SQL compliant and users familiar with SQL Server will be very comfortable using this environment. Azure SQL Data Warehouse can export data to a local file the same way an on-premises SQL Server can, e.g., via the SQL Server Import and Export Wizard. Although concurrency was not tested in the benchmark, Azure SQL Data Warehouse supports 128 concurrent queries. This is many more than BigQuery, which supports a maximum concurrency of 50 per project. Snowflake’s maximum concurrency is difficult to calculate because it is a function of the number of queries, the submitted queries’ execution plan, the size of the warehouse, and the maximum number of multi-cluster setting. In our experience we saw an X-Large (16 node) Snowflake warehouse run 6 concurrent simple scan queries (SELECT with a single column filter WHERE clause) before starting to queue. Thus, if we set the maximum multi-clusters at 5, we would likely hit a max concurrency of 30 in that scenario. Your results may vary. Enterprise class application lifecycle management Defense-in-depth security and 99.9% financially backed availability SLA  Separation of compute and storage Prioritize resources for the most valuable workloads Query directly over the Data Lake Support for structured and semi-structured data Up to 94% less expensive than competitors

We Can Help! Pragmatic Works can help you migrate or manage your data warehouse environment in Azure.   Respond YES to the exit survey for more information.

Thanks! GigaOm Analyst Report - https://gigaom.com/report/data-warehouse-cloud-benchmark/ TPC-H Benchmark spec - http://www.tpc.org/tpc_documents_current_versions/pdf/tpc-h_v2.17.3.pdf Microsoft Azure SQL DW Comparison - https://azure.microsoft.com/en-us/services/sql-data-warehouse/compare/ Loading NYC Taxi data to Azure SQL DW - https://docs.microsoft.com/en-us/azure/sql-data-warehouse/load-data-from-azure-blob-storage-using-polybase