Azure SQL DWH: Tips and Tricks for developers

Slides:



Advertisements
Similar presentations
Dos and don’ts of Columnstore indexes The basis of xVelocity in-memory technology What’s it all about The compression methods (RLE / Dictionary encoding)
Advertisements

SQL Server Integration Services (SSIS) Presented by Tarek Ghazali IT Technical Specialist Microsoft SQL Server (MVP) Microsoft Certified Technology Specialist.
Windows Azure Tour Benjamin Day Benjamin Day Consulting, Inc.
Azure SQL DW – Elastic Data Analytics in the cloud Josh Sivey | Microsoft TSP #492 | Phoenix.
What’s new in Tabular 2016? Polonychko Yevgen. SQLSat Kyiv Team Vitaliy Popovych Mykola Pobyivovk Yevhen Nedashkivskyi Olena Smoliak Oksana Borysenko.
Review DirectQuery in SSAS 2016, best practices and use cases
A deep dive into Azure AD B2C
Service Broker in action
Cloud BI with Azure Analysis Services
Azure SQL Data Warehouse for Beginners
5/9/2018 7:28 AM © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS.
Operational Analytics in SQL Server 2016 and Azure SQL Database
Microsoft /2/2018 3:42 PM BRK3129 Query Big Data using the Expanded T-SQL footprint with PolyBase in SQL Server 2016 Casey Karst Program Manager.
Cloud BI with Azure Analysis Services
Why Is My SQL DW Query Slow?
Microsoft /23/2018 1:11 AM BRK3180 Migrate CRM OnPremise organizations to CRM Online cloud using Dynamics Lifecycle Services (LCS) Aditya Varma Ganapathy.
Performing a Seamless Migration in Azure SQL DB
Microsoft Build /22/ :52 PM © 2016 Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY,
7/22/2018 9:21 PM BRK3270 Building a Better Data Solution: Microsoft SQL Server and Azure Data Services Joey D’Antoni Principal Consultant Denny Cherry.
Microsoft Ignite /6/2018 3:11 PM THR3055
Encryption in SQL Server
Design Seamless Upgrades to SQL Server 2016 with Query Store
TechEd /13/2018 7:46 PM © 2013 Microsoft Corporation. All rights reserved. Microsoft, Windows, and other product names are or may be registered trademarks.
Machine Learning, Analytics, & Data Science Conference
A developers guide to Azure SQL Data Warehouse
Azure SQL Data Warehouse for SQL Server DBAS
SSAS Tabular Toolbelt Sergiy Lunyakin.
Azure SQL Data Warehouse Scaling: Configuration and Guidance
Data Science that’s scale
Analytics for Apps: Landing and Loading Data into SQL Data Warehouse
Introducing the SQL Server 2016 Query Store
What is the Azure SQL Datawarehouse?
Statistics for beginners
Please support our sponsors
Azure SQL Data Warehouse Performance Tuning
Introduction to AWS Redshift
11/18/2018 2:14 PM © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN.
Cloud BI with Azure Analysis Services
Azure SQL Data Warehouse for SQL Server DBAS
Microsoft Ignite NZ October 2016 SKYCITY, Auckland.
BRK2279 Real-World Data Movement and Orchestration Patterns using Azure Data Factory Jason Horner, Attunix Cathrine Wilhelmsen, Inmeta -
A developers guide to Azure SQL Data Warehouse
Azure SQL DWH: Tips and Tricks for developers
MPP – Maximize Parallel Productivity
20 Questions with Azure SQL Data Warehouse
Cloud BI with Azure Analysis Services
Explore the Azure Cosmos DB with .NET Core 2.0
Azure SQL DWH: Optimization
Managing batch processing Transient Azure SQL Warehouse Resource
SQL Server Performance Tuning Nowadays
TechEd /15/2019 8:08 PM © 2013 Microsoft Corporation. All rights reserved. Microsoft, Windows, and other product names are or may be registered trademarks.
What query folding means to self-service BI projects
Context about the Data Warehouse
Azure Data Factory v2: What’s new?
Azure SQL DWH: Tips and Tricks for developers
SQL Database on IoT devices could you? should you? would you?
Power BI with Analysis Services
Fewer cursors since SQL Server 2012 Came Along
4/11/2019 6:29 AM © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN.
Azure SQL DWH: Tips and Tricks for developers
Azure Machine Learning on Databricks
ETL Patterns in the Cloud with Azure Data Factory
Get data insights faster with Data Wrangling
Using Columnstore indexes in Azure DevOps Services. Lessons learned
Using Columnstore indexes in Azure DevOps Services. Lessons learned
Cloud BI with Azure Analysis Services
SQL Like Languages in Azure IoT
Visual Data Flows – Azure Data Factory v2
Visual Data Flows – Azure Data Factory v2
Architecture of modern data warehouse
Presentation transcript:

Azure SQL DWH: Tips and Tricks for developers Sergiy Lunyakin, ITMagination Azure SQL DWH: Tips and Tricks for developers

SQLSat Kyiv Team Yevhen Nedashkivskyi Alesya Zhuk Eugene Polonichko Oksana Borysenko Mykola Pobyivovk Oksana Tkach

Sponsor Sessions Starts at 13:10 Don’t miss them, they might be providing some interesting and valuable information! Room A Room B Room C 13:00 - 13:20 DevArt Microsoft Eleks 13:20 - 13:50 DB Best Intapp DataArt NULL means no session in that room at that time 

Our Awesome Sponsors

Session will begin very soon :) Please complete the evaluation form from your pocket after the session. Your feedback will help us to improve future conferences and speakers will appreciate your feedback! Enjoy the conference!

About me DWH/BI Consultunt at ITMagination Data Platform MVP, MCSE BI, MCSA Cloud Platform Leader of Speaker at SQLSaturdays Organizer of SQLSaturday Lwow Contacts: sergey.lunyakin@gmail.com @slunyakin

Agenda What is Azure SQL DW Architecture of Azure SQL DW Limitations Check compatibility Handling cross-database query Handling Identity Handling ANSI Update/Delete/Merge/SCD Handling Compute columns Handling Cursor

What is Azure SQL DW Microsoft Azure Platform as a Service It’s a Massively Parallel Processing system (MPP) Distributed Compute and Distributed Storage Scale up and down in couple minutes Pause compute resources Supports a subset of T-SQL Join with external data in Azure Blob Storage

Architecture of Azure SQL DW Dist_DB_1 Dist_DB_2 Dist_DB_15 Dist_DB_16 Dist_DB_17 Dist_DB_30 Dist_DB_46 Dist_DB_47 Dist_DB_60 … … … … … …

Logical Overview Control Compute Storage Microsoft Build 2016 11/30/2018 2:09 PM Logical Overview Compute Control Storage © 2016 Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

Distributed queries Query Result Control Compute Storage Microsoft Build 2016 11/30/2018 2:09 PM Distributed queries Query Result Control Compute Storage Scale-out distributed query engine © 2016 Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

Distributed Query SELECT COUNT_BIG(*) FROM dbo.[FactInternetSales] ; SELECT SUM(*) FROM dbo.[FactInternetSales] ; Control Compute SELECT COUNT_BIG(*) FROM dbo.[FactInternetSales] ; SELECT COUNT_BIG(*) FROM dbo.[FactInternetSales] ; SELECT COUNT_BIG(*) FROM dbo.[FactInternetSales] ; SELECT COUNT_BIG(*) FROM dbo.[FactInternetSales] ;

Limitations Primary/Foreign Keys Identity Computed Columns Triggers Cross-database joins Sequences Cursors MERGE ANSI joins on updates/deletes More limitations: https://azure.microsoft.com/en-us/documentation/articles/sql-data-warehouse-migrate-code/ https://feedback.azure.com/forums/307516-sql-data-warehouse

Check compatibility Data warehouse migration utility Free tool Helps to identify unsupported features Helps to identify HASH distribution column Migrate scheama Migrate data (BCP tool)

Cross-database query Azure SQL DW doesn’t support cross-database query. Use ELT approach. Separate schemas. Use External tables as staging tables.

CTAS CTAS is super-charched version of SELECT...INTO Parallelized Better for Data import Data copy Workarounds CREATE TABLE [dbo].[FactInternetSales_new] WITH ( DISTRIBUTION = ROUND_ROBIN , CLUSTERED COLUMNSTORE INDEX ) AS SELECT * FROM [dbo].[FactInternetSales];

Identity Handle it on source side IDENTITY property Explicit import Doesn’t support CTAS Custom Identity with ROW_NUMBER

ANSI JOINS Update/Del/Merge Update/Delete doesn’t support JOINS in FROM Use CTAS for preparing interim table with JOINS Use CTAS for Merge workaround Split Merge to operation steps and use UNION ALL Use interim table for big number of steps Use partitioning for big tables, don’t reload the whole table

Compute columns Handle it in source system Use CTAS during import Create a View Use explicit data type and nullability check Wrong data during migration Schema error during partition switch

Cursor WHILE for lopping List of elements as a table Loop through list using While clause and variable

Summary MPP PaaS Service in Azure Cloud Storing and processing huge amount of structure data Limitation: Identity, ANSI JOINS, MERGE CTAS - Super-charched version of SELECT...INTO CTAS good way for workarounds Better reload data with CTAS than Row-By-Row operations

Question?

Our Awesome Sponsors