Moving your on-prem data warehouse to cloud. What are your options?

Slides:



Advertisements
Similar presentations
High Performance Analytical Appliance MPP Database Server Platform for high performance Prebuilt appliance with HW & SW included and optimally configured.
Advertisements

Microsoft Ignite /16/2017 5:47 PM
An Introduction to Infrastructure Ch 11. Issues Performance drain on the operating environment Technical skills of the data warehouse implementers Operational.
Introduction to Hadoop and HDFS
An Introduction to HDInsight June 27 th,
Modern Data Warehouse: Microsoft APS Alain Dormehl June 2015.
Windows Azure. Azure Application platform for the public cloud. Windows Azure is an operating system You can: – build a web application that runs.
By N.Gopinath AP/CSE.  The data warehouse architecture is based on a relational database management system server that functions as the central repository.
SQL Server 2016 New Innovations. Microsoft Data Platform Relational Beyond Relational On-premises Cloud Comprehensiv e Connected Choice SQL Server Azure.
Azure SQL DW – Elastic Data Analytics in the cloud Josh Sivey | Microsoft TSP #492 | Phoenix.
Modern Data Warehousing Symmetric Multi-Processing SQL (SMP) vs Massive Parallel Processing SQL (MPP) Alain Dormehl P-Cubed Session Level : Intermediary.
SQL Server as a Cloud Service November 11th 2015 Warner Chaves SQL MVP/MCM.
An Introduction To Big Data For The SQL Server DBA.
Apache Hadoop on Windows Azure Avkash Chauhan
Redmond Protocols Plugfest 2016 Casey Karst PolyBase in SQL Server 2016.
SQL Server as a Cloud Service April 15th 2016 Warner Chaves Data Platform MVP/SQL Server MCM.
Cloud Database Platforms for the SQL DBA
BUILD BIG DATA ENTERPRISE SOLUTIONS FASTER ON AZURE HDINSIGHT
Connected Infrastructure
4/18/2018 6:56 AM © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN.
Data Platform and Analytics Foundational Training
Azure SQL Data Warehouse for Beginners
SAS users meeting in Halifax
Smart Building Solution
Hadoop and Analytics at CERN IT
Welcome! Power BI User Group (PUG)
Microsoft /2/2018 3:42 PM BRK3129 Query Big Data using the Expanded T-SQL footprint with PolyBase in SQL Server 2016 Casey Karst Program Manager.
Why Is My SQL DW Query Slow?
The Model Architecture with SQL and Polybase
Smart Building Solution
Example of a page header
Microsoft Build /22/ :52 PM © 2016 Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY,
Connected Infrastructure
Cloudy with a Chance of Data
02 | Design and implement database
A developers guide to Azure SQL Data Warehouse
Azure SQL Data Warehouse for SQL Server DBAS
9/21/2018 3:41 AM BRK3180 Architect your big data solutions with SQL Data Warehouse & Azure Analysis Services Josh Caplan & Matt Usher Program Managers.
11/9/2018 5:08 AM © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN.
What is the Azure SQL Datawarehouse?
Please support our sponsors
Azure SQL Data Warehouse Performance Tuning
Massively Parallel Processing in Azure Comparing Hadoop and SQL based MPP architectures in the cloud Josh Sivey SQL Saturday #597 | Phoenix.
Azure SQL Data Warehouse for SQL Server DBAS
Microsoft Connect /22/2018 9:50 PM
Microsoft Connect /24/ :05 AM
MPP – Maximize Parallel Productivity
20 Questions with Azure SQL Data Warehouse
Power BI for large databases
Managing batch processing Transient Azure SQL Warehouse Resource
Overview of big data tools
Data analytics with Hadoop In the Microsoft Azure cloud
Azure Data Lake for First Time Swimmers
Microsoft Azure for SQL Server Professionals
Context about the Data Warehouse
Azure SQL DWH: Tips and Tricks for developers
Power BI with Analysis Services
Cloud Computing for Data Analysis Pig|Hive|Hbase|Zookeeper
Azure SQL DWH: Tips and Tricks for developers
Understanding Azure Data Engineering Options Finding Clarity in a Vast & Changing Landscape Cameron Snapp.
HDInsight & Power BI By Łukasz Gołębiewski.
Azure Data Storage Options
Big-Data Analytics with Azure HDInsight
Introduction to Azure Data Lake
Data Wrangling for ETL enthusiasts
Copyright © JanBask Training. All rights reserved Get Started with Hadoop Hive HiveQL Languages.
Architecture of modern data warehouse
The Database World of Azure
Presentation transcript:

Moving your on-prem data warehouse to cloud. What are your options? Asanka Padmakumara Moving your on-prem data warehouse to cloud. What are your options?

Who am I? Asanka Padmakumara Business Intelligence Consultant, More than 8 years in BI and Data Warehousing Blog: asankap.wordpress.com Linked In: linkedin.com/in/asankapadmakumara Twitter: @asanka_e Facebook: facebook.com/asankapk

Thanks to our Sponsors SILVER: PASS: VENUE: Global Alliance Partner Sri Lanka

Why you need to pay attention now? Cleaver

How do you define a DW? Central Organized Integrated Read Optimized Reporting/ Analysis SMP/MPP

Symmetric Multi Processing(SMP) Multiple processors, common OS and Memory Shared Everything Architecture Scale Up NUMA -> Dedicated Memory area for a processor NUMA-non-uniform memory access

Massively Parallel Processing (MPP) Multiple processors, own OS and Memory Shared Nothing Architecture Scale Out

SMP or MPP? SMP MPP Best suited for small to medium data sets Many small read/write operations Multiple row-by-row operations Best suited for big data, analytical, batch-oriented workloads Complex Queries If your data sizes already exceed 1 TB and are expected to continually grow

For your Data Warehousing needs SMP Azure SQL Database SQL Server in a VM MPP Azure Data Warehouse Apache Hive on HDInsight Interactive Query (Hive LL AP) on HDInsight symmetric multiprocessing massively parallel processing

Azure SQL Database SQL Database as a Service No CDC,CLR Primary file group only No database mail No cross data base queries Automatic Backup Managed Service No PolyBase Can’t pause Support In-Memory Dynamic Scalability SQL Database as a Service One or more databases in a logical sever Single/Elastic Pool/Managed Instance Max Size : 4 TB Pricing: DTU (Database Transaction Units) Premium 5 to 4000 DTUs

SQL Server in a VM SQL server installed and configured in a VM Can pause the VM High no of concurrent connections Support In-Memory Not a managed Service No Dynamic Scalability Seamless migration SQL server installed and configured in a VM Windows Server, Linux (Red Hat ,SUSE ,Ubuntu) MAX Size : 256 TB Pricing: VCore and RAM basic

Azure SQL DW PaaS One Control Node- Multiple Compute Nodes Distributed Computing Distributed Storage DWUs (Data Warehouse Units) Parallel Data Warehouse Distributed or replicated tables 60 distribution if 400DWUs- 4 compute nodes , each 15 Distributions – max 60 compute nodes Hash/Replicated/Round Robin Global temporary tables that begin with ## are not supported.

Azure Data Warehouse is not for Everyone Should I use Azure Data Warehouse? www.sqlchick.com/ No In-Memory No Real Time Limited Concurrency Data Encryption Managed Can pause Dynamic Scalability Faster Processing /Computation Melissa Coates No Primary key, Foreign keys, Unique

Demo Behavior of a Small DW in Azure SQL Database and Azure SQL Data Warehouse

Apache Hive on HDInsight Hadoop Data warehouse system Convert SQL like Queries (Hive QL) to MapReduce, Tez , or Spark jobs Batch Processing Works well with Semi-Structured Data Pausing -> Delete and re-create No Dynamic Scalability

Hive Interactive Query Also called Hive LLAP, or Live Long and Process In-Memory Caching Suite for real time analytics

I want to pause my resources when not using I have a real time reporting needs I want to manage my own server. I need to support a large number of concurrent users and connections . My data is mostly unstructured. Azure SQL Database Azure SQL Data warehouse SQL Server in a VM Apache Hive

Azure Data Lake Not a traditional DW, But … Highly scalable data storage and analytics service Largely intended for big data storage and analysis Consists of three services: Analytics Storage HDInsight (“managed clusters”) Azure Data Lake Analytics Azure Data Lake Storage

A DW solution using ADL Curated Data Staging Area Persisted(Raw) Data Azure Data Lake Store Staging Area Curated Data Persisted(Raw) Data Sandbox End user/Report authors Business Analysts (exploratory analysis) Azure Data Factory Azure Data Lake Analytics Azure Analysis Services Data Warehouse

Your Feedback is Important Paste Feedback QR Code here

Thanks to our Sponsors SILVER: PASS: VENUE: Global Alliance Partner Sri Lanka