Download presentation
Presentation is loading. Please wait.
Published byKerrie Sims Modified over 5 years ago
1
Moving your on-prem data warehouse to cloud. What are your options?
Asanka Padmakumara Moving your on-prem data warehouse to cloud. What are your options?
2
Who am I? Asanka Padmakumara
Business Intelligence Consultant, More than 8 years in BI and Data Warehousing Blog: asankap.wordpress.com Linked In: linkedin.com/in/asankapadmakumara Facebook: facebook.com/asankapk
3
Thanks to our Sponsors SILVER: PASS: VENUE: Global Alliance Partner
Sri Lanka
4
Why you need to pay attention now?
Cleaver
5
How do you define a DW? Central Organized Integrated Read Optimized
Reporting/ Analysis SMP/MPP
6
Symmetric Multi Processing(SMP)
Multiple processors, common OS and Memory Shared Everything Architecture Scale Up NUMA -> Dedicated Memory area for a processor NUMA-non-uniform memory access
7
Massively Parallel Processing (MPP)
Multiple processors, own OS and Memory Shared Nothing Architecture Scale Out
8
SMP or MPP? SMP MPP Best suited for small to medium data sets
Many small read/write operations Multiple row-by-row operations Best suited for big data, analytical, batch-oriented workloads Complex Queries If your data sizes already exceed 1 TB and are expected to continually grow
9
For your Data Warehousing needs
SMP Azure SQL Database SQL Server in a VM MPP Azure Data Warehouse Apache Hive on HDInsight Interactive Query (Hive LL AP) on HDInsight symmetric multiprocessing massively parallel processing
10
Azure SQL Database SQL Database as a Service
No CDC,CLR Primary file group only No database mail No cross data base queries Automatic Backup Managed Service No PolyBase Can’t pause Support In-Memory Dynamic Scalability SQL Database as a Service One or more databases in a logical sever Single/Elastic Pool/Managed Instance Max Size : 4 TB Pricing: DTU (Database Transaction Units) Premium 5 to 4000 DTUs
11
SQL Server in a VM SQL server installed and configured in a VM
Can pause the VM High no of concurrent connections Support In-Memory Not a managed Service No Dynamic Scalability Seamless migration SQL server installed and configured in a VM Windows Server, Linux (Red Hat ,SUSE ,Ubuntu) MAX Size : 256 TB Pricing: VCore and RAM basic
12
Azure SQL DW PaaS One Control Node- Multiple Compute Nodes
Distributed Computing Distributed Storage DWUs (Data Warehouse Units) Parallel Data Warehouse Distributed or replicated tables 60 distribution if 400DWUs- 4 compute nodes , each 15 Distributions – max 60 compute nodes Hash/Replicated/Round Robin Global temporary tables that begin with ## are not supported.
13
Azure Data Warehouse is not for Everyone
Should I use Azure Data Warehouse? No In-Memory No Real Time Limited Concurrency Data Encryption Managed Can pause Dynamic Scalability Faster Processing /Computation Melissa Coates No Primary key, Foreign keys, Unique
14
Demo Behavior of a Small DW in Azure SQL Database and Azure SQL Data Warehouse
15
Apache Hive on HDInsight
Hadoop Data warehouse system Convert SQL like Queries (Hive QL) to MapReduce, Tez , or Spark jobs Batch Processing Works well with Semi-Structured Data Pausing -> Delete and re-create No Dynamic Scalability
16
Hive Interactive Query
Also called Hive LLAP, or Live Long and Process In-Memory Caching Suite for real time analytics
17
I want to pause my resources when not using
I have a real time reporting needs I want to manage my own server. I need to support a large number of concurrent users and connections . My data is mostly unstructured. Azure SQL Database Azure SQL Data warehouse SQL Server in a VM Apache Hive
18
Azure Data Lake Not a traditional DW, But …
Highly scalable data storage and analytics service Largely intended for big data storage and analysis Consists of three services: Analytics Storage HDInsight (“managed clusters”) Azure Data Lake Analytics Azure Data Lake Storage
19
A DW solution using ADL Curated Data Staging Area Persisted(Raw) Data
Azure Data Lake Store Staging Area Curated Data Persisted(Raw) Data Sandbox End user/Report authors Business Analysts (exploratory analysis) Azure Data Factory Azure Data Lake Analytics Azure Analysis Services Data Warehouse
20
Your Feedback is Important
Paste Feedback QR Code here
21
Thanks to our Sponsors SILVER: PASS: VENUE: Global Alliance Partner
Sri Lanka
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.