Massively Parallel Processing in Azure Comparing Hadoop and SQL based MPP architectures in the cloud Josh Sivey SQL Saturday #597 | Phoenix.

Slides:



Advertisements
Similar presentations
Chapter 22: Cloud Computing and Related Security Issues Guide to Computer Network Security.
Advertisements

FAST FORWARD WITH MICROSOFT BIG DATA Vinoo Srinivas M Solutions Specialist Windows Azure (Hadoop, HPC, Media)
Virtual techdays INDIA │ November 2010 SQL Azure Data Sync Shilpa Nirmale │ Associate Manager, Accenture.
An Introduction to Infrastructure Ch 11. Issues Performance drain on the operating environment Technical skills of the data warehouse implementers Operational.
Windows Azure SQL Database and Storage Name Title Organization.
A Brief Overview by Aditya Dutt March 18 th ’ Aditya Inc.
Cloud computing is the use of computing resources (hardware and software) that are delivered as a service over the Internet. Cloud is the metaphor for.
Introduction To Windows Azure Cloud
Cloud Computing & Amazon Web Services – EC2 Arpita Patel Software Engineer.
Windows Azure Conference 2014 Deploy your Java workloads on Windows Azure.
Modern Data Warehouse: Microsoft APS Alain Dormehl June 2015.
PaaSport Introduction on Cloud Computing PaaSport training material.
Hadoop IT Services Hadoop Users Forum CERN October 7 th,2015 CERN IT-D*
Advanced Technology Days 18. i 19. studenog 2015., Cinestar Arena Centar.
Information managers are seeking innovative DBMS’s which are able to handle large data volumes in new ways or to optimize existing products and processes.
Breaking points of traditional approach What if you could handle big data?
3/12/2013Computer Engg, IIT(BHU)1 CLOUD COMPUTING-1.
CLOUD COMPUTING WHAT IS CLOUD COMPUTING?  Cloud Computing, also known as ‘on-demand computing’, is a kind of Internet-based computing,
Hello Cloud… Mike Benkovich
SQL Server 2012 Session: 1 Session: 4 SQL Azure Data Management Using Microsoft SQL Server.
Azure SQL DW – Elastic Data Analytics in the cloud Josh Sivey | Microsoft TSP #492 | Phoenix.
Azure HDInsight And Excel Analyze unstructured data at scale, then visualize! George Walters Sr. Technical Solutions Professional, Data Platform Microsoft.
Modern Data Warehousing Symmetric Multi-Processing SQL (SMP) vs Massive Parallel Processing SQL (MPP) Alain Dormehl P-Cubed Session Level : Intermediary.
Big Data analytics in the Cloud Ahmed Alhanaei. What is Cloud computing?  Cloud computing is Internet-based computing, whereby shared resources, software.
AZ PASS User Group Azure Data Factory Overview Josh Sivey, Solution Partner October
An Introduction To Big Data For The SQL Server DBA.
©2015 DesignMind. All Rights Reserved.. 2 About DesignMind.
BI 202 Data in the Cloud Creating SharePoint 2013 BI Solutions using Azure 6/20/2014 SharePoint Fest NYC.
Agenda  What is Cloud Computing?  Milestone of Cloud Computing  Common Attributes of Cloud Computing  Cloud Service Layers  Cloud Implementation.
Business Insights Play briefing deck.
Prof. Jong-Moon Chung’s Lecture Notes at Yonsei University
Migrating to and Integrating with SQL Azure
Unit 3 Virtualization.
Connected Infrastructure
Chapter 6: Securing the Cloud
Organizations Are Embracing New Opportunities
Data Platform and Analytics Foundational Training
Azure SQL Data Warehouse for Beginners
Big Data Enterprise Patterns
Welcome! Power BI User Group (PUG)
Where Should My Data Live (and Why)?
What is Cloud Computing - How cloud computing help your Business?
Why Is My SQL DW Query Slow?
Cloud Computing Solutions |
Cloud Data platform (Cloud Application Development & Deployment)
Welcome! Power BI User Group (PUG)
Connected Infrastructure
9/13/2018 © 2014 Microsoft Corporation. All rights reserved. Microsoft, Windows, and other product names are or may be registered trademarks and/or trademarks.
Chapter 21: Cloud Computing and Related Security Issues
APACHE HAWQ 2.X A Hadoop Native SQL Engine
Chapter 22: Cloud Computing Technology and Security
Cloudy with a Chance of Data
Azure SQL Data Warehouse for SQL Server DBAS
Welcome! Power BI User Group (PUG)
Cloudy with a Chance of Data
What is the Azure SQL Datawarehouse?
Azure SQL Data Warehouse for SQL Server DBAS
Azure SQL DWH: Tips and Tricks for developers
Welcome! Power BI User Group (PUG)
Managing batch processing Transient Azure SQL Warehouse Resource
"Cloud services" - what it is.
Power BI with Analysis Services
Cloud Computing: Concepts
MS AZURE By Sauras Pandey.
Where Should My Data Live (and Why)?
HDInsight & Power BI By Łukasz Gołębiewski.
Moving your on-prem data warehouse to cloud. What are your options?
Introduction to Azure Data Lake
Cloudy with a Chance of Data
Presentation transcript:

Massively Parallel Processing in Azure Comparing Hadoop and SQL based MPP architectures in the cloud Josh Sivey SQL Saturday #597 | Phoenix

Agenda What “kind” of MPP are we talking about? Benefits of using Azure for MPP solutions Comparing Hadoop MPP vs. SQL MPP Hadoop (Azure HDInsight) SQL (Azure SQL Data Warehouse) Discuss PaaS vs. IaaS Demos! Wrap-up SQL Saturday #597 | PHOENIX 2017

What “kind” of MPP are we talking about? massively parallel refers to the use of a large number of processors (or separate computers) to perform a set of coordinated computations in parallel (simultaneously). Share-Nothing Infrastructure Easily Scales Out SQL Saturday #597 | PHOENIX 2017

Benefits of using Azure for MPP solutions Ease / Speed of Deployment No Infrastructure Selection / Procurement Reduced Maintenance Cost Pay only for what you use Scale Out and Up SQL Saturday #597 | PHOENIX 2017

PaaS vs. IaaS Infrastructure-as-a-Service (IaaS) Equipment Servers, Storage, Networking Platform-as-a-Service (PaaS) Complete Solution Ecosystem Equipment and Software SQL Saturday #597 | PHOENIX 2017

PaaS vs. IaaS Platform-as-a-Service (PaaS) Infrastructure-as-a-Service Decreased Maintenance Abstracted Complexity of Architecture New versions/features Automatically Rolled Out Infrastructure-as-a-Service Fine grain control of environment Choice of Software Versions Customizable SQL Saturday #597 | PHOENIX 2017

Hadoop MPP vs. SQL MPP Hadoop MPP SQL MPP Hadoop Ecosystem HDFS, Hive, Tez, Impala, … Structured, Semi-Structured, Unstructured Data SQL MPP SQL Server on MPP Architecture T-SQL For Queries SSMS SQL Saturday #597 | PHOENIX 2017

Demos – What are we going to show? SQL Saturday #597 | PHOENIX 2017

Demo #1 – Hadoop MPP Demo HDInsight via Azure Marketplace Potential Use Cases Dev / POC Cases without impacting Production Testing Version Upgrades Peak Processing Offload Backups SQL Saturday #597 | PHOENIX 2017

Demo #1 – Hadoop MPP Demo Resulting Architecture SQL Saturday #597 | PHOENIX 2017

Demo #1 – Hadoop MPP Demo Review Used Cloudera Distribution from Azure Marketplace to Provision a Hadoop Cluster Connected to commonly-used Hadoop Tools in the Cloud Updated HDFS configuration to allow to connect to Azure Blob Storage Copied Data into HDFS Connected Client Tool to Cloud Cluster SQL Saturday #597 | PHOENIX 2017

Demo #2 – SQL MPP Demo Azure SQL Data Warehouse Potential Use Cases via Azure Marketplace Potential Use Cases Building a new cloud based Data Warehouse Hybrid data source scenarios High-Performance Computing Agility and Elastic Scale SQL Saturday #597 | PHOENIX 2017

Demo #2 – SQL MPP Demo - Architecture By combining MPP architecture and Azure storage capabilities, SQL Data Warehouse can: Grow or shrink storage independent of compute. Grow or shrink compute without moving data. Pause compute capacity while keeping data intact. Resume compute capacity at a moment's notice. SQL Saturday #597 | PHOENIX 2017

Demo #2 – SQL MPP Demo Review Azure SQL Data Warehouse - PaaS Azure SQL DW is a cloud-based, scale-out database capable of processing massive volumes of data Increase, decrease, pause, or resume compute in seconds. Fully fault tolerant with automatic back-ups. Develop with familiar SQL Server T-SQL and tools. SQL Saturday #597 | PHOENIX 2017

Thank you Sponsors!

Thank You