Modern Data Warehousing Symmetric Multi-Processing SQL (SMP) vs Massive Parallel Processing SQL (MPP) Alain Dormehl P-Cubed Session Level : Intermediary.

Slides:



Advertisements
Similar presentations
1 Copyright © 2012 Oracle and/or its affiliates. All rights reserved. Convergence of HPC, Databases, and Analytics Tirthankar Lahiri Senior Director, Oracle.
Advertisements

2012 © Trivadis BASEL BERN LAUSANNE ZÜRICH DÜSSELDORF FRANKFURT A.M. FREIBURG I.BR. HAMBURG MÜNCHEN STUTTGART WIEN TechTalk Beste Skalierbarkeit dank massiv.
High Performance Analytical Appliance MPP Database Server Platform for high performance Prebuilt appliance with HW & SW included and optimally configured.
1. Aim High with Oracle Real World Performance Andrew Holdsworth Director Real World Performance Group Server Technologies.
FAST FORWARD WITH MICROSOFT BIG DATA Vinoo Srinivas M Solutions Specialist Windows Azure (Hadoop, HPC, Media)
Microsoft Ignite /16/2017 5:47 PM
Extreme Performance Data Warehousing
Fast Track, Microsoft SQL Server 2008 Parallel Data Warehouse and Traditional Data Warehouse Design BI Best Practices and Tuning for Scaling SQL Server.
An Introduction to Infrastructure Ch 11. Issues Performance drain on the operating environment Technical skills of the data warehouse implementers Operational.
Analytics Map Reduce Query Insight Hive Pig Hadoop SQL Map Reduce Business Intelligence Predictive Operational Interactive Visualization Exploratory.
Hadoop Basics -Venkat Cherukupalli. What is Hadoop? Open Source Distributed processing Large data sets across clusters Commodity, shared-nothing servers.
Introduction to Hadoop and HDFS
+ Administering Microsoft SQL Server 2012 Databases Implementing a Data Warehouse with Microsoft SQL Server = Querying Microsoft SQL.
An Introduction to HDInsight June 27 th,
Microsoft SQL Server 2008 R2 IT:Network:Applications.
5-1 McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved.
2012 © Trivadis BASEL BERN LAUSANNE ZÜRICH DÜSSELDORF FRANKFURT A.M. FREIBURG I.BR. HAMBURG MÜNCHEN STUTTGART WIEN SQL Server 2012 Parallel Data Warehouse.
2012 © Trivadis BASEL BERN LAUSANNE ZÜRICH DÜSSELDORF FRANKFURT A.M. FREIBURG I.BR. HAMBURG MÜNCHEN STUTTGART WIEN Welcome November 2012 Vorstellung Parallel.
Modern Data Warehouse: Microsoft APS Alain Dormehl June 2015.
1 Top Five Tips and Tricks for DBAs and Storage Admins Deploying Oracle Database 12c Gagan Singh Sr. Database Architect Technology and Manufacturing Group.
Hadoop IT Services Hadoop Users Forum CERN October 7 th,2015 CERN IT-D*
By N.Gopinath AP/CSE.  The data warehouse architecture is based on a relational database management system server that functions as the central repository.
PolyBase Query Hadoop with ease Sahaj Saini SQL Server, Microsoft.
Azure SQL DW – Elastic Data Analytics in the cloud Josh Sivey | Microsoft TSP #492 | Phoenix.
Azure HDInsight And Excel Analyze unstructured data at scale, then visualize! George Walters Sr. Technical Solutions Professional, Data Platform Microsoft.
Microsoft Analytics Platform System Stefan Cronjaeger, Microsoft.
SQL Server Evolution New innovations Jen Underwood Sr. Program Manager of Business Intelligence & Analytics Microsoft George Walters Sr. Technical Solutions.
Making Data Work for Everyone Gordon Phillips May 28, 2014.
Group members: Phạm Hoàng Long Nguyễn Huy Hùng Lê Minh Hiếu Phan Thị Thanh Thảo Nguyễn Đức Trí 1 BIG DATA & NoSQL Topic 1:
Data Warehousing The Easy Way with AWS Redshift
An Introduction To Big Data For The SQL Server DBA.
PolyBase Query Hadoop with ease Sahaj Saini Program Manager, Microsoft.
©2015 DesignMind. All Rights Reserved.. 2 About DesignMind.
SQL Server 2016 editions – what’s new Express Mission critical performance SecurityData warehousing Business intelligence Advanced Analytics Hybrid cloud.
Redmond Protocols Plugfest 2016 Casey Karst PolyBase in SQL Server 2016.
JET INFOSYSTEMS The main approach to Big Data parallel processing: Oracle way Aleksey Struchenko Database Department Leader.
…the secret sauce! Diagrams and video from Microsoft white papers and slide decks.
Henk van der Valk Microsoft*
Data Platform and Analytics Foundational Training
Azure SQL Data Warehouse for Beginners
SAS users meeting in Halifax
Big Data Enterprise Patterns
System Center Marketing
Welcome! Power BI User Group (PUG)
System Center Marketing
Microsoft /2/2018 3:42 PM BRK3129 Query Big Data using the Expanded T-SQL footprint with PolyBase in SQL Server 2016 Casey Karst Program Manager.
Why Is My SQL DW Query Slow?
Informix Red Brick Warehouse 5.1
Data Warehousing: SQL Server Parallel Data Warehouse AU3 update
Azure SQL Datawarehouse - Datawarehouse on Cloud
Azure SQL Data Warehouse for SQL Server DBAS
Analytics for Apps: Landing and Loading Data into SQL Data Warehouse
What is the Azure SQL Datawarehouse?
Mapping the Data Warehouse to a Multiprocessor Architecture
Azure SQL Data Warehouse Performance Tuning
SQL 2014 In-Memory OLTP What, Why, and How
Massively Parallel Processing in Azure Comparing Hadoop and SQL based MPP architectures in the cloud Josh Sivey SQL Saturday #597 | Phoenix.
Azure SQL Data Warehouse for SQL Server DBAS
Azure SQL DWH: Tips and Tricks for developers
MPP – Maximize Parallel Productivity
Azure SQL DWH: Tips and Tricks for developers
Ch 4. The Evolution of Analytic Scalability
Azure SQL DWH: Optimization
Managing batch processing Transient Azure SQL Warehouse Resource
Introduction to Teradata
Azure SQL DWH: Tips and Tricks for developers
Azure SQL DWH: Tips and Tricks for developers
What is New in SQL Server 2016 BI Stack
Moving your on-prem data warehouse to cloud. What are your options?
Architecture of modern data warehouse
Presentation transcript:

Modern Data Warehousing Symmetric Multi-Processing SQL (SMP) vs Massive Parallel Processing SQL (MPP) Alain Dormehl P-Cubed Session Level : Intermediary Alain MCSE Data Platforms (SQL 2012)

A bit about me …  10 years using SQL Server  MCSE Data Platforms & MCT  Work at: P-Cubed  Research, Development and Continuity  More Recently: APS and Azure DW   Feel Free to ask any questions … 05/09/2015 PASS SQL Saturday Johannesburg - #4252 |

Agenda  SMP Architecture  MPP Architecture  Unstructured Data you ask !?  The best kept secret !  Something for the DBAs  Why MPP over SMP (scale out vs scale up)  QA 05/09/2015 PASS SQL Saturday Johannesburg - #4253 |

Traditional SMP Implementation 05/09/2015 PASS SQL Saturday Johannesburg - #4254 | QUERY RESOURCE CONTENTION

MPP Implementation QUERY 05/09/2015 PASS SQL Saturday Johannesburg - #4255 | SHARE NOTHING CONTROL

Analytics Platform System 05/09/2015 PASS SQL Saturday Johannesburg - #4256 | D1D2D3 D4D5D6D7D8 D1D2D3 D4D5D6D7D8 D1D2D3 D4D5D6D7D8 D1D2D3 D4D5D6D7D8 D1D2D3 D4D5D6D7D8 D1D2D3 D4D5D6D7D8 C1 C2 C3 C4 C5 C6 CONTROL NODE QUERY Even distribution of data, parallel query execution

AZURE STORAGE Azure Data Warehouse 05/09/2015 PASS SQL Saturday Johannesburg - #4257 | AZURE DW CONTROL SERVICE QUERY D3D4D5 D6D7D8D9D10 D2 D1 D13D14D15 D16D17D18D19D20 D12 D11 D23D24D25 D26D27D28D29D30 D22 D21 D33D34D35 D36D37D38D39D40 D32 D31 D43D44D45 D46D47D48D49D50 D42 D41 D53D54D55 D56D57D58D59D60 D52 D51 Even distribution of data, parallel query execution and scale to use

The Modern Data Warehouse 05/09/2015 PASS SQL Saturday Johannesburg - #4258 | Data sources Non-relational data

The Best Kept Secret – POLYBASE 05/09/2015 PASS SQL Saturday Johannesburg - #4259 | PolyBase – Single Query for Structured and Unstructured  SEAMLESS & HIGH SPEED Access  Query and join Hadoop tables with relational tables in parallel  Use SQL query language  Leverages the power of MPP to enhance the query execution performance  No need to duplicate Hadoop data into DW or vice versa  Works with all major Hadoop distributions  Predicate pushdown onto Hadoop platform to minimize data transfer  Data compression Existing SQL Skillset No IT Intervention Save Time and Costs Analyze All Data Types

MPPs’ Best Kept Secret – POLYBASE 05/09/2015 PASS SQL Saturday Johannesburg - #42510 | ~294 billion rows

Fastest way to backup SQL201x 05/09/2015 PASS SQL Saturday Johannesburg - #42511 | BACKUP DATABASE [TPCH_1TB]TO DISK = N'C:\DSI3400\LUN00\backup\TPCH_1TB-Full', DISK = N'C:\DSI3500\LUN00\backup\File2', DISK = N'C:\DSI3500\LUN00\backup\File3', DISK = N'C:\DSI3500\LUN00\backup\File4', DISK = N'C:\DSI3500\LUN00\backup\File5', DISK = N'C:\DSI3500\LUN00\backup\FileX' WITH NOFORMAT, INIT,NAME = N'TPCH_1TB-Full Database Backup', SKIP, NOREWIND, NOUNLOAD, COMPRESSION,STATS = 10 – Magic:,BUFFERCOUNT = 2200,BLOCKSIZE = 65536,MAXTRANSFERSIZE= GO

APS Backups 05/09/2015 PASS SQL Saturday Johannesburg - #42512 | Parallel Node Backup to External fileshare by default Benefit from 56 GBit Infiniband network cards for best throughput Backup sets are compressed automatically

Why APS when I have SQL Server ? 05/09/2015 PASS SQL Saturday Johannesburg - #42511 | Manageable Costs Appliance Simplicity: HW + SW Query Performance Scale Out SQL MPP versus Scale Up SMP “Small, Big & Huge Data” Integration…

Questions and Answers   Special Thank  Henk van der Valk - Microsoft EMEA APS and Big Data Lead   James Rowland-Jones (Big Bang