Download presentation
Presentation is loading. Please wait.
Published byJoshua Baker Modified over 8 years ago
1
Modern Data Warehousing Symmetric Multi-Processing SQL (SMP) vs Massive Parallel Processing SQL (MPP) Alain Dormehl P-Cubed Session Level : Intermediary Alain Dormehl @APSolutely MCSE Data Platforms (SQL 2012)
2
A bit about me … 10 years using SQL Server MCSE Data Platforms & MCT Work at: P-Cubed Research, Development and Continuity More Recently: APS and Azure DW Twitter: @APSolutely Feel Free to ask any questions … 05/09/2015 PASS SQL Saturday Johannesburg - #4252 |
3
Agenda SMP Architecture MPP Architecture Unstructured Data you ask !? The best kept secret ! Something for the DBAs Why MPP over SMP (scale out vs scale up) QA 05/09/2015 PASS SQL Saturday Johannesburg - #4253 |
4
Traditional SMP Implementation 05/09/2015 PASS SQL Saturday Johannesburg - #4254 | QUERY RESOURCE CONTENTION
5
MPP Implementation QUERY 05/09/2015 PASS SQL Saturday Johannesburg - #4255 | SHARE NOTHING CONTROL
6
Analytics Platform System 05/09/2015 PASS SQL Saturday Johannesburg - #4256 | D1D2D3 D4D5D6D7D8 D1D2D3 D4D5D6D7D8 D1D2D3 D4D5D6D7D8 D1D2D3 D4D5D6D7D8 D1D2D3 D4D5D6D7D8 D1D2D3 D4D5D6D7D8 C1 C2 C3 C4 C5 C6 CONTROL NODE QUERY Even distribution of data, parallel query execution
7
AZURE STORAGE Azure Data Warehouse 05/09/2015 PASS SQL Saturday Johannesburg - #4257 | AZURE DW CONTROL SERVICE QUERY D3D4D5 D6D7D8D9D10 D2 D1 D13D14D15 D16D17D18D19D20 D12 D11 D23D24D25 D26D27D28D29D30 D22 D21 D33D34D35 D36D37D38D39D40 D32 D31 D43D44D45 D46D47D48D49D50 D42 D41 D53D54D55 D56D57D58D59D60 D52 D51 Even distribution of data, parallel query execution and scale to use
8
The Modern Data Warehouse 05/09/2015 PASS SQL Saturday Johannesburg - #4258 | Data sources Non-relational data
9
The Best Kept Secret – POLYBASE 05/09/2015 PASS SQL Saturday Johannesburg - #4259 | PolyBase – Single Query for Structured and Unstructured SEAMLESS & HIGH SPEED Access Query and join Hadoop tables with relational tables in parallel Use SQL query language Leverages the power of MPP to enhance the query execution performance No need to duplicate Hadoop data into DW or vice versa Works with all major Hadoop distributions Predicate pushdown onto Hadoop platform to minimize data transfer Data compression Existing SQL Skillset No IT Intervention Save Time and Costs Analyze All Data Types
10
MPPs’ Best Kept Secret – POLYBASE 05/09/2015 PASS SQL Saturday Johannesburg - #42510 | ~294 billion rows
11
Fastest way to backup SQL201x 05/09/2015 PASS SQL Saturday Johannesburg - #42511 | BACKUP DATABASE [TPCH_1TB]TO DISK = N'C:\DSI3400\LUN00\backup\TPCH_1TB-Full', DISK = N'C:\DSI3500\LUN00\backup\File2', DISK = N'C:\DSI3500\LUN00\backup\File3', DISK = N'C:\DSI3500\LUN00\backup\File4', DISK = N'C:\DSI3500\LUN00\backup\File5', DISK = N'C:\DSI3500\LUN00\backup\FileX' WITH NOFORMAT, INIT,NAME = N'TPCH_1TB-Full Database Backup', SKIP, NOREWIND, NOUNLOAD, COMPRESSION,STATS = 10 – Magic:,BUFFERCOUNT = 2200,BLOCKSIZE = 65536,MAXTRANSFERSIZE=2097152 GO
12
APS Backups 05/09/2015 PASS SQL Saturday Johannesburg - #42512 | Parallel Node Backup to External fileshare by default Benefit from 56 GBit Infiniband network cards for best throughput Backup sets are compressed automatically
13
Why APS when I have SQL Server ? 05/09/2015 PASS SQL Saturday Johannesburg - #42511 | Manageable Costs Appliance Simplicity: HW + SW Query Performance Scale Out SQL MPP versus Scale Up SMP “Small, Big & Huge Data” Integration…
14
Questions and Answers Alaind@p-cubed.co.za Alaind@p-cubed.co.za Special Thank Henk van der Valk - Microsoft EMEA APS and Big Data Lead http://henkvandervalk.com/ http://henkvandervalk.com/ James Rowland-Jones (Big Bang Data) @jrowlandjones @jrowlandjones
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.