2012 © Trivadis BASEL BERN LAUSANNE ZÜRICH DÜSSELDORF FRANKFURT A.M. FREIBURG I.BR. HAMBURG MÜNCHEN STUTTGART WIEN Welcome November 2012 Vorstellung Parallel.

Slides:



Advertisements
Similar presentations
2012 © Trivadis BASEL BERN LAUSANNE ZÜRICH DÜSSELDORF FRANKFURT A.M. FREIBURG I.BR. HAMBURG MÜNCHEN STUTTGART WIEN Welcome November 2012 Highlights BI.
Advertisements

2012 © Trivadis BASEL BERN LAUSANNE ZÜRICH DÜSSELDORF FRANKFURT A.M. FREIBURG I.BR. HAMBURG MÜNCHEN STUTTGART WIEN Welcome November 2012 Effiziente Data.
Tableau Software Australia
2012 © Trivadis BASEL BERN LAUSANNE ZÜRICH DÜSSELDORF FRANKFURT A.M. FREIBURG I.BR. HAMBURG MÜNCHEN STUTTGART WIEN TechTalk WrapUp, Q&A Mark Wunderli Meinrad.
2012 © Trivadis BASEL BERN LAUSANNE ZÜRICH DÜSSELDORF FRANKFURT A.M. FREIBURG I.BR. HAMBURG MÜNCHEN STUTTGART WIEN TechTalk Beste Skalierbarkeit dank massiv.
Copyright © 2012, Oracle and/or its affiliates. All rights reserved. 1.
Living with Exadata Presented by: Shaun Dewberry, OS Administrator, RDC Tom de Jongh van Arkel, Database Administrator, RDC Komaran Hansragh, Data Warehouse.
Microsoft Data Warehouse Vision Massive Scalability at Low Cost Improved Business Agility and Alignment Democratized Business Intelligence Hardware.
Doug Lanman Data Warehousing SSP North Central, Midwest and Heartland Districts SQL Server Data Warehousing.
High Performance Analytical Appliance MPP Database Server Platform for high performance Prebuilt appliance with HW & SW included and optimally configured.
A Fast Growing Market. Interesting New Players Lyzasoft.
Danny Tambs Solution Architect. VOLUME (Size) VARIETY (Structure) VELOCITY (Speed)
Microsoft Ignite /16/2017 5:47 PM
Components and Architecture CS 543 – Data Warehousing.
Data Warehousing - 3 ISYS 650. Snowflake Schema one or more dimension tables do not join directly to the fact table but must join through other dimension.
5 Creating the Physical Model. Designing the Physical Model Phase IV: Defining the physical model.
Designing a Data Warehouse
Microsoft Data Warehousing Vision SQL Server 2008 R2 DW enhancements High speed connectors Change Data Capture Star join enhancement features Partition.
Fast Track, Microsoft SQL Server 2008 Parallel Data Warehouse and Traditional Data Warehouse Design BI Best Practices and Tuning for Scaling SQL Server.
© Hitachi Data Systems Corporation All rights reserved. 1 1 Det går pænt stærkt! Tony Franck Senior Solution Manager.
April 10-12, Chicago, IL PDW Architecture Gets Real: Customer Implementations Brian Walker | Microsoft Corporation PDW Center of Excellence Murshed Zaman.
Build it yourself Custom configurations High IT expertise “Cooking recipe” Probably higher success Can be ‘sold’ to customers Tied to HW vendor Very.
An Introduction to Infrastructure Ch 11. Issues Performance drain on the operating environment Technical skills of the data warehouse implementers Operational.
2012 © Trivadis BASEL BERN LAUSANNE ZÜRICH DÜSSELDORF FRANKFURT A.M. FREIBURG I.BR. HAMBURG MÜNCHEN STUTTGART WIEN TechTalk Hochperformante und kostengünstige.
Designing a Data Warehouse Issues in DW design. Three Fundamental Processes Data Acquisition Data Storage Data a Access.
PMIT-6102 Advanced Database Systems
SQL Server Warehousing (Fast Track 4.0 & PDW)
Bob Thome, Senior Director of Product Management, Oracle SIMPLIFYING YOUR HIGH AVAILABILITY DATABASE.
© Copyright 2013 Hewlett-Packard Development Company, L.P. The information herein is subject to change without notice. HP Restricted. HP AppSystem for.
SQL Server Data Warehousing Overview
Sofia, Bulgaria | 9-10 October SQL Server 2005 High Availability for developers Vladimir Tchalkov Crossroad Ltd. Vladimir Tchalkov Crossroad Ltd.
DW-1: Introduction to Data Warehousing. Overview What is Database What Is Data Warehousing Data Marts and Data Warehouses The Data Warehousing Process.
DBI332 ilikesql brianwmitchelll UNSTRUCTURED UNBALANCED UNPREDICTABLE.
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice. HP Enterprise Data Warehouse.
Information Explosion. Reality: New Machine-Generated Data Non-relational and relational data outside of the EDW † Source: Analytics Platforms – Beyond.
2012 © Trivadis BASEL BERN LAUSANNE ZÜRICH DÜSSELDORF FRANKFURT A.M. FREIBURG I.BR. HAMBURG MÜNCHEN STUTTGART WIEN SQL Server 2012 Parallel Data Warehouse.
Building BI Solutions with SQL Server PDW AU3 Ruwen Hess Senior Program Manager Microsoft Corporation DBI321.
Introduction to Microsoft Windows 2000 Welcome to Chapter 1 Windows 2000 Server.
Data Staging Data Loading and Cleaning Marakas pg. 25 BCIS 4660 Spring 2012.
Srik Raghavan Principal Lead Program Manager Kevin Cox Principal Program Manager SESSION CODE: DAT206.
CS338Parallel and Distributed Databases11-1 Parallel and Distributed Databases Lecture Topics Multi-CPU and distributed systems Monolithic system Client–server.
Solution to help customers and partners accelerate their data.
By N.Gopinath AP/CSE.  The data warehouse architecture is based on a relational database management system server that functions as the central repository.
2012 © Trivadis BASEL BERN LAUSANNE ZÜRICH DÜSSELDORF FRANKFURT A.M. FREIBURG I.BR. HAMBURG MÜNCHEN STUTTGART WIEN Welcome November 2012 Columnstore Indexes.
2012 © Trivadis BASEL BERN LAUSANNE ZÜRICH DÜSSELDORF FRANKFURT A.M. FREIBURG I.BR. HAMBURG MÜNCHEN STUTTGART WIEN Welcome November 2012 Einführung in.
Rushabh Mehta Managing Director (India) | Solid Quality Mentors
Scalable data access with Impala Zbigniew Baranowski Maciej Grzybek Daniel Lanza Garcia Kacper Surdy.
SMP MPP with PDW ** Workload requirements usually drive the architecture decision.
QlikView Architecture Overview
Azure SQL DW – Elastic Data Analytics in the cloud Josh Sivey | Microsoft TSP #492 | Phoenix.
Thomas Baus Senior Sales Consultant Oracle/SAP Global Technology Center Mail: Phone:
Microsoft Analytics Platform System Stefan Cronjaeger, Microsoft.
SQL Server 2008 R2 Parallel Data Warehouse: Under the Hood Brian Mitchell Senior Premier Field Engineer.
SQL Server 2008 R2 Introduction Dejan Sarka Solid Quality Mentors
BIG DATA/ Hadoop Interview Questions.
Scaling PostgreSQL with GridSQL. Who Am I? Jim Mlodgenski – Co-organizer of NYCPUG – Founder of Cirrus Technologies – Former Chief Architect of EnterpriseDB.
…the secret sauce! Diagrams and video from Microsoft white papers and slide decks.
Data Platform and Analytics Foundational Training
Data warehouse and OLAP
Data Warehouse in the Cloud – Marketing or Reality?
SQL Server 2008 R2 – The Newest and the Best
Data Warehousing: SQL Server Parallel Data Warehouse AU3 update
Azure SQL Data Warehouse for SQL Server DBAS
Overview of Fast Track and PDW
What is the Azure SQL Datawarehouse?
20 Questions with Azure SQL Data Warehouse
Microsoft Analytics Platform System 03 – Distribution Theory & Design
Dell EMC SQL Server Solutions Doug Bernhardt
Moving your on-prem data warehouse to cloud. What are your options?
Presentation transcript:

2012 © Trivadis BASEL BERN LAUSANNE ZÜRICH DÜSSELDORF FRANKFURT A.M. FREIBURG I.BR. HAMBURG MÜNCHEN STUTTGART WIEN Welcome November 2012 Vorstellung Parallel Data Warehouse 1 November 2012 Meinrad Weiss

2012 © Trivadis Data Warehouse – Products Positioning Minimal HW tune- up/optimization; supports mixed workloads Balanced solution for mostly scan-centric workloads. Max HW tune-up for most DW scenarios. 4 4 Most flexible architecture for handling all DW scenarios. Scale Complexity HA by default SW-HW integration SQL Server 2008 R2 Fast Track SQL Server 2008 R2 Enterprise PDW SQL Server 2008 R2 Data Center PDW with Distributed Data Architecture November Vorstellung Parallel Data Warehouse

2012 © Trivadis Microsoft Data Warehousing Solutions Scalable and reliable platform for data warehousing on any hardware Reference Architectures offering best price performance for data warehousing Scalable and reliable platform for data warehousing on any hardware Appliance for high-end data warehousing requiring highest scalability, performance, or complexity Ideal for data marts or small to mid-sized EDWs Ideal for data marts or small to mid-sized DWs with scan- centric workloads Ideal for large data marts or mid-sized EDWs Offers flexibility in hardware and architecture Software only Reference Architectures (software and hardware) Software only DW appliance (fully integrated software and hardware) Scale-up DW Scale-out DW with MPP 10s of TB 2 – 80 TB 10s of TB 10s - 100s of TB November Vorstellung Parallel Data Warehouse

2012 © Trivadis Data Warehouse – Products Positioning 100% SQL Server 2008 R2 Compatibility Scale Complexity HA by default SW-HW integration SQL Server 2008 R2 with Fast Track Reference Architecture SQL Server 2008 R2 Enterprise PDW SQL Server 2008 R2 Data Center PDW with Distributed Data Architecture November Vorstellung Parallel Data Warehouse

2012 © Trivadis MPP vs. SMP November 2012 Vorstellung Parallel Data Warehouse 5  MPP - Massively Parallel Processing  Uses many separate CPUs running in parallel to execute a single program  Each CPU has its own memory and disks  High-speed communications between nodes  Applications must be segmented SMP MPP  SMP - Symmetric Multiprocessing  Multiple CPUs used to complete individual processes simultaneously  All CPUs share the same memory, disks, and network controllers  All SQL Server implementations up until now have been SMP

2012 © Trivadis Two hardware vendors: HP and Dell November 2012 Vorstellung Parallel Data Warehouse 6 Microsoft+Dell Parallel Data Warehouse Appliance Microsoft+HP Enterprise Data Warehouse Appliance

2012 © Trivadis SQL Control Node Management Node Landing Zone Backup Node Control RackData Rack(s) November Vorstellung Parallel Data Warehouse

2012 © Trivadis SQL Control Node Management Node Landing Zone Backup Node Control Rack SQL  Client connections always go through the control node  Windows Failover Cluster for Availability  Contains no persistent user data  Processes SQL requests  Prepares execution plan  Orchestrates distributed execution  Local SQL Server processes final query plan and aggregates results November Vorstellung Parallel Data Warehouse

2012 © Trivadis SQL Control Node Management Node Landing Zone Backup Node Control Rack SQL  Provides Support and Patching for the Appliance  Holds image for re-deployment of compute node  Holds Active Directory November Vorstellung Parallel Data Warehouse

2012 © Trivadis SQL Control Node Management Node Landing Zone Backup Node Control Rack SQL  Provides high-capacity storage for data files from ETL processes  Is available as a sandbox for other applications and scripts that run on the internal network  Provides SQL Server Integration Services Source Landing Zone Files Data Loader Compute Nodes DWLoader or SQL Server Integration Services November Vorstellung Parallel Data Warehouse

2012 © Trivadis SQL Control Node Management Node Landing Zone Backup Node Control Rack SQL  Provides Integrated Backup Solution  Integrates with 3rd party backup products  Orderable in different sizes November Vorstellung Parallel Data Warehouse

2012 © Trivadis SQL Control Node Management Node Landing Zone Backup Node Control RackData Rack(s)  Data Rack Servers 5/10 active + 1 passive per Rack  InfiniBand, FC and Ethernet switching  Expansion Grow from 1/2–4 data racks, storage options, test/dev system  Consists of COMPUTE NODES and STORAGE NODES  Shared Nothing  Spare Node provides failover in case of node failure November Vorstellung Parallel Data Warehouse

2012 © Trivadis Connectivity and Tools Nexus Query Chameleon DWSQL November 2012 Vorstellung Parallel Data Warehouse 13

2012 © Trivadis Creating a Database CREATE DATABASE PDW WITH (AUTOGROW = ON, REPLICATED_SIZE = 1024 GB, -- (per Node) DISTRIBUTED_SIZE = GB, -- (whole System) LOG_SIZE = 1024 GB); November Vorstellung Parallel Data Warehouse

2012 © Trivadis Distribution and Replication of Data: Replicate November 2012 Vorstellung Parallel Data Warehouse 15 Time Dim Date Dim ID Calendar Year Calendar Qtr Calendar Mo Calendar Day Date Dim ID Calendar Year Calendar Qtr Calendar Mo Calendar Day Store Dim Store Dim ID Store Name Store Mgr Store Size Store Dim ID Store Name Store Mgr Store Size Product Dim Prod Dim ID Prod Category Prod Sub Cat Prod Desc Prod Dim ID Prod Category Prod Sub Cat Prod Desc Mktg Campaign Dim Mktg Campaign Dim Mktg Camp ID Camp Name Camp Mgr Camp Start Camp End TDTD TDTD PDPD PDPD SDSD SDSD MDMD MDMD TDTD TDTD PDPD PDPD SDSD SDSD MDMD MDMD TDTD TDTD PDPD PDPD SDSD SDSD MDMD MDMD Smaller (<5GB ) Dimension Tables are Replicated on Every Compute Node TDTD TDTD PDPD PDPD SDSD SDSD MDMD MDMD Sales Facts Date Dim ID Store Dim ID Prod Dim ID Mktg Camp Id Qty Sold Dollars Sold SF -1 SF -2 SF -3 SF -4 Result: Fact -Dimension Joins can be performed locally

2012 © Trivadis Create Replicated Table November 2012 Vorstellung Parallel Data Warehouse 16 CREATE TABLE DimProduct( ProductId BIGINT NOT NULL, Description VARCHAR(50), CategoryId INT NOT NULL, ListPrice DECIMAL(12,2)) WITH (DISTRIBUTION = REPLICATE); CREATE TABLE DimProduct( ProductId BIGINT NOT NULL, Description VARCHAR(50), CategoryId INT NOT NULL, ListPrice DECIMAL(12,2)) WITH (DISTRIBUTION = REPLICATE);  Creates tables on each of the individual compute nodes and assigns them to the REPLICATED file group.  Data Compression is automatically turned on

2012 © Trivadis Distribution and Replication of Data: Distribute November 2012 Vorstellung Parallel Data Warehouse 17 SF -1 Sales Facts Date Dim ID Store Dim ID Prod Dim ID Mktg Camp Id Qty Sold Dollars Sold Larger (> 10 GB) Fact Table is Hash Distributed Across All Compute Nodes SF -1 SF -2 SF -3 SF -4 Time Dim Date Dim ID Calendar Year Calendar Qtr Calendar Mo Calendar Day Date Dim ID Calendar Year Calendar Qtr Calendar Mo Calendar Day Store Dim Store Dim ID Store Name Store Mgr Store Size Store Dim ID Store Name Store Mgr Store Size Product Dim Prod Dim ID Prod Category Prod Sub Cat Prod Desc Prod Dim ID Prod Category Prod Sub Cat Prod Desc Mktg Campaign Dim Mktg Campaign Dim Mktg Camp ID Camp Name Camp Mgr Camp Start Camp End

2012 © Trivadis November 2012 Vorstellung Parallel Data Warehouse Distribution on a PDW PDW Node 1 Create Table _a Create Table _b … Create Table _h 8 Tables per Node PDW Node 2 Create Table _a Create Table _b … Create Table _h PDW Node 10 Create Table _a Create Table _b … Create Table _h PDW Node … Final Result: 80 individual tables across a 10 node (1 data rack) appliance CREATE TABLE myTable (column Defs) WITH (DISTRIBUTION = HASH (id)); CREATE TABLE myTable (column Defs) WITH (DISTRIBUTION = HASH (id)); 18

2012 © Trivadis Reference Case: Today’s process flow / Building blocks DB_ GSAPOP DB_ MasterTables DB_ ReportTables FinanceCube Baseline : Once data extracted from SAP: Time taken to create end-end Reports and Cubes insights 13+ hours (In production typical 20+ hours with multiple companies) DW_Finance Transactions MasterFinance table population 6 hours 21min 6 hours 1 hour Suspicious words Reports 3hr21min

2012 © Trivadis Reference Case: Audit Process with PDW DB_ GSAPOP DB_ MasterTables DB_ ReportTables FinanceCube Once data is extracted from SAP: Creating 5 CM Reports & FSCP Finance Cube; Time taken: 30 Minutes Once data is extracted from SAP: Creating 5 CM Reports & FSCP Finance Cube; Time taken: 30 Minutes DW_Finance Transactions MasterFinance table population 8m50sec load from FlatFile 23min 10m10sec 11 min All 5 Reports within 6min (80)

2012 © Trivadis Appliance Update AU3 November 2012 Vorstellung Parallel Data Warehouse 21  Performance – up to 10x improvement  Data Movement Services  New cost based Query Optimizer  New Data Movement Service  1/2 rack appliances from HP and Dell  System Center 2012 Integration (SCOM pack)  And YES … Support for Stored Procedures (subset)  Collations: Full support for international data  Native SQL Server drivers

2012 © Trivadis Landing Zone ETL Tools Hub and Spoke Departmental Reporting Regional Reporting High-Performance Reporting Central EDW Hub Regional Reporting with Business Decision Appliance Third-Party RDBMS Third-Party Data Integration Mobile Applications November Vorstellung Parallel Data Warehouse

2012 © Trivadis Web-BasedManagement Dashboard November 2012 Vorstellung Parallel Data Warehouse 23

2012 © Trivadis System Center (SCOM) November 2012 Vorstellung Parallel Data Warehouse 24

2012 © Trivadis SQL Server Compute Nodes System Throughput Regular SQL Server ( 1 Node) Seamless Scalability Half Rack PDW ( 5 Nodes) Full Rack PDW ( 10 Nodes) 2 Rack PDW ( 20 Nodes) 3 Rack PDW ( 30 Nodes) 4 Rack PDW ( 30 Nodes) November Vorstellung Parallel Data Warehouse

2012 © Trivadis BASEL BERN LAUSANNE ZÜRICH DÜSSELDORF FRANKFURT A.M. FREIBURG I.BR. HAMBURG MÜNCHEN STUTTGART WIEN Let‘s go. November 2012 Vorstellung Parallel Data Warehouse 26 Wettbewerb