Designing a Data Warehouse from the Ground Up

Slides:



Advertisements
Similar presentations
Author: Graeme C. Simsion and Graham C. Witt Chapter 11 Logical Database Design.
Advertisements

Dimensional Modeling.
CHAPTER OBJECTIVE: NORMALIZATION THE SNOWFLAKE SCHEMA.
 Definition  Components  Advantages  Limitations Contents  Definition Definition  Normal Forms Normal Forms  First Normal Form First Normal Form.
Dimensional Modeling Business Intelligence Solutions.
Dimensional Modeling CS 543 – Data Warehousing. CS Data Warehousing (Sp ) - Asim LUMS2 From Requirements to Data Models.
Data Warehouse IMS5024 – presented by Eder Tsang.
MIS 451 Building Business Intelligence Systems Logical Design (5) – Aggregate.
©Silberschatz, Korth and Sudarshan18.1Database System Concepts - 5 th Edition, Aug 26, 2005 Buzzword List OLTP – OnLine Transaction Processing (normalized,
Datawarehouse & Datamart OLAPs vs. OLTPs Dimensional Modeling Creating Physical Design Using SQL Mgt. Studio Module II: Designing Datamarts 1.
Module 1: Introduction to Data Warehousing and OLAP
Operational vs. Informational System. Operational System Operational systems maintain records of daily business transactions whereas a Data Warehouse.
Normalized model vs dimensional model
MIS2502: Data Analytics Dimensional Data Modeling
UNIT-II Principles of dimensional modeling
CMPE 226 Database Systems October 21 Class Meeting Department of Computer Engineering San Jose State University Fall 2015 Instructor: Ron Mak
ADVANCED TOPICS IN RELATIONAL DATABASES Spring 2011 Instructor: Hassan Khosravi.
Business Intelligence Transparencies 1. ©Pearson Education 2009 Objectives What business intelligence (BI) represents. The technologies associated with.
Business Intelligence Training Siemens Engineering Pakistan Zeeshan Shah December 07, 2009.
1 Copyright © 2009, Oracle. All rights reserved. Oracle Business Intelligence Enterprise Edition: Overview.
By A Sai Krishna Geethika Lokanadham Mithun Rajanna KV Kumar Data warehousing for Risk Analysis.
NORMALIZATION Handout - 4 DBMS. What is Normalization? The process of grouping data elements into tables in a way that simplifies retrieval, reduces data.
Information Systems in Organizations Managing the business: decision-making Growing the business: knowledge management, R&D, and social business.
1 Database Systems, 8 th Edition Star Schema Data modeling technique –Maps multidimensional decision support data into relational database Creates.
Copyright © 2016 Pearson Education, Inc. Modern Database Management 12 th Edition Jeff Hoffer, Ramesh Venkataraman, Heikki Topi CHAPTER 9: DATA WAREHOUSING.
Data Warehouse/Data Mart It’s all about the data.
Jaclyn Hansberry MIS2502: Data Analytics The Things You Can Do With Data The Information Architecture of an Organization Jaclyn.
Fundamental Relational Database Design
Data Warehouse.
Data warehouse and OLAP
Developing, Managing & Using Customer-related Databases
Chapter 13 The Data Warehouse
Data Warehouse—Subject‐Oriented
MIS2502: Data Analytics Dimensional Data Modeling
Data Warehousing Business Intelligence
Star Schema.
Applying Data Warehouse Techniques
MIS2502: Data Analytics Dimensional Data Modeling
MIS2502: Data Analytics Dimensional Data Modeling
Overview and Fundamentals
Database Design Using Normalization
Dimensional Model January 14, 2003
Retail Sales is used to illustrate a first dimensional model
MIS2502: Data Analytics Dimensional Data Modeling
CMPE 226 Database Systems April 11 Class Meeting
Traveling in time with SQL Server 2017
CUSTOMER RELATIONSHIP MANAGEMENT CONCEPTS AND TECHNOLOGIES
Applying Data Warehouse Techniques
An Introduction to Data Warehousing
MIS2502: Data Analytics Dimensional Data Modeling
Dr. Bernard Chen Ph.D. University of Central Arkansas Fall 2009
Retail Sales is used to illustrate a first dimensional model
Applying Data Warehouse Techniques
Designing SSIS Packages for Performance
Database Design Agenda
Data Warehousing Data Model –Part 1
Dimensional Modeling.
MIS2502: Data Analytics Dimensional Data Modeling
Making Sense of the Power BI Ecosystem
Retail Sales is used to illustrate a first dimensional model
Building your First Cube with SSAS
Reporting on a Cube with SSRS
DWH – Dimesional Modeling
Introduction to Database Design
Applying Data Warehouse Techniques
Making Sense of the Power BI Ecosystem
Building a Microsoft BI solution step-by-step
Analytics, BI & Data Integration
Applying Data Warehouse Techniques
Data Warehouse and OLAP Technology
Presentation transcript:

Designing a Data Warehouse from the Ground Up Dustin Ryan Data Platform Architect sqldusty@gmail.com

Dustin Ryan Designing analytics solutions for past 10+ years Currently Data Platform Solution Architect for Microsoft for past 3+ years Blogs at SQLDusty.com Author, technical editor, speaker Facebook.com/SQLDusty Twitter.com/SQLDusty Baby wrangler and chicken farmer Live in Jacksonville, Florida with my wife, Angela, and three kiddos, Dallas, Bradley, Andrew LinkedIn.com/in/SQLDusty YouTube.com/dustinryan

Why a Data Warehouse? OLTP Data Warehouse Purpose Execution of business Analysis of business Primary Interaction Single transaction Aggregated transactions Interaction Method Insert, Update, Delete Select Temporal Focus Current Current/historic Design Optimization Update concurrency High-performance queries Design Principle 3NF Star Schema

Four Steps Identify the business process Identify the grain Choose the dimensions Choose the measures

Identify the Business Process Business process NOT business department If just starting, choose high impact, low risk area of the business The business can help you here For this example  Retail Sales High Impact Low Impact Low Risk High Risk

Identify the Grain What does one fact row represent? Choose the most atomic level We can’t predict the queries! “One row represents a movie rented by a customer from an employee in a store on a day.”

Define the Dimensions Who, what, where, when? De-normalized design focuses on high performance reads Best attributes are descriptive Use smallest data type possible

Define the Measures How the business measures success Best measures are fully additive Non-additive measures should be handled in SSAS

Resources

http://tinyurl.com/DesignDWGroundUp Dustin Ryan Facebook.com/SQLDusty Twitter.com/SQLDusty Dustin Ryan Data Platform Solution Architect SQLDusty@gmail.com SQLDusty.com LinkedIn.com/in/SQLDusty YouTube.com/dustinryan