Design and ETL 2017. 6.

Slides:



Advertisements
Similar presentations
The Organisation As A System An information management framework The Performance Organiser Data Warehousing.
Advertisements

Dimensional Modeling.
Tips and Tricks for Dimensional Modeling
CHAPTER OBJECTIVE: NORMALIZATION THE SNOWFLAKE SCHEMA.
Analyses for all areas of your business Analysis Suite by Taurus Software Analysis Suite by Taurus Software.
James Serra – Data Warehouse/BI/MDM Architect
Copyright © Starsoft Inc, Data Warehouse Architecture By Slavko Stemberger.
Presented by Brad Gall Using BI Techniques for Database Statistics.
Technical BI Project Lifecycle
Dimensional Modeling Business Intelligence Solutions.
Data Manager Best Practices Business Intelligence Solutions.
MIS 451 Building Business Intelligence Systems Logical Design (3) – Design Multiple-fact Dimensional Model.
ETL Process in Data Warehouse Chirayu Poundarik. Outline ETL Extraction Transformation Loading.
ETL Design and Development Michael A. Fudge, Jr.
ETL By Dr. Gabriel.
ETL and Metadata. The ETL Process Source Systems Extract Transform Staging Area Load Presentation System.
Introducing ETL: Components & Architecture Michael A. Fudge, Jr.
Sayed Ahmed Logical Design of a Data Warehouse.  Free Training and Educational Services  Training and Education in Bangla: Training and Education in.
ISV Innovation Presented by ISV Innovation Presented by Business Intelligence Fundamentals: Data Loading Ola Ekdahl IT Mentors 9/12/08.
Data Warehouse and Business Intelligence Dr. Minder Chen Fall 2009.
1 Data Warehouses BUAD/American University Data Warehouses.
Data Staging Data Loading and Cleaning Marakas pg. 25 BCIS 4660 Spring 2012.
SHIFALI CHOUBEY GISE LAB IITB Decision Support System For Farmers.
Physical Design Michael A. Fudge, Jr.
UNIT-II Principles of dimensional modeling
Competitive (Business) Intelligence Systems The Road to Denormalization (starring Charlie Sheen & other Random Celebrities)
1 Agenda – 04/02/2013 Discuss class schedule and deliverables. Discuss project. Design due on 04/18. Discuss data mart design. Use class exercise to design.
Fact Table The fact table stores business events. The attributes explain the conditions of the entity at the time the business event happened.
7 Strategies for Extracting, Transforming, and Loading.
June 08, 2011 How to design a DATA WAREHOUSE Linh Nguyen (Elly)
Two-Tier DW Architecture. Three-Tier DW Architecture.
INCREMENTAL AGGREGATION After you create a session that includes an Aggregator transformation, you can enable the session option, Incremental Aggregation.
Base SAS ® vs. SAS ® Data Integration Studio Greg Nelson and Danny Grasse.
Houston Petroleum Valve Company Data-Mining Project Data Modeling Phase Fouad Alibrahim Mohammad H. Monakes University of Houston Clear Lake University.
Building the Corporate Data Warehouse Pindaro Demertzoglou Data Resource Management.
Copyright © 2016 Pearson Education, Inc. Modern Database Management 12 th Edition Jeff Hoffer, Ramesh Venkataraman, Heikki Topi CHAPTER 9: DATA WAREHOUSING.
Building the Corporate Data Warehouse Pindaro Demertzoglou Lally School of Management Data Resource Management.
ETL Process in Data Warehouse
ETL Design - Stage Philip Noakes May 9, 2015.
Building Data ware House
Informix Red Brick Warehouse 5.1
Summarized from various resources Modern Database Management
Data Warehouse.
Star Schema.
Applying Data Warehouse Techniques
Overview and Fundamentals
Dimensional Model January 14, 2003
SSIS Demo Michael A. Fudge, Jr.
CMPE 226 Database Systems April 11 Class Meeting
Data Base System Lecture 6: Relational Model
Insert, Update, Delete Manipulating Data.
Applying Data Warehouse Techniques
INFS 3220 Systems Analysis & Design
Typically data is extracted from multiple sources
MIS2502: Data Analytics Dimensional Data Modeling
Database solutions Chosen aspects of the relational model Marzena Nowakowska Faculty of Management and Computer Modelling Kielce University of Technology.
Applying Data Warehouse Techniques
Designing SSIS Packages for Performance
Warehouse Architecture
Dimensional Modeling.
MIS2502: Data Analytics Dimensional Data Modeling
The Road to Denormalization
Retail Sales is used to illustrate a first dimensional model
DWH – Dimesional Modeling
Applying Data Warehouse Techniques
Analysis Services Analysis Services vs. the Data Warehouse vs. OLTP DB
Applying Data Warehouse Techniques
Dimension Load Patterns with Azure Data Factory Data Flows
Dmytro Polishchuk BI Developer DB Best Technologies
Implementing ETL solution for Incremental Data Load in Microsoft SQL Server Ganesh Lohani SR. Data Analyst Lockheed Martin
Presentation transcript:

Design and ETL 2017. 6

Loading a Star Dimension table 다음에 fact table 데이터 로딩 필요에 따라서 referential integrity를 만족하도록 dimension, fact table에 동시에 로딩도 가능 Dimension table 간에는 dependency가 거의 없음

Load a Dimension Table Incremental process Necessary to inspect the data sources of dimension tables for new and changed information on a regular basis What a dimension load must achieve

Load a Dimension Table

Load a Dimension Table Preparing records for processing Row-wise processing Source data pivoted, transposed

Load a Dimension Table

Load a Dimension Table (Type 1 change)

Load a Dimension Table (Type 1 change) brand_name (Type 1 attribute) not fully dependent on the natural key sku -> brand_code (type 2) ->(weakly determine) brand_name (type 1)

Load a Dimension Table (Type 2 change) Cook, Dan insertion

Loading the Fact Table What a fact table load must achieve

Loading the Fact Table Restructuring A single record

Loading the Fact Table Aggregation If the source data is given at a finer grain than is required by the fact table, => aggregation ! NK 기준, 해당하는 record를 병합, 요약 작업

Loading the Fact Table Identification of surrogate keys Source 데이터 레코드는 Natural Key를 가지고 추출 추출한 레코드가 fact table에 들어갈 때는 Surrogate Key가 필요

Optimizing the Load Eliminating lookup Caching lookup Dimension table에 대하여 Type 1 change 검사를 굳이 하지 않음 Caching lookup Dimension table 전체 또는 필요 컬럼들을 memory에 로드

Optimizing the Load

Cleansing the Data Cleansing the dimensional data 표준 코드

Cleansing the Data Facts with invalid details

Housekeeping Columns Housekeeping columns for ETL process

Housekeeping Columns

How to Design and Document a Dimensional Model

Dimensional Modeling Kimball’s guideline Each star corresponds to a discrete process

Dimensional Modeling may be grouped into a fact table !

Dimensional Modeling

Dimensional Modeling

Dimensional Modeling

Dimensional Modeling

Dimensional Modeling

Dimensional Modeling

Dimensional Modeling (detailed)

Dimensional Modeling (detailed)

Dimensional Modeling (detailed)

Dimensional Modeling (detailed)