Agile data warehouse design

Slides:



Advertisements
Similar presentations
Data Warehousing and Data Mining J. G. Zheng May 20 th 2008 MIS Chapter 3.
Advertisements

Dimensional Modeling.
Cognos 8 Training Session
An overview of Data Warehousing and OLAP Technology Presented By Manish Desai.
BY LECTURER/ AISHA DAWOOD DW Lab # 2. LAB EXERCISE #1 Oracle Data Warehousing Goal: Develop an application to implement defining subject area, design.
Data Warehouse IMS5024 – presented by Eder Tsang.
Data Warehousing - 3 ISYS 650. Snowflake Schema one or more dimension tables do not join directly to the fact table but must join through other dimension.
Data Warehousing ISYS 650. What is a data warehouse? A data warehouse is a subject-oriented, integrated, nonvolatile, time-variant collection of data.
DATA WAREHOUSE (Muscat, Oman).
Data Warehousing DSCI 4103 Dr. Mennecke Introduction and Chapter 1.
Tanvi Madgavkar CSE 7330 FALL Ralph Kimball states that : A data warehouse is a copy of transaction data specifically structured for query and analysis.
Implementing a Data Warehouse with SQL Server Jump Start
1 Components of A Successful Data Warehouse Chris Wheaton, Co-Founder, Client Advocate.
Business Intelligence Instructor: Bajuna Salehe Web:
M ODULE 5 Metadata, Tools, and Data Warehousing Section 4 Data Warehouse Administration 1 ITEC 450.
Agenda Common terms used in the software of data warehousing and what they mean. Difference between a database and a data warehouse - the difference in.
Business Intelligence
DATA WAREHOUSING IN SQL SERVER 2005/2008 BUSINESS INTELLIGENCE.
©Silberschatz, Korth and Sudarshan18.1Database System Concepts - 5 th Edition, Aug 26, 2005 Buzzword List OLTP – OnLine Transaction Processing (normalized,
1 Brett Hanes 30 March 2007 Data Warehousing & Business Intelligence 30 March 2007 Brett Hanes.
Best Practices for Data Warehousing. 2 Agenda – Best Practices for DW-BI Best Practices in Data Modeling Best Practices in ETL Best Practices in Reporting.
ISV Innovation Presented by ISV Innovation Presented by Business Intelligence Fundamentals: Data Loading Ola Ekdahl IT Mentors 9/12/08.
DW-1: Introduction to Data Warehousing. Overview What is Database What Is Data Warehousing Data Marts and Data Warehouses The Data Warehousing Process.
Data Warehouse Architecture. Inmon’s Corporate Information Factory The enterprise data warehouse is not intended to be queried directly by analytic applications,
Datawarehouse & Datamart OLAPs vs. OLTPs Dimensional Modeling Creating Physical Design Using SQL Mgt. Studio Module II: Designing Datamarts 1.
Atlanta Microsoft Database Forum Introduction to Data Warehousing Concepts Brian Thomas Solution Builders, Inc. Presented by March 8, 2004
Program Pelatihan Tenaga Infromasi dan Informatika Sistem Informasi Kesehatan Ari Cahyono.
© 2007 by Prentice Hall 1 Introduction to databases.
Chapter 1 Adamson & Venerable Spring Dimensional Modeling Dimensional Model Basics Fact & Dimension Tables Star Schema Granularity Facts and Measures.
Data Warehouse. Design DataWarehouse Key Design Considerations it is important to consider the intended purpose of the data warehouse or business intelligence.
1 Data Warehouses BUAD/American University Data Warehouses.
The Data Warehouse “A data warehouse is a subject-oriented, integrated, time-variant, and nonvolatile collection of “all” an organisation’s data in support.
Module 1: Introduction to Data Warehousing and OLAP
IMS 4212: Data Warehousing / Business Intelligence 1 Dr. Lawrence West, Management Dept., University of Central Florida Data Warehousing.
BI Terminologies.
Decision Support and Date Warehouse Jingyi Lu. Outline Decision Support System OLAP vs. OLTP What is Date Warehouse? Dimensional Modeling Extract, Transform,
Dimensional Modeling Overview. Agenda ■ Dimensional Modeling Overview ■ Dimensional Modeling Steps ■ Dimensional Modeling Framework ■ Retail Sales - Case.
Basic Model: Retail Grocery Store
Designing a Data Warehousing System. Overview Business Analysis Process Data Warehousing System Modeling a Data Warehouse Choosing the Grain Establishing.
UNIT-II Principles of dimensional modeling
Chapter 5 DATA WAREHOUSING Study Sections 5.2, 5.3, 5.5, Pages: & Snowflake schema.
IMS 6217: Data Warehousing / Business Intelligence 1 Dr. Lawrence West, Management Dept., University of Central Florida Database Performance.
Shilpa Seth.  Multidimensional Data Model Concepts Multidimensional Data Model Concepts  Data Cube Data Cube  Data warehouse Schemas Data warehouse.
Creating the Dimensional Model
1 Agenda – 04/02/2013 Discuss class schedule and deliverables. Discuss project. Design due on 04/18. Discuss data mart design. Use class exercise to design.
Pooja Sharma Shanti Ragathi Vaishnavi Kasala. BUSINESS BACKGROUND Lowe's started as a single hardware store in North Carolina in 1946 and since then has.
June 08, 2011 How to design a DATA WAREHOUSE Linh Nguyen (Elly)
Two-Tier DW Architecture. Three-Tier DW Architecture.
NAVYA SIRISHA RAGHAVA SHANKER ARAVIND SRIKANTH. OUTLINE Background Business Need Dimensions and Fact table Data Model Cube Implementation BI Information.
Copyright© 2014, Sira Yongchareon Department of Computing, Faculty of Creative Industries and Business Lecturer : Dr. Sira Yongchareon ISCG 6425 Data Warehousing.
1 Copyright © 2009, Oracle. All rights reserved. Oracle Business Intelligence Enterprise Edition: Overview.
1 Copyright © Oracle Corporation, All rights reserved. Business Intelligence and Data Warehousing.
Last Updated : 26th may 2003 Center of Excellence Data Warehousing Introductionto Data Modeling.
1 Dimensional Modelling III. 2 Dimensional Models A denormalized relational model Made up of tables with attributes Relationships defined by keys and.
The Need for Data Analysis 2 Managers track daily transactions to evaluate how the business is performing Strategies should be developed to meet organizational.
COMP 430 Intro. to Database Systems Denormalization & Dimensional Modeling.
Building the Corporate Data Warehouse Pindaro Demertzoglou Data Resource Management.
Copyright © 2016 Pearson Education, Inc. Modern Database Management 12 th Edition Jeff Hoffer, Ramesh Venkataraman, Heikki Topi CHAPTER 9: DATA WAREHOUSING.
The Concepts of Business Intelligence Microsoft® Business Intelligence Solutions.
Jaclyn Hansberry MIS2502: Data Analytics The Things You Can Do With Data The Information Architecture of an Organization Jaclyn.
Introduction to Data Warehousing
Data Warehouse.
Applying Data Warehouse Techniques
Overview and Fundamentals
Competing on Analytics II
Applying Data Warehouse Techniques
Applying Data Warehouse Techniques
Applying Data Warehouse Techniques
Applying Data Warehouse Techniques
Data Warehousing.
Presentation transcript:

Agile data warehouse design Dao Vo Confidential

Overview of data warehousing agenda Overview of data warehousing Designing and implementing a data warehouse Waterfall BI/WH development Agile BI/WH development framework Q&A Audience? Confidential

Overview of data warehousing What is a data warehouse? Confidential

OVERVIEW OF DATA WAREHOUSING The business problem What is a data warehouse? BI/WH Architectures Confidential

Module 1: Introduction to Data Warehousing Course 10777A Module 1: Introduction to Data Warehousing The Business Problem Key business data is distributed across multiple systems Emphasize that the drivers for a data warehousing solution are typically business-related rather than technical. Ask students about the various applications and data stores in their organizations. How easy or difficult is it for students to collate data from them to create an overall, business-wide view of key measures that drive strategic business decision making?

Module 1: Introduction to Data Warehousing Course 10777A Module 1: Introduction to Data Warehousing The Business Problem Finding the information required for business decision making is time-consuming and error-prone Emphasize that the drivers for a data warehousing solution are typically business-related rather than technical. Ask students about the various applications and data stores in their organizations. How easy or difficult is it for students to collate data from them to create an overall, business-wide view of key measures that drive strategic business decision making?

Module 1: Introduction to Data Warehousing Course 10777A Module 1: Introduction to Data Warehousing The Business Problem Fundamental business questions are hard to answer Emphasize that the drivers for a data warehousing solution are typically business-related rather than technical. Ask students about the various applications and data stores in their organizations. How easy or difficult is it for students to collate data from them to create an overall, business-wide view of key measures that drive strategic business decision making?

What Is a Data Warehouse? Course 10777A Module 1: Introduction to Data Warehousing What Is a Data Warehouse? Explain that this course uses a fairly generic definition for a data warehouse. There are many very specific definitions in use throughout the data warehousing industry, and students may be aware of some of the more common schools of thought with regard to database design, including those of Bill Inmon and Ralph Kimball. This course does not advocate one approach over another, although the lab solutions and data warehouse schema design that are discussed in this course are more in line with a Kimball-based approach than any other.

What Is a Data Warehouse? Course 10777A Module 1: Introduction to Data Warehousing What Is a Data Warehouse? A centralized store of business data for reporting and analysis Typically, a data warehouse: Contains large volumes of historical data Is optimized for querying data (as opposed to inserting or updating) Is incrementally loaded with new business data at regular intervals Provides the basis for enterprise business intelligence solutions Explain that this course uses a fairly generic definition for a data warehouse. There are many very specific definitions in use throughout the data warehousing industry, and students may be aware of some of the more common schools of thought with regard to database design, including those of Bill Inmon and Ralph Kimball. This course does not advocate one approach over another, although the lab solutions and data warehouse schema design that are discussed in this course are more in line with a Kimball-based approach than any other.

Designing and implementing a data warehouse How to design a data warehouse and BI solution? Confidential

DESIGN AND IMPLEMENT WH Introduction to Dimensional Modeling Star Schemas Considerations for Dimension Tables Considerations for Fact Tables Snowflake Schemas Confidential

Warehouse Modeling Confidential

Introduction to Dimensional Modeling Module 3: Designing and Implementing a Data Warehouse Course 10777A Introduction to Dimensional Modeling Business questions focus on measures that are aggregated by business dimensions Measures are facts about the business Dimensions are ways in which the measures can be aggregated Time Product Line At this stage, the focus is on identifying the measures and dimensions, not on defining specific fact and dimension tables. Point out that the measures and dimensions by which the tables will be aggregated are generally identified through discussions with the stakeholders who will use the data warehouse. Quantity Revenue Cost Profit Region Customer Sales person Product

Module 3: Designing and Implementing a Data Warehouse Course 10777A Module 3: Designing and Implementing a Data Warehouse Star Schemas DimSalesPerson SalesPersonKey SalesPersonName StoreName StoreCity StoreRegion Group related dimensions into dimension tables Group related measures into fact tables Relate fact tables to dimension tables by using foreign keys Encourage students to suggest other examples of business events that include measures, and the dimensions by which they may be aggregated. FactOrders CustomerKey SalesPersonKey ProductKey ShippingAgentKey TimeKey OrderNo LineItemNo Quantity Revenue Cost Profit DimCustomer CustomerKey CustomerName City Region DimDate DateKey Year Quarter Month Day DimProduct ProductKey ProductName ProductLine SupplierName DimShippingAgent ShippingAgentKey ShippingAgentName

Module 3: Designing and Implementing a Data Warehouse Course 10777A Module 3: Designing and Implementing a Data Warehouse Snowflake Schemas DimSalesPerson SalesPersonKey SalesPersonName StoreKey DimStore StoreKey StoreName GeographyKey DimDate DateKey Year Quarter Month Day FactOrders CustomerKey SalesPersonKey ProductKey ShippingAgentKey TimeKey OrderNo LineItemNo Quantity Revenue Cost Profit Alert students to the danger that is inherent in refactoring a star schema to a snowflake schema. Most database professionals are experienced in designing OLTP databases, and can have a tendency toward normalization. When you design a data warehouse, you need to keep this tendency in check and ensure that you only create normalized snowflake dimensions when they can be justified by one of the considerations described in this topic. DimGeography GeographyKey City Region DimCustomer CustomerKey CustomerName GeographyKey DimShippingAgent ShippingAgentKey ShippingAgentName DimProductLine ProductLineKey ProductLineName DimProduct ProductKey ProductName ProductLineKey SupplierKey DimSupplier SupplierKey SupplierName

Warehouse Modeling Confidential

Waterfall BI/WH development Traditional SDLC to develop a BI/WH product Confidential

Waterfall BI/WH development SDLC Overview Confidential

Waterfall BI/WH development Confidential

SDLC OVERVIEW Confidential

Agile BI/WH development framework Incremental development framework for BI/WH product Confidential

Agile BI/WH development framework Agile BI/WH life cycle Agile DW design overview Agile ETL Solution Confidential

Agile BI/WH Life Cycle Confidential

Agile BI/WH Life Cycle Confidential

Agile DW design overview How to design to answer business question? Confidential

Agile DW design overview How do we ask question? The 7Ws framework Design using natural language Straightforward methodology Model storming BEAM methodology Confidential

How do we ask question? Events/Transactions Interrogatives: A immutable "fact" that occurs in a time and place Interrogatives: Who, What, When, Where, Why Descriptive context that fully describes the event A set of “dimensions" that describe events Immutable: Do not change Confidential

The 7Ws framework Why Where How Who When What How Many Confidential

The 7Ws framework HOW – FACTs Much Many Often £ $ € Who Customer Employee Seller Organization What Product Service Transactions Booking Event Why Causal Promotion Reason Weather Competition Where Location Geographic Store Ship to Hospital When Time Day Month Year The 7Ws framework

Design using natural language Verbs – Events – Relationships – Fact Tables Nouns – Details – Entities – Dimensions Main Clause – Subject-Verb-Object Prepositions – connect additional details to the main clause Interrogatives – The 7Ws – Dimension Types Confidential

Straightforward methodology 11111 4 Who 1 5 What Subject-Verb-Object 2 3 When Declare Event Type 6 Initial Data Examples Where 7 How (many) Quantities - Facts 8 Why Sufficient Detail Fact Granularity How 9 Confidential

Design using natural language Verbs – Events – Relationships – Fact Tables Nouns – Details – Entities – Dimensions Main Clause – Subject-Verb-Object Prepositions – connect additional details to the main clause Interrogatives – The 7Ws – Dimension Types Confidential

Business Event Analysis and Modeling (BEAM✲) An agile approach to dimensional modeling Confidential

Quick Inclusive Interactive Fun Model Storming Data Modeler Inclusive: Tinh toan ven Interactive Fun Confidential Data Modeler BI Stakeholders

BEAM✲ BEAM ✲ Methodology Structured, non-technical, collaborative working conversation directly with BI Users BEAM✲ Logical and Physical Dimensional Data Models Example data Detailed and Testable ETL Specification DW Prototype BI User’s Business Process, Organizational, Hierarchical, and Data Knowledge Focused Data Profiling Data Modeler BI Stakeholders Confidential

Q&A

Thank you.