Business Intelligence Process Grain of the Fact Table Dr. Chang Liu

Slides:



Advertisements
Similar presentations
MIS 385/MBA 664 Systems Implementation with DBMS/ Database Management
Advertisements

DeepSee Embedded Real-Time BI Russia Symposium 2008.
IS 4420 Database Fundamentals Chapter 11: Data Warehousing Leon Chen
BY LECTURER/ AISHA DAWOOD DW Lab # 2. LAB EXERCISE #1 Oracle Data Warehousing Goal: Develop an application to implement defining subject area, design.
DAVID M. KROENKE’S DATABASE PROCESSING, 10th Edition © 2006 Pearson Prentice Hall 15-1 David M. Kroenke Database Processing Chapter 15 Business Intelligence.
Copyright © Starsoft Inc, Data Warehouse Architecture By Slavko Stemberger.
Dimensional Modeling Business Intelligence Solutions.
Data Warehousing - 2 ISYS 650. Data Warehouse Design - Star Schema - Dimension tables – contain descriptions about the subjects of the business such as.
Introduction to Data Warehousing. From DBMS to Decision Support DBMSs widely used to maintain transactional data Attempts to use of these data for analysis,
Dimensional Modeling – Part 2
Exploiting the DW data DW is a platform for creating a wide array of reports It solves data feed problems, but does not lead to specific decision support.
Advanced Querying OLAP Part 2. Context OLAP systems for supporting decision making. Components: –Dimensions with hierarchies, –Measures, –Aggregation.
Data Warehousing - 3 ISYS 650. Snowflake Schema one or more dimension tables do not join directly to the fact table but must join through other dimension.
Lecture 5 CS.456 DATABASE DESIGN.
Building a Data Warehouse with SQL Server Presented by John Sterrett.
Agenda Common terms used in the software of data warehousing and what they mean. Difference between a database and a data warehouse - the difference in.
Chapter 9: data warehousing
1.
IST722 Data Warehousing Business Intelligence Development with SQL Server Analysis Services and Excel 2013 Michael A. Fudge, Jr.
Chapter 6: Foundations of Business Intelligence - Databases and Information Management Dr. Andrew P. Ciganek, Ph.D.
DIMENSIONAL MODELLING. Overview Clearly understand how the requirements definition determines data design Introduce dimensional modeling and contrast.
Data Warehouse. Design DataWarehouse Key Design Considerations it is important to consider the intended purpose of the data warehouse or business intelligence.
1 Data Warehouses BUAD/American University Data Warehouses.
Data Warehousing.
Chapter 9: data warehousing
DIMENSIONAL MODELING MIS2502 Data Analytics. So we know… Relational databases are good for storing transactional data But bad for analytical data What.
Operation Data Analysis Hints and Guidelines EGN 5621 Enterprise Systems Collaboration Summer B, 2014.
MIS2502: Data Analytics Dimensional Data Modeling
Designing a Data Warehousing System. Overview Business Analysis Process Data Warehousing System Modeling a Data Warehouse Choosing the Grain Establishing.
UNIT-II Principles of dimensional modeling
Chapter 5 DATA WAREHOUSING Study Sections 5.2, 5.3, 5.5, Pages: & Snowflake schema.
McGraw-Hill/Irwin ©2009 The McGraw-Hill Companies, All Rights Reserved CHAPTER 6 DATABASES AND DATA WAREHOUSES CHAPTER 6 DATABASES AND DATA WAREHOUSES.
CMPE 226 Database Systems October 21 Class Meeting Department of Computer Engineering San Jose State University Fall 2015 Instructor: Ron Mak
Foundations of Business Intelligence: Databases and Information Management.
Chapter 11: Data Warehousing Modern Database Management 6 th Edition Jeffrey A. Hoffer, Mary B. Prescott, Fred R. McFadden.
Pooja Sharma Shanti Ragathi Vaishnavi Kasala. BUSINESS BACKGROUND Lowe's started as a single hardware store in North Carolina in 1946 and since then has.
An inside look into Retail Sales & Purchases. Refresh: (About US Census Bureau) Agency of the Federal Statistical System Accumulates and reports on American.
Business Intelligence Training Siemens Engineering Pakistan Zeeshan Shah December 07, 2009.
1 Copyright © 2009, Oracle. All rights reserved. Oracle Business Intelligence Enterprise Edition: Overview.
 Definition of terms  Reasons for need of data warehousing  Describe three levels of data warehouse architectures  Describe two components of star.
© 2009 Pearson Education, Inc. Publishing as Prentice Hall 1 Lecture 14: Data Warehousing Modern Database Management 9 th Edition Jeffrey A. Hoffer, Mary.
Data Warehouses and OLAP 1.  Review Questions ◦ Question 1: OLAP ◦ Question 2: Data Warehouses ◦ Question 3: Various Terms and Definitions ◦ Question.
Pindaro Demertzoglou Data Resource Management – MGMT 4170 Lally School of Management Rensselaer Polytechnic Institute.
Building the Corporate Data Warehouse Pindaro Demertzoglou Data Resource Management.
Copyright © 2016 Pearson Education, Inc. Modern Database Management 12 th Edition Jeff Hoffer, Ramesh Venkataraman, Heikki Topi CHAPTER 9: DATA WAREHOUSING.
CMPE 226 Database Systems April 12 Class Meeting Department of Computer Engineering San Jose State University Spring 2016 Instructor: Ron Mak
Data Warehousing Design DT211/4. Designing Data Warehouses To begin a data warehouse project, we need to find answers for questions such as: – Which user.
Operation Data Analysis Hints and Guidelines
Chapter 13 Business Intelligence and Data Warehouses
Data warehouse and OLAP
Chapter 13 The Data Warehouse
Summarized from various resources Modern Database Management
MIS2502: Data Analytics Dimensional Data Modeling
Data Warehouse.
Star Schema.
MIS2502: Data Analytics Dimensional Data Modeling
MIS2502: Data Analytics Dimensional Data Modeling
Overview and Fundamentals
Competing on Analytics II
MIS2502: Data Analytics Dimensional Data Modeling
CMPE 226 Database Systems April 11 Class Meeting
MANAGING DATA RESOURCES
Data Warehouse and OLAP
CHAPTER SIX OVERVIEW SECTION 6.1 – DATABASE FUNDAMENTALS
MIS2502: Data Analytics Dimensional Data Modeling
MIS2502: Data Analytics Dimensional Data Modeling
Building your First Cube with SSAS
Analytics, BI & Data Integration
Data Warehouse and OLAP
Data Warehousing.
Presentation transcript:

Business Intelligence Process Grain of the Fact Table Dr. Chang Liu

Business Intelligence Process Analysis Services Multidimensional Cube Data Operational Data Sources (Normalized) Staging Database Data Warehouse (Denormalized) Client Distribution Data Data Mart Data Mart * Cubes are normally created as part of a Business Intelligence process.

Business Intelligence Process Demo Create a dimensional model for a BI application Use Excel as a front end tool to analyze data

BI Benefits in Modern Organizations Improvement of Operational Performance Improvement in Customer Service Identification of New Opportunities

BI Obstacles/Challenges BI requires large initial investment BI requires substantial ongoing costs BI return-on-investment is difficult to justify Organizations lack of preparation for BI: Business events are not consistently defined throughout the enterprise BI Tools may be difficult to use for certain users

BI for Competitive Advantages? IT and Business together must tackle their data issues by answering the following questions: Data Relevance – what data is needed to complete on analytics? Data Sourcing – where can this data be obtained? Data Quantity – How much data is needed? Data Quality – How can the data be made more accurate and valuable for analysis? Data Governance – What rules and processes are needed to manage data from its creation through its retirement?

Reading Assignment Predicts 2014: Business Intelligence and Analytics will remain CIO’s Top Technology Priority Analytics 3.0

BI Tools Personal BI Team BI Organizational BI

PowerPivot for EXCEL (Personal BI)

Example: Sales Data in DB

The Star Schema Years Customers Sales Months SalesMen DayOfWeek Year _ID Year Customers Customer _ID Customer_Name Sales Customer_ID Salesman_ID Year_ID Month_ID Day_ID Amount Months Month _ID Month SalesMen Salesman _ID Salesman_Name DayOfWeek Day _ID Day

The Star Schema (2) Periods Customers Sales SalesMen Period_ID Date Customer _ID Customer_Name Sales Customer_ID Salesman_ID Period_ID Amount SalesMen Salesman _ID Salesman_Name

What is a Cube? A cube can be thought of as a multidimensional pivot or crosstab. It stores numeric values for all combinations of values of the business dimensions.

Grain of the Fact Table Granularity of Fact Table–what level of detail do you want? Finer grains  better market basket analysis capability Finer grain  more dimension tables, more rows in fact table In Web-based commerce, finest granularity is a click

Star schema example Fact table provides statistics for sales broken down by product, period and store dimensions

Size of Fact Table Depends on the number of dimensions and the grain of the fact table Number of rows = product of number of possible values for each dimension associated with the fact table Example: assume the following for Figure 1: Total rows calculated as follows (assuming only half the products record sales for a given month):

Size of Fact Table (2) The size of the fact table is many times larger than the dimension tables! Estimate the size (in bytes) of the fact table: Each of the above 6 fields average about 4 bytes in length Total Size = ? The size of the fact table depends on the number of the dimensions and the grain of the fact table. Suppose we’d like to request the daily totals be accumulated in the fact table (assuming 20% of all products record sales on a given day) Number of rows in the fact table?

Advantages of a Star Schema The star schema is a denormalized schema The star schema has several benefits: Simplified the database structure Easy to query because there is only one level of joins Queries run much faster compared to the normalized structure Easy to maintain Modeled around business entities

Class Exercise – Size of a Fact Table

PowerPivot SAP Business Object Explorer Exercises