BI Terminologies.

Slides:



Advertisements
Similar presentations
The Organisation As A System An information management framework The Performance Organiser Data Warehousing.
Advertisements

Chapter 4 Tutorial.
Dimensional Modeling.
CHAPTER OBJECTIVE: NORMALIZATION THE SNOWFLAKE SCHEMA.
An overview of Data Warehousing and OLAP Technology Presented By Manish Desai.
OLAP Tuning. Outline OLAP 101 – Data warehouse architecture – ROLAP, MOLAP and HOLAP Data Cube – Star Schema and operations – The CUBE operator – Tuning.
OLAP Services Business Intelligence Solutions. Agenda Definition of OLAP Types of OLAP Definition of Cube Definition of DMR Differences between Cube and.
Data Warehousing M R BRAHMAM.
Dimensional Modeling CS 543 – Data Warehousing. CS Data Warehousing (Sp ) - Asim LUMS2 From Requirements to Data Models.
Dimensional Modeling – Part 2
Chapter 3. 2 Chapter 3 - Objectives Terminology of relational model. Terminology of relational model. How tables are used to represent data. How tables.
Advanced Querying OLAP Part 2. Context OLAP systems for supporting decision making. Components: –Dimensions with hierarchies, –Measures, –Aggregation.
Data Warehousing - 3 ISYS 650. Snowflake Schema one or more dimension tables do not join directly to the fact table but must join through other dimension.
MIS 451 Building Business Intelligence Systems Logical Design (3) – Design Multiple-fact Dimensional Model.
Lab3 CPIT 440 Data Mining and Warehouse.
Data Warehousing. On-Line Analytical Processing (OLAP) Tools The use of a set of graphical tools that provides users with multidimensional views of their.
CSE6011 Warehouse Models & Operators  Data Models  relations  stars & snowflakes  cubes  Operators  slice & dice  roll-up, drill down  pivoting.
Tanvi Madgavkar CSE 7330 FALL Ralph Kimball states that : A data warehouse is a copy of transaction data specifically structured for query and analysis.
Designing OLAP Dimensions. Enabling Various Views Finance Operations Profit by Division by Country by Month by Actual/Budget Revenue by Product by Region.
Online Analytical Processing (OLAP) Hweichao Lu CS157B-02 Spring 2007.
Agenda Common terms used in the software of data warehousing and what they mean. Difference between a database and a data warehouse - the difference in.
SQL Analysis Services Microsoft® SQL Server 2005 Analysis Services provides unified, fully integrated views of your business data to support online.
SharePoint 2010 Business Intelligence Module 6: Analysis Services.
Ahsan Abdullah 1 Data Warehousing Lecture-12 Relational OLAP (ROLAP) Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics.
Sayed Ahmed Logical Design of a Data Warehouse.  Free Training and Educational Services  Training and Education in Bangla: Training and Education in.
IST722 Data Warehousing Business Intelligence Development with SQL Server Analysis Services and Excel 2013 Michael A. Fudge, Jr.
Analysis Services 101 Dave Fackler, MCDBA, MCSE, MCT Director, Business Intelligence Practice Intellinet Corporation.
On-Line Analytic Processing Chetan Meshram Class Id:221.
IMS 6217: Data Warehousing / Business Intelligence Part 3 1 Dr. Lawrence West, Management Dept., University of Central Florida Analysis.
2005 SPRING CSMUIntroduction to Information Management1 Organizing Data John Sum Institute of Technology Management National Chung Hsing University.
Ahsan Abdullah 1 Data Warehousing Lecture-11 Multidimensional OLAP (MOLAP) Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for.
OnLine Analytical Processing (OLAP)
Chapter 3 The Relational Model. 2 Chapter 3 - Objectives u Terminology of relational model. u How tables are used to represent data. u Connection between.
Cube Intro. Decision Making Effective decision making Goal: Choice that moves an organization closer to an agreed-on set of goals in a timely manner Goal:
DIMENSIONAL MODELLING. Overview Clearly understand how the requirements definition determines data design Introduce dimensional modeling and contrast.
Data Warehousing.
Module 1: Introduction to Data Warehousing and OLAP
UNIT-II Principles of dimensional modeling
1 On-Line Analytic Processing Warehousing Data Cubes.
CMPE 226 Database Systems October 21 Class Meeting Department of Computer Engineering San Jose State University Fall 2015 Instructor: Ron Mak
BI Practice March-2006 COGNOS 8BI TOOLS COGNOS 8 Framework Manager TATA CONSULTANCY SERVICES SEEPZ, Mumbai.
Centre of Competence on data warehouse Workshop Helsinki Database Cube and Browsing the Cube Mark Rantala.
What is OLAP?.
CSE 5331/7331 F'071 CSE 5331/7331 Fall 2007 Dimensional Modeling Margaret H. Dunham Department of Computer Science and Engineering Southern Methodist University.
Copyright© 2014, Sira Yongchareon Department of Computing, Faculty of Creative Industries and Business Lecturer : Dr. Sira Yongchareon ISCG 6425 Data Warehousing.
Data Warehousing.
DAT 378 SQL Server 2000 Bringing The Best of Reporting Services and Analysis Services Together Sean Boon Program Manager, BI Systems
1 Online Analytical Processing (OLAP) Anjali Gupta Mithun Arora Aameek Singh Kranthi Kumar.
SQL Server Analysis Services Understanding Unified Dimension Model (UDM)
1 Database Systems, 8 th Edition Star Schema Data modeling technique –Maps multidimensional decision support data into relational database Creates.
Physical Layer of a Repository. March 6, 2009 Agenda – What is a Repository? –What is meant by Physical Layer? –Data Source, Connection Pool, Tables and.
1 Copyright © 2006, Oracle. All rights reserved. Defining OLAP Concepts.
Introduction to OLAP and Data Warehouse Assoc. Professor Bela Stantic September 2014 Database Systems.
Data Warehouses and OLAP 1.  Review Questions ◦ Question 1: OLAP ◦ Question 2: Data Warehouses ◦ Question 3: Various Terms and Definitions ◦ Question.
Or How I Learned to Love the Cube…. Alexander P. Nykolaiszyn BLOG:
Data Warehousing and OLAP Outline u Models & operations u Implementing a warehouse u Future directions.
CMPE 226 Database Systems April 12 Class Meeting Department of Computer Engineering San Jose State University Spring 2016 Instructor: Ron Mak
CSE6011 Implementing a Warehouse  Monitoring: Sending data from sources  Integrating: Loading, cleansing,...  Processing: Query processing, indexing,...
Extending and Creating Dynamics AX OLAP Cubes
11/20/ :11 AMData Mining 1 Data Mining – CSE 9033 Chapter – 1; Data Warehousing Dr. Goutam Sarker, B.E., M.E., Ph.D.(Engineering), Fellow: IE(I),
Defining Data Warehouse Structures Data Warehouse Data Access End User Data Access Data Sources Staging Area Data Marts Data Extract, Transform, and Load.
Introduction to SQL Server Analysis Services
On-Line Analytic Processing
Using Partitions and Fragments
Star Schema.
Databases & Data Warehouses
CMPE 226 Database Systems April 11 Class Meeting
Implementing Data Models & Reports with Microsoft SQL Server
Retail Sales is used to illustrate a first dimensional model
Analysis Services Analysis Services vs. the Data Warehouse vs. OLTP DB
Presentation transcript:

BI Terminologies

Agenda OLAP Data Warehouse Data Warehouse design Cube Dimensions Measures Facts

Agenda Cont’d Measure Groups Calculated Measures Measure Expression Hierarchies Dimension Members Discretization Calculated Members Custom Member Formulas

Agenda Cont’d Cell and Tuple Set and Named Set KPI MDX Dimension Types Dimension relationship to Facts Summary Questions

OLAP

Data Warehouse Repository of an organization’s electrically stored data. Designed to facilitate reporting and analysis. How is it used in the BI solution?

Data Warehouse design Star Schema: Consists of few fact tables, referencing directly any number of dimension tables

Data Warehouse design Cont’d Snowflake Schema: Consists of a centralized fact tables which are connected to multiple dimensions, and these dimensions are normalized into multiple related tables

Data Warehouse design Cont’d Combination Schema: Mix approach leading to some dimension tables being completely a star schema while other dimension tables are snow flake schema to save space.

Cube Data structure that allows fast analysis of data. Cube Operations: Slice (Where) Dice (Select) Drill up/down Types to save Cube aggregations: MOLAP (high performance, high storage) ROLAP (less performance, better storage) HOLAP (benefit from both)

Dimensions The specific data which the user will be concerned to view his data by this categorization

Measures The fields in which the system is concerned to measure.

Facts Group of measures that can be categorized by dimensions.

Example If we have a cube connected to a data source that contains the sales amount data and date time data, and the user want to view the sales amount by time, (ex. view the sales amount per month). Then we shall have a dimension “” as this what data would be categorized by. Then we will have a measure which will be the “”

Example If we have a cube connected to a data source that contains the sales amount data and date time data, and the user want to view the sales amount by time, (ex. view the sales amount per month). Then we shall have a dimension “Date Time” as this what data would be categorized by. Then we will have a measure which will be the “Sales amount” And in the fact table we will have each measure labeled to a certain period of time using foreign keys.

Measure Groups It is a group of measures; a fact is considered a measure group.

Calculated Measure A measure that is calculated by a formula, not by a direct aggregation. Example: If we have a measure to calculate the sum of salaries of employees, but we want to get the salaries including the 10% taxes, then we will create a calculated measure which will take the value of the salary and multiply it by 0.1. Also it can be used if we want to multiply 2 measures with each other; this also is considered a calculated measure

Measure Expression Works on the least level of members Formula is executed before the aggregation process

Example Calculated Measure Vs Measure Expression Branches B1 SE1 SE2 B2 SE3 SE4

Example Calculated Measure Vs Measure Expression Cont’d Quotes Q1 [1000$] SE1 [20%] SE3 [80%] Q2[1000$] SE1 [10%] SE2 [10%] SE3 [40%] SE4 [40%] Total amount for B1???

Example Calculated Measure Vs Measure Expression Cont’d It will aggregate first, then execute the multiplication so for B1 we will have the total amount (20%+10%+10%) * (2000$) which will be 800$. Measure expression: It will execute the multiplication on the lowest level then aggregate, so for B1 we will have the total amount (20% * 1000$) + (10% * 1000) + (10% * 1000) which will be 400$. And this is the correct value.

Attribute It is the columns within a dimension table.

Hierarchies It is a grouping of attributes ordered to reflect their relationship with other attributes Example: In the time dimension, we could make the following hierarchy ‘YearQuarterMonthWeekDay’, which means that every year will be divided into quarters, and each quarter will be divided into months, and so on

Dimension Members It is the value of each cell in the dimension table Example: In time dimension, in the year attribute, we have ‘2005’,’2006’,’2007’,.. As member of this attribute.

Discretization The process of grouping members of an attribute into a number of member groups. Example: If we have an attribute “City”, that contain a large number of members (ex. 500,000 city) we can discretize this attribute into groups to facilitate viewing of this attribute (ex. Cities from A-B, then C-D,.. and so on.

Calculated Members It is a member that is calculated by a formula not as the normal members of the attribute. Example: “Year to Date” added to the members of the attribute, you should be aware that the calculated members have different behavior than the normal members, so they should be tested along with the normal members.

Custom Member Formulas It is a member value that is calculated or set, for certain conditions, it will not be generic as calculated members, and it will only be executed with certain members of other attributes Example: If we want to multiply the salaries of employees in united states only by 10, then we will create a custom member formula and set its value with united states only with salary*10.

Custom Member Vs Calculated Member Exists in dimension table It can be considered as a modification or altering to the aggregation formula of already existing members Calculated Member: It is a new virtual member not existing in the dimension table Considered as a dimension member.

Cell and Tuple Cell: Tuple: Certain value within a dimension or/and fact. Tuple: Certain row within a dimension or/and fact.

Set and Named Set Set: Named Set: It is a group of rows of data. It can be considered as a calculated set, also it can be considered as a “view” with certain conditions other than the normal behavior if the set.

Example on Named Set If you want to select employees whose salaries are more than 3000, and you will use this set of data in multiple queries, then you can put the output of this query in a named set where you can use in the multiple queries you want to use, and if there is a change in any condition of this set you will not have to change in each query, you will just have to change in the named set, and each query which uses this named set will be affected with the change.

KPI Measurement for measuring business success. Frequently evaluated over time and varies from one organization to the other. Consists of: Goal – [ Target ] Value – [ Actual ] Status – [ Score ] Trend – [ Upward, Downward] Status indicator Trend indicator Parent KPI Weight – [ In case of parent KPI ]

MDX Language for OLAP databases Stands for?

Dimension Types Standard Dimension Time Dimension Server Time Dimension

Dimension Relationship to Facts Regular: A regular dimension relationship between a cube dimension and a measure group (Fact) exists when the key column for the dimension is joined directly to the fact table. This direct relationship is based on a primary key–foreign key relationship in the underlying relational database, but might also be based on a logical relationship that is defined in the data source view

Dimension Relationship to Facts Cont’d Reference (Snowflake): A reference dimension relationship between a cube dimension and a measure group exists when the key column for the dimension is joined indirectly to the fact table through a key in another dimension table.

Dimension Relationship to Facts Cont’d Fact (Degenerate): It is a fact table that acts as a dimension for itself, rather than using a separate dimension table. Usually it is used for drill to details information.

Dimension Relationship to Facts Cont’d Many To Many: It is the same as the many to many relationship in the relational database, and we should overcome this by using an intermediate table.

Summary OLAP Data Warehouse Dimensions Facts Measure Members Dimension Relationship to Facts

Questions

THANK YOU… A VERY BIG THANK YOU TO Sherif Anwar 