© 2007 Robert T. Monroe Carnegie Mellon University ©2006 - 2008 Robert T. Monroe 45-875 BI Tools and Techniques Administrivia – HW #2 Homework #2 OLAP.

Slides:



Advertisements
Similar presentations
The Organisation As A System An information management framework The Performance Organiser Data Warehousing.
Advertisements

Business Intelligence Simon Pease. Experience with BI Developing end-to-end BI prototype for Plan International Developing end-to-end BI prototype for.
DENORMALIZATION CSCI 6442 © Copyright 2015, David C. Roberts, all rights reserved.
By: Mr Hashem Alaidaros MIS 211 Lecture 4 Title: Data Base Management System.
Technical BI Project Lifecycle
OLAP Services Business Intelligence Solutions. Agenda Definition of OLAP Types of OLAP Definition of Cube Definition of DMR Differences between Cube and.
Decision Support and Data Warehouse. Decision supports Systems Components Data management function –Data warehouse Model management function –Analytical.
Data Sources Data Warehouse Analysis Results Data visualisation Analytical tools OLAP Data Mining Overview of Business Intelligence Data visualisation.
Implementing Business Analytics with MDX Chris Webb London September 29th.
COMP 578 Data Warehousing And OLAP Technology Keith C.C. Chan Department of Computing The Hong Kong Polytechnic University.
© Tan,Steinbach, Kumar Introduction to Data Mining 8/05/ Data Warehouse and Data Cube Lecture Notes for Chapter 3 Introduction to Data Mining By.
Data Warehousing. On-Line Analytical Processing (OLAP) Tools The use of a set of graphical tools that provides users with multidimensional views of their.
CSE6011 Warehouse Models & Operators  Data Models  relations  stars & snowflakes  cubes  Operators  slice & dice  roll-up, drill down  pivoting.
Data Warehousing DSCI 4103 Dr. Mennecke Introduction and Chapter 1.
Online Analytical Processing (OLAP) Hweichao Lu CS157B-02 Spring 2007.
1 Basic concepts of On-Line Analytical processing DT211 /4.
DWH – Dimesional Modeling PDT Genči. 2 Outline Requirement gathering Fact and Dimension table Star schema Inside dimension table Inside fact table STAR.
SQL Analysis Services Microsoft® SQL Server 2005 Analysis Services provides unified, fully integrated views of your business data to support online.
SharePoint 2010 Business Intelligence Module 6: Analysis Services.
IMS 6217: Data Warehousing / Business Intelligence Part 3 1 Dr. Lawrence West, Management Dept., University of Central Florida Analysis.
Chapter 6: Foundations of Business Intelligence - Databases and Information Management Dr. Andrew P. Ciganek, Ph.D.
OnLine Analytical Processing (OLAP)
Cube Intro. Decision Making Effective decision making Goal: Choice that moves an organization closer to an agreed-on set of goals in a timely manner Goal:
Chapter 6 SAS ® OLAP Cube Studio. Section 6.1 SAS OLAP Cube Studio Architecture.
Faster and Smarter Data Warehouses with Oracle OLAP 11g.
DIMENSIONAL MODELLING. Overview Clearly understand how the requirements definition determines data design Introduce dimensional modeling and contrast.
Data Warehouse. Design DataWarehouse Key Design Considerations it is important to consider the intended purpose of the data warehouse or business intelligence.
1 Data Warehouses BUAD/American University Data Warehouses.
Data Warehousing.
Building the cube – Chapter 9 & 10 Let’s be over with it.
BI Terminologies.
Ahsan Abdullah 1 Data Warehousing Lecture-10 Online Analytical Processing (OLAP) Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center.
DEFINING the BUSINESS REQUIREMENTS. Introduction OLTP and DW planning is different in term of requirements clarity Planning DW is about solving users’
Decision Support and Date Warehouse Jingyi Lu. Outline Decision Support System OLAP vs. OLTP What is Date Warehouse? Dimensional Modeling Extract, Transform,
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
CS 157B: Database Management Systems II April 3 Class Meeting Department of Computer Science San Jose State University Spring 2013 Instructor: Ron Mak.
1 Technology in Action Chapter 11 Behind the Scenes: Databases and Information Systems Copyright © 2010 Pearson Education, Inc. Publishing as Prentice.
UNIT-II Principles of dimensional modeling
Presented By: Solutions Delivery Managing Reports in CRMnext.
Building Dashboards SharePoint and Business Intelligence.
CMPE 226 Database Systems October 21 Class Meeting Department of Computer Engineering San Jose State University Fall 2015 Instructor: Ron Mak
1 Agenda – 04/02/2013 Discuss class schedule and deliverables. Discuss project. Design due on 04/18. Discuss data mart design. Use class exercise to design.
Administrivia HW #1 management option due now – please submit
Analytics & Reporting Tool.  Outline how to access SAS OLAP Cubes through SAS AMO  Review SAS OLAP Cube creation and how it relates to integration with.
Pooja Sharma Shanti Ragathi Vaishnavi Kasala. BUSINESS BACKGROUND Lowe's started as a single hardware store in North Carolina in 1946 and since then has.
1 Copyright © 2009, Oracle. All rights reserved. Oracle Business Intelligence Enterprise Edition: Overview.
Oracle Business Intelligence Foundation - Commonly Used Features in Repository.
OLTP, OLAP, Datawarehousing and Mining. OLTP Online transaction processing, or OLTP, refers to a class of systems that facilitate and manage transaction-oriented.
The Need for Data Analysis 2 Managers track daily transactions to evaluate how the business is performing Strategies should be developed to meet organizational.
1 Database Systems, 8 th Edition Star Schema Data modeling technique –Maps multidimensional decision support data into relational database Creates.
Introduction to OLAP and Data Warehouse Assoc. Professor Bela Stantic September 2014 Database Systems.
Pindaro Demertzoglou Data Resource Management – MGMT 4170 Lally School of Management Rensselaer Polytechnic Institute.
1 Management Information Systems M Agung Ali Fikri, SE. MM.
Building the Corporate Data Warehouse Pindaro Demertzoglou Data Resource Management.
Copyright © 2016 Pearson Education, Inc. Modern Database Management 12 th Edition Jeff Hoffer, Ramesh Venkataraman, Heikki Topi CHAPTER 9: DATA WAREHOUSING.
The Concepts of Business Intelligence Microsoft® Business Intelligence Solutions.
CMPE 226 Database Systems April 12 Class Meeting Department of Computer Engineering San Jose State University Spring 2016 Instructor: Ron Mak
1 Copyright © 2008, Oracle. All rights reserved. Repository Basics.
Operation Data Analysis Hints and Guidelines
Chapter 13 Business Intelligence and Data Warehouses
Using Partitions and Fragments
Data storage is growing Future Prediction through historical data
Data Warehouse.
Star Schema.
Competing on Analytics II
CMPE 226 Database Systems April 11 Class Meeting
MIS2502: Data Analytics Dimensional Data Modeling
DataMart (Data Warehouse) Tool:
MIS2502: Data Analytics Dimensional Data Modeling
Analysis Services Analysis Services vs. the Data Warehouse vs. OLTP DB
Presentation transcript:

© 2007 Robert T. Monroe Carnegie Mellon University © Robert T. Monroe BI Tools and Techniques Administrivia – HW #2 Homework #2 OLAP option is posted to the wiki It seems to be challenging to get Analysis Services installed and working properly on student laptops I strongly encourage you to do the reporting option, unless you have significant system administration skills, patience, and a willingness to go to ‘plan B’ if the original plan doesn’t work out… That said, it’s an interesting assignment to work with the OLAP tools. Your call…

© 2007 Robert T. Monroe Carnegie Mellon University © Robert T. Monroe BI Tools and Techniques Online Analytical Processing (OLAP) BI Tools and Techniques Robert Monroe April 8, 2008

© 2007 Robert T. Monroe Carnegie Mellon University © Robert T. Monroe BI Tools and Techniques Key Take Aways OLAP tools support interactive analysis and exploration of large and complex dimensional data sets Much of the power of OLAP comes from the use of a standard data model (cubes) and offline processing, aggregation, and analysis of data To use OLAP tools effectively, you need to have a basic understanding of how and why data is structured in cubes and the kinds of analyses that this structure makes readily available to you

© 2007 Robert T. Monroe Carnegie Mellon University © Robert T. Monroe BI Tools and Techniques Core OLAP Concepts

© 2007 Robert T. Monroe Carnegie Mellon University © Robert T. Monroe BI Tools and Techniques What Are OLAP Tools? OLAP tools provide a mechanism for interactive analysis and exploration of dimensional data –Interactive: users need to be able to easily specify queries –Analysis: it should be possible to perform (and reuse) complex analyses of the dimensional data –Exploration: answering one question with an OLAP tool frequently raises numerous subsequent questions A good OLAP tool allows the user to quickly pose follow-on queries –Dimensional: OLAP tools operate on dimensional data – data structured as facts and dimensions

© 2007 Robert T. Monroe Carnegie Mellon University © Robert T. Monroe BI Tools and Techniques OLAP’s Role In Decision Making Source: O’Brien, Management Information Systems, 6 th ed. OLAP excels at exploring complex, structured questions OLAP Sweet-Spot

© 2007 Robert T. Monroe Carnegie Mellon University © Robert T. Monroe BI Tools and Techniques Quick OLAP Tools Demo Contour Components OLAP cube browser –Open in IE 6.0 or higher –Ok the installation of any ActiveX controls that the site requests – Use the Samples > Government > Regional Employee Turnover menu in the upper left of the screen to open up sample OLAP cube. Demo requires IE 6.0 or later and ActiveX install –Installation for class is optional For first demo we will browse regional emloyee turnover data

© 2007 Robert T. Monroe Carnegie Mellon University © Robert T. Monroe BI Tools and Techniques Why Not Just Write SQL Queries? Performance Complexity Exploration Presentation Difficulty in dealing with hierarchies Difficult or impossible to specify some desired queries

© 2007 Robert T. Monroe Carnegie Mellon University © Robert T. Monroe BI Tools and Techniques Why Not Just Use Spreadsheets? Complexity (with > 2 dimensions) Presentation is tied to representation Does not scale to large data sets or many dimensions –Storage and representation is ill-suited to the task Inability to deal with hierarchies

© 2007 Robert T. Monroe Carnegie Mellon University © Robert T. Monroe BI Tools and Techniques OLAP’s Place In A Business Intelligence Solution Reconcile Data Derive Data OLAP Cube OLAP Tools Analyze Diagram Source: Hoffer, Prescott, McFadden, Modern Database Management, 7 th ed.

© 2007 Robert T. Monroe Carnegie Mellon University © Robert T. Monroe BI Tools and Techniques Dimensional Modeling with HyperCubes: Basic Concepts

© 2007 Robert T. Monroe Carnegie Mellon University © Robert T. Monroe BI Tools and Techniques Representing Dimensional Databases as Cubes OLAP tools represent dimensional data as cubes –Cubes are also sometimes referred to as hypercubes Dimension tables are represented as cube dimensions Facts are represented using measures –Measures can be thought of as the values stored in individual cells of the cube –Measures consist of two parts: A numerical value that represents the basic fact A formula for combining multiple measures into a single measure

© 2007 Robert T. Monroe Carnegie Mellon University © Robert T. Monroe BI Tools and Techniques Quick Review: Dimensional Modeling Example Fact table provides statistics for sales broken down by product, period and store dimensions Dimension tables provides details on stores, products, and time periods Diagram Source: Hoffer, Prescott, McFadden, Modern Database Management, 7 th ed.

© 2007 Robert T. Monroe Carnegie Mellon University © Robert T. Monroe BI Tools and Techniques Quick Review: Dimensional Example With Data Product (dimension) Period (dimension) Store (dimension) Sales (fact) Diagram Source: Hoffer, Prescott, McFadden, Modern Database Management, 7 th ed.

© 2007 Robert T. Monroe Carnegie Mellon University © Robert T. Monroe BI Tools and Techniques Multiple Fact Tables It is frequently useful to store more than one type of fact in a single multidimensional database (star schema) This can be handled by using multiple fact tables that share dimensions Example: modeling products sold and products purchased Diagram Source: Hoffer, Prescott, McFadden, Modern Database Management, 7 th ed.

© 2007 Robert T. Monroe Carnegie Mellon University © Robert T. Monroe BI Tools and Techniques Factless Fact Tables – Tracking Events “Factless” fact tables store only foreign keys, no facts Factless fact tables allow the tracking of what types of events happened, and under what circumstances they happened Diagram Source: Hoffer, Prescott, McFadden, Modern Database Management, 7 th ed.

© 2007 Robert T. Monroe Carnegie Mellon University © Robert T. Monroe BI Tools and Techniques Conformed Dimensions When dimensions are shared across multiple fact tables they must be conformed dimensions Conformed dimensions –One or more dimension tables associated with two or more fact tables for which the dimension tables have the same business meaning and primary key with each fact table Conformed dimensions allow users to: –Query across multiple fact tables –Improve consistency of meaning and structure for derived and retrieved information

© 2007 Robert T. Monroe Carnegie Mellon University © Robert T. Monroe BI Tools and Techniques Tabular Representation of Measures and Dimensions Simple example of viewing OLAP data in a grid: –Row headings (Store) represent dimension members –Columns represent different measures Store Sales Data for 2004 StoreGross SalesQuotaProfitsSales vs. Quota Chicago$3,250,000$2,750,000$624,352+ $500,000 New York$4,500,000$3,550,000$100,000+ $950,000 Pittsburgh$1,600,000$1,700,000$250,000- $100,000 Measures Dimension

© 2007 Robert T. Monroe Carnegie Mellon University © Robert T. Monroe BI Tools and Techniques Tabular Representation of Measures and Dimensions Example 2: Store sales by year and store location –Column and row headings represent dimension values in this case –Cells represent measures, Name of table describes measure Store Sales Data Store Chicago$3,250,000$3,500,000$3,000,000$3,900,000 New York$4,500,000$4,350,000$5,100,000$5,450,000 Pittsburgh$1,600,000$1,700,000$1,800,000$1,650,000 Dimensions Measures

© 2007 Robert T. Monroe Carnegie Mellon University © Robert T. Monroe BI Tools and Techniques Cube Representation of Measures and Dimensions Diagram Source: Hoffer, Prescott, McFadden, Modern Database Management, 7 th ed.

© 2007 Robert T. Monroe Carnegie Mellon University © Robert T. Monroe BI Tools and Techniques Dimension Hierarchies Dimension tables are represented as cube dimensions –Cube dimensions use levels to represent hierarchies –Each sub-level subdivides the parent level with finer granularity Dimensions can be of fixed or variable height (jagged) Examples –Dimension: Time Period Levels – Year :: Quarter :: Month :: Week :: Day –Dimension: Organization Levels – Company :: Division :: Department :: Employee

© 2007 Robert T. Monroe Carnegie Mellon University © Robert T. Monroe BI Tools and Techniques Measures Measures represent the interesting data at the intersection of different dimensions There is a space for a measure at every intersection of every level of every dimension –Base facts are stored in the intersections of lowest-level dimensions (either simple or calculated measures) –Aggregate or computed values are stored at the intersections of where all of the dimensions are not at the lowest level (aggregate values must be calculated measures)

© 2007 Robert T. Monroe Carnegie Mellon University © Robert T. Monroe BI Tools and Techniques Three Categories Of Measures Additive measures can be meaningfully combined along any dimensions –Example: total sales by product, location, or time Semi-additive measures cannot be combined along one or more dimensions –Example: summing inventory levels across time Non-additive measures cannot be combined along any dimensions –Example: weighted averages without weight information Exercise: –Identify three measures of interest for a cube that tracks sales data –Be sure to identify numeric value tracked and aggregation function Definition source: Pedersen and Jensen, Multidimensional Database Technology, IEEE Computer 12/01

© 2007 Robert T. Monroe Carnegie Mellon University © Robert T. Monroe BI Tools and Techniques Why OLAP Performs So Well Pre-computation of aggregates, and other values at cube-building time enable very rapid responses to many common queries Ability to specify other formulas/values to precompute on cube build Use of standardized structure and dimensional model allows query engine to make many assumptions about how to best answer queries and take advantage of pre- computed values

© 2007 Robert T. Monroe Carnegie Mellon University © Robert T. Monroe BI Tools and Techniques Dimensions Examples What dimensions are available in the regional employee turnover example? –Are there any important dimensions missing that you might want to use for an analysis if you were a governmental official trying to improve the employment outlook in your region? The worldwide population cube has an example of a hierarchical dimension –Which one is hierarchical? –Is it a fixed or jagged dimension? –What are the measures in this cube?

© 2007 Robert T. Monroe Carnegie Mellon University © Robert T. Monroe BI Tools and Techniques Analytics Analytics are specific analyses that can be performed on an OLAP cube –Simple pre-defined analytics (sums, counts, percentages) –Complex pre-canned analytics defined as part of the cube model/build –Ad-hoc exploration Examples: –Actual sales vs. quota by sales region –Supplier count by commodity category by division –Deviation from contracted pricing by supplier, commodity category, and division over the previous 3 years –Examples of analytics related to sourcing or procurement?

© 2007 Robert T. Monroe Carnegie Mellon University © Robert T. Monroe BI Tools and Techniques Analytics Examples Revenue cube analytics Automobile traffic analytics Marketing dynamics cube (multiple slices preset)

© 2007 Robert T. Monroe Carnegie Mellon University © Robert T. Monroe BI Tools and Techniques Drilling Down The drilling down operation analyzes the data presently displayed in greater detail. Diagram Source: Hoffer, Prescott, McFadden, Modern Database Management, 7 th ed.

© 2007 Robert T. Monroe Carnegie Mellon University © Robert T. Monroe BI Tools and Techniques Slicing The slicing operation selects specific values for one or more dimensions of a cube and renders measures for those dimensions in a two-dimensional table Diagram Source: Hoffer, Prescott, McFadden, Modern Database Management, 7 th ed.

© 2007 Robert T. Monroe Carnegie Mellon University © Robert T. Monroe BI Tools and Techniques Filtering Filtering reduces the elements included in a calculation Filtering can cross multiple slices Example: filter previous results to only show February, April, May Diagram Source: Hoffer, Prescott, McFadden, Modern Database Management, 7 th ed.

© 2007 Robert T. Monroe Carnegie Mellon University © Robert T. Monroe BI Tools and Techniques In-Class Exercise Open the Contour Cubes Automobile Traffic sample Which intersection and day in London has the most overutilization of the roads? Which intersection has the worst overutilization of roads across all of the days? Which intersection has the highest overall hourly traffic flow?

© 2007 Robert T. Monroe Carnegie Mellon University © Robert T. Monroe BI Tools and Techniques Pivoting Data OLAP tools generally let you pivot dimensions –This involves switching which dimensions are displayed horizontally and which are displayed vertically This can be useful when exploring and trying to visualize data Store Sales Data ‘97 – ‘00 ($ Millions) Store Chicago$3.25$3.5$3.0$3.9 NY$4.5$4.35$5.1$5.45 Pgh$1.6$1.7$1.8$1.65 Annual Sales, By Store ‘97 – ‘00 ($ Millions) YearChicagoNYPGH 1997$3.25$4.5$ $3.5$4.35$ $3.0$5.1$ $3.9$5.45$1.65 Pivot

© 2007 Robert T. Monroe Carnegie Mellon University © Robert T. Monroe BI Tools and Techniques Modeling Hierarchies Dimension tables frequently model hierarchies Example: –Customers dimension stores data about your customers –You may sell to several divisions of a single company –You want to be able to analyze sales to the individual divisions and also capture “rolled-up” values for the parent company Divisions of ABC Automotive Diagram Source: Hoffer, Prescott, McFadden, Modern Database Management, 7 th ed.

© 2007 Robert T. Monroe Carnegie Mellon University © Robert T. Monroe BI Tools and Techniques Modeling Hierarchies With Denormalized Tables (I) Hierarchical dimensions are frequently represented with denormalized tables Simplifies and speeds queries at the cost of introducing anomalies This example represents a ‘jagged’ or ‘arbitrary’ hierarchy Customer_Dimension Parent_CompanyCustomer_KeyNameAddressType C000001ABC Automotive100 1 st St.Dealer C000001C000002ABC Auto Sales110 1 st St.Sales C000001C000003ABC Repair130 1 st St.Service C000002C000004ABC Auto New Sales110 1 st St.Sales C000002C000005ABC Auto Used Sales110 1 st St.Sales C000006Bubba’s House O’ Cars5432 Maple LnDealer

© 2007 Robert T. Monroe Carnegie Mellon University © Robert T. Monroe BI Tools and Techniques Modeling Hierarchies With Denormalized Tables (II) Similar example but with a well-defined hierarchy depth –Same number of levels for all entries in the dimension table –Simpler structureThis approach requires a fixed height to hierarchy –, CityID serves as primary key for the whole table City_Geography_Dimension CityIDCityNameStateIDStateNameTimeZone 45Little Rock2ArkansasCentral 263Denver15ColoradoMountain 423Aspen15ColoradoMountain 522Pittsburgh36PennsylvaniaEastern 771Philadelphia36PennsylvaniaEastern

© 2007 Robert T. Monroe Carnegie Mellon University © Robert T. Monroe BI Tools and Techniques Wrap Up

© 2007 Robert T. Monroe Carnegie Mellon University © Robert T. Monroe BI Tools and Techniques Key Take Aways OLAP tools support interactive analysis and exploration of large and complex dimensional data sets Much of the power of OLAP comes from the use of a standard data model (cubes) and offline processing, aggregation, and analysis of data To use OLAP tools effectively, you need to have a basic understanding of how and why data is structured in cubes and the kinds of analyses that this structure makes readily available to you

© 2007 Robert T. Monroe Carnegie Mellon University © Robert T. Monroe BI Tools and Techniques 7 th Inning Stretch