1 COMP 3503 Deductive Modeling with OLAP with Daniel L. Silver Daniel L. Silver.

Slides:



Advertisements
Similar presentations
OLAP Tuning. Outline OLAP 101 – Data warehouse architecture – ROLAP, MOLAP and HOLAP Data Cube – Star Schema and operations – The CUBE operator – Tuning.
Advertisements

Online Analytical Processing OLAP
Data Warehousing CPS216 Notes 13 Shivnath Babu. 2 Warehousing l Growing industry: $8 billion way back in 1998 l Range from desktop to huge: u Walmart:
OLAP Services Business Intelligence Solutions. Agenda Definition of OLAP Types of OLAP Definition of Cube Definition of DMR Differences between Cube and.
Database Systems: Design, Implementation, and Management Tenth Edition
Data Sources Data Warehouse Analysis Results Data visualisation Analytical tools OLAP Data Mining Overview of Business Intelligence Data visualisation.
Advanced Querying OLAP Part 2. Context OLAP systems for supporting decision making. Components: –Dimensions with hierarchies, –Measures, –Aggregation.
COMP 578 Data Warehousing And OLAP Technology Keith C.C. Chan Department of Computing The Hong Kong Polytechnic University.
CS2032 DATA WAREHOUSING AND DATA MINING
Data Warehousing. On-Line Analytical Processing (OLAP) Tools The use of a set of graphical tools that provides users with multidimensional views of their.
CSE6011 Warehouse Models & Operators  Data Models  relations  stars & snowflakes  cubes  Operators  slice & dice  roll-up, drill down  pivoting.
Chapter 13 The Data Warehouse
Ch3 Data Warehouse part2 Dr. Bernard Chen Ph.D. University of Central Arkansas Fall 2009.
CS346: Advanced Databases
Online Analytical Processing (OLAP) Hweichao Lu CS157B-02 Spring 2007.
1 Basic concepts of On-Line Analytical processing DT211 /4.
CISB594 – Business Intelligence
Data Warehousing & OLAP Nuosang Du Jon B. Arnason CSCI 5707 November 19, 2013.
Chetan Bhirud Raza Mohammad Abinash Sahoo Online Marketing Giant.
Database Management Systems, 2 nd Edition. R. Ramakrishnan and J. Gehrke1 Decision Support Chapter 23.
Week 6 Lecture The Data Warehouse Samuel Conn, Asst. Professor
Data Warehouse & Data Mining
IBM Start Now Business Intelligence Solutions. Agenda Overview of BI Who will buy and why Start Now BI solution Benefit to customer.
Multi-Dimensional Databases & Online Analytical Processing This presentation uses some materials from: “ An Introduction to Multidimensional Database Technology,
Ahsan Abdullah 1 Data Warehousing Lecture-11 Multidimensional OLAP (MOLAP) Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for.
OnLine Analytical Processing (OLAP)
Data Warehouse. Design DataWarehouse Key Design Considerations it is important to consider the intended purpose of the data warehouse or business intelligence.
1 Data Warehouses BUAD/American University Data Warehouses.
OLAP & DSS SUPPORT IN DATA WAREHOUSE By - Pooja Sinha Kaushalya Bakde.
Online analytical processing (OLAP) is a category of software technology that enables analysts, managers, and executives to gain insight into data through.
Data Warehousing.
October 28, Data Warehouse Architecture Data Sources Operational DBs other sources Analysis Query Reports Data mining Front-End Tools OLAP Engine.
BUSINESS ANALYTICS AND DATA VISUALIZATION
Decision Support and Date Warehouse Jingyi Lu. Outline Decision Support System OLAP vs. OLTP What is Date Warehouse? Dimensional Modeling Extract, Transform,
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
By N.Gopinath AP/CSE. There are 5 categories of Decision support tools, They are; 1. Reporting 2. Managed Query 3. Executive Information Systems 4. OLAP.
Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe Slide
Lexmark By Rosanna Nadal & Irina Yermolovich. Lexmark International Global manufacturer of printing products and solutions for customers in more then.
1 On-Line Analytic Processing Warehousing Data Cubes.
Building Dashboards SharePoint and Business Intelligence.
OLAP in DWH Ján Genči PDT. 2 Outline OLAP Definitions and Rules The term OLAP was introduced in a paper entitled “Providing On-Line Analytical.
CISB594 – Business Intelligence Business Analytics and Data Visualization Part I.
A POWER OF OLAP TECHNOLOGY National Technical University of Ukraine “Kiev Polytechnic Institute” Heat and energy design faculty Department of automation.
Business Intelligence Transparencies 1. ©Pearson Education 2009 Objectives What business intelligence (BI) represents. The technologies associated with.
What is OLAP?.
Data Warehousing.
1 Database Systems, 8 th Edition 1 Chapter 13 Business Intelligence and Data Warehouses Objectives In this chapter, you will learn: –How business intelligence.
12 1 Database Systems: Design, Implementation, & Management, 6 th Edition, Rob & Coronel 12.4 Online Analytical Processing OLAP creates an advanced data.
Database Management Systems, 2 nd Edition. R. Ramakrishnan and J. Gehrke1 Data Warehousing and Decision Support.
The Need for Data Analysis 2 Managers track daily transactions to evaluate how the business is performing Strategies should be developed to meet organizational.
Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke1 Data Warehousing and Decision Support Chapter 25.
1 Database Systems, 8 th Edition Star Schema Data modeling technique –Maps multidimensional decision support data into relational database Creates.
1 Copyright © 2006, Oracle. All rights reserved. Defining OLAP Concepts.
Introduction to OLAP and Data Warehouse Assoc. Professor Bela Stantic September 2014 Database Systems.
Pindaro Demertzoglou Data Resource Management – MGMT 4170 Lally School of Management Rensselaer Polytechnic Institute.
Data Warehousing COMP3017 Advanced Databases Dr Nicholas Gibbins –
Data Warehousing and OLAP Outline u Models & operations u Implementing a warehouse u Future directions.
Data Mining & OLAP What is Data Mining? Data Mining is the set of activities used to find new, hidden, or unexpected patterns in data.
Data Mining and Data Warehousing: Concepts and Techniques What is a Data Warehouse? Data Warehouse vs. other systems, OLTP vs. OLAP Conceptual Modeling.
Data Warehousing CIS 4301 Lecture Notes 4/20/2006.
Chapter 13 Business Intelligence and Data Warehouses
Chapter 13 The Data Warehouse
OLAP – On Line Analytical Processing
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
Data Warehouse.
Online Analytical Processing OLAP
Introduction of Week 9 Return assignment 5-2
OLAP in DWH Ján Genči PDT.
Online analytical processing (OLAP) is a category of software technology that enables analysts, managers, and executives to gain insight into data through.
Presentation transcript:

1 COMP 3503 Deductive Modeling with OLAP with Daniel L. Silver Daniel L. Silver

2 Agenda  What is OLAP?  OLAP, MOLAP and ROLAP  OLAP Functionality  Overview of IBM Cognos Insight  OLAP Pros and Cons

3 What is OLAP?

4 On-Line Analytical Processing OLAP  Term coined by E.F. Codd in a document published in 1993 sponsored by Arbor Software Corp (ESSBASE)  In contrast to OLTP and traditional RDBMS  Defined requirements for databases and tools to implement decision support and business intelligence systems.  Has had a significant impact on the database and business software market.

5 OLAP Definition  Online Analytical Processing = OLAP refers to technology that allows users of multidimensional databases to generate on-line descriptive or comparative summaries ("views") of data and other analytic queries.  OLAP facilities should be integrated into enterprise-wide data base systems allow analysts and managers to monitor the performance of the businessallow analysts and managers to monitor the performance of the business e.g. –number of transactions / sales at different locations by product class by timee.g. –number of transactions / sales at different locations by product class by time Courtesy Anders Stjarne

6 Multidimensional Requirements  Example: Sales volume as a function of product, time, and geography. Product Geography Time Dimensions: Product, Geography, Time Measure: ‘ Sales Volume ’ Courtesy Anders Stjarne More than three dimensional data cube is referred to as a hypercube

7 q Deductive Modelling and Analysis Quarter Month Type Customer Line Brand Number Country Branch Sales Rep Quantity Cost Margin Combination 1 Quarter Month Type Customer Line Brand Number Country Branch Sales Rep Quantity Cost Margin Combination 2 When? Time (1997) Who? Customers (Channels) What? Product (Type) Where? Location (Region) Result? Indicator (Revenue) Comprehensive Sales Analysis Courtesy Anders Stjarne

8 On-Line Analytical Processing 12 Rules of an OLAP Environment by E.F. Codd  Multi-dimensional - data-cubes or hypercubes  Transparent access  Navigation aids  Consistent reporting  Client-sever based  Generic dimensionality  Efficient data storage  Multi-user support  Unrestricted cross- dimensional operations  Intuitive data manipulation  Flexible reporting  Unlimited levels of aggregation

9 On-Line Analytical Processing  Strong connection to multi-dimensional database (MDBMS) model  MOLAP  Data-cubes are typically constructed off-line due to time required to build indices  Dimensions, values, and aggregations are limited to that within data-cube  On-line cube development has allowed RDBMS vendors to survive as major players in OLAP market  ROLAP

10 OLAP, MOLAP and ROLAP

11 OLAP Distributed Framework OLAP functions are independent of: Front-end user interfaceFront-end user interface Back-end data storageBack-end data storage Courtesy Anders Stjarne

12 MDBMS  Relational versus Dimensional Data Rwhttp:// Rwhttp:// Rwhttp:// Rw  ROLAP = Representing dimensional data with RDBMS Star SchemaStar Schema o P.html P.html P.html More details:More details: o o igningtheStarSchemaDatabase/tabid/101/Default.aspx igningtheStarSchemaDatabase/tabid/101/Default.aspx igningtheStarSchemaDatabase/tabid/101/Default.aspx

13 MOLAP vs. ROLAP Multidimensional difficulty handling sparcity efficiently difficulty handling sparcity efficiently direct representation of the data “ cube ” direct representation of the data “ cube ” rapid drill down on summary data rapid drill down on summary data proprietary solutions proprietary solutions better performance response better performance response does not scale well to handle large amounts of detail does not scale well to handle large amounts of detail thin client, analytical processing done on server thin client, analytical processing done on server REF: White, “MOLAP vs ROLAP,” (B&A-15) Relational multidimensional view built on a Relational DBMS hampered by the limitations of SQL handles sparcity automatically stores summary and detail data equally easily easy to share common dimensions across DWs scales well using well-developed relational technology depends on efficient processing of STAR joins and indexes analytical processing done on the client (or middle server) Courtesy Anders Stjarne

14 OLAP Functionality

15 On-Line Analytical Processing Deductive Modeling with OLAP Deductive Modeling with OLAP  Model is developed within the users mind as data is explored  Verification or rejection is facilitated by multi-dimensional functions which display data numerically and graphically  Best practices: Determine suspected variable interaction Determine suspected variable interaction Verify/reject model through exploration Verify/reject model through exploration Drill-down to refine model Drill-down to refine model Maintain record of exploratory findings Maintain record of exploratory findings

16 On-Line Analytical Processing Basic OLAP Functionality Basic OLAP Functionality  Dimension selection - slice & dice  Rotation - allows change in perspective  Filtration -value range selection  Hierarchies of aggregation levels drill-downs to lower levels drill-downs to lower levels roll-ups to higher levels roll-ups to higher levels Tremendous tool for decision support and executive information delivery and analysis

17 OLAP - Sample Operations  Roll up: summarize data total sales volume last year by product category by region total sales volume last year by product category by region  Roll down, drill down, drill through: go from higher level summary to lower level summary or detailed data For a particular product category, find the detailed sales data for each salesperson by date For a particular product category, find the detailed sales data for each salesperson by date  Slice and dice: select and project Sales of beverages in the West over the last 6 months Sales of beverages in the West over the last 6 months  Pivot or rotate: change visual dimensions Courtesy Anders Stjarne

18 OLAP and Data Mining  The final results from OLAP exploration can lead to inductive data mining  Data Mining techniques can be applied to the data views and summaries generated by OLAP to provide more in- depth and often more multidimensional knowledge  Data Mining techniques can be considered analytic extension of OLAP

19 q Multi-dimensional Cubes  A cube is a structure that stores data multi-dimensionally and provides: secure data accesssecure data access fast retrieval of data.fast retrieval of data.  Cubes can be distributed across a network or to individual computers.

20 Measures  The numeric (continuous) data that is collected and stored by your organization.  The performance measures used to evaluate your business. Examples: RevenueRevenue CostCost Quantity soldQuantity sold Units on-handUnits on-hand Hours per JobHours per Job Number of callsNumber of calls Defective units.Defective units. q #% Revenue - Cost = Profit Margin Basic Derived

21 q Dimensions and Levels  Dimensions are a broad group of descriptive data about the major aspects of your business.  Levels represent established hierarchy within dimensions. Dimensions Levels When? Date What? Products Where? Locations Years Months Days Line Type Product Region Branch Country Courtesy Anders Stjarne

22 q Levels and Categories A category is a data item that populates a level in a dimension.A category is a data item that populates a level in a dimension. Levels CategoriesDimension Locations RegionCountryBranchEuropeUnited Kingdom London, U.K. Manchester, U.K. Courtesy Anders Stjarne

23 Application Development Process q Plan measures and dimensions Create the cube Obtain the required data Develop the MDBMS model Explore the cube data using Insight Courtesy Anders Stjarne

24 Basic OLAP Operations Selection (Filter) – within the range of a dimension Selection (Filter) – within the range of a dimension Scope – the range on a dimension Scope – the range on a dimension Slice – a two dimensional ‘ page ’ from the cube Slice – a two dimensional ‘ page ’ from the cube Dice – chopping up along the dimensions Dice – chopping up along the dimensions Drill down analysis - to the detail beneath summary data Drill down analysis - to the detail beneath summary data Rollup/ Consolidate Rollup/ Consolidate Rotate (Pivot) – change dimension orientation Rotate (Pivot) – change dimension orientation o Swap rows and columns o Swap on or off o Change nesting order Reach Through – to the source data detail Reach Through – to the source data detail Calculations / Derivation formulas on the measured facts Calculations / Derivation formulas on the measured facts o Ratios, Rankings, etc. o E.g., NetSales = GrossSales – Cost; NetSales = GrossSales*(1 - Margin) REFS: INMON, Building, Ch. 7, p. 243; White, “MOLAP vs ROLAP,” (B&A-15) Courtesy Anders Stjarne

25 Advanced OLAP Operations  Trend analysis - over broad vistas of time handling time series data, time calculationshandling time series data, time calculations  Key ratio indicator measurement and tracking  Comparisons - present to: past, plan, and others competitive market analysiscompetitive market analysis  Problem monitoring - of variables within control limits  Alerts and Event-Driven Agent Processing Courtesy Anders Stjarne

26 OLAP Pros and Cons

27 On-Line Analytical Processing Strengths of OLAP  Powerful visualization ability via GUI  Fast, interactive response times  Analysis of time series  Deductive discovery of clusters/exceptions  Many OLAP products available and integrated to DB products

28 On-Line Analytical Processing Weaknesses of OLAP  Does not handle continuous variables  Does not automatically discover patterns and models  Generation of a complex hypercubes require some training and experience  Hypercube generation and update - MOLAP Vs. ROLAP

29 On-Line Analytical Processing Products and Suppliers Products and Suppliers  of_OLAP_Servers of_OLAP_Servers of_OLAP_Servers

30 Overview of IBM Cognos Insight OLAP Intro: ugczSGNVXlU ugczSGNVXlU ugczSGNVXlU In depth: ?v=bNw89HUHKEk ?v=bNw89HUHKEk ?v=bNw89HUHKEk

31 Tutorial  IBM Cognos Insight

32 THE END