1 Lecture 09: OLAP www.cl.cam.ac.uk/Teaching/current/Databases/

Slides:



Advertisements
Similar presentations
OLAP Tuning. Outline OLAP 101 – Data warehouse architecture – ROLAP, MOLAP and HOLAP Data Cube – Star Schema and operations – The CUBE operator – Tuning.
Advertisements

Data Warehousing Willem Visser RW334. Somebody is watching! Everybody seems to be recording your every move Loyalty cards Cookies – Facebook, Twitter,…
Data Warehousing CPS216 Notes 13 Shivnath Babu. 2 Warehousing l Growing industry: $8 billion way back in 1998 l Range from desktop to huge: u Walmart:
Introduction to Data Warehousing CPS Notes 6.
Data Warehousing M R BRAHMAM.
The Role of Data Warehousing and OLAP Technologies CS 536 – Data Mining These slides are adapted from J. Han and M. Kamber’s book slides (
Data Warehousing Xintao Wu. Evolution of Database Technology (See Fig. 1.1) 1960s: Data collection, database creation, IMS and network DBMS 1970s: Relational.
Dr. M. Sulaiman Khan Dept. of Computer Science University of Liverpool 2010 COMP207: Data Mining Data Warehousing COMP207: Data Mining.
COMP 578 Data Warehousing And OLAP Technology Keith C.C. Chan Department of Computing The Hong Kong Polytechnic University.
© Tan,Steinbach, Kumar Introduction to Data Mining 8/05/ Data Warehouse and Data Cube Lecture Notes for Chapter 3 Introduction to Data Mining By.
Lab3 CPIT 440 Data Mining and Warehouse.
1 9 Adv. DBMS Data Warehouse CSC5301 Review Hachim Haddouti.
Microsoft SQL Server 2012 Analysis Services (SSAS) Reporting Services (SSRS)
Ch3 Data Warehouse part2 Dr. Bernard Chen Ph.D. University of Central Arkansas Fall 2009.
DATA WAREHOUSE (Muscat, Oman).
1 Data Warehousing and OLAP. 2 Data Warehousing & OLAP Defined in many different ways, but not rigorously.  A decision support database that is maintained.
Chapter 4 Tutorial.
CS346: Advanced Databases
Ch3 Data Warehouse Dr. Bernard Chen Ph.D. University of Central Arkansas Fall 2010.
Components of the Data Warehouse Michael A. Fudge, Jr.
1 Data Warehouses C hapter 2. 2 Chapter 2 Outline Chapter 2 Outline – Introduction –Data Warehouses –Data Warehouse in Organisation – OLTP vs. OLAP –Why.
Online Analytical Processing (OLAP) Hweichao Lu CS157B-02 Spring 2007.
OLAP OPERATIONS. OLAP ONLINE ANALYTICAL PROCESSING OLAP provides a user-friendly environment for Interactive data analysis. In the multidimensional model,
Dr. Bernard Chen Ph.D. University of Central Arkansas
8/20/ Data Warehousing and OLAP. 2 Data Warehousing & OLAP Defined in many different ways, but not rigorously. Defined in many different ways, but.
Database Management Systems, 2 nd Edition. R. Ramakrishnan and J. Gehrke1 Decision Support Chapter 23.
DW-1: Introduction to Data Warehousing. Overview What is Database What Is Data Warehousing Data Marts and Data Warehouses The Data Warehousing Process.
1 Cube Computation and Indexes for Data Warehouses CPS Notes 7.
OnLine Analytical Processing (OLAP)
CS 157B: Database Management Systems II March 20 Class Meeting Department of Computer Science San Jose State University Spring 2013 Instructor: Ron Mak.
Data Warehousing Xintao Wu. Can You Easily Answer These Questions? What are Personnel Services costs across all departments for all funding sources? What.
OLAP & DSS SUPPORT IN DATA WAREHOUSE By - Pooja Sinha Kaushalya Bakde.
Data Warehousing.
Roadmap 1.What is the data warehouse, data mart 2.Multi-dimensional data modeling 3.Data warehouse design – schemas, indices 4.The Data Cube operator –
T.ROKAYAH BAYAN OLAP IN THE DATA WAREHOUSE. CHAPTER OBJECTIVES  Review the major features and functions of OLAP in detail  Grasp the intricacies of.
October 28, Data Warehouse Architecture Data Sources Operational DBs other sources Analysis Query Reports Data mining Front-End Tools OLAP Engine.
Data Warehousing. Databases support: Transaction Processing Systems –operational level decision –recording of transactions Decision Support Systems –tactical.
Dr. N. MamoulisAdvanced Database Technologies1 Topic 6: Data Warehousing & OLAP Defined in many different ways, but not rigorously. A decision support.
SHIFALI CHOUBEY GISE LAB IITB Decision Support System For Farmers.
Data Mining Data Warehouses.
Business Intelligence Transparencies 1. ©Pearson Education 2009 Objectives What business intelligence (BI) represents. The technologies associated with.
M. Sulaiman Khan Dept. of Computer Science University of Liverpool 2009 This is the full course notes, but not quite complete. You.
CSE 5331/7331 F'071 CSE 5331/7331 Fall 2007 Dimensional Modeling Margaret H. Dunham Department of Computer Science and Engineering Southern Methodist University.
January 21, 2016Data Mining: Concepts and Techniques 1 Chapter 3: Data Warehousing and OLAP Technology: An Overview What is a data warehouse? A multi-dimensional.
Advanced Database Concepts
CS 157B: Database Management Systems II April 10 Class Meeting Department of Computer Science San Jose State University Spring 2013 Instructor: Ron Mak.
Database Management Systems, 2 nd Edition. R. Ramakrishnan and J. Gehrke1 Data Warehousing and Decision Support.
Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke1 Data Warehousing and Decision Support Chapter 25.
Datawarehousing and OLAP C.Eng 714 Spring
Pindaro Demertzoglou Data Resource Management – MGMT 4170 Lally School of Management Rensselaer Polytechnic Institute.
Data Warehousing and OLAP Outline u Models & operations u Implementing a warehouse u Future directions.
Data Mining and Data Warehousing: Concepts and Techniques What is a Data Warehouse? Data Warehouse vs. other systems, OLTP vs. OLAP Conceptual Modeling.
Business Intelligence Overview
Data Mining: Data Warehousing
Information Management course
Data Warehousing CIS 4301 Lecture Notes 4/20/2006.
A B D C G5b Date 1Qtr 2Qtr 3Qtr 4Qtr TV Product PC
On-Line Analytic Processing
What is OLAP OLAP allows to model data in a multidimensional way like a data cube in order to look for the data from many perspectives.
OLAP Concepts and Techniques
Data Warehousing and OLAP Technology for Data Mining
Data Warehouse and OLAP
Lecture 4: From Data Cubes to ML
Data Warehousing and Decision Support Chapter 25
Introduction of Week 9 Return assignment 5-2
Data Mining: Concepts and Techniques
Data Warehouse.
Data Warehouse and OLAP
Data Warehousing.
Presentation transcript:

1 Lecture 09: OLAP

2 2+2 /* Microsoft SQL Server 2005 */ /* By the way, it is just VHVyaW5nIG1hY2hpbmU= :-) */ WITH SubQuery(t, s, a, b) AS ( SELECT 0, 's', CAST (' 1 */ SELECT '4', '1', '4', '1', 'r' UNION ALL /* find 0 or _; left */ SELECT '4', '_', '5', '_', 'l' UNION ALL SELECT '4', '0', '5', '0', 'l' UNION ALL SELECT '5', '1', '6', '0', 's' UNION ALL /* 1 -> 0 */ SELECT '6', '1', '6', '1', 'l' UNION ALL /* rewind */ SELECT '6', '0', '6', '0', 'l' UNION ALL SELECT '6', '<', 's', '<', 's' /* restart */ ) AS prog(currS, currZ, newS, newZ, mv) WHERE curr.s = currS AND Right(curr.a, 1) = currZ ) SELECT CharIndex('0', a + b) - 2 FROM SubQuery WHERE s = 'a' OPTION (MAXRECURSION 0); /* SELECT t, s, a + '.' + b FROM SubQuery OPTION (MAXRECURSION 0); */ David Srbecky

3 Acknowledgments DB2/400: Mastering Data Warehousing Functions. (IBM Redbook) Chapters 1 & 2 only. Data Warehousing and OLAP Hector Garcia-Molina (Stanford University) Data Warehousing and OLAP Technology for Data Mining Department of Computing London Metropolitan University

4 Buzz Words Buzz Words Buzz Words Buzz Words Buzz Words Data Warehouse (DW) Decision Support (DS) Data Marts (DM) Data Mining (DM) Enterprise Dashboard (ED) Multi-Dimensional Modeling (MDM) Online Analytic Processing (OLAP) Extract, Transform, and Load (ETL) MOLAP vs. ROLAP Three Letter Acronym (TLR) Drill Down, Roll up (DD+RU) Data vs. Knowledge (DvK) Data Cube vs. Sugar Cube (DCvSC) Don’t be surprised to see this sort of BDB (Blah-Dee-Blah) in the trade press: “The ED lets you transform enterprise data into knowledge with at-a-glance DS/DM and MDM, allowing interactive DD/RU over large DCs.”

5 OLTP vs. OLAP Database is operational Data is up-to-date Mostly updates Need to support high levels of update transactions Normal form schemas are important Database is for analysis Data is historical Mostly reads Need to efficiently support complex queries, and only bulk loading of data Schema optimized for query processing

6 Decision Support Systems Information SourcesData Warehouse Server (Tier 1) OLAP Servers (Tier 2) Clients (Tier 3) Operational DB’s Semistructured Sources Extract Transform Load Data Marts Data Warehouse e.g., MOLAP e.g., ROLAP serve Analysis Query/Reporting Data Mining serve From Enrico Franconi CS 636

7 xOLAP Multi-dimensional OLAP (MOLAP) –‘A k-dimensional matrix based on a non relational storage structure.’ [Agrawal et al] Relational OLAP (ROLAP) –‘A relational back-end wherein operations of the data are translated to relational queries.’ [Agrawal et al] Hybrid OLAP (HOLAP) –Integration of MOLAP with ROLAP. Desktop OLAP (DOLAP) –Simplified versions of MOLAP or ROLAP. ZOLAP –Speak with your chemist (normally only prescribed for death march victims)

8 Beware of Data Warehouse Death March Edward Yourdon, 1997, Death March: The Complete Software Developer’s Guide to Surviving “Mission Impossible Projects” Death March projects “use a forced march imposed upon relatively innocent victims, the outcome of which is usually a high casualty rate.” Data Warehouses and Decision Support systems are among the most complex and demanding in the IT world. Failure rates are very high….

9 Relational data model based on a single structure of data values in a two dimensional table CUSTOMER ORDER Cus_idCus_name… 001Robert… 002Lyn… ……… Ord_noOrd_dateCus_id… 0102 Dec 02002… 0203 Dec 02Lyn… …………

10 Data warehousing ___Multidimensional Data Sales volume as a function of product, month, and region Product Region Dimensions: Product, Location, Time Month

11 A Sample Data Cube Total annual sales of TV in U.S.A. Date Product Country sum TV VCR PC 1Qtr 2Qtr 3Qtr 4Qtr U.S.A Canada Mexico sum

12 A Concept Hierarchy for Dimension Location all EuropeNorth_America MexicoCanadaSpainGermany Vancouver M. WindL. Chan... all region office country TorontoFrankfurtcity

13 Cuboids Corresponding to the Cube all product date country product,dateproduct,countrydate, country product, date, country 0-D(apex) cuboid 1-D cuboids 2-D cuboids 3-D(base) cuboid

14 Multidimensional Data: A University Sample Data Cube Students’ marks as a function of student, department, and year Average Mark of Abraham in Year 1. Module Student Time Avg Abraham Caroline Bridget Art Business Computing Year 1 Year 2 Year 3 Avg Design

15 Data Warehousing “A data warehouse is a subject-oriented, integrated, time-variant, and nonvolatile collection of data in support of management’s decision-making process.” —W. H. Inmon

16 OLAP Operations Roll up (drill-up): summarize data –by climbing up hierarchy or by dimension reduction Drill down (roll down): reverse of roll-up –from higher level summary to lower level summary or detailed data, or introducing new dimensions Slice and dice: –project and select Pivot (rotate): –reorient the cube, visualization, 3D to series of 2D planes. Other operations –drill across: involving (across) more than one fact table –drill through: through the bottom level of the cube to its back- end relational tables (using SQL)