© 2014 Zvi M. Kedem 1 Unit 11 Online Analytical Processing (OLAP) Basic Concepts.

Slides:



Advertisements
Similar presentations
OLAP Tuning. Outline OLAP 101 – Data warehouse architecture – ROLAP, MOLAP and HOLAP Data Cube – Star Schema and operations – The CUBE operator – Tuning.
Advertisements

Data Warehousing CPS216 Notes 13 Shivnath Babu. 2 Warehousing l Growing industry: $8 billion way back in 1998 l Range from desktop to huge: u Walmart:
Data Warehousing M R BRAHMAM.
Jennifer Widom On-Line Analytical Processing (OLAP) Introduction.
Database Systems: Design, Implementation, and Management Tenth Edition
Decision Support and Data Warehouse. Decision supports Systems Components Data management function –Data warehouse Model management function –Analytical.
OLAP. Overview Traditional database systems are tuned to many, small, simple queries. Some new applications use fewer, more time-consuming, analytic queries.
Advanced Querying OLAP Part 2. Context OLAP systems for supporting decision making. Components: –Dimensions with hierarchies, –Measures, –Aggregation.
1 Lecture 10: More OLAP - Dimensional modeling
Lab3 CPIT 440 Data Mining and Warehouse.
Lead Black Slide. © 2001 Business & Information Systems 2/e2 Chapter 7 Information System Data Management.
Data Warehousing. On-Line Analytical Processing (OLAP) Tools The use of a set of graphical tools that provides users with multidimensional views of their.
1 9 Adv. DBMS Data Warehouse CSC5301 Review Hachim Haddouti.
CSE6011 Warehouse Models & Operators  Data Models  relations  stars & snowflakes  cubes  Operators  slice & dice  roll-up, drill down  pivoting.
Chapter 13 The Data Warehouse
DATA WAREHOUSE (Muscat, Oman).
Online Analytical Processing (OLAP) Hweichao Lu CS157B-02 Spring 2007.
Database Management Systems, 2 nd Edition. R. Ramakrishnan and J. Gehrke1 Decision Support Chapter 23.
Week 6 Lecture The Data Warehouse Samuel Conn, Asst. Professor
On-Line Analytic Processing Chetan Meshram Class Id:221.
IMS 6217: Data Warehousing / Business Intelligence Part 3 1 Dr. Lawrence West, Management Dept., University of Central Florida Analysis.
Multi-Dimensional Databases & Online Analytical Processing This presentation uses some materials from: “ An Introduction to Multidimensional Database Technology,
Datawarehouse & Datamart OLAPs vs. OLTPs Dimensional Modeling Creating Physical Design Using SQL Mgt. Studio Module II: Designing Datamarts 1.
OnLine Analytical Processing (OLAP)
Cube Intro. Decision Making Effective decision making Goal: Choice that moves an organization closer to an agreed-on set of goals in a timely manner Goal:
Lead Black Slide Powered by DeSiaMore1. 2 Chapter 7 Information System Data Management.
Data Warehouse & OLAP Kuliah 1 Introduction Slide banyak mengambil dari acuan- acuan yang dipakai.
Roadmap 1.What is the data warehouse, data mart 2.Multi-dimensional data modeling 3.Data warehouse design – schemas, indices 4.The Data Cube operator –
BI Terminologies.
October 28, Data Warehouse Architecture Data Sources Operational DBs other sources Analysis Query Reports Data mining Front-End Tools OLAP Engine.
BUSINESS ANALYTICS AND DATA VISUALIZATION
Decision Support and Date Warehouse Jingyi Lu. Outline Decision Support System OLAP vs. OLTP What is Date Warehouse? Dimensional Modeling Extract, Transform,
13 1 Chapter 13 The Data Warehouse Database Systems: Design, Implementation, and Management, Seventh Edition, Rob and Coronel.
Next Back MAP 3-1 Management Information Systems for the Information Age Copyright 2002 The McGraw-Hill Companies, Inc. All rights reserved Chapter 3 Data.
Winter 2006Winter 2002 Keller, Ullman, CushingJudy Cushing 19–1 Warehousing The most common form of information integration: copy sources into a single.
1 On-Line Analytic Processing Warehousing Data Cubes.
CMPE 226 Database Systems October 21 Class Meeting Department of Computer Engineering San Jose State University Fall 2015 Instructor: Ron Mak
Data Warehousing Multidimensional Analysis
Business Intelligence Transparencies 1. ©Pearson Education 2009 Objectives What business intelligence (BI) represents. The technologies associated with.
OLAP On Line Analytic Processing. OLTP On Line Transaction Processing –support for ‘real-time’ processing of orders, bookings, sales –typically access.
Pooja Sharma Shanti Ragathi Vaishnavi Kasala. BUSINESS BACKGROUND Lowe's started as a single hardware store in North Carolina in 1946 and since then has.
ISAM 5931: Data Warehousing & Data Mining Group Project submitted by : Mudassar Hakim & Gaurav Wadhwani.
CSE 5331/7331 F'071 CSE 5331/7331 Fall 2007 Dimensional Modeling Margaret H. Dunham Department of Computer Science and Engineering Southern Methodist University.
Data Warehousing.
The Data Warehouse Chapter Operational Databases = transactional database  designed to process individual transaction quickly and efficiently.
Database Management Systems, 2 nd Edition. R. Ramakrishnan and J. Gehrke1 Data Warehousing and Decision Support.
1 Online Analytical Processing (OLAP) Anjali Gupta Mithun Arora Aameek Singh Kranthi Kumar.
SQL Server Analysis Services Understanding Unified Dimension Model (UDM)
Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke1 Data Warehousing and Decision Support Chapter 25.
1 Database Systems, 8 th Edition Star Schema Data modeling technique –Maps multidimensional decision support data into relational database Creates.
Introduction to OLAP and Data Warehouse Assoc. Professor Bela Stantic September 2014 Database Systems.
Data Warehouses and OLAP 1.  Review Questions ◦ Question 1: OLAP ◦ Question 2: Data Warehouses ◦ Question 3: Various Terms and Definitions ◦ Question.
Pindaro Demertzoglou Data Resource Management – MGMT 4170 Lally School of Management Rensselaer Polytechnic Institute.
Or How I Learned to Love the Cube…. Alexander P. Nykolaiszyn BLOG:
Data Warehousing and OLAP Outline u Models & operations u Implementing a warehouse u Future directions.
CSE6011 Implementing a Warehouse  Monitoring: Sending data from sources  Integrating: Loading, cleansing,...  Processing: Query processing, indexing,...
Data Analysis Decision Support Systems Data Analysis and OLAP Data Warehousing.
11/20/ :11 AMData Mining 1 Data Mining – CSE 9033 Chapter – 1; Data Warehousing Dr. Goutam Sarker, B.E., M.E., Ph.D.(Engineering), Fellow: IE(I),
On-Line Application Processing
Data warehouse.
Data Warehousing CIS 4301 Lecture Notes 4/20/2006.
On-Line Analytic Processing
On-Line Analytical Processing (OLAP)
MIS2502: Data Analytics Dimensional Data Modeling
MIS2502: Data Analytics Dimensional Data Modeling
Introduction of Week 9 Return assignment 5-2
On-Line Application Processing
DATA CUBES E0 261 Jayant Haritsa Computer Science and Automation
Online analytical processing (OLAP) is a category of software technology that enables analysts, managers, and executives to gain insight into data through.
Presentation transcript:

© 2014 Zvi M. Kedem 1 Unit 11 Online Analytical Processing (OLAP) Basic Concepts

© 2014 Zvi M. Kedem 2 OLAP in Context

© 2014 Zvi M. Kedem 3 OLAP vs. OLTP uWe have focused until now on OLTP: Online Transaction Processing uThis dealt with storing data both logically and physically and managing transactions querying and modifying the data uWe will now focus providing support for analytical queries, essentially statistical and summary information for decision-makers, that is on OLAP: Online Analytical Processing uThis may be accomplished by preprocessing, for efficiency purposes, and producing special types of views, which are also not necessarily up to date  Not up to date may not be a problem in OLAP uData for OLAP (and more generally for data mining) is frequently stored in a Data Warehouse

© 2014 Zvi M. Kedem 4 Example uOur company has several stores and sells several products uThe stores are in different locations uThe locations, identified by (city,state) pairs are grouped into several regions uWe divide the time of sale into four quarters uThe quarters are grouped into two half-years

© 2014 Zvi M. Kedem 5 Our Company StoreStore#CityStateRegion AlphaNew YorkNYNE BetaAlbanyNYNE QuarterQuarter#Half_Year 1First 2 3Second 4 ProductProduct#WeightPrice Book4100 Glass15200

© 2014 Zvi M. Kedem 6 Our Sales SaleStore#Product#Quarter#$ AlphaBook170,000 AlphaGlass190,000 BetaBook190,000 BetaGlass180,000 AlphaBook290,000 AlphaGlass290,000 BetaBook260,000 BetaGlass250,000 AlphaBook360,000 AlphaGlass380,000 BetaBook350,000 BetaGlass390,000 AlphaBook450,000 AlphaGlass450,000 BetaBook470,000 BetaGlass470,000

© 2014 Zvi M. Kedem 7 Star Schema uWe want to support queries useful for statistical analysis by computing various sums, averages, etc. uThe structure we have is a star schema uIn the middle we have our facts table

© 2014 Zvi M. Kedem 8 Snowflake Schema: Normalized Star Schema uOne could also normalize, as table Store is not normalized, since State  Region uThen, one could get, which we will not consider further, a snowflake schema

© 2014 Zvi M. Kedem 9 Cube uWe could think of each row of fact table as occupying a voxel (volume element) in a cube uCube, in general, can have any number of dimensions; in our example there are three uThis cube can then be sliced and diced

© 2014 Zvi M. Kedem 10 Slice uSELECT Store#, Product#, SUM($) FROM Sale GROUP BY Store#, Product# uWe can do all kinds of such slices

© 2014 Zvi M. Kedem 11 Dimension Hierarchies uDimensions could have hierarchies (or more generally even lattices) uWe have two very simple hierarchies  One temporal: quarters are in half years  One geographical: cities are in states are in regions

© 2014 Zvi M. Kedem 12 Using Hierarchies uSELECT Sale.Product#, Quarter.Half_Year, SUM($) FROM Sale, Quarter WHERE Sale.Quarter# = Quarter.Quarter# GROUP BY Half_Year; uWill produce summaries by half years, not quarters

© 2014 Zvi M. Kedem 13 New Operator: CUBE uSELECT Store#, Product#, SUM($) FROM Sale GROUP BY CUBE (Store#,Product#); uWill produce all possible aggregations based on subsets of {Store#,Product#}, best explained by looking at what will come out Store#Product#$ AlphaBook270,000 AlphaGlass310,000 BetaBook270,000 BetaGlass290,000 AlphaNULL580,000 BetaNULL560,000 NULLBook540,000 NULLGlass600,000 NULL 1,140,000

© 2014 Zvi M. Kedem 14 New Operator: ROLLUP uROLLUP produces only some of the aggregate operators produced by CUBE, best explained by example uSELECT Store#, Product#, SUM($) FROM Sale GROUP BY ROLLUP (Store#,Product#); Store#Product#$ AlphaBook270,000 AlphaGlass310,000 BetaBook270,000 BetaGlass290,000 AlphaNULL580,000 BetaNULL560,000 NULL 1,140,000

© 2014 Zvi M. Kedem 15 ROLAP And MOLAP uROLAP: Relational OLAP uThat is what we have been doing: OLAP information was stored as a set of star (or more generally snowflakes) schemas uMOLAP: Multidimensional OLAP uInformation not stored as a relational database, but essentially as a cube

© 2014 Zvi M. Kedem 16 Oracle uOracle supports OLAP uFollowing is the SQL code

© 2014 Zvi M. Kedem 17 Oracle: Defining the Database create table store( sid char(20) primary key, city char(20), state char(20), region char(20) ); create table product( pid char(20) primary key, weight number, price number ); create table quarter( qid number primary key, half_year char(10) );

© 2014 Zvi M. Kedem 18 Oracle: Defining the Database create table sale( sid char(20), pid char(20), qid number, profit number, primary key(qid, pid, sid), foreign key(qid) references quarter(qid), foreign key(pid) references product(pid), foreign key(sid) references store(sid) );

© 2014 Zvi M. Kedem 19 Oracle: OLAP Query select sid, pid, sum(profit) from sale group by rollup(sid, pid); select sid, pid, sum(profit) from sale group by cube(sid, pid);

© 2014 Zvi M. Kedem 20 Key Ideas uOLAP vs. OLTP uStar schema uSnowflake schema uCube uSlicing and dicing uDimension hierarchies uROLAP uMOLAP