Remember the Sales Data Cube? Each cell contains a sales measurement, e.g., the number of sales (may contain many other measurements of product-date-country.

Slides:



Advertisements
Similar presentations
1 Multi-way Algorithm for Cube Computation CPS Notes 8.
Advertisements

Arrays of charges Contents: Basic concept: Sum of forces Linear arrays Example Whiteboards 2-D Arrays Example Whiteboards.
5.1Database System Concepts - 6 th Edition Chapter 5: Advanced SQL Advanced Aggregation Features OLAP.
Introduction to Data Warehousing CPS Notes 6.
1 Lecture 09: OLAP
Oct 28, 2003Murali Mani Relational Algebra B term 2004: lecture 10, 11.
30 September 2004Comp 120 Fall September 2004 Chapter 4 – Logic Gates Read in Chapter 4 pages , , section 4.8 through top of page.
© Tan,Steinbach, Kumar Introduction to Data Mining 8/05/ Data Warehouse and Data Cube Lecture Notes for Chapter 3 Introduction to Data Mining By.
CSE6011 Warehouse Models & Operators  Data Models  relations  stars & snowflakes  cubes  Operators  slice & dice  roll-up, drill down  pivoting.
Microsoft SQL Server 2012 Analysis Services (SSAS) Reporting Services (SSRS)
1 Data Warehousing and OLAP. 2 Data Warehousing & OLAP Defined in many different ways, but not rigorously.  A decision support database that is maintained.
CS346: Advanced Databases
OLAP OPERATIONS. OLAP ONLINE ANALYTICAL PROCESSING OLAP provides a user-friendly environment for Interactive data analysis. In the multidimensional model,
8/20/ Data Warehousing and OLAP. 2 Data Warehousing & OLAP Defined in many different ways, but not rigorously. Defined in many different ways, but.
Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross-Tab, and Sub-Totals Presenter : Parminder Jeet Kaur Discussion Lead : Kailang.
2/14 Warm-ups Consider the surface area formula for a rectangular prism. 1.What part of the formula gives the area of the top and bottom faces? 2.What.
1 Cube Computation and Indexes for Data Warehouses CPS Notes 7.
1 Vicky :: Cao Hui Ping Sherman :: Chow Sze Ming CTH :: Chong Tsz Ho Ronald :: Woo Lok Yan Ken :: Yiu Man Lung Implementing Data Cubes Efficiently.
Methods for Multiplying. Standard Algorithm Partial Products Draw table with dimensions of digits in each number. Ex.
OLAP & DSS SUPPORT IN DATA WAREHOUSE By - Pooja Sinha Kaushalya Bakde.
Operations with Rational Numbers. When simplifying expressions with rational numbers, you must follow the order of operations while remembering your rules.
Prof. Bayer, DWH, Ch.4, SS Chapter 4: Dimensions, Hierarchies, Operations, Modeling.
Roadmap 1.What is the data warehouse, data mart 2.Multi-dimensional data modeling 3.Data warehouse design – schemas, indices 4.The Data Cube operator –
Verklaren van exceptionele waarden in multi-dimensionele bedrijfsdatabanken Emiel Caron, November 14, 2013.
©Silberschatz, Korth and Sudarshan18.1Database System Concepts - 5 th Edition, Aug 26, 2005 Extended Aggregation in SQL:1999 The cube operation computes.
T.ROKAYAH BAYAN OLAP IN THE DATA WAREHOUSE. CHAPTER OBJECTIVES  Review the major features and functions of OLAP in detail  Grasp the intricacies of.
BI Terminologies.
Non Linear Arrays of charges Contents: 2-D Arrays Example Whiteboards.
October 28, Data Warehouse Architecture Data Sources Operational DBs other sources Analysis Query Reports Data mining Front-End Tools OLAP Engine.
Data Mining Data Warehouses.
Sales Dim Date Dim Customers Dim Products Dim Categories Dim Geography The data warehouse is a simple and standard one, after all we.
Online Analytical Processing (OLAP) An Overview Kian Win Ong, Nicola Onose Mar 3 rd 2006.
Data Warehouse [ Example ] J. Han and M. Kamber, Data Mining: Concepts and Techniques, Morgan Kaufmann, 2001, ISBN Data Mining: Concepts and.
Determine the sequence of genes along a chromosome based on the following recombination frequencies A-C 20% A-D 10% B-C 15% B-D 5%
Database Management Systems, 2 nd Edition. R. Ramakrishnan and J. Gehrke1 Data Warehousing and Decision Support.
Surface area of cuboids and prisms Grade C work. Find the area of these shapes.
SURFACE AREA & VOLUME RECTANGULAR PRISM – AREA RECTANGULAR PRISM – VOLUME.
Lecture 34 Section 6.7 Wed, Mar 28, 2007
Data Analysis Decision Support Systems Data Analysis and OLAP Data Warehousing.
Data Mining: Data Warehousing
Data Analysis and OLAP Dr. Ms. Pratibha S. Yalagi Topic Title
BlinkDB.
9 Area and Volume [I] Case Study 9.1 Areas of Polygons
Information Management course
Introduction Polynomials are often added, subtracted, and even multiplied. Doing so results in certain sums, differences, or products of polynomials.
A B D C G5b Date 1Qtr 2Qtr 3Qtr 4Qtr TV Product PC
BlinkDB.
Divide into meridian sections – Gore development
What is OLAP OLAP allows to model data in a multidimensional way like a data cube in order to look for the data from many perspectives.
Chapter 5: Advanced SQL Database System concepts,6th Ed.
تصنيف التفاعلات الكيميائية
Data warehouse Design Using Oracle
Bridging the Analysis Gap: Multidimensional Analysis
Objective - To find the surface area of a rectangular prism.
Geometric Dimensioning and Tolerancing
Lecture 4: From Data Cubes to ML
Chapter 4: Dimensions, Hierarchies, Operations, Modeling
What is Data Warehouse? Defined in many different ways.
What is Data Warehouse? Defined in many different ways.
Parent Information Session Common Language and Methodology
The General Triangle C B A.
Lecture 6 Karnaugh maps (K-maps) K-maps with “don’t cares”
The General Triangle C B A.
The Essentials of 2-Level Design of Experiments Part I: The Essentials of Full Factorial Designs Developed by Don Edwards, John Grego and James Lynch.
Multiplication of Matrices
Chapter 3 Orthographic projection.
Parent Information Session Common Language and Methodology
3 ways to slice a 3-d figure
Chapter 3 Orthographic projection.
Presentation transcript:

Remember the Sales Data Cube? Each cell contains a sales measurement, e.g., the number of sales (may contain many other measurements of product-date-country instances) We will attempt to apply this technology to the task of finding bicliques later, after reviewing the technology. Date 1Qtr 2Qtr 3Qtr 4Qtr TV Product PC U.S.A VCR Canada Country Mexico

Total of all product sales by country and quarter Total sales by country and date Rollup (aggregate under +) along product (e.g., using the aggregate, sum) Date 1Qtr 2Qtr 3Qtr 4Qtr TV Product Total of all product sales by country and quarter PC U.S.A VCR Canada Country Mexico

Rollup along date (e.g., using the aggregate, sum) Total annual sales by country and product Date 1Qtr 2Qtr 3Qtr 4Qtr TV Product PC U.S.A VCR Canada Country Mexico

Rollup along country (e.g., using the aggregate, sum) Date 1Qtr 2Qtr 3Qtr 4Qtr TV Product PC U.S.A VCR Canada Country Mexico Total of all product sales by product and date Total of all product sales by product and date

All rollups (e.g., using the aggregate, sum) Date 1Qtr 2Qtr 3Qtr 4Qtr TV Product sales by product, country PC U.S.A sales by product, country and quarter VCR sales by country, date sales by country sales by country Canada Country Mexico sales by product sales by product, country sales by product sales by date sales by date Total sales Total sales Total sales

Date Product Country 1Qtr 2Qtr 3Qtr 4Qtr TV U.S.A VCR PC Canada Mexico Partial Rollup: climbing up a concept hierarchy (instead of eliminating Product altogether by summing over all products, rollup partially on Product, from (VCR, PC, TV) to computer (includes PC only) and non-computer (includes VCR + TV) Date 1Qtr 2Qtr 3Qtr 4Qtr Product TV U.S.A non-comp comp VCR PC Canada Country Mexico

SLICE e.g., slice off PC Date Product Country 1Qtr 2Qtr 3Qtr 4Qtr TV U.S.A VCR PC Canada Country Mexico

DICE (e.g. dice off PC, the last two quarters, the country Mexico) Date 1Qtr 2Qtr 3Qtr 4Qtr Product TV U.S.A VCR PC Canada Country Mexico

Pivot/Rotate Country Date Product Date Country Product Mexico Canada secondary Pivot/Rotate Date Product Country TV VCR PC 1Qtr 2Qtr 3Qtr 4Qtr U.S.A Canada Mexico tertiary primary Date Product Country TV VCR PC 1Qtr 2Qtr 3Qtr 4Qtr U.S.A Canada Mexico

Now let’s apply this technology to finding all bicliques. 1 2 3 A B C G5b1 bCLQ3s centered on numbers. 1AC 2AB 2AC 2BC 1 2 3 A A B 2AB 2AB C 1AC 2AC B 2BC C

1 2 3 A A B C B C 2ABC 1AC 2AC 2BC 2AB 1AC 2AC 2 3 A B C G5b1 bCLQ3s centered on numbers. 1AC 2AB 2AC 2BC 1 2 3 A 2ABC RollUp along the front-to-back dimension using the hub intersection and spoke union gives the expanded hub-and-spoke biclique, hub={2}, spokes={A,B,C} or hub={2A}, spokes={B,C} or the hub-union (of hubs {B},{C}), spoke-intersection (of spokes {2,A}). Rather than view it as an intersection-union of hubs and spokes, I think it suffices to just take the union??? A B 2AB C 1AC 2AC 1AC 2AC B 2BC C

1 2 3 A B C G5b1 bCLQ3s centered on numbers. 1AC 2AB 2AC 2BC 1 2 3 A 12AC RollUp along the left-right dimension using the hub intersection and the spoke union gives the one expanded biclique, (hub={AC}, spokes={1,2} B 2AB C 1AC 2AC A 2BC B C

1 2 3 A B C A B C 1AC 2AC 2AB 2BC 2ABC 2ABC 2 3 A B C G5b1 bCLQ3s centered on numbers. 1AC 2AB 2AC 2BC 1 2 3 A 1AC 2AC B 2AB C 2BC A B 2ABC RollUp along the top-bottom dim using hub intersection and spoke union gives the expanded hub-and-spoke biclique, (hub={2}, spokes={A,B,C} 2ABC RollUp along the top-bottom dim using hub intersection and spoke union gives the expanded hub-and-spoke biclique, (hub={2}, spokes={A,B,C} C

1 2 3 A A B C B C 12 AC 2AB 1AC 2AC 2ABC 2BC 2ABC 2 3 A B C G5b1 bCLQ3s centered on numbers. 1AC 2AB 2AC 2BC 1 2 3 A 12 AC A B 2AB C 1AC 2AC 2ABC B 2BC C 2ABC

1 2 3 A A B C B C 2AB 1AC 2AC 3AC 1BC 2BC 3BC 1AB 2AB 3AB G5b2 bCLQ3s centered on numbers. 1AB 1AC 1BC 2AB 2AC 2BC 3AB 3AC 3BC 1 2 3 A A B 1AB 2AB 2AB 3AB C 1AC 2AC 3AC B 1BC 2BC 3BC C

1 2 3 A B C A B C 2AB 1AC 2AC 3AC 1BC 2BC 3BC 1AB 2AB 3AB G5b2 bCLQ3s centered on numbers. 1AB 1AC 1BC 2AB 2AC 2BC 3AB 3AC 3BC 1 2 3 A B 1AB 2AB 2AB 3AB C 1AC 2AC 3AC A 1BC 2BC 3BC B C

1 2 3 A B C A B C 1ABC 2ABC 3ABC 1AC 2AC 1BC 2BC 3BC 1AB 2AB 3AB 1AC G5b2 bCLQ3s centered on numbers. 1AB 1AC 1BC 2AB 2AC 2BC 3AB 3AC 3BC 1 2 3 A 1ABC 2ABC 3ABC B 1AB 2AB 3AB C 1AC 2AC 1AC 2AC 3AC A 1BC 2BC 3BC B C

AB 1 2 3 A B C A B C 1A BC 2A 3A 1AC 2AC 1BC 2BC 3BC 1AB 2AB 3AB 123 G5b2 bCLQ3s centered on numbers. 1AB 1AC 1BC 2AB 2AC 2BC 2AB 3AC 3BC 1 2 3 A 1A BC 2A 3A B 1AB 2AB 3AB 123 BC AB AC UnionRollUp along front-back dim gives expanded bicliques, hub={1,A} spoke={B,C}. hub={2,A} spoke={B,C}, hub={3,A} spoke={B,C}. UnionRollUp along left-right dim hub={A,B} spokes={1,2,3}, hub={A,C} spokes={1,2,3}, hub={B,C} spokes={1,2,3}. Note: hub is always the combo of fixed values. C 1AC 2AC 1AC 2AC 3AC A 1BC 2BC 3BC B C

1 2 3 A B C A B C 1A BC 2A 3A 123 BC AB AC 1AC 2AC 123A BC 1BC 2BC 3BC G5b2 bCLQ3s centered on numbers. 1AB 1AC 1BC 2AB 2AC 2BC 3AB 3AC 3BC 1 2 3 A 1A BC 2A 3A B 1AB 2AB 3AB 123 BC AB AC UnionRollUp along front-back dim gives expanded bicliques, hub={1,A} spoke={B,C}. hub={2,A} spoke={B,C}, hub={3,A} spoke={B,C}. UnionRollUp along left-right dim hub={A,B} spokes={1,2,3}, hub={A,C} spokes={1,2,3}, hub={B,C} spokes={1,2,3}. Note: hub is always the combo of fixed values. C 1AC 2AC 1AC 2AC 3AC 123A BC A 1BC 2BC 3BC B C 1C AB 2C 3C UnionRollUp along the top-bottom dim gives the expanded biclique, hub={1,C} spokes={A,B}; hub={2,C} spokes={A,B}; hub={3,C} spokes={A,B}.

DICE (e.g. dice off 3 AND C.) 1 2 3 A B C A B C 1AB 2AB G5b2 bCLQ3s centered on numbers. 1AB 1AC 1BC 2AB 2AC 2BC 3AB 3AC 3BC DICE (e.g. dice off 3 AND C.) 1 2 3 A B 1AB 2AB C A B C 1 2 A B G5b3 bCLQ3s 1AB 2AB