Cube Explorer: Online Exploration of Data Cubes Jiawei Han, Jianyong Wang, Guozhu Dong, Jian Pei, Ke Wang.

Slides:



Advertisements
Similar presentations
1 Copyright Jiawei Han; modified by Charles Ling for CS411a/538a Data Mining and Data Warehousing  Introduction  Data warehousing and OLAP for data mining.
Advertisements

OLAP Tuning. Outline OLAP 101 – Data warehouse architecture – ROLAP, MOLAP and HOLAP Data Cube – Star Schema and operations – The CUBE operator – Tuning.
1 Profit Mining: From Patterns to Action Ke Wang, Senqiang Zhou, Jiawei Han Simon Fraser University.
Our New Progress on Frequent/Sequential Pattern Mining We develop new frequent/sequential pattern mining methods Performance study on both synthetic and.
When to use Data Mining. Introduction An important question that should be answered before you commence any data mining project is whether data mining.
1 ACCTG 6910 Building Enterprise & Business Intelligence Systems (e.bis) Introduction to Data Mining Olivia R. Liu Sheng, Ph.D. Emma Eccles Jones Presidential.
MP3 / MD740 Strategy & Information Systems Oct. 13, 2004 Databases & the Data Asset, Types of Information Systems, Artificial Intelligence.
Database Management: Getting Data Together Chapter 14.
Unlock Your Data Rich connectivity Robust data integration Enterprise-class manageability Deliver Relevant Information Intuitive design environment.
1 9 Concepts of Database Management, 4 th Edition, Pratt & Adamski Chapter 9 Database Management Approaches.
Business Intelligence. On-Line Analytical Processing (OLAP) Tools The use of a set of graphical tools that provides users with multidimensional views.
Chapter 3 Databases and Data Warehouses: Building Business Intelligence Copyright © 2010 by the McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin.
WebSphere -DB2 Integration Web Browser Web Server (Apache) WebSphere –JSP/Servlet/EJB DB2 JDBC, SQL HTTP.
Chapter 14 The Second Component: The Database.
The University of Akron Dept of Business Technology Computer Information Systems Database Management Approaches 2440: 180 Database Concepts Instructor:
Chapter 13 The Data Warehouse
1 Data and Knowledge Management. 2 Data Management: A Critical Success Factor The difficulties and the process Data sources and collection Data quality.
DBMiner 2.0 Adnan Rahi Prabhat Vivekanandan. Brief History of DBMiner Technology Inc. Research on data mining since International reputation and.
Business Intelligence components Introduction. Microsoft® SQL Server™ 2005 is a complete business intelligence (BI) platform that provides the features,
Gavin Russell-Rockliff BI Technical Specialist Microsoft BIN305.
Understanding Analysis Services Architecture. Microsoft Data Warehousing Overview OLTP Source DTS DW Storage Analysis Services Clients OLE DB for OLAP,
OLAM and Data Mining: Concepts and Techniques. Introduction Data explosion problem: –Automated data collection tools and mature database technology lead.
1 Dr. Panagiotis Symeonidis Data Engineering Laboratory Data Warehouse implementation: Part B.
What is Business Intelligence? Business intelligence (BI) –Range of applications, practices, and technologies for the extraction, translation, integration,
Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization.
Business Intelligence. Topics Chart Online Analytical Process, OLAP – Excel’s Pivot table – Data visualization with dashboard Data warehousing Data Mining.
SharePoint 2010 Business Intelligence Module 6: Analysis Services.
MD240 - MIS Oct. 4, 2005 Databases & the Data Asset Harrah’s & Allstate Cases.
Data Management Turban, Aronson, and Liang Decision Support Systems and Intelligent Systems, Seventh Edition.
On-Line Analytic Processing Chetan Meshram Class Id:221.
CIS 9002 Kannan Mohan Department of CIS Zicklin School of Business, Baruch College.
Multi-Dimensional Databases and On-Line Analytical Processing Dominic Cutuli John Lundgren Clark Mitchell Leslie Theiss Ken Zenkevich.
DATA ANALYTICS Russell Ridley INFRASTRUCTURE Compliment your Existing Structure Production, Development, and Quality Environments Think Virtual Disk.
The DM Process – MS’s view (DMX). The Basics  You select an algorithm, show the algorithm some examples called training example and, from these examples,
Cube Intro. Decision Making Effective decision making Goal: Choice that moves an organization closer to an agreed-on set of goals in a timely manner Goal:
Microsoft Business Intelligence Environment Overview.
1 Multi-dimensional Sequential Pattern Mining Helen Pinto, Jiawei Han, Jian Pei, Ke Wang, Qiming Chen, Umeshwar Dayal ~From: 10th ACM Intednational Conference.
Introduction to SQL Server Data Mining Nick Ward SQL Server & BI Product Specialist Microsoft Australia Nick Ward SQL Server & BI Product Specialist Microsoft.
Peer to Peer | Greater Scale | More Voices | Faster AXUG Summit 2011 RPT07: Understanding PowerPivot Noah Kluge—PrecisionPoint Software.
MIS2502: Data Analytics The Information Architecture of an Organization.
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
SO RELIABLE Iain Bray Sales Engineer InterSystems Corporation.
By N.Gopinath AP/CSE. There are 5 categories of Decision support tools, They are; 1. Reporting 2. Managed Query 3. Executive Information Systems 4. OLAP.
Lexmark By Rosanna Nadal & Irina Yermolovich. Lexmark International Global manufacturer of printing products and solutions for customers in more then.
Business Intelligence. Topics Chart Online Analytical Process, OLAP – Excel’s Pivot table – Data visualization with dashboard Scenario Management Data.
Business Intelligence Transparencies 1. ©Pearson Education 2009 Objectives What business intelligence (BI) represents. The technologies associated with.
What is OLAP?.
Evaluation of DBMiner By: Shu LIN Calin ANTON. Outline  Importing and managing data source  Data mining modules Summarizer Associator Classifier Predictor.
Information Systems in Organizations
Data Mining. Overview the extraction of hidden predictive information from large databases Data mining tools predict future trends and behaviors, allowing.
Applications and Trends in Data Mining Pertemuan 13 Matakuliah: M0614 / Data Mining & OLAP Tahun : Feb
To SSAS or not to SSAS, that is the question Ayman Senior PFE - Microsoft.
Pindaro Demertzoglou Data Resource Management – MGMT 4170 Lally School of Management Rensselaer Polytechnic Institute.
Introduction to Business Analytics
Data Resource Management – MGMT An overview of where we are right now SQL Developer OLAP CUBE 1 Sales Cube Data Warehouse Denormalized Historical.
Mary Ledbetter, Systems Sales Engineer. What is a Data Warehouse, really? Operational systems - not designed for Analysis Complex and slow for Analytical.
Bartek Doruch, Managing Partner, Kamil Karbowiak, Managing Partner, Using Power BI in a Corporate.
Data Mining: Concepts and Techniques (3rd ed.) — Chapter 1 —
Information Systems in Organizations
Reporting and Analysis With Microsoft Office
Chapter 13 The Data Warehouse
Data Mining It's not the size of your data it's what you do with it
Fundamentals of Information Systems
What is business intelligence?
Jiawei Han Department of Computer Science
©Jiawei Han and Micheline Kamber Slides contributed by Jian Pei
©Jiawei Han and Micheline Kamber
©Jiawei Han and Micheline Kamber Slides contributed by Jian Pei
©Jiawei Han and Micheline Kamber Slides contributed by Jian Pei
Online Analytical Processing Stream Data: Is It Feasible?
Presentation transcript:

Cube Explorer: Online Exploration of Data Cubes Jiawei Han, Jianyong Wang, Guozhu Dong, Jian Pei, Ke Wang

2 Mining Guided Cube Explorer Novel Algorithms and Methods:  Faster Creation of Iceberg Cube  Predictive Gradient Analysis  Multi-dimensional Gradient Mining  Association and Sequence Cube Analysis Integrated with commercial software such as Microsoft OLE DB for DM, OLAP, RDMS, and DBMiner

3 Cube Explorer Benefits For Users:  Superior performance and scalability  Saving analysis cost and time –reusable mining queries, work directly on OLAP and relational data,  Easy to use – SQL like mining, integrated with data sources  Leverage OLAP & data warehouse engines– versatile functionality and strong synergy

4 Iceberg Cube Exploration Demo  Novel H-Tree Iceberg Cube Creation  Cube Computation with Complex Measures Dataset: large retail POS transaction data

5 Iceberg Cube Exploration Results  3D Visualization (Scatter Plot)

6 Gradient Mining Issues 1: “What products sold with ‘TV’ will significantly change profits of ‘TV’ ?” Answer: -TV profit is up 10% when sold with DVD -TV profit is down 5% when sold with VCR 2: “What are changes of housing price in Big City in 2001 comparing against 2000?” Answer: -downtown apartments go up 15% while houses in suburb go down 5%

7 How to Mine Meaningful Changes? 1 Naïve and manual method  Compute two sub-cubes  Big City housing in 2000  Big City housing in 2001  Tremendous costs  Space  Time 2 Innovation Only interesting changes wanted  “gradient constraint” to capture and predict significant changes automatically

8 Gradient Mining Prediction Demo  What products sold with ‘Muffins’ will change Sales of ‘Muffins’? Select ‘Muffins’ as promotion Itemset, Sales average as Measure:

9 Gradient Mining: Results 1  Most profitable patterns (Ratio >1) Rule #1: cereal increases ‘muffins” avg. sales by 8%

10 Gradient Mining: Results 2  Least profitable patterns (Ratio <1) Rule #1: Ice Cream reduces ‘muffins” avg. sales by 4%

11 Gradient Mining: Visualization  Results plotted using 3D bar graph