A Decision Tree Approach to Cube Construction Patrick Kelly.

Slides:



Advertisements
Similar presentations
Materialization and Cubing Algorithms. Cube Materialization Each cell of the data cube is a view consisting of an aggregation of interest. The values.
Advertisements

OLAP Services Business Intelligence Solutions. Agenda Definition of OLAP Types of OLAP Definition of Cube Definition of DMR Differences between Cube and.
McGraw-Hill/Irwin © 2006 The McGraw-Hill Companies, Inc. All rights reserved. 8-1 BUSINESS DRIVEN TECHNOLOGY Chapter Eight: Viewing and Protecting Organizational.
Introduction to Data Warehousing. From DBMS to Decision Support DBMSs widely used to maintain transactional data Attempts to use of these data for analysis,
Distributed DBMSs A distributed database is a single logical database that is physically distributed to computers on a network. Homogeneous DDBMS has the.
Manajemen Basis Data Pertemuan 8 Matakuliah: M0264/Manajemen Basis Data Tahun: 2008.
1 Data & Database Development. 2 Data File Bit Byte Field Record File Database Entity Attribute Key field Key file management concepts include:
Advanced Querying OLAP Part 2. Context OLAP systems for supporting decision making. Components: –Dimensions with hierarchies, –Measures, –Aggregation.
6/25/2015 Acc 522 Fall 2001 (Jagdish S. Gangolly) 1 Data Mining I Jagdish Gangolly State University of New York at Albany.
13 Chapter 13 The Data Warehouse Hachim Haddouti.
The University of Akron Dept of Business Technology Computer Information Systems Database Management Approaches 2440: 180 Database Concepts Instructor:
Chapter 13 The Data Warehouse
Business Intelligence System September 2013 BI.
DASHBOARDS Dashboard provides the managers with exactly the information they need in the correct format at the correct time. BI systems are the foundation.
Chapter 13 – Data Warehousing. Databases  Databases are developed on the IDEA that DATA is one of the critical materials of the Information Age  Information,
1 Basic concepts of On-Line Analytical processing DT211 /4.
XP Information Information is everywhere in an organization Employees must be able to obtain and analyze the many different levels, formats, and granularities.
Week 6 Lecture The Data Warehouse Samuel Conn, Asst. Professor
SharePoint 2010 Business Intelligence Module 6: Analysis Services.
Understanding Data Analytics and Data Mining Introduction.
 First two parts of class ◦ Part 1: What is business intelligence and why should organizations consider incorporating more technology-related intelligence.
CIS 429—Chapter 8 Accessing Organizational Information—Data Warehouse.
Data Warehouse & Data Mining
Dr. Russell Anderson Dr. Musa Jafar West Texas A&M University.
Datawarehouse Objectives
Using SAS® Information Map Studio
BUS1MIS Management Information Systems Semester 1, 2012 Week 6 Lecture 1.
Data Mining Knowledge on rough set theory SUSHIL KUMAR SAHU.
Data Warehousing.
1 Categories of data Operational and very short-term decision making data Current, short-term decision making, related to financial transactions, detailed.
Ahsan Abdullah 1 Data Warehousing Lecture-10 Online Analytical Processing (OLAP) Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center.
BUSINESS ANALYTICS AND DATA VISUALIZATION
1 Topics about Data Warehouses What is a data warehouse? How does a data warehouse differ from a transaction processing database? What are the characteristics.
13 1 Chapter 13 The Data Warehouse Database Systems: Design, Implementation, and Management, Seventh Edition, Rob and Coronel.
1 Categories of data Operational and very short-term decision making data Current, short-term decision making, related to financial transactions, detailed.
Chapter 5: Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization DECISION SUPPORT SYSTEMS AND BUSINESS.
Next Back MAP 3-1 Management Information Systems for the Information Age Copyright 2002 The McGraw-Hill Companies, Inc. All rights reserved Chapter 3 Data.
Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe Slide
Fox MIS Spring 2011 Data Warehouse Week 8 Introduction of Data Warehouse Multidimensional Analysis: OLAP.
Business Intelligence Transparencies 1. ©Pearson Education 2009 Objectives What business intelligence (BI) represents. The technologies associated with.
Multidimensional analysis model for a document warehouse that includes textual measures KIM JEONG RAE UOS.DML
Centre of Competence on data warehouse Workshop Helsinki Database Cube and Browsing the Cube Mark Rantala.
What is OLAP?.
13 1 Chapter 13 The Data Warehouse Database Systems: Design, Implementation, and Management, Seventh Edition, Rob and Coronel.
Business Intelligence Training Siemens Engineering Pakistan Zeeshan Shah December 07, 2009.
Advanced Database Concepts
The Data Warehouse Chapter Operational Databases = transactional database  designed to process individual transaction quickly and efficiently.
1 Categories of data Operational and very short-term decision making data Current, short-term decision making, related to financial transactions, detailed.
12 1 Database Systems: Design, Implementation, & Management, 6 th Edition, Rob & Coronel 12.4 Online Analytical Processing OLAP creates an advanced data.
Data Resource Management Agenda What types of data are stored by organizations? How are different types of data stored? What are the potential problems.
SF-Tree and Its Application to OLAP Speaker: Ho Wai Shing.
Business intelligence systems. Data warehousing. An orderly and accessible repositery of known facts and related data used as a basis for making better.
1 Database Systems, 8 th Edition Star Schema Data modeling technique –Maps multidimensional decision support data into relational database Creates.
Business Intelligence Overview. What is Business Intelligence? Business Intelligence is the processes, technologies, and tools that help us change data.
WEB BASED DSS Aaron Atuhe. KEY CONCEPTS When software vendors propose implementing a Web-Based Decision Support System, they are referring to a computerized.
Chapter 13 The Data Warehouse
Chapter 13 – Data Warehousing
COSC 6340 Projects & Homeworks Spring 2002
Data Mining Concept Description
Data Warehouse and OLAP
Chapter 1 Database Systems
DATA MINING.
An Introduction to Data Warehousing
CHAPTER SIX OVERVIEW SECTION 6.1 – DATABASE FUNDAMENTALS
DataMart (Data Warehouse) Tool:
Introduction of Week 9 Return assignment 5-2
Chapter 13 The Data Warehouse
Building your First Cube with SSAS
Chapter 1 Database Systems
Data Warehouse and OLAP
Presentation transcript:

A Decision Tree Approach to Cube Construction Patrick Kelly

Data Cube Attributes Designed for quick viewing and decision-making Built to optimize the users time Cube construction is labor intensive Users can only travel along pre-planned path of data analysis Dimensions are designed from pre- existing warehouse tables Reports are structured and ridges like the construction of the cube

Decision Tree Characteristics Decision Tree analysis is a excellent analysis tool used in data exploration Decision Trees are designed to explicitly address the need to identify the relationships between the data’s variables. Decision Tree looks though more relationships than the multidimensional cube. Decision Tree verifies and validates relationships as statistically sound.

Comparison of Multidimensional Cubes and Decision Trees Pg 134 of Decision trees for Business Intelligence and Data Mining Multidimensional CubeDecision Tree Shows tabular views of data as tables with relatively fixed dimensions; dimensions are determined primarily on the basis of business rules Shows tabular views of data within relevant dimensions as determined by computational algorithms and business rules Has database that is pre-built to support anticipated queries Has database that is pre-built to support numerous unanticipated queries Provides quick view retrievalHas lengthy retrieval Tends to limit number of cross-views or relevant factors Has few limitations on the relevant factors Makes it difficult, almost impossible to identify novel results Emphasizes novel results and the identification of important versus unimportant contributions

Construction of Decision Trees Using Data Cube Lixin Fu Decision Trees are not suited for large aggregated data sets Use a Sparse Statistics Tree (SST) to design the OLAP cube. Create Decision Tree using the most current data stored in the cube

Conclusion Decision Trees Decision tree analysis is a good analysis to be done in data exploration Decision Tree analysis is a possible validation test for Cube