Evaluation of DBMiner By: Shu LIN Calin ANTON. Outline  Importing and managing data source  Data mining modules Summarizer Associator Classifier Predictor.

Slides:



Advertisements
Similar presentations
1 Copyright Jiawei Han; modified by Charles Ling for CS411a/538a Data Mining and Data Warehousing  Introduction  Data warehousing and OLAP for data mining.
Advertisements

The database approach to data management provides significant advantages over the traditional file-based approach Define general data management concepts.
Chapter 13 Business Intelligence and Data Warehouses
When to use Data Mining. Introduction An important question that should be answered before you commence any data mining project is whether data mining.
Chapter 9 DATA WAREHOUSING Transparencies © Pearson Education Limited 1995, 2005.
Classifier Decision Tree A decision tree classifies data by predicting the label for each record. The first element of the tree is the root node, representing.
Fundamentals of Information Systems, Second Edition 1 Organizing Data and Information Chapter 3.
About ISoft … What is Decision Tree? Alice Process … Conclusions Outline.
Chapter 5 Database Concepts.
1 Data & Database Development. 2 Data File Bit Byte Field Record File Database Entity Attribute Key field Key file management concepts include:
6/25/2015 Acc 522 Fall 2001 (Jagdish S. Gangolly) 1 Data Mining I Jagdish Gangolly State University of New York at Albany.
Mgt 20600: IT Management & Applications Databases
Evaluation of MineSet 3.0 By Rajesh Rathinasabapathi S Peer Mohamed Raja Guided By Dr. Li Yang.
ML ALGORITHMS. Algorithm Types Classification (supervised) Given -> A set of classified examples “instances” Produce -> A way of classifying new examples.
1 Chapter 2 Reviewing Tables and Queries. 2 Chapter Objectives Identify the steps required to develop an Access application Specify the characteristics.
Mgt 20600: IT Management & Applications Databases Tuesday April 4, 2006.
DBMiner 2.0 Adnan Rahi Prabhat Vivekanandan. Brief History of DBMiner Technology Inc. Research on data mining since International reputation and.
Ch3 Data Warehouse part2 Dr. Bernard Chen Ph.D. University of Central Arkansas Fall 2009.
Chapter 5 Data mining : A Closer Look.
Chapter 13 – Data Warehousing. Databases  Databases are developed on the IDEA that DATA is one of the critical materials of the Information Age  Information,
Online Analytical Processing (OLAP) Hweichao Lu CS157B-02 Spring 2007.
LÊ QU Ố C HUY ID: QLU OUTLINE  What is data mining ?  Major issues in data mining 2.
Chapter 6: The Traditional Approach to Requirements
1 © Goharian & Grossman 2003 Introduction to Data Mining (CS 422) Fall 2010.
Data Mining : Introduction Chapter 1. 2 Index 1. What is Data Mining? 2. Data Mining Functionalities 1. Characterization and Discrimination 2. MIning.
Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization.
Data Mining Techniques
SharePoint 2010 Business Intelligence Module 6: Analysis Services.
Fundamentals of Information Systems, Fifth Edition
Data-mining & Data As we used Excel that has capability to analyze data to find important information, the data-mining helps us to extract information.
Introduction to Data Mining Group Members: Karim C. El-Khazen Pascal Suria Lin Gui Philsou Lee Xiaoting Niu.
Chapter 6 SAS ® OLAP Cube Studio. Section 6.1 SAS OLAP Cube Studio Architecture.
Succeeding with Technology Database Systems Basic Data Management Concepts Organizing Data in a Database Database Management Systems Using Database Systems.
Chapter 11 Business Intelligence Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall 11-1.
Structured Analysis.
Technology In Action Chapter 11 1 Databases and… Databases and their uses Database components Types of databases Database management systems Relational.
5 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved.
Data Mining – Intro. Course Overview Spatial Databases Temporal and Spatio-Temporal Databases Multimedia Databases Data Mining.
Some OLAP Issues CMPT 455/826 - Week 9, Day 2 Jan-Apr 2009 – w9d21.
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
MANAGING DATA RESOURCES ~ pertemuan 7 ~ Oleh: Ir. Abdul Hayat, MTI.
Advanced Database Course (ESED5204) Eng. Hanan Alyazji University of Palestine Software Engineering Department.
Reporting and Analysis With Microsoft Office. Reporting and Analysis Business User Reporting & Analysis OLAP Data Warehouse.
Building Dashboards SharePoint and Business Intelligence.
Database Management Supplement 1. 2 I. The Hierarchy of Data Database File (Entity, Table) Record (info for a specific entity, Row) Field (Attribute,
Analytics & Reporting Tool.  Outline how to access SAS OLAP Cubes through SAS AMO  Review SAS OLAP Cube creation and how it relates to integration with.
Centre of Competence on data warehouse Workshop Helsinki Database Cube and Browsing the Cube Mark Rantala.
Advanced Database Concepts
DBMiner Introduction Advisor:Dr. Hsu Graduate:ZenJohn Huang Min-Hong Lin IDSL seminar 2001/11/13.
Fundamentals of Information Systems, Sixth Edition Chapter 3 Database Systems, Data Centers, and Business Intelligence.
Hierarchical Modeling.  Explain the 3 different types of model for which computer graphics is used for.  Differentiate the 2 different types of entity.
1 Database Systems, 8 th Edition Star Schema Data modeling technique –Maps multidimensional decision support data into relational database Creates.
Ontology Engineering and Feature Construction for Predicting Friendship Links in the Live Journal Social Network Author:Vikas Bahirwani 、 Doina Caragea.
Managing Data Resources File Organization and databases for business information systems.
Data Mining Functionalities
Reporting and Analysis With Microsoft Office
Chapter 13 Business Intelligence and Data Warehouses
Databases and Information Management
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
©Jiawei Han and Micheline Kamber
Chapter 13 – Data Warehousing
©Jiawei Han and Micheline Kamber
MANAGING DATA RESOURCES
Data Mining Concept Description
Data Warehouse and OLAP
University of Houston-Clear Lake Kaiser Permanente San Jose
CHAPTER SIX OVERVIEW SECTION 6.1 – DATABASE FUNDAMENTALS
Data Mining: Characterization
PolyAnalyst Web Report Training
Data Warehouse and OLAP
Presentation transcript:

Evaluation of DBMiner By: Shu LIN Calin ANTON

Outline  Importing and managing data source  Data mining modules Summarizer Associator Classifier Predictor  Conclusions

View of Warehouses and Hierarchies  Tables  Datamarts Columns Dimensions Measurements Cubes

Creating Data Warehouse  Importing relational data: only the default database that DBMiner provides can be used.  Easy to import a datamart and difficult to create one. Datamart can be associated with only one table/view. Creation of datamart is not easy enough for usual users and not flexible enough for professional users

Creating Data Warehouse (continued)  Concept hierarchy is automatically generated when a dimension is created, but the user can scarcely customize it.  Only numerical attributes are allowed as measures  We think it would be better if DBMiner had a wizard to help user build a data cube.

 The visualization of data cube is very instructive.  User can perform some OLAP operations but slicing.  Data cube browser also presents the data dispersion. Browsing Data Cube

Browsing Data Cube (continued)

Data Mining  Mining functions: summarization, association, classification, prediction.  OLAP functions are integrated with mining functions.  The mining process is performed on line.  A wizard guides user through mining process.

The Summarizer  Generalize/summarize data at high abstraction levels  The output is presented in six different forms: crosstab, 3D bar chart, 3D area chart, 3D cluster bar, 2D bar chart and 2D line char.  The output can be presented on only two dimensions at a time, and not all the combinations of two dimensions are possible.

The Summarizer (continued)

The Associator  Find association among a set of attributes and their values.  We find a bug in this module.  User can change the settings or constraints during association mining to make the association rules more accurate.  Association rule is visualized as table, bar chart, and ball chart.

The Associator (continued)

The Classifier  Based on the features present in the class_labeled training data, develop a description or model for each class.  The output is presented as a complete classification tree (decision tree) good in the sense that user can get a clear impression of classification process. Redundant if user only cares about the classification rules.

The Classifier (continued)

The Predictor  Predict data value distributions based on the available data.  The output is presented as a set of curves if the predictive attribute has continuous numeric values; otherwise, a set of pie charts is used.  The output is present only on one dimension. It cannot show how the combination of two predictive attributes affect the predicted attribute.

The Predictor (continued)

Conclusions It integrates OLAP functions with mining functions. It works on line, i.e., it is fast. It generates multiple forms of output: graphics, tables, and different kinds of charts; It has a user-friendly interface, and for each mining function it has a wizard, which guides the user through the mining process.