This example shows the development of a classification tree using DMVis, a data visualisation tool. The tool allows visualisation of relational structures.

Slides:



Advertisements
Similar presentations
Database management system (DBMS)  a DBMS allows users and other software to store and retrieve data in a structured way  controls the organization,
Advertisements

CSE544 Database Statistics Tuesday, February 15 th, 2011 Dan Suciu , Winter
Frequent Closed Pattern Search By Row and Feature Enumeration
AB 11 22 33 44 55 66 77 88 99 10  20  19  18  17  16  15  14  13  12  11  21  22  23  24  25  26  27  28.
Copyright © Cengage Learning. All rights reserved. CHAPTER 5 SEQUENCES, MATHEMATICAL INDUCTION, AND RECURSION SEQUENCES, MATHEMATICAL INDUCTION, AND RECURSION.
Factors and Prime Factorization. Definitions Factors ~ Whole numbers that are multiplied to find a product Factors ~ Whole numbers that are multiplied.
Presented by Russell Myers Paper by Ming-Chuan Wu and Alejandro P. Buchmann.
Knowledge Representation. 2 Outline: Output - Knowledge representation  Decision tables  Decision trees  Decision rules  Rules involving relations.
Balanced Graph Partitioning Konstantin Andreev Harald Räcke.
Knowledge Representation. 2 Outline: Output - Knowledge representation  Decision tables  Decision trees  Decision rules  Rules involving relations.
16.5 Introduction to Cost- based plan selection Amith KC Student Id: 109.
Washington, 08/27/03 Washington, 08/27/03 Martin Pfeifle, Database Group, University of Munich Representatives for Visually Analyzing Cluster Hierarchies.
BLOSUM Information Resources Algorithms in Computational Biology Spring 2006 Created by Itai Sharon.
Some Introductory Programming 1. Structured Query Language - used for queries. - a standard database product. 2. Visual Basic for Applications - use of.
TCP/IP Protocol Suite 1 Chapter 17 Upon completion you will be able to: Domain Name System: DNS Understand how the DNS is organized Know the domains in.
1 times table 2 times table 3 times table 4 times table 5 times table
DASHBOARDS Dashboard provides the managers with exactly the information they need in the correct format at the correct time. BI systems are the foundation.
Chapter 25 Domain Name System
HOW TO OPTIMIZE A HIERARCHY IN SQL SERVER Louis Davidson (drsql.org)
Strategy Development Tools. SCENARIOS GUIDING PRINCIPLES / POLICIES (Core Values of the Company) STRATEGIC GOALS (How we wish to relate to the environment)
Intelligent Systems Lecture 23 Introduction to Intelligent Data Analysis (IDA). Example of system for Data Analyzing based on neural networks.
Automatic methods for functional annotation of sequences Petri Törönen.
Lecture 5 – Gantt Chart GANTT Charts Constructing GANTT Charts
Slides for “Data Mining” by I. H. Witten and E. Frank.
Welcome to BIM Jeopardy. GeneralAccessWordExcelPowerPoint
 WFS for fun and profit Paul Handcock Julian Parker.
Data Mining – A First View Roiger & Geatz. Definition Data mining is the process of employing one or more computer learning techniques to automatically.
Physical DB Issues, Indexes, Query Optimisation Database Systems Lecture 13 Natasha Alechina.
Bug Localization with Machine Learning Techniques Wujie Zheng
Em Spatiotemporal Database Laboratory Pusan National University File Processing : Index and Hash 2004, Spring Pusan National University Ki-Joune Li.
too.
Database Management Systems.  Database management system (DBMS)  Store large collections of data  Organize the data  Becomes a data storage system.
Advanced Database Course (ESED5204) Eng. Hanan Alyazji University of Palestine Software Engineering Department.
Opener: Check each number below for divisibility by 2, 3, 5, 6, 9 and 10. Opener: Check each number below for divisibility by 2, 3, 5, 6, 9 and )
1 Trees 4: AVL Trees Section 4.4. Motivation When building a binary search tree, what type of trees would we like? Example: 3, 5, 8, 20, 18, 13, 22 2.
1 Categories of data Operational and very short-term decision making data Current, short-term decision making, related to financial transactions, detailed.
Using Microsoft Access Now that data has been entered we are ready to sort or data To do this, first click on the field label which you want to sort If.
Lesson Objective: To compare traditional hard engineering to more modern holistic approaches.
CS411 Database Systems Kazuhiro Minami 11: Query Execution.
IB Computer Science – Logic
DATA MINING WITH CLUSTERING AND CLASSIFICATION Spring 2007, SJSU Benjamin Lam.
Introduction We have seen how to Integrate in C1 In C2 we start to use Integration, to work out areas below curves It is increasingly important in this.
Denary (our numbers) Binary
EOC Review: Explicit & Recursive.
$100 $200 $300 $400 $500 $100 $200 $300 $400 $500 $100 $200 $300 $400 $500 $100 $200 $300 $400 $500 $100 $200 $300 $400 $500 $100 $200 $300.
1Computer Sciences Department. 2 Advanced Design and Analysis Techniques TUTORIAL 7.
Factors and Prime Factorization
Tables Learning Support
a/b by n Division of Fractions Dividing a fraction by a whole number. To divide a fraction by a whole number simply multiply it by the reciprocal of the.
BY: Mark Gruszecki.  What is a Recursive Query?  Definition(s) and Algorithm(s)  Optimization Techniques  Practical Issues  Impact of each Optimization.
Geog. 314 Working with tables.
Unsupervised Learning: Clustering
Unsupervised Learning: Clustering
Introduction Applications Balance Factor Rotations Deletion Example
Using and Managing Data: A Case Study in Three Scenes
Times Tables.
Advantages and disadvantages of types of graphs
Chapter 8 Tutorial.
Database.
The Penal Code: A Schematic Design & Discussion
The BIRCH Algorithm Davitkov Miroslav, 2011/3116
Access Review.
Suggestion Schemes: Job Evaluation
Text Categorization Berlin Chen 2003 Reference:
Domain Name System: DNS
Operator Precedence and Associativity
PHYLOGENETIC TREES.
3 times tables.
6 times tables.
BETONLINEBETONLINE A·+A·+
Presentation transcript:

This example shows the development of a classification tree using DMVis, a data visualisation tool. The tool allows visualisation of relational structures. The example is that of a double glazing estimate database Part of the data model is shown below

Visualisation of the whole database of estimates It shows 5 (or 6?) clear groups Cluster A Cluster B Cluster C Cluster D Cluster E Data Base

Tool selects Cluster A to visualise separately AB C D E Data Base

Visualisation of cluster A shows that it has 4 clear sub clusters A B C D E Data Base A1A2 A3 A4

Classification by visualisation The tree produced by use of the visualisation tool showed advantages over other clustering techniques: The resulting classification had a natural hierarchy. The hierarchy had an easily understood basis for human understanding. Problems as to degree of branching ( at the first level, 4 at the second for cluster A ) are easily resolved visually. Long thin trees, such as are produced by a with binary division system, or short fat trees, with too many classes are un-natural, and hard to understand. They frequently need re- structuring. The visualisation method can be applied to relational and recursive database structures, and doesn’t require a single table or query to be identified.