This kind of tables known from Rough Sets, Decision Trees, etc Data Mining

Slides:



Advertisements
Similar presentations
The Software Infrastructure for Electronic Commerce Databases and Data Mining Lecture 4: An Introduction To Data Mining (II) Johannes Gehrke
Advertisements

Some Slides from: U.C. Berkeley, U.C. Berkeley, Alan Mishchenko, Alan Mishchenko, Mike Miller, Mike Miller, Gaetano Borriello Gaetano Borriello Introduction.
VIII - Working with Sequential Logic © Copyright 2004, Gaetano Borriello and Randy H. Katz 1 Finite state machine optimization State minimization  fewer.
Ensemble Methods An ensemble method constructs a set of base classifiers from the training data Ensemble or Classifier Combination Predict class label.
Data Mining Techniques: Clustering
Image Indexing and Retrieval using Moment Invariants Imran Ahmad School of Computer Science University of Windsor – Canada.
Soft computing Lecture 6 Introduction to neural networks.
Efficient Decomposition of Large Fuzzy Functions and Relations Marek Perkowski + Portland State University, Dept. Electrical Engineering, Portland, Oregon.
Graph Coloring Algorithms for Fast Evaluation of Curtis Decomposition Marek Perkowski, Rahul Malvi, Stan Grygiel, Mike Burns, and Alan Mishchenko Portland.
ECE Synthesis & Verification - Lecture 19 1 ECE 667 Spring 2009 ECE 667 Spring 2009 Synthesis and Verification of Digital Systems Functional Decomposition.
1 MACHINE LEARNING TECHNIQUES IN IMAGE PROCESSING By Kaan Tariman M.S. in Computer Science CSCI 8810 Course Project.
Grover’s Algorithm in Machine Learning and Optimization Applications
Efficient Decomposition of Large Fuzzy Functions and Relations.
Neural Networks. Background - Neural Networks can be : Biological - Biological models Artificial - Artificial models - Desire to produce artificial systems.
Midterm Review Rao Vemuri 16 Oct Posing a Machine Learning Problem Experience Table – Each row is an instance – Each column is an attribute/feature.
Combinatorial Algorithms Unate Covering Binate Covering Graph Coloring Maximum Clique.
Computational Intelligence: Methods and Applications Lecture 19 Pruning of decision trees Włodzisław Duch Dept. of Informatics, UMK Google: W Duch.
Inferring Decision Trees Using the Minimum Description Length Principle J. R. Quinlan and R. L. Rivest Information and Computation 80, , 1989.
Graph Coloring. Introduction When a map is colored, two regions with a common border are customarily assigned different colors. We want to use a small.
Data Mining Knowledge on rough set theory SUSHIL KUMAR SAHU.
Combinational Problems: Unate Covering, Binate Covering, Graph Coloring and Maximum Cliques Example of application: Decomposition.
Fast Kernel-Density-Based Classification and Clustering Using P-Trees Anne Denton Major Advisor: William Perrizo.
Learning to Detect Faces A Large-Scale Application of Machine Learning (This material is not in the text: for further information see the paper by P.
Sum-of-Products (SOP)
Unate Covering, Binate Covering, Graph Coloring Maximum Cliques part B.
Data Mining By Farzana Forhad CS 157B. Agenda Decision Tree and ID3 Rough Set Theory Clustering.
SUPERVISED AND UNSUPERVISED LEARNING Presentation by Ege Saygıner CENG 784.
Cluster Analysis What is Cluster Analysis? Types of Data in Cluster Analysis A Categorization of Major Clustering Methods Partitioning Methods.
By N.Gopinath AP/CSE.  A decision tree is a flowchart-like tree structure, where each internal node (nonleaf node) denotes a test on an attribute, each.
Reducing Structural Bias in Technology Mapping
Antara Ghosh Jignashu Parikh
Chapter 11 Analysis of Variance
A very naïve introduction to few methods of Machine Learning
Shortest Path Problem Under Triangular Fuzzy Neutrosophic Information
Design and Documentation
What Is Cluster Analysis?
Finite state machine optimization
Finite state machine optimization
Artificial Neural Networks
A New Logic Synthesis, ExorBDS
EEE2135 Digital Logic Design Chapter 1. Introduction
Software Engineering (CSI 321)
Rule Induction for Classification Using
Fast Kernel-Density-Based Classification and Clustering Using P-Trees
PROBABILISTIC AND LOGIC APPROACHES TO MACHINE LEARNING AND DATA MINING
QUIZZ Select any one from the following six problems.
Neural Networks Dr. Peter Phillips.
Welcome Final Year Project Oral Presentation
UNIT-4 BLACKBOX AND WHITEBOX TESTING
Portland State University
Dr. Clincy Professor of CS
Example of application: Decomposition
A Boolean Paradigm in Multi-Valued Logic Synthesis
Data Mining Practical Machine Learning Tools and Techniques
Objective of This Course
PROBLEM 1 Training Examples: Class 1 Training Examples: Class 2
Revision (Part II) Ke Chen
Revision (Part II) Ke Chen
Dr. Clincy Professor of CS
Lecture 3 VLSI Physical Design VLSI design flow VLSI physical design
Hierarchical Modeling & Constructive Solid Geometry
MACHINE LEARNING TECHNIQUES IN IMAGE PROCESSING
Combinational Problems: Unate Covering, Binate Covering, Graph Coloring and Maximum Cliques Unit 6 part B.
An Infant Facial Expression Recognition System Based on Moment Feature Extraction C. Y. Fang, H. W. Lin, S. W. Chen Department of Computer Science and.
MACHINE LEARNING TECHNIQUES IN IMAGE PROCESSING
Avoid Overfitting in Classification
Roc curves By Vittoria Cozza, matr
ECE 352 Digital System Fundamentals
CSE 370 – Winter Sequential Logic-2 - 1
UNIT-4 BLACKBOX AND WHITEBOX TESTING
Presentation transcript:

Generalization of the Ashenhurst-Curtis decomposition model to MV functions

This kind of tables known from Rough Sets, Decision Trees, etc Data Mining

At every step many decompositions exist Decomposition is hierarchical At every step many decompositions exist

Decomposition Algorithm Find a set of partitions (Ai, Bi) of input variables (X) into free variables (A) and bound variables (B) For each partitioning, find decomposition F(X) = Hi(Gi(Bi), Ai) such that column multiplicity is minimal, and calculate DFC Repeat the process for all partitioning until the decomposition with minimum DFC is found.

Algorithm Requirements Since the process is iterative, it is of high importance that minimization of the column multiplicity index is done as fast as possible. At the same time, for a given partitioning, it is important that the value of the column multiplicity is as close to the absolute minimum value

Generalizations of the Ashenhurst-Curtis decomposition model to MV relations

Compatibility of columns for Relations is not transitive ! C0, C1 and C2 are pairwise compatible but not compatible as a group This is an important difference between decomposing functions and relations

Four Extensions of Ashenhurst Decomposition

Fist Extension: Dealing with noise in data

Compatibility graph construction for data with noise Numbers treated as symbols Incompatible in 3 of 4 cases compatible Kmap Compatibility Graph for Threshold 0.75 Compatibility Graph for Threshold 0.25 Figure 12: Create a graph with edges labeled by percent of incompatibility Remove nodes with values above the selected threshold Create a graph by disregarding labels. Use this graph for coloring, follow the standard procedure The method can be repeated for various values of thresholds All edges with values above 0.25 are removed

Second Extension: Dealing with numerical (metric) versus symbolic (nominal) data

Compatibility graph for metric data Numbers treated as metric (numbers) Compatibility Graph for nominal (symbolic) data Compatibility Graph for metric data Figure 13: Kmap If the difference is only 1 then nodes are treated as compatible The method can be repeated for various values of thresholds of differences

Third Extension: Dealing with data in contingency tables

MV relations can be created from contingency tables THRESHOLD 50 THRESHOLD 70 Number of cases for the given combination of inputs The method can be repeated for various values of thresholds

Fourth Extension: Removing some assumption of AC method: Column multiplicity

Example of decomposing a “Curtis non-decomposable” function. Function f in Kmap has a multiplicity of 3 Function f in Kmap cannot be decomposed according to Curtis decomposition It can be encoded by two function values One of them is AND , other is OR, they are both “cheap functions”. (see c) Now we can create a new table (see d) Next we can create circuit from e Next we create table from f Next we create circuit a

APPLICATIONS FPGA SYNTHESIS VLSI LAYOUT SYNTHESIS DATA MINING AND KNOWLEDGE DISCOVERY MEDICAL DATABASES EPIDEMIOLOGY ROBOTICS FUZZY LOGIC DECOMPOSITION CONTINUOUS FUNCTION DECOMPOSITION

Importance of Intepretability in decomposition methods You do not only want to have high accuracy. You want to be able to understand and interpret the results.

Example of an application Knowledge discovery in data with no error

Michalski’s Trains Figure 5: Trains problem

Michalski’s Trains Multiple-valued functions. There are 10 trains, five going East, five going West, and the problem is to nd the simplest rule which, for a given train, would determine whether it is East or Westbound. The best rules discovered at that time were: 1. If a train has a short closed car, then it Eastbound and otherwise Westbound. 2. If a train has two cars, or has a car with a jagged roof then it is Westbound and otherwise Eastbound. Espresso format. MVGUD format.

Michalski’s Trains

Michalski’s Trains

Michalski’s Trains Attribute 33: Class attribute (east or west) direction (east = 0, west = 1) The number of cars vary between 3 and 5. Therefore, attributes referring to properties of cars that do not exist (such as the 5 attributes for the “5th" car when the train has fewer than 5 cars) are assigned a value of “-". Applied to the trains problem our program discovered the following rules: 1. If a train has triangle next to triangle or rectangle next to triangle on adjacent cars then it is Eastbound and otherwise Westbound. 2. If the shape of car 1 (s1) is jagged top or open rectangle or u-shaped then it is Westbound and otherwise Eastbound.

Another examples of meaning of intermediate variables Figure 5: Trains problem

MV benchmarks: zoo Figure 9:

MV benchmarks: shuttle Figure 6:

MV benchmarks: lenses Figure 7:

MV benchmarks: breastc Figure 8:

Evaluation of variants Decomposition versus DNF versus Decision Tree

Evaluation of results for learning 1. Learning Error 2. Occam Razor , complexity

A machine learning approach versus several logic synthesis approaches Decomposition has small error. Decomposition needs less samples to learn

Finding the error, DFC, and time of the decomposer on the benchmark kdd5.

The average error over 54 benchmark functions.

Example of a application Gait control of a robot puppet for Oregon Cyber Theatre

Model with a gripper

Model with an internet camera

camera camera stamp Universal Logic Machine radio DecStation radio Spider I control camera Universal Logic Machine radio camera DEC PERLE DecStation stamp radio radio Turbochannel

The following formula describes the exact motion of the shaft of every servo. Theta, the angle of the servo’s shaft, is a function of time. Theta naught is a base value corresponding to the servo’s middle position. Theta naught will be the same for all the servos. ‘A’ is called the amplitude of the oscillation. It relates to how many degrees the shaft is able to rotate through. Omega relates to how fast the servo’s shaft rotates back and forth. Currently, for all servos, there are only four possible value that omega may take Phi is the relative phase angle.

And a familiar table again

Medical Applications Ovulation prediction (Perkowski, Grygiel, Pierzchala, students) Melanoma cancer (Anika, Lizzy Zhao) Breast cancer (Clemen) Drought in California (Robin) Endangered species in Oregon (Nashita) HIV (Leo) Forest Fire in Oregon (Leo) Poisonous spiders in Oregon Poisonous Mushrooms Robin on heart bits

Conclusion Stimulated by practical hard problems: Machine Learning, Data Mining. Robotics (hexapod gaits, face recognition), Field Programmable Gate Arrays (FPGA), Application Specific Integrated Circuits (ASIC) High performance custom design (Intel) Very Large Scale of Integration (VLSI) layout-driven synthesis for custom processors,

Conclusion Developed 1989-present Intel, Washington County epidemiology office, Northwest Family Planning Services, Lattice Logic Corporation, Cypress Semiconductor, AbTech Corp., Air Force Office of Scientific Research, Wright Laboratories. A set of tools for decomposition of binary and multi-valued functions and relations. Extended to fuzzy logic, reconstructability analysis and real-valued functions.

Conclusion Our recent software allows also for bi-decomposition, removal of vacuous variables and other preprocessing/postprocessing operations. Variants of our software are used in several commercial companies. The applications of the method are unlimited and it can be used whenever decision trees or artificial neural nets are used now. The quality of learning was better than in the top decision tree creating program C4.5 and various neural nets. The only problem that remains is speed in some applications.

Conclusion On our WWW page, http:// www.ee.pdx.edu/~cfiles/papers.html the reader can find many benchmarks from various disciplines that can be used for comparison of machine learning and logic synthesis programs. We plan to continue work on decomposition and its various practical applications such as epidemiology or robotics which generate large real-life benchmarks. We work on FPGA-based reconfigurable hardware accelerator for decomposition to be used on a mobile robot.

Questions and Problems What is Ashenhurst Decomposition? What is Curtis Decomposition? Give example of decomposition of MV data. What is a difference of decomposing functions and relations? Discuss one example of dealing with noise in data while using decomposition. Discuss decomposition of numeric (metric) data. Discuss how to use decomposition to deal with contingency data.

Questions and Problems Discuss one generalization to Ashenhurst/Curtis Decomposition? Explain the problem of intepretability of results. Find a new solution to Michalski’s Trains Problem. Discuss how to compare DNF, Decision Tree, Decomposition and other ML methods. Discuss how to use decomposition to learn gaits for a hexapod robot. Find a new application of A/C decomposition.