Ontology Learning Mining Functional Dependencies from Data Hong Yao and Howard J. Hamilton Presented By Stephen Lynn.

Slides:



Advertisements
Similar presentations
CSC411Artificial Intelligence 1 Chapter 3 Structures and Strategies For Space State Search Contents Graph Theory Strategies for Space State Search Using.
Advertisements

Machine Learning: Lecture 9
Association Rules Spring Data Mining: What is it?  Two definitions:  The first one, classic and well-known, says that data mining is the nontrivial.
Association Rule Mining. 2 The Task Two ways of defining the task General –Input: A collection of instances –Output: rules to predict the values of any.
10 -1 Lecture 10 Association Rules Mining Topics –Basics –Mining Frequent Patterns –Mining Frequent Sequential Patterns –Applications.
Association Analysis. Association Rule Mining: Definition Given a set of records each of which contain some number of items from a given collection; –Produce.
Data Mining Techniques So Far: Cluster analysis K-means Classification Decision Trees J48 (C4.5) Rule-based classification JRIP (RIPPER) Logistic Regression.
Data Mining Association Analysis: Basic Concepts and Algorithms Introduction to Data Mining by Tan, Steinbach, Kumar © Tan,Steinbach, Kumar Introduction.
Mining Association Rules. Association rules Association rules… –… can predict any attribute and combinations of attributes … are not intended to be used.
Logic in general Logics are formal languages for representing information such that conclusions can be drawn Syntax defines the sentences in the language.
Data Integration Aggregate Query Answering under Uncertain Schema Mappings Avigdor Gal, Maria Vanina Martinez, Gerardo I. Simari, VS Subrahmanian Presented.
Chapter 6 Methodology Logical Database Design for the Relational Model Transparencies © Pearson Education Limited 1995, 2005.
Relational Data Mining in Finance Haonan Zhang CFWin /04/2003.
1 Inference Rules and Proofs Z: Inference Rules and Proofs.
Sai Moturu. Introduction Current approaches to microarray data analysis –Analysis of experimental data followed by a posterior process where biological.
© Vipin Kumar CSci 8980 Fall CSci 8980: Data Mining (Fall 2002) Vipin Kumar Army High Performance Computing Research Center Department of Computer.
Mining Negative Rules in Large Databases using GRD Dhananjay R Thiruvady Supervisor: Professor Geoffrey Webb.
The Implicit Mapping into Feature Space. In order to learn non-linear relations with a linear machine, we need to select a set of non- linear features.
From Association Analysis to Causal Discovery Prof Jiuyong Li University of South Australia.
Thesis Proposal Mini-Ontology GeneratOr (MOGO) Mini-Ontology Generation from Canonicalized Tables Stephen Lynn Data Extraction Research Group Department.
Basic Data Mining Techniques
Abrar Fawaz AlAbed-AlHaq Kent State University October 28, 2011
Propositions and Truth Tables
Comparing the Parallel Automatic Composition of Inductive Applications with Stacking Methods Hidenao Abe & Takahira Yamaguchi Shizuoka University, JAPAN.
1 SciCSM: Novel Contrast Set Mining over Scientific Datasets Using Bitmap Indices Gangyi Zhu, Yi Wang, Gagan Agrawal The Ohio State University.
Mohammad Ali Keyvanrad
Data Mining Association Analysis: Basic Concepts and Algorithms Lecture Notes for Chapter 6 Introduction to Data Mining By Tan, Steinbach, Kumar Lecture.
Modul 7: Association Analysis. 2 Association Rule Mining  Given a set of transactions, find rules that will predict the occurrence of an item based on.
1 A Theoretical Framework for Association Mining based on the Boolean Retrieval Model on the Boolean Retrieval Model Peter Bollmann-Sdorra.
Treatment Learning: Implementation and Application Ying Hu Electrical & Computer Engineering University of British Columbia.
Data Mining with Oracle using Classification and Clustering Algorithms Presented by Nhamo Mdzingwa Supervisor: John Ebden.
110/19/2015CS360 AI & Robotics AI Application Areas  Neural Networks and Genetic Algorithms  These model the structure of neurons in the brain  Humans.
Procedures for managing workflow components Workflow components: A workflow can usually be described using formal or informal flow diagramming techniques,
Mining Reference Tables for Automatic Text Segmentation Eugene Agichtein Columbia University Venkatesh Ganti Microsoft Research.
CEMiner – An Efficient Algorithm for Mining Closed Patterns from Time Interval-based Data Yi-Cheng Chen, Wen-Chih Peng and Suh-Yin Lee ICDM 2011.
1 Discovering Robust Knowledge from Databases that Change Chun-Nan HsuCraig A. Knoblock Arizona State UniversityUniversity of Southern California Journal.
Stefan Mutter, Mark Hall, Eibe Frank University of Freiburg, Germany University of Waikato, New Zealand The 17th Australian Joint Conference on Artificial.
Multi-Relational Data Mining: An Introduction Joe Paulowskey.
CSE4334/5334 DATA MINING CSE4334/5334 Data Mining, Fall 2014 Department of Computer Science and Engineering, University of Texas at Arlington Chengkai.
Detecting Group Differences: Mining Contrast Sets Author: Stephen D. Bay Advisor: Dr. Hsu Graduate: Yan-Cheng Lin.
Chapter 11 Statistical Techniques. Data Warehouse and Data Mining Chapter 11 2 Chapter Objectives  Understand when linear regression is an appropriate.
Decision Trees. What is a decision tree? Input = assignment of values for given attributes –Discrete (often Boolean) or continuous Output = predicated.
1 Appendix D: Application of Genetic Algorithm in Classification Duong Tuan Anh 5/2014.
Association Rule Mining
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Design of an Expert System for Enhancing.
Patch Based Prediction Techniques University of Houston By: Paul AMALAMAN From: UH-DMML Lab Director: Dr. Eick.
Charles Tappert Seidenberg School of CSIS, Pace University
CS 103 Discrete Structures Lecture 13 Induction and Recursion (1)
Summary „Rough sets and Data mining” Vietnam national university in Hanoi, College of technology, Feb.2006.
Data Mining and Decision Support
Reasoning with Propositional Logic automated processing of a simple knowledge base CD.
1 Boolean Satisfiability (SAT) Class Presentation By Girish Paladugu.
Association Rules Carissa Wang February 23, 2010.
Chap 6: Association Rules. Rule Rules!  Motivation ~ recent progress in data mining + warehousing have made it possible to collect HUGE amount of data.
Data Mining Association Rules Mining Frequent Itemset Mining Support and Confidence Apriori Approach.
DATA MINING It is a process of extracting interesting(non trivial, implicit, previously, unknown and useful ) information from any data repository. The.
1 Data Mining Lecture 6: Association Analysis. 2 Association Rule Mining l Given a set of transactions, find rules that will predict the occurrence of.
Methodology - Logical Database Design. 2 Step 2 Build and Validate Local Logical Data Model To build a local logical data model from a local conceptual.
Biological data representation and data mining Xin Chen
March 3, 2016Introduction to Artificial Intelligence Lecture 12: Knowledge Representation & Reasoning I 1 Back to “Serious” Topics… Knowledge Representation.
Differential Analysis on Deep Web Data Sources Tantan Liu, Fan Wang, Jiedan Zhu, Gagan Agrawal December.
Inference and search for the propositional satisfiability problem
Knowledge Representation and Reasoning
Association Rule Mining
NATURE SUNDAY ACADEMY BOOLEAN ALGEBRA.
Mulitvalued Dependencies
Semantics In Text: Chapter 3.
Back to “Serious” Topics…
Lecture 11 (Market Basket Analysis)
Summary of the Rules A>B -A B -(A>B) A -B --A A AvB A B
Presentation transcript:

Ontology Learning Mining Functional Dependencies from Data Hong Yao and Howard J. Hamilton Presented By Stephen Lynn

Ontology Learning Rule Mining  Algorithmic process that takes data as input and yields rules such as:  Association Rules  Implications  Functional dependencies

Ontology Learning Overview  Goals/Objectives  Implication/Functional Dependencies  Base Algorithm  4 Pruning Rules  Evaluation  Analysis

Ontology Learning Goals and Objectives Design an efficient rule discovery algorithm for mining functional dependencies from a dataset.

Ontology Learning Implication  Describes relationship between one specific combination of attribute-value pairs.  Binary Data  Propositional Logic {milk, eggs} → {bread}

Ontology Learning Functional Dependency  Describe relationship between all possible combinations of attribute-value pairs.  Disjoint attributes  True regardless of how many possible attribute values  antecedent → consequent postcode → areacode

Ontology Learning Search Space

Ontology Learning Armstrong’s Axioms

Ontology Learning Equivalent Attributes

Ontology Learning Nontrivial Closure

Ontology Learning Base Algorithm  Generate all possible antecedents then test with possible consequents (1 level at a time)

Ontology Learning Pruning Rules

Ontology Learning FD_Mine

Ontology Learning Experimental Summary  15 Datasets from UCI Machine Learning Repository (2005)

Ontology Learning Results

Ontology Learning Results

Ontology Learning Runtime

Ontology Learning Analysis  Strengths  Nicely drawn proofs  Weaknesses  Missing good example  Nice to show results with/without pruning  Future Work  Find multivalued dependencies  Find conditional dependencies  Data cleaning