FLAIRS 20011 Graph-Based Concept Learning Jesus Gonzalez, Lawrence Holder and Diane Cook Department of Computer Science and Engineering The University.

Slides:



Advertisements
Similar presentations
Ontology-Based Computing Kenneth Baclawski Northeastern University and Jarg.
Advertisements

Explanation-Based Learning (borrowed from mooney et al)
TU e technische universiteit eindhoven / department of mathematics and computer science Modeling User Input and Hypermedia Dynamics in Hera Databases and.
1 CS 391L: Machine Learning: Rule Learning Raymond J. Mooney University of Texas at Austin.
Is the shape below a function? Explain. Find the domain and range.
Hierarchical Floorplanning of Chip Multiprocessors using Subgraph Discovery Javier de San Pedro Jordi Cortadella Antoni Roca Universitat Politècnica de.
1 Update on Learning By Observation Learning from Positive Examples Only Tolga Konik University of Michigan.
Inductive Logic Programming: The Problem Specification Given: –Examples: first-order atoms or definite clauses, each labeled positive or negative. –Background.
Nadia Andreani Dwiyono DESIGN AND MAKE OF DATA MINING MARKET BASKET ANALYSIS APLICATION AT DE JOGLO RESTAURANT.
Organizing research publications in Web 3 enviroment Anastasiou Lucas Vasilis Tzouvaras
GRAPH-BASED HIERARCHICAL CONCEPTUAL CLUSTERING by Istvan Jonyer, Lawrence B. Holder and Diane J. Cook The University of Texas at Arlington.
Automated Layout and Phase Assignment for Dark Field PSM Andrew B. Kahng, Huijuan Wang, Alex Zelikovsky UCLA Computer Science Department
FLAIRS '991 Applying the SUBDUE Substructure Discovery System to the Chemical Toxicity Domain Ravindra N. Chittimoori, Diane J. Cook, Lawrence B. Holder.
CS590D: Data Mining Prof. Chris Clifton April 21, 2005 Multi-Relational Data Mining.
Structural Web Search Using a Graph-Based Discovery System Nitish Manocha, Diane J. Cook, and Lawrence B. Holder University of Texas at Arlington
1Ellen L. Walker Segmentation Separating “content” from background Separating image into parts corresponding to “real” objects Complete segmentation Each.
CSCI 5582 Fall 2006 CSCI 5582 Artificial Intelligence Lecture 21 Jim Martin.
Relational Data Mining in Finance Haonan Zhang CFWin /04/2003.
Relational Learning of Pattern-Match Rules for Information Extraction Mary Elaine Califf Raymond J. Mooney.
Discovering Substructures in Chemical Toxicity Domain Masters Project Defense by Ravindra Nath Chittimoori Committee: DR. Lawrence B. Holder, DR. Diane.
Graph-Based Concept Learning Jesus A. Gonzalez, Lawrence B. Holder, and Diane J. Cook Department of Computer Science and Engineering University of Texas.
Structural Knowledge Discovery Used to Analyze Earthquake Activity Jesus A. Gonzalez Lawrence B. Holder Diane J. Cook.
Graph-Based Data Mining Diane J. Cook University of Texas at Arlington
Subdue Graph Visualizer by Gayathri Sampath, M.S. (CSE) University of Texas at Arlington.
Learning to Match Ontologies on the Semantic Web AnHai Doan Jayant Madhavan Robin Dhamankar Pedro Domingos Alon Halevy.
GUI implementation for Supervised and Unsupervised SUBDUE System.
Graph-based Learning and Discovery Diane J. Cook University of Texas at Arlington
Workshop1 Efficient Mining of Graph-Based Data Jesus Gonzalez, Istvan Jonyer, Larry Holder and Diane Cook University of Texas at Arlington Department.
CSCI 5582 Fall 2006 CSCI 5582 Artificial Intelligence Lecture 22 Jim Martin.
Part I: Classification and Bayesian Learning
Graph-Based Data Mining and Applications István Jónyer Department of Computer Science Oklahoma State University.
Some Thoughts to Consider 6 What is the difference between Artificial Intelligence and Computer Science? What is the difference between Artificial Intelligence.
Inductive Logic Programming Includes slides by Luis Tari CS7741L16ILP.
V. Vranić — AspectJ Paradigm Model — GCSE 2001 AspectJ Paradigm Model A Basis for Multi-Paradigm Design for AspectJ Valentino Vranić
1 SUBSTRUCTURE DISCOVERY IN REAL WORLD SPATIO-TEMPORAL DOMAINS Jesus A. Gonzalez Supervisor:Dr. Lawrence B. Holder Committee:Dr. Diane J. Cook Dr. Lynn.
Introduction to ILP ILP = Inductive Logic Programming = machine learning  logic programming = learning with logic Introduced by Muggleton in 1992.
Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence Wednesday, February 7, 2001.
Using Hyperlink structure information for web search.
Turing Clusters into Patterns: Rectangle-based Discriminative Data Description Byron J. Gao and Martin Ester IEEE ICDM 2006 Adviser: Koh Jia-Ling Speaker:
CS Learning Rules1 Learning Sets of Rules. CS Learning Rules2 Learning Rules If (Color = Red) and (Shape = round) then Class is A If (Color.
Dimitrios Skoutas Alkis Simitsis
November 10, Machine Learning: Lecture 9 Rule Learning / Inductive Logic Programming.
Predicate Calculus Syntax Countable set of predicate symbols, each with specified arity  0. For example: clinical data with multiple tables of patient.
Machine Learning Chapter 5. Artificial IntelligenceChapter 52 Learning 1. Rote learning rote( โรท ) n. วิถีทาง, ทางเดิน, วิธีการตามปกติ, (by rote จากความทรงจำ.
Overview Concept Learning Representation Inductive Learning Hypothesis
Methods in Computational Linguistics II Queens College Lecture 8: Dynamic Programming.
University at BuffaloThe State University of New York Lei Shi Department of Computer Science and Engineering State University of New York at Buffalo Frequent.
Clustering I. 2 The Task Input: Collection of instances –No special class label attribute! Output: Clusters (Groups) of instances where members of a cluster.
Learning, page 1 CSI 4106, Winter 2005 Symbolic learning Points Definitions Representation in logic What is an arch? Version spaces Candidate elimination.
For Monday Finish chapter 19 No homework. Program 4 Any questions?
ECML 2001 A Framework for Learning Rules from Multi-Instance Data Yann Chevaleyre and Jean-Daniel Zucker University of Paris VI – LIP6 - CNRS.
For Monday Finish chapter 19 Take-home exam due. Program 4 Any questions?
CS 5751 Machine Learning Chapter 10 Learning Sets of Rules1 Learning Sets of Rules Sequential covering algorithms FOIL Induction as the inverse of deduction.
Example Apply hierarchical clustering with d min to below data where c=3. Nearest neighbor clustering d min d max will form elongated clusters!
Chap. 10 Learning Sets of Rules 박성배 서울대학교 컴퓨터공학과.
1 A Methodology for automatic retrieval of similarly shaped machinable components Mark Ascher - Dept of ECE.
Representation and Search The function of a representation is to capture the critical features of the problem domain –and make the information accessible.
How many …?. What shape can you see? I can see some _____. Q1 Q1 stars.
Of 24 lecture 11: ontology – mediation, merging & aligning.
Gspan: Graph-based Substructure Pattern Mining
Improving Parallelism in Structural Data Mining Min Cai, Istvan Jonyer, Marcin Paprzycki Computer Science Department, Oklahoma State University, Stillwater,
Rule Induction for Classification Using
What shape am I? I am a plane shape. I have 4 sides.
Fractions 1/2 1/8 1/3 6/8 3/4.
Rule Learning Hankui Zhuo April 28, 2018.
Shapes.
Class Project Guidelines
Automated Layout and Phase Assignment for Dark Field PSM
Can you work out the next shape in the pattern?
2D Shapes Rectangle Circle Triangle Rectangle. What shape is the door? Rectangle.
Presentation transcript:

FLAIRS Graph-Based Concept Learning Jesus Gonzalez, Lawrence Holder and Diane Cook Department of Computer Science and Engineering The University of Texas at Arlington

FLAIRS Outline Relational concept learning Graph-based concept learning Conceptual graphs and Galois lattice Graph-based discovery in Subdue SubdueCL Empirical results Conclusions

FLAIRS Relational Concept Learning Inductive Logic Programming (ILP) FOIL Progol First-order logic vs. graphs Expressiveness Interpretability Conceptual graphs

FLAIRS Conceptual Graphs Logic-based knowledge representation Object On Shape Triangle SquareObject Shape shape(X,triangle) shape(Y,square) on(X,Y)

FLAIRS Conceptual Graphs Graph Logic PAC-learning CGs [Jappy & Nock] Size of CG class and generalization (projection) operator polynomial in Number of relations Number of concepts Number of labels

FLAIRS Galois Lattice Each node consists of a description graph and set of subsumed examples Begins with positive examples Generalization operator Most specific generalization Union of example sets

FLAIRS Galois Lattice [ 1][ 2][ 3][ 4] [ 1, 2 ][ 3, 4 ] [ 1, 2, 3 ][ 1, 2, 4 ][ 1, 3, 4 ][ 2, 3, 4 ] [ 1, 2, 3, 4 ] [ 1][ 2][ 3][ 4] [ 1, 2 ][ 3, 4 ] [ 1, 2, 3 ][ 1, 2, 4 ][ 1, 3, 4 ][ 2, 3, 4 ] [ 1, 2, 3, 4 ] [ 1][ 2][ 3][ 4] [ 1, 2 ][ 3, 4 ] [ 1, 2, 3 ][ 1, 2, 4 ][ 1, 3, 4 ][ 2, 3, 4 ] [ 1, 2, 3, 4 ] [ 1][ 2][ 3][ 4] [ 1, 2 ][ 3, 4 ] [ 1, 2, 3 ][ 1, 2, 4 ][ 1, 3, 4 ][ 2, 3, 4 ] [ 1, 2, 3, 4 ] [ 1, 2 ][ 3, 4 ] [ 1, 2, 3 ] [ 1, 2, 4 ] [ 1, 3, 4 ] [ 2, 3, 4 ] [ 1, 2, 3, 4 ] triangle on square on circle on rectangle triangle on triangle on rectangle triangle on triangle on [ 1, 2 ] triangle on square on circle on rectangle triangle on triangle on rectangle triangle on triangle on [ 1, 2 ] triangle on square on triangle on square on circle on rectangle circle on rectangle triangle on triangle on triangle on rectangle triangle on rectangle triangle on triangle on

FLAIRS Galois Lattice Galois lattice creation O(n 3 p) n examples p nodes in lattice Tractable for poly-time generalization GRAAL system

FLAIRS Graph-Based Discovery Finding “interesting” and repetitive substructures (connected subgraphs) in data represented as a graph object triangle R1 C1 T1 S1 T2 S2 T3 S3 T4 S4 Input DatabaseSubstructure S1 (graph form) Compressed Database R1 C1 object square on shape

FLAIRS Graph-Based Discovery “Interesting” defined according to the Minimum Description Length principle min [DL(S) + DL(G|S)] General-to-specific beam search through substructure space Poly-time inexact graph match Subdue system S

FLAIRS Subdue System Graph-based… Discovery Concept learning Hierarchical conceptual clustering Background knowledge Parallel/distributed capability

FLAIRS Graph-Based Concept Learning object on triangle square shape

FLAIRS Graph-Based Concept Learning Extension to graph-based discovery Input now a set of positive graphs and a set of negative graphs Set-covering approach Iterate until all positive graphs and no negative graphs covered Result is a substructure DNF

FLAIRS Graph-Based Concept Learning Solution 1 Find substructure compressing positive graphs, but not negative graphs Compress graphs and iterate until no further compression Problem Compressing, instead of removing, partially-covered positive graphs leads to overly-specific hypotheses

FLAIRS Graph-Based Concept Learning Solution 2 Find substructure covering positive graphs, but not negative graphs Remove covered positive graphs and iterate until all covered Substructure value = 1 - Error

FLAIRS Empirical Results Comparison with ILP systems Non-relational domains from UCI repository GolfVoteDiabetesCreditTicTacToe FOIL Progol SubdueCL

FLAIRS Empirical Results Comparison with ILP systems Relational domains: Chess endgame WKC WKR WRRWRC BKC BKR pos adj pos adj pos eq WK WR BK lt

FLAIRS Empirical Results: Chess FOIL: 11 rules, 99.34% Progol: 5 rules, 99.74% SubdueCL: 7 rules, 99.74% WKC pos BKCWKRBKR adj lt WKC pos BKCWKRBKR adj WKC pos BKC WKR BKR adj lt WKC pos BKC WKR BKR adj

FLAIRS Empirical Results Relational domain: Cancer SubdueCL: 62% Progol: 64% [72%] compound amine p chromaberr has_group compound amine p chromaberr has_group compound amine p chromaberr has_group compound p drosophila_slrl compound p drosophila_slrl compound p drosophila_slrl

FLAIRS Empirical Results Relational domain: Web Professor (+) vs. student (-) websites Hyperlink structure and page content

FLAIRS Empirical Results Relational domain: Web Computer store (+) vs. professor (-) websites Hyperlink structure only

FLAIRS Conclusions Theoretical analysis of graph-based concept learning PAC-learning conceptual graphs Galois lattice Next step: Relax graph constraints Empirical analysis Competitive with other relational concept learners (ILP) Next step: More relational domains SubdueCL (