Multi-Relational Data Mining: An Introduction Joe Paulowskey.

Slides:



Advertisements
Similar presentations
1 CS 391L: Machine Learning: Rule Learning Raymond J. Mooney University of Texas at Austin.
Advertisements

Machine Learning: Lecture 9
Ch2 Data Preprocessing part3 Dr. Bernard Chen Ph.D. University of Central Arkansas Fall 2009.
Induction and Decision Trees. Artificial Intelligence The design and development of computer systems that exhibit intelligent behavior. What is intelligence?
Active subgroup mining for descriptive induction tasks Dragan Gamberger Rudjer Bošković Instute, Zagreb Zdenko Sonicki University of Zagreb.
Building Global Models from Local Patterns A.J. Knobbe.
Data Mining Techniques Cluster Analysis Induction Neural Networks OLAP Data Visualization.
CS590D: Data Mining Prof. Chris Clifton April 21, 2005 Multi-Relational Data Mining.
1 Lecture 5: Automatic cluster detection Lecture 6: Artificial neural networks Lecture 7: Evaluation of discovered knowledge Brief introduction to lectures.
Relational Data Mining in Finance Haonan Zhang CFWin /04/2003.
A First Attempt towards a Logical Model for the PBMS PANDA Meeting, Milano, 18 April 2002 National Technical University of Athens Patterns for Next-Generation.
Mining Sequence Patterns from Wind Tunnel Experimental Data Zhenyu Liu †, Wesley W. Chu †, Adam Huang ‡, Chris Folk ‡, Chih-Ming Ho ‡
© Prentice Hall1 DATA MINING TECHNIQUES Introductory and Advanced Topics Eamonn Keogh (some slides adapted from) Margaret Dunham Dr. M.H.Dunham, Data Mining,
Knowledge Representation. 2 Outline: Output - Knowledge representation  Decision tables  Decision trees  Decision rules  Rules involving relations.
Data Mining and Data Warehousing – a connected view.
Research Project Mining Negative Rules in Large Databases using GRD.
Introduction to WEKA Aaron 2/13/2009. Contents Introduction to weka Download and install weka Basic use of weka Weka API Survey.
Video Mining Learning Patterns of Behaviour via an Intelligent Image Analysis System.
02 -1 Lecture 02 Agent Technology Topics –Introduction –Agent Reasoning –Agent Learning –Ontology Engineering –User Modeling –Mobile Agents –Multi-Agent.
Neural Networks in Data Mining “An Overview”
Oracle Data Mining Ying Zhang. Agenda Data Mining Data Mining Algorithms Oracle DM Demo.
Data Mining and Decision Tree CS157B Spring 2006 Masumi Shimoda.
Data Mining CS 157B Section 2 Keng Teng Lao. Overview Definition of Data Mining Application of Data Mining.
Data Mining By Andrie Suherman. Agenda Introduction Major Elements Steps/ Processes Tools used for data mining Advantages and Disadvantages.
Overview of Distributed Data Mining Xiaoling Wang March 11, 2003.
Data Mining : Introduction Chapter 1. 2 Index 1. What is Data Mining? 2. Data Mining Functionalities 1. Characterization and Discrimination 2. MIning.
Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence From Data Mining To Knowledge.
Inductive Logic Programming (for Dummies) Anoop & Hector.
Inductive Logic Programming Includes slides by Luis Tari CS7741L16ILP.
Notes for Chapter 12 Logic Programming The AI War Basic Concepts of Logic Programming Prolog Review questions.
INTRODUCTION TO MACHINE LEARNING. $1,000,000 Machine Learning  Learn models from data  Three main types of learning :  Supervised learning  Unsupervised.
Slides for “Data Mining” by I. H. Witten and E. Frank.
Introduction to Data Mining Group Members: Karim C. El-Khazen Pascal Suria Lin Gui Philsou Lee Xiaoting Niu.
1 A Theoretical Framework for Association Mining based on the Boolean Retrieval Model on the Boolean Retrieval Model Peter Bollmann-Sdorra.
Mining Multidimensional Sequential Patterns over Data Streams Chedy Raїssi and Marc Plantevit DaWak_2008.
Data Mining Chapter 1 Introduction -- Basic Data Mining Tasks -- Related Concepts -- Data Mining Techniques.
Introduction to machine learning and data mining 1 iCSC2014, Juan López González, University of Oviedo Introduction to machine learning Juan López González.
Chapter 13 Query Processing Melissa Jamili CS 157B November 11, 2004.
Data Mining Knowledge on rough set theory SUSHIL KUMAR SAHU.
November 10, Machine Learning: Lecture 9 Rule Learning / Inductive Logic Programming.
Data Mining By Dave Maung.
Decision Trees. Decision trees Decision trees are powerful and popular tools for classification and prediction. The attractiveness of decision trees is.
1 Discovering Robust Knowledge from Databases that Change Chun-Nan HsuCraig A. Knoblock Arizona State UniversityUniversity of Southern California Journal.
Data Mining – Intro. Course Overview Spatial Databases Temporal and Spatio-Temporal Databases Multimedia Databases Data Mining.
Stefan Mutter, Mark Hall, Eibe Frank University of Freiburg, Germany University of Waikato, New Zealand The 17th Australian Joint Conference on Artificial.
A Learning System for Decision Support in Telecommunications Filip Železný, Olga Štěpánková (Czech Technical University in Prague) Jiří Zídek (Atlantis.
Data Mining In contrast to the traditional (reactive) DSS tools, the data mining premise is proactive. Data mining tools automatically search the data.
Detecting Group Differences: Mining Contrast Sets Author: Stephen D. Bay Advisor: Dr. Hsu Graduate: Yan-Cheng Lin.
DATA MINING By Cecilia Parng CS 157B.
ECML/PKDD 2003 Discovery Challenge Attribute-Value and First Order Data Mining within the STULONG project Anneleen Van Assche, Sofie Verbaeten,
CS 5751 Machine Learning Chapter 10 Learning Sets of Rules1 Learning Sets of Rules Sequential covering algorithms FOIL Induction as the inverse of deduction.
DATA MINING WITH CLUSTERING AND CLASSIFICATION Spring 2007, SJSU Benjamin Lam.
1 Introduction to Data Mining C hapter 1. 2 Chapter 1 Outline Chapter 1 Outline – Background –Information is Power –Knowledge is Power –Data Mining.
DECISION TREE Ge Song. Introduction ■ Decision Tree: is a supervised learning algorithm used for classification or regression. ■ Decision Tree Graph:
Decision Tree Learning Presented by Ping Zhang Nov. 26th, 2007.
Data Mining and Decision Support
Data Mining By Farzana Forhad CS 157B. Agenda Decision Tree and ID3 Rough Set Theory Clustering.
Clustering Algorithms Sunida Ratanothayanon. What is Clustering?
DECISION TREES Asher Moody, CS 157B. Overview  Definition  Motivation  Algorithms  ID3  Example  Entropy  Information Gain  Applications  Conclusion.
Chap. 10 Learning Sets of Rules 박성배 서울대학교 컴퓨터공학과.
Optimization of Association Rules Extraction Through Exploitation of Context Dependent Constraints Arianna Gallo, Roberto Esposito, Rosa Meo, Marco Botta.
Differential Analysis on Deep Web Data Sources Tantan Liu, Fan Wang, Jiedan Zhu, Gagan Agrawal December.
Data Mining: Confluence of Multiple Disciplines Data Mining Database Systems Statistics Other Disciplines Algorithm Machine Learning Visualization.
DATA MINING © Prentice Hall.
RESEARCH APPROACH.
Data Mining 101 with Scikit-Learn
Proper Refinement of Datalog Clauses using Primary Keys
Research Areas Christoph F. Eick
Discriminative Frequent Pattern Analysis for Effective Classification
CS490D: Introduction to Data Mining Prof. Chris Clifton
Presentation transcript:

Multi-Relational Data Mining: An Introduction Joe Paulowskey

Overview Introduction to Data Mining Relational  Data  Patterns Inductive Logic Programming (ILP) Relational Association Rules Relational Decision Trees Relation Distance-Based Approaches

Relation Data Relational Database  Multiple Tables  Defined Views Tables

Relational Pattern Multiple Relations from a relational database  More Expressive Opens up  Classification  Association  Regression

Relational Pattern (Cont.) Expressed in Subsets of First Order Logic

Data Mining Look for patterns in data What do you discover?  Associations  Sequences  Classifications Goals of Data Mining  Predict  Identify  Classify  Optimize Uses  Business Data  Environmental/Traffic Engineering  Web Mining  Drug Design

Data Mining: Relational Databases Most Data Mining approaches deal with single tables  Not safe to merge multiple tables into one single table Number of patterns increases  Explicit constraints required

Inductive Logic Programming (ILP) Logic Programs used to find patterns Clauses  Head and Body  Literals  Types Definite Program

ILP (Cont) Predicate  Relations in relational database  Arguments -> Attributes Attributes are Typed Database Clauses are typed program clauses Deductive Database

Relational Rule Induction ILP Learn logical definitions of relations Classification  Rules can be found by decision trees  Simple Algorithm Dealing with noisy/incomplete data

ILP Problems to Propositional Forms Propositional  attribute-value Use Single Table Data Mining algorithms LINUS  Background Knowledge

ILP/RDM Algorithms Share  Learning as a Search Paradigm Differences  Representation of Data, Patterns  Refinement operators  Testing Coverage Upgrading from Propositional to Relational

Relational Association Rules Frequent Patterns  Determining Frequency  Itemsets Association Rules  Obtained by frequent itemsets

Relational Decision Trees Used for Prediction Binary Trees First Order Decision List

Relational Distance-Based Approaches Calculated distance between two objects Statistical Approaches

Conclusion