CS490D: Introduction to Data Mining Prof. Chris Clifton April 19, 2004 More on Associations/Rules.

Slides:



Advertisements
Similar presentations
1-A Introduction to Biology
Advertisements

Frequent Itemset Mining Methods. The Apriori algorithm Finding frequent itemsets using candidate generation Seminal algorithm proposed by R. Agrawal and.
Data Mining Techniques Association Rule
Machine Learning: Lecture 9
Association Analysis (Data Engineering). Type of attributes in assoc. analysis Association rule mining assumes the input data consists of binary attributes.
Data Mining (Apriori Algorithm)DCS 802, Spring DCS 802 Data Mining Apriori Algorithm Spring of 2002 Prof. Sung-Hyuk Cha School of Computer Science.
gSpan: Graph-based substructure pattern mining
Association Rules Spring Data Mining: What is it?  Two definitions:  The first one, classic and well-known, says that data mining is the nontrivial.
Organizing Your Argument The Argumentative Essay.
Adaptive Frequency Counting over Bursty Data Streams Bill Lin, Wai-Shing Ho, Ben Kao and Chun-Kit Chui Form CIDM07.
Rakesh Agrawal Ramakrishnan Srikant
Data Mining Association Analysis: Basic Concepts and Algorithms Introduction to Data Mining by Tan, Steinbach, Kumar © Tan,Steinbach, Kumar Introduction.
Data Mining Association Analysis: Basic Concepts and Algorithms Lecture Notes for Chapter 6 Introduction to Data Mining by Tan, Steinbach, Kumar © Tan,Steinbach,
ICS 421 Spring 2010 Data Mining 1 Asst. Prof. Lipyeow Lim Information & Computer Science Department University of Hawaii at Manoa 4/6/20101Lipyeow Lim.
CS590D: Data Mining Prof. Chris Clifton April 21, 2005 Multi-Relational Data Mining.
Relational Data Mining in Finance Haonan Zhang CFWin /04/2003.
Decision Tree Algorithm
Summarization of Frequent Pattern Mining. What is FPM? Why being frequent is so important? Application of FPM Decision make/Business Software Debugging.
David Lo Siau-Cheng Khoo Chao Liu DASFAA 2008 Efficient Mining of Recurrent Rules from a Sequence Database 1.
Association Rule Mining (Some material adapted from: Mining Sequential Patterns by Karuna Pande Joshi)‏
Lecture14: Association Rules
1 Fast Algorithms for Mining Association Rules Rakesh Agrawal Ramakrishnan Srikant Slides from Ofer Pasternak.
Association Analysis (5) (Mining Word Associations)
Descriptive Research. D Used to obtain information concerning the current status of a phenomena. D Purpose of these methods is to describe “what exists”
THE MODEL OF ASIS FOR PROCESS CONTROL APPLICATIONS P.Andreeva, T.Atanasova, J.Zaprianov Institute of Control and System Researches Topic Area: 12. Intelligent.
Group practice in problem design and problem solving
CS490D: Introduction to Data Mining Prof. Chris Clifton April 14, 2004 Fraud and Misuse Detection.
Notes for Chapter 12 Logic Programming The AI War Basic Concepts of Logic Programming Prolog Review questions.
林俊宏 Parallel Association Rule Mining based on FI-Growth Algorithm Bundit Manaskasemsak, Nunnapus Benjamas, Arnon Rungsawang.
A Z Approach in Validating ORA-SS Data Models Scott Uk-Jin Lee Jing Sun Gillian Dobbie Yuan Fang Li.
1 Mining Association Rules Mohamed G. Elfeky. 2 Introduction Data mining is the discovery of knowledge and useful information from the large amounts of.
Derivatives of Exponential Functions Lesson 4.4. An Interesting Function Consider the function y = a x Let a = 2 Graph the function and it's derivative.
The interaction of measurement, models and accountability: How values affect our growth model choices. Jennifer L. Dunn Center for Assessment.
November 10, Machine Learning: Lecture 9 Rule Learning / Inductive Logic Programming.
C-1 Management Information Systems for the Information Age Copyright 2004 The McGraw-Hill Companies, Inc. All rights reserved Extended Learning Module.
Association Rules Plan for this week Progress report Midterm review In class Minute paper.
Frequent Subgraph Discovery Michihiro Kuramochi and George Karypis ICDM 2001.
Multi-Relational Data Mining: An Introduction Joe Paulowskey.
The Volcano Optimizer Generator Extensibility and Efficient Search.
Outline Knowledge discovery in databases. Data warehousing. Data mining. Different types of data mining. The Apriori algorithm for generating association.
Instructor : Prof. Marina Gavrilova. Goal Goal of this presentation is to discuss in detail how data mining methods are used in market analysis.
CS Introduction to AI Tutorial 8 Resolution Tutorial 8 Resolution.
Is Sampling Useful in Data Mining? A Case in the Maintenance of Discovered Association Rules S.D. Lee, David W. Cheung, Ben Kao The University of Hong.
CS4026 Formal Models of Computation Part II The Logic Model Lecture 2 – Prolog: History and Introduction.
1 Efficient Algorithms for Incremental Update of Frequent Sequences Minghua ZHANG Dec. 7, 2001.
Ch. 13 Ch. 131 jcmt CSE 3302 Programming Languages CSE3302 Programming Languages (notes?) Dr. Carter Tiernan.
For Wednesday Read 20.4 Lots of interesting stuff in chapter 20, but we don’t have time to cover it all.
Solving Literal Equations and Formulas Objectives: I can solve literal equations (formulas) for a specified variable. I can apply these skills to solve.
Logic Programming. Formal Logics- Recap Formulas – w/out quantifiers Free Variables Bound Variables Assignments and satisfaction Validity and satisfiability.
9.1c SKM & PP 1 Quadratic Equations. 9.1c SKM & PP 2 Square Roots: Know Your Squares! Consider this equation: What number(s) could be squared to give.
HEMANTH GOKAVARAPU SANTHOSH KUMAR SAMINATHAN Frequent Word Combinations Mining and Indexing on HBase.
1998 년 8 월 7 일 Data Engineering Lab 성 유진 1 Exploratory Mining and Pruning Optimization of Constrained Associations Rules.
Differential Analysis on Deep Web Data Sources Tantan Liu, Fan Wang, Jiedan Zhu, Gagan Agrawal December.
Gspan: Graph-based Substructure Pattern Mining
COMSATS INSTITUTE OF INFORMATION TECHNOLOGY, VEHARI
A Research Oriented Study Report By :- Akash Saxena
Data Mining Association Analysis: Basic Concepts and Algorithms
DIRECT HASHING AND PRUNING (DHP) ALGORITHM
SEG 4630 E-Commerce Data Mining — Final Review —
Transactional data Algorithm Applications
Homework 1 (parts 1 and 2) For the general system described by the following dynamics: We have the following algorithm for generating the Kalman gain and.
Data Mining Association Analysis: Basic Concepts and Algorithms
CSE 573 Introduction to Artificial Intelligence Decision Trees
Lesson 2.2a EQ: How do I solve a literal equation for a specified variable? Standard(CED.4) Equations.
CS490D: Introduction to Data Mining Prof. Chris Clifton
Market Basket Analysis and Association Rules
Heritage Definition: something that is passed down from preceding generations.
Lecture 11 (Market Basket Analysis)
Limits and Continuity An introduction to Limits and how we will be using them.
Section 5.5: The Substitution Rule
Presentation transcript:

CS490D: Introduction to Data Mining Prof. Chris Clifton April 19, 2004 More on Associations/Rules

Project: Association Several people have suggested association rule mining –Good idea Issues –Most data is not “itemsets”, but scaled values –Is apriori the right algorithm? Ideas –Try Tertius –Decision rules (need to define class)

Tertius First-order logic learning –Similar to ILP we have been discussing Provide structural schema definition –Individual: What is an individual (for counting purposes) –Structural: Define relationships between individuals –Properties: Things that describe an individual ning/Tertius/index.htmlhttp:// ning/Tertius/index.html

Tertius: Use Call: Specify number of literals, number of variables in rules –Finds strongest k rules Predicate File –Parent 2 person person cwa –Daughter 2 person person cwa –Male 1 person cwa Fact File –Parent(Chris, Denise) –Daughter(Denise, Chris) –Female(Denise)

JRIP/Ripper Decision Rules –Like association rules –But need target class (right hand side) Idea: –Grow rule –Prune rule –If good then keep –Repeat Growth: Based on information gain Prune: p+N-n / P+N