Data Warehousing Lecture-30 What can Data Mining do? Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics Research National University of Computers & Emerging Sciences, Islamabad
CLASSIFICATION
ESTIMATION
PREDICTION
MARKET BASKET ANALYSIS
98% of people who purchased items A and B also purchased item C A Y B Ask graphics to replace pictures of items with similar pictures MARKET BASKET ANALYSIS
Discovering Association Rules Cola, Diaper, Milk5 Juice, Bread, Diaper, Milk4 Juice, Cola, Diaper, Milk3 Juice, Bread2 Bread, Cola, Milk1 ItemsTID Rules: {Milk} {Cola} {Diaper, Milk} {Juice}
Task of segmenting a heterogeneous population into a number of more homogenous sub-groups or clusters. CLUSTERING
Examples of Clustering Applications Marketing: Insurance: Land use: Seismic studies:
Ambiguity in Clustering How many clusters? Two clusters Four clusters Six clusters
DESCRIPTION
Comparing Methods Accuracy: Speed: Robustness: Scalability: Interpretability: Simplicity:
Where does Data Mining fits in? Data Preprocessing Selection Cleaning Transformation Feature Extraction Knowledge Data Mining Identify Patterns Generate Models Interpretation/ Evaluation Validation Tests Visualization Data Mining is one step of Knowledge Discovery in Databases (KDD)