Data Warehousing Lecture-31 Supervised vs. Unsupervised Learning Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics.

Slides:



Advertisements
Similar presentations
Lecture-19 ETL Detail: Data Cleansing
Advertisements

Data Mining Techniques: Clustering
DWH-Ahsan Abdullah 1 Data Warehousing Lecture-5 Types & Typical Applications of DWH Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center.
Data Mining: A Closer Look Chapter Data Mining Strategies.
ICS 421 Spring 2010 Data Mining 2 Asst. Prof. Lipyeow Lim Information & Computer Science Department University of Hawaii at Manoa 4/8/20101Lipyeow Lim.
CS 590M Fall 2001: Security Issues in Data Mining Lecture 3: Classification.
Basic Data Mining Techniques Chapter Decision Trees.
What is Cluster Analysis
Basic Data Mining Techniques
Semi-Supervised Clustering Jieping Ye Department of Computer Science and Engineering Arizona State University
DWH-Ahsan Abdullah 1 Data Warehousing Lab Lect-1 DTS: Introduction Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics.
Lecture-33 DWH Implementation: Goal Driven Approach (1)
Oracle Data Mining Ying Zhang. Agenda Data Mining Data Mining Algorithms Oracle DM Demo.
Data Mining: A Closer Look Chapter Data Mining Strategies 2.
Chapter 5 Data mining : A Closer Look.
Hazırlayan NEURAL NETWORKS Radial Basis Function Networks II PROF. DR. YUSUF OYSAL.
GUHA method in Data Mining Esko Turunen Tampere University of Technology Tampere, Finland.
Lecture-1 Introduction and Background
DWH-Ahsan Abdullah 1 Data Warehousing Lab Lect-2 Lab Data Set Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics.
Ahsan Abdullah 1 Data Warehousing Lecture-12 Relational OLAP (ROLAP) Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics.
Unsupervised Learning. CS583, Bing Liu, UIC 2 Supervised learning vs. unsupervised learning Supervised learning: discover patterns in the data that relate.
Data Mining Joyeeta Dutta-Moscato July 10, Wherever we have large amounts of data, we have the need for building systems capable of learning information.
Ahsan Abdullah 1 Data Warehousing Lecture-17 Issues of ETL Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics.
Data Mining CS157B Fall 04 Professor Lee By Yanhua Xue.
1 Advanced Databases Data Mining Dr Theodoros Manavis
Ahsan Abdullah 1 Data Warehousing Lecture-11 Multidimensional OLAP (MOLAP) Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for.
Knowledge Discovery and Data Mining Evgueni Smirnov.
1 Lecture 10 Clustering. 2 Preview Introduction Partitioning methods Hierarchical methods Model-based methods Density-based methods.
Data Warehousing 1 Lecture-24 Need for Speed: Parallelism Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics.
DWH-Ahsan Abdullah 1 Data Warehousing Lecture-37 Case Study: Agri-Data Warehouse Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center.
1 Data Warehousing Lecture-13 Dimensional Modeling (DM) Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics Research.
Introduction to machine learning and data mining 1 iCSC2014, Juan López González, University of Oviedo Introduction to machine learning Juan López González.
Ahsan Abdullah 1 Data Warehousing Lecture-7De-normalization Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics.
Data Mining: Classification & Predication Hosam Al-Samarraie, PhD. Centre for Instructional Technology & Multimedia Universiti Sains Malaysia.
DWH-Ahsan Abdullah 1 Data Warehousing Lecture-4 Introduction and Background Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for.
Knowledge Discovery and Data Mining Evgueni Smirnov.
Ahsan Abdullah 1 Data Warehousing Lecture-18 ETL Detail: Data Extraction & Transformation Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. &
Ahsan Abdullah 1 Data Warehousing Lecture-9 Issues of De-normalization Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics.
Data Warehousing 1 Lecture-28 Need for Speed: Join Techniques Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics.
1 Data Warehousing Lecture-14 Process of Dimensional Modeling Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics.
DWH-Ahsan Abdullah 1 Data Warehousing Lecture-2 Introduction and Background Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for.
Ahsan Abdullah 1 Data Warehousing Lecture-10 Online Analytical Processing (OLAP) Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center.
1 Data Warehousing Lecture-15 Issues of Dimensional Modeling Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics.
Data Warehousing Lecture-30 What can Data Mining do? Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics Research.
DWH-Ahsan Abdullah 1 Data Warehousing Lecture-29 Brief Intro. to Data Mining Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center.
DATA MINING By Cecilia Parng CS 157B.
Prepared by: Mahmoud Rafeek Al-Farra
Clustering Clustering is a technique for finding similarity groups in data, called clusters. I.e., it groups data instances that are similar to (near)
Ch. Eick: Introduction to Hierarchical Clustering and DBSCAN 1 Remaining Lectures in Advanced Clustering and Outlier Detection 2.Advanced Classification.
DWH-Ahsan Abdullah 1 Data Warehousing Lecture-22 DQM: Quantifying Data Quality Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center.
Compiled By: Raj Gaurang Tiwari Assistant Professor SRMGPC, Lucknow Unsupervised Learning.
Ahsan Abdullah 1 Data Warehousing Lecture-6Normalization Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics.
© Tan,Steinbach, Kumar Introduction to Data Mining 4/18/ Data Mining: Cluster Analysis This lecture node is modified based on Lecture Notes for Chapter.
Dr. Chen, Data Mining  A/W & Dr. Chen, Data Mining Chapter 3 Basic Data Mining Techniques Jason C. H. Chen, Ph.D. Professor of MIS School of Business.
Ahsan Abdullah 1 Data Warehousing Lecture-8 De-normalization Techniques Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics.
Introduction to Data Mining Clustering & Classification Reference: Tan et al: Introduction to data mining. Some slides are adopted from Tan et al.
Cluster Analysis What is Cluster Analysis? Types of Data in Cluster Analysis A Categorization of Major Clustering Methods Partitioning Methods.
DWH-Ahsan Abdullah 1 Data Warehousing Lecture-21 Introduction to Data Quality Management (DQM) Virtual University of Pakistan Ahsan Abdullah Assoc. Prof.
Data Mining Cluster Analysis: Basic Concepts and Algorithms Lecture Notes Introduction to Data Mining by Tan, Steinbach, Kumar © Tan,Steinbach, Kumar Introduction.
Data Mining Classification and Clustering Techniques Introduction to Data Mining by Tan, Steinbach, Kumar © Tan,Steinbach, Kumar Introduction to Data Mining.
Lecture-3 Introduction and Background
Semi-Supervised Clustering
Lecture-32 DWH Lifecycle: Methodologies
Basic Intro Tutorial on Machine Learning and Data Mining
Prepared by: Mahmoud Rafeek Al-Farra
Data Mining 資料探勘 分群分析 (Cluster Analysis) Min-Yuh Day 戴敏育
Lecture-38 Case Study: Agri-Data Warehouse
Supervised vs. unsupervised Learning
Lecture-35 DWH Implementation: Pitfalls, Mistakes, Keys
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Presentation transcript:

Data Warehousing Lecture-31 Supervised vs. Unsupervised Learning Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics Research National University of Computers & Emerging Sciences, Islamabad

Data Structures in Data Mining Data matrix –Table or database –n records and m attributes, –n >> m C 1,1 C 1,2 C 1,3 C 1,m C 2,1 C 2,2 C 2,3 C 2,m C 3,1 C 3,2 C 3,3 C 3,m C n,1 C n,2 C n,3 C n,m … … S 1,2 S 1,3 S 1,n S 2,1 1S 2,3 S 2,n S 3,1 S 3,2 1S 3,n S n,1 S n,2 S n,3 1 … … Similarity matrix –Symmetric square matrix –n x n or m x m

Main types of DATA MINING Supervised Bayesian Modeling Decision Trees Neural Networks Etc. Unsupervised One-way Clustering Two-way Clustering Type and number of classes are NOT known in advance Type and number of classes are known in advance

Clustering: Min-Max Distance Age Salary outlier Inter-cluster distances are maximized Intra-cluster distances are minimized

How Clustering works?

One-way clustering example INPUT OUTPUT Black spots are noise White spots are missing data

Data Mining Agriculture data INPUT Clustered OUTPUT clusters

Which class? Classifier (model) Unseen Data Classification

Output Confidence Level Inputs How Classification work?

Classification Process (1): Model ConstructionTrainingData ClassificationAlgorithms IF time/items >= 6 THEN gender = ‘F’ Classifier(Model) (observations, measurements, etc.) Relationship between shopping time and items bought

Classification Process (2): Use the Model in PredictionTestingData Unseen Data (Firdous, Time= 15 Items = 1) Classifier Gender?

Clustering vs. Cluster Detection

Clustering vs. Cluster Detection ExampleA B

The K-Means Clustering

The K-Means Clustering: Example A B D C

The K-Means Clustering: Comment