Prepared by: Mahmoud Rafeek Al-Farra

Slides:



Advertisements
Similar presentations
Text Categorization.
Advertisements

COMPUTER AIDED DIAGNOSIS: CLASSIFICATION Prof. Yasser Mostafa Kadah –
Image classification Given the bag-of-features representations of images from different classes, how do we learn a model for distinguishing them?
K-NEAREST NEIGHBORS AND DECISION TREE Nonparametric Supervised Learning.
Data Mining Classification: Alternative Techniques
1 Machine Learning: Lecture 10 Unsupervised Learning (Based on Chapter 9 of Nilsson, N., Introduction to Machine Learning, 1996)
Prepared by: Mahmoud Rafeek Al-Farra College of Science & Technology Dep. Of Computer Science & IT BCs of Information Technology Data Mining
1 Machine Learning: Lecture 7 Instance-Based Learning (IBL) (Based on Chapter 8 of Mitchell T.., Machine Learning, 1997)
Kansas State University Department of Computing and Information Sciences Laboratory for Knowledge Discovery in Databases (KDD) KDD Group Research Seminar.
1er. Escuela Red ProTIC - Tandil, de Abril, Instance-Based Learning 4.1 Introduction Instance-Based Learning: Local approximation to the.
K nearest neighbor and Rocchio algorithm
SAK 5609 DATA MINING Prof. Madya Dr. Md. Nasir bin Sulaiman
1 Lecture 5: Automatic cluster detection Lecture 6: Artificial neural networks Lecture 7: Evaluation of discovered knowledge Brief introduction to lectures.
CS 590M Fall 2001: Security Issues in Data Mining Lecture 3: Classification.
Carla P. Gomes CS4700 CS 4700: Foundations of Artificial Intelligence Carla P. Gomes Module: Nearest Neighbor Models (Reading: Chapter.
Data Mining with Decision Trees Lutz Hamel Dept. of Computer Science and Statistics University of Rhode Island.
Recommender systems Ram Akella February 23, 2011 Lecture 6b, i290 & 280I University of California at Berkeley Silicon Valley Center/SC.
Neural Networks Chapter Feed-Forward Neural Networks.
Basic concepts of Data Mining, Clustering and Genetic Algorithms Tsai-Yang Jea Department of Computer Science and Engineering SUNY at Buffalo.
Data-intensive Computing Algorithms: Classification Ref: Algorithms for the Intelligent Web 6/26/20151.
Aprendizagem baseada em instâncias (K vizinhos mais próximos)
1 MACHINE LEARNING TECHNIQUES IN IMAGE PROCESSING By Kaan Tariman M.S. in Computer Science CSCI 8810 Course Project.
Recommender systems Ram Akella November 26 th 2008.
INSTANCE-BASE LEARNING
CS Instance Based Learning1 Instance Based Learning.
Introduction to Data Mining Engineering Group in ACL.
Machine Learning Usman Roshan Dept. of Computer Science NJIT.
1 Text Categorization  Assigning documents to a fixed set of categories  Applications:  Web pages  Recommending pages  Yahoo-like classification hierarchies.
Supervised Learning and k Nearest Neighbors Business Intelligence for Managers.
Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence From Data Mining To Knowledge.
Chapter 13 Genetic Algorithms. 2 Data Mining Techniques So Far… Chapter 5 – Statistics Chapter 6 – Decision Trees Chapter 7 – Neural Networks Chapter.
COMMON EVALUATION FINAL PROJECT Vira Oleksyuk ECE 8110: Introduction to machine Learning and Pattern Recognition.
Introduction to machine learning and data mining 1 iCSC2014, Juan López González, University of Oviedo Introduction to machine learning Juan López González.
Prepared by: Mahmoud Rafeek Al-Farra College of Science & Technology Dep. Of Computer Science & IT BCs of Information Technology Data Mining
Data Mining Teaching experience at the FIB. What is Data Mining? A broad set of techniques and algorithms brought from machine learning and statistics.
1 Pattern Recognition Pattern recognition is: 1. A research area in which patterns in data are found, recognized, discovered, …whatever. 2. A catchall.
Data Mining Practical Machine Learning Tools and Techniques Chapter 4: Algorithms: The Basic Methods Section 4.7: Instance-Based Learning Rodney Nielsen.
Prepared by: Mahmoud Rafeek Al-Farra College of Science & Technology Dep. Of Computer Science & IT BCs of Information Technology Data Mining
Prepared by: Mahmoud Rafeek Al-Farra
October 2-3, 2015, İSTANBUL Boğaziçi University Prof.Dr. M.Erdal Balaban Istanbul University Faculty of Business Administration Avcılar, Istanbul - TURKEY.
Introduction to Data Mining by Yen-Hsien Lee Department of Information Management College of Management National Sun Yat-Sen University March 4, 2003.
Outline K-Nearest Neighbor algorithm Fuzzy Set theory Classifier Accuracy Measures.
K nearest neighbors algorithm Parallelization on Cuda PROF. VELJKO MILUTINOVIĆ MAŠA KNEŽEVIĆ 3037/2015.
Eick: kNN kNN: A Non-parametric Classification and Prediction Technique Goals of this set of transparencies: 1.Introduce kNN---a popular non-parameric.
Debrup Chakraborty Non Parametric Methods Pattern Recognition and Machine Learning.
Introduction to Data Mining Clustering & Classification Reference: Tan et al: Introduction to data mining. Some slides are adopted from Tan et al.
Machine Learning Usman Roshan Dept. of Computer Science NJIT.
Data-intensive Computing Algorithms: Classification
k-Nearest neighbors and decision tree
Prepared by: Mahmoud Rafeek Al-Farra
Table 1. Advantages and Disadvantages of Traditional DM/ML Methods
Data Mining 101 with Scikit-Learn
Data Mining: Concepts and Techniques (3rd ed
Data Mining Techniques So Far…
Instance Based Learning (Adapted from various sources)
FUNDAMENTALS OF MACHINE LEARNING AND DEEP LEARNING
Nearest-Neighbor Classifiers
Research Areas Christoph F. Eick
Prepared by: Mahmoud Rafeek Al-Farra
Prepared by: Mahmoud Rafeek Al-Farra
Text Categorization Assigning documents to a fixed set of categories
COSC 4335: Other Classification Techniques
Objectives Data Mining Course
MACHINE LEARNING TECHNIQUES IN IMAGE PROCESSING
MACHINE LEARNING TECHNIQUES IN IMAGE PROCESSING
Prepared by: Mahmoud Rafeek Al-Farra
Machine Learning: UNIT-4 CHAPTER-1
Nearest Neighbors CSC 576: Data Mining.
Naive Bayes Classifier
A task of induction to find patterns
Presentation transcript:

Prepared by: Mahmoud Rafeek Al-Farra College of Science & Technology Dep. Of Computer Science & IT BCs of Information Technology Data Mining Chapter 4_3: Classification Methods (Examples) Prepared by: Mahmoud Rafeek Al-Farra 2013 www.cst.ps/staff/mfarra

Course’s Out Lines Introduction Data Preparation and Preprocessing Data Representation Classification Methods Evaluation Clustering Methods Mid Exam Association Rules Knowledge Representation Special Case study : Document clustering Discussion of Case studies by students

Out Lines Machine learning techniques k-Nearest Neighbors Naïve Bayesian Classifiers

k-Nearest Neighbors Also called instance based learning. K-nearest neighbor is a supervised learning algorithm where the result of new instance query is classified based on majority of K-nearest neighbor category. The purpose of this algorithm is to classify a new object based on attributes and training samples.

k-Nearest Neighbors The classifiers do not use any model to fit and only based on memory. Given a query point, We find K number of objects or (training points) closest to the query point. The classification is using majority vote among the classification of the K objects.

k-Nearest Neighbors - Algorithm Given a new instance x, find its nearest neighbor <x’,y’> Return y’ as the class of x

Advantage and Disadvantage Robust to noisy training data Effective if the training data is large Disadvantage Need to determine value of parameter K Distance based learning is not clear which type of distance to use and which attribute to use to produce the best results. Computation cost is quite high because we need to compute distance of each query instance to all training samples.

Next … Naïve Bayesian Classifiers Artificial Neural Networks

Thanks