School of Computer Science & Engineering

Slides:



Advertisements
Similar presentations
Rule extraction in neural networks. A survey. Krzysztof Mossakowski Faculty of Mathematics and Information Science Warsaw University of Technology.
Advertisements

Huge Raw Data Cleaning Data Condensation Dimensionality Reduction Data Wrapping/ Description Machine Learning Classification Clustering Rule Generation.
Data Mining: Knowledge Discovery in Databases Peter van der Putten ALP Group, LIACS Pre-University College LAPP-Top Computer Science February 2005.
OUTLINE Course description, What is pattern recognition, Cost of error, Decision boundaries, The desgin cycle.
Week 9 Data Mining System (Knowledge Data Discovery)
Learning From Data Chichang Jou Tamkang University.
Data Mining with Decision Trees Lutz Hamel Dept. of Computer Science and Statistics University of Rhode Island.
1 MACHINE LEARNING TECHNIQUES IN IMAGE PROCESSING By Kaan Tariman M.S. in Computer Science CSCI 8810 Course Project.
CS Instance Based Learning1 Instance Based Learning.
Copyright R. Weber INFO 629 Concepts in Artificial Intelligence Fall 2004 Professor: Dr. Rosina Weber.
This week: overview on pattern recognition (related to machine learning)
Data Mining Joyeeta Dutta-Moscato July 10, Wherever we have large amounts of data, we have the need for building systems capable of learning information.
COMP3503 Intro to Inductive Modeling
Pattern Recognition & Machine Learning Debrup Chakraborty
Introduction to machine learning and data mining 1 iCSC2014, Juan López González, University of Oviedo Introduction to machine learning Juan López González.
1 SUPPORT VECTOR MACHINES İsmail GÜNEŞ. 2 What is SVM? A new generation learning system. A new generation learning system. Based on recent advances in.
Some working definitions…. ‘Data Mining’ and ‘Knowledge Discovery in Databases’ (KDD) are used interchangeably Data mining = –the discovery of interesting,
Data Mining: An Introduction Billy Mutell. “The Library of Babel” Analogy Network of bookshelves with every book ever written All the books one could.
Introduction to Artificial Intelligence and Soft Computing
Overview of Part I, CMSC5707 Advanced Topics in Artificial Intelligence KH Wong (6 weeks) Audio signal processing – Signals in time & frequency domains.
Machine Learning Tutorial Amit Gruber The Hebrew University of Jerusalem.
Classification Techniques: Bayesian Classification
1 Pattern Recognition Pattern recognition is: 1. A research area in which patterns in data are found, recognized, discovered, …whatever. 2. A catchall.
Week 1 - An Introduction to Machine Learning & Soft Computing
Data Mining BY JEMINI ISLAM. Data Mining Outline: What is data mining? Why use data mining? How does data mining work The process of data mining Tools.
Data Mining: Knowledge Discovery in Databases Peter van der Putten ALP Group, LIACS Pre-University College LAPP-Top Computer Science February 2005.
1Ellen L. Walker Category Recognition Associating information extracted from images with categories (classes) of objects Requires prior knowledge about.
Data Mining: Knowledge Discovery in Databases Peter van der Putten ALP Group, LIACS Pre-University College Bio Informatics January
Chapter1: Introduction Chapter2: Overview of Supervised Learning
Data Mining and Decision Support
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
Pattern Recognition. What is Pattern Recognition? Pattern recognition is a sub-topic of machine learning. PR is the science that concerns the description.
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 28 Data Mining Concepts.
FNA/Spring CENG 562 – Machine Learning. FNA/Spring Contact information Instructor: Dr. Ferda N. Alpaslan
The Hebrew University of Jerusalem School of Engineering and Computer Science Academic Year: 2011/2012 Instructor: Jeff Rosenschein.
Brief Intro to Machine Learning CS539
Big data classification using neural network
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
CS 9633 Machine Learning Support Vector Machines
Machine Learning for Computer Security
Deep Learning Amin Sobhani.
Introduction Machine Learning 14/02/2017.
IMAGE PROCESSING RECOGNITION AND CLASSIFICATION
Intro to Machine Learning
School of Computer Science & Engineering
Prepared by: Mahmoud Rafeek Al-Farra
CSSE463: Image Recognition Day 11
Data Mining: Concepts and Techniques (3rd ed
What is Pattern Recognition?
Machine Learning Week 1.
Basic Intro Tutorial on Machine Learning and Data Mining
Advanced Embodiment Design 26 March 2015
Classification Techniques: Bayesian Classification
CSSE463: Image Recognition Day 11
Introduction to Artificial Intelligence and Soft Computing
Prepared by: Mahmoud Rafeek Al-Farra
An Introduction to Supervised Learning
School of Computer Science & Engineering
Overview of Machine Learning
Artificial Intelligence Lecture No. 28
Data Mining: Introduction
MACHINE LEARNING TECHNIQUES IN IMAGE PROCESSING
MACHINE LEARNING TECHNIQUES IN IMAGE PROCESSING
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
CSSE463: Image Recognition Day 11
CSSE463: Image Recognition Day 11
A task of induction to find patterns
A task of induction to find patterns
Presentation transcript:

School of Computer Science & Engineering Artificial Intelligence Machine Learning, Pattern Recognition, Data Mining Dae-Won Kim School of Computer Science & Engineering Chung-Ang University

AI Scope 3. Machine Learning/ Pattern Recognition/ Data Mining 1. Search-based optimization techniques for real-life problems Hill climbing, Branch and bound, A*, Greedy algorithm Simulated annealing, Tabu search, Genetic algorithm 2. Reasoning: Logic, Inference, and knowledge representation Logical language: Syntax and Semantics Inference algorithm: Forward/Backward chaining, Resolution, and Expert System 3. Machine Learning/ Pattern Recognition/ Data Mining Classification: Bayesian algorithm, Nearest-neighbor algorithm, Neural network Clustering: Hierarchical algorithm, K-Means algorithm 4. Uncertainty based on Probability theory 5. Planning, Scheduling, Robotics, and Industry Automation

Have you ever heard of Big Data?

Progress in digital data acquisition and storage technology has resulted in the growth of huge databases.

Data mining is the extraction of implicit, previously unknown, and potentially useful information from data.

We build algorithms that sift through databases automatically, seeking patterns.

Strong patterns, if found, will likely generalize to make accurate predictions on future data.

Algorithms need to be robust enough to cope with imperfect data and to extract patterns that are inexact useful.

Machine learning provides the technical basis of data mining.

We will study simple machine learning methods, looking for patterns in data.

People has been seeking patterns in data since human life began. e.g., Samsung Galaxy: Samsung Pay, Managers in Samsung want to find consuming patterns of users so that it’d provide personalized services.

In data mining, computer algorithm is solving problems by analyzing data in databases.

Data mining is defined as the process of (knowledge) discovering patterns in data.

Data mining is defined as the process of (knowledge) discovering patterns in data.

We start with a simple example.

Q: Tell me the name of this fish.

Algorithm ??

We have 100 fishes, and measured their lengths. (e. g We have 100 fishes, and measured their lengths. (e.g., fish: x=[length]t)

Our algorithm can measure the length of a new fish, and estimate its label.

Yes, it is a typical prediction task through classification technique Yes, it is a typical prediction task through classification technique. But, it is often inexact and unsatisfactory.

Next, we measured their lightness. (e.g., fish: x=[lightness])

Lightness is better than length.

Let us use both lightness and width. (e. g Let us use both lightness and width. (e.g., fish: x=[lightness, width])

Each fish is represented a point (vector) in 2D x-y coordinate space.

Everything is represented as N-dimensional vector in coordinate space.

The world is represented as matrix

We assume that you have learned the basic concepts of linear algebra.

The objective is to find a line that effectively separates two groups.

How to find the line using a simple Math from high school?

We can build a complex nonlinear line to provide exact separation.

The formal procedure is given as:

This shows a predictive task of data mining, often called as pattern classification/ recognition/ prediction.

The act of taking in raw data and making an action based on the category of the pattern.

We build a machine that can recognize or predict patterns.

Q: How to represent and classify texts? Opinion mining Sentiment analysis

Another famous task of data mining is a descriptive task Another famous task of data mining is a descriptive task. Cluster analysis is the well-known group discovery algorithm.

We will experience the basic issues in the prediction task (pattern classification) in forthcoming weeks.

Some terms should be defined.

Given training data set : ‘n x d’ pattern/data matrix: Fish Lightness Length Weight Width Class Label Fish-1 10 70.3 6.0 36 Salmon Fish-2 75.5 8.8 128 Fish-3 29 51.1 9.4 164 Sea bass Fish-4 49.9 8.4 113

Given training data set : ‘n x d’ pattern/data matrix: ‘d’ features (attributes, variables, dimensions, fields) Fish Lightness Length Weight Width Class Label Fish-1 10 70.3 6.0 36 Salmon Fish-2 75.5 8.8 128 Fish-3 29 51.1 9.4 164 Sea bass Fish-4 49.9 8.4 113 ‘n’ patterns (objects, observations, vectors, records)

Classification General description Specific terms Supervised pattern classification Labeled training patterns, the groups are known a priori Constructs rules for classifying new data into the known groups Specific terms Pattern=object=observation is represented as a feature vector Distance measure for numeric and categorical data Training set (answer database) and test set (new observation) Prediction performance by accuracy, sensitivity, specificity, … ex) Bayesian classifier, Nearest-neighbor classifier, SVM, NN, LDA, …

Each pattern is represented as a feature vector.

The training pattern matrix is stored in a file or database.

Given labeled training patterns, the class groups are known a priori.

We constructs algorithms to classify new data into the known groups.

Training data vs. Test data

Training data are used as answers Training data are used as answers. We are learning algorithms using training data.

Test data are a set of new unseen data Test data are a set of new unseen data. We predict class labels using the learned algorithm.

Training data Test data # of data # of features data index feature-1 … feature-N class label Feature-1 Test data # of data # of features data index feature-1 feature-2 … feature-N Feature-1

For example, we try to classify the tumor type of breast cancer patients

Breast-cancer-training.txt 100 30 Patient-1 165 52 … 210 cancer Patient-2 170 50 230 normal Patient-100 160 47 250 Breast-cancer-test.txt 10 163 55 215 155 240 Patient-10 45 235

To evaluate the performance of prediction algorithms, we need a performance measure (Accuracy).

Accuracy = (True Positive + True Negative) / Gold Standard (Truth) Positive Negative Prediction Result True Positive False Positive False Negative True Negative Suspicious Patients with Breast Cancer Positive (Cancer) Negative (Normal) Accuracy = (True Positive + True Negative) / (True Positive + False Positive + False Negative + True Negative)

Accuracy = (30 + 55) / (30 + 5 + 10 + 55) = 0.85 (85%) Gold Standard (Truth) Positive Negative Prediction Result True Positive False Positive False Negative True Negative Suspicious Patients with Breast Cancer Positive (Cancer) Negative (Normal) 30 5 10 55 Accuracy = (30 + 55) / (30 + 5 + 10 + 55) = 0.85 (85%)

References # Textbooks # Advanced Topic: Emotional data mining 1) R.O. Duda, et al., “Pattern Classification” 2) S. Theodoridis., “Pattern Recognition” 3) T.M. Mitchell., “Machine Learning” # Advanced Topic: Emotional data mining Personalized music recommendation (Nov.)