ECE 471/571 – Lecture 12 Perceptron.

Slides:

Advertisements

Similar presentations

Perceptron Lecture 4.

Advertisements

Slides from: Doug Gray, David Poole

Introduction to Neural Networks Computing

1 Image Classification MSc Image Processing Assignment March 2003.

G53MLE | Machine Learning | Dr Guoping Qiu

Longin Jan Latecki Temple University

Machine Learning: Connectionist McCulloch-Pitts Neuron Perceptrons Multilayer Networks Support Vector Machines Feedback Networks Hopfield Networks.

Simple Neural Nets For Pattern Classification

Introduction to Neural Networks John Paxton Montana State University Summer 2003.

September 23, 2010Neural Networks Lecture 6: Perceptron Learning 1 Refresher: Perceptron Training Algorithm Algorithm Perceptron; Start with a randomly.

CHAPTER 11 Back-Propagation Ming-Feng Yeh.

Linear Discrimination Reading: Chapter 2 of textbook.

Non-Bayes classifiers. Linear discriminants, neural networks.

ECE 471/571 – Lecture 2 Bayesian Decision Theory 08/25/15.

ECE 471/571 – Lecture 6 Dimensionality Reduction – Fisher’s Linear Discriminant 09/08/15.

Chapter 2 Single Layer Feedforward Networks

Chapter 20 Classification and Estimation Classification – Feature selection Good feature have four characteristics: –Discrimination. Features.

ECE 471/571 - Lecture 19 Review 11/12/15. A Roadmap 2 Pattern Classification Statistical ApproachNon-Statistical Approach SupervisedUnsupervised Basic.

ECE 471/571 – Lecture 21 Syntactic Pattern Recognition 11/19/15.

Artificial Intelligence Methods Neural Networks Lecture 3 Rakesh K. Bissoondeeal Rakesh K. Bissoondeeal.

ECE 471/571 – Lecture 3 Discriminant Function and Normal Density 08/27/15.

Pattern Recognition Lecture 20: Neural Networks 3 Dr. Richard Spillman Pacific Lutheran University.

Use Neural Networks for Pattern Recognition 03/29/17

Neural networks and support vector machines

Syntactic Pattern Recognition 04/28/17

ECE 471/571 - Lecture 19 Review 02/24/17.

Nonparametric Density Estimation – k-nearest neighbor (kNN) 02/20/17

Learning with Perceptrons and Neural Networks

Performance Evaluation 02/15/17

Chapter 2 Single Layer Feedforward Networks

Ranga Rodrigo February 8, 2014

ECE 471/571 – Lecture 18 Classifier Fusion 04/12/17.

Classification with Perceptrons Reading:

Unsupervised Learning - Clustering 04/03/17

Basic machine learning background with Python scikit-learn

with Daniel L. Silver, Ph.D. Christian Frey, BBA April 11-12, 2017

Unsupervised Learning - Clustering

ECE 471/571 - Lecture 17 Back Propagation.

Neural Networks Advantages Criticism

Classification Neural Networks 1

Chapter 3. Artificial Neural Networks - Introduction -

Neuro-Computing Lecture 4 Radial Basis Function Network

ECE 471/571 – Review 1.

Perceptron as one Type of Linear Discriminants

Neural Networks Chapter 5

Neural Network - 2 Mayank Vatsa

CS 478 Homework CS Homework.

Parametric Estimation

Hairong Qi, Gonzalez Family Professor

Lecture Notes for Chapter 4 Artificial Neural Networks

Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John.

ECE – Pattern Recognition Lecture 16: NN – Back Propagation

ECE – Pattern Recognition Lecture 15: NN - Perceptron

ECE – Pattern Recognition Lecture 13 – Decision Tree

Neuro-Computing Lecture 2 Single-Layer Perceptrons

Hairong Qi, Gonzalez Family Professor

Machine Learning Perceptron: Linearly Separable Supervised Learning

Hairong Qi, Gonzalez Family Professor

The McCullough-Pitts Neuron

A task of induction to find patterns

ECE – Lecture 1 Introduction.

ECE – Pattern Recognition Lecture 10 – Nonparametric Density Estimation – k-nearest-neighbor (kNN) Hairong Qi, Gonzalez Family Professor Electrical.

Bayesian Decision Theory

Hairong Qi, Gonzalez Family Professor

ECE – Pattern Recognition Lecture 8 – Performance Evaluation

ECE – Pattern Recognition Lecture 4 – Parametric Estimation

Hairong Qi, Gonzalez Family Professor

ECE – Pattern Recognition Lecture 14 – Gradient Descent

Outline Announcement Neural networks Perceptrons - continued

ECE – Pattern Recognition Midterm Review

Presentation transcript:

ECE 471/571 – Lecture 12 Perceptron

Different Approaches - More Detail Pattern Classification Statistical Approach Syntactic Approach Supervised Unsupervised Basic concepts: Baysian decision rule (MPP, LR, Discri.) Basic concepts: Distance Agglomerative method Parametric learning (ML, BL) k-means Non-Parametric learning (kNN) Winner-take-all NN (Perceptron, BP) Kohonen maps Dimensionality Reduction Fisher’s linear discriminant K-L transform (PCA) Performance Evaluation ROC curve TP, TN, FN, FP Stochastic Methods local optimization (GD) global optimization (SA, GA) ECE471/571, Hairong Qi

Perceptron x1 w1 w2 x2 S …… z wd xd w0 1 Bias or threshold

Perceptron A program that learns “concepts” based on examples and correct answers It can only respond with “true” or “false” Single layer neural network By training, the weight and bias of the network will be changed to be able to classify the training set with 100% accuracy

Perceptron Criterion Function Y: the set of samples misclassified by a Gradient descent learning

Perceptron Learning Rule x1 w1 w2 x2 S …… z wd xd -w0 w(k+1) = w(k) + (T – z) x -w0(k+1) = -w0(k) + (T – z) 1 T is the expected output z is the real output

Training Step1: Samples are presented to the network Step2: If the output is correct, no change is made; Otherwise, the weight and biases will be updated based on perceptron learning rule Step3: An entire pass through all the training set is called an “epoch”. If no change has been made for the epoch, stop. Otherwise, go back Step1

Exercise (AND Logic) x1 w1 w2 z x2 -w0 1 x1 x2 T 0 0 0 1 0 0 0 1 0 0 0 0 1 0 0 0 1 0 1 1 1 w1 w2 -w0

% demonstration on perceptron % AND gate % % Hairong Qi input = 2; tr = 4; w = rand(1,input+1); % generate the initial weight vector x = [0 0; 1 0; 0 1; 1 1]; % the training set (or the inputs) T = [0; 0; 0; 1]; % the expected output % learning process (training process) finish = 0; while ~finish disp(w) for i=1:tr z(i) = w(1:input) * x(i,:)' > w(input+1); w(1:input) = w(1:input) + (T(i)-z(i))* x(i,:); w(input+1) = w(input+1) - (T(i)-z(i)); end if sum(abs(z-T')) == 0 finish = 1; disp(z) pause

Visualizing the Decision Rule

Limitations The output only has two values (1 or 0) Can only classify samples which are linearly separable (straight line or straight plane) Can’t train network functions like XOR

Examples

Examples