10-10-05Prof. Pushpak Bhattacharyya, IIT Bombay 1 CS 621 Artificial Intelligence Lecture 23 - 10/10/05 Prof. Pushpak Bhattacharyya Linear Separability,

Slides:



Advertisements
Similar presentations
CS344: Principles of Artificial Intelligence Pushpak Bhattacharyya CSE Dept., IIT Bombay Lecture 11, 12: Perceptron Training 30 th and 31 st Jan, 2012.
Advertisements

G53MLE | Machine Learning | Dr Guoping Qiu
CS344: Introduction to Artificial Intelligence Pushpak Bhattacharyya CSE Dept., IIT Bombay Lecture 15, 16: Perceptrons and their computing power 6 th and.
Linear Discriminant Functions
L15:Microarray analysis (Classification) The Biological Problem Two conditions that need to be differentiated, (Have different treatments). EX: ALL (Acute.
20.5 Nerual Networks Thanks: Professors Frank Hoffmann and Jiawei Han, and Russell and Norvig.
Carla P. Gomes CS4700 CS 4700: Foundations of Artificial Intelligence Prof. Carla P. Gomes Module: Neural Networks Expressiveness.
2806 Neural Computation Support Vector Machines Lecture Ari Visa.
Artificial Neural Networks
CS623: Introduction to Computing with Neural Nets (lecture-10) Pushpak Bhattacharyya Computer Science and Engineering Department IIT Bombay.
CS460/626 : Natural Language Processing/Speech, NLP and the Web (Lecture 16– Linear and Logistic Regression) Pushpak Bhattacharyya CSE Dept., IIT Bombay.
CS344: Introduction to Artificial Intelligence (associated lab: CS386) Pushpak Bhattacharyya CSE Dept., IIT Bombay Lecture 25: Perceptrons; # of regions;
CS344: Introduction to Artificial Intelligence Pushpak Bhattacharyya CSE Dept., IIT Bombay Lecture 36-37: Foundation of Machine Learning.
CS621: Artificial Intelligence Lecture 11: Perceptrons capacity Pushpak Bhattacharyya Computer Science and Engineering Department IIT Bombay.
Instructor: Prof. Pushpak Bhattacharyya 13/08/2004 CS-621/CS-449 Lecture Notes CS621/CS449 Artificial Intelligence Lecture Notes Set 6: 22/09/2004, 24/09/2004,
CS344 : Artificial Intelligence Pushpak Bhattacharyya CSE Dept., IIT Bombay Lecture 29 Introducing Neural Nets.
CS623: Introduction to Computing with Neural Nets Pushpak Bhattacharyya Computer Science and Engineering Department IIT Bombay.
CS621: Artificial Intelligence Pushpak Bhattacharyya CSE Dept., IIT Bombay Lecture 41,42– Artificial Neural Network, Perceptron, Capacity 2 nd, 4 th Nov,
CS344: Introduction to Artificial Intelligence (associated lab: CS386) Pushpak Bhattacharyya CSE Dept., IIT Bombay Lecture 31: Feedforward N/W; sigmoid.
CS344: Introduction to Artificial Intelligence (associated lab: CS386) Pushpak Bhattacharyya CSE Dept., IIT Bombay Lecture 30: Perceptron training convergence;
CS621 : Artificial Intelligence Pushpak Bhattacharyya CSE Dept., IIT Bombay Lecture 21 Computing power of Perceptrons and Perceptron Training.
CS344: Introduction to Artificial Intelligence (associated lab: CS386) Pushpak Bhattacharyya CSE Dept., IIT Bombay Lecture 29: Perceptron training and.
Linear Classification with Perceptrons
SUPPORT VECTOR MACHINES. Intresting Statistics: Vladmir Vapnik invented Support Vector Machines in SVM have been developed in the framework of Statistical.
Instructor: Prof. Pushpak Bhattacharyya 13/08/2004 CS-621/CS-449 Lecture Notes CS621/CS449 Artificial Intelligence Lecture Notes Set 4: 24/08/2004, 25/08/2004,
Prof. Pushpak Bhattacharyya, IIT Bombay 1 CS 621 Artificial Intelligence Lecture /08/05 Prof. Pushpak Bhattacharyya Fuzzy Set (contd) Fuzzy.
CSSE463: Image Recognition Day 14 Lab due Weds, 3:25. Lab due Weds, 3:25. My solutions assume that you don't threshold the shapes.ppt image. My solutions.
CS621 : Artificial Intelligence
CS344: Introduction to Artificial Intelligence (associated lab: CS386) Pushpak Bhattacharyya CSE Dept., IIT Bombay Lecture 32: sigmoid neuron; Feedforward.
CS621 : Artificial Intelligence Pushpak Bhattacharyya CSE Dept., IIT Bombay Lecture 21: Perceptron training and convergence.
CS344: Introduction to Artificial Intelligence (associated lab: CS386) Pushpak Bhattacharyya CSE Dept., IIT Bombay Lecture 36, 37: Hardness of training.
Support-Vector Networks C Cortes and V Vapnik (Tue) Computational Models of Intelligence Joon Shik Kim.
CS623: Introduction to Computing with Neural Nets (lecture-7) Pushpak Bhattacharyya Computer Science and Engineering Department IIT Bombay.
Prof. Pushpak Bhattacharyya, IIT Bombay1 CS 621 Artificial Intelligence Lecture 12 – 30/08/05 Prof. Pushpak Bhattacharyya Fundamentals of Information.
Announcements 1. Textbook will be on reserve at library 2. Topic schedule change; modified reading assignment: This week: Linear discrimination, evaluating.
CS623: Introduction to Computing with Neural Nets (lecture-9) Pushpak Bhattacharyya Computer Science and Engineering Department IIT Bombay.
Prof. Pushpak Bhattacharyya, IIT Bombay1 CS 621 Artificial Intelligence Lecture /08/05 Prof. Pushpak Bhattacharyya Fuzzy Inferencing.
CS621 : Artificial Intelligence Pushpak Bhattacharyya CSE Dept., IIT Bombay Lecture 20: Neural Net Basics: Perceptron.
CS621: Artificial Intelligence Lecture 10: Perceptrons introduction Pushpak Bhattacharyya Computer Science and Engineering Department IIT Bombay.
Pattern Recognition Lecture 20: Neural Networks 3 Dr. Richard Spillman Pacific Lutheran University.
CS621 : Artificial Intelligence
CS623: Introduction to Computing with Neural Nets (lecture-5)
CS344: Introduction to Artificial Intelligence (associated lab: CS386)
Pushpak Bhattacharyya Computer Science and Engineering Department
CS621: Artificial Intelligence
CS623: Introduction to Computing with Neural Nets (lecture-2)
CS621: Artificial Intelligence Lecture 17: Feedforward network (lecture 16 was on Adaptive Hypermedia: Debraj, Kekin and Raunak) Pushpak Bhattacharyya.
Artificial Intelligence Chapter 3 Neural Networks
Pushpak Bhattacharyya Computer Science and Engineering Department
CS623: Introduction to Computing with Neural Nets (lecture-4)
CS623: Introduction to Computing with Neural Nets (lecture-9)
Pushpak Bhattacharyya Computer Science and Engineering Department
Artificial Intelligence Chapter 3 Neural Networks
CS344 : Introduction to Artificial Intelligence
CS 621 Artificial Intelligence Lecture /10/05 Prof
CS621: Artificial Intelligence
Artificial Intelligence Chapter 3 Neural Networks
CS 621 Artificial Intelligence Lecture 27 – 21/10/05
Artificial Intelligence Chapter 3 Neural Networks
ARTIFICIAL INTELLIGENCE
CS621: Artificial Intelligence Lecture 12: Counting no
CS623: Introduction to Computing with Neural Nets (lecture-5)
CS623: Introduction to Computing with Neural Nets (lecture-3)
CS344 : Introduction to Artificial Intelligence
CS621: Artificial Intelligence Lecture 22-23: Sigmoid neuron, Backpropagation (Lecture 20 and 21 taken by Anup on Graphical Models) Pushpak Bhattacharyya.
Prof. Pushpak Bhattacharyya, IIT Bombay
CS621: Artificial Intelligence Lecture 14: perceptron training
CS621: Artificial Intelligence Lecture 18: Feedforward network contd
Artificial Intelligence Chapter 3 Neural Networks
CS621: Artificial Intelligence Lecture 17: Feedforward network (lecture 16 was on Adaptive Hypermedia: Debraj, Kekin and Raunak) Pushpak Bhattacharyya.
Presentation transcript:

Prof. Pushpak Bhattacharyya, IIT Bombay 1 CS 621 Artificial Intelligence Lecture /10/05 Prof. Pushpak Bhattacharyya Linear Separability, Introduction of Feedforward Network

Prof. Pushpak Bhattacharyya, IIT Bombay 2 Test for Linear Separability (LS) Theorem: A function is linearly separable iff the vectors corresponding to the function do not have a Positive Linear Combination (PLC) PLC – Both a necessary and sufficient condition. X 1, X 2, …, X m - Vectors of the function Y 1, Y 2, …, Y m - Augmented negated set Prepending -1 to the 0-class vector X i and negating it, gives Y i

Prof. Pushpak Bhattacharyya, IIT Bombay 3 Example (1) - XNOR The set {Y i } has a PLC if ∑ P i Y i = 0, 1 ≤ i ≤ m –where each P i is a non-negative scalar and –atleast one P i > 0 Example : 2 bit even- parity (X-NOR function) X1X1 +Y1Y1 X2X2 -Y2Y2 X3X3 -Y3Y3 X4X4 +Y4Y4

Prof. Pushpak Bhattacharyya, IIT Bombay 4 Example (1) - XNOR P 1 [ ] T + P 2 [ ] T + P 3 [ ] T + P 4 [ ] T = [ ] T All P i = 1 gives the result. For Parity function, PLC exists => Not linearly separable.

Prof. Pushpak Bhattacharyya, IIT Bombay 5 Example - AND AND does not have PLC. Suppose not,  P 1, P 2, P 3,P 4 s.t. 4  P i X i T = 0 i=1

Prof. Pushpak Bhattacharyya, IIT Bombay 6 AND (Contd) X 1 = [1,0,0] X 2 = [1,0,-1] X 3 = [1,-1,0] X 4 = [-1,1,1] P 1 [1,0,0] T + P 2 [1,0,-1] T + P 3 [1,-1,0] T + P 4 [-1,1,1] T = [0,0,0] T P 1 + P 2 + P 3 - P 4 = 0- (1) - P 3 + P 4 = 0- (2) - P 2 + P 4 = 0- (3)

Prof. Pushpak Bhattacharyya, IIT Bombay 7 AND (contd) This can be satisfied if P 1 = P 2 = P 3 = P 4 = 0 So, PLC does not exist. So, AND is computable by perceptron. However finding PLC is not efficient.

Prof. Pushpak Bhattacharyya, IIT Bombay 8 Exercise Try to learn the SNNS package (available on CS621 homepage). Try PLC test for 1.Different boolean functions. 2.Majority function (1 if #1s > #0s) 3.Comparator function (1 if decimal(Y) > decimal(X) 4.Odd parity 5.IRIS data

Prof. Pushpak Bhattacharyya, IIT Bombay 9 Study of Linear Separability W. Xj = 0 defines a hyperplane in the (n+1) dimension. => W vector and Xj vectors are perpendicular to each other.... θ w1w1 w2w2 w3w3 wnwn x2x2 x3x3 xnxn

Prof. Pushpak Bhattacharyya, IIT Bombay 10 Linear Separability X k+1 - X k X m - X1+X1+ + X 2 Xk+Xk+ + Positive set : w. Xj > 0  j≤k Negative set : w. Xj k Separating hyperplane

Prof. Pushpak Bhattacharyya, IIT Bombay 11 Linear Separability w. X j = 0 => w is normal to the hyperplane which separates the +ve points from the –ve points. In this computing paradigm, computation means “placing hyperplanes”. Functions computable by the perceptron are called –“threshold functions” because of comparing ∑ w i X i with θ (the threshold) –“linearly separable” because of setting up linear surfaces to separate +ve and –ve points

Prof. Pushpak Bhattacharyya, IIT Bombay 12 Concept of Separability All these need the concept of separability Perceptrons Support Vector Machines Feed-forward networks

Prof. Pushpak Bhattacharyya, IIT Bombay 13 SVMs Variations of the idea of linear separability – Separating plane Vapnik – Statistical Learning Theory

Prof. Pushpak Bhattacharyya, IIT Bombay 14 Feed-Forward Networks Motivation: Most real life data is not linearly separable. If you can’t separate by a single plane, use more planes.

Prof. Pushpak Bhattacharyya, IIT Bombay 15 Feed-Forward Networks (contd) (0,0) (0,1) (1,0) (1,1)0-class 1-class 0-class 1-class x2x2 x1x1 h2h2 h1h1 y hidden layers