G5AIAI Introduction to AI

Slides:



Advertisements
Similar presentations
Multi-Layer Perceptron (MLP)
Advertisements

Learning in Neural and Belief Networks - Feed Forward Neural Network 2001 년 3 월 28 일 안순길.
1 Neural networks. Neural networks are made up of many artificial neurons. Each input into the neuron has its own weight associated with it illustrated.
Artificial Neural Network
G5BAIM Artificial Intelligence Methods Graham Kendall Neural Networks.
Introduction to Artificial Intelligence (G51IAI)
Biological and Artificial Neurons Michael J. Watts
Simple Neural Nets For Pattern Classification
Neural Networks.
1 Chapter 11 Neural Networks. 2 Chapter 11 Contents (1) l Biological Neurons l Artificial Neurons l Perceptrons l Multilayer Neural Networks l Backpropagation.
Connectionist Modeling Some material taken from cspeech.ucd.ie/~connectionism and Rich & Knight, 1991.
Before we start ADALINE
Data Mining with Neural Networks (HK: Chapter 7.5)
The McCulloch-Pitts Neuron. Characteristics The activation of a McCulloch Pitts neuron is binary. Neurons are connected by directed weighted paths. A.
Neural Networks. Background - Neural Networks can be : Biological - Biological models Artificial - Artificial models - Desire to produce artificial systems.
1 Introduction to Artificial Neural Networks Andrew L. Nelson Visiting Research Faculty University of South Florida.
MSE 2400 EaLiCaRA Spring 2015 Dr. Tom Way
1 st Neural Network: AND function Threshold(Y) = 2 X1 Y X Y.
Learning in neural networks Chapter 19 DRAFT. Biological neuron.
Neural Networks AI – Week 23 Sub-symbolic AI Multi-Layer Neural Networks Lee McCluskey, room 3/10
Artificial Intelligence Neural Networks ( Chapter 9 )
Semiconductors, BP&A Planning,
1 Machine Learning The Perceptron. 2 Heuristic Search Knowledge Based Systems (KBS) Genetic Algorithms (GAs)
Artificial Intelligence Methods Neural Networks Lecture 4 Rakesh K. Bissoondeeal Rakesh K. Bissoondeeal.
Artificial Neural Networks. The Brain How do brains work? How do human brains differ from that of other animals? Can we base models of artificial intelligence.
1 Chapter 11 Neural Networks. 2 Chapter 11 Contents (1) l Biological Neurons l Artificial Neurons l Perceptrons l Multilayer Neural Networks l Backpropagation.
Introduction to Artificial Intelligence (G51IAI) Dr Rong Qu Neural Networks.
ADVANCED PERCEPTRON LEARNING David Kauchak CS 451 – Fall 2013.
Neural Network Basics Anns are analytical systems that address problems whose solutions have not been explicitly formulated Structure in which multiple.
Introduction to Neural Networks Introduction to Neural Networks Applied to OCR and Speech Recognition An actual neuron A crude model of a neuron Computational.
IE 585 History of Neural Networks & Introduction to Simple Learning Rules.
COSC 4426 AJ Boulay Julia Johnson Artificial Neural Networks: Introduction to Soft Computing (Textbook)
1 Perceptron as one Type of Linear Discriminants IntroductionIntroduction Design of Primitive UnitsDesign of Primitive Units PerceptronsPerceptrons.
Artificial Intelligence Methods Neural Networks Lecture 1 Rakesh K. Bissoondeeal Rakesh K.
Perceptrons Michael J. Watts
Previous Lecture Perceptron W  t+1  W  t  t  d(t) - sign (w(t)  x)] x Adaline W  t+1  W  t  t  d(t) - f(w(t)  x)] f’ x Gradient.
Artificial Intelligence CIS 342 The College of Saint Rose David Goldschmidt, Ph.D.
Artificial Intelligence Methods Neural Networks Lecture 3 Rakesh K. Bissoondeeal Rakesh K. Bissoondeeal.
Start with student evals. What function does perceptron #4 represent?
Bab 5 Classification: Alternative Techniques Part 4 Artificial Neural Networks Based Classifer.
Neural networks.
Fall 2004 Backpropagation CS478 - Machine Learning.
Neural Networks.
Learning with Perceptrons and Neural Networks
Learning in Neural Networks
Artificial neural networks:
Real Neurons Cell structures Cell body Dendrites Axon
Announcements HW4 due today (11:59pm) HW5 out today (due 11/17 11:59pm)
with Daniel L. Silver, Ph.D. Christian Frey, BBA April 11-12, 2017
Artificial Neural Networks
FUNDAMENTAL CONCEPT OF ARTIFICIAL NETWORKS
Hebb and Perceptron.
Binary Decision Diagrams
CSE (c) S. Tanimoto, 2004 Neural Networks
of the Artificial Neural Networks.
Perceptron as one Type of Linear Discriminants
CS623: Introduction to Computing with Neural Nets (lecture-4)
Perceptrons Introduced in1957 by Rosenblatt
Artificial Intelligence Lecture No. 28
CSE (c) S. Tanimoto, 2001 Neural Networks
CSE (c) S. Tanimoto, 2002 Neural Networks
Artificial neurons Nisheeth 10th January 2019.
Artificial Intelligence 12. Two Layer ANNs
Artificial Neural Networks
CSE (c) S. Tanimoto, 2007 Neural Nets
Introduction to Neural Network
The McCullough-Pitts Neuron
David Kauchak CS158 – Spring 2019

Artificial Neural Network
Presentation transcript:

G5AIAI Introduction to AI Graham Kendall Neural Networks

Neural Networks AIMA – Chapter 19 Fundamentals of Neural Networks : Architectures, Algorithms and Applications. L, Fausett, 1994 An Introduction to Neural Networks (2nd Ed). Morton, IM, 1995

Neural Networks McCulloch & Pitts (1943) are generally recognised as the designers of the first neural network Many of their ideas still used today (e.g. many simple units combine to give increased computational power and the idea of a threshold)

Neural Networks Hebb (1949) developed the first learning rule (on the premise that if two neurons were active at the same time the strength between them should be increased)

Neural Networks During the 50’s and 60’s many researchers worked on the perceptron amidst great excitement. 1969 saw the death of neural network research for about 15 years – Minsky & Papert Only in the mid 80’s (Parker and LeCun) was interest revived (in fact Werbos discovered algorithm in 1974)

Neural Networks

Neural Networks We are born with about 100 billion neurons A neuron may connect to as many as 100,000 other neurons

Neural Networks Signals “move” via electrochemical signals The synapses release a chemical transmitter – the sum of which can cause a threshold to be reached – causing the neuron to “fire” Synapses can be inhibitory or excitatory

The First Neural Neural Networks McCulloch and Pitts produced the first neural network in 1943 Many of the principles can still be seen in neural networks of today

The First Neural Neural Networks -1 2 X1 X2 X3 Y The activation of a neuron is binary. That is, the neuron either fires (activation of one) or does not fire (activation of zero).

The First Neural Neural Networks -1 2 X1 X2 X3 Y For the network shown here the activation function for unit Y is f(y_in) = 1, if y_in >= θ else 0 where y_in is the total input signal received θ is the threshold for Y

The First Neural Neural Networks -1 2 X1 X2 X3 Y Neurons in a McCulloch-Pitts network are connected by directed, weighted paths

The First Neural Neural Networks -1 2 X1 X2 X3 Y If the weight on a path is positive the path is excitatory, otherwise it is inhibitory

The First Neural Neural Networks -1 2 X1 X2 X3 Y All excitatory connections into a particular neuron have the same weight, although different weighted connections can be input to different neurons

The First Neural Neural Networks -1 2 X1 X2 X3 Y Each neuron has a fixed threshold. If the net input into the neuron is greater than or equal to the threshold, the neuron fires

The First Neural Neural Networks -1 2 X1 X2 X3 Y The threshold is set such that any non-zero inhibitory input will prevent the neuron from firing

The First Neural Neural Networks -1 2 X1 X2 X3 Y It takes one time step for a signal to pass over one connection.

The First Neural Neural Networks AND Function 1 X1 X2 Y Threshold(Y) = 2

The First Neural Neural Networks AND Function OR Function 2 X1 X2 Y Threshold(Y) = 2

The First Neural Neural Networks AND NOT Function -1 2 X1 X2 Y Threshold(Y) = 2

The First Neural Neural Networks XOR Function 2 -1 Z1 Z2 Y X1 X2 X1 XOR X2 = (X1 AND NOT X2) OR (X2 AND NOT X1)

The First Neural Neural Networks If we touch something cold we perceive heat If we keep touching something cold we will perceive cold If we touch something hot we will perceive heat

The First Neural Neural Networks To model this we will assume that time is discrete If cold is applied for one time step then heat will be perceived If a cold stimulus is applied for two time steps then cold will be perceived If heat is applied then we should perceive heat

The First Neural Neural Networks X1 X2 Z1 Z2 Y1 Y2 Heat Cold 2 1 -1 Hot

The First Neural Neural Networks X1 X2 Z1 Z2 Y1 Y2 Heat Cold 2 1 -1 Hot It takes time for the stimulus (applied at X1 and X2) to make its way to Y1 and Y2 where we perceive either heat or cold At t(0), we apply a stimulus to X1 and X2 At t(1) we can update Z1, Z2 and Y1 At t(2) we can perceive a stimulus at Y2 At t(2+n) the network is fully functional

The First Neural Neural Networks We want the system to perceive cold if a cold stimulus is applied for two time steps Y2(t) = X2(t – 2) AND X2(t – 1)

The First Neural Neural Networks We want the system to perceive heat if either a hot stimulus is applied or a cold stimulus is applied (for one time step) and then removed Y1(t) = [ X1(t – 1) ] OR [ X2(t – 3) AND NOT X2(t – 2) ]

The First Neural Neural Networks The network shows Y1(t) = X1(t – 1) OR Z1(t – 1) Z1(t – 1) = Z2( t – 2) AND NOT X2(t – 2) Z2(t – 2) = X2(t – 3) Substituting, we get Y1(t) = [ X1(t – 1) ] OR [ X2(t – 3) AND NOT X2(t – 2) ] which is the same as our original requirements

The First Neural Neural Networks You can confirm that Y2 works correctly You can also check it works on the spreadsheet

Modelling a Neuron aj :Activation value of unit j wj,I :Weight on the link from unit j to unit i inI :Weighted sum of inputs to unit i aI :Activation value of unit i g :Activation function

Activation Functions Stept(x) = 1 if x >= t, else 0 Sign(x) = +1 if x >= 0, else –1 Sigmoid(x) = 1/(1+e-x) Identity Function

Simple Networks

Simple Networks t = 0.0 y x W = 1.5 W = 1 -1

Perceptron Synonym for Single- Layer, Feed-Forward Network First Studied in the 50’s Other networks were known about but the perceptron was the only one capable of learning and thus all research was concentrated in this area

Perceptron A single weight only affects one output so we can restrict our investigations to a model as shown on the right Notation can be simpler, i.e.

What can perceptrons represent?

What can perceptrons represent? 0,0 0,1 1,0 1,1 AND XOR Functions which can be separated in this way are called Linearly Separable Only linearly Separable functions can be represented by a perceptron

What can perceptrons represent? Linear Separability is also possible in more than 3 dimensions – but it is harder to visualise

Training a perceptron Aim

Training a perceptrons y x -1 W = 0.3 W = -0.4 W = 0.5

Learning While epoch produces an error Present network with next inputs from epoch Err = T – O If Err <> 0 then Wj = Wj + LR * Ij * Err End If End While

Learning While epoch produces an error Present network with next inputs from epoch Err = T – O If Err <> 0 then Wj = Wj + LR * Ij * Err End If End While Epoch : Presentation of the entire training set to the neural network. In the case of the AND function an epoch consists of four sets of inputs being presented to the network (i.e. [0,0], [0,1], [1,0], [1,1])

Learning While epoch produces an error Present network with next inputs from epoch Err = T – O If Err <> 0 then Wj = Wj + LR * Ij * Err End If End While Training Value, T : When we are training a network we not only present it with the input but also with a value that we require the network to produce. For example, if we present the network with [1,1] for the AND function the training value will be 1

Learning While epoch produces an error Present network with next inputs from epoch Err = T – O If Err <> 0 then Wj = Wj + LR * Ij * Err End If End While Error, Err : The error value is the amount by which the value output by the network differs from the training value. For example, if we required the network to output 0 and it output a 1, then Err = -1

Learning Output from Neuron, O : The output value from the neuron While epoch produces an error Present network with next inputs from epoch Err = T – O If Err <> 0 then Wj = Wj + LR * Ij * Err End If End While Output from Neuron, O : The output value from the neuron Ij : Inputs being presented to the neuron Wj : Weight from input neuron (Ij) to the output neuron LR : The learning rate. This dictates how quickly the network converges. It is set by a matter of experimentation. It is typically 0.1

Learning After First Epoch Note I1 point = W0/W1 I2 point = W0/W2 0,0 0,1 1,0 1,1 I1 I2 After First Epoch Note I1 point = W0/W1 I2 point = W0/W2 0,0 0,1 1,0 1,1 I1 I2 At Convergence

And Finally…. “If the brain were so simple that we could understand it then we’d be so simple that we couldn’t” Lyall Watson

G5AIAI Introduction to AI Graham Kendall End of Neural Networks