An Introduction To The Backpropagation Algorithm Who gets the credit?

Slides:



Advertisements
Similar presentations
Multi-Layer Perceptron (MLP)
Advertisements

Backpropagation Learning Algorithm
NEURAL NETWORKS Backpropagation Algorithm
1 Machine Learning: Lecture 4 Artificial Neural Networks (Based on Chapter 4 of Mitchell T.., Machine Learning, 1997)
1 Neural networks. Neural networks are made up of many artificial neurons. Each input into the neuron has its own weight associated with it illustrated.
Multilayer Perceptrons 1. Overview  Recap of neural network theory  The multi-layered perceptron  Back-propagation  Introduction to training  Uses.
Artificial Neural Networks - Introduction -
Artificial Neural Networks - Introduction -
Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John Wiley.
Neural Networks Basic concepts ArchitectureOperation.
September 30, 2010Neural Networks Lecture 8: Backpropagation Learning 1 Sigmoidal Neurons In backpropagation networks, we typically choose  = 1 and 
Prénom Nom Document Analysis: Artificial Neural Networks Prof. Rolf Ingold, University of Fribourg Master course, spring semester 2008.
Neural Networks Chapter Feed-Forward Neural Networks.
Chapter 6: Multilayer Neural Networks
Multi Layer Perceptrons (MLP) Course website: The back-propagation algorithm Following Hertz chapter 6.
Before we start ADALINE
Data Mining with Neural Networks (HK: Chapter 7.5)
CHAPTER 11 Back-Propagation Ming-Feng Yeh.
INTRODUCTION TO ARTIFICIAL INTELLIGENCE
November 21, 2012Introduction to Artificial Intelligence Lecture 16: Neural Network Paradigms III 1 Learning in the BPN Gradients of two-dimensional functions:
Image Compression Using Neural Networks Vishal Agrawal (Y6541) Nandan Dubey (Y6279)
Neural Networks. Background - Neural Networks can be : Biological - Biological models Artificial - Artificial models - Desire to produce artificial systems.
1 IFT6255: Information Retrieval Text classification.
Neuro-fuzzy Systems Xinbo Gao School of Electronic Engineering Xidian University 2004,10.
Artificial neural networks:
Artificial Neural Networks (ANN). Output Y is 1 if at least two of the three inputs are equal to 1.
Multiple-Layer Networks and Backpropagation Algorithms
Artificial Neural Networks
Multi Layer NN and Bit-True Modeling of These Networks SILab presentation Ali Ahmadi September 2007.
Neural Networks Chapter 6 Joost N. Kok Universiteit Leiden.
Appendix B: An Example of Back-propagation algorithm
Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John Wiley.
NEURAL NETWORKS FOR DATA MINING
 Diagram of a Neuron  The Simple Perceptron  Multilayer Neural Network  What is Hidden Layer?  Why do we Need a Hidden Layer?  How do Multilayer.
ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition LECTURE 16: NEURAL NETWORKS Objectives: Feedforward.
Artificial Intelligence Methods Neural Networks Lecture 4 Rakesh K. Bissoondeeal Rakesh K. Bissoondeeal.
Artificial Intelligence Techniques Multilayer Perceptrons.
Artificial Neural Networks. The Brain How do brains work? How do human brains differ from that of other animals? Can we base models of artificial intelligence.
November 26, 2013Computer Vision Lecture 15: Object Recognition III 1 Backpropagation Network Structure Perceptrons (and many other classifiers) can only.
Modelling Language Evolution Lecture 1: Introduction to Learning Simon Kirby University of Edinburgh Language Evolution & Computation Research Unit.
BACKPROPAGATION: An Example of Supervised Learning One useful network is feed-forward network (often trained using the backpropagation algorithm) called.
Multi-Layer Perceptron
EE459 Neural Networks Backpropagation
Artificial Neural Networks Students: Albu Alexandru Deaconescu Ionu.
CS621 : Artificial Intelligence
CS344: Introduction to Artificial Intelligence (associated lab: CS386) Pushpak Bhattacharyya CSE Dept., IIT Bombay Lecture 32: sigmoid neuron; Feedforward.
1 Lecture 6 Neural Network Training. 2 Neural Network Training Network training is basic to establishing the functional relationship between the inputs.
Introduction to Neural Networks Introduction to Neural Networks Applied to OCR and Speech Recognition An actual neuron A crude model of a neuron Computational.
Neural Networks Teacher: Elena Marchiori R4.47 Assistant: Kees Jong S2.22
EEE502 Pattern Recognition
Chapter 8: Adaptive Networks
Hazırlayan NEURAL NETWORKS Backpropagation Network PROF. DR. YUSUF OYSAL.
Neural Networks 2nd Edition Simon Haykin
Artificial Neural Networks (ANN). Artificial Neural Networks First proposed in 1940s as an attempt to simulate the human brain’s cognitive learning processes.
CAP6938 Neuroevolution and Artificial Embryogeny Neural Network Weight Optimization Dr. Kenneth Stanley January 18, 2006.
Previous Lecture Perceptron W  t+1  W  t  t  d(t) - sign (w(t)  x)] x Adaline W  t+1  W  t  t  d(t) - f(w(t)  x)] f’ x Gradient.
Artificial Intelligence CIS 342 The College of Saint Rose David Goldschmidt, Ph.D.
Intro. ANN & Fuzzy Systems Lecture 11. MLP (III): Back-Propagation.
Data Mining: Concepts and Techniques1 Prediction Prediction vs. classification Classification predicts categorical class label Prediction predicts continuous-valued.
An Introduction To The Backpropagation Algorithm.
Neural networks.
Multiple-Layer Networks and Backpropagation Algorithms
The Gradient Descent Algorithm
FUNDAMENTAL CONCEPT OF ARTIFICIAL NETWORKS
Artificial Intelligence Methods
Artificial Neural Network & Backpropagation Algorithm
network of simple neuron-like computing elements
An Introduction To The Backpropagation Algorithm
Presentation transcript:

An Introduction To The Backpropagation Algorithm Who gets the credit?

7/2/2015 Copyright G. A. Tagliarini, PhD 2 Basic Neuron Model In A Feedforward Network Inputs x i arrive through pre-synaptic connections Synaptic efficacy is modeled using real weights w i The response of the neuron is a nonlinear function f of its weighted inputs

7/2/2015 Copyright G. A. Tagliarini, PhD 3 Inputs To Neurons Arise from other neurons or from outside the network Nodes whose inputs arise outside the network are called input nodes and simply copy values An input may excite or inhibit the response of the neuron to which it is applied, depending upon the weight of the connection

7/2/2015 Copyright G. A. Tagliarini, PhD 4 Weights Represent synaptic efficacy and may be excitatory or inhibitory Normally, positive weights are considered as excitatory while negative weights are thought of as inhibitory Learning is the process of modifying the weights in order to produce a network that performs some function

7/2/2015 Copyright G. A. Tagliarini, PhD 5 Output The response function is normally nonlinear Samples include – Sigmoid – Piecewise linear

7/2/2015 Copyright G. A. Tagliarini, PhD 6 Backpropagation Preparation Training Set A collection of input-output patterns that are used to train the network Testing Set A collection of input-output patterns that are used to assess network performance Learning Rate-η A scalar parameter, analogous to step size in numerical integration, used to set the rate of adjustments

7/2/2015 Copyright G. A. Tagliarini, PhD 7 Network Error Total-Sum-Squared-Error (TSSE) Root-Mean-Squared-Error (RMSE)

7/2/2015 Copyright G. A. Tagliarini, PhD 8 A Pseudo-Code Algorithm Randomly choose the initial weights While error is too large – For each training pattern (presented in random order) Apply the inputs to the network Calculate the output for every neuron from the input layer, through the hidden layer(s), to the output layer Calculate the error at the outputs Use the output error to compute error signals for pre-output layers Use the error signals to compute weight adjustments Apply the weight adjustments – Periodically evaluate the network performance

7/2/2015 Copyright G. A. Tagliarini, PhD 9 Possible Data Structures Two-dimensional arrays – Weights (at least for input-to-hidden layer and hidden-to-output layer connections) – Weight changes (  ij ) One-dimensional arrays – Neuron layers Cumulative current input Current output Error signal for each neuron – Bias weights

7/2/2015 Copyright G. A. Tagliarini, PhD 10 Apply Inputs From A Pattern Apply the value of each input parameter to each input node Input nodes compute only the identity function FeedforwardInputs Outputs

7/2/2015 Copyright G. A. Tagliarini, PhD 11 Calculate Outputs For Each Neuron Based On The Pattern The output from neuron j for pattern p is O pj where and k ranges over the input indices and W jk is the weight on the connection from input k to neuron j FeedforwardInputs Outputs

7/2/2015 Copyright G. A. Tagliarini, PhD 12 Calculate The Error Signal For Each Output Neuron The output neuron error signal  pj is given by  pj =(T pj -O pj ) O pj (1-O pj ) T pj is the target value of output neuron j for pattern p O pj is the actual output value of output neuron j for pattern p

7/2/2015 Copyright G. A. Tagliarini, PhD 13 Calculate The Error Signal For Each Hidden Neuron The hidden neuron error signal  pj is given by where  pk is the error signal of a post-synaptic neuron k and W kj is the weight of the connection from hidden neuron j to the post- synaptic neuron k

7/2/2015 Copyright G. A. Tagliarini, PhD 14 Calculate And Apply Weight Adjustments Compute weight adjustments  W ji at time t by  W ji (t)= η  pj O pi Apply weight adjustments according to W ji (t+1) = W ji (t) +  W ji (t) Some add a momentum term  W ji (t-1)

7/2/2015 Copyright G. A. Tagliarini, PhD 15 An Example: Exclusive “OR” Training set – ((0.1, 0.1), 0.1) – ((0.1, 0.9), 0.9) – ((0.9, 0.1), 0.9) – ((0.9, 0.9), 0.1) Testing set – Use at least 121 pairs equally spaced on the unit square and plot the results – Omit the training set (if desired)

7/2/2015 Copyright G. A. Tagliarini, PhD 16 An Example (continued): Network Architecture inputs output(s)

7/2/2015 Copyright G. A. Tagliarini, PhD 17 An Example (continued): Network Architecture Sample input Target output

7/2/2015 Copyright G. A. Tagliarini, PhD 18 Feedforward Network Training by Backpropagation: Process Summary Select an architecture Randomly initialize weights While error is too large – Select training pattern and feedforward to find actual network output – Calculate errors and backpropagate error signals – Adjust weights Evaluate performance using the test set

7/2/2015 Copyright G. A. Tagliarini, PhD 19 An Example (continued): Network Architecture Sample input Actual output ??? ?? Target output 0.9