Artificial Neural Networks : An Introduction

Slides:

Advertisements

Similar presentations

Learning in Neural and Belief Networks - Feed Forward Neural Network 2001 년 3 월 28 일 안순길.

Advertisements

Introduction to Artificial Neural Networks

Introduction to Neural Networks Computing

5/16/2015Intelligent Systems and Soft Computing1 Introduction Introduction Hebbian learning Hebbian learning Generalised Hebbian learning algorithm Generalised.

Artificial neural networks:

Artificial Neural Networks - Introduction -

Machine Learning: Connectionist McCulloch-Pitts Neuron Perceptrons Multilayer Networks Support Vector Machines Feedback Networks Hopfield Networks.

Machine Learning Neural Networks

Artificial Intelligence (CS 461D)

Simple Neural Nets For Pattern Classification

Neural Networks.

Prénom Nom Document Analysis: Artificial Neural Networks Prof. Rolf Ingold, University of Fribourg Master course, spring semester 2008.

Slides are based on Negnevitsky, Pearson Education, Lecture 8 Artificial neural networks: Unsupervised learning n Introduction n Hebbian learning.

PERCEPTRON. Chapter 3: The Basic Neuron  The structure of the brain can be viewed as a highly interconnected network of relatively simple processing.

1 Chapter 11 Neural Networks. 2 Chapter 11 Contents (1) l Biological Neurons l Artificial Neurons l Perceptrons l Multilayer Neural Networks l Backpropagation.

AN INTERACTIVE TOOL FOR THE STOCK MARKET RESEARCH USING RECURSIVE NEURAL NETWORKS Master Thesis Michal Trna

Before we start ADALINE

Artificial Neural Networks : An Introduction

Neural Networks. Background - Neural Networks can be : Biological - Biological models Artificial - Artificial models - Desire to produce artificial systems.

Artificial neural networks:

Machine Learning. Learning agent Any other agent.

Neurons, Neural Networks, and Learning 1. Human brain contains a massively interconnected net of (10 billion) neurons (cortical cells) Biological.

MSE 2400 EaLiCaRA Spring 2015 Dr. Tom Way

Artificial Intelligence Lecture No. 28 Dr. Asad Ali Safi Assistant Professor, Department of Computer Science, COMSATS Institute of Information Technology.

Presentation on Neural Networks.. Basics Of Neural Networks Neural networks refers to a connectionist model that simulates the biophysical information.

Artificial Neural Networks

Neural NetworksNN 11 Neural netwoks thanks to: Basics of neural network theory and practice for supervised and unsupervised.

Artificial Neural Nets and AI Connectionism Sub symbolic reasoning.

Chapter 9 Neural Network.

Artificial Neural Network Unsupervised Learning

Chapter 3 Neural Network Xiu-jun GONG (Ph. D) School of Computer Science and Technology, Tianjin University

1 Chapter 6: Artificial Neural Networks Part 2 of 3 (Sections 6.4 – 6.6) Asst. Prof. Dr. Sukanya Pongsuparb Dr. Srisupa Palakvangsa Na Ayudhya Dr. Benjarath.

Artificial Neural Networks : An Introduction G.Anuradha.

1 Machine Learning The Perceptron. 2 Heuristic Search Knowledge Based Systems (KBS) Genetic Algorithms (GAs)

Artificial Neural Network Supervised Learning دكترمحسن كاهاني

NEURAL NETWORKS FOR DATA MINING

LINEAR CLASSIFICATION. Biological inspirations  Some numbers…  The human brain contains about 10 billion nerve cells ( neurons )  Each neuron is connected.

Artificial Neural Networks. The Brain How do brains work? How do human brains differ from that of other animals? Can we base models of artificial intelligence.

Neural Networks and Fuzzy Systems Hopfield Network A feedback neural network has feedback loops from its outputs to its inputs. The presence of such loops.

1 Chapter 11 Neural Networks. 2 Chapter 11 Contents (1) l Biological Neurons l Artificial Neurons l Perceptrons l Multilayer Neural Networks l Backpropagation.

Artificial Neural Networks An Introduction. What is a Neural Network? A human Brain A porpoise brain The brain in a living creature A computer program.

Introduction to Artificial Intelligence (G51IAI) Dr Rong Qu Neural Networks.

Feed-Forward Neural Networks 主講人 : 虞台文. Content Introduction Single-Layer Perceptron Networks Learning Rules for Single-Layer Perceptron Networks – Perceptron.

METU Informatics Institute Min720 Pattern Classification with Bio-Medical Applications Part 8: Neural Networks.

Features of Biological Neural Networks 1)Robustness and Fault Tolerance. 2)Flexibility. 3)Ability to deal with variety of Data situations. 4)Collective.

Neural Networks Steven Le. Overview Introduction Architectures Learning Techniques Advantages Applications.

Features of Biological Neural Networks 1)Robustness and Fault Tolerance. 2)Flexibility. 3)Ability to deal with variety of Data situations. 4)Collective.

Neural Network Basics Anns are analytical systems that address problems whose solutions have not been explicitly formulated Structure in which multiple.

1 Lecture 6 Neural Network Training. 2 Neural Network Training Network training is basic to establishing the functional relationship between the inputs.

Lecture 5 Neural Control

Neural Networks Presented by M. Abbasi Course lecturer: Dr.Tohidkhah.

Neural Networks Teacher: Elena Marchiori R4.47 Assistant: Kees Jong S2.22

Supervised learning network G.Anuradha. Learning objectives The basic networks in supervised learning Perceptron networks better than Hebb rule Single.

Dr.Abeer Mahmoud ARTIFICIAL INTELLIGENCE (CS 461D) Dr. Abeer Mahmoud Computer science Department Princess Nora University Faculty of Computer & Information.

IE 585 History of Neural Networks & Introduction to Simple Learning Rules.

Chapter 6 Neural Network.

1 Neural networks 2. 2 Introduction: Neural networks The nervous system contains 10^12 interconnected neurons.

“Principles of Soft Computing, 2 nd Edition” by S.N. Sivanandam & SN Deepa Copyright  2011 Wiley India Pvt. Ltd. All rights reserved. CHAPTER 2 ARTIFICIAL.

Introduction To Artificial Neural System. Artificial Neuron Y =f( Σw i X i ) Σ X 1 X 2 X 3 X n.

Supervised Learning – Network is presented with the input and the desired output. – Uses a set of inputs for which the desired outputs results / classes.

Learning in Neural Networks

Artificial neural networks:

DEPARTMENT: COMPUTER SC. & ENGG. SEMESTER : VII

with Daniel L. Silver, Ph.D. Christian Frey, BBA April 11-12, 2017

FUNDAMENTAL CONCEPT OF ARTIFICIAL NETWORKS

Hebb and Perceptron.

CSE 573 Introduction to Artificial Intelligence Neural Networks

The Network Approach: Mind as a Web

Introduction to Neural Network

Presentation transcript:

Artificial Neural Networks : An Introduction G.Anuradha

Learning Objectives Fundamentals of ANN Comparison between biological neuron and artificial neuron Basic models of ANN Different types of connections of NN, Learning and activation function Basic fundamental neuron model-McCulloch-Pitts neuron and Hebb network

Fundamental concept NN are constructed and implemented to model the human brain. Performs various tasks such as pattern-matching, classification, optimization function, approximation, vector quantization and data clustering. These tasks are difficult for traditional computers

ANN ANN posess a large number of processing elements called nodes/neurons which operate in parallel. Neurons are connected with others by connection link. Each link is associated with weights which contain information about the input signal. Each neuron has an internal state of its own which is a function of the inputs that neuron receives- Activation level

Artificial Neural Networks x1 x2 X1 X2 w1 w2 Y y

Neural net of pure linear eqn. Input X m Y mx

Information flow in nervous system

Biological Neural Network

Neuron and a sample of pulse train

Biological Neuron Has 3 parts End of axon splits into fine strands Soma or cell body:- cell nucleus is located Dendrites:- nerve connected to cell body Axon: carries impulses of the neuron End of axon splits into fine strands Each strand terminates into a bulb-like organ called synapse Electric impulses are passed between the synapse and dendrites Synapses are of two types Inhibitory:- impulses hinder the firing of the receiving cell Excitatory:- impulses cause the firing of the receiving cell Neuron fires when the total of the weights to receive impulses exceeds the threshold value during the latent summation period After carrying a pulse an axon fiber is in a state of complete nonexcitability for a certain time called the refractory period.

Characteristics of ANN Neurally implemented mathematical model There exist a highly interconnected processing units called neurons Interconnections with their weighted linkages hold information ANN have the ability to learn, recall, generalize by adjustment of weights Computational power is by collective behavior and no single neuron carries specific information

Evolution of ANN Year Neural Network Designer Description 1943 McCulloch and Pitts Neuron Mcculloch Pitts Logic gates 1949 Hebb Strength increases if neurons are active 1958-1988 Perceptron Rosenblatt Weights of path can be adjusted 1960 Adaline Widrow and Hoff Mean squared error 1972 SOM Kohenen Clustering 1982 Hopfield John Hopfield Associative memory nets 1986 Back Propagation Rumelhard Multilayer 1987-90 ART Carpenter Used for both binary and analog

McCulloch-Pitts Neuron Model

Features of McCulloch-Pitts model Allows binary 0,1 states only Operates under a discrete-time assumption Weights and the neurons’ thresholds are fixed in the model and no interaction among network neurons Just a primitive model

General symbol of neuron consisting of processing node and synaptic connections

Neuron Modeling for ANN Is referred to activation function. Domain is set of activation values net. Scalar product of weight and input vector Neuron as a processing node performs the operation of summation of its weighted input.

Activation function Bipolar binary and unipolar binary are called as hard limiting activation functions used in discrete neuron model Unipolar continuous and bipolar continuous are called soft limiting activation functions are called sigmoidal characteristics.

Activation functions Bipolar continuous Bipolar binary functions

Activation functions Unipolar continuous Unipolar Binary

Common models of neurons Binary perceptrons Continuous perceptrons

Comparison between brain verses computer ANN Speed Few ms. Few nano sec. massive ||el processing Size and complexity 1011 neurons & 1015 interconnections Depends on designer Storage capacity Stores information in its interconnection or in synapse. No Loss of memory Contiguous memory locations loss of memory may happen sometimes. Tolerance Has fault tolerance No fault tolerance Inf gets disrupted when interconnections are disconnected Control mechanism Complicated involves chemicals in biological neuron Simpler in ANN

Basic models of ANN

Classification based on interconnections Feed forward Single layer Multilayer Feed Back Recurrent Network architecture is the arrangement of neurons to form layers and connection patterns formed within and between layers

Single layer Feedforward Network

Feedforward Network Its output and input vectors are respectively Weight wij connects the i’th neuron with j’th input. Activation rule of ith neuron is where EXAMPLE

Multilayer feed forward network Can be used to solve complicated problems

Example of feedforward network

Feedback network When outputs are directed back as inputs to same or preceding layer nodes it results in the formation of feedback networks. The input is given only once and then removed

Categories of feedback Discrete feedback Continuous feedback

Discrete feedback network The mapping of o(t) with o(t+∆) is given by The time step is unity and time instances are indexed as positive integers. For discrete feedback network A system with discrete time inputs and discrete data representation is called an automaton. (Recurrent Neural networks)

Recurrent n/ws Single node with own feedback Competitive nets Feedback networks with closed loop are called Recurrent Networks. The response at the k+1’th instant depends on the entire history of the network starting at k=0. Automaton: A system with discrete time inputs and a discrete data representation is called an automaton Single node with own feedback Competitive nets Single-layer recurrent nts Multilayer recurrent networks

Single node with own feedback

Single layer Recurrent Networks

Competitive networks

Lateral feedback If the feedback of the output of the processing elements is directed back as input to the processing elements in the same layer then it is called lateral feedback

Continuous feedback networks Continuous time networks employ neurons with continuous activation function. Used in electrical circuits. Relation between resistance, inductance and capacitance Concepts from zurada

Basic models of ANN

Learning It’s a process by which a NN adapts itself to a stimulus by making proper parameter adjustments, resulting in the production of desired response Two kinds of learning Parameter learning:- connection weights are updated Structure Learning:- change in network structure

Training The process of modifying the weights in the connections between network layers with the objective of achieving the expected output is called training a network. This is achieved through Supervised learning Unsupervised learning Reinforcement learning

Classification of learning Supervised learning Unsupervised learning Reinforcement learning

Supervised Learning Child learns from a teacher Each input vector requires a corresponding target vector. Training pair=[input vector, target vector] Neural Network W X Y (Actual output) (Input) Error (D-Y) signals Error Signal Generator (Desired Output)

Supervised learning contd. Supervised learning does minimization of error

Unsupervised Learning How a fish or tadpole learns All similar input patterns are grouped together as clusters. If a matching input pattern is not found a new cluster is formed

Unsupervised learning

Self-organizing In unsupervised learning there is no feedback Network must discover patterns, regularities, features for the input data over the output While doing so the network might change in parameters This process is called self-organizing

Reinforcement Learning X NN W Y (Input) (Actual output) Error signals Error Signal Generator R Reinforcement signal

When Reinforcement learning is used? If less information is available about the target output values (critic information) Learning based on this critic information is called reinforcement learning and the feedback sent is called reinforcement signal Feedback in this case is only evaluative and not instructive

Basic models of ANN

Activation Function Identity Function f(x)=x for all x Binary Step function Bipolar Step function Sigmoidal Functions:- Continuous functions Ramp functions:- sivanandem

Some learning algorithms we will learn are Supervised: Adaline, Madaline Perceptron Back Propagation multilayer perceptrons Radial Basis Function Networks Unsupervised Competitive Learning Kohenen self organizing map Learning vector quantization Hebbian learning

Neural processing Recall:- processing phase for a NN and its objective is to retrieve the information. The process of computing o for a given x Basic forms of neural information processing Auto association Hetero association Classification Zurada

Neural processing-Autoassociation Set of patterns can be stored in the network If a pattern similar to a member of the stored set is presented, an association with the input of closest stored pattern is made

Neural Processing- Heteroassociation Associations between pairs of patterns are stored Distorted input pattern may cause correct heteroassociation at the output

Neural processing-Classification Set of input patterns is divided into a number of classes or categories In response to an input pattern from the set, the classifier is supposed to recall the information regarding class membership of the input pattern.

Important terminologies of ANNs Weights Bias Threshold Learning rate Momentum factor Vigilance parameter Notations used in ANN sivanendem

Weights Each neuron is connected to every other neuron by means of directed links Links are associated with weights Weights contain information about the input signal and is represented as a matrix Weight matrix also called connection matrix

Weight matrix W= =

Weights contd… wij –is the weight from processing element ”i” (source node) to processing element “j” (destination node) X1 1 Xi Yj Xn w1j wij wnj bj

Activation Functions Used to calculate the output response of a neuron. Sum of the weighted input signal is applied with an activation to obtain the response. Activation functions can be linear or non linear Already dealt Identity function Single/binary step function Discrete/continuous sigmoidal function.

Bias Bias is like another weight. Its included by adding a component x0=1 to the input vector X. X=(1,X1,X2…Xi,…Xn) Bias is of two types Positive bias: increase the net input Negative bias: decrease the net input

Why Bias is required? The relationship between input and output given by the equation of straight line y=mx+c C(bias) Input X Y y=mx+C

Threshold Set value based upon which the final output of the network may be calculated Used in activation function The activation function using threshold can be defined as

Learning rate Denoted by α. Used to control the amount of weight adjustment at each step of training Learning rate ranging from 0 to 1 determines the rate of learning in each time step

Other terminologies Momentum factor: Vigilance parameter: used for convergence when momentum factor is added to weight updation process. Vigilance parameter: Denoted by ρ Used to control the degree of similarity required for patterns to be assigned to the same cluster

Neural Network Learning rules c – learning constant

Hebbian Learning Rule FEED FORWARD UNSUPERVISED LEARNING The learning signal is equal to the neuron’s output

Features of Hebbian Learning Feedforward unsupervised learning “When an axon of a cell A is near enough to exicite a cell B and repeatedly and persistently takes place in firing it, some growth process or change takes place in one or both cells increasing the efficiency” If oixj is positive the results is increase in weight else vice versa

Perceptron Learning rule Learning signal is the difference between the desired and actual neuron’s response Learning is supervised

Delta Learning Rule Only valid for continuous activation function Used in supervised training mode Learning signal for this rule is called delta The aim of the delta rule is to minimize the error over all training patterns

Delta Learning Rule Contd. Learning rule is derived from the condition of least squared error. Calculating the gradient vector with respect to wi Minimization of error requires the weight changes to be in the negative gradient direction

Widrow-Hoff learning Rule Also called as least mean square learning rule Introduced by Widrow(1962), used in supervised learning Independent of the activation function Special case of delta learning rule wherein activation function is an identity function ie f(net)=net Minimizes the squared error between the desired output value di and neti

Winner-Take-All learning rules

Winner-Take-All Learning rule Contd… Can be explained for a layer of neurons Example of competitive learning and used for unsupervised network training Learning is based on the premise that one of the neurons in the layer has a maximum response due to the input x This neuron is declared the winner with a weight

Summary of learning rules

Linear Separability Separation of the input space into regions is based on whether the network response is positive or negative Line of separation is called linear-separable line. Example:- AND function & OR function are linear separable Example EXOR function Linearly inseparable. Example

Hebb Network Hebb learning rule is the simpliest one The learning in the brain is performed by the change in the synaptic gap When an axon of cell A is near enough to excite cell B and repeatedly keep firing it, some growth process takes place in one or both cells According to Hebb rule, weight vector is found to increase proportionately to the product of the input and learning signal.

Flow chart of Hebb training algorithm Start 1 Initialize Weights Activate output y=t Weight update For Each s:t n Bias update b(new)=b(old) + y y Activate input xi=si Stop 1