Weight Initialization for Backpropagation with Genetic Algorithms Anoop Kunchukuttan Sandeep Limaye Ashish Lahane.

Slides:



Advertisements
Similar presentations
Genetic Algorithms Chapter 3. A.E. Eiben and J.E. Smith, Introduction to Evolutionary Computing Genetic Algorithms GA Quick Overview Developed: USA in.
Advertisements

CS6800 Advanced Theory of Computation
Using Parallel Genetic Algorithm in a Predictive Job Scheduling
Genetic Algorithms Contents 1. Basic Concepts 2. Algorithm
Genetic Algorithms Representation of Candidate Solutions GAs on primarily two types of representations: –Binary-Coded –Real-Coded Binary-Coded GAs must.
Institute of Intelligent Power Electronics – IPE Page1 Introduction to Basics of Genetic Algorithms Docent Xiao-Zhi Gao Department of Electrical Engineering.
COMP305. Part II. Genetic Algorithms. Genetic Algorithms.
Genetic Algorithm for Variable Selection
COMP305. Part II. Genetic Algorithms. Genetic Algorithms.
Intro to AI Genetic Algorithm Ruth Bergman Fall 2002.
Genetic Algorithms Nehaya Tayseer 1.Introduction What is a Genetic algorithm? A search technique used in computer science to find approximate solutions.
Intro to AI Genetic Algorithm Ruth Bergman Fall 2004.
Genetic Algorithms Overview Genetic Algorithms: a gentle introduction –What are GAs –How do they work/ Why? –Critical issues Use in Data Mining –GAs.
Optimization of thermal processes2007/2008 Optimization of thermal processes Maciej Marek Czestochowa University of Technology Institute of Thermal Machinery.
Genetic Algorithms: A Tutorial
Genetic Algorithm.
A Genetic Algorithms Approach to Feature Subset Selection Problem by Hasan Doğu TAŞKIRAN CS 550 – Machine Learning Workshop Department of Computer Engineering.
Slides are based on Negnevitsky, Pearson Education, Lecture 12 Hybrid intelligent systems: Evolutionary neural networks and fuzzy evolutionary systems.
Integrating Neural Network and Genetic Algorithm to Solve Function Approximation Combined with Optimization Problem Term presentation for CSC7333 Machine.
Soft Computing Lecture 18 Foundations of genetic algorithms (GA). Using of GA.
SOFT COMPUTING (Optimization Techniques using GA) Dr. N.Uma Maheswari Professor/CSE PSNA CET.
Genetic algorithms Prof Kang Li
Optimization in Engineering Design Georgia Institute of Technology Systems Realization Laboratory Mixed Integer Problems Most optimization algorithms deal.
Zorica Stanimirović Faculty of Mathematics, University of Belgrade
Genetic Algorithms Michael J. Watts
Boltzmann Machine (BM) (§6.4) Hopfield model + hidden nodes + simulated annealing BM Architecture –a set of visible nodes: nodes can be accessed from outside.
What is Genetic Programming? Genetic programming is a model of programming which uses the ideas (and some of the terminology) of biological evolution to.
Optimization Problems - Optimization: In the real world, there are many problems (e.g. Traveling Salesman Problem, Playing Chess ) that have numerous possible.
Neural and Evolutionary Computing - Lecture 5 1 Evolutionary Computing. Genetic Algorithms Basic notions The general structure of an evolutionary algorithm.
An Introduction to Genetic Algorithms Lecture 2 November, 2010 Ivan Garibay
Genetic Algorithms Genetic Algorithms – What are they? And how they are inspired from evolution. Operators and Definitions in Genetic Algorithms paradigm.
Genetic Algorithms Introduction Advanced. Simple Genetic Algorithms: Introduction What is it? In a Nutshell References The Pseudo Code Illustrations Applications.
Soft Computing A Gentle introduction Richard P. Simpson.
Genetic Algorithms Siddhartha K. Shakya School of Computing. The Robert Gordon University Aberdeen, UK
Neural and Evolutionary Computing - Lecture 9 1 Evolutionary Neural Networks Design  Motivation  Evolutionary training  Evolutionary design of the architecture.
GENETIC ALGORITHM A biologically inspired model of intelligence and the principles of biological evolution are applied to find solutions to difficult problems.
1 Genetic Algorithms K.Ganesh Introduction GAs and Simulated Annealing The Biology of Genetics The Logic of Genetic Programmes Demo Summary.
Genetic Algorithms Przemyslaw Pawluk CSE 6111 Advanced Algorithm Design and Analysis
Chapter 9 Genetic Algorithms.  Based upon biological evolution  Generate successor hypothesis based upon repeated mutations  Acts as a randomized parallel.
09/20/04 Introducing Proteins into Genetic Algorithms – CSIMTA'04 Introducing “Proteins” into Genetic Algorithms Virginie LEFORT, Carole KNIBBE, Guillaume.
Edge Assembly Crossover
Genetic Algorithms What is a GA Terms and definitions Basic algorithm.
ECE 103 Engineering Programming Chapter 52 Generic Algorithm Herbert G. Mayer, PSU CS Status 6/4/2014 Initial content copied verbatim from ECE 103 material.
Genetic Algorithms. 2 Overview Introduction To Genetic Algorithms (GAs) GA Operators and Parameters Genetic Algorithms To Solve The Traveling Salesman.
Probabilistic Algorithms Evolutionary Algorithms Simulated Annealing.
Chapter 12 FUSION OF FUZZY SYSTEM AND GENETIC ALGORITHMS Chi-Yuan Yeh.
EE749 I ntroduction to Artificial I ntelligence Genetic Algorithms The Simple GA.
Genetic Algorithms. The Basic Genetic Algorithm 1.[Start] Generate random population of n chromosomes (suitable solutions for the problem) 2.[Fitness]
CITS7212: Computational Intelligence An Overview of Core CI Technologies Lyndon While.
Parallel Genetic Algorithms By Larry Hale and Trevor McCasland.
Waqas Haider Bangyal 1. Evolutionary computing algorithms are very common and used by many researchers in their research to solve the optimization problems.
Innovative and Unconventional Approach Toward Analytical Cadastre – based on Genetic Algorithms Anna Shnaidman Mapping and Geo-Information Engineering.
D Nagesh Kumar, IIScOptimization Methods: M8L5 1 Advanced Topics in Optimization Evolutionary Algorithms for Optimization and Search.
An Introduction to Genetic Algorithms Lecture 2 November, 2010 Ivan Garibay
Genetic Algorithms. Underlying Concept  Charles Darwin outlined the principle of natural selection.  Natural Selection is the process by which evolution.
Artificial Intelligence By Mr. Ejaz CIIT Sahiwal Evolutionary Computation.
Overview Last two weeks we looked at evolutionary algorithms.
Genetic Algorithms. Solution Search in Problem Space.
EVOLUTIONARY SYSTEMS AND GENETIC ALGORITHMS NAME: AKSHITKUMAR PATEL STUDENT ID: GRAD POSITION PAPER.
Genetic Algorithms And other approaches for similar applications Optimization Techniques.
Genetic Algorithm. Outline Motivation Genetic algorithms An illustrative example Hypothesis space search.
 Presented By: Abdul Aziz Ghazi  Roll No:  Presented to: Sir Harris.
Presented By: Farid, Alidoust Vahid, Akbari 18 th May IAUT University – Faculty.
 Negnevitsky, Pearson Education, Lecture 12 Hybrid intelligent systems: Evolutionary neural networks and fuzzy evolutionary systems n Introduction.
Genetic Algorithms.
CSC 380: Design and Analysis of Algorithms
Genetic Algorithms Chapter 3.
EE368 Soft Computing Genetic Algorithms.
CSC 380: Design and Analysis of Algorithms
Population Methods.
Presentation transcript:

Weight Initialization for Backpropagation with Genetic Algorithms Anoop Kunchukuttan Sandeep Limaye Ashish Lahane

Seminar Outline Weight Initialization Problem Introduction to Genetic Algorithms GA based solution

Problem Definition Part - I

Backpropagation Feed-forward neural network training method Minimizes Mean Square Error Gradient descent Greedy Method

Weight Initialization Problem  Backpropagation is sensitive to initial conditions  Initial conditions such as - initial weights - learning factor - momentum factor

Effect of Initial Weights  Can get stuck in local minima  May converge too slowly  Achieved generalization can be poor

Solution Use of global search methods to get the final weights, such as: Genetic algorithms, Simulated Annealing etc. Use global search methods to get closer to global minimum & then run local search (BP) to get exact solution We will concentrate on the latter method using Genetic algorithms

Genetic Algorithms A primer Part - II

Motivation Inspired by evolution in living organisms “Survival of the fittest” “Fitter” individuals in the population preferred to reproduce to create new generation

Other algorithms inspired by nature Neural networks – neurons in brain Simulated annealing – physical properties of metals GA, NN have partly overlapped application areas – pattern recognition, ML, image processing, expert systems

Basic Steps Coding the problem Determining “fitness” Selection of parents for reproduction Recombination to produce new generation

Coding GA in natureGA in CS GenesParameters of solution Chromosome = collection of genesChromosome = concatenated parameter values Genotype = set of parameters represented in the chromosome Phenotype = Finished “construction” of the individual (values assigned to parameters)

Generic Model for Genetic Algorithm (1)

Generic Model for Genetic Algorithm (2) BEGIN /* genetic algorithm */ generate initial population WHILE NOT finished DO BEGIN compute fitness of each individual IF population has converged THEN finished := TRUE ELSE /* reproductive cycle */ apply genetic operators to produce children END

Determining fitness Fitness function Varies from problem to problem Usually returns a value that we want to optimize Example: Strength / Weight ratio in a civil engineering problem Unimodal / Multimodal fitness functions – Multimodal: several peaks

Selection Individuals selected to recombine and produce offspring Selection of parents based on “Roulette Wheel” algorithm Fitter parents may get selected multiple times, not- so-fit may not get selected at all Child can be less fit than parents, but such a child will probably “die out” without reproducing in the next generation.

Roulette Wheel selection

Selection - variations Fitness-based v/s rank-based

Recombination - Crossover Crossover probability (typically 0.6 < CP < 1) If no crossover, child is identical to parent

Crossover – variations (1) One-point crossover

Crossover – variations (2) Two-point crossover

Crossover – variations (3) Uniform crossover

Recombination - Mutation Crossover – rapidly searches a large problem space Mutation – allows more fine-grained search

Convergence Gene convergence: when 95% of the population has the same value for that gene Population convergence: when all the genes reach convergence

Design Issues Goldberg’s principles of coding (building block hypothesis): – Related genes should be close together – Little interaction between genes Is it possible, (and if yes, how) to find coding schemes that obey the principles? If not, can the GA be modified to improve its performance in these circumstances?

GA - Plus points Robust: Wide range of problems can be tackled “Acceptably good” solutions in “acceptably quick” time Explore wide range of solution space reasonably quickly Combination of Exploration and Exploitation Hybridizing existing algorithms with GAs often proves beneficial

GA - shortcomings Need not always give a globally optimum solution! Probabilistic behaviour

Weight Initialization using GA Part - III

Applying GA to Neural Nets Optimizing the NN using GA as the optimization algorithm Weight initialization Learning the network topology Learning network parameters

Why use genetic algorithms? Global heuristic search (Global Sampling) Get close to the optimal solution faster Genetic operators create a diversity in the population, due to which a larger solution space can be explored.

The Intuition Optimizing using only GA is not efficient. – Might move away after getting close to the solution. Gradient Descent can zoom in to a solution in a local neighbourhood. Exploit the global search of GA with local search of backpropagation. This hybrid approach has been observed to be efficient.

The Hybrid (GA-NN) Approach GA provides ‘seeds’ for the BP to run.

Basic Architecture GABP-FF Random encoded weights Fittest Candidates from GA for BP Next generation candidates Evaluate Fitness Function Final weights

Considerations Initially high learning rate – To reduce required number of BP iterations Learning rate and number of BP iterations as a function of fitness criteria: – To speed up convergence – To exploit local search capability of BP Tradeoff between number of GA generations and BP iterations.

Encoding Problem Q. What to encode? Ans. Weights. Q. How? Ans. Binary Encoding

Binary Weight Encoding

Issues Order of the weights Code length Weights are real valued

Code Length ‘l ‘ Code length determines - resolution - precision - size of solution space to be searched Minimal precision requirement  minimum ‘ l ’  limits the efforts to improve GA search by reducing gene length

Real Value Encoding Methods Gray-Scale DPE (Dynamic Parameter Encoding)

Gray-Code Encoding Gray-code : Hamming distance between any two consecutive numbers is 1 Better than or at least same as binary encoding - Example

Randomization + Transformation

Drawbacks Too large search space  large ‘l ’  higher resolution  high precision  long time for GA to converge Instead: search can proceed from coarse to finer precision: i.e. DPE

DPE (Dynamic Parameter Encoding) use small ‘l’  coarse precision  search for most favored area  refine mapping  again search  iterate till convergence with desired precision achieved Zooming operation

Zooming

Fitness Function & Selection Mean Square Error Rate of change of Mean Square Error

Genetic operators – problem-specific variations (1) UNBIASED-MUTATE-WEIGHTS – With fixed p, replace weight by new randomly chosen value BIASED-MUTATE-WEIGHTS – With fixed p, add randomly chosen value to existing weight value – biased towards existing weight value – exploitation

Genetic operators – problem-specific variations (2) MUTATE-NODES – Mutate genes for incoming weights to n non-input neurons – Intuition: such genes form a logical subgroup, changing them together may result in better evaluation MUTATE-WEAKEST-NODES – Strength of a node = (n/w evaluation) – (n/w evaluation with this node “disabled” i.e. its outgoing weights set to 0) – Select m weakest nodes and mutate their incoming / outgoing weights – Does not improve nodes that are already “doing well”, so should not be used as a single recombination operator

Genetic operators – problem-specific variations (3) CROSSOVER-WEIGHTS – Similar to uniform crossover CROSSOVER-NODES – Crossover the incoming weights of a parent node together – Maintains “logical subgroups” across generations

Genetic operators – problem-specific variations (4) OPERATOR-PROBABILITIES – Decides which operator(s) from the operator pool get applied for a particular recombination – Initially, all operators have equal probability – An adaptation mechanism tunes the probabilities as the generations evolve

Computational Complexity Training time could be large due to running multiple generations of GA and multiple runs of BP. Total Time = Generations*PopulationSize*Training Time There is a tradeoff between the number of generations and number of trials to each individual

Conclusion and Future work Hybrid approach promising Correct choice of encoding and recombination operators are important. As part of course project – Implement the hybrid approach – Experiment with different learning rates and number of iterations and adapting them

References David Beasley, David Bull, Ralph Martin: An Overview of Genetic Algorithms, University Computing, Genetic Algorithm Tutorial - Zbygniew Michalewicz, Genetic Algorithms + Data Structures = Evolution Programs, Springer-Verlag, 1992 Training Feedforward networks using genetic algorithms, Montana, Davis, ICJAI’89 Evolving Networks-Using the Genetic Algorithm with Connectionist Learning, Belew,Schraudolf Backpropagation is sensitive to initial conditions, Kolen Combining genetic algorithms and neural nets-The Encoding Problem Dynamic Parameter Encoding For Genetic Algorithms, Belew,Schraudolf