Design of an Evolutionary Algorithm M&F, ch. 7 why I like this textbook and what I don’t like about it!
M&F’s model of Evolutionary Algorithm Five aspects of specification 1.(7.1) representation (e.g., set of variable values) 2.(7.5) initial population (e.g., random) 3.(7.2) evaluation function ( e.g., fitness) 4.(7.4) selection procedure (next generation pop.) 5.(7.3) random variation operators (e.g., crossover, mutation)
7.1 Representation SAT representation, n propositions –vector of n boolean values effective when variables are independent (no or few constraints between variables) --> vector of parameters or symbols
7.1.1 Fixed length vector V = {v 1, v 2, …, v n }, v i D i size of search space is |D 1 | x |D 2 | x…x |D n | e.g., SAT, sum of integers puzzle
7.1.2 permutation for TSP and similar problems, the vector representation includes too many bad solutions (repeated or missing cities) –because all variables are from same domain {D, A, E, C, B}“without replacement” n variables as permutation size of search space is n!
Vector vs Permutation Vector {x 1, x 2, x 3 } Space {1,1,1}, {1,1,2}, {1,1,3}, {1,2,1}, {1,2,2}, {1,2,3}, {1,3,1}, {1,3,2}, {1,3,3}, {2,1,1}, {2,1,2}, {2,1,3}, {2,2,1}, {2,2,2}, {2,2,3}, {2,3,1}, {2,3,2}, {2,3,3}, {3,1,1}, {3,1,2}, {3,1,3}, {3,2,1}, {3,2,2}, {3,2,3}, {3,3,1}, {3,3,2}, {3,3,3} Permutation {x 1, x 2, x 3 } Space {1,2,3} {1,3,2} {2,1,3} {2,3,1} {3,1,2} {3,2,1}
Permutation applications TSP job shop scheduling –k processing machines –n jobs each job must be processed at some subset of the k machines in a specific order and takes some time to be processed at each machine –determine best order of job processing at each machine to minimize time of processing schedule
7.1.3 Finite state machines extension of vector representation: e.g., DFA (Q, ∑, , q 0, F) –Q: set of states {q i } –∑: alphabet of input symbols {s j } – : transition table Q x ∑ --> Q –F: subset of Q, accepting states
Searching for a DFA : transition table Q x ∑ --> Q s1s1 s2s2 s3s3 s4s4 s5s5 q1q1 q2q2 q 22 q3q3 q4q4 (Q, ∑, , q 0, F ) –Q: set of states {qi} –∑: alphabet of input symbols {sj} – : transition table Q x ∑ --> Q –F : subset of Q, accepting states representation*: {q 11, …, q 45,q 0, f 1, …, f 4 } *could include number of states - variable length vector
7.1.4 Symbolic expressions searching for an expression defining a function based on input data and a set of available operations –learning programs –Spore game of evolution
Symbolic expression syntax tree, size variable + sin. y3 x example representation: PreOrder traversal {+, x, sin,., 3, y}
Symbolic expression syntax tree, size variable v v, a value, belongs to domain that includes the input data (e.g., x and y) and the set of possible operations with operand(s) (tree is extended) op op, an operator, belongs to domain that includes the set of possible operations with same number of operands and the input data (tree is pruned)
Symbolic expression x + sin. y3 x + yx + x x / yx transformation to neighbour
7.2 Evaluation function may be the fitness function of the problem, …, or not must measure progress toward an optimal or satisficing solution should operate on the chosen representation (avoid transformation)
Desirable evaluation features efficient - searches typically spend most resources on evaluation incremental - revise evaluation when going from one solution to a neighbour, instead of redo
7.3 Variation operators equivalent of neighbourhood definition in local search: –rules for getting next candidate solution(s) determining the connectivity of the solution space (edges) and the metric dependent on any number of current solutions: –0: e.g., random –1: e.g., mutation –2: e.g., crossover
variation of fixed length vectors one parent: change value of one of n variables (mutation) –densely connected space - maximum n variation between any two solution –less densely connected - vary by single step, 1 two or more parents: crossover, many variations that each connect space differently –one or more cut points –random choice for each variable
one parent: –2-opt, etc. two or more parents: –many attempts to simulate crossover most need ‘repair’ to avoid repetition most are more costly of resources most inherit some properties but destroy others e.g., p175 variation of permutations
variation of finite state machines like vectors except when states and symbols can be added or deleted
variations of symbolic expressions one parent: see representation section two parents: –exchange subtrees
7.4 Selection two selection factors 1.which solutions should contributed to variation? 2.which solutions should survive to next generation? s 1,s 2,s 3,s 4,s 5,s 6 at t SELECTION VARIATION s 1,s 2,s 3,s 4,s 5,s 6, c 1,c 2,c 3,c 4 SELECTION s 1,s 2,s 3,s 4,s 5,s 6 at t+1
selection strategies deterministic evaluation comparison faster convergence stochastic probabilistic selection 1.based on fitness 2.best of subsets 3.ranking in subsets broader sampling
7.5 Initial populations uniform sampling –try to span the search space random sampling informed sampling –“guidance” - use good solutions from other sources