Evolution of descent directions Alejandro Sierra Escuela Politécnica Superior Universidad Autónoma de Madrid Iván Santibáñez Koref Bionik und Evolutionstechnik.

Evolution of descent directions Alejandro Sierra Escuela Politécnica Superior Universidad Autónoma de Madrid Iván Santibáñez Koref Bionik und Evolutionstechnik Technical University of Berlin

Outline Estimation of distribution algorithms (EDA) A naive EDA Beyond the naive EDA: IDEA MBOA CMA-ES Classical optimization algorithms Evolution of descent directions (ED 2)

Estimation of distribution algorithms Optimization algorithm

Estimation of distribution algorithms Optimization algorithm Samples from a probability density function (pdf)

Estimation of distribution algorithms Optimization algorithm Samples from a probability density function (pdf) The pdf is updated in an evolutionary way: Population of samples The best representatives are used to update the parameters of the pdf

A naive EDA

Initialization of each Normal pdf: Random means. Standard deviations = 1.

A naive EDA Initialization of each Normal pdf: Random means. Standard deviations = 1. Repeat until finding a good solution:

A naive EDA Initialization of each Normal pdf: Random means. Standard deviations = 1. Repeat until finding a good solution: Take λ samples from the product

A naive EDA Initialization of each Normal pdf: Random means. Standard deviations = 1. Repeat until finding a good solution: Take λ samples from the product Recalculate means and deviations from µ (μ<<λ) best samples

Beyond the naive EDA Two ways out:

Beyond the naive EDA Two ways out: Use a full multidimensional Normal distribution (CMA-ES).

Beyond the naive EDA Two ways out: Use a full multidimensional Normal distribution (CMA-ES). Use Bayesian networks to learn more complex joint probability relationships (IDEA, MBOA).

Beyond the naive EDA IDEA and MBOA have very heavy learning procedures. We’d like to keep the naive approach without giving up variable dependencies. Classical minimization algorithms as inspiration.

Classical optimization algorithms Classical minimization of a function f (x): 1. Generate a random point x 2. Generate a random direction v

Classical optimization algorithms f(x) Initial point

Classical optimization algorithms Classical minimization of a function f (x): 1. Generate a random point x 2. Generate a random direction v 3. Run a line minimization algorithm to find λ v :

Classical optimization algorithms f(x) Point found by line minimization

Classical optimization algorithms Classical minimization of a function f (x): 1. Generate a random point x 2. Generate a random direction v 3. Run a line minimization algorithm to find λ v : 4. Update x

Classical optimization algorithms f(x)

Classical optimization algorithms Classical minimization of a function f (x): 1. Generate a random point x 2. Generate a random direction v 3. Run a line minimization algorithm to find λ v : 4. Update x 5. Update v and come back to point 3.

Classical optimization algorithms

ED 2 : the naive EDA of descent directions Sampling directions from a factorized probability density function. Each direction is a model of correlation between variables. A product of normal distributions is enough.

ED 2 : the algorithm The initial normal distributions are initialized and random directions are sampled.

ED 2 : the algorithm The initial normal distributions are randomly generated. The best point is initialized.

ED 2 : the algorithm The initial normal distributions are randomly generated. The best point is initialized. The following steps are repeated:

ED 2 : the algorithm The initial normal distributions are randomly generated. The best point is initialized. The following steps are repeated: Each direction is used to improve the best point by interpolation.

ED 2 : the algorithm The initial normal distributions are randomly generated. The best point is initialized. The following steps are repeated: Each direction is used to improve the best point by interpolation. Fitness = drop in fitness value.

ED 2 : the algorithm The initial normal distributions are randomly generated. The best point is initialized. The following steps are repeated: Each direction is used to improve the best point by interpolation. Fitness = drop in fitness value. The pdfs are updated out of the best directions. New directions are sampled from the product of normal distributions.

ED2: Results for the cigar function FunctionCMA-ESIDEAMBOAED 2 Cigar1 (3840)4.6122.2 Rotated cigar1 (3840)3821003.7 CMA-ES takes 3840 function evaluations till reaching f(x)=10 -10 IDEA takes 4.6 times more evaluations

Conclusions ED 2 : Evolution of descent directions Sampling of directions from a product of normal distributions ED 2 is very fast Future work: More complex line minimization algorithms

Evolution of descent directions Alejandro Sierra Escuela Politécnica Superior Universidad Autónoma de Madrid Iván Santibáñez Koref Bionik und Evolutionstechnik.

Similar presentations

Presentation on theme: "Evolution of descent directions Alejandro Sierra Escuela Politécnica Superior Universidad Autónoma de Madrid Iván Santibáñez Koref Bionik und Evolutionstechnik."— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

Evolution of descent directions Alejandro Sierra Escuela Politécnica Superior Universidad Autónoma de Madrid Iván Santibáñez Koref Bionik und Evolutionstechnik.

Similar presentations

Presentation on theme: "Evolution of descent directions Alejandro Sierra Escuela Politécnica Superior Universidad Autónoma de Madrid Iván Santibáñez Koref Bionik und Evolutionstechnik."— Presentation transcript:

Similar presentations

About project

Feedback