Introducing Mixtures of Generalized Mallows in Estimation of Distribution Algorithms Josian Santamaria Josu Ceberio Roberto Santana Alexander Mendiburu.

Slides:

Advertisements

Similar presentations

Josu Ceberio. Previously…  EDAs for integer domains.  EDAs for real value domains.  Few efficient designs for permutation- based problems. POOR PERFORMANCE.

Advertisements

1 An Adaptive GA for Multi Objective Flexible Manufacturing Systems A. Younes, H. Ghenniwa, S. Areibi uoguelph.ca.

Fundamentals of Data Analysis Lecture 12 Methods of parametric estimation.

Clustering Clustering of data is a method by which large sets of data is grouped into clusters of smaller sets of similar data. The example below demonstrates.

Machine Learning and Data Mining Clustering

Software Quality Ranking: Bringing Order to Software Modules in Testing Fei Xing Michael R. Lyu Ping Guo.

Characterizing the Distribution of Low- Makespan Schedules in the Job Shop Scheduling Problem Matthew J. Streeter Stephen F. Smith Carnegie Mellon University.

Visual Recognition Tutorial

Paper Discussion: “Simultaneous Localization and Environmental Mapping with a Sensor Network”, Marinakis et. al. ICRA 2011.

Date:2011/06/08 吳昕澧 BOA: The Bayesian Optimization Algorithm.

First introduced in 1977 Lots of mathematical derivation Problem : given a set of data (data is incomplete or having missing values). Goal : assume the.

A New Evolutionary Algorithm for Multi-objective Optimization Problems Multi-objective Optimization Problems (MOP) –Definition –NP hard By Zhi Wei.

1 An Asymptotically Optimal Algorithm for the Max k-Armed Bandit Problem Matthew Streeter & Stephen Smith Carnegie Mellon University NESCAI, April

Incremental Learning of Temporally-Coherent Gaussian Mixture Models Ognjen Arandjelović, Roberto Cipolla Engineering Department, University of Cambridge.

Reporter : Mac Date : Multi-Start Method Rafael Marti.

Introduction to Evolutionary Computation  Genetic algorithms are inspired by the biological processes of reproduction and natural selection. Natural selection.

Expectation Maximization Algorithm

Ant Colony Optimization Optimisation Methods. Overview.

Jeremy Tantrum, Department of Statistics, University of Washington joint work with Alejandro Murua & Werner Stuetzle Insightful Corporation University.

Lecture outline Support vector machines. Support Vector Machines Find a linear hyperplane (decision boundary) that will separate the data.

Visual Recognition Tutorial

Josu Ceberio Alexander Mendiburu Jose A. Lozano

Clustering with Bregman Divergences Arindam Banerjee, Srujana Merugu, Inderjit S. Dhillon, Joydeep Ghosh Presented by Rohit Gupta CSci 8980: Machine Learning.

On Fairness, Optimizing Replica Selection in Data Grids Husni Hamad E. AL-Mistarihi and Chan Huah Yong IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS,

1 COMBINATORIAL OPTIMIZATION : an instance s : Solutions Set f : s → Cost function to minimize (Max) Find s* S s.t. f ( s* ) f ( s ), s S ( MIN) or f (

Review Rong Jin. Comparison of Different Classification Models  The goal of all classifiers Predicating class label y for an input x Estimate p(y|x)

Differential Evolution Hossein Talebi Hassan Nikoo 1.

Metaheuristics The idea: search the solution space directly. No math models, only a set of algorithmic steps, iterative method. Find a feasible solution.

Elements of the Heuristic Approach

Biointelligence Laboratory, Seoul National University

Multi-Style Language Model for Web Scale Information Retrieval Kuansan Wang, Xiaolong Li and Jianfeng Gao SIGIR 2010 Min-Hsuan Lai Department of Computer.

Machine Learning CUNY Graduate Center Lecture 3: Linear Regression.

Computer Implementation of Genetic Algorithm

FDA- A scalable evolutionary algorithm for the optimization of ADFs By Hossein Momeni.

Solving Permutation Problems with Estimation of Distribution Algorithms and Extensions Thereof Josu Ceberio.

COMMON EVALUATION FINAL PROJECT Vira Oleksyuk ECE 8110: Introduction to machine Learning and Pattern Recognition.

A Multivariate Statistical Model of a Firm’s Advertising Activities and their Financial Implications Oleg Vlasov, Vassilly Voinov, Ramesh Kini and Natalie.

Search Methods An Annotated Overview Edward Tsang.

START OF DAY 8 Reading: Chap. 14. Midterm Go over questions General issues only Specific issues: visit with me Regrading may make your grade go up OR.

Design of an Evolutionary Algorithm M&F, ch. 7 why I like this textbook and what I don’t like about it!

CS 782 – Machine Learning Lecture 4 Linear Models for Classification  Probabilistic generative models  Probabilistic discriminative models.

Doshisha Univ., Kyoto, Japan CEC2003 Adaptive Temperature Schedule Determined by Genetic Algorithm for Parallel Simulated Annealing Doshisha University,

Ch 4. Linear Models for Classification (1/2) Pattern Recognition and Machine Learning, C. M. Bishop, Summarized and revised by Hee-Woong Lim.

1 Short Term Scheduling. 2  Planning horizon is short  Multiple unique jobs (tasks) with varying processing times and due dates  Multiple unique jobs.

Genetic Algorithms Przemyslaw Pawluk CSE 6111 Advanced Algorithm Design and Analysis

Optimizing Pheromone Modification for Dynamic Ant Algorithms Ryan Ward TJHSST Computer Systems Lab 2006/2007 Testing To test the relative effectiveness.

ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: Supervised Learning Resources: AG: Conditional Maximum Likelihood DP:

Lecture 2: Statistical learning primer for biologists

Flat clustering approaches

Point Estimation of Parameters and Sampling Distributions Outlines:  Sampling Distributions and the central limit theorem  Point estimation  Methods.

Probability Distribution of a Discrete Random Variable If we have a sample probability distribution, we use (x bar) and s, respectively, for the mean.

Kernels of Mallows Models for Solving Permutation-based Problems

Machine Learning 5. Parametric Methods.

Diversity Loss in General Estimation of Distribution Algorithms J. L. Shapiro PPSN (Parallel Problem Solving From Nature) ’06 BISCuit 2 nd EDA Seminar.

An Introduction to Simulated Annealing Kevin Cannons November 24, 2005.

1 ParadisEO-MOEO for a Bi-objective Flow-Shop Scheduling Problem May 2007 E.-G. Talbi and the ParadisEO team

Iterative K-Means Algorithm Based on Fisher Discriminant UNIVERSITY OF JOENSUU DEPARTMENT OF COMPUTER SCIENCE JOENSUU, FINLAND Mantao Xu to be presented.

Metaheuristics for the New Millennium Bruce L. Golden RH Smith School of Business University of Maryland by Presented at the University of Iowa, March.

1 Comparative Study of two Genetic Algorithms Based Task Allocation Models in Distributed Computing System Oğuzhan TAŞ 2005.

ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition Objectives: Mixture Densities Maximum Likelihood Estimates.

Fundamentals of Data Analysis Lecture 11 Methods of parametric estimation.

Differential Evolution

Igor V. Cadez, Padhraic Smyth, Geoff J. Mclachlan, Christine and E

Probabilistic Models with Latent Variables

SMEM Algorithm for Mixture Models

POINT ESTIMATOR OF PARAMETERS

○　Hisashi Shimosaka (Doshisha University)

LECTURE 21: CLUSTERING Objectives: Mixture Densities Maximum Likelihood Estimates Application to Gaussian Mixture Models k-Means Clustering Fuzzy k-Means.

RM-MEDA: A Regularity Model-Based Multiobjective Estimation of Distribution Algorithm BISCuit EDA Seminar

Biointelligence Laboratory, Seoul National University

Presentation transcript:

Introducing Mixtures of Generalized Mallows in Estimation of Distribution Algorithms Josian Santamaria Josu Ceberio Roberto Santana Alexander Mendiburu Jose A. Lozano X Congreso Español de Metaheurísticas, Algoritmos Evolutivos y Bioinspirados - MAEB2015

Outline Background The Mallows and Generalized Mallows models Mixtures of Generalized Mallows models Experimentation Conclusions and future work 2

Estimation of distribution algorithms Definition 3

4 Despite their success, poor performance on permutation problems.

Permutation optimization problems Definition Combinatorial problems whose solutions are naturally represented as permutations 5

Permutation optimization problems Notation 6 A permutation is a bijection of the set onto itself,

Permutation optimization problems Goal To find the permutation solution that minimizes a fitness function The search space consists of solutions. 7

Permutation optimization problems Travelling salesman problem (TSP) Permutation Flowshop Scheduling Problem (PFSP) Linear Ordering Problem (LOP) Quadratic Assignment Problem (QAP) 8

Permutation optimization problems Travelling salesman problem (TSP) Permutation Flowshop Scheduling Problem (PFSP) Linear Ordering Problem (LOP) Quadratic Assignment Problem (QAP) 9

Permutation Flowshop Scheduling Problem Definition Total flow time (TFT) m1m1 m2m2 m3m3 m4m4 j4j4 j1j1 j3j3 j2j2 j5j5 jobs machines processing times 5 x 4 10

Why poor performance? The mutual exclusivity constraints associated with permutations Our proposal: probability models for permutation spaces Estimation of Distribution Algorithms Definition 11  Mallows  Generalized Mallows  Plackett-Luce

The Mallows model Definition A distance-based exponential probability model Central permutation Spread parameter A distance on permutations 12

The Mallows model Definition A distance-based exponential probability model Central permutation Spread parameter A distance on permutations 13

The Mallows model Definition A distance-based exponential probability model Central permutation Spread parameter A distance on permutations 14

The Generalized Mallows model Definition If the distance can be decomposed as sum of terms then, the Mallows model can be generalized as The Generalized Mallows model n-1 spread parameters 15

The Generalized Mallows model Kendall’s-τ distance 16 Kendall’s-τ distance: calculates the number of pairwise disagreements

Learning in 2 steps: Calculate the central permutation Maximum likelihood estimation of the spread parameters. Sampling in 2 steps: Sample a vector from Build a permutation from the vector and The Generalized Mallows model Learning and sampling 17

Drawbacks 18 The Generalized Mallows is an unimodal model, and may not detect the different modalities in heterogeneous populations.

Mixtures of Generalized Mallows models 19

Mixtures of Generalized Mallows models Learning 20 Given a data set of permutations, we calculate the maximum likelihood parameters from Expectation Maximization (EM)

Mixtures of Generalized Mallows models Expectation Maximization (EM) 21 Initialize the weights to Initialize randomly the models in the mixture E step Estimate the membership weight of to the cluster M step Compute the weights as Compute the parameters of the models with

Mixtures of Generalized Mallows models Sampling 22 Stochastic Universal Sampling

Mixtures of Generalized Mallows models Sampling 23 Stochastic Universal Sampling

Problems:  Permutation Flowshop Scheduling Problem (10 instances)  Quadratic Assignment Problem (10 instances) Experiments Settings 24

The quadratic assignment problem (QAP)

Elementary Landscape Decomposition The quadratic assignment problem (QAP) The quadratic assignment problem (QAP)

Problems:  Permutation Flowshop Scheduling Problem (10 instances)  Quadratic Assignment Problem (10 instances) Algorithms: Generalized Mallows EDA – Kendall’s-tau Mixtures of Generalized Mallows EDA – Kendall’s-tau Generalized Mallows EDA – Cayley Mixtures of Generalized Mallows EDA – Cayley Experiments Settings 27

Other distances Cayley distance Calculates the minimum number of swap operations to convert in. 28

Problems:  Permutation Flowshop Scheduling Problem (10 instances)  Quadratic Assignment Problem (10 instances) Algorithms: Generalized Mallows EDA – Kendall’s-tau Mixtures of Generalized Mallows EDA – Kendall’s-tau Generalized Mallows EDA – Cayley Mixtures of Generalized Mallows EDA – Cayley Two models in the mixture, G=2 Average Relative Percentage Deviation (ARPD) of 20 repetitions Stopping criterion: 100n-1 generations Experiments Settings 29

Extension of the toolbox MATEDA for the mathematical computing environment Matlab Experiments Settings 30

Experimentation Results 31 InstanceGM ken Mix ken GM cay Mix cay QAP n n n n n n n n n n

Experimentation Results 32 InstanceGM ken Mix ken GM cay Mix cay PFSP n n n n n n n n n n

Results summary 33 Generalized Mallows EDA Generalized Mallows EDA Kendall’s-PFSPKendall’s-QAP

Results summary 34 Generalized Mallows EDA Generalized Mallows EDA Kendall’s-PFSPKendall’s-QAP Mixtures of Generalized Mallows EDA Cayley-PFSPCayley-QAP

Conclusions 35 Promising results of mixtures models.

Future work 36 Investigate the reason for which the distances behave differently.

Future work 37 Evaluate the performance of mixtures with more components (G>2) and implement methods that tune the parameter G automatically.

Future work 38 Extend the experimentation to larger instances and more problems

Introducing Mixtures of Generalized Mallows in Estimation of Distribution Algorithms Josian Santamaria Josu Ceberio Roberto Santana Alexander Mendiburu Jose A. Lozano X Congreso Español de Metaheurísticas, Algoritmos Evolutivos y Bioinspirados - MAEB2015