Hydra-MIP: Automated Algorithm Configuration and Selection for Mixed Integer Programming Lin Xu, Frank Hutter, Holger H. Hoos, and Kevin Leyton-Brown Department.

Slides:

Advertisements

Similar presentations

TWO STEP EQUATIONS 1. SOLVE FOR X 2. DO THE ADDITION STEP FIRST

Advertisements

Online Max-Margin Weight Learning with Markov Logic Networks Tuyen N. Huynh and Raymond J. Mooney Machine Learning Group Department of Computer Science.

Convegno Progetto FIRB LSNO – Capri 19/20 aprile ESOPO: an Environment for Solving Optimization Problems Online M. DApuzzo *, M.L. De Cesare **,

Kapitel 21 Astronomie Autor: Bennett et al. Galaxienentwicklung Kapitel 21 Galaxienentwicklung © Pearson Studium 2010 Folie: 1.

Greening Backbone Networks Shutting Off Cables in Bundled Links Will Fisher, Martin Suchara, and Jennifer Rexford Princeton University.

1 Copyright © 2010, Elsevier Inc. All rights Reserved Fig 2.1 Chapter 2.

1 Copyright © 2013 Elsevier Inc. All rights reserved. Chapter 38.

IDSIA Lugano Switzerland Master Algorithms for Active Experts Problems based on Increasing Loss Values Jan Poland and Marcus Hutter Defensive Universal.

By D. Fisher Geometric Transformations. Reflection, Rotation, or Translation 1.

A Robust Bagging Method using Median as a Combination Rule Zaman Md. Faisal and Hideo Hirose Department of Information Design and Informatics Kyushu Institute.

Business Transaction Management Software for Application Coordination 1 Business Processes and Coordination.

Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13

Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13

Title Subtitle.

DIVIDING INTEGERS 1. IF THE SIGNS ARE THE SAME THE ANSWER IS POSITIVE 2. IF THE SIGNS ARE DIFFERENT THE ANSWER IS NEGATIVE.

MULTIPLYING MONOMIALS TIMES POLYNOMIALS (DISTRIBUTIVE PROPERTY)

ADDING INTEGERS 1. POS. + POS. = POS. 2. NEG. + NEG. = NEG. 3. POS. + NEG. OR NEG. + POS. SUBTRACT TAKE SIGN OF BIGGER ABSOLUTE VALUE.

MULTIPLICATION EQUATIONS 1. SOLVE FOR X 3. WHAT EVER YOU DO TO ONE SIDE YOU HAVE TO DO TO THE OTHER 2. DIVIDE BY THE NUMBER IN FRONT OF THE VARIABLE.

SUBTRACTING INTEGERS 1. CHANGE THE SUBTRACTION SIGN TO ADDITION

MULT. INTEGERS 1. IF THE SIGNS ARE THE SAME THE ANSWER IS POSITIVE 2. IF THE SIGNS ARE DIFFERENT THE ANSWER IS NEGATIVE.

Automated Parameter Setting Based on Runtime Prediction: Towards an Instance-Aware Problem Solver Frank Hutter, Univ. of British Columbia, Vancouver, Canada.

Robust Window-based Multi-node Technology- Independent Logic Minimization Jeff L.Cobb Kanupriya Gulati Sunil P. Khatri Texas Instruments, Inc. Dept. of.

SATzilla: Portfolio-based Algorithm Selection for SAT Lin Xu, Frank Hutter, Holger H. Hoos, and Kevin Leyton-Brown Department of Computer Science University.

Richmond House, Liverpool (1) 26 th January 2004.

BT Wholesale October Creating your own telephone network WHOLESALE CALLS LINE ASSOCIATED.

HyLog: A High Performance Approach to Managing Disk Layout Wenguang Wang Yanping Zhao Rick Bunt Department of Computer Science University of Saskatchewan.

On Comparing Classifiers : Pitfalls to Avoid and Recommended Approach

Copyright 2007 McGraw-Hill Pty Ltd PPTs t/a Marketing Research 2e by Lukas, Hair, Bush and Ortinau Slides prepared by Judy Rex 16-1 Chapter Sixteen Data.

Université du Québec École de technologie supérieure Face Recognition in Video Using What- and-Where Fusion Neural Network Mamoudou Barry and Eric Granger.

ABC Technology Project

Galit Haim, Ya'akov Gal, Sarit Kraus and Michele J. Gelfand A Cultural Sensitive Agent for Human-Computer Negotiation 1.

Gate Sizing for Cell Library Based Designs Shiyan Hu*, Mahesh Ketkar**, Jiang Hu* *Dept of ECE, Texas A&M University **Intel Corporation.

© S Haughton more than 3?

© Charles van Marrewijk, An Introduction to Geographical Economics Brakman, Garretsen, and Van Marrewijk.

© Charles van Marrewijk, An Introduction to Geographical Economics Brakman, Garretsen, and Van Marrewijk.

© Charles van Marrewijk, An Introduction to Geographical Economics Brakman, Garretsen, and Van Marrewijk.

Chapter 6 The Mathematics of Diversification

Squares and Square Root WALK. Solve each problem REVIEW:

Chapter 5 Test Review Sections 5-1 through 5-4.

SIMOCODE-DP Software.

GG Consulting, LLC I-SUITE. Source: TEA SHARS Frequently asked questions 2.

1 First EMRAS II Technical Meeting IAEA Headquarters, Vienna, 19–23 January 2009.

Addition 1’s to 20.

25 seconds left…...

Test B, 100 Subtraction Facts

We will resume in: 25 Minutes.

Figure Essential Cell Biology (© Garland Science 2010)

1 Unit 1 Kinematics Chapter 1 Day

TASK: Skill Development A proportional relationship is a set of equivalent ratios. Equivalent ratios have equal values using different numbers. Creating.

Document Summarization using Conditional Random Fields Dou Shen, Jian-Tao Sun, Hua Li, Qiang Yang, Zheng Chen IJCAI 2007 Hao-Chin Chang Department of Computer.

How Cells Obtain Energy from Food

From Approximative Kernelization to High Fidelity Reductions joint with Michael Fellows Ariel Kulik Frances Rosamond Technion Charles Darwin Univ. Hadas.

Ramprasad Yelchuru, Optimal controlled variable selection for Individual process units, 1 Optimal controlled variable selection for individual process.

Classification Classification Examples

Random Forest Predrag Radenković 3237/10

Automatic Tuning1/33 Boosting Verification by Automatic Tuning of Decision Procedures Domagoj Babić joint work with Frank Hutter, Holger H. Hoos, Alan.

On the Potential of Automated Algorithm Configuration Frank Hutter, University of British Columbia, Vancouver, Canada. Motivation for automated tuning.

SATzilla-07: The Design and Analysis of an Algorithm Portfolio for SAT Lin Xu, Frank Hutter, Holger H. Hoos and Kevin Leyton-Brown University of British.

Hierarchical Hardness Models for SAT Lin Xu, Holger H. Hoos and Kevin Leyton-Brown University of British Columbia {xulin730, hoos,

Frank Hutter, Holger Hoos, Kevin Leyton-Brown

Hierarchical Hardness Models for SAT Hierarchical Hardness Models for SAT Building the hierarchical hardness models 1.The classifier is trained on a set.

Parallel Algorithm Configuration Frank Hutter, Holger Hoos, Kevin Leyton-Brown University of British Columbia, Vancouver, Canada.

Performance Prediction and Automated Tuning of Randomized and Parametric Algorithms Frank Hutter 1, Youssef Hamadi 2, Holger Hoos 1, and Kevin Leyton-Brown.

Performance Prediction and Automated Tuning of Randomized and Parametric Algorithms: An Initial Investigation Frank Hutter 1, Youssef Hamadi 2, Holger.

Lin Xu, Holger H. Hoos, Kevin Leyton-Brown

Presentation transcript:

Hydra-MIP: Automated Algorithm Configuration and Selection for Mixed Integer Programming Lin Xu, Frank Hutter, Holger H. Hoos, and Kevin Leyton-Brown Department of Computer Science University of British Columbia

2 Solving MIP more effectively Portfolio-based algorithm selection (SATzilla) [Xu et al., 2007;2008;2009] Where are the solvers? Parameter settings of a single solver (e.g. CPLEX) How to find good settings? Automated algorithm configuration tool [Hutter et al., 2007;2009] How to find good candidates for algorithm selection? Algorithm configuration with dynamic performance metric [Xu et al., 2010] Xu, Hutter, Hoos, and Leyton-Brown: Hydra-MIP

Some particularly related work: [Gratch & Dejong, 1992]; [Balaprakash, Birattari & Stuetzle, 2007]; [Hutter, Babic, Hoos & Hu, 2007]; [Hutter, Hoos, Stuetzle & Leyton-Brown, 2009] Some particularly related work: [Rice, 1976]; [Leyton-Brown, Nudelman & Shoham, 2003; 2009]; [Guerri & Milano, 2004]; [Nudelman, Leyton-Brown, Shoham & Hoos, 2004] 3 Hydra Portfolio-based algorithm selection: Automated algorithm configuration: Xu, Hutter, Hoos, and Leyton-Brown: Hydra-MIP

Outline Improve algorithm selection – SATzilla – Drawback of SATzilla – New SATzilla with cost sensitive classification – Results Reduce the construction cost – Hydra – The cost – Make full use of configuration – Results Conclusion Xu, Hutter, Hoos, and Leyton-Brown: Hydra-MIP 4

5 Given: – training set of instances – performance metric – candidate solvers – portfolio builder (incl. instance features) Training: – collect performance data – portfolio builder learns predictive models At Runtime: – predict performance – select solver Metric Portfolio Builder Training Set Novel Instance Portfolio-Based Algorithm Selector Candidate Solvers Selected Solver SATzilla: Portfolio-Based Algorithm Selection [Xu, Hutter, Hoos, Leyton-Brown, 2007; 2008] Xu, Hutter, Hoos, and Leyton-Brown: Hydra-MIP 5

6 Drawback of SATzilla Algorithm selection in SATzilla based on regression: – Predict each solver performance independently – Select best predicted solver – Classification based on regression Goal of regression: Accurately predict each solvers performance Algorithm selection: Pick solvers on a per-instance basis in order to minimize some overall performance metric Better regression Better algorithm selection Xu, Hutter, Hoos, and Leyton-Brown: Hydra-MIP Algorithm Selector

7 Cost sensitive classification for SATzilla Loss function: the performance difference – Punish misclassifications in direct proportion to their impact on portfolio performance – No need for predicting runtime Implementation: Binary cost sensitive classifier: decision forest (DF) – Build DF for each pair of candidate solvers – one vote for the better solver – Most votes -> Best solver Xu, Hutter, Hoos, and Leyton-Brown: Hydra-MIP

8 SATzilla DF performance DataSetModelAverage TimeSolved PercentageTime speedup RAND LR % HAND LR % INDU LR % LR: linear regression as used in previous SATzilla; DF: cost sensitive decision forest Xu, Hutter, Hoos, and Leyton-Brown: Hydra-MIP

SATzilla DF performance DataSetModelAverage TimeSolved PercentageTime speedup RAND LR % 1.08× DF % HAND LR % 1.16× DF % INDU LR % 1.12× DF % LR: linear regression as used in previous SATzilla; DF: cost sensitive decision forest 9 Xu, Hutter, Hoos, and Leyton-Brown: Hydra-MIP

10 MIPzilla DF performance DataSetModelAverage TimeSolved Percentage Time speedup LR % LR % ISAC (new) LR % MIX LR5699.6% Xu, Hutter, Hoos, and Leyton-Brown: Hydra-MIP

11 MIPzilla DF performance DataSetModelAverage TimeSolved Percentage Time speedup LR % 1.00× DF % LR % 1.04× DF % ISAC (new) LR % 1.18× DF % MIX LR5699.6% 1.05× DF4899.6% Xu, Hutter, Hoos, and Leyton-Brown: Hydra-MIP

12 Hydra Procedure: Iteration 1 Algorithm Configurator Metric Training Set Portfolio-Based Algorithm Selector Candidate Solver Set Candidate Solver Parameterized Algorithm Portfolio Builder Xu, Hutter, Hoos, and Leyton-Brown: Hydra-MIP

13 Hydra Procedure: Iteration 2 Algorithm Configurator Metric Training Set Portfolio-Based Algorithm Selector Candidate Solver Set Candidate Solver Parameterized Algorithm Portfolio Builder Xu, Hutter, Hoos, and Leyton-Brown: Hydra-MIP

14 Hydra Procedure: Iteration 3 Algorithm Configurator Metric Training Set Portfolio-Based Algorithm Selector Candidate Solver Set Candidate Solver Parameterized Algorithm Portfolio Builder Xu, Hutter, Hoos, and Leyton-Brown: Hydra-MIP

15 Output: Hydra Procedure: After Termination Portfolio-Based Algorithm Selector Novel Instance Selected Solver Xu, Hutter, Hoos, and Leyton-Brown: Hydra-MIP

16 We are wasting configuration results! Algorithm Configurator Metric Training Set Candidate Solver Parameterized Algorithm Xu, Hutter, Hoos, and Leyton-Brown: Hydra-MIP

17 Make full use of configurations Algorithm Configurator Metric Training Set Parameterized Algorithm Xu, Hutter, Hoos, and Leyton-Brown: Hydra-MIP k Candidate Solvers

18 Make full use of configurations Advantage: – Add k solvers instead of 1 in each iteration (good for algorithm selection) – No need for validation step in configuration (SAVE time) Disadvantage: – Need to collect runtime data for more solvers (COST time) In our experiment, we found SAVE = COST (k=4) Xu, Hutter, Hoos, and Leyton-Brown: Hydra-MIP

19 Experimental Setup: Hydras Inputs Portfolio Builder: MIPzilla LR (SATzilla for MIP) [Xu et al., 2008] MIPzilla DF (MIPzilla using cost sensitive DF) Parameterized Solver: CPLEX12.1 Algorithm Configurator: FocusedILS [Hutter, Hoos, Leyton-Brown, 2009] Performance Metric: Penalized average runtime (PAR) Instance Sets: 4 heterogeneous sets by combining homogeneous subsets [Hutter et al., 2010];[Kadioglu et al., 2010]; [Ahmadizadeh et al., 2010] Xu, Hutter, Hoos, and Leyton-Brown: Hydra-MIP

20 Three versions of Hydra for MIP Hydra LR,1 : Original Hydra for MIP [Xu et al., 2010] Hydra DF,1 : Hydra for MIP with Improvement I Hydra DF,4 : Hydra for MIP with Improvement I and II Xu, Hutter, Hoos, and Leyton-Brown: Hydra-MIP

MIP-Hydra performance on MIX Hydra DF,* performs better than Hydra LR,1 Hydra DF,4 performs similar to Hydra DF,1, but converge faster Performance close to Oracle and MIPzilla DF 21 Xu, Hutter, Hoos, and Leyton-Brown: Hydra-MIP

22 Conclusion Cost sensitive classification based SATzilla outperforms original SATzilla New Hydra-MIP outperforms CPLEX default, algorithm configuration alone, and original Hydra on four heterogeneous MIP sets Technical contributions: – Cost sensitive classification results better algorithm selection for SAT and MIP – Using multiple configurations speeds up the convergence of Hydra Xu, Hutter, Hoos, and Leyton-Brown: Hydra-MIP