CP - 2001 1 Formal Models of Heavy-Tailed Behavior in Combinatorial Search Hubie Chen, Carla P. Gomes, and Bart Selman

Slides:

Advertisements

Similar presentations

10/7/2014 Constrainedness of Search Toby Walsh NICTA and UNSW

Advertisements

COMP 553: Algorithmic Game Theory Fall 2014 Yang Cai Lecture 21.

1 Backdoor Sets in SAT Instances Ryan Williams Carnegie Mellon University Joint work in IJCAI03 with: Carla Gomes and Bart Selman Cornell University.

Dynamic Restarts Optimal Randomized Restart Policies with Observation Henry Kautz, Eric Horvitz, Yongshao Ruan, Carla Gomes and Bart Selman.

Random Sampling and Data Description

Statistical Regimes Across Constrainedness Regions Carla P. Gomes, Cesar Fernandez Bart Selman, and Christian Bessiere Cornell University Universitat de.

Methods for SAT- a Survey Robert Glaubius CSCE 976 May 6, 2002.

Heavy-Tailed Behavior and Search Algorithms for SAT Tang Yi Based on [1][2][3]

Impact of Structure on Complexity Carla Gomes Bart Selman Cornell University Intelligent Information Systems.

1 An Asymptotically Optimal Algorithm for the Max k-Armed Bandit Problem Matthew Streeter & Stephen Smith Carnegie Mellon University NESCAI, April

Creating Difficult Instances of the Post Correspondence Problem Presenter: Ling Zhao Department of Computing Science University of Alberta March 20, 2001.

Master Class on Experimental Study of Algorithms Scientific Use of Experimentation Carla P. Gomes Cornell University CPAIOR Bologna, Italy 2010.

It’s all about the support: a new perspective on the satisfiability problem Danny Vilenchik.

AAAI00 Austin, Texas Generating Satisfiable Problem Instances Dimitris Achlioptas Microsoft Carla P. Gomes Cornell University Henry Kautz University of.

Short XORs for Model Counting: From Theory to Practice Carla P. Gomes, Joerg Hoffmann, Ashish Sabharwal, Bart Selman Cornell University & Univ. of Innsbruck.

1 Backdoors To Typical Case Complexity Ryan Williams Carnegie Mellon University Joint work with: Carla Gomes and Bart Selman Cornell University.

Carla P. Gomes CS4700 CS 4700: Foundations of Artificial Intelligence Carla P. Gomes Module: Randomization in Complete Tree Search.

Structure and Phase Transition Phenomena in the VTC Problem C. P. Gomes, H. Kautz, B. Selman R. Bejar, and I. Vetsikas IISI Cornell University University.

Carla P. Gomes CS4700 CS 4700: Foundations of Artificial Intelligence Carla P. Gomes Module: Instance Hardness and Phase Transitions.

Chapter 11: Limitations of Algorithmic Power

Ch 13 – Backtracking + Branch-and-Bound

CP-AI-OR-02 Gomes & Shmoys 1 The Promise of LP to Boost CSP Techniques for Combinatorial Problems Carla P. Gomes David Shmoys

1 CS 4700: Foundations of Artificial Intelligence Carla P. Gomes Module: Satisfiability (Reading R&N: Chapter 7)

Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 6-1 Chapter 6 The Normal Distribution and Other Continuous Distributions.

Carla P. Gomes School on Optimization CPAIOR02 Exploiting Structure and Randomization in Combinatorial Search Carla P. Gomes

Lukas Kroc, Ashish Sabharwal, Bart Selman Cornell University, USA SAT 2010 Conference Edinburgh, July 2010 An Empirical Study of Optimal Noise and Runtime.

1 Dynamic Models for File Sizes and Double Pareto Distributions Michael Mitzenmacher Harvard University.

Controlling Computational Cost: Structure and Phase Transition Carla Gomes, Scott Kirkpatrick, Bart Selman, Ramon Bejar, Bhaskar Krishnamachari Intelligent.

1 Combinatorial Problems in Cooperative Control: Complexity and Scalability Carla Gomes and Bart Selman Cornell University Muri Meeting March 2002.

Sampling Combinatorial Space Using Biased Random Walks Jordan Erenrich, Wei Wei and Bart Selman Dept. of Computer Science Cornell University.

Hardness-Aware Restart Policies Yongshao Ruan, Eric Horvitz, & Henry Kautz IJCAI 2003 Workshop on Stochastic Search.

Daniel Kroening and Ofer Strichman Decision Procedures An Algorithmic Point of View Deciding ILPs with Branch & Bound ILP References: ‘Integer Programming’

Learning to Search Henry Kautz University of Washington joint work with Dimitri Achlioptas, Carla Gomes, Eric Horvitz, Don Patterson, Yongshao Ruan, Bart.

Decision Procedures An Algorithmic Point of View

Stochastic Approximation and Simulated Annealing Lecture 8 Leonidas Sakalauskas Institute of Mathematics and Informatics Vilnius, Lithuania EURO Working.

Distributions of Randomized Backtrack Search Key Properties: I Erratic behavior of mean II Distributions have “heavy tails”.

Tractable Symmetry Breaking Using Restricted Search Trees Colva M. Roney-Dougal, Ian P. Gent, Tom Kelsey, Steve Linton Presented by: Shant Karakashian.

Distributed Constraint Optimization Michal Jakob Agent Technology Center, Dept. of Computer Science and Engineering, FEE, Czech Technical University A4M33MAS.

Quantifying the dynamics of Binary Search Trees under combined insertions and deletions BACKGROUND The complexity of many operations on Binary Search Trees.

Structure and Phase Transition Phenomena in the VTC Problem C. P. Gomes, H. Kautz, B. Selman R. Bejar, and I. Vetsikas IISI Cornell University University.

Quasigroups Defaults Foundations of AI. Given an N X N matrix, and given N colors, color the matrix in such a way that: -all cells are colored; - each.

Constrainedness Including slides from Toby Walsh.

1 Chapter 5 Advanced Search. 2 Chapter 5 Contents l Constraint satisfaction problems l Heuristic repair l The eight queens problem l Combinatorial optimization.

Télécom 2A – Algo Complexity (1) Time Complexity and the divide and conquer strategy Or : how to measure algorithm run-time And : design efficient algorithms.

McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 6 Continuous Random Variables.

1 Lower Bounds Lower bound: an estimate on a minimum amount of work needed to solve a given problem Examples: b number of comparisons needed to find the.

CP Summer School Modelling for Constraint Programming Barbara Smith 2. Implied Constraints, Optimization, Dominance Rules.

Heavy-Tailed Phenomena in Satisfiability and Constraint Satisfaction Problems by Carla P. Gomes, Bart Selman, Nuno Crato and henry Kautz Presented by Yunho.

CSE 589 Part VI. Reading Skiena, Sections 5.5 and 6.8 CLR, chapter 37.

Explorations in Artificial Intelligence Prof. Carla P. Gomes Module Logic Representations.

On the Relation between SAT and BDDs for Equivalence Checking Sherief Reda Rolf Drechsler Alex Orailoglu Computer Science & Engineering Dept. University.

Daniel Kroening and Ofer Strichman 1 Decision Procedures An Algorithmic Point of View BDDs.

Quality of LP-based Approximations for Highly Combinatorial Problems Lucian Leahu and Carla Gomes Computer Science Department Cornell University.

SAT 2009 Ashish Sabharwal Backdoors in the Context of Learning (short paper) Bistra Dilkina, Carla P. Gomes, Ashish Sabharwal Cornell University SAT-09.

Accelerating Random Walks Wei Wei and Bart Selman.

1 Combinatorial Problems in Cooperative Control: Complexity and Scalability Carla P. Gomes and Bart Selman Cornell University Muri Meeting June 2002.

Balance and Filtering in Structured Satisfiability Problems Henry Kautz University of Washington joint work with Yongshao Ruan (UW), Dimitris Achlioptas.

Eliminating non- binary constraints Toby Walsh Cork Constraint Computation Center.

CSCE350 Algorithms and Data Structure Lecture 21 Jianjun Hu Department of Computer Science and Engineering University of South Carolina

Lecture 8 Randomized Search Algorithms Part I: Backtrack Search CSE 573 Artificial Intelligence I Henry Kautz Fall 2001.

Constraint Programming for the Diameter Constrained Minimum Spanning Tree Problem Thiago F. Noronha Celso C. Ribeiro Andréa C. Santos.

Shortcomings of Traditional Backtrack Search on Large, Tight CSPs: A Real-world Example Venkata Praveen Guddeti and Berthe Y. Choueiry The combination.

The NP class. NP-completeness Lecture2. The NP-class The NP class is a class that contains all the problems that can be decided by a Non-Deterministic.

Formal Complexity Analysis of RoboFlag Drill & Communication and Computation in Distributed Negotiation Algorithms in Distributed Negotiation Algorithms.

Branch and Bound.

Constraint Programming and Backtracking Search Algorithms

Chapter 11 Limitations of Algorithm Power

Backtracking and Branch-and-Bound

Major Design Strategies

Presentation transcript:

CP Formal Models of Heavy-Tailed Behavior in Combinatorial Search Hubie Chen, Carla P. Gomes, and Bart Selman Department of Computer Science Cornell University

CP Background Randomized backtrack search methods demonstrate high variability of run time (relative to fixed instance): Heavy-tailed behavior (Gomes et. al. CP ‘97, JAR ‘00) New insights into the the design of search algorithms  restart strategies Randomization and restart strategies are now an integral part of state-of-the-art SAT Solvers (Chaff, GRASP, RELSAT, SATZ-Rand)

CP Goals Our goals:  Formal analysis of tree search models: show under what conditions heavy-tailed distributions can and cannot arise.  Understand when restart strategies are/are not effective. Research on heavy-tails in search thus far largely based on empirical studies.

CP IntuitionIntuition How does heavy-tailed behavior arise? The procedure is characterized by a large variability, which leads to highly different trees from run to run. Wrong branching decisions may lead the search procedure to explore exponentially large subtrees of the search space containing no solutions. A lucky sequence of good branching decisions may lead the search to find a solution after exploring only a small subtree.

CP Intuition Pump: Restarts When are restarts effective? Suppose a search procedure requires (on inputs of size n): Time p(n) (for a polynomial p) with probability ½ Time 2^n with probability ½ No restarts: expected time exponential: equal to ½ * (p(n) + 2^n) Restart with time interval p(n): expected time drops to polynomial: equal to 2*p(n)

CP Outline of Talk Empirical evidence of Heavy-Tailed behavior Tree Search Models Balanced Tree Search Model Imbalanced Tree Search Model Bounded Heavy-Tailed Behavior: finite distributions

CP Empirical Evidence of Heavy-Tailed Behavior

CP Quasigroups or Latin Squares: An Abstraction for Real World Applications Quasigroup or Latin Square (Order 4) 32% preassignment Gomes and Selman 96 A quasigroup is an n-by-n matrix such that each row and column is a permutation of the same n colors

CP Randomized Backtrack Search (*) no solution found - reached cutoff: 2000 Time:(*)3011(*)7 Easy instance – 15 % preassigned cells Gomes et al. 97

CP Median = 1! sample mean 3500! Erratic Behavior of Search Cost Quasigroup Completion Problem number of runs

CP Heavy-Tailed Distributions

CP Heavy-Tailed Distributions Infinite variance, infinite mean Introduced by Pareto in the 1920’s --- “probabilistic curiosity.” Mandelbrot established the use of heavy-tailed distributions to model real-world fractal phenomena. Examples: stock-market, earthquakes, weather, web traffic...

CP Decay of Distributions Standard Exponential Decay e.g. Normal: Heavy-Tailed Power Law Decay e.g. Pareto-Levy: Power Law Decay Standard Distribution (finite mean & variance) Exponential Decay

CP Visualization of Heavy Tailed Behavior Log-log plot of tail of distribution should be approximately linear. Slope gives value of infinite mean and infinite variance infinite mean and infinite variance infinite variance infinite variance Number backtracks (log) (1-F(x))(log) Unsolved fraction => Infinite mean 18% unsolved 0.002% unsolved

CP Exploiting Heavy-Tailed Behavior Heavy Tailed behavior has been observed in several domains: QCP, Graph Coloring, Planning, Scheduling, Circuit synthesis, Decoding, etc. Consequence for algorithm design: Use restarts or parallel / interleaved runs to exploit the extreme variance performance. Restarts provably eliminate heavy-tailed behavior (Gomes et al. 2000) 70% unsolved 1-F(x) Unsolved fraction Number backtracks (log) 250 (62 restarts) 0.001% unsolved

CP Tree Search Models: Balanced Tree Model

CP Balanced Tree Model, Described Trees  All leaves occur at the same depth  Branching factor 2  Exactly one “satisfying” leaf Search algorithm  Chronological backtrack search model  Random child selection with no propagation mechanisms

CP Balanced Tree Model: Analysis Let denote the runtime: number of leaf nodes visited (including “satisfying” leaf), on tree of depth n. Let denote choice at (unique) node above satisfying leaf at depth i : 1 = bad choice, 0 = good choice Then, There is exactly one choice of zero-one assignments to the variables for each possible value of T(n); any such assignment has probability T(n) has an uniform distribution. T=4 T=64

CP Balanced Tree Model: Distribution The expected run time and variance scale exponentially, in the height of the search tree (number of variables); The run time distribution is uniform -- shape not heavy tailed. (see paper for formal proofs)

CP Balanced Tree Model: Restarts Restart strategies are not effective for this model: no restart strategy with expected polynomial time. Define a restart strategy to be a sequence of times Applied to a search procedure by running procedure for time ; restarting and running for time, etc., until solution found. Luby et al. (IPL ‘93) show that optimal performance (minimum expectation) obtained by a purely uniform restart strategy:

CP What sort of improvements can be made to an algorithm so that behavior not like backtrack in balanced tree model?  Very clever search heuristics that lead quickly to the solution node - but that is hard in general  Combination of pruning, propagation, dynamic variable ordering: prune subtrees that do not contain the solution, allowing for runs that are short. Resulting trees may vary dramatically from run to run. Balanced Tree Model

CP Tree Search Models: Imbalanced Tree Model

CP Imbalanced Tree Model Algorithm requires time b^i with probability (1-p)p^i Intuition: lower p corresponds to “smarter” search Let T denote the runtime of the algorithm: the number of leaf nodes visited up to and including the successful node. b=2

CP Imbalanced Tree Model

CP Imbalanced Tree Model: Three Regimes of Behavior Regime 1: finite expected time, finite variance Regime 2: finite expected time, infinite variance Regime 3: infinite expected time, infinite variance Tail: when we have (see paper for formal proofs)

CP Bounded Imbalanced Tree Model

CP Bounded Imbalanced Tree Model Unbounded model Single infinite distribution. Bounded model Infinite number of distributions, one for each n. Arises from truncating successively larger finite segments of unbounded distribution. Given that: We define: with

CP Bounded Imbalanced Tree Model: Three Regimes of Behavior Regime 1: polynomial expected time, polynomial variance Regime 2: polynomial expected time, exponential variance Regime 3: exponential expected time, exponential variance (see paper for formal proofs) Restart strategy - Expected polynomial time

CP Bounded Heavy-Tailed Behavior

CP Balanced, Unbounded, and Imbalanced Trees

CP Conclusions

CP Conclusions Heavy-tailed behavior yields insight into backtrack search methods, providing an explanation for the effectiveness of restart strategies. Tree Search Models: can be analyzed rigorously. Balanced Tree Search Model Uniform distribution (not heavy-tailed); restarts are not effective Imbalanced Tree Search Model (Bounded/Unbounded) Heavy-tailed; restarts are effective Consequence for algorithm design: aim for strategies which have highly asymmetric distributions.

CP Check also: Check also: Demos, papers, etc.