Distributions of Randomized Backtrack Search Key Properties: I Erratic behavior of mean II Distributions have “heavy tails”.

Slides:



Advertisements
Similar presentations
10/7/2014 Constrainedness of Search Toby Walsh NICTA and UNSW
Advertisements

GRAPH BALANCING. Scheduling on Unrelated Machines J1 J2 J3 J4 J5 M1 M2 M3.
1 Backdoor Sets in SAT Instances Ryan Williams Carnegie Mellon University Joint work in IJCAI03 with: Carla Gomes and Bart Selman Cornell University.
Dynamic Restarts Optimal Randomized Restart Policies with Observation Henry Kautz, Eric Horvitz, Yongshao Ruan, Carla Gomes and Bart Selman.
Information Networks Small World Networks Lecture 5.
CPSC 322, Lecture 15Slide 1 Stochastic Local Search Computer Science cpsc322, Lecture 15 (Textbook Chpt 4.8) February, 6, 2009.
Constraint Systems Laboratory Oct 21, 2004Guddeti: MS thesis defense1 An Improved Restart Strategy for Randomized Backtrack Search Venkata P. Guddeti Constraint.
Statistical Regimes Across Constrainedness Regions Carla P. Gomes, Cesar Fernandez Bart Selman, and Christian Bessiere Cornell University Universitat de.
CP Formal Models of Heavy-Tailed Behavior in Combinatorial Search Hubie Chen, Carla P. Gomes, and Bart Selman
Heavy-Tailed Behavior and Search Algorithms for SAT Tang Yi Based on [1][2][3]
1 An Asymptotically Optimal Algorithm for the Max k-Armed Bandit Problem Matthew Streeter & Stephen Smith Carnegie Mellon University NESCAI, April
Implicit Hitting Set Problems Richard M. Karp Harvard University August 29, 2011.
Master Class on Experimental Study of Algorithms Scientific Use of Experimentation Carla P. Gomes Cornell University CPAIOR Bologna, Italy 2010.
AAAI00 Austin, Texas Generating Satisfiable Problem Instances Dimitris Achlioptas Microsoft Carla P. Gomes Cornell University Henry Kautz University of.
Analysis of Algorithms CS 477/677
Short XORs for Model Counting: From Theory to Practice Carla P. Gomes, Joerg Hoffmann, Ashish Sabharwal, Bart Selman Cornell University & Univ. of Innsbruck.
1 Backdoors To Typical Case Complexity Ryan Williams Carnegie Mellon University Joint work with: Carla Gomes and Bart Selman Cornell University.
Carla P. Gomes CS4700 CS 4700: Foundations of Artificial Intelligence Carla P. Gomes Module: Randomization in Complete Tree Search.
Structure and Phase Transition Phenomena in the VTC Problem C. P. Gomes, H. Kautz, B. Selman R. Bejar, and I. Vetsikas IISI Cornell University University.
Ch 13 – Backtracking + Branch-and-Bound
CS 584. Discrete Optimization Problems A discrete optimization problem can be expressed as (S, f) S is the set of all feasible solutions f is the cost.
CP-AI-OR-02 Gomes & Shmoys 1 The Promise of LP to Boost CSP Techniques for Combinatorial Problems Carla P. Gomes David Shmoys
1 CS 4700: Foundations of Artificial Intelligence Carla P. Gomes Module: Satisfiability (Reading R&N: Chapter 7)
1 Algorithms for Large Data Sets Ziv Bar-Yossef Lecture 7 May 14, 2006
1 Understanding Problem Hardness: Recent Developments and Directions Bart Selman Cornell University.
Carla P. Gomes School on Optimization CPAIOR02 Exploiting Structure and Randomization in Combinatorial Search Carla P. Gomes
Lukas Kroc, Ashish Sabharwal, Bart Selman Cornell University, USA SAT 2010 Conference Edinburgh, July 2010 An Empirical Study of Optimal Noise and Runtime.
1 Dynamic Models for File Sizes and Double Pareto Distributions Michael Mitzenmacher Harvard University.
Ryan Kinworthy 2/26/20031 Chapter 7- Local Search part 2 Ryan Kinworthy CSCE Advanced Constraint Processing.
Foundations of Constraint Processing, Fall 2004 November 18, 2004More on BT search1 Foundations of Constraint Processing CSCE421/821, Fall 2004:
1 Combinatorial Problems in Cooperative Control: Complexity and Scalability Carla Gomes and Bart Selman Cornell University Muri Meeting March 2002.
Hardness-Aware Restart Policies Yongshao Ruan, Eric Horvitz, & Henry Kautz IJCAI 2003 Workshop on Stochastic Search.
Why SAT Scales: Phase Transition Phenomena & Back Doors to Complexity slides courtesy of Bart Selman Cornell University.
Learning to Search Henry Kautz University of Washington joint work with Dimitri Achlioptas, Carla Gomes, Eric Horvitz, Don Patterson, Yongshao Ruan, Bart.
Stochastic Approximation and Simulated Annealing Lecture 8 Leonidas Sakalauskas Institute of Mathematics and Informatics Vilnius, Lithuania EURO Working.
Distributed Constraint Optimization Michal Jakob Agent Technology Center, Dept. of Computer Science and Engineering, FEE, Czech Technical University A4M33MAS.
Binomial Distributions Calculating the Probability of Success.
Structure and Phase Transition Phenomena in the VTC Problem C. P. Gomes, H. Kautz, B. Selman R. Bejar, and I. Vetsikas IISI Cornell University University.
Introduction to Job Shop Scheduling Problem Qianjun Xu Oct. 30, 2001.
Applied Business Forecasting and Regression Analysis Review lecture 2 Randomness and Probability.
Heavy-Tailed Phenomena in Satisfiability and Constraint Satisfaction Problems by Carla P. Gomes, Bart Selman, Nuno Crato and henry Kautz Presented by Yunho.
Discrete Structures Lecture 12: Trees Ji Yanyan United International College Thanks to Professor Michael Hvidsten.
Many random walks are faster than one Noga AlonTel Aviv University Chen AvinBen Gurion University Michal KouckyCzech Academy of Sciences Gady KozmaWeizmann.
1 Chapter 9 Introducing Probability. From Exploration to Inference p. 150 in text Purpose: Unrestricted exploration & searching for patterns Purpose:
CSE 589 Part VI. Reading Skiena, Sections 5.5 and 6.8 CLR, chapter 37.
15.053Tuesday, April 9 Branch and Bound Handouts: Lecture Notes.
1 Markov Decision Processes Infinite Horizon Problems Alan Fern * * Based in part on slides by Craig Boutilier and Daniel Weld.
Butterfly model slides. Topological Model: “Butterfly” Objective: Develop model to help explain behavioral mechanisms that cause observed properties,
On the Relation between SAT and BDDs for Equivalence Checking Sherief Reda Rolf Drechsler Alex Orailoglu Computer Science & Engineering Dept. University.
The thermodynamical limit of abstract composition rules T. S. Bíró, KFKI RMKI Budapest, H Non-extensive thermodynamics Composition rules and formal log-s.
Quality of LP-based Approximations for Highly Combinatorial Problems Lucian Leahu and Carla Gomes Computer Science Department Cornell University.
SAT 2009 Ashish Sabharwal Backdoors in the Context of Learning (short paper) Bistra Dilkina, Carla P. Gomes, Ashish Sabharwal Cornell University SAT-09.
CS 584. Discrete Optimization Problems A discrete optimization problem can be expressed as (S, f) S is the set of all feasible solutions f is the cost.
Accelerating Random Walks Wei Wei and Bart Selman.
1 Combinatorial Problems in Cooperative Control: Complexity and Scalability Carla P. Gomes and Bart Selman Cornell University Muri Meeting June 2002.
Balance and Filtering in Structured Satisfiability Problems Henry Kautz University of Washington joint work with Yongshao Ruan (UW), Dimitris Achlioptas.
Eliminating non- binary constraints Toby Walsh Cork Constraint Computation Center.
CSCE350 Algorithms and Data Structure Lecture 21 Jianjun Hu Department of Computer Science and Engineering University of South Carolina
Lecture 8 Randomized Search Algorithms Part I: Backtrack Search CSE 573 Artificial Intelligence I Henry Kautz Fall 2001.
Constraint Programming for the Diameter Constrained Minimum Spanning Tree Problem Thiago F. Noronha Celso C. Ribeiro Andréa C. Santos.
Shortcomings of Traditional Backtrack Search on Large, Tight CSPs: A Real-world Example Venkata Praveen Guddeti and Berthe Y. Choueiry The combination.
1 P NP P^#P PSPACE NP-complete: SAT, propositional reasoning, scheduling, graph coloring, puzzles, … PSPACE-complete: QBF, planning, chess (bounded), …
Extensions to BT Search Foundations of Constraint Processing
Extensions to BT Search Foundations of Constraint Processing
Partly Verifiable Signals (c.n.)
Constraint Programming and Backtracking Search Algorithms
Extensions to BT Search Foundations of Constraint Processing
Extensions to BT Search Foundations of Constraint Processing
Constraints and Search
Extensions to BT Search Foundations of Constraint Processing
Presentation transcript:

Distributions of Randomized Backtrack Search Key Properties: I Erratic behavior of mean II Distributions have “heavy tails”.

Median = 1! sample mean 3500! Erratic Behavior of Search Cost Quasigroup Completion Problem number of runs

1

75%<=30 Number backtracks Proportion of cases Solved 5%>100000

Heavy-Tailed Distributions … infinite variance … infinite mean… infinite variance … infinite mean Introduced by Pareto in the 1920’s --- “probabilistic curiosity.” Mandelbrot established the use of heavy-tailed distributions to model real-world fractal phenomena. Examples: stock-market, earth-quakes, weather,...

Decay of Distributions Standard --- Exponential Decay e.g. Normal: Heavy-Tailed --- Power Law Decay e.g. Pareto-Levy:

Standard Distribution (finite mean & variance) Power Law Decay Exponential Decay

Normal, Cauchy, and Levy Normal - Exponential Decay Cauchy -Power law Decay Levy -Power law Decay

Tail Probabilities (Standard Normal, Cauchy, Levy)

Example of Heavy Tailed Model (Random Walk) Random Walk: Start at position 0 Toss a fair coin: –with each head take a step up (+1) –with each tail take a step down (-1) X --- number of steps the random walk takes to return to position 0.

The record of 10,000 tosses of an ideal coin (Feller) Zero crossing Long periods without zero crossing

Random Walk Heavy-tails vs. Non-Heavy-Tails Normal (2, ) Normal (2,1) O,1%> % 2 Median=2 1-F(x) Unsolved fraction X - number of steps the walk takes to return to zero (log scale)

How to Check for “Heavy Tails”? Log-Log plot of tail of distribution should be approximately linear. Slope gives value of infinite mean and infinite variance infinite mean and infinite variance infinite variance infinite variance

Number backtracks (log) (1-F(x))(log) Unsolved fraction => Infinite mean Heavy-Tailed Behavior in QCP Domain 18% unsolved 0.002% unsolved

Formal Models of Heavy- Tailed Behavior in Combinatorial Search Chen, Gomes, Selman 2001

MotivationMotivation Research on heavy-tails has been largely based on empirical studies of run time distribution. Goal: to provide a formal characterization of tree search models and show under what conditions heavy-tailed distributions can arise. Intuition: Heavy-tailed behavior arises: from the fact that wrong branching decisions may lead the procedure to explore an exponentially large subtree of the search space that contains no solutions; the procedure is characterized by a large variability in the time to find a solution on different runs, which leads to highly different trees from run to run;

Balanced vs. Imbalanced Tree Model Balanced Tree Model: chronological backtrack search model; fixed variable ordering; random child selection with no propagation mechanisms; (show demo)

T(n) - the number of leaf nodes visited - choice at level i; (1 - bad choice; 0 -good choice) (note : there is exactly one choice of zero-one assignments to the variables for each possible value of T(n); any such assignment has probability. T(n) follows an Uniform distribution

The run time distribution of chronological backtrack search on a complete balanced tree is uniform (therefore not heavy-tailed). Both the expected run time and variance scale exponentially

Balanced Tree Model –The expected run time and variance scale exponentially, in the height of the search tree (number of variables); –The run time distribution is Uniform, (not heavy tailed ). –Backtrack search on balanced tree model has no restart strategy with exponential polynomial time. Chen, Gomes & Selman 01

How can we improve on the balanced serach tree model? Very clever search heuristic that leads quickly to the solution node - but that is hard in general; Combination of pruning, propagation, dynamic variable ordering that prune subtrees that do not contain the solution, allowing for runs that are short. ---> resulting trees may vary dramatically from run to run.

T - the number of leaf nodes visited up to and including the successful node; b - branching factor Formal Model Yielding Heavy-Tailed Behavior b = 2 (show demo)

Expected Run Time (infinite expected time) Variance (infinite variance) Tail (heavy-tailed)

Bounded Heavy-Tailed Behavior (show demo)

No Heavy-tailed behavior for Proving Optimality

Proving Optimality

Small-World Vs. Heavy-Tailed Behavior Does a Small-World topology (Watts & Strogatz) induce heavy-tail behavior? The constraint graph of a quasigroup exhibits a small-world topology (Walsh 99)

Exploiting Heavy-Tailed Behavior Heavy Tailed behavior has been observed in several domains: QCP, Graph Coloring, Planning, Scheduling, Circuit synthesis, Decoding, etc. Consequence for algorithm design: Use restarts or parallel / interleaved runs to exploit the extreme variance performance. Restarts provably eliminate heavy-tailed behavior. (Gomes et al. 97, Hoos 99, Horvitz 99, Huberman, Lukose and Hogg 97, Karp et al 96, Luby et al. 93, Rish et al. 97, Wlash 99)

XXXXX solved 10 Sequential: = 51 seconds Parallel: 10 machines second 51 x speedup Super-linear Speedups Interleaved (1 machine): 10 x 1 = 10 seconds 5 x speedup

Restarts 70% unsolved 1-F(x) Unsolved fraction Number backtracks (log) no restarts restart every 4 backtracks 250 (62 restarts) 0.001% unsolved

Example of Rapid Restart Speedup (planning) ~100 restarts Cutoff (log) Number backtracks (log) ~10 restarts

Sketch of proof of elimination of heavy tails Let’s truncate the search procedure after m backtracks. Probability of solving problem with truncated version: Run the truncated procedure and restart it repeatedly.

Y - does not have Heavy Tails