Sampling Combinatorial Space Using Biased Random Walks Jordan Erenrich, Wei Wei and Bart Selman Dept. of Computer Science Cornell University.

Slides:



Advertisements
Similar presentations
CSCI 3160 Design and Analysis of Algorithms Tutorial 4
Advertisements

Gibbs sampler - simple properties It’s not hard to show that this MC chain is aperiodic. Often is reversible distribution. If in addition the chain is.
1 Backdoor Sets in SAT Instances Ryan Williams Carnegie Mellon University Joint work in IJCAI03 with: Carla Gomes and Bart Selman Cornell University.
Computing the Density of States of Boolean Formulas Stefano Ermon, Carla Gomes, and Bart Selman Cornell University, September 2010.
Counting Solution Clusters Divide-and-Conquer Recursively:  Arbitrarily pick a variable, say x, of formula F  Count how many clusters contain solutions.
Ch 11. Sampling Models Pattern Recognition and Machine Learning, C. M. Bishop, Summarized by I.-H. Lee Biointelligence Laboratory, Seoul National.
BAYESIAN INFERENCE Sampling techniques
CS774. Markov Random Field : Theory and Application Lecture 16 Kyomin Jung KAIST Nov
. PGM: Tirgul 8 Markov Chains. Stochastic Sampling  In previous class, we examined methods that use independent samples to estimate P(X = x |e ) Problem:
CP Formal Models of Heavy-Tailed Behavior in Combinatorial Search Hubie Chen, Carla P. Gomes, and Bart Selman
Computational statistics 2009 Random walk. Computational statistics 2009 Random walk with absorbing barrier.
Ryan Kinworthy 2/26/20031 Chapter 7- Local Search part 1 Ryan Kinworthy CSCE Advanced Constraint Processing.
Algorithms in Exponential Time. Outline Backtracking Local Search Randomization: Reducing to a Polynomial-Time Case Randomization: Permuting the Evaluation.
1 Towards Efficient Sampling: Exploiting Random Walk Strategy Wei Wei, Jordan Erenrich, and Bart Selman.
Model Counting: A New Strategy for Obtaining Good Bounds Carla P. Gomes, Ashish Sabharwal, Bart Selman Cornell University AAAI Conference, 2006 Boston,
1 Sampling, Counting, and Probabilistic Inference Wei joint work with Bart Selman.
It’s all about the support: a new perspective on the satisfiability problem Danny Vilenchik.
AAAI00 Austin, Texas Generating Satisfiable Problem Instances Dimitris Achlioptas Microsoft Carla P. Gomes Cornell University Henry Kautz University of.
Short XORs for Model Counting: From Theory to Practice Carla P. Gomes, Joerg Hoffmann, Ashish Sabharwal, Bart Selman Cornell University & Univ. of Innsbruck.
Accelerating Random Walks Wei Dept. of Computer Science Cornell University (joint work with Bart Selman)
1 Backdoors To Typical Case Complexity Ryan Williams Carnegie Mellon University Joint work with: Carla Gomes and Bart Selman Cornell University.
CPSC 322, Lecture 17Slide 1 Planning: Representation and Forward Search Computer Science cpsc322, Lecture 17 (Textbook Chpt 8.1 (Skip )- 8.2) February,
Carla P. Gomes CS4700 CS 4700: Foundations of Artificial Intelligence Carla P. Gomes Module: Instance Hardness and Phase Transitions.
Accelerating Random Walks Wei Wei and Bart Selman Dept. of Computer Science Cornell University.
Stochastic greedy local search Chapter 7 ICS-275 Spring 2007.
Knowledge Representation II (Inference in Propositional Logic) CSE 473 Continued…
How to Stall a Motor: Information-Based Optimization for Safety Refutation of Hybrid Systems Todd W. Neller Knowledge Systems Laboratory Stanford University.
Lukas Kroc, Ashish Sabharwal, Bart Selman Cornell University, USA SAT 2010 Conference Edinburgh, July 2010 An Empirical Study of Optimal Noise and Runtime.
1 Combinatorial Problems in Cooperative Control: Complexity and Scalability Carla Gomes and Bart Selman Cornell University Muri Meeting March 2002.
1 Message Passing and Local Heuristics as Decimation Strategies for Satisfiability Lukas Kroc, Ashish Sabharwal, Bart Selman (presented by Sebastian Brand)
Relaxed DPLL Search for MaxSAT (short paper) Lukas Kroc, Ashish Sabharwal, Bart Selman Cornell University SAT-09 Conference Swansea, U.K. July 3, 2009.
Introduction to Monte Carlo Methods D.J.C. Mackay.
1 Exploiting Random Walk Strategies in Reasoning Wei.
Simulation of Random Walk How do we investigate this numerically? Choose the step length to be a=1 Use a computer to generate random numbers r i uniformly.
Monte Carlo Methods: Basics
Stochastic Algorithms Some of the fastest known algorithms for certain tasks rely on chance Stochastic/Randomized Algorithms Two common variations – Monte.
1 MCMC Style Sampling / Counting for SAT Can we extend SAT/CSP techniques to solve harder counting/sampling problems? Such an extension would lead us to.
The Boolean Satisfiability Problem: Theory and Practice Bart Selman Cornell University Joint work with Carla Gomes.
Combination of Exact and Approximate Methods for SAT and MAX-SAT Problems Frédéric Lardeux, Frédéric Saubion and Jin-Kao Hao Metaheuristics and Combinatorial.
U NIFORM S OLUTION S AMPLING U SING A C ONSTRAINT S OLVER A S AN O RACLE Stefano Ermon Cornell University August 16, 2012 Joint work with Carla P. Gomes.
Stochastic greedy local search Chapter 7 ICS-275 Spring 2009.
/425 Declarative Methods - J. Eisner 1 Random 3-SAT  sample uniformly from space of all possible 3- clauses  n variables, l clauses Which are.
An Introduction to Markov Chain Monte Carlo Teg Grenager July 1, 2004.
SAT 2009 Ashish Sabharwal Backdoors in the Context of Learning (short paper) Bistra Dilkina, Carla P. Gomes, Ashish Sabharwal Cornell University SAT-09.
Seminar on random walks on graphs Lecture No. 2 Mille Gandelsman,
De novo discovery of mutated driver pathways in cancer Discussion leader: Matthew Bernstein Scribe: Kun-Chieh Wang Computational Network Biology BMI 826/Computer.
Markov Chain Monte Carlo for LDA C. Andrieu, N. D. Freitas, and A. Doucet, An Introduction to MCMC for Machine Learning, R. M. Neal, Probabilistic.
Lecture #9: Introduction to Markov Chain Monte Carlo, part 3
The Markov Chain Monte Carlo Method Isabelle Stanton May 8, 2008 Theory Lunch.
Patrick De Causmaecker Stefaan Haspeslagh Tommy Messelis.
Reducing MCMC Computational Cost With a Two Layered Bayesian Approach
Accelerating Random Walks Wei Wei and Bart Selman.
Iterative Improvement Search Including Hill Climbing, Simulated Annealing, WALKsat and more....
Presented by Dajiang Zhu 11/1/2011.  Introduction of Markov chains Definition One example  Two problems as examples 2-SAT Algorithm (simply introduce.
Inference in Propositional Logic (and Intro to SAT) CSE 473.
Ka-fu Wong © 2007 ECON1003: Analysis of Economic Data Lesson0-1 Supplement 2: Comparing the two estimators of population variance by simulations.
Metaheuristics for the New Millennium Bruce L. Golden RH Smith School of Business University of Maryland by Presented at the University of Iowa, March.
Random Sampling Algorithms with Applications Kyomin Jung KAIST Aug ERC Workshop.
1 P NP P^#P PSPACE NP-complete: SAT, propositional reasoning, scheduling, graph coloring, puzzles, … PSPACE-complete: QBF, planning, chess (bounded), …
Simulation analysis of quantum walks Tomohiro YAMASAKI QCI, ERATO, JST / University of Tokyo.
Generalization Performance of Exchange Monte Carlo Method for Normal Mixture Models Kenji Nagata, Sumio Watanabe Tokyo Institute of Technology.
Markov Chain Monte Carlo in R
Inference in Propositional Logic (and Intro to SAT)
Local Search Strategies: From N-Queens to Walksat
Watermarking of SAT Using Combinatorial Isolation Lemmas
Metaheuristic methods and their applications. Optimization Problems Strategies for Solving NP-hard Optimization Problems What is a Metaheuristic Method?
Joint work with Carla Gomes.
Computing the Density of States of Boolean Formulas
Junghoo “John” Cho UCLA
Local search characteristics of incomplete SAT procedures
Presentation transcript:

Sampling Combinatorial Space Using Biased Random Walks Jordan Erenrich, Wei Wei and Bart Selman Dept. of Computer Science Cornell University

Many forms of probabilistic reasoning can be effectively reduced to sampling satisfying assignments from a Boolean formula (an instance of SAT). Question: Can state-of-the-art local search procedures for SAT sample effectively from the solution space? (as an alternative to standard Monte Carlo Markov Chain methods)

Characteristics of Solution space: Solution Clustering Visualization with multi-dimensional scaling (MDS) –Solutions to specific 75 variable, 325 clause 3-SAT instances –75 dimensional solution projected to two dimensions –Distance between points approximates hamming distance

Solution Probability Consider a simple 2-SAT problem –x OR y Consider a simple SAT heuristic –Starts with a random bit assignment –Randomly flip a bit until a solution is found Consider the probability of finding each solution – xySolution?Solution Probability 00NoN/A 01Yes3/8 10Yes3/8 11Yes1/4

Solution Probability Using WalkSat Algorithm Empirically determined each solution’s probability (uf variable, 325 clause 3-SAT instance) WalkSat finds every solution, but with very large range of probabilities (1:10 4 ) Probability Clusters

Probability Ranges in Different Domains InstanceRunsHits Rarest Hits Common Common-to -Rare Ratio Random50* * *10 4 Logistics1* *

Improving the Uniformity of Sampling To reduce the range of probabilities, we propose a hybrid local search algorithm: –With probability p, the algorithm makes a biased random walk move –With probability 1-p, the algorithm makes a SA (simulated annealing) move In our experiment, we used –50% WalkSat + 50% SA at a fixed temperature Mixed sampling strategy

Results of the Hybrid Approach Our key figure.

Solution Clusters Results on a random 3-SAT instance (70 vars, 301 clauses, 2531 solutions).

Summary 1)WalkSAT does sample all solutions. 2)But, sampling can be highly biased. 3)Using a new hybrid strategy, we can obtain effective near-uniform sampling. Lesson: Hybrid of SA and biased walk, is a promising alternative to MCMC methods for sampling. Proposal: Use SAT solvers to sample solutions from a combinatorial space. Findings: