Planning as Satisfiability: Progress and Challenges Bart Selman Cornell University.

Slides:



Advertisements
Similar presentations
Propositional Satisfiability (SAT) Toby Walsh Cork Constraint Computation Centre University College Cork Ireland 4c.ucc.ie/~tw/sat/
Advertisements

Dana Nau: Lecture slides for Automated Planning Licensed under the Creative Commons Attribution-NonCommercial-ShareAlike License:
Proofs from SAT Solvers Yeting Ge ACSys NYU Nov
CPSC 422, Lecture 21Slide 1 Intelligent Systems (AI-2) Computer Science cpsc422, Lecture 21 Mar, 4, 2015 Slide credit: some slides adapted from Stuart.
1 Backdoor Sets in SAT Instances Ryan Williams Carnegie Mellon University Joint work in IJCAI03 with: Carla Gomes and Bart Selman Cornell University.
The Theory of NP-Completeness
Tradeoffs in Backdoors: Inconsistency Detection, Dynamic Simplification, and Preprocessing Bistra Dilkina, Carla Gomes, Ashish Sabharwal Cornell University.
Phase Transitions of PP-Complete Satisfiability Problems D. Bailey, V. Dalmau, Ph.G. Kolaitis Computer Science Department UC Santa Cruz.
08/1 Foundations of AI 8. Satisfiability and Model Construction Davis-Putnam, Phase Transitions, GSAT Wolfram Burgard and Bernhard Nebel.
CSE 5731 Lecture 21 State-Space Search vs. Constraint- Based Planning CSE 573 Artificial Intelligence I Henry Kautz Fall 2001.
The Theory of NP-Completeness
Phase Transitions of PP-Complete Satisfiability Problems D. Bailey, V. Dalmau, Ph.G. Kolaitis Computer Science Department UC Santa Cruz.
Search in the semantic domain. Some definitions atomic formula: smallest formula possible (no sub- formulas) literal: atomic formula or negation of an.
1 Backdoors To Typical Case Complexity Ryan Williams Carnegie Mellon University Joint work with: Carla Gomes and Bart Selman Cornell University.
Last time Proof-system search ( ` ) Interpretation search ( ² ) Quantifiers Equality Decision procedures Induction Cross-cutting aspectsMain search strategy.
1 BLACKBOX: A New Paradigm for Planning Bart Selman Cornell University.
Carla P. Gomes CS4700 CS 4700: Foundations of Artificial Intelligence Carla P. Gomes Module: Instance Hardness and Phase Transitions.
Encoding Domain Knowledge in the Planning as Satisfiability Framework Bart Selman Cornell University.
Chapter 11: Limitations of Algorithmic Power
1 CS 4700: Foundations of Artificial Intelligence Carla P. Gomes Module: Satisfiability (Reading R&N: Chapter 7)
1 BLACKBOX: A New Approach to the Application of Theorem Proving to Problem Solving Bart Selman Cornell University Joint work with Henry Kautz AT&T Labs.
Knowledge Representation II (Inference in Propositional Logic) CSE 473 Continued…
1 Paul Beame University of Washington Phase Transitions in Proof Complexity and Satisfiability Search Dimitris Achlioptas Michael Molloy Microsoft Research.
1 Message Passing and Local Heuristics as Decimation Strategies for Satisfiability Lukas Kroc, Ashish Sabharwal, Bart Selman (presented by Sebastian Brand)
Why SAT Scales: Phase Transition Phenomena & Back Doors to Complexity slides courtesy of Bart Selman Cornell University.
SAT Solver Math Foundations of Computer Science. 2 Boolean Expressions  A Boolean expression is a Boolean function  Any Boolean function can be written.
Classical Planning Chapter 10.
Planning as Satisfiability CS Outline 0. Overview of Planning 1. Modeling and Solving Planning Problems as SAT - SATPLAN 2. Improved Encodings using.
The Role of Domain-Specific Knowledge in the Planning as Satisfiability Framework Henry Kautz AT&T Labs Bart Selman Cornell University.
Logical Foundations of AI Planning as Satisfiability Clause Learning Backdoors to Hardness Henry Kautz.
1 The Theory of NP-Completeness 2012/11/6 P: the class of problems which can be solved by a deterministic polynomial algorithm. NP : the class of decision.
Boolean Satisfiability and SAT Solvers
Jonathon Doran. The Planning Domain A domain describes the objects, facts, and actions in the universe. We may have a box and a table in our universe.
1 MCMC Style Sampling / Counting for SAT Can we extend SAT/CSP techniques to solve harder counting/sampling problems? Such an extension would lead us to.
Performing Bayesian Inference by Weighted Model Counting Tian Sang, Paul Beame, and Henry Kautz Department of Computer Science & Engineering University.
The Boolean Satisfiability Problem: Theory and Practice Bart Selman Cornell University Joint work with Carla Gomes.
SAT and SMT solvers Ayrat Khalimov (based on Georg Hofferek‘s slides) AKDV 2014.
INTRODUCTION TO ARTIFICIAL INTELLIGENCE COS302 MICHAEL L. LITTMAN FALL 2001 Satisfiability.
Solvers for the Problem of Boolean Satisfiability (SAT) Will Klieber Aug 31, 2011 TexPoint fonts used in EMF. Read the TexPoint manual before you.
Open-Loop Planning as Satisfiability Henry Kautz AT&T Labs.
Planning as Propositional Satisfiabililty Brian C. Williams Oct. 30 th, J/6.834J GSAT, Graphplan and WalkSAT Based on slides from Bart Selman.
Survey Propagation. Outline Survey Propagation: an algorithm for satisfiability 1 – Warning Propagation – Belief Propagation – Survey Propagation Survey.
1 The Theory of NP-Completeness 2 Cook ’ s Theorem (1971) Prof. Cook Toronto U. Receiving Turing Award (1982) Discussing difficult problems: worst case.
Explorations in Artificial Intelligence Prof. Carla P. Gomes Module Logic Representations.
Combining Component Caching and Clause Learning for Effective Model Counting Tian Sang University of Washington Fahiem Bacchus (U Toronto), Paul Beame.
On the Relation between SAT and BDDs for Equivalence Checking Sherief Reda Rolf Drechsler Alex Orailoglu Computer Science & Engineering Dept. University.
First-Order Logic and Inductive Logic Programming.
CS 5411 Compilation Approaches to AI Planning 1 José Luis Ambite* Some slides are taken from presentations by Kautz and Selman. Please visit their.
Boolean Satisfiability Present and Future
SAT 2009 Ashish Sabharwal Backdoors in the Context of Learning (short paper) Bistra Dilkina, Carla P. Gomes, Ashish Sabharwal Cornell University SAT-09.
CPSC 422, Lecture 21Slide 1 Intelligent Systems (AI-2) Computer Science cpsc422, Lecture 21 Oct, 30, 2015 Slide credit: some slides adapted from Stuart.
Accelerating Random Walks Wei Wei and Bart Selman.
Balance and Filtering in Structured Satisfiability Problems Henry Kautz University of Washington joint work with Yongshao Ruan (UW), Dimitris Achlioptas.
1 Propositional Logic Limits The expressive power of propositional logic is limited. The assumption is that everything can be expressed by simple facts.
AAAI of 20 Deconstructing Planning as Satisfiability Henry Kautz University of Rochester in collaboration with Bart Selman and Jöerg Hoffmann.
Onlinedeeneislam.blogspot.com1 Design and Analysis of Algorithms Slide # 1 Download From
Inference in Propositional Logic (and Intro to SAT) CSE 473.
Daniel Kroening and Ofer Strichman 1 Decision Procedures An Algorithmic Point of View Basic Concepts and Background.
Proof Methods for Propositional Logic CIS 391 – Intro to Artificial Intelligence.
Dana Nau: Lecture slides for Automated Planning Licensed under the Creative Commons Attribution-NonCommercial-ShareAlike License:
1 P NP P^#P PSPACE NP-complete: SAT, propositional reasoning, scheduling, graph coloring, puzzles, … PSPACE-complete: QBF, planning, chess (bounded), …
Inference in Propositional Logic (and Intro to SAT)
Inference and search for the propositional satisfiability problem
CS 4700: Foundations of Artificial Intelligence
Planning as Satisfiability
Planning as Search State Space Plan Space Algorihtm Progression
First-Order Logic and Inductive Logic Programming
Emergence of Intelligent Machines: Challenges and Opportunities
ECE 667 Synthesis and Verification of Digital Circuits
Joint work with Carla Gomes.
Presentation transcript:

Planning as Satisfiability: Progress and Challenges Bart Selman Cornell University

2 What is Planning? Planning (AI): Find a (partially) ordered set of actions that transforms a given initial state to a specified goal state. Can cover most forms of “problem solving” Long history in AI, since mid sixties --- general problem solving architecture --- robotics (“planning a sequence of actions to achieve a set of goals”) Scheduling (OR): Fixed set of actions, “just” need to find optimal ordering --- Significantly easier than general planning --- E.g. linear programming relaxations are quite effective. --- Planning problems typically highly non-linear, require combinatorial search

Some Applications of Planning Supply chain management: combines planning and scheduling [main industrial interest] Autonomous systems Many NASA applications: Deep Space One Remote Agent Long-range mission planning Communication planning & scheduling [DOD / Darpa] Softbots - software agents Internet agents, program assistants AI “characters” in games, entertainment Natural language / dialogue understanding and generation Speech act: The goal of the speaker is to convey info to the listener. Dialogue can be formulated as a plan generation problem. Listener: plan recognition poblem. User modeling: Plan recognition Theorem proving: Proof planning

4 Core Computational Problem Focus: Classical state-space planning, extended with parallel actions Similar computational issues arise in other models Reactive plans Planning with uncertainty and utilities Continuous processes Metric time

5 State-space Planning Find a sequence of operators that transform an initial state to a goal state State = complete truth assignment to a set of variables (“fluents”) Goal = partial truth assignment (set of states) Operator = a partial function State  State specified by three sets of variables: precondition, add list, delete list

6 Parallelism Operators may be applied in parallel when all orderings are well defined and equivalent (Op1 || Op2)(s) = Op2(Op1(s)) = Op1(Op2(s)) A special form of non-linear plans Only allows parallel actions, not parallel action sequences Easy to serialize

7 Abdundance of Negative Complexity Results I.Domain-independent planning: PSPACE-complete or worse (Chapman 1987; Bylander 1991; Backstrom 1993) II. Domain-dependent planning: NP-complete or worse operators fixed (Chenoweth 1991; Gupta and Nau 1992) III. Approximate planning: NP-complete or worse (Selman 1994)

Planning Practice: Phase I Till mid-nineties: Domain-independent planning systems could generate plans of only a few (< 10) steps! Prodigy, Nonlin, UCPOP,... Limited practical use Research focus: minimize or eliminate search by employing complex search control rules, hand-tailored to the search engine and the particular search space (Sacerdoti 1975, Slaney 1996, Bacchus 1996) pre-compiling entire state-space to a reactive finite-state machine (Agre & Chapman 1997, Williams & Nayak 1997) Drawback: Highly domain dependent.

Planning Practice: Phase II Since mid-nineties, three key advances: --- GraphPlan (Blum and Furst 1996) planning as a specialized graph search problem --- SATPLAN (Kautz and Selman 1996; foundation in ’92) planning as Boolean satisfiabilty (constraint reasoning) --- FF planner (Hoffmann 2000) non-optimal planning as specialized heuristic search (A*) Till mid-nineties: < 10 step optimal plans Current: we can now synthesize optimal 500+ step plans non-optimal: steps [historical note: planning as first-order inference (Green ‘69)]

10 Planning as Satisfiability: SATPLAN axiom schemas instantiated propositional clauses satisfying model plan mapping length problem description SAT engine(s) instantiate interpret Note: To deal with PSPACE completeness, we limit the plan length. Gives NP-completeness. Analogous to bounded model checking.

blackbox version 9B command line: blackbox -o logistics.pddl -f logistics_prob_d_len.pddl -solver compact -l -then satz -cutoff 25 -restart Converting graph to wff 6151 variables clauses Invoking simplifier compact Variables undetermined: 4633 Non-unary clauses output: Invoking solver satz version satz-rand-2.1 Wff loaded [1] begin restart [1] reached cutoff back to root [2] begin restart [2] reached cutoff back to root [3] begin restart [3] reached cutoff back to root [4] begin restart [4] reached cutoff back to root [5] begin restart **** the instance is satisfiable ***** **** verification of solution is OK **** total elapsed seconds = Begin plan 1 drive-truck_ny-truck_ny-central_ny-po_ny Example run

100 step optimal plan

Biannual Planning competitions: --- All current competitors are derived from Graphplan, SATPLAN, or FF. Traditional planners no longer competitive Knowledge-intensive approaches require too much human effort Range of new techniques: Type-theoretic analysis of operators: can infer state invariants as extra constraints powerful, generally applicable pre-processor (Long et al. ‘01; Gerevini and Schubert ‘04) Compilation of more expressive languages (conditional effects) Recent extensions to Markov Decision Processes of FF (Geffner ‘02), Graphplan (Blum ‘02), SATPLAN (Littman ‘03) Deriving more complex plan constraints on the fly (Chen ’05)

14 Research Challenges [planning & constraint reasoning in general] a) Automated structure discovery to further improve large-scale constraint solving [capture structure in additional constraints; use statistical and other machine learning techniques] b) Adversarial reasoning: Quantified Boolean Formulas c) Probabilistic reasoning: Solution Counting and the integration of probabilistic and logical reasoning

15 Worst-case intractability but real-world has significant hidden tractable sub-structure. [aside: exponential behavior of SAT solvers easily demonstrated on synthetic instance classes. So, structure of naturally occurring problems is key.] Key notion: backdoor variables A small set of special variables that capture inherent combinatorics of a reasoning problem. In terms of human cognition, real-world reasoning problems can often be solved allow using clever shortcuts. Formal definition includes the notion of a “subsolver”: a polynomial simplification procedure with certain general characteristics found in current DPLL SAT solvers. I)Hidden Problem Structure Backdoors to tractability (Gomes et al. ’03. ’04)

Backdoors (wrt subsolver A; SAT case): Strong backdoors (wrt subsolver A; UNSAT case): Note: Notion of backdoor is related to but different from constraint-graph based notions such as cutsets.

Backdoors can be surprisingly small: Most recent: Other combinatorial domains. E.g. graphplan planning, near constant size backdoors (2 or 3 variables) and log(n) size in certain domains. (Hoffmann, Gomes, Selman ’04) Backdoors capture critical problem resources (bottlenecks).

Backdoors --- “seeing is believing” Logistics_b.cnf planning formula. 843 vars, 7,301 clauses, approx min backdoor 16 (backdoor set = reasoning shortcut) Constraint graph of reasoning problem. One node per variable: edge between two variables if they share a constraint. Visualization by Anand Kapur.

Logistics.b.cnf after setting 5 backdoor vars.

After setting just 12 (out of 800+) backdoor vars – problem almost solved.

MAP-6-7.cnf infeasible planning instances. Strong backdoor of size vars, 2,578 clauses. Another example

After setting 2 (out of 392) backdoor vars. --- reducing problem complexity in just a few steps!

Inductive inference problem --- ii16a1.cnf vars, 19,368 clauses. Backdoor size 40. Last example.

After setting 6 backdoor vars.

After setting 38 (out of 1600+) backdoor vars: Some other intermediate stages: So: Real-world structure hidden in the network. Can be exploited by automated reasoning engines.

(Williams, Gomes, and Selman ’04) Current solvers Size backdoor n = num. vars. k is a constant Restarts, clause learning and local focus (Malik ’08) help in backdoor discovery.

II) Adversarial Reasoning Quantified Boolean Formulas (QBF) extend Boolean logic by allowing quantification over variables (exists and forall) QBF is satisfiable iff there exists a way of setting the existential vars such that for every possible assigment to the universal vars the clauses are satisfied. Literally a “game played on the clauses”: Existential player tries hard to satisfy all clauses in the matrix. Universal player tries hard to “spoil” it for the existential player: i.e., break (“unsatisfy”) one or more clauses. the clausesQuantifiers prefix

Formally: Problem is PSPACE- complete. Range of new applications: Multi-agent reasoning, unbounded planning, unbounded model-checking (verification), and certain forms probabilistic reasoning and contingency planning. Can we transfer successful SAT techniques to QBF? Cautiously optimistic. But very sensitive to problem encodings. (Antsotegui, Gomes, and Selman ’05) Related work: Walsh ’03; Gent, Nightingale, and Stergiou ’05; Pan & Vardi 04; Giunchiglia et al. 04; Malik and Zhang ‘04; Alur ’04; and Williams ’05.

Original Search Space 2 N Search Space SAT Encoding 2 N+M Space Searched by SAT Solvers 2 N/C ; N log(N) ; Poly(N ) Original 2 N Search Space for SAT Approaches

Original Search Space 2 N Search Space QBF Encoding 2 N+M’ Space Searched by COND QBF Solvers with Streamlining Search Space of QBF Search Space Standard QBF Encoding 2 N+M’’ Original 2 N

III) Combining Probabilistic and Logical Reasoning Survey Propagation (SP) Mezard et al. (Science ’02). Exciting new reasoning / combinatorial search method. Applies probabilistic reasoning technique for solving combinatorial search problems. Basic idea: Let N be the total number of satisfying assignments. N_x+ the number of satisfying assigns with x set to True. N_x- with x set to False. Define: P_x+ = N_x+ / N and P_x- = N_x- / N. I.e., P_x+ is “the probability of seeing x assigned True when randomly sampling satisfying assignments.”

SP, cont. Consider the following “decimation” strategy: If P_x+ >= P_x- then set x to True else set x to False. Simplify instance and repeat, until a satisfying assignment is reached. But, only a physicist would think of such a strategy! (Why?) Since computing the probabilities is much harder (#P-complete) than finding a satisfying assignment (NP-complete)!

SP, surprise But, perhaps one can efficiently compute good approximations of P_x+ and P_x- Strategy is to iteratively solve a set of recursive equations. Linear time. The so-called SP equations are quite involved. They are a form of probabilistic reasoning called Belief Propagation. Intuitively, the idea is to consider the effect of adding a clause (constraint) to a set of clauses. Example: start with the empty set of clauses over two variables p and q. So, P_p+ = P_p- = ½ and P_q_+ = P_q- = ½. Now add a clause (p OR (NOT q)). What happens to P_p+ and P_q+? First should go up a bit and the other down a bit…

(p OR (NOT q)) is satisfied by (T, F), (T, T), and (F,F). So, P_p+ = 2/3 and P_p- = 1/3 and P_q+ = 1/3 and P_q- = 2/3. Now consider adding ((NOT p) OR q OR r). P_p+ should go down a bit. P_q+ and P_r+ up a bit. Etc. Brute force enumeration becomes infeasible but SP equations model the changes in these probabilities directly to capture the addition of clauses/constraints. Clauses and variables interact, so we will have to look for a fixed point of a set of coupled recursive equations.

The CNF: The “Factor” Graph: (Graphical Model. Bayesian Net) The equations:

SP is surprisingly effective on hard random k-SAT and graph coloring. 10M instances with 42M clauses can be solved in linear time (around one hour of cpu time; sets batches of variables, never backtracks, finds satisfying assignment!) Walksat, a biased random walk strategy, is the next best but would require 100+ hrs of cpu time. Our formal understanding of SP is limited. SP is still a bit “magical”. Some insights: Zecchina et al. 2004; Wainright et al. 2006; Kroc et al. 2007, 2008.

(Kroc et al. 2007) Hard random 3-SAT. 5,000 var; 21,000 clauses SP Marginal Probabilities True Marginal Probabilities SP: marginals in < 30 seconds 100+ hrs (direct computation)

38 Survey Propagation: final observations Next challenge for SP: use on structured (non-random) formulas. Significant potential. (preliminary results in Kroc et al. 2007b.) In general, probabilistic techniques for solving constraint problems provides an intriguing alternative to the existing two main paradigms: (1) complete, backtrack search, and (2) local search. Involves finding fixed points of coupled recursive equations over continuous variables. New perspective and potentially very fast.

P NP P^#P PSPACE NP-complete: SAT, scheduling, graph coloring, puzzles, … PSPACE-complete: QBF, adversarial planning, chess (bounded), … EXP-complete: games like Go, … P-complete: circuit-value, … Note: widely believed hierarchy; know P≠EXP for sure. In P: sorting, shortest path, … Complexity Hierarchy: Where We Are Easy PH EXP #P-complete/hard: #SAT, sampling, probabilistic inference, … SAT/SMT Solvers “Traditional” Algs. Extended SAT Solvers QBF Solvers Hard First Order Thm. Provers