Time-Space Tradeoffs in Proof Complexity: Superpolynomial Lower Bounds for Superlinear Space Chris Beck Princeton University Joint work with Paul Beame.

Slides:

Advertisements

Similar presentations

Exploiting SAT solvers in unbounded model checking

Advertisements

Lower Bounds for Additive Spanners, Emulators, and More David P. Woodruff MIT and Tsinghua University To appear in FOCS, 2006.

Hybrid BDD and All-SAT Method for Model Checking Orna Grumberg Joint work with Assaf Schuster and Avi Yadgar Technion – Israel Institute of Technology.

Comparative Succinctness of KR Formalisms Paolo Liberatore.

Time-Space Tradeoffs in Resolution: Superpolynomial Lower Bounds for Superlinear Space Chris Beck Princeton University Joint work with Paul Beame & Russell.

UIUC CS 497: Section EA Lecture #2 Reasoning in Artificial Intelligence Professor: Eyal Amir Spring Semester 2004.

Proofs from SAT Solvers Yeting Ge ACSys NYU Nov

Methods of Proof Chapter 7, second half.. Proof methods Proof methods divide into (roughly) two kinds: Application of inference rules: Legitimate (sound)

COMP 553: Algorithmic Game Theory Fall 2014 Yang Cai Lecture 21.

Lecture 24 Coping with NPC and Unsolvable problems. When a problem is unsolvable, that's generally very bad news: it means there is no general algorithm.

Time-Space Tradeoffs in Resolution: Superpolynomial Lower Bounds for Superlinear Space Chris Beck Princeton University Joint work with Paul Beame & Russell.

Daniel Kroening and Ofer Strichman 1 Decision Procedures An Algorithmic Point of View SAT.

1 Backdoor Sets in SAT Instances Ryan Williams Carnegie Mellon University Joint work in IJCAI03 with: Carla Gomes and Bart Selman Cornell University.

Bounds on Code Length Theorem: Let l ∗ 1, l ∗ 2,..., l ∗ m be optimal codeword lengths for a source distribution p and a D-ary alphabet, and let L ∗ be.

Properties of SLUR Formulae Ondřej Čepek, Petr Kučera, Václav Vlček Charles University in Prague SOFSEM 2012 January 23, 2012.

Best-First Search: Agendas

Reduction of Interpolants for Logic Synthesis John Backes Marc Riedel University of Minnesota Dept.

Beating Brute Force Search for Formula SAT and QBF SAT Rahul Santhanam University of Edinburgh.

Constraint Logic Programming Ryan Kinworthy. Overview Introduction Logic Programming LP as a constraint programming language Constraint Logic Programming.

Complexity 19-1 Complexity Andrei Bulatov More Probabilistic Algorithms.

Analysis of Algorithms CS 477/677

Search in the semantic domain. Some definitions atomic formula: smallest formula possible (no sub- formulas) literal: atomic formula or negation of an.

1 Backdoors To Typical Case Complexity Ryan Williams Carnegie Mellon University Joint work with: Carla Gomes and Bart Selman Cornell University.

A Compressed Breadth-First Search for Satisfiability DoRon B. Motter and Igor L. Markov University of Michigan, Ann Arbor.

Last time Proof-system search ( ` ) Interpretation search ( ² ) Quantifiers Equality Decision procedures Induction Cross-cutting aspectsMain search strategy.

Data Flow Analysis Compiler Design Nov. 8, 2005.

1 Understanding the Power of Clause Learning Ashish Sabharwal, Paul Beame, Henry Kautz University of Washington, Seattle IJCAI ConferenceAug 14, 2003.

Lecture 20: April 12 Introduction to Randomized Algorithms and the Probabilistic Method.

Complexity ©D.Moshkovitz 1 Paths On the Reasonability of Finding Paths in Graphs.

1 Paul Beame University of Washington Phase Transitions in Proof Complexity and Satisfiability Search Dimitris Achlioptas Michael Molloy Microsoft Research.

Hardness Results for Problems

1.1 Chapter 1: Introduction What is the course all about? Problems, instances and algorithms Running time v.s. computational complexity General description.

Copyright © Cengage Learning. All rights reserved. CHAPTER 11 ANALYSIS OF ALGORITHM EFFICIENCY ANALYSIS OF ALGORITHM EFFICIENCY.

1 Institute for Theoretical Computer Science, IIIS Tsinghua university, Beijing Iddo Tzameret Based on joint work with Sebastian Müller (Prague)

Logics for Data and Knowledge Representation Propositional Logic: Reasoning Originally by Alessandro Agostini and Fausto Giunchiglia Modified by Fausto.

On Bridging Simulation and Formal Verification Eugene Goldberg Cadence Research Labs (USA) VMCAI-2008, San Francisco, USA.

Boolean Satisfiability and SAT Solvers

Performing Bayesian Inference by Weighted Model Counting Tian Sang, Paul Beame, and Henry Kautz Department of Computer Science & Engineering University.

SAT and SMT solvers Ayrat Khalimov (based on Georg Hofferek‘s slides) AKDV 2014.

CHAPTERS 7, 8 Oliver Schulte Logical Inference: Through Proof to Truth.

INTRODUCTION TO ARTIFICIAL INTELLIGENCE COS302 MICHAEL L. LITTMAN FALL 2001 Satisfiability.

Techniques for Proving NP-Completeness Show that a special case of the problem you are interested in is NP- complete. For example: The problem of finding.

Lazy Annotation for Program Testing and Verification Speaker: Chen-Hsuan Adonis Lin Advisor: Jie-Hong Roland Jiang November 26,

1 The Theory of NP-Completeness 2 Cook ’ s Theorem (1971) Prof. Cook Toronto U. Receiving Turing Award (1982) Discussing difficult problems: worst case.

CSE 589 Part VI. Reading Skiena, Sections 5.5 and 6.8 CLR, chapter 37.

Combining Component Caching and Clause Learning for Effective Model Counting Tian Sang University of Washington Fahiem Bacchus (U Toronto), Paul Beame.

Daniel Kroening and Ofer Strichman 1 Decision Procedures An Algorithmic Point of View BDDs.

Boolean Satisfiability Present and Future

SAT 2009 Ashish Sabharwal Backdoors in the Context of Learning (short paper) Bistra Dilkina, Carla P. Gomes, Ashish Sabharwal Cornell University SAT-09.

Finding Models for Blocked 3-SAT Problems in Linear Time by Systematical Refinement of a Sub- Model Gábor Kusper Eszterházy Károly.

Accelerating Random Walks Wei Wei and Bart Selman.

Heuristics for Efficient SAT Solving As implemented in GRASP, Chaff and GSAT.

Secret Sharing Non-Shannon Information Inequalities Presented in: Theory of Cryptography Conference (TCC) 2009 Published in: IEEE Transactions on Information.

1 Propositional Logic Limits The expressive power of propositional logic is limited. The assumption is that everything can be expressed by simple facts.

Exponential time algorithms Algorithms and networks.

SAT Solving As implemented in - DPLL solvers: GRASP, Chaff and

Inference in Propositional Logic (and Intro to SAT) CSE 473.

Proof Methods for Propositional Logic CIS 391 – Intro to Artificial Intelligence.

1 IAS, Princeton ASCR, Prague. The Problem How to solve it by hand ? Use the polynomial-ring axioms ! associativity, commutativity, distributivity, 0/1-elements.

Theory of Computational Complexity Probability and Computing Chapter Hikaru Inada Iwama and Ito lab M1.

Inference in Propositional Logic (and Intro to SAT)

Hybrid BDD and All-SAT Method for Model Checking

Introduction to Randomized Algorithms and the Probabilistic Method

Resolution over Linear Equations: (Partial) Survey & Open Problems

Decision Procedures An Algorithmic Point of View

Resolution Proofs for Combinational Equivalence

Switching Lemmas and Proof Complexity

Solving Non-clausal Formulas with DPLL search

Presentation transcript:

Time-Space Tradeoffs in Proof Complexity: Superpolynomial Lower Bounds for Superlinear Space Chris Beck Princeton University Joint work with Paul Beame & Russell Impagliazzo

SAT & SAT Solvers SAT is central to both theory and practice In the last ten years, there has been a revolution in practical SAT solving. Modern SAT solvers can sometimes solve practical instances with millions of variables. Best current solvers use a Backtracking approach pioneered by DPLL ’62, plus an idea called Clause Learning developed in Chaff ‘99.

SAT & SAT Solvers DPLL search requires very little memory Clause learning adds new clauses to the CNF every time the search backtracks – Uses lots of memory to try to beat DPLL. – In practice, must use heuristics to guess which clauses are “important” and store only those. Hard to do well! Memory becomes a bottleneck. Question: Is this inherent? Or can the right heuristics avoid the memory bottleneck?

SAT Solvers and Proofs All SAT algorithms find a satisfying assignment or a proof of unsatisfiability. – Important for applications, not simply academic. For “real” algorithms, these proofs take place in simple deductive proof systems, reflecting the underlying reasoning of the algorithm. – Proof can be thought of as a high level summary of the computation history. – Backtracking SAT Solvers correspond to Resolution

Resolution Proof System Proof lines are clauses, one simple proof step Proof is a sequence of clauses each of which is – an original clause, or – follows from previous clauses via resolution step A CNF is UNSAT iff can derive empty clause ⊥

Proof DAG General resolution: Arbitrary DAG For DPLL algorithm, DAG is a tree.

SAT Solvers and Proof Complexity

More recently, researchers want to investigate memory bottleneck for DPLL + Clause Learning Question: If Proof Size ≤ Time for Ideal SAT Solver, can we define Proof Space so that Proof Space ≤ Memory for Ideal SAT Solver, and then prove strong lower bounds for Space?

Space in Resolution … Must be in memory Informally: Clause Space of a proof = Number of clauses you need to hold in memory at once in order to carry out the proof.

Lower Bounds on Space? Generic Upper Bound: All UNSAT formulas on vars have DPLL refutation in space ≤. – Sharp lower bounds are known for explicit tautologies. [ET’99, ABRW’00, T’01, AD’03] So although we can get tight results for space, we can’t show superpolynomial space is needed this way – need to think about size-space tradeoffs. In this direction: [Ben-Sasson, Nordström ‘10] Pebbling formulas with proofs in Size O (n), Space O (n), but Space O (n/log n)  Size exp(n  (1) ). But, this is still only for sublinear space.

Size-Space Tradeoffs Eli Ben-Sasson asks formally: “Does there exist such that any CNF with a refutation of size T also has a refutation of size T in space O()?”

Tseitin Tautologies 10 0

When  odd, G connected, corresponding CNF is called a Tseitin tautology. [Tseitin ‘68] Specifics of  don’t matter, only total parity. The graph is what determines the hardness. Known to be hard with respect to Size and Space when G is a constant degree expander. [Urquhart ‘87, Torán ‘99] This work: Tradeoffs on × grid, ≫, and similar graphs, using isoperimetry.

Tseitin formula on Grid l n

l n

l n

l n

l n

Warmup Proof Our size/space lower bound draws on the ideas of one of the main size lower bound techniques. [Haken, Beame Pitassi ‘95]. To illustrate the ideas behind our result, we’ll first give the details of the Beame Pitassi result, then show how to build on it to get a size/space tradeoff.

Warmup Proof The plan is to show that any refutation of the 2x grid formula must contain many different wide clauses. First, we show that any refutation of the 1x grid formula must contain at least one wide clause. Then, we use a random restriction argument to “boost” this, showing that proofs of 2x grid contain many wide clauses.

Warmup Proof l n

Warmup Proof: One Wide Clause

Warmup Proof: Many Clauses A restriction is a partial assignment to the variables of a formula, resulting in some simplification. Consider choosing a random restriction for 2x grid which for each edge pair, randomly sets one to a random constant. l n Poof!

Warmup Proof: Many Clauses A restriction is a partial assignment to the variables of a formula, resulting in some simplification. Consider choosing a random restriction for 2x grid which for each edge pair, randomly sets one to a random constant. Then formula always simplifies to the 1x grid. l n

Warmup Proof: Many Clauses

Size Space Tradeoff

Complexity vs. Time Time Hi Med Low

Two Possibilities Time Hi Med Low

Two Possibilities Time Hi Med Low

Isoperimetry in the Grid n

n

n

n

n

Two Possibilities Time Hi Med Low

Full Result To get the full result in [BBI’12], don’t just subdivide into epochs once, do it recursively. Uses a more sophisticated case analysis on progress. The full result can also be extended to Polynomial Calculus Resolution, an algebraic proof system which manipulates polynomials rather than clauses. In [BNT’12], we combined the ideas of [BBI’12], [BGIP’01] to achieve this.

Open Questions More than quasi-polynomial separations? – For Tseitin formulas upper bound for small space is only a log n power of the unrestricted size – Candidate formulas? Are these even possible? Tight result for Tseitin? A connection with a pebbling result [Paul, Tarjan’79] may show how. Can we get tradeoffs for Cutting Planes? Monotone Circuits? Frege subsystems?

Thanks!

Analogy with Flows, Pebbling In any Resolution proof, can think of a truth assignment as following a path in the proof dag, stepping along falsified clauses. Path starts at empty clause, at the end of the proof. Branch according to resolved variable. If x = 1…

Analogy with Flows, Pebbling Then the random restriction argument can be viewed as a construction of a distribution on truth assignments following paths that are unlikely to hit complex clauses. Initial Clauses “Bottlenecks” (complex clauses)

Analogy with Flows, Pebbling Initial Points Middle Layer 1 Middle Layer 2

Analogy with Flows, Pebbling In a series of papers, [Paul, Tarjan ‘79], [Lengauer, Tarjan ’80?] an epoch subdivision argument appeared for pebblings which solved most open questions in graph pebbling. Their argument works for graphs formed from stacks of expanders, superconcentrators, etc. The arguments seem closely related. However, theirs scales up exponentially with # of stacks, ours scales up exponentially with log #stacks.

SAT Solvers Well-known connection between Resolution and SAT solvers based on Backtracking These algorithms are very powerful – sometimes can quickly handle CNF’s with millions of variables. On UNSAT formulas, computation history yields a Resolution proof. – Tree-like Resolution ≈ DPLL algorithm – General Resolution ≿ DPLL + “Clause Learning” Best current SAT solvers use this approach

Overview of Lower Bound To get a time space tradeoff, divide the proof into a large number of epochs and a case analysis involving the progress measure: – Either, progress is saved during the breakpoints between epochs (difficult with small space) – Or, progress happens within an epoch. (difficult if epochs are small) Simple arguments in restricted proof boost to almost tight bounds in unrestricted proof.

Overview of Lower Bound Suppose the space used by the proof is small. Divide the proof into epochs of equal sizes, and hit it with the random restriction. The number of epochs times the space bounds the number of clauses appearing at breakpoints between epochs. If their number is small, then with high probability, none of them has a “medium’ value of mu.

Overview of Lower Bound

Main technical step: Show that if an epoch contains few clauses, restriction is unlikely to have C_1 … C_k’ of superincreasing mu values. Need to do better than a union bound over all clauses, or result will be trivial. Main Idea: Show that any such C_1 … C_k’ have Omega(k’ n) variables collectively. If so, then by a union bound over k’ tuples, Pr[ E has k’ superincreasing] < (|E| 2^{-w})^k’

Overview of Lower Bound

Tseitin formula on Grid-like Graph l n

l n

l n

High Level Overview of Lower Bound Fundamental idea in Resolution size bounds is “bottleneck counting argument” [Haken]. Think of any truth assignment as following a path in the proof dag, stepping along falsified clauses. Path starts at empty clause, at the end of the proof. Branch according to resolved variable. If x = 1…

High Level Overview of Lower Bound Fundamental idea in Resolution lower bounds is “bottleneck counting argument” [Haken]. Given a distribution of assignments, get a distribution of paths through proof DAG. Haken’s idea: To show a formula is hard, find a large set of assignments such that in any sound proof, most assignments pass through a wide clause. Since only a small fraction of assignments can falsify a wide clause, this implies there are many wide clauses (bottlenecks in the flow of assignments).

High Level Overview of Lower Bound Initial Clauses Middle Layer (wide clauses)

High Level Overview of Lower Bound Our idea: If a proof is too short and uses too little space, flow will be too congested to route all paths. Need to consider multiple middle layers. Initial Clauses Middle Layer (wide clauses)

High Level Overview of Lower Bound Initial Clauses Middle Layer 1 Middle Layer 2

High Level Overview of Lower Bound

Extended Isoperimetric Inequality If the sets aren’t essentially blocks, we’re done. If they are blocks, reduce to the line:

Intervals on the line

Proof DAG

“Regular”: On every root to leaf path, no variable resolved more than once.

Tradeoffs for Regular Resolution Theorem : For any k, 4-CNF formulas (Tseitin formulas on long and skinny grid graphs) of size n with – Regular resolution refutations in size n k+1, Space n k. – But with Space only n k- , for any  > 0, any regular resolution refutation requires size at least n  log log n / log log log n.

Regular Resolution Can define partial information more precisely Complexity is monotonic wrt proof DAG edges. This part uses regularity assumption, simplifies arguments with complexity plot. Random Adversary selects random assignments based on proof – No random restrictions, conceptually clean and don’t lose constant factors here and there.

Size-Space Tradeoffs for Resolution

Warmup Proof: Many Clauses A restriction is a partial assignment to the variables of a formula, resulting in some simplification. l n

Techniques of Proof