The Space Efficiency of OSHL Swaha Miller David A. Plaisted UNC Chapel Hill.

Slides:



Advertisements
Similar presentations
Resolution Proof System for First Order Logic
Advertisements

Computer Science CPSC 322 Lecture 25 Top Down Proof Procedure (Ch 5.2.2)
The Model Evolution Calculus with Built-in Theories Peter Baumgartner MPI Informatik, Saarbrücken
Biointelligence Lab School of Computer Sci. & Eng.
UIUC CS 497: Section EA Lecture #2 Reasoning in Artificial Intelligence Professor: Eyal Amir Spring Semester 2004.
Proofs from SAT Solvers Yeting Ge ACSys NYU Nov
Methods of Proof Chapter 7, second half.. Proof methods Proof methods divide into (roughly) two kinds: Application of inference rules: Legitimate (sound)
Methods of Proof Chapter 7, Part II. Proof methods Proof methods divide into (roughly) two kinds: Application of inference rules: Legitimate (sound) generation.
CPSC 422, Lecture 21Slide 1 Intelligent Systems (AI-2) Computer Science cpsc422, Lecture 21 Mar, 4, 2015 Slide credit: some slides adapted from Stuart.
Artificial Intelligence Chapter 14. Resolution in the Propositional Calculus Artificial Intelligence Chapter 14. Resolution in the Propositional Calculus.
1 Applied Computer Science II Resolution in FOL Luc De Raedt.
Performance of OSHL on Problems Requiring Definition Expansion Swaha Miller David A. Plaisted UNC Chapel Hill.
CPSC 322, Lecture 23Slide 1 Logic: TD as search, Datalog (variables) Computer Science cpsc322, Lecture 23 (Textbook Chpt 5.2 & some basic concepts from.
Outline Recap Knowledge Representation I Textbook: Chapters 6, 7, 9 and 10.
Proof methods Proof methods divide into (roughly) two kinds: –Application of inference rules Legitimate (sound) generation of new sentences from old Proof.
Willis Lemasters Grant Conklin. Searching a tree recursively one branch at a time, abandoning any branch which does not satisfy the search constraints.
The Model Evolution Calculus Peter Baumgartner, MPI Saarbruecken and U Koblenz Cesare Tinelli, U Iowa.
CPSC 322, Lecture 23Slide 1 Logic: TD as search, Datalog (variables) Computer Science cpsc322, Lecture 23 (Textbook Chpt 5.2 & some basic concepts from.
Knowledge Representation I (Propositional Logic) CSE 473.
Complexity 19-1 Complexity Andrei Bulatov More Probabilistic Algorithms.
Methods of Proof Chapter 7, second half.
SPASS: Combining Superposition, Sorts and Splitting Christoph Weidenbach Max-Planck-Institute for Computer Science Presented.
Last time Proof-system search ( ` ) Interpretation search ( ² ) Quantifiers Equality Decision procedures Induction Cross-cutting aspectsMain search strategy.
Chapter 3 Propositional Logic
Knowledge Representation II (Inference in Propositional Logic) CSE 473 Continued…
Artificial Intelligence Chapter 14 Resolution in the Propositional Calculus Artificial Intelligence Chapter 14 Resolution in the Propositional Calculus.
Logic - Part 2 CSE 573. © Daniel S. Weld 2 Reading Already assigned R&N ch 5, 7, 8, 11 thru 11.2 For next time R&N 9.1, 9.2, 11.4 [optional 11.5]
Notes for Chapter 12 Logic Programming The AI War Basic Concepts of Logic Programming Prolog Review questions.
Proof Systems KB |- Q iff there is a sequence of wffs D1,..., Dn such that Dn is Q and for each Di in the sequence: a) either Di is in KB or b) Di can.
Logics for Data and Knowledge Representation Propositional Logic: Reasoning Originally by Alessandro Agostini and Fausto Giunchiglia Modified by Fausto.
Boolean Satisfiability and SAT Solvers
Empirical Explorations with The Logical Theory Machine: A Case Study in Heuristics by Allen Newell, J. C. Shaw, & H. A. Simon by Allen Newell, J. C. Shaw,
CHAPTERS 7, 8 Oliver Schulte Logical Inference: Through Proof to Truth.
Logical Inference 2 rule based reasoning
ARTIFICIAL INTELLIGENCE [INTELLIGENT AGENTS PARADIGM] Professor Janis Grundspenkis Riga Technical University Faculty of Computer Science and Information.
S P Vimal, Department of CSIS, BITS, Pilani
Explorations in Artificial Intelligence Prof. Carla P. Gomes Module Logic Representations.
ARTIFICIAL INTELLIGENCE [INTELLIGENT AGENTS PARADIGM] Professor Janis Grundspenkis Riga Technical University Faculty of Computer Science and Information.
Kansas State University Department of Computing and Information Sciences CIS 730: Introduction to Artificial Intelligence Lecture 17 Wednesday, 01 October.
Resolution Strategies One common strategy for applying resolution is called level saturation. Here you try to resolve every pair of clauses from the original.
Logical Agents Chapter 7. Knowledge bases Knowledge base (KB): set of sentences in a formal language Inference: deriving new sentences from the KB. E.g.:
CS6133 Software Specification and Verification
Kansas State University Department of Computing and Information Sciences CIS 730: Introduction to Artificial Intelligence Lecture 18 of 41 Friday, 01 October.
1 The Wumpus Game StenchBreeze Stench Gold Breeze StenchBreeze Start  Breeze.
SAT 2009 Ashish Sabharwal Backdoors in the Context of Learning (short paper) Bistra Dilkina, Carla P. Gomes, Ashish Sabharwal Cornell University SAT-09.
CPSC 422, Lecture 21Slide 1 Intelligent Systems (AI-2) Computer Science cpsc422, Lecture 21 Oct, 30, 2015 Slide credit: some slides adapted from Stuart.
© Copyright 2008 STI INNSBRUCK Intelligent Systems Propositional Logic.
© Daniel S. Weld 1 Logistics Problem Set 2 Due Wed A few KR problems Robocode 1.Form teams of 2 people 2.Write design document.
Nikolaj Bjørner Microsoft Research DTU Winter course January 2 nd 2012 Organized by Flemming Nielson & Hanne Riis Nielson.
DEDUCTION PRINCIPLES AND STRATEGIES FOR SEMANTIC WEB Chain resolution and its fuzzyfication Dr. Hashim Habiballa University of Ostrava.
Backward chaining Assume the same representation of rules as in forward chaining, i.e. If is true, is true, … is true Then is true. Rule interpretation.
1 Propositional Logic Limits The expressive power of propositional logic is limited. The assumption is that everything can be expressed by simple facts.
Logical Agents Chapter 7. Outline Knowledge-based agents Propositional (Boolean) logic Equivalence, validity, satisfiability Inference rules and theorem.
Inference in Propositional Logic (and Intro to SAT) CSE 473.
Proof Methods for Propositional Logic CIS 391 – Intro to Artificial Intelligence.
Knowledge Repn. & Reasoning Lecture #9: Propositional Logic UIUC CS 498: Section EA Professor: Eyal Amir Fall Semester 2005.
Computing & Information Sciences Kansas State University Wednesday, 04 Oct 2006CIS 490 / 730: Artificial Intelligence Lecture 17 of 42 Wednesday, 04 October.
Computing & Information Sciences Kansas State University Friday, 13 Oct 2006CIS 490 / 730: Artificial Intelligence Lecture 21 of 42 Friday, 13 October.
Logical Agents. Outline Knowledge-based agents Logic in general - models and entailment Propositional (Boolean) logic Equivalence, validity, satisfiability.
CENG 424-Logic for CS Introduction Based on the Lecture Notes of Konstantin Korovin, Valentin Goranko, Russel and Norvig, and Michael Genesereth.
EA C461 Artificial Intelligence
Inference in Propositional Logic (and Intro to SAT)
EA C461 – Artificial Intelligence Logical Agent
Lecture 2 Propositional Logic
OSHL: A Propositional Prover with Semantics for First-Order Logic
Propositional Resolution
Artificial Intelligence: Agents and Propositional Logic.
The Relative Power of Semantics and Unification
Logic: Domain Modeling /Proofs + Computer Science cpsc322, Lecture 22
Methods of Proof Chapter 7, second half.
Presentation transcript:

The Space Efficiency of OSHL Swaha Miller David A. Plaisted UNC Chapel Hill

How do humans prove theorems? Semantics Case analysis Sequential search through space of possible structures Focus on the theorem

“Systematic methods can now routinely solve verification problems with thousands or tens of thousands of variables, while local search methods can solve hard random 3SAT problems with millions of variables.” (from a conference announcement)

DPLL Example {p,r},{  p,  q,r},{p,  r} {T,r},{  T,  q,r},{T,  r}{F,r},{  F,  q,r},{F,  r} p=Tp=F {  q,r}{r},{  r} {} SIMPLIFY

Hyper Linking

Eliminating Duplication with the Hyper- Linking Strategy, Shie-Jue Lee and David A. Plaisted, Journal of Automated Reasoning 9 (1992)

Later propositional strategies Billon’s disconnection calculus, derived from hyper-linking Disconnection calculus theorem prover (DCTP), derived from Billon’s work FDPLL

Performance of DCTP on TPTP, 2003 DCTP 1.3 first in EPS and EPR (largely propositional) DCTP 10.2p third in FNE (first-order, no equality) solving same number as best provers DCTP 10.2p fourth in FOF and FEQ (all first- order formulae, and formulae with equality) DCTP 1.3 is a single strategy prover.

Strategy Selection in E

Strategy Selection Schulz, Stephan, E-A Brainiac Theorem Prover, Journal of AI Communications 15(2/3): , 2002.

Strategy Selection The Vampire kernel provides a fairly large number of features for strategy selection. The most important ones are: Choice of the main saturation procedure : (i) OTTER loop, with or without the Limited Resource Strategy, (ii) DISCOUNT loop. A variety of optional simplifications. Parameterised reduction orderings. A number of built-in literal selection functions and different modes of comparing literals. Age-weight ratio that specifies how strongly lighter clauses are preferred for inference selection. Set-of-support strategy.

Strategy Selection The automatic mode of Vampire 7.0 is derived from extensive experimental data obtained on problems from TPTP v Input problems are classified taking into account simple syntactic properties, such as being Horn or non-Horn, presence of equality, etc. Additionally, we take into account the presence of some important kinds of axioms, such as set theory axioms, associativity and commutativity. Every class of problems is assigned a fixed schedule consisting of a number of kernel strategies called one by one with different time limits.

DCTP Strategy Selection DCTP 1.31 has been implemented as a monolithic system in the Bigloo dialect of the Scheme language. DCTP 1.31 is a single strategy prover. Individual strategies are started by DCTP 10.21p using the schedule based resource allocation scheme known from the E-SETHEO system. Of course, different schedules have been precomputed for the syntactic problem classes. The problem classes are more or less identical with the sub-classes of the competition organisers. In CASC-J2 DCTP 10.21p performed substantially better.

Goal of OSHL First-order logic Clause form Propositional efficiency Semantics Requires ground decidability

Structure of OSHL Goal sensitivity if semantics chosen properly Choose initial semantics to satisfy axioms Use of natural semantics For group theory problems, can specify a group Sequential search through possible interpretations Thus similar to Davis and Putnam’s method Propositional Efficiency Constructs a semantic tree

Ordered Semantic Hyperlinking (Oshl) Reduce first-order logic problem to propositional problem Imports propositional efficiency into first-order logic The algorithm Imposes an ordering on clauses Progresses by generating ground instances Di of input clauses and refining interpretations unsatisfiable I0 I1 I2 I3 … D0 D1 D2 T

Semantics Trivial semantics: Positive: Choose I 0 to falsify all atoms, first D is all positive. Forward chaining. Negative: Choose I 0 to satisfy all atoms, first D is all negative. Backward chaining. Natural semantics: I 0 chosen by user

Semantics Ordering < t a well founded ordering on atoms, extended to literals Extend < t to interpretations as follows: I and J agree on L if they interpret L the same Suppose I 0 is given I < t J if I and J are not identical, A is the minimal atom on which they disagree, and I agrees with I 0 on A

Rules of OSHL Start with empty sequence (C 1,C 2, …, C n ), D minimal ground instance of an input clause that contradicts I, I minimal model of sequence (C 1,C 2, …, C n,D) (C 1,C 2, …, C n, D), C n “out of order” (C 1,C 2, …, C n-1,D) (C 1,C 2, …, C n,D), max resolution possible (C 1,C 2, …, C n-1,res(C n,D,L)) Proof if empty clause derived

Propositional Example (  p I 0 p) () ({-p1, -p2, -p3}) I 0 [-p3] ({-p1, -p2, -p3}, {-p4, -p5, -p6}) I 0 [-p3,-p6] ({…}, {…}, {-p7}) I 0 [-p3,-p6,-p7] ({…}, {…}, {-p7}, {p3, p7}) ({…}, {-p4, -p5, -p6}, {p3}) ({-p1, -p2, -p3},{p3}) ({-p1, -p2 }) I 0 [-p2] ╨

U Rules Choose clauses instances to match existing literals. Look for a contradiction. Basic clauses and U clauses Basic clauses are used in three rules given Sequence can also have U clauses on the end U clauses have a selected literal In basic clauses the max. lit. is selected In U clauses other literals can be selected. Significant performance enhancement.

UR Resolution Example Given the sequence ({s(a), p(b) }, {t(a), q(b)}) and the clause {  p(X),  q(X), r(X)} create the sequence ({s(a), p(b)}, {t(a), q(b)}, {  p(b),  q(b), r(b)} ) X  b

Filtering Example Given the sequence ({s(a), p(b)}, {t(a), q(b)}) and the clause {  p(X),  q(X)} create the sequence ({s(a), p(b)}, {t(a), q(b)}, {  p(b),  q(b)} ) X  b

Case Analysis Example Given the sequence ({s(a), p(b)}, {t(a), q(b)}) and the clause {  q(X), r(X), s(X)} create the sequence ({s(a), p(b)}, {t(a), q(b)}, {  q(b), r(b), s(b)} ) X  b

Example Proof Using U Rules All positive semantics Clauses: A1.  X  Y,  Y  X, X=Y A2.  Z  X,  X  Y, Z  Y A3. g(X,Y)  X, X  Y A4.  g(X,Y)  Y, X  Y A5.  Z  X, Z  X  Y A6.  Z  Y, Z  X  Y A7.  Z  X  Y, Z  X, Z  Y T.  A  B = B  A

Example Proof Using U Rules 1. {  A  B = B  A} (T) 2. {  A  B  B  A,  B  A  A  B, A  B = B  A} (Case Analysis, A1) 3. {  g(A  B, B  A)  B  A, A  B  B  A} (UR resolution, A4) 4. {g(A  B, B  A)  B  A,  g(…)  B} (UR resolution, A5) 5. {g(A  B, B  A)  B  A,  g(…)  A} (UR resolution, A6) 6. {g(…)  B, g(…)  A,  g(…)  A  B} (UR resolution, A7) 7. {A  B  B  A, g(…)  A  B} (Filtering, A3)

Example Proof Using U Rules 1. {  A  B = B  A} 2. {  A  B  B  A,  B  A  A  B, A  B = B  A} (Case Analysis) 3. {  g(A  B, B  A)  B  A, A  B  B  A} (UR resolution) 4. {g(A  B, B  A)  B  A,  g(…)  B} (UR resolution) 5. {g(A  B, B  A)  B  A,  g(…)  A} (UR resolution) 8. {g(…)  B, g(…)  A, A  B  B  A,} (Resolution of 6. and 7.)

Example Proof Using U Rules 1. {  A  B = B  A} 2. {  A  B  B  A,  B  A  A  B, A  B = B  A} (Case Analysis) 3. {  g(A  B, B  A)  B  A, A  B  B  A} (UR resolution) 4. {g(A  B, B  A)  B  A,  g(…)  B} (UR resolution) 9. {g(A  B, B  A)  B  A, g(…)  B, A  B  B  A} (Resolution of 8. and 5.)

Example Proof Using U Rules 1. {  A  B = B  A} 2. {  A  B  B  A,  B  A  A  B, A  B = B  A} (Case Analysis) 3. {  g(A  B, B  A)  B  A, A  B  B  A} (UR resolution) 10. {g(A  B, B  A)  B  A} (Resolution of 9. and 4.)

Example Proof Using U Rules 1. {  A  B = B  A} 2. {  A  B  B  A,  B  A  A  B, A  B = B  A} (Case Analysis) 11. {A  B  B  A} (Resolution of 10. and 3.)

Example Proof Using U Rules 1. {  A  B = B  A} 12. {  B  A  A  B, A  B = B  A} (Resolution of 11 and 2) Now the other half of the proof will be done. Note that there is only one ascending sequence of clauses constructed by OSHL and we are only indicating part of it.

Implementation Results Slower implementation speed of OSHL Uniform strategy versus strategy selection The choice of Otter Influence of U rules on an earlier version: None: 233 proofs in 30 seconds on TPTP problems Using them: 900 proofs in 30 seconds All results for trivial semantics

Implementation Results OSHL has no special data structures. Implemented in OCaML No special equality methods Semantics was implemented but frequently only trivial semantics was used. Thus significant performance improvements are possible.

Various Provers PTTP solved 999 of 2200 tested problems. Otter proved leanCoP proved 745. Source: Jens Otten and Wolfgang Bibel. leanCoP: Lean Connection-Based Theorem Proving. Journal of Symbolic Computation, Volume 36, pages Elsevier Science, Vampire 6.0: 3286 refutations of 7267 problems, more solved

Total Number of Proofs #PROBS # Otter Proofs # OSHL-U Proofs AllHORNNon-HornAllHORNNon-Horn AllR=0R>0AllR=0R>0 All FLD SET R denotes the TPTP difficulty rating 30 second time limit on each problem with each prover

Implementation Results Shows that a prover working entirely at the ground level can come into the range of performance of a respectable resolution theorem prover. DCTP and FDPLL probably perform better than OSHL. DCTP and FDPLL do not work entirely at the ground level and do not use natural semantics.

Search space AllHornNon-HornR=0R>0 Non-Horn, R>0 Otter OSHL-U Number of clauses generated (in 1,000s) computed on 827 problems that were proved by both provers Ratio RatioAllHornNon-HornR=0R>0 Non-Horn, R>0 OSHL-U Otter Otter Ratio of number of clauses generated

Storage space AllHornNon-HornR=0R>0 Non-Horn, R>0 Otter OSHL-U Max. number of clauses stored (in 1,000s) computed on 827 problems that were proved by both provers Ratio RatioAllHornNon-HornR=0R>0 Non-Horn, R>0 OSHL-U Otter Otter Ratio of number of clauses stored

Implementation Results In a given number of inferences OSHL finds more proofs than Otter for non Horn problems