Practical Partition-Based Theorem Proving for Large Knowledge Bases Bill MacCartney (Stanford KSL) Sheila A. McIlraith (Stanford KSL) Eyal Amir (UC Berkeley)

Slides:



Advertisements
Similar presentations
ARTIFICIAL INTELLIGENCE [INTELLIGENT AGENTS PARADIGM] Professor Janis Grundspenkis Riga Technical University Faculty of Computer Science and Information.
Advertisements

The 20th International Conference on Software Engineering and Knowledge Engineering (SEKE2008) Department of Electrical and Computer Engineering
Big Ideas in Cmput366. Search Blind Search State space representation Iterative deepening Heuristic Search A*, f(n)=g(n)+h(n), admissible heuristics Local.
CSC411Artificial Intelligence 1 Chapter 3 Structures and Strategies For Space State Search Contents Graph Theory Strategies for Space State Search Using.
CS498-EA Reasoning in AI Lecture #15 Instructor: Eyal Amir Fall Semester 2011.
WIMS 2014, June 2-4Thessaloniki, Greece1 Optimized Backward Chaining Reasoning System for a Semantic Web Hui Shi, Kurt Maly, and Steven Zeil Contact:
Automatic Verification Book: Chapter 6. What is verification? Traditionally, verification means proof of correctness automatic: model checking deductive:
Optimizing Join Enumeration in Transformation-based Query Optimizers ANIL SHANBHAG, S. SUDARSHAN IIT BOMBAY VLDB 2014
UIUC CS 497: Section EA Lecture #2 Reasoning in Artificial Intelligence Professor: Eyal Amir Spring Semester 2004.
Methods of Proof Chapter 7, second half.. Proof methods Proof methods divide into (roughly) two kinds: Application of inference rules: Legitimate (sound)
Methods of Proof Chapter 7, Part II. Proof methods Proof methods divide into (roughly) two kinds: Application of inference rules: Legitimate (sound) generation.
Resolution Theorem Proving
CPSC 422, Lecture 21Slide 1 Intelligent Systems (AI-2) Computer Science cpsc422, Lecture 21 Mar, 4, 2015 Slide credit: some slides adapted from Stuart.
Partition-Based Logical Reasoning Bill MacCartney (KSL), Sheila A. McIlraith (KSL), Eyal Amir (FRG/Berkeley), Tomas Uribe (SRI) Richard Fikes and John.
Proof methods Proof methods divide into (roughly) two kinds: –Application of inference rules Legitimate (sound) generation of new sentences from old Proof.
Revisiting a slide from the syllabus: CS 525 will cover Parallel and distributed computing architectures – Shared memory processors – Distributed memory.
CSE (c) S. Tanimoto, 2008 Propositional Logic
Sharing Aggregate Computation for Distributed Queries Ryan Huebsch, UC Berkeley Minos Garofalakis, Yahoo! Research † Joe Hellerstein, UC Berkeley Ion Stoica,
Recent Development on Elimination Ordering Group 1.
Methods of Proof Chapter 7, second half.
Resolution versus Search: Two Strategies for SAT Brad Dunbar Shamik Roy Chowdhury.
Automated Changes of Problem Representation Eugene Fink LTI Retreat 2007.
Ontology translation: two approaches Xiangkui Yao OntoMorph: A Translation System for Symbolic Knowledge By: Hans Chalupsky Ontology Translation on the.
Distributed Constraint Optimization * some slides courtesy of P. Modi
LÊ QU Ố C HUY ID: QLU OUTLINE  What is data mining ?  Major issues in data mining 2.
Web Explanations for Semantic Heterogeneity Discovery Pavel Shvaiko 2 nd European Semantic Web Conference (ESWC), 1 June 2005, Crete, Greece work in collaboration.
Graph Coalition Structure Generation Maria Polukarov University of Southampton Joint work with Tom Voice and Nick Jennings HUJI, 25 th September 2011.
The Electronic Geometry Textbook Project Xiaoyu Chen LMIB - Department of Mathematics Beihang University, China.
Knowledge representation
Mehdi Kargar Aijun An York University, Toronto, Canada Keyword Search in Graphs: Finding r-cliques.
SOFTWARE DESIGN.
UIUC CS 497: Section EA Lecture #3 Reasoning in Artificial Intelligence Professor: Eyal Amir Spring Semester 2004.
1 Structured Region Graphs: Morphing EP into GBP Max Welling Tom Minka Yee Whye Teh.
On Graph Query Optimization in Large Networks Alice Leung ICS 624 4/14/2011.
INTERACTIVE ANALYSIS OF COMPUTER CRIMES PRESENTED FOR CS-689 ON 10/12/2000 BY NAGAKALYANA ESKALA.
Exploiting Context Analysis for Combining Multiple Entity Resolution Systems -Ramu Bandaru Zhaoqi Chen Dmitri V.kalashnikov Sharad Mehrotra.
Knowledge Repn. & Reasoning Lec #11: Partitioning & Treewidth UIUC CS 498: Section EA Professor: Eyal Amir Fall Semester 2004.
Mehdi Kargar Aijun An York University, Toronto, Canada Keyword Search in Graphs: Finding r-cliques.
An Introduction to Artificial Intelligence – CE Chapter 7- Logical Agents Ramin Halavati
Kansas State University Department of Computing and Information Sciences CIS 730: Introduction to Artificial Intelligence Lecture 13 of 41 Monday, 20 September.
Zhuo Peng, Chaokun Wang, Lu Han, Jingchao Hao and Yiyuan Ba Proceedings of the Third International Conference on Emerging Databases, Incheon, Korea (August.
Kansas State University Department of Computing and Information Sciences CIS 730: Introduction to Artificial Intelligence Lecture 17 Wednesday, 01 October.
Logical Agents Chapter 7. Knowledge bases Knowledge base (KB): set of sentences in a formal language Inference: deriving new sentences from the KB. E.g.:
Computing & Information Sciences Kansas State University Lecture 14 of 42 CIS 530 / 730 Artificial Intelligence Lecture 14 of 42 William H. Hsu Department.
Ch. 13 Ch. 131 jcmt CSE 3302 Programming Languages CSE3302 Programming Languages (notes?) Dr. Carter Tiernan.
Kansas State University Department of Computing and Information Sciences CIS 730: Introduction to Artificial Intelligence Lecture 14 of 41 Wednesday, 22.
Kansas State University Department of Computing and Information Sciences CIS 730: Introduction to Artificial Intelligence Lecture 18 of 41 Friday, 01 October.
CPSC 422, Lecture 21Slide 1 Intelligent Systems (AI-2) Computer Science cpsc422, Lecture 21 Oct, 30, 2015 Slide credit: some slides adapted from Stuart.
Nikolaj Bjørner Microsoft Research DTU Winter course January 2 nd 2012 Organized by Flemming Nielson & Hanne Riis Nielson.
Accelerating Random Walks Wei Wei and Bart Selman.
DEDUCTION PRINCIPLES AND STRATEGIES FOR SEMANTIC WEB Chain resolution and its fuzzyfication Dr. Hashim Habiballa University of Ostrava.
Knowledge Repn. & Reasoning Lec. #5: First-Order Logic UIUC CS 498: Section EA Professor: Eyal Amir Fall Semester 2004.
1 Propositional Logic Limits The expressive power of propositional logic is limited. The assumption is that everything can be expressed by simple facts.
Logical Agents Chapter 7. Outline Knowledge-based agents Propositional (Boolean) logic Equivalence, validity, satisfiability Inference rules and theorem.
Some Thoughts to Consider 5 Take a look at some of the sophisticated toys being offered in stores, in catalogs, or in Sunday newspaper ads. Which ones.
Proof Methods for Propositional Logic CIS 391 – Intro to Artificial Intelligence.
Knowledge Repn. & Reasoning Lecture #9: Propositional Logic UIUC CS 498: Section EA Professor: Eyal Amir Fall Semester 2005.
Computing & Information Sciences Kansas State University Wednesday, 04 Oct 2006CIS 490 / 730: Artificial Intelligence Lecture 17 of 42 Wednesday, 04 October.
Computing & Information Sciences Kansas State University Friday, 13 Oct 2006CIS 490 / 730: Artificial Intelligence Lecture 21 of 42 Friday, 13 October.
1 Double-Patterning Aware DSA Template Guided Cut Redistribution for Advanced 1-D Gridded Designs Zhi-Wen Lin and Yao-Wen Chang National Taiwan University.
1 Ontological Foundations For SysML Henson Graves September 2010.
Computing & Information Sciences Kansas State University Monday, 18 Sep 2006CIS 490 / 730: Artificial Intelligence Lecture 11 of 42 Monday, 18 September.
Logical Agents. Outline Knowledge-based agents Logic in general - models and entailment Propositional (Boolean) logic Equivalence, validity, satisfiability.
Sub-fields of computer science. Sub-fields of computer science.
Knowledge Evolution Tools
Computability and Complexity
Consistency Methods for Temporal Reasoning
Complexity 6-1 The Class P Complexity Andrei Bulatov.
Compact Propositional Encoding of First Order Theories
ONTOMERGE Ontology translations by merging ontologies Paper: Ontology Translation on the Semantic Web by Dejing Dou, Drew McDermott and Peishen Qi 2003.
Presentation transcript:

Practical Partition-Based Theorem Proving for Large Knowledge Bases Bill MacCartney (Stanford KSL) Sheila A. McIlraith (Stanford KSL) Eyal Amir (UC Berkeley) Tomas Uribe (SRI) with thanks to Mark Stickel (SRI)

8/14/03Bill MacCartney, Stanford KSL 2 Motivation Goal: to enable automated reasoners to exploit the implicit structure of large knowledge bases Reasoners in big KBs face combinatorial explosion  Making headway often requires KB-specific manual tuning But, large commonsense KBs contain structure  Loosely-coupled clusters of domain knowledge Partitioning aims to speed reasoning by:  Decomposing graph structure of KB into a tree of partitions  Propagating results between partitions using message-passing  Thereby, focusing proof search and ignoring the irrelevant

8/14/03Bill MacCartney, Stanford KSL 3 Outline Background: partition-based reasoning  Algorithms for automatic partitioning of large KBs  The MP algorithm for reasoning with partitions Experimental evaluation of MP Partition-derived ordering (PDO)  Automatic alternative to hand-crafted symbol orderings MP with focused support (MFS)  Enhancing vanilla MP with a smart within-partition strategy Combinations of strategies  Can outperform set-of-support by 10x or more

8/14/03Bill MacCartney, Stanford KSL 4 The espresso machine theory (1)ok-pump  on-pump  water (2)man-fill  water (3)man-fill   on-pump (4)  man-fill  on-pump (5)water  ok-boiler  on-boiler  steam (6)  water   steam (7)  on-boiler   steam (8)  ok-boiler   steam (9)steam  coffee  hot-drink (10)steam  tea  hot-drink (11)coffee  tea A simple KB of propositional logic (we normally use first-order logic)

8/14/03Bill MacCartney, Stanford KSL 5 Outline Background: partition-based reasoning  Algorithms for automatic partitioning of large KBs  The MP algorithm for reasoning with partitions Experimental evaluation of MP Partition-derived ordering (PDO)  Automatic alternative to hand-crafted symbol orderings MP with focused support (MFS)  Enhancing vanilla MP with a smart within-partition strategy Combinations of strategies  Can outperform set-of-support by 10x or more

8/14/03Bill MacCartney, Stanford KSL 6 Automatic partitioning Step 1: construct symbol graph  Nodes are symbols in KB  Edges connect nodes which appear together in an axiom  Symbol graph captures structure of KB (1)ok-pump  on-pump  water (2)man-fill  water (3)man-fill   on-pump (4)  man-fill  on-pump (5)water  ok-boiler  on-boiler  steam (6)  water   steam (7)  on-boiler   steam (8)  ok-boiler   steam (9)steam  coffee  hot-drink (10)steam  tea  hot-drink (11)coffee  tea hot-drinktea coffee steam on-boiler ok-boiler water on-pump ok-pumpman-fill

8/14/03Bill MacCartney, Stanford KSL 7 Automatic partitioning Step 2: construct tree decomposition  Each node in tree decomposition corresponds to a tightly- connected cluster of symbols  a partition  [Amir 2001] gives algorithm which approximates the optimal decomposition by a factor O(log t) steam water hot-drinktea coffee steam on-boiler ok-boiler water on-pump ok-pumpman-fill water steam hot-drinktea coffee steam on-boiler ok-boiler water on-pump ok-pumpman-fill steam water

8/14/03Bill MacCartney, Stanford KSL 8 Automatic partitioning Step 3: generate partition graph  Allocate axioms to partitions according to vocabulary  “Link languages” are defined by shared vocabularies  Efficient reasoning depends on keeping link vocabularies small steam water hot-drinktea coffee steamon-boiler ok-boiler water on-pump ok-pumpman-fill water steam (1)  ok-pump   on-pump  water (2)  man-fill  water (3)  man-fill   on-pump (4)man-fill  on-pump (5)  water   ok-boiler   on-boiler  steam (6)water   steam (7)on-boiler   steam (8)ok-boiler   steam (9)  steam   coffee  hot-drink (10)  steam   tea  hot-drink (11)coffee  tea steam water

8/14/03Bill MacCartney, Stanford KSL 9 Outline Background: partition-based reasoning  Algorithms for automatic partitioning of large KBs  The MP algorithm for reasoning with partitions Experimental evaluation of MP Partition-derived ordering (PDO)  Automatic alternative to hand-crafted symbol orderings MP with focused support (MFS)  Enhancing vanilla MP with a smart within-partition strategy Combinations of strategies  Can outperform set-of-support by 10x or more

8/14/03Bill MacCartney, Stanford KSL 10 Start with a tree-structured partition graph Reasoning with MP MP Algorithm [Amir & McIlraith 2000]  Pass messages in L i toward goal Identify goal partition (based on matching vocabulary) Direct edges toward goal (fixing outbound link language L i for each partition) Concurrently, in each partition:  Generate consequences in L i

8/14/03Bill MacCartney, Stanford KSL 11 MP in action A simple propositional theory Theory {Q R S T U V W X Y Z} Partition 1 {Q R S T}Partition 2 {T U V W}Partition 3 {W X Y Z} {T}{T} {W}{W} Partition 1 {Q R S T}Partition 2 {T U V W}Partition 3 {W X Y Z} {T}{T} {W}{W} (1)  Q   R  T (2)  S  T (3)  S   R (4)S  R (5)  T   U   V  W (6)T   W (7)U   W (8)V   W (9)  W   X  Z (10)X  Y (11)  W   Y  Z (15)  Z (12)Q(13)U (14)V (16)  R  T (17) S  T (18) T (19)  U   V  W (20)  V  W (21)W (22)  W  Y  Z (23)  W  Z (24)Z (25)  Using partitioning, this query took just 10 resolution steps. Using set-of-support, the same query can take 28 steps. Query: Q  U  V  Z ?

8/14/03Bill MacCartney, Stanford KSL 12 Reasoning is performed locally in each partition Relevant results propagate toward goal partition Globally sound & complete … provided each local reasoner is sound & complete for L i -consequence finding [Amir & McIlraith 2000] Performance is worst-case exponential within partitions, but linear in tree structure Characteristics of MP Minimizes between-partition deduction Supports parallel processing Different reasoners in different partitions Focuses within-partition deduction

8/14/03Bill MacCartney, Stanford KSL 13 Outline Background: partition-based reasoning  Algorithms for automatic partitioning of large KBs  The MP algorithm for reasoning with partitions Experimental evaluation of MP Partition-derived ordering (PDO)  Automatic alternative to hand-crafted symbol orderings MP with focused support (MFS)  Enhancing vanilla MP with a smart within-partition strategy Combinations of strategies  Can outperform set-of-support by 10x or more

8/14/03Bill MacCartney, Stanford KSL 14 Experimental Evaluation of MP Do “real world” KBs exhibit inherent structure?  Do they have good tree decompositions (partition graphs)?  Can partition-based reasoning outperform other strategies? Experimental testbed  Theorem prover: SNARK  KB: Cyc –A subset on spatial relationships, ~750 axioms, ~150 symbols –We’re working on adding SUMO, others  Queries from outside source  Number of resolution steps used as chief performance metric  Normalized to number of steps required using no strategy

8/14/03Bill MacCartney, Stanford KSL 15 Comparison to conventional strategies Restriction strategies focus proof search  Disallow some resolution steps to speed search  Completeness issues are critical Set-of-support restriction  Place the negated query into a designated “set of support”  Allow only resolutions involving a clause from the set of support  Add newly-derived clauses to set of support Ordering restriction  Define a global ordering among predicates  Resolve on predicates in order from greatest to least  (SNARK provides a default ordering, which is arbitrary)

8/14/03Bill MacCartney, Stanford KSL 16 Experimental results: “vanilla” MP Queries

8/14/03Bill MacCartney, Stanford KSL 17 Outline Background: partition-based reasoning  Algorithms for automatic partitioning of large KBs  The MP algorithm for reasoning with partitions Experimental evaluation of MP Partition-derived ordering (PDO)  Automatic alternative to hand-crafted symbol orderings MP with focused support (MFS)  Enhancing vanilla MP with a smart within-partition strategy Combinations of strategies  Can outperform set-of-support by 10x or more

8/14/03Bill MacCartney, Stanford KSL 18 Motivation for PDO Ordered resolution can be highly efficient Voronkov: best modern resolution provers use ordering to reduce search space But success depends on having the right ordering Until now, successful orderings have been  Laboriously hand-crafted  Tailored to a specific KB  Poorly understood Insight: partitioning can induce a good ordering

8/14/03Bill MacCartney, Stanford KSL 19 How PDO works Generate a partition-derived ordering 1.Direct edges of partition graph toward goal partition 2.Perform topological sort on partitions 3.Beginning with partitions furthest from goal, progressively append symbols from each partition to ordering Use result as input for ordered resolution  (Partition graph can now be discarded)  Sound & complete PDO roughly simulates MP

8/14/03Bill MacCartney, Stanford KSL 20 Experimental results: PDO Queries

8/14/03Bill MacCartney, Stanford KSL 21 Outline Background: partition-based reasoning  Algorithms for automatic partitioning of large KBs  The MP algorithm for reasoning with partitions Experimental evaluation of MP Partition-derived ordering (PDO)  Automatic alternative to hand-crafted symbol orderings MP with focused support (MFS)  Enhancing vanilla MP with a smart within-partition strategy Combinations of strategies  Can outperform set-of-support by 10x or more

8/14/03Bill MacCartney, Stanford KSL 22 MP with focused support (MFS) Motivating intuition  Only results in the outbound link vocabulary can be propagated  So, focus within-partition reasoning on generating such results The “focused support” restriction  Initialize set S to contain any clause in the partition that includes a symbol in outbound link language.  Resolve two clauses only if one is in S and the resolved predicate is not in outbound link language. Add the resolvent to S. MFS is globally sound & complete [see paper for proof]

8/14/03Bill MacCartney, Stanford KSL 23 Experimental results: MFS Queries

8/14/03Bill MacCartney, Stanford KSL 24 Outline Background: partition-based reasoning  Algorithms for automatic partitioning of large KBs  The MP algorithm for reasoning with partitions Experimental evaluation of MP Partition-derived ordering (PDO)  Automatic alternative to hand-crafted symbol orderings MP with focused support (MFS)  Enhancing vanilla MP with a smart within-partition strategy Combinations of strategies  Can outperform set-of-support by 10x or more

8/14/03Bill MacCartney, Stanford KSL 25 Strategy combinations Combine MP, PDO, or MFS with set-of-support  Maintain a set of support at global level  Allow resolution between two clauses only if they are in the same partition and at least one of them is in the support Completeness  These combinations are in general not complete  Incompleteness sometimes revealed in practice Performance  However, combinations outperform any single strategy

8/14/03Bill MacCartney, Stanford KSL 26 Experimental results: strategy combos Queries

8/14/03Bill MacCartney, Stanford KSL 27 Experimental results: strategy combos Queries (same results, re-normalized vs. set-of-support)

8/14/03Bill MacCartney, Stanford KSL 28 Conclusions and Future Work Partitioning can speed up reasoning  Exploits implicit structure of large commonsense KBs  Reasoning becomes significantly more focused and efficient  MFS does even better by focusing reasoning within partitions Partition-derived ordering is surprisingly effective  Especially when combined with set-of-support  Automatic alternative to hand-crafted orderings Future work  Greater diversity of experimental results  Obstacle: scarcity of large KBs usable with generic FOL prover  Assessing the potential benefit of parallelization

8/14/03Bill MacCartney, Stanford KSL 29 Web Papers MacCartney, B., McIlraith, S., Amir, E. and Uribe, T., “Practical Partition-Based Theorem Proving for Large Knowledge Bases,” 18th International Joint Conference on Artificial Intelligence (IJCAI-03), Amir, E. and McIlraith, S., “Partition-Based Logical Reasoning for First-Order and Propositional Theories,” accepted for publication in Artificial Intelligence. McIlraith, S. and Amir, E., “Theorem Proving with Structured Theories,” 17th International Joint Conference on Artificial Intelligence (IJCAI-01), Amir, E., “Efficient Approximation for Triangulation of Minimum Treewidth,” 17th Conference on Uncertainty in Artificial Intelligence (UAI ’01), Amir, E. and McIlraith, S., “Solving Satisfiability using Decomposition and the Most Constrained Subproblem.” Proceedings of SAT 2001, Amir, E. and McIlraith, S., “Partition-Based Logical Reasoning,” 7 th International Conference on Principles of Knowledge Representation and Reasoning (KR ’2000), References

8/14/03Bill MacCartney, Stanford KSL 30 Thanks!

8/14/03Bill MacCartney, Stanford KSL 31 Results: automatic partitioning Partition graph is largely independent of query  But edges may need to be redirected We’re experimenting with multiple algorithms Alg 5Alg 6 Number of partitions12440 Max symbols/partition1619 Max symbols/link1417 Max axioms/partition8095 Max partitions/axiom2528 Axioms in multiple partitions152

8/14/03Bill MacCartney, Stanford KSL 32 Queries hd-q1If the pump is OK and the boiler is OK and the boiler is on, do we get a hot drink? cyc-p5If A and B are inside C, can C be inside A? cyc-p7If A and B are part of C and C is at D, where is A? cyc-p1Suppose that A is touching B and B is inside C and C is at D. Is A at D? cyc-v5A has parts B, C, and D. B has parts E, and F. Is F near A? cyc-p3If C is between A and B, and both A and B are inside D, and D is at E, is C at E? cyc-p4If C is between A and B, and both A and B are at D, is C also at D?

8/14/03Bill MacCartney, Stanford KSL 33 Automatic partitioning

8/14/03Bill MacCartney, Stanford KSL 34 MP in action Query: If the pump is OK and the boiler is OK and the boiler is on, do we get a hot drink? (1)  ok-pump   on-pump  water (2)  man-fill  water (3)  man-fill   on-pump (4)man-fill  on-pump (5)  water   ok-boiler   on-boiler  steam (6)water   steam (7)on-boiler   steam (8)ok-boiler   steam (9)  steam   coffee  hot-drink (10)  steam   tea  hot-drink (11)coffee  tea steam water (12)ok-pump (13)ok-boiler (14)on-boiler (15)  hot-drink

8/14/03Bill MacCartney, Stanford KSL 35 (1)  ok-pump   on-pump  water (2)  man-fill  water (3)  man-fill   on-pump (4)man-fill  on-pump (5)  water   ok-boiler   on-boiler  steam (6)water   steam (7)on-boiler   steam (8)ok-boiler   steam (9)  steam   coffee  hot-drink (10)  steam   tea  hot-drink (11)coffee  tea steam water (12)ok-pump (13)ok-boiler (14)on-boiler (15)  hot-drink MP in action (16)  on-pump  water (17) man-fill  water (18) water water steam (19)  ok-boiler   on-boiler  steam (20) steam (21)  steam  tea  hot-drink (22)  steam  hot-drink (23) hot-drink Using set-of-support, SNARK took 28 steps to prove this. Using partitioning, SNARK took just 11 steps. (24) 

8/14/03Bill MacCartney, Stanford KSL 36 Ongoing research Testing on more KBs  Finding good test data is a real challenge Characterizing the queries for which MP and its extensions work especially well Assessing the potential benefit of parallelization  Current implementation is serial  But reasoning within partitions can happen concurrently Distributed implementations  Demonstrating integration of heterogeneous reasoners

8/14/03Bill MacCartney, Stanford KSL 37 Recap: automatic partitioning Begin with a KB in PL or FOL Efficient reasoning depends on keeping partition sizes and link sizes small Construct symbol graph  Edges join symbols which appear together in an axiom Apply tree decomposition algorithm  We use an adaptation of min-fill Partition axioms correspondingly  Each partition has its own vocabulary  “Link languages” are defined by shared vocabulary