A Template-based Approach to Complete Predicate Refinement Tachio Terauchi (Nagoya University) Hiroshi Unno (University of Tsukuba) Naoki Kobayashi (University.

Slides:

Advertisements

Similar presentations

Model Checking Base on Interoplation

Advertisements

The behavior of SAT solvers in model checking applications K. L. McMillan Cadence Berkeley Labs.

Exploiting SAT solvers in unbounded model checking

A practical and complete approach to predicate abstraction Ranjit Jhala UCSD Ken McMillan Cadence Berkeley Labs.

Quantified Invariant Generation using an Interpolating Saturation Prover Ken McMillan Cadence Research Labs TexPoint fonts used in EMF: A A A A A.

Exploiting SAT solvers in unbounded model checking K. L. McMillan Cadence Berkeley Labs.

Automated Theorem Proving Lecture 1. Program verification is undecidable! Given program P and specification S, does P satisfy S?

Software Model Checking with SMT Ken McMillan Microsoft Research TexPoint fonts used in EMF: A A A A A.

Inference Rules Universal Instantiation Existential Generalization

SLD-resolution Introduction Most general unifiers SLD-resolution

Predicate Abstraction and Canonical Abstraction for Singly - linked Lists Roman Manevich Mooly Sagiv Tel Aviv University Eran Yahav G. Ramalingam IBM T.J.

Satisfiability Modulo Theories (An introduction)

UIUC CS 497: Section EA Lecture #2 Reasoning in Artificial Intelligence Professor: Eyal Amir Spring Semester 2004.

© Anvesh Komuravelli Quantified Invariants in Rich Domains using Model Checking and Abstract Interpretation Anvesh Komuravelli, CMU Joint work with Ken.

Automating Relatively Complete Verification of Higher-Order Functional Programs Hiroshi Unno (University of Tsukuba) Tachio Terauchi (Nagoya University)

Rigorous Software Development CSCI-GA Instructor: Thomas Wies Spring 2012 Lecture 13.

Program Analysis as Constraint Solving Sumit Gulwani (MSR Redmond) Ramarathnam Venkatesan (MSR Redmond) Saurabh Srivastava (Univ. of Maryland) TexPoint.

Relatively Complete Verification of Higher- Order Programs (via Automated Refinement Type Inference) Tachio Terauchi Nagoya University TexPoint fonts used.

SAT and Model Checking. Bounded Model Checking (BMC) A.I. Planning problems: can we reach a desired state in k steps? Verification of safety properties:

Probabilistic CEGAR* Björn Wachter Joint work with Holger Hermanns, Lijun Zhang TexPoint fonts used in EMF. Read the TexPoint manual before you delete.

Constraint Logic Programming Ryan Kinworthy. Overview Introduction Logic Programming LP as a constraint programming language Constraint Logic Programming.

Nikolaj Bjørner Microsoft Research Lecture 3. DayTopicsLab 1Overview of SMT and applications. SAT solving, Z3 Encoding combinatorial problems with Z3.

Interpolants [Craig 1957] G(y,z) F(x,y)

Counterexample-Guided Focus TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AAA A A A AA A A Thomas Wies Institute of.

Solving Partial Order Constraints for LPO termination.

Inference and Resolution for Problem Solving

Search in the semantic domain. Some definitions atomic formula: smallest formula possible (no sub- formulas) literal: atomic formula or negation of an.

Formal Verification Group © Copyright IBM Corporation 2008 IBM Haifa Labs SAT-based unbounded model checking using interpolation Based on a paper “Interpolation.

Last time Proof-system search ( ` ) Interpretation search ( ² ) Quantifiers Equality Decision procedures Induction Cross-cutting aspectsMain search strategy.

Computing OverApproximations with Bounded Model Checking Daniel Kroening ETH Zürich.

1 Abstraction Refinement for Bounded Model Checking Anubhav Gupta, CMU Ofer Strichman, Technion Highly Jet Lagged.

272: Software Engineering Fall 2012 Instructor: Tevfik Bultan Lecture 4: SMT-based Bounded Model Checking of Concurrent Software.

C&O 355 Lecture 2 N. Harvey TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: A A.

Constraint-based Invariant Inference. Invariants Dictionary Meaning: A function, quantity, or property which remains unchanged Property (in our context):

Binary Decision Diagrams (BDDs)

Proof Systems KB |- Q iff there is a sequence of wffs D1,..., Dn such that Dn is Q and for each Di in the sequence: a) either Di is in KB or b) Di can.

On Bridging Simulation and Formal Verification Eugene Goldberg Cadence Research Labs (USA) VMCAI-2008, San Francisco, USA.

SAT and SMT solvers Ayrat Khalimov (based on Georg Hofferek‘s slides) AKDV 2014.

Refinement Type Inference via Horn Constraint Optimization Kodai Hashimoto and Hiroshi Unno (University of Tsukuba, Japan)

Solvers for the Problem of Boolean Satisfiability (SAT) Will Klieber Aug 31, 2011 TexPoint fonts used in EMF. Read the TexPoint manual before you.

Daniel Kroening and Ofer Strichman 1 Decision Procedures An Algorithmic Point of View BDDs.

CS Introduction to AI Tutorial 8 Resolution Tutorial 8 Resolution.

Symbolic Execution with Abstract Subsumption Checking Saswat Anand College of Computing, Georgia Institute of Technology Corina Păsăreanu QSS, NASA Ames.

SAT 2009 Ashish Sabharwal Backdoors in the Context of Learning (short paper) Bistra Dilkina, Carla P. Gomes, Ashish Sabharwal Cornell University SAT-09.

© Copyright 2008 STI INNSBRUCK Intelligent Systems Propositional Logic.

Automated discovery in math Machine learning techniques (GP, ILP, etc.) have been successfully applied in science Machine learning techniques (GP, ILP,

Heuristics for Efficient SAT Solving As implemented in GRASP, Chaff and GSAT.

1 Propositional Logic Limits The expressive power of propositional logic is limited. The assumption is that everything can be expressed by simple facts.

CS357 Lecture 13: Symbolic model checking without BDDs Alex Aiken David Dill 1.

Bounded Model Checking A. Biere, A. Cimatti, E. Clarke, Y. Zhu, Symbolic Model Checking without BDDs, TACAS’99 Presented by Daniel Choi Provable Software.

Logical Agents Chapter 7. Outline Knowledge-based agents Propositional (Boolean) logic Equivalence, validity, satisfiability Inference rules and theorem.

SAT Solving As implemented in - DPLL solvers: GRASP, Chaff and

Knowledge Repn. & Reasoning Lecture #9: Propositional Logic UIUC CS 498: Section EA Professor: Eyal Amir Fall Semester 2005.

© Anvesh Komuravelli Spacer Model Checking with Proofs and Counterexamples Anvesh Komuravelli Carnegie Mellon University Joint work with Arie Gurfinkel,

Satisfiability Modulo Theories and DPLL(T) Andrew Reynolds March 18, 2015.

Counterexample-Guided Abstraction Refinement By Edmund Clarke, Orna Grumberg, Somesh Jha, Yuan Lu, and Helmut Veith Presented by Yunho Kim Provable Software.

The software model checker BLAST Dirk Beyer, Thomas A. Henzinger, Ranjit Jhala, Rupak Majumdar Presented by Yunho Kim TexPoint fonts used in EMF. Read.

Presentation Title 2/4/2018 Software Verification using Predicate Abstraction and Iterative Refinement: Part Bug Catching: Automated Program Verification.

SMT-Based Verification of Parameterized Systems

Solving Linear Arithmetic with SAT-based MC

Automating Induction for Solving Horn Clauses

Introduction to Software Verification

MoCHi: Software Model Checker for a Higher-Order Functional Language

Relatively Complete Refinement Type System for Verification of Higher-Order Non-deterministic Programs Hiroshi Unno (University of Tsukuba) Yuki Satake.

Inferring Simple Solutions to Recursion-free Horn Clauses via Sampling

Over-Approximating Boolean Programs with Unbounded Thread Creation

Decision Procedures An Algorithmic Point of View

Compact Propositional Encoding of First Order Theories

Predicate Abstraction

The Satisfiability Problem

Presentation transcript:

A Template-based Approach to Complete Predicate Refinement Tachio Terauchi (Nagoya University) Hiroshi Unno (University of Tsukuba) Naoki Kobayashi (University of Tokyo) TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: A AA A AA

Software Model Checking Automated Verification of Infinite State Systems Data : Infinite (e.g. Integers) Control : Finite, PDS (aka CFL reachability) – SLAM, BLAST, IMPACT, ARMC, Terminator, etc.

SMC Internals FOL predicate abstraction of infinite data – E.g. “x < y” = set of states ½ where ½ (x) < ½ (y) – Exploits advances in SAT/SMT solving CEGAR to automatically refine abstraction – Inference of appropriate FOL predicates Same design also used in “higher-order SMC”: Depcegar, MoCHi, HMC, etc.

Predicates: x = 0, y = 0, x = yPredicates: x = 0, y = 0 Example x := 0; y := 0; while (x < 100) { x++; y++; } assert (x = y); > x= 0 Æ y = 0 > > > > ) x = y

Predicates: x = 0, y = 0, x = y x= 0 Æ y = 0 Example x := 0; y := 0; while (x < 100) { x++; y++; } assert (x = y); > x = y

Problem A refinement can be any predicates that refute the c.ex. – Not unique in general We got lucky by choosing x = y – Could have chosen x = 1 instead And then choose x = 2, x = 3, … ad infinitum

Predicates: x = 0, y = 0, x = 1 Predicates: x = 0, y = 0 Example failing to converge x := 0; y := 0; while (x < 100) { x++; y++; } assert (x = y); > x= 0 Æ y = 0 > > > > ) x = y

> Predicates: x = 0, y = 0, x = 1Predicates: x = 0, y = 0, x = 1, x = 2 Example failing to converge x := 0; y := 0; while (x < 100) { x++; y++; } assert (x = y); > x= 0 Æ y = 0 > x = 1 > > ) x = y

Solution : Complete SMC Def: Let X be a FOL theory (e.g., X = QF_UFLRA). SMC is said to be complete wrt. X when 9 preds µ X. P ² preds safe, SMC(P) returns “safe”

Complete SMC in CEGAR (1/2) [Jhala,McMillan TACAS’06] Let X be some FOL theory – “theory” : set of (normalized) formulas Let L 0, L 1, … µ X s.t. – Each L i is finite – For each i, L i µ L i+1 –  i 2 ! L i = X E.g., – X = QF_UFLRA – L i = { µ 2 X| atomic terms in µ are of size · i }

Complete SMC in CEGAR (2/2) [Jhala,McMillan TACAS’06] Init L := some L i 2 {L 0, L 1, … } Repeat Run SMC but restricting refinements to L If proved safe, exit with “safe” If fail to prove, let ¼ = counterexample – Find L j s.t. L µ L j and L j contains a refinement for ¼ » Exit with “unsafe” if no such L j exists – Set L := L j and repeat

Challenges 1.Given L and c.ex. ¼, quickly find preds µ L s.t. ¼ ² preds safe 2.Find L j s.t. L µ L j Æ 9 preds µ L j. ¼ ² preds safe – This can be done by existing methods

Challenges 1.Given L and c.ex. ¼, quickly find preds µ L s.t. ¼ ² preds safe Problem is obviously decidable – Because L is finite – “quickly” is the issue Existing method [Jhala,McMillan TACAS’06] only handles limited theory (QF_UFDL)

Overview of c.ex. refinement Refinement reduces to inferring Ã (y) s.t. µ (x,y) ) Ã (y), Ã (y) ) Á (y,z), and µ (x,y), Á (y,z), Ã (y) 2 X µ (x,y) : “what is true about x,y at the program point” Á (y,z) : “what must hold true about y,z after the point to refute the c.ex.” Ã (y) : “sufficient fact about y at the point to refute the c.ex.” So, to do complete refinement Just restrict Ã (y) to the current L when doing this

A Template-based Approach (QF_LRA) Template T: QF_LRA formula with bounded coefficient variables – E.g. c 0 x + c 2 y + c 3 · 0 Æ c 4 x + c 5 y + c 6 · 0 Ç c 5 x + c 6 y + c 7 < 0 Each c is associated with bound B c µ fin Z Idea: Let L = the instances of T and use “increasingly larger” T’s for L 0 µ L 1 µ …

Searching for Refinements in T (1/3) Problem: Decide if 9 c 0 2 B 0,…,c n 2 B n. 8 x 0,…,x m.( µ ) T) Æ (T ) Á ) 9 c 0 2 B 0,…,c n 2 B n. 8 x 0,…,x m. ª (c 0,…,c n,x 0,…,x m ) ª is a non-linear arithmetic formula over rationals – linear on x’s with coefficients on c’s

Searching for Refinements in T (2/3) 9 c 0 2 B 0,…,c n 2 B n. 8 x 0,…,x m. ª (c 0,…,c n,x 0,…,x m ) ª is a non-linear arithmetic formula over rationals – linear on x’s with coefficients on c’s 1.Convert ª to cnf Æ j Ã j –Ã j of the form : (Ax · a Æ Bx < b) s.t. a,b,A,B are over c’s 2.Apply Motzkin’s transposition theorem to each Ã j Ax · a Æ Bx < b is unsatisfiable iff 9 r ¸ 0,p ¸ 0. rA + pB = 0 Æ (ra + pb < 0 Ç (p != 0 Æ ra + pb · 0))

Searching for Refinements in T (3/3) Now, the problem is of the form 9 c 0 2 B 0,…,c n 2 B n,r ¸ 0,p ¸ 0. © (r,p,c 0,…,c n ) Existential formula (i.e., got rid of 8 x 0,…x m ) © is non-linear arithmetic formula – linear on r and p’s with coefficients on c’s Prop: Let Á be a satisfiable QF_LIA formula with n vars, m literals, and coefficients bounded by k Then, there is a solution of Á bounded by 2 log(n+2) + m(log(m) + log(k)) Bit-blast and reduce ① to SAT ①

This is complete for QF_LRA Going beyond QF_LRA – QF_UFLRA – QF_AUFLRA

QF_UFLRA UF – Function symbols f 1, f 2, …, f k – For each f j of arity n 8 x 1 …x n,y 1 …y n. Æ i x i = y i ) f j (x 1 …x n ) = f j (y 1 …y n ) Useful for conservatively modeling operators like :: £

L-restricting UF 1.Incorporate UF terms in templates as follows c 0 f(c 1 x+c 2 y+c 3 +c 4 g(c 5 x + c 6 y+c 7 )) + … 2.Apply Ackermann expansion For each UF subterm f(t) 2 µ, let x f(t) be a fresh var. Let Á = Æ f(t1),f(t2) 2 µ ½ (t1) = ½ (t2) ) x f(t1) = x f(t2) ½ replaces f(t) by x f(t) Prop: QF_UFLRA ² µ iff QF_LRA ² Á ) ½ ( µ ) Idea from [Beyer et al. VMCAI’07]

QF_AUFLRA 8 a,e,i. rd(wr(a,i,e),i) = e 8 a,e,i,j. i != j ) rd(wr(a,i,e),j) = rd(a,j) 8 a,b. a != b ) rd(a,diff(a,b)) != rd(b,diff(a,b)) Useful for modeling pointers – QF_AUFLRA can be reduced to QF_UFLRA See, e.g., [Totla, Wies POPL’13]

This sounds too easy… Does it really scale?

No it doesn’t scale I was oversimplifying the problem – Infer Ã (y) s.t. µ (x,y) ) Ã (y) and Ã (y) ) Á (y,z) c.ex. refinement in reality: – Infer Ã 1 (y 1 ), Ã 2 (y 2 ), … Ã n (y n ) s.t. µ 1 (x 1,y 1 ) ) Ã 1 (y 1 ) Æ µ 2 (x 2,y 2 ) Æ Ã 1 (x 2 ) ) Ã 2 (y 2 ) Æ … µ n (x n,y n ) Æ Ã 1 (x n ) ) Á (y n,z)

So, to infer L-restricted refinement Need to restrict Ã 1 (y 1 ), Ã 2 (y 2 ), … Ã n (y n ) to L Lots of templates! T 1, T 2, … T n – Proportional to the size of c.ex. – Lots of non-linear terms in the constraints Doesn’t scale even on state-of-the-art SAT solver (or SMT solver for non-linear real arithmetic)

Solution (Informal) Key Observation: counterexample in SMC (and constraints solved to refute it) is always repetitions of a fixed set of patterns. – Use the observation to L-restrict only a few Ã ’s and still achieve complete refinement

Example (1/2) c.ex. are of the form µ init (x,y) ) Ã 1 (x 1,y 1 ) Æ Ã 1 (x 1, y 1 ) Æ µ loop (x 1,y 1,x 2,y 2 ) ) Ã 2 (x 2, y 2 ) Æ Ã 2 (x 2, y 2 ) Æ µ loop (x 2,y 2,x 3,y 3 ) ) Ã 3 (x 3, y 3 ) Æ … Ã n (x n,y n ) ) Á (x n,y n ) x := 0; y := 0; while (x < 100) { x++; y++; } assert (x = y); µ init (x,y), x = 0 Æ y = 0 µ loop (x,y,x’,y’), x < 100 Æ x’ = x + 1 Æ y’ = y + 1 Á (x,y), x ¸ 100 Æ x = y

Example (2/2) Theorem: The following strategy is sufficient for complete predicate refinement: 1.Pick some constant k > 0 2.Infer L-restricted refinement for i £ k-th Ã ’s (i.e., Ã i £ k ) 3.Infer unrestricted refinement for other Ã ’s (e.g., via interpolation) This reduces to [Jhala,McMillan TACAS’06] when k = 1 Larger k -> less L-restriction – Proof: On board

Formalization Key Observation: Let P be a program. There exists a set of Horn-clause-like rules R s.t. for any c.ex. ¼ of SMC(P), the set of constraints solved to refute ¼ is an acyclic instance of R − P 1 (x) Æ … Æ P n (x) Æ µ (x,y) ) Q(y) − P 1 (x) Æ … Æ P n (x) Æ µ (x,y) ) Á (x,y) P, Q, … : predicate variables Copies of rules from R with fresh renaming of pred. vars s.t. there is no cycle P ) … ) P

Example x := 0; y := 0; while (x < 100) { x++; y++; } assert (x = y); x = 0 Æ y = 0 ) P(x,y) P(x,y) Æ x < 100 Æ x’ = x+1 Æ y’ = y+1 ) Q(x’,y’) Q(x,y) ) P(x,y) P(x,y) Æ x ¸ 100 ) x = y x = 0 Æ y = 0 ) P 1 (x,y) P 1 (x,y) Æ x < 100 Æ x’ = x+1 Æ y’ = y+1 ) Q 1 (x’,y’) Q 1 (x,y) ) P 2 (x,y) P 2 (x,y) Æ x < 100 Æ x’ = x+1 Æ y’ = y+1 ) Q 2 (x’,y’) Q 2 (x,y) ) P 3 (x,y) P 3 (x,y) Æ x ¸ 100 ) x = y = R E.g. consts( ¼ ) =

Example x := 0; y := 0; while (x < 100) { x++; y++; } assert (x = y); x = 0 Æ y = 0 ) P(x,y) P(x,y) Æ x < 100 Æ x’ = x+1 Æ y’ = y+1 ) Q(x’,y’) Q(x,y) ) P(x,y) P(x,y) Æ x ¸ 100 ) x = y = R The observation also holds for higher-order SMC (e.g., Depcegar, MoCHi, HMC), and SMC for concurrent programs (e.g., Threader) Somewhat more general than [Grebenshchikov et al. PLDI’12] – Only says that c.ex. are instances of the rules

Bounded Patterns Def: Set of bounded patterns A of R is a finite set of acyclic instances of R – Can view each element of A as a “combined” rule Def: Bounded patterns A of R is partitioning if for any acyclic instance G of R, there exists instance A’ of A s.t. G and A’ are isomorphic – E.g., R is a partitioning bounded patterns of R – So is any A [ R where A is a bounded pattern of R

Example x := 0; y := 0; while (x < 100) { x++; y++; } assert (x = y); x = 0 Æ y = 0 ) P(x,y) P(x,y) Æ x < 100 Æ x’ = x+1 Æ y’ = y+1 ) Q(x’,y’) Q(x,y) ) P(x,y) P(x,y) Æ x ¸ 100 ) x = y = R A = R [ {{ P 0 (x,y) Æ x < 100 Æ x’ = x+1 Æ y’ = y+1 ) Q 0 (x’,y’), Q 0 (x,y) ) P 1 (x,y), P 1 (x,y) Æ x < 100 Æ x’ = x+1 Æ y’ = y+1 ) Q 1 (x’,y’), Q 1 (x,y) ) P 2 (x,y), P 2 (x,y) Æ x < 100 Æ x’ = x+1 Æ y’ = y+1 ) Q 2 (x’,y’), Q 2 (x,y) ) P 3 (x,y) }} A’ : On board

L-restriction at Boundaries Def: Let A’ be a partition of c.ex. G by A. Boundaries of partition A’ are predicate variables that appear in more than one element of A’ Theorem: L-restriction at boundaries is sufficient for complete predicate refinement – Proof: Preds at boundaries determine the preds at internal nodes. So, L-restr. at boundaries -> finite # of possible refinements for internals

How to pick bounded partitioning A simple strategy: View G as dag of P’s, L-restrict each i £ k-th P (i.e., P i £ k ) from a root where k is some constant and i = 1,2,3,… Reduces to [Jhala,McMillan TACAS’06] when k = 1 Larger k -> less L-restriction Theorem: above ensures bounded partitioning – Proof: Because there are only a finite # of dags generated by R of path lengths bounded by k

Conclusion Complete predicate refinement for the theory of QF_AUFLRA – Template-based Bounded coefficients allow reduction to SAT – Extends L-restricted refinement [Jhala,McMillan TACAS’06] Exploits the observation that c.ex. are repetitions of some patterns Only L-restrict predicate variables at boundaries of bounded patterns Horn-clause-like rules