Automatically Inferring Temporal Properties for Program Evolution Jinlin Yang and David Evans 15 th IEEE International Symposium on Software Reliability.

Slides:

Advertisements

Similar presentations

PRESENTER: PCLee System-on-chip (SoC) designs use bus protocols for high performance data transfer among the Intellectual Property (IP) cores.

Advertisements

Test process essentials Riitta Viitamäki,

Seyedehmehrnaz Mireslami, Mohammad Moshirpour, Behrouz H. Far Department of Electrical and Computer Engineering University of Calgary, Canada {smiresla,

1 Chao Wang, Yu Yang*, Aarti Gupta, and Ganesh Gopalakrishnan* NEC Laboratories America, Princeton, NJ * University of Utah, Salt Lake City, UT Dynamic.

HARDWARE SOFTWARE PARTITIONING AND CO-DESIGN PRINCIPLES MADHUMITA RAMESH BABU SUDHI PROCH 1/37.

50.530: Software Engineering Sun Jun SUTD. Week 10: Invariant Generation.

A Randomized Dynamic Program Analysis for Detecting Real Deadlocks Koushik Sen CS 265.

Background for “KISS: Keep It Simple and Sequential” cs264 Ras Bodik spring 2005.

1 of 24 Automatic Extraction of Object-Oriented Observer Abstractions from Unit-Test Executions Dept. of Computer Science & Engineering University of Washington,

1 Perracotta: Mining Temporal API Rules from Imperfect Traces Jinlin Yang David Evans Deepali Bhardwaj Thirumalesh Bhat Manuvir Das.

Bug Isolation via Remote Program Sampling Ben Liblit, Alex Aiken, Alice X.Zheng, Michael I.Jordan Presented by: Xia Cheng.

CS 582 / CMPE 481 Distributed Systems Fault Tolerance.

Architecture-driven Modeling and Analysis By David Garlan and Bradley Schmerl Presented by Charita Feldman.

Dynamically Discovering Likely Program Invariants to Support Program Evolution Michael D. Ernst, Jake Cockrell, William G. Griswold, David Notkin Presented.

Dynamically Discovering Likely Program Invariants to Support Program Evolution Michael Ernst, Jake Cockrell, William Griswold, David Notkin Presented by.

CS590 Z Software Defect Analysis Xiangyu Zhang. CS590F Software Reliability What is Software Defect Analysis  Given a software program, with or without.

Dynamically Discovering Likely Program Invariants to Support Program Evolution Michael D. Ernst, Jake Cockrell, William G. Griswold, David Notkin Presented.

Michael Ernst, page 1 Improving Test Suites via Operational Abstraction Michael Ernst MIT Lab for Computer Science Joint.

Automatic Extraction of Object-Oriented Component Interfaces John Whaley Michael C. Martin Monica S. Lam Computer Systems Laboratory Stanford University.

The Rare Glitch Project: Verification Tools for Embedded Systems Carnegie Mellon University Pittsburgh, PA Ed Clarke, David Garlan, Bruce Krogh, Reid Simmons,

Software Issues Derived from Dr. Fawcett’s Slides Phil Pratt-Szeliga Fall 2009.

AFID: An Automated Fault Identification Tool Alex Edwards Sean Tucker Sébastien Worms Rahul Vaidya Brian Demsky.

1 Inferring Specifications A kind of review. 2 The Problem Most programs do not have specifications Those that do often fail to preserve the consistency.

Architectural Design Establishing the overall structure of a software system Objectives To introduce architectural design and to discuss its importance.

What Exactly are the Techniques of Software Verification and Validation A Storehouse of Vast Knowledge on Software Testing.

CASE Tools And Their Effect On Software Quality Peter Geddis – pxg07u.

Impact Analysis of Database Schema Changes Andy Maule, Wolfgang Emmerich and David S. Rosenblum London Software Systems Dept. of Computer Science, University.

CS527: (Advanced) Topics in Software Engineering Overview of Software Quality Assurance Tao Xie ©D. Marinov, T. Xie.

Dr. Pedro Mejia Alvarez Software Testing Slide 1 Software Testing: Building Test Cases.

Software Faults and Fault Injection Models --Raviteja Varanasi.

272: Software Engineering Fall 2012 Instructor: Tevfik Bultan Lecture 17: Code Mining.

Reverse Engineering State Machines by Interactive Grammar Inference Neil Walkinshaw, Kirill Bogdanov, Mike Holcombe, Sarah Salahuddin.

A Visual Comparison Approach to Automated Regression Testing (PDF to PDF Compare)

DySy: Dynamic Symbolic Execution for Invariant Inference.

Mining Windows Kernel API Rules Jinlin Yang 09/28/2005CS696.

Chapter 8 – Software Testing Lecture 1 1Chapter 8 Software testing The bearing of a child takes nine months, no matter how many women are assigned. Many.

4/2/03I-1 © 2001 T. Horton CS 494 Object-Oriented Analysis & Design Software Architecture and Design Readings: Ambler, Chap. 7 (Sections to start.

Requirements Engineering Requirements Elicitation Process Lecture-8.

The Daikon system for dynamic detection of likely invariants MIT Computer Science and Artificial Intelligence Lab. 16 January 2007 Presented by Chervet.

Large Scale Software Systems Derived from Dr. Fawcett’s Notes Phil Pratt-Szeliga Fall 2010.

Software Development. Software Developers Refresher A person or organization that designs software and writes the programs. Software development is the.

Jinlin Yang and David Evans [jinlin, Department of Computer Science University of Virginia PASTE 2004 June 7 th 2004

CIS 842: Specification and Verification of Reactive Systems Lecture 1: Course Overview Copyright 2001, Matt Dwyer, John Hatcliff, and Radu Iosif. The.

1 Test Selection for Result Inspection via Mining Predicate Rules Wujie Zheng

ECE450 - Software Engineering II1 ECE450 – Software Engineering II Today: Introduction to Software Architecture.

Software Architecture Evaluation Methodologies Presented By: Anthony Register.

Securing Passwords Against Dictionary Attacks Presented By Chad Frommeyer.

MSE Portfolio Presentation 1 Doug Smith November 13, 2008

Inculcating Invariants in Introductory Courses David Evans and Michael Peck University of Virginia ICSE 2006 Education Track Shanghai, 24 May

Using Symbolic PathFinder at NASA Corina Pãsãreanu Carnegie Mellon/NASA Ames.

Recommending Adaptive Changes for Framework Evolution Barthélémy Dagenais and Martin P. Robillard ICSE08 Dec 4 th, 2008 Presented by EJ Park.

Software Development Process CS 360 Lecture 3. Software Process The software process is a structured set of activities required to develop a software.

Whole Test Suite Generation. Abstract Not all bugs lead to program crashes, and not always is there a formal specification to check the correctness of.

Software Design Derived from Dr. Fawcett’s slides CSE784 – Software Studio Fall 2009.

Copyright 1999 G.v. Bochmann ELG 7186C ch.1 1 Course Notes ELG 7186C Formal Methods for the Development of Real-Time System Applications Gregor v. Bochmann.

Test Case Purification for Improving Fault Localization presented by Taehoon Kwak SoftWare Testing & Verification Group Jifeng Xuan, Martin Monperrus [FSE’14]

Ocasta: Clustering Configuration Settings for Error Recovery Zhen Huang, David Lie Department of Electrical and Computer Engineering University of Toronto.

Jeremy Nimmer, page 1 Automatic Generation of Program Specifications Jeremy Nimmer MIT Lab for Computer Science Joint work with.

Checking Inside the Black Box: Regression Fault Exposure and Localization Based on Value Spectra Differences Tao Xie Advisor: David Notkin Dept.

Automated Pattern Based Mobile Testing

Eclat: Automatic Generation and Classification of Test Inputs

Gabor Madl Ph.D. Candidate, UC Irvine Advisor: Nikil Dutt

Test Case Test case Describes an input Description and an expected output Description. Test case ID Section 1: Before execution Section 2: After execution.

Software Verification and Validation

Laura Bright David Maier Portland State University

Software Verification and Validation

Lecture 06:Software Maintenance

Automated Analysis and Code Generation for Domain-Specific Models

Software Verification and Validation

Chapter 13: I/O Systems.

Presentation transcript:

Automatically Inferring Temporal Properties for Program Evolution Jinlin Yang and David Evans 15 th IEEE International Symposium on Software Reliability Engineering 5 November 2004 Saint-Malo, France University of Virginia Computer Science

2 Temporal Properties Constrain ordering of events –Essential for program correctness (file open before read) Reveal important differences between programs: S  P  not a chef But, hard for humans to document correctly [Holzmann’s FSE 2002 keynote] Can we infer useful temporal properties automatically? This talk argues: Yes!

3 Dynamically Inferring Properties Inherently unsound: guessing properties of all executions by looking at a small number Value-based invariants –Daikon [Ernst, TSE, Feb 01] Temporal properties –Mining specification [Ammons, POPL 02] –Extracting component interfaces [Whaley, ISSTA 02] We focus on relationship between 2 or 3 events: automation, scalability

4 System Overview Inferred Properties 1 Different Properties Difference Analyzer Inferred Properties 2 Shared Properties Program Version 1 Program Version 2 Dynamic Inference … …

5 Property Inference Inferred Properties 1 Different Properties Difference Analyzer Inferred Properties 2 Shared Properties Program Version 1 Program Version 2 Dynamic Inference … … Program Instrumented Program Instrumentation Test Suite Execution Traces Testing Inferred Properties Candidate Property Patterns Inference

6 Candidate Property Patterns Response pattern [Dwyer, ICSE ‘99] [-P]* (P [-S]* S [-P]*)* –Filtered version (all non P/S events removed): S*(P + S + )* SPPSPS  SPSP  Too weak for our purposes

7 Partial Order of Patterns MultiEffect (PS + )* MultiCause (P + S)* EffectFirst S*(PS)* Alternating (PS)* OneCause S*(PS + )* OneEffect S*(P + S)* CauseFirst (P + S + )* Response S*(P + S + )* Stricter ΛΛ Λ Λ 3 Primitive Patterns 4 Derived Patterns

8 Inferring Properties For all pairs of events, P and S: –Check which (if any) of OneCause, CauseFirst and OneEffect are satisfied –Infer the strictest property satisfied if more than one primitive is MultiEffect (PS + )* MultiCause (P + S)* EffectFirst S*(PS)* Alternating (PS)* OneCause S*(PS + )* OneEffect S*(P + S)* CauseFirst (P + S + )* ΛΛ Λ Λ

9 Traces Example Trace 1: PSPSTrace 2: PPSAll Traces CauseFirst (P+S+)* OneCause S*(PS+)* OneEffect S*(P+S)* For any two events, determine the strictest pattern satisfied by all traces

10 Traces Example Trace 1: PSPSTrace 2: PPSAll Traces CauseFirst (P+S+)* OneCause S*(PS+)*  OneEffect S*(P+S)* For any two events, determine the strictest pattern satisfied by all traces

11 Traces Example Trace 1: PSPSTrace 2: PPSAll Traces CauseFirst (P+S+)* OneCause S*(PS+)*  OneEffect S*(P+S)* For any two events, determine the strictest pattern satisfied by all traces CauseFirst  OneEffect  MultiCause

12 Implementation Program Instrumented Program Instrumentation Test Suite Execution Traces Testing Inferred Properties Candidate Property Patterns Inference Automated for Java using JRat Method entry/exit events Manually for C code or output (today’s examples) Prototype (described in paper) 900 lines of Perl New implementation 8K lines of Java Improved performance Analyses, ranking heuristics

13 Experiments Hypotheses –We can automatically extract interesting temporal properties –Differences in inferred temporal properties among multiple versions can reveal interesting things Target programs –Tour bus simulator (8 student submissions) –OpenSSL (0.9.6, d)

14 Tour Bus Simulator Bus and each passenger are a separate thread Assignment in Fall 2003 graduate-level course (before we started this project) 8 submissions from the instructor (all believed to be correct)

15 Testing > cville_bus –N 2 –C 1 –T 2 Bus waiting for trip 1 Passenger 0 boards Bus drives around Charlottesville Passenger 0 exits Bus waiting for trip 2 Passenger 1 boards Bus drives around Charlottesville Passenger 1 exits Bus stops for the day N, the number of people C, the capacity of the bus T, the number of trips Executed each submission with 100 randomly generated inputs, where 20 < C ≤ 40 C+1 ≤ N ≤ 2C 1 ≤ T ≤ 10 Event traces extracted from program output (no need to instrument program)

16 Differences Reveal Problems Pattern7 Correct Versions 1 Faulty Version Alternating wait  drives MultiEffect drives  boards wait  exits wait  boards wait  drives wait  boards MultiCause boards  drives CauseFirst boards  exits drives  exits wait  exits –wait  drives not Alternating Bus drives around before all passengers exit Bug in locking code (misplaced synchronization) –drives  exits, wait  exits not MultiEffect Bus drives around twice before letting passengers exit Missing synchronization

17 OpenSSL Widely used implementation of the Secure Socket Layer protocol 6 versions [0.9.6, 0.9.7, 0.9.7a-d] between Sept 2000 and March 2004 We focus on the handshake protocol –38 different event types

18 SR_CLNT_HELLO SW_SRVR_HELLO SW_CERT SW_KEY_EXCH SW_CERT_REQ SW_SRVR_DONE SR_CERT SR_KEY_EXCH SR_CERT_VRFY SR_FINISHED SW_CHANGE SW_FINISHED BEFORE+ACCEPT SW_FLUSH OK SW_FLUSH Client Server

19 Testing Manually instrumented server to record handshake events Executed each version of server with 1000 randomly generated clients –Client modified advance to a randomly selected state with 5% probability

20 Inferred Alternating Patterns a0.9.7b0.9.7c0.9.7d SR_KEY_EXCH  SR_CERT_VRFY SW_CERT  SW_KEY_EXCH SW_SRVR_DONE  SR_CERT Documented change: ignore unrequested client certificates Fixed bug causing server crashes Race condition (present in all versions) 7 alternating patterns same for all versions

21 Partitioning Traces All Traces Correct Clients (never jump to random state) Faulty Clients No Server Error Server Error Follows SSL specification Missing Alternating patterns: allows handshake cycle Segmentation fault (<0.9.7d)

22 Performance Prototype implementation (described in paper) –Up to 3-4 minutes for examples (Pentium 4, 3GHz, 1G RAM) –Too slow (several days) for larger programs –Scales approximately as trace length  events 2 New implementation –Scales approximately as trace length  events –Analyze these experiments in <.5s Bus: 102 events, 100 traces, 222 events/trace: 0.4s OpenSSL: 35 events, 1000 traces, 18 events/trace: 0.3s –Feasible to analyze large systems Thousands of different events Traces with millions of events Preliminary results from experiments with JBoss

23 Summary of Experiments Useful in program evolution –Reveal interesting changes in OpenSSL –Identify unexpected differences Revealed bugs –Tour bus: identified faulty implementation Multiple implementations are rare, but multiple representations are common (i.e. design, model, code) –OpenSSL: Differences between versions revealed Flaws found by partitioning traces

24 Future Work/Research Questions Can we make this feasible in practice? –Automatically identify interesting events –Heuristics to identify important patterns –Scalability and automation Can we make it more useful? –More expressive patterns: more events, combine with data-flow –Understand impact of different testing strategies

25 Conclusion Automatically inferring temporal properties is feasible Even very simple property patterns reveal interesting program properties

26 Questions? This work is funded in part by the National Science Foundation. Thanks: Marty Humphrey for providing the student submissions, Chengdu Huang for help with OpenSSL, Joel Winstead, the anonymous reviewers, and anonymous CS650 students for writing buggy code.