University of Southern California Center for Software Engineering CSE USC ©USC-CSE 3/11/2002 Empirical Methods for Benchmarking High Dependability The.

University of Southern California Center for Software Engineering CSE USC ©USC-CSE 3/11/2002 Empirical Methods for Benchmarking High Dependability The Role of Defect and defect Seeding CSE Research Review 2002

University of Southern California Center for Software Engineering CSE USC ©USC-CSE 3/11/2002 2 The Role of Defect Seeding in HDC Benchmarking Traditional Defect Seeding –Assumptions and Challenges Potential New Approaches –Change histories as a source of seeded defects –N-Version development as a source of seeded defects –Connectors and Wrappers –Randomized benchmarks –Hybrids of new and traditional approaches Issues for discussion –Mapping of approaches to dependability categories –Alternative concepts and approaches –Special application challenges

University of Southern California Center for Software Engineering CSE USC ©USC-CSE 3/11/2002 3 Traditional Defect Seeding Insert N defects in system under test (SUT) –“be-bugging" Run tests, find M seeded defects, K unseeded defects Estimate remaining defects as: R= K *((N-M)/M), from K/(K+R) = M/N Example: – seed the SUT with N=10 defects – tests find M=6 of the 10 seeded defects, K=3 unseeded – estimate is that R=2 remain – claim is that the unseeded defects found represent 60% of the 5 previously-undetected defects in the SUT

University of Southern California Center for Software Engineering CSE USC ©USC-CSE 3/11/2002 4 Assumptions of Defect Seeding 1.The seeded defects are representative of existing defects. –Seeding is mostly done by developers, whose blind spots miss many sources of defects. 2.The test profile is representative of the operational profile. –The developers’ knowledge of actual usage patterns is generally highly imperfect. 3.The SUT is developed without knowledge of the seeding profile. –If the seeded defects become well-known, there are risks of consciously or unconsciously tailoring the tool to look good on the seeded defect sample. 4.The source code available for defect seeding. –As systems become increasingly COTS-based, this difficulty increases.

University of Southern California Center for Software Engineering CSE USC ©USC-CSE 3/11/2002 5 Change histories as a source of seeded defects Use fixes from earlier versions of SUT as sources of seeded defects Are representative of existing defects having been real defects Problem is that of having been the most detectable defects using current techniques Version changes may be complex combinations of defect fixes, patches, and general upgrades –preparing an appropriately-seeded SUT more difficult

University of Southern California Center for Software Engineering CSE USC ©USC-CSE 3/11/2002 6 N-Version Development Generate representative defects by giving the specs to different programmers and generating a family of SUT versions –Use versions as sample space for defect population –Estimate non-seeded defects from number of defects caused by seed vs. non-seed –Calculate dependability estimators over samples with respect to estimated population –Comparative analysis of the defects found in the SUT versions can also generate estimates of the likely number of residual defects Studies of N-version programming have shown that it is an imperfect source of independent implementations, and it can also be expensive, but it appears to be worth exploration –Program mutations are a similar source of defect-seeding alternatives

University of Southern California Center for Software Engineering CSE USC ©USC-CSE 3/11/2002 7 Randomized Benchmarks Seed defects according to a known distribution (e.g. uniform) –Compare sample distribution to seed distribution –Parameter estimators may be used determine actual distribution and used to estimate dependability –Randomization helps avoid gaming to the evaluation criteria Non-parametric approach –Permutations and combinations of defect seeds –Jackknife and bootstrap methods to get population estimators from multiple runs

University of Southern California Center for Software Engineering CSE USC ©USC-CSE 3/11/2002 8 Connectors and Wrappers Seed defects by intervening between normal interfaces and communications Seeded defects can simulate potential real defects –Data corruption (change data i/o through wrapper interface) –Communication failures (change timing, handshaking, response, etc. through connector) Actual defect estimates can be made by comparing defects caused by seeds to non-seed defects –Expected dependability can be measured through systems response to seeds and non-seeds

University of Southern California Center for Software Engineering CSE USC ©USC-CSE 3/11/2002 9 Hybrids of New and Traditional One can also combine randomized defect seeding with defect distribution statistics to address the defect representativeness issue. –Orthogonal Defect Classification statistics are a good example. Combinatorial designs to reduce bias in defect seeds with respect to non-seeds Game theoretic approaches

University of Southern California Center for Software Engineering CSE USC ©USC-CSE 3/11/2002 10 Issues for Discussion Mapping of approaches to dependability categories - “Dependability” attributes and tradeoffs Alternative concepts and approaches - Mutation testing as defect seeding - Model- driven approaches - Others Special application challenges - Scalability; test oracles ( e.g. for agent –based systems of systems); Heisenbug effects; others

University of Southern California Center for Software Engineering CSE USC ©USC-CSE 3/11/2002 11 “Dependability” Attributes and Tradeoffs Robustness: reliability, availability, survivability Protection: security, safety Quality of Service: accuracy, fidelity, performance assurance Integrity: correctness, verifiability Attributes mostly compatible and synergetic Some conflicts and tradeoffs –Spreading information: survivability vs. security –Fail-safe: safety vs. quality of service –Graceful degradation: survivability vs. quality of service

University of Southern California Center for Software Engineering CSE USC ©USC-CSE 3/11/2002 Empirical Methods for Benchmarking High Dependability The.

Similar presentations

Presentation on theme: "University of Southern California Center for Software Engineering CSE USC ©USC-CSE 3/11/2002 Empirical Methods for Benchmarking High Dependability The."— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

University of Southern California Center for Software Engineering CSE USC ©USC-CSE 3/11/2002 Empirical Methods for Benchmarking High Dependability The.

Similar presentations

Presentation on theme: "University of Southern California Center for Software Engineering CSE USC ©USC-CSE 3/11/2002 Empirical Methods for Benchmarking High Dependability The."— Presentation transcript:

Similar presentations

About project

Feedback