Exam 1 Review u Scores Min 30 Max 96 Ave 63.9 Std Dev 14.5.

Slides:

Advertisements

Similar presentations

Design of Experiments Lecture I

Advertisements

1 G Lect 2a G Lecture 2a Thinking about variability Samples and variability Null hypothesis testing.

Chapter 5 Understanding Randomness

Chapter 10 Quality Control McGraw-Hill/Irwin

A GOAL-BASED FRAMEWORK FOR SOFTWARE MEASUREMENT

9-1 Hypothesis Testing Statistical Hypotheses Statistical hypothesis testing and confidence interval estimation of parameters are the fundamental.

1/55 EF 507 QUANTITATIVE METHODS FOR ECONOMICS AND FINANCE FALL 2008 Chapter 10 Hypothesis Testing.

Evaluating Hypotheses

Statistics for the Social Sciences Psychology 340 Fall 2006 Review For Exam 1.

Presented by: Hatem Halaoui

1 Evaluation of Safety Critical Software David L. Parnas, C ACM, June 1990.

Fall 2006 – Fundamentals of Business Statistics 1 Chapter 8 Introduction to Hypothesis Testing.

C82MCP Diploma Statistics School of Psychology University of Nottingham 1 Overview of Lecture Independent and Dependent Variables Between and Within Designs.

BCOR 1020 Business Statistics

Copyright © 1998, Triola, Elementary Statistics Addison Wesley Longman 1 Chapter 7 Hypothesis Testing 7-1 Overview 7-2 Fundamentals of Hypothesis Testing.

Sampling Theory Determining the distribution of Sample statistics.

Introduction to Testing a Hypothesis Testing a treatment Descriptive statistics cannot determine if differences are due to chance. A sampling error occurs.

1 Functional Testing Motivation Example Basic Methods Timing: 30 minutes.

1 841f06parnas13 Evaluation of Safety Critical Software David L. Parnas, C ACM, June 1990.

HAWKES LEARNING SYSTEMS math courseware specialists Copyright © 2010 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Chapter 14 Analysis.

Chapter 10 Hypothesis Testing

Confidence Intervals and Hypothesis Testing - II

1 © Lecture note 3 Hypothesis Testing MAKE HYPOTHESIS ©

Presented by Mohammad Adil Khan

Fundamentals of Hypothesis Testing: One-Sample Tests

1 CHAPTER 7 Homework:5,7,9,11,17,22,23,25,29,33,37,41,45,51, 59,65,77,79 : The U.S. Bureau of Census publishes annual price figures for new mobile homes.

Evaluation of Safety Critical Software -- David L. Parnas, -- A. John van Schouwen, -- Shu Po Kwan -- June 1990 Presented By Zhuojing Li.

Sampling Theory Determining the distribution of Sample statistics.

1 740f02frankl25 Evaluating Testing Methods by Delivered Reliability Frankl, Hamlet, Littlewood, Strigini IEEE TOSE Aug98.

1 Today Null and alternative hypotheses 1- and 2-tailed tests Regions of rejection Sampling distributions The Central Limit Theorem Standard errors z-tests.

Chapter 8 Introduction to Hypothesis Testing

Psy B07 Chapter 4Slide 1 SAMPLING DISTRIBUTIONS AND HYPOTHESIS TESTING.

1841f06detprob3 MM Stroustrup Ch26 u Comments? u Agree or disagree with his testing approach?

Confidence intervals are one of the two most common types of statistical inference. Use a confidence interval when your goal is to estimate a population.

CS5263 Bioinformatics Lecture 20 Practical issues in motif finding Final project.

Lecture 16 Section 8.1 Objectives: Testing Statistical Hypotheses − Stating hypotheses statements − Type I and II errors − Conducting a hypothesis test.

841f07frankl12oct21 Evaluating Testing Methods by Delivered Reliability Frankl, Hamlet, Littlewood, Strigini IEEE TOSE Aug98.

Software Testing. Software testing is the execution of software with test data from the problem domain. Software testing is the execution of software.

1 Chapter 8 Hypothesis Testing 8.2 Basics of Hypothesis Testing 8.3 Testing about a Proportion p 8.4 Testing about a Mean µ (σ known) 8.5 Testing about.

Section 10.1 Confidence Intervals

Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Fundamentals of Hypothesis Testing: One-Sample Tests Statistics.

Copyright © 2010 Pearson Education, Inc. Slide Beware: Lots of hidden slides!

6.1 Inference for a Single Proportion  Statistical confidence  Confidence intervals  How confidence intervals behave.

Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Overview.

Exam 1 Review u Scores Min 30 Max 96 Ave 63.9 Std Dev 14.5.

1 Evaluation of Safety Critical Software David L. Parnas, C ACM, June 1990.

Chap 8-1 Fundamentals of Hypothesis Testing: One-Sample Tests.

MARLAP Chapter 20 Detection and Quantification Limits Keith McCroan Bioassay, Analytical & Environmental Radiochemistry Conference 2004.

14 Statistical Testing of Differences and Relationships.

Hypothesis Testing Introduction to Statistics Chapter 8 Feb 24-26, 2009 Classes #12-13.

Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,

1 Definitions In statistics, a hypothesis is a claim or statement about a property of a population. A hypothesis test is a standard procedure for testing.

Quality Control Copyright © 2015 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill.

1841f06detprob3 Testing Basics Detection probability.

#1 Make sense of problems and persevere in solving them How would you describe the problem in your own words? How would you describe what you are trying.

Statistics for Business and Economics Module 1:Probability Theory and Statistical Inference Spring 2010 Lecture 4: Estimating parameters with confidence.

Hypothesis Tests l Chapter 7 l 7.1 Developing Null and Alternative

Software Testing.

Decision Table Testing

Unit 5: Hypothesis Testing

Input Space Partition Testing CS 4501 / 6501 Software Testing

Chapter 8: Inference for Proportions

Determining the distribution of Sample statistics

Hypothesis Testing (Cont.)

Discrete Event Simulation - 4

Hypothesis Testing.

Sampling Distributions

OMGT LECTURE 10: Elements of Hypothesis Testing

One-Sample Tests of Hypothesis

Evaluating Testing Methods by Delivered Reliability

Presentation transcript:

Exam 1 Review u Scores Min 30 Max 96 Ave 63.9 Std Dev 14.5

2 Q1 Calculations – given the following table, compare the estimates of the E(Q) using the MFR est and using the subdomain formula. Assume that subdomain c is three times as likely as the other subdomains. You can simplify the table by “projections” on each axis. Show your work. State and justify any assumptions that you must make. Which formula do you think best estimates the actual E(Q) and why?

3 Q1 chart subdomainF1F2F3Domain Size a b c d04210 e04420 f010020

4 Q2 Assume that a testing method, called X, requires that for each decision one test case is randomly picked from the subset that makes the decision false, one from the subset that makes the decision true and one from the subset that makes the two sides equal (e.g. if the decision was “x < y”, that set would be those points where the value of x was equal to the value of y). X testing for the whole program would do this for each relational expression.

5 Q2 comments There were 6 subdomains. Three from the first decision. Three from the second decision. No mention is made of being mutually exclusive. Since the testing was by subdomains, the best choice was the product formula. Common mistakes were using 2 or 3 subdomains.

6 Q3 Find the largest set of mutually-exclusive subdomains that might be useful for thorough testing of the code. Justify why they would be useful. Express the sets with relational conditions on a and b and draw a 2D map of the subdomains. Consider these three faults: 1) change “b > 3” to “b > 2” ; 2) change “a > b” to “a >= b” ; 3) change “a-2” to “a-3” Can these subdomains reveal these faults?

7 Q4 Suppose that your company is considering buying a triangle program that identifies whether the 3 inputs are the sides of a scalene, isosceles, equilateral, bad inputs or not a triangle. Your boss wants you to test the new program before he purchases the software, but he will allow you only 3 test cases. The company knows that misclassifying any triangle will cost the company X dollars but classifying an equilateral triangle as scalene will cost an additional 2X dollars. How do you decide which tests to use? Can you use seeded faults to help select? If so, what faults would you seed?

8 Q4 comments The question asks for a decision or a decision process. If you don’t have a decision about which tests to do (and a justification), you need a process. If seeding faults (and you would need to say which faults you would seed), how do you use that information. For example, “I would choose tests that eliminated the most faults”. Just saying “I would seed faults” or “I would pick high q” is not sufficient

9 Evaluation of Safety Critical Software David L. Parnas, C ACM, June 1990

Overview of Parnas’s article u What was the main point? u What did you learn? u What did you find confusing? u Has anything changed since 1990?

Initial Faults u As a rule software systems do not work well until they have been used, and have failed repeatedly, in real applications. Generally, many uses and many failures are required before a product is considered reliable. Software products, including those that have become relatively reliable, behave like other products of evolution-like processes; they often fail, even years after they were built, when the operating conditions change.

Terms u Safety critical u Weak link behavior u Silver bullet u Clean room development u Trustworthiness

Software Controllers u It is important to recognize that, in theory, software implemented controllers can be described in exactly the same way as black box mathematical models. They can also be viewed as black boxes whose output is a mathematical function of the input. In practice, they are not viewed this way. One reason for the distinction is that their functions are more complex (i.e. harder to describe) than the functions that describe the behavior of conventional controllers. However, [4] and [17] provide ample evidence that requirements for real systems can be documented in this way.

Difficulties u Why is software hard to test u Software Testing Concerns u Software Reviewability Concerns

15 Necessary Reviews

16 Does OO change this?

Software Reliability u Nonetheless, our practical experience is that software appears to exhibit stochastic properties. It is quite useful to associate reliability figures such as MTBF (Mean Time Between Failures) with an operating system or other software product. Some software experts attribute the apparently random behavior to our ignorance. They believe that all software failures would be predictable if we fully understood the software, but our failure to understand our own creations justifies the treatment of software failures as random.

Operational Profile? u For systems that function correctly only in rare emergencies, we wish to measure the reliability in those situations where the system must take corrective action, and not include data from situations in which the system is not needed. The input sequence distributions used in reliability assessment should be those that one would encounter in emergency situations, and not those that characterize normal operation.

Error counts u In other words, even if we could count the number of errors, reliability is not a function of the error count. If asked to evaluate a safety-critical software product, there is no point in attempting to estimate or predict the number of errors remaining in a program

20 Table 1 Table I shows that, if our design target was to have the probability of failure be less than 1 in 1000, performing between 4500 and 5000 tests (randomly chosen from the appropriate test case distribution) without failure would mean that the probability of an unacceptable product passing the test was less than 1 in a hundred.

21 Table II

1 minute paper u What issues/concerns/opinions/questions do you have about the Parnas paper?