Download presentation
Presentation is loading. Please wait.
Published byJoleen Carroll Modified over 9 years ago
1
Example 10.1 Experimenting with a New Pizza Style at the Pepperoni Pizza Restaurant Concepts in Hypothesis Testing
2
10.1a10.1a | 10.2 | 10.3 | 10.4 | 10.5 | 10.6 | 10.7 | 10.8 | 10.910.210.310.410.510.610.710.810.9 Background Information n The manager of Pepperoni Pizza Restaurant has recently begun experimenting with a new method of baking its pepperoni pizzas. n He believes that the new method produces a better- tasting pizza, but he would like to base a decision on whether to switch from the old method to the new method on customer reactions. n Therefore he performs an experiment.
3
10.1a10.1a | 10.2 | 10.3 | 10.4 | 10.5 | 10.6 | 10.7 | 10.8 | 10.910.210.310.410.510.610.710.810.9 The Experiment n For 100 randomly selected customers who order a pepperoni pizza for home delivery, he includes both an old style and a free new style pizza in the order. n All he asks is that these customers rate the difference between pizzas on a -10 to +10 scale, where -10 means they strongly favor the old style, +10 means they strongly favor the new style, and 0 means they are indifferent between the two styles. n Once he gets the ratings from the customers, how should he proceed?
4
10.1a10.1a | 10.2 | 10.3 | 10.4 | 10.5 | 10.6 | 10.7 | 10.8 | 10.910.210.310.410.510.610.710.810.9 Hypothesis Testing n This example’s goal is to explain hypothesis testing concepts. We are not implying that the manager would, or should, use a hypothesis testing procedure to decide whether to switch methods. n First, hypothesis testing does not take costs into account. In this example, if the new method is more costly it would be ignored by hypothesis testing. n Second, even if costs of the two pizza-making methods are equivalent, the manager might base his decision on a simple point estimate and possibly a confidence interval.
5
10.1a10.1a | 10.2 | 10.3 | 10.4 | 10.5 | 10.6 | 10.7 | 10.8 | 10.910.210.310.410.510.610.710.810.9 Null and Alternative Hypotheses n Usually, the null hypothesis is labeled H o and the alternative hypothesis is labeled H a. n The null and alternative hypotheses divide all possibilities into two nonoverlapping sets, exactly one of which must be true. n Traditionally, hypotheses testing has been phrased as a decision-making problem, where an analyst decides either to accept the null hypothesis or reject it, based on the sample evidence.
6
10.1a10.1a | 10.2 | 10.3 | 10.4 | 10.5 | 10.6 | 10.7 | 10.8 | 10.910.210.310.410.510.610.710.810.9 One-Tailed Versus Two-Tailed Tests n The form of the alternative hypothesis can be either a one-tailed or two-tailed, depending on what the analyst is trying to prove. n A one-tailed hypothesis is one where the only sample results which can lead to rejection of the null hypothesis are those in a particular direction, namely, those where the sample mean rating is positive. n A two-tailed test is one where results in either of two directions can lead to rejection of the null hypothesis.
7
10.1a10.1a | 10.2 | 10.3 | 10.4 | 10.5 | 10.6 | 10.7 | 10.8 | 10.910.210.310.410.510.610.710.810.9 One-Tailed Versus Two-Tailed Tests -- continued n Once the hypotheses are set up, it is easy to detect whether the test is one-tailed or two-tailed. n One tailed alternatives are phrased in terms of “>” or “<“ whereas two tailed alternatives are phrased in terms of “ ” n The real question is whether to set up hypotheses for a particular problem as one-tailed or two-tailed. n There is no statistical answer to this question. It depends entirely on what we are trying to prove.
8
10.1a10.1a | 10.2 | 10.3 | 10.4 | 10.5 | 10.6 | 10.7 | 10.8 | 10.910.210.310.410.510.610.710.810.9 Types of Errors n Whether or not one decides to accept or reject the null hypothesis, it might be the wrong decision. n One might reject the null hypothesis when it is true or incorrectly accept the null hypothesis when it is false. n These errors are called type I and type II errors. n In general we incorrectly reject a null hypothesis that is true. We commit a type II error when we incorrectly accept a null hypothesis that is false.
9
10.1a10.1a | 10.2 | 10.3 | 10.4 | 10.5 | 10.6 | 10.7 | 10.8 | 10.910.210.310.410.510.610.710.810.9 Types of Errors -- continued n These ideas appear graphically below. n While these errors seem to be equally serious, actually type I errors have traditionally been regarded as the more serious of the two. n Therefore, the hypothesis-testing procedure factors caution in terms of rejecting the null hypothesis.
10
10.1a10.1a | 10.2 | 10.3 | 10.4 | 10.5 | 10.6 | 10.7 | 10.8 | 10.910.210.310.410.510.610.710.810.9 Significance Level and Rejection Region n The real question is how strong the evidence in favor of the alternative hypothesis must be to reject the null hypothesis. n The analyst determines the probability of a type I error that he is willing to tolerate. The value is denoted by alpha and is most commonly equal to 0.05, although alpha=0.01 and alpha=0.10 are also frequently used. n The value of alpha is called the significance level of the test.
11
10.1a10.1a | 10.2 | 10.3 | 10.4 | 10.5 | 10.6 | 10.7 | 10.8 | 10.910.210.310.410.510.610.710.810.9 Significance Level and Rejection Region -- continued n Then, given the value of alpha, we use statistical theory to determine the rejection region. n If the sample falls into this region we reject the null hypothesis; otherwise, we accept it. n Sample evidence that falls into the rejection region is called statistically significant at the alpha level.
12
10.1a10.1a | 10.2 | 10.3 | 10.4 | 10.5 | 10.6 | 10.7 | 10.8 | 10.910.210.310.410.510.610.710.810.9 Significance from p-values n This approach is currently more popular than the significance level and rejected region approach. n This approach is to avoid the use of the alpha level and instead simply report “how significant” the sample evidence is. n We do this by means of the p-value.The p-value is the probability of seeing a random sample at least as extreme as the sample observes, given that the null hypothesis is true.
13
10.1a10.1a | 10.2 | 10.3 | 10.4 | 10.5 | 10.6 | 10.7 | 10.8 | 10.910.210.310.410.510.610.710.810.9 Significance from p-values -- continued n Here “extreme” is relative to the null hypothesis. n In general smaller p-values indicate more evidence in support of the alternative hypothesis. If a p-value is sufficiently small, almost any decision maker will conclude that rejecting the null hypothesis is the more “reasonable” decision.
14
10.1a10.1a | 10.2 | 10.3 | 10.4 | 10.5 | 10.6 | 10.7 | 10.8 | 10.910.210.310.410.510.610.710.810.9 Significance from p-values -- continued n How small is a “small” p-value? This is largely a matter of semantics but if the –p-value is less than 0.01, it provides “convincing” evidence that the alternative hypothesis is true; –p-value is between 0.01 and 0.05, there is “strong” evidence in favor of the alternative hypothesis; –p-value is between 0.05 and 0.10, it is in a “gray area”; –p-values greater than 0.10 are interpreted as weak or no evidence in support of the alternative.
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.