STATISTICS ELEMENTARY MARIO F. TRIOLA Chapter 7 Hypothesis Testing EIGHTH EDITION
Chapter 7 Hypothesis Testing 7-1 Overview 7-2 Fundamentals of Hypothesis Testing 7-3 Testing a Claim about a Mean: Large Samples 7-4 Testing a Claim about a Mean: Small Samples 7-5 Testing a Claim about a Proportion 7-6 Testing a Claim about a Standard Deviation
Definition 7-1 Overview Hypothesis in statistics, is a claim or statement about a property of a population page 366 of text Various examples are provided below definition box
Rare Event Rule for Inferential Statistics If, under a given assumption, the probability of a particular observed event is exceptionally small, we conclude that the assumption is probably not correct. Example on page 366-367 of text Introduce the word ‘significant’ in regard to hypothesis testing.
Fundamentals of Hypothesis Testing 7-2 Fundamentals of Hypothesis Testing
Figure 7-1 Central Limit Theorem The Expected Distribution of Sample Means Assuming that = 98.6 Likely sample means µx = 98.6
Figure 7-1 Central Limit Theorem The Expected Distribution of Sample Means Assuming that = 98.6 Likely sample means z = - 1.96 x = 98.48 or z = 1.96 x = 98.72 µx = 98.6
Figure 7-1 Central Limit Theorem The Expected Distribution of Sample Means Assuming that = 98.6 Sample data: z = - 6.64 x = 98.20 Likely sample means or z = - 1.96 x = 98.48 or z = 1.96 x = 98.72 µx = 98.6
Components of a Formal Hypothesis Test page 369 of text
Null Hypothesis: H0 Statement about value of population parameter Must contain condition of equality =, , or Test the Null Hypothesis directly Reject H0 or fail to reject H0 Give examples of different wording for and Š, such as ‘at least’, ‘at most’, ‘no more than’, etc.
Alternative Hypothesis: H1 Must be true if H0 is false , <, > ‘opposite’ of Null Give examples of different ways to word °,< and >, such as ‘is different from’, ‘fewer than’, ‘more than’, etc.
Note about Forming Your Own Claims (Hypotheses) If you are conducting a study and want to use a hypothesis test to support your claim, the claim must be worded so that it becomes the alternative hypothesis. By examining the flowchart for the Wording of the Final Conclusion, Figure 7-4, page 375, this requirement for support of a statement becomes clear.
Note about Testing the Validity of Someone Else’s Claim Someone else’s claim may become the null hypothesis (because it contains equality), and it sometimes becomes the alternative hypothesis (because it does not contain equality). This is important to emphasize. A claim is not always the null statement. Because of the wording, it may become the alternative. Whichever statement the claim becomes (null or alternative), the other statement will be the ‘opposite’. Some examples should be given for starting with a claim, and then set up the null and alternative. One example is on page 370 and exercises 9 - 16 are appropriate.
Test Statistic a value computed from the sample data that is used in making the decision about the rejection of the null hypothesis page 371 of text
For large samples, testing claims about population means Test Statistic a value computed from the sample data that is used in making the decision about the rejection of the null hypothesis For large samples, testing claims about population means page 371-372 of text Example on page 372 of text x - µx z = n
Critical Region Set of all values of the test statistic that would cause a rejection of the null hypothesis page 372 of text
Critical Region Set of all values of the test statistic that would cause a rejection of the null hypothesis Critical Region
Critical Region Set of all values of the test statistic that would cause a rejection of the null hypothesis Critical Region
Critical Region Set of all values of the test statistic that would cause a rejection of the null hypothesis Critical Regions
Significance Level denoted by the probability that the test statistic will fall in the critical region when the null hypothesis is actually true. common choices are 0.05, 0.01, and 0.10 This is the same introduced in Section 6-2, where we defined the degree of confidence for a confidence interval to be the probability 1 -
Critical Value Value or values that separate the critical region (where we reject the null hypothesis) from the values of the test statistics that do not lead to a rejection of the null hypothesis page 372 of text
Critical Value Value or values that separate the critical region (where we reject the null hypothesis) from the values of the test statistics that do not lead to a rejection of the null hypothesis The critical value depends on the type of test being conducted. The text will start with critical values that are z scores. Later tests will have critical values that are t scores and X2 values. Example on page 373 of text. Critical Value ( z score )
Critical Value Value or values that separate the critical region (where we reject the null hypothesis) from the values of the test statistics that do not lead to a rejection of the null hypothesis Reject H0 Fail to reject H0 The critical value separates the curve into areas where one would reject the null (the critical region), and where one would fail to reject the null (the rest of the curve). Critical Value ( z score )
Two-tailed,Right-tailed, Left-tailed Tests The tails in a distribution are the extreme regions bounded by critical values. page 373 of text
Two-tailed Test H0: µ = 100 H1: µ 100
is divided equally between the two tails of the critical Two-tailed Test H0: µ = 100 H1: µ 100 is divided equally between the two tails of the critical region
is divided equally between the two tails of the critical Two-tailed Test H0: µ = 100 H1: µ 100 is divided equally between the two tails of the critical region Means less than or greater than Analysis of what the symbol ° means which helps students to realize this is a two tailed test.
is divided equally between the two tails of the critical Two-tailed Test H0: µ = 100 H1: µ 100 is divided equally between the two tails of the critical region Means less than or greater than Reject H0 Fail to reject H0 Reject H0 100 Values that differ significantly from 100
Right-tailed Test H0: µ 100 H1: µ > 100
Right-tailed Test H0: µ 100 H1: µ > 100 Points Right
Right-tailed Test H0: µ 100 H1: µ > 100 Points Right Values that Fail to reject H0 Reject H0 Values that differ significantly from 100 100
Left-tailed Test H0: µ 100 H1: µ < 100
Left-tailed Test H0: µ 100 H1: µ < 100 Points Left
Left-tailed Test H0: µ 100 H1: µ < 100 Points Left Values that Reject H0 Fail to reject H0 Values that differ significantly from 100 100
Conclusions in Hypothesis Testing always test the null hypothesis 1. Reject the H0 2. Fail to reject the H0 need to formulate correct wording of final conclusion See Figure 7-4 page 374 of text. Examples at bottom of page and top of page 375
FIGURE 7-4 Wording of Final Conclusion Start Does the original claim contain the condition of equality “There is sufficient evidence to warrant rejection of the claim that. . . (original claim).” (This is the only case in which the original claim is rejected). Yes (Original claim contains equality and becomes H0) Do you reject H0?. Yes (Reject H0) No (Fail to reject H0) “There is not sufficient evidence to warrant rejection of the claim that. . . (original claim).” No (Original claim does not contain equality and becomes H1) (This is the only case in which the original claim is supported). page 375 of text Also found on Formula and Table insert provided with text. Many instructors allow students to use this flowchart when taking exams. The conclusion process indicted in this chart will be used with other statistical processes in other chapters. Discussion of the two results that support or reject conclusions, whereas the other two do not provide enough evidence for support or rejection. Do you reject H0? Yes (Reject H0) “The sample data supports the claim that . . . (original claim).” No (Fail to reject H0) “There is not sufficient evidence to support the claim that. . . (original claim).”
Accept versus Fail to Reject some texts use “accept the null hypothesis we are not proving the null hypothesis sample evidence is not strong enough to warrant rejection (such as not enough evidence to convict a suspect) page 374 of text The term ‘accept’ is somewhat misleading, implying incorrectly that the null has been proven. The phrase ‘fail to reject’ represents the result more correctly.
Type I Error The mistake of rejecting the null hypothesis when it is true. (alpha) is used to represent the probability of a type I error Example: Rejecting a claim that the mean body temperature is 98.6 degrees when the mean really does equal 98.6 Example on page 375 of text
Type II Error the mistake of failing to reject the null hypothesis when it is false. ß (beta) is used to represent the probability of a type II error Example: Failing to reject the claim that the mean body temperature is 98.6 degrees when the mean is really different from 98.6
Table 7-2 Type I and Type II Errors True State of Nature The null hypothesis is true The null hypothesis is false Type I error (rejecting a true null hypothesis) We decide to reject the null hypothesis Correct decision Decision Type II error (rejecting a false null hypothesis) page 376 of text We fail to reject the null hypothesis Correct decision
Controlling Type I and Type II Errors For any fixed , an increase in the sample size n will cause a decrease in For any fixed sample size n , a decrease in will cause an increase in . Conversely, an increase in will cause a decrease in . To decrease both and , increase the sample size. page 377 in text
Power of a Hypothesis Test Definition Power of a Hypothesis Test is the probability (1 - ) of rejecting a false null hypothesis, which is computed by using a particular significance level and a particular value of the mean that is an alternative to the value assumed true in the null hypothesis.