Download presentation
Presentation is loading. Please wait.
Published byDennis Weaver Modified over 9 years ago
1
1 1 Slide © 2003 South-Western/Thomson Learning™ Slides Prepared by JOHN S. LOUCKS St. Edward’s University
2
2 2 Slide © 2003 South-Western/Thomson Learning™ Chapter 17 Nonparametric Methods n Sign Test n Wilcoxon Signed-Rank Test n Mann-Whitney-Wilcoxon Test n Kruskal-Wallis Test n Rank Correlation
3
3 3 Slide © 2003 South-Western/Thomson Learning™ n Most of the statistical methods referred to as parametric require the use of interval- or ratio-scaled data. n Nonparametric methods are often the only way to analyze nominal or ordinal data and draw statistical conclusions. n Nonparametric methods require no assumptions about the population probability distributions. n Nonparametric methods are often called distribution- free methods. Nonparametric Methods
4
4 4 Slide © 2003 South-Western/Thomson Learning™ Nonparametric Methods n In general, for a statistical method to be classified as nonparametric, it must satisfy at least one of the following conditions. The method can be used with nominal data. The method can be used with nominal data. The method can be used with ordinal data. The method can be used with ordinal data. The method can be used with interval or ratio data when no assumption can be made about the population probability distribution. The method can be used with interval or ratio data when no assumption can be made about the population probability distribution.
5
5 5 Slide © 2003 South-Western/Thomson Learning™ Sign Test n A common application of the sign test involves using a sample of n potential customers to identify a preference for one of two brands of a product. n The objective is to determine whether there is a difference in preference between the two items being compared. n To record the preference data, we use a plus sign if the individual prefers one brand and a minus sign if the individual prefers the other brand. n Because the data are recorded as plus and minus signs, this test is called the sign test.
6
6 6 Slide © 2003 South-Western/Thomson Learning™ Example: Peanut Butter Taste Test n Sign Test: Large-Sample Case As part of a market research study, a sample of 36 consumers were asked to taste two brands of peanut butter and indicate a preference. Do the data shown below indicate a significant difference in the consumer preferences for the two brands? 18 preferred Hoppy Peanut Butter (+ sign recorded) 12 preferred Pokey Peanut Butter ( _ sign recorded) 6 had no preference 6 had no preference The analysis is based on a sample size of 18 + 12 = 30.
7
7 7 Slide © 2003 South-Western/Thomson Learning™ n Hypotheses H 0 : p =.50 No preference for one brand over the other exists H a : A preference for one brand over the other exists Example: Peanut Butter Taste Test
8
8 8 Slide © 2003 South-Western/Thomson Learning™ Example: Peanut Butter Taste Test n Sampling Distribution 2.74 = 15 =.5(30) = 15 =.5(30) Sampling distribution of the number of “+” values if there is no brand preference Sampling distribution of the number of “+” values if there is no brand preference
9
9 9 Slide © 2003 South-Western/Thomson Learning™ Example: Peanut Butter Taste Test n Rejection Rule Using.05 level of significance: Reject H 0 if z 1.96 Reject H 0 if p -value <.05 Reject H 0 if the number of plus signs is less than 10 or greater than 20 less than 10 or greater than 20 n Test Statistic z = (18 - 15)/2.74 = 3/2.74 = 1.095
10
10 Slide © 2003 South-Western/Thomson Learning™ Example: Peanut Butter Taste Test n Conclusion Do not reject H 0. There is insufficient evidence in the sample to conclude that a difference in preference exists for the two brands of peanut butter.
11
11 Slide © 2003 South-Western/Thomson Learning™ Using Excel to Conduct a Sign Test n Formula Worksheet Note: Rows 10-31 are not shown.
12
12 Slide © 2003 South-Western/Thomson Learning™ Using Excel to Conduct a Sign Test n Value Worksheet Note: Rows 10-31 are not shown.
13
13 Slide © 2003 South-Western/Thomson Learning™ Using Excel to Conduct a Sign Test n Using the p -Value The preceding value worksheet shows a two-tail p -value of.3616 The preceding value worksheet shows a two-tail p -value of.3616 The rejection rule is “Reject H 0 if p -value <.05” The rejection rule is “Reject H 0 if p -value <.05” Because.3616 >.05, we do not reject H 0 Because.3616 >.05, we do not reject H 0
14
14 Slide © 2003 South-Western/Thomson Learning™ Hypothesis Test About a Median n We can apply the sign test by: Using a plus sign whenever the data in the sample are above the hypothesized value of the median Using a plus sign whenever the data in the sample are above the hypothesized value of the median Using a minus sign whenever the data in the sample are below the hypothesized value of the median Using a minus sign whenever the data in the sample are below the hypothesized value of the median Discarding any data exactly equal to the hypothesized median Discarding any data exactly equal to the hypothesized median
15
15 Slide © 2003 South-Western/Thomson Learning™ Wilcoxon Signed-Rank Test n This test is the nonparametric alternative to the parametric matched-sample test presented in Chapter 10. n The methodology of the parametric matched-sample analysis requires: interval data, and interval data, and the assumption that the population of differences between the pairs of observations is normally distributed. the assumption that the population of differences between the pairs of observations is normally distributed. n If the assumption of normally distributed differences is not appropriate, the Wilcoxon signed-rank test can be used.
16
16 Slide © 2003 South-Western/Thomson Learning™ Example: Express Deliveries n Wilcoxon Signed-Rank Test A firm has decided to select one of two express delivery services to provide next-day deliveries to the district offices. To test the delivery times of the two services, the firm sends two reports to a sample of 10 district offices, with one report carried by one service and the other report carried by the second service. Do the data (delivery times in hours) on the next slide indicate a difference in the two services?
17
17 Slide © 2003 South-Western/Thomson Learning™ Example: Express Deliveries District Office OverNight NiteFlite District Office OverNight NiteFlite Seattle 32 hrs. 25 hrs. Los Angeles3024 Boston1915 Cleveland1615 New York1513 Houston1815 Atlanta1415 St. Louis108 Milwaukee79 Denver1611
18
18 Slide © 2003 South-Western/Thomson Learning™ Wilcoxon Signed-Rank Test n Preliminary Steps of the Test Compute the differences between the paired observations. Compute the differences between the paired observations. Discard any differences of zero. Discard any differences of zero. Rank the absolute value of the differences from lowest to highest. Tied differences are assigned the average ranking of their positions. Rank the absolute value of the differences from lowest to highest. Tied differences are assigned the average ranking of their positions. Give the ranks the sign of the original difference in the data. Give the ranks the sign of the original difference in the data. Sum the signed ranks. Sum the signed ranks.... next we will determine whether the sum is significantly different from zero.
19
19 Slide © 2003 South-Western/Thomson Learning™ Example: Express Deliveries District Office Differ. |Diff.| Rank Sign. Rank Seattle710+10 Los Angeles69+9 Boston47+7 Cleveland11.5+1.5 New York24+4 Houston36+6 Atlanta-11.5-1.5 St. Louis24+4 Milwaukee-24-4 Denver58+8 +44
20
20 Slide © 2003 South-Western/Thomson Learning™ n Hypotheses H 0 : The delivery times of the two services are the same; neither offers faster service than the other. H a : Delivery times differ between the two services; recommend the one with the smaller times. Example: Express Deliveries
21
21 Slide © 2003 South-Western/Thomson Learning™ Example: Express Deliveries n Sampling Distribution Sampling distribution of T if populations are identical Sampling distribution of T if populations are identical 19.62 T = 0 T
22
22 Slide © 2003 South-Western/Thomson Learning™ n Rejection Rule Using.05 level of significance, Reject H 0 if z 1.96 n Test Statistic z = ( T - T )/ T = (44 - 0)/19.62 = 2.24 n Conclusion Reject H 0. There is sufficient evidence in the sample to conclude that a difference exists in the delivery times provided by the two services. Recommend using the NiteFlite service. Example: Express Deliveries
23
23 Slide © 2003 South-Western/Thomson Learning™ Using Excel to Conduct a Wilcoxon Signed-Rank Test n Formula Worksheet (showing data)
24
24 Slide © 2003 South-Western/Thomson Learning™ Using Excel to Conduct a Wilcoxon Signed-Rank Test n Formula Worksheet Note: Columns A-D are not shown.
25
25 Slide © 2003 South-Western/Thomson Learning™ Using Excel to Conduct a Wilcoxon Signed-Rank Test n Value Worksheet Note: Columns A-D are not shown.
26
26 Slide © 2003 South-Western/Thomson Learning™ Using Excel to Conduct a Wilcoxon Signed-Rank Test n Formula Worksheet Note: Columns A-D and rows 1-13 are not shown.
27
27 Slide © 2003 South-Western/Thomson Learning™ Using Excel to Conduct a Wilcoxon Signed-Rank Test n Value Worksheet Note: Columns A-D and rows 1-13 are not shown.
28
28 Slide © 2003 South-Western/Thomson Learning™ Using Excel to Conduct a Wilcoxon Signed-Rank Test n Using the p -Value The preceding value worksheet shows a two-tail p -value of.0249 The preceding value worksheet shows a two-tail p -value of.0249 The rejection rule is “Reject H 0 if p -value <.05” The rejection rule is “Reject H 0 if p -value <.05” Because.0249 <.05, we can reject H 0. There is sufficient evidence in the sample to conclude that a difference exists in the delivery times provided by the two services. Because.0249 <.05, we can reject H 0. There is sufficient evidence in the sample to conclude that a difference exists in the delivery times provided by the two services.
29
29 Slide © 2003 South-Western/Thomson Learning™ Mann-Whitney-Wilcoxon Test n This test is another nonparametric method for determining whether there is a difference between two populations. n This test, unlike the Wilcoxon signed-rank test, is not based on a matched sample. n This test does not require interval data or the assumption that both populations are normally distributed. n The only requirement is that the measurement scale for the data is at least ordinal.
30
30 Slide © 2003 South-Western/Thomson Learning™ Mann-Whitney-Wilcoxon Test n Instead of testing for the difference between the means of two populations, this method tests to determine whether the two populations are identical. n The hypotheses are: H 0 : The two populations are identical H a : The two populations are not identical
31
31 Slide © 2003 South-Western/Thomson Learning™ Example: Westin Freezers n Mann-Whitney-Wilcoxon Test (Large-Sample Case) Manufacturer labels indicate the annual energy cost associated with operating home appliances such as freezers. The energy costs for a sample of 10 Westin freezers and a sample of 10 Easton Freezers are shown on the next slide. Do the data indicate, using =.05, that a difference exists in the annual energy costs associated with the two brands of freezers?
32
32 Slide © 2003 South-Western/Thomson Learning™ Example: Westin Freezers Westin Freezers Easton Freezers Westin Freezers Easton Freezers $55.10 $56.10 54.50 54.70 54.50 54.70 53.20 54.40 53.20 54.40 53.00 55.40 53.00 55.40 55.50 54.10 55.50 54.10 54.90 56.00 54.90 56.00 55.80 55.50 55.80 55.50 54.00 55.00 54.00 55.00 54.20 54.30 54.20 54.30 55.20 57.00 55.20 57.00
33
33 Slide © 2003 South-Western/Thomson Learning™ Example: Westin Freezers n Hypotheses H 0 : Annual energy costs for Westin freezers and Easton freezers are the same. and Easton freezers are the same. H a : Annual energy costs differ for the two brands of freezers. two brands of freezers.
34
34 Slide © 2003 South-Western/Thomson Learning™ n First, rank the combined data from the lowest to the highest values, with tied values being assigned the average of the tied rankings. n Then, compute T, the sum of the ranks for the first sample. n Then, compare the observed value of T to the sampling distribution of T for identical populations. The value of the standardized test statistic z will provide the basis for deciding whether to reject H 0. Mann-Whitney-Wilcoxon Test: Large-Sample Case
35
35 Slide © 2003 South-Western/Thomson Learning™ n Sampling Distribution of T for Identical Populations Mean Mean T = 1 n 1 ( n 1 + n 2 + 1) T = 1 n 1 ( n 1 + n 2 + 1) Standard Deviation Standard Deviation Distribution Form Distribution Form Approximately normal, provided Approximately normal, provided n 1 > 10 and n 2 > 10 Mann-Whitney-Wilcoxon Test: Large-Sample Case
36
36 Slide © 2003 South-Western/Thomson Learning™ Example: Westin Freezers Westin Rank Easton Rank Westin Rank Easton Rank $55.10 12 $56.10 19 $55.10 12 $56.10 19 54.50 8 54.70 9 54.50 8 54.70 9 53.20 2 54.40 7 53.20 2 54.40 7 53.00 1 55.40 14 53.00 1 55.40 14 55.50 15.5 54.10 4 55.50 15.5 54.10 4 54.90 10 56.00 18 54.90 10 56.00 18 55.80 17 55.50 15.5 55.80 17 55.50 15.5 54.00 3 55.00 11 54.00 3 55.00 11 54.20 5 54.30 6 54.20 5 54.30 6 55.20 13 57.00 20 55.20 13 57.00 20 Sum of Ranks 86.5 Sum of Ranks 123.5
37
37 Slide © 2003 South-Western/Thomson Learning™ Example: Westin Freezers n Sampling Distribution 13.23 Sampling distribution of T if populations are identical Sampling distribution of T if populations are identical T = 105 =1/2(10)(21) T
38
38 Slide © 2003 South-Western/Thomson Learning™ Example: Westin Freezers n Rejection Rule With.05 level of significance, Using test statistic: Reject H 0 if z 1.96 Using p -value: Reject H 0 if p -value <.05 n Test Statistic z = ( T - T )/ T = (86.5 - 105)/13.23 = -1.40
39
39 Slide © 2003 South-Western/Thomson Learning™ Example: Westin Freezers n Conclusion Do not reject H 0. There is insufficient evidence in the sample data to conclude that there is a difference in the annual energy cost associated with the two brands of freezers.
40
40 Slide © 2003 South-Western/Thomson Learning™ Using Excel to Conduct a Mann-Whitney- Wilcoxon Test: Large-Sample Case n Formula Worksheet Note: Rows 12-21 are not shown.
41
41 Slide © 2003 South-Western/Thomson Learning™ Using Excel to Conduct a Mann-Whitney- Wilcoxon Test: Large-Sample Case n Formula Worksheet Note: Rows 2-11 are hidden.
42
42 Slide © 2003 South-Western/Thomson Learning™ Using Excel to Conduct a Mann-Whitney- Wilcoxon Test: Large-Sample Case n Value Worksheet Note: Rows 12-21 are not shown.
43
43 Slide © 2003 South-Western/Thomson Learning™ Using Excel to Conduct a Mann-Whitney- Wilcoxon Test: Large-Sample Case n Value Worksheet Note: Rows 2-11 are hidden.
44
44 Slide © 2003 South-Western/Thomson Learning™ Using Excel to Conduct a Mann-Whitney- Wilcoxon Test: Large-Sample Case n Formula Worksheet Note: Rows 1-22 are not shown.
45
45 Slide © 2003 South-Western/Thomson Learning™ Using Excel to Conduct a Mann-Whitney- Wilcoxon Test: Large-Sample Case n Value Worksheet Note: Rows 1-22 are not shown.
46
46 Slide © 2003 South-Western/Thomson Learning™ Using Excel to Conduct a Mann-Whitney- Wilcoxon Test: Large-Sample Case n Using the p -Value The preceding value worksheet shows a two-tail p -value of.1620 The preceding value worksheet shows a two-tail p -value of.1620 The rejection rule is “Reject H 0 if p -value <.05” The rejection rule is “Reject H 0 if p -value <.05” Because.1620 >.05, we cannot reject H 0. There is insufficient evidence in the sample data to conclude that there is a difference in the annual energy cost associated with the two brands of freezers. Because.1620 >.05, we cannot reject H 0. There is insufficient evidence in the sample data to conclude that there is a difference in the annual energy cost associated with the two brands of freezers.
47
47 Slide © 2003 South-Western/Thomson Learning™ Kruskal-Wallis Test n The Mann-Whitney-Wilcoxon test can be used to test whether two populations are identical. n The MWW test has been extended by Kruskal and Wallis for cases of three or more populations. n The Kruskal-Wallis test can be used with ordinal data as well as with interval or ratio data. n Also, the Kruskal-Wallis test does not require the assumption of normally distributed populations. n The hypotheses are: H 0 : All populations are identical H 0 : All populations are identical H a : Not all populations are identical H a : Not all populations are identical
48
48 Slide © 2003 South-Western/Thomson Learning™ Rank Correlation n The Pearson correlation coefficient, r, is a measure of the linear association between two variables for which interval or ratio data are available. n The Spearman rank-correlation coefficient, r s, is a measure of association between two variables when only ordinal data are available. n Values of r s can range from –1.0 to +1.0, where values near 1.0 indicate a strong positive association between the rankings, and values near 1.0 indicate a strong positive association between the rankings, and values near -1.0 indicate a strong negative association between the rankings. values near -1.0 indicate a strong negative association between the rankings.
49
49 Slide © 2003 South-Western/Thomson Learning™ Rank Correlation n Spearman Rank-Correlation Coefficient, r s where: n = number of items being ranked x i = rank of item i with respect to one variable x i = rank of item i with respect to one variable y i = rank of item i with respect to a second variable y i = rank of item i with respect to a second variable d i = x i - y i d i = x i - y i
50
50 Slide © 2003 South-Western/Thomson Learning™ Test for Significant Rank Correlation n We may want to use sample results to make an inference about the population rank correlation p s. n To do so, we must test the hypotheses: H 0 : p s = 0 H 0 : p s = 0 H a : p s = 0 H a : p s = 0
51
51 Slide © 2003 South-Western/Thomson Learning™ n Sampling Distribution of r s when p s = 0 Mean Mean Standard Deviation Standard Deviation Distribution Form Distribution Form Approximately normal, provided n > 10 Approximately normal, provided n > 10 Rank Correlation
52
52 Slide © 2003 South-Western/Thomson Learning™ Example: Connor Investors n Rank Correlation Connor Investors provides a portfolio management service for its clients. Two of Connor’s analysts rated ten investments from high (6) to low (1) risk as shown below. Use rank correlation, with =.10, to comment on the agreement of the two analysts’ ratings. Connor Investors provides a portfolio management service for its clients. Two of Connor’s analysts rated ten investments from high (6) to low (1) risk as shown below. Use rank correlation, with =.10, to comment on the agreement of the two analysts’ ratings. InvestmentABCDEF GHI J Analyst #1 1498635 7210 Analyst #2 1562973104 8
53
53 Slide © 2003 South-Western/Thomson Learning™ Analyst #1 Analyst #2 Analyst #1 Analyst #2 Investment Rating Rating Differ. (Differ.) 2 A1100 B45-11 C9639 D82636 E69-39 F37-416 G5324 H710-39 I24-24 J10824 Sum =92 Sum =92 Example: Connor Investors
54
54 Slide © 2003 South-Western/Thomson Learning™ Example: Connor Investors n Hypotheses H 0 : p s = 0 (No rank correlation exists.) H 0 : p s = 0 (No rank correlation exists.) H a : p s = 0 (Rank correlation exists.) H a : p s = 0 (Rank correlation exists.) n Sampling Distribution r = 0 rsrsrsrs Sampling distribution of r s under the assumption of no rank correlation Sampling distribution of r s under the assumption of no rank correlation
55
55 Slide © 2003 South-Western/Thomson Learning™ Example: Connor Investors n Rejection Rule With.10 level of significance, Using test statistic: Reject H 0 if z 1.645 Using p -value: Reject H 0 if p -value <.10 n Test Statistic z = ( r s - r )/ r = (.4424 - 0)/.3333 = 1.33
56
56 Slide © 2003 South-Western/Thomson Learning™ Example: Connor Investors n Conclusion Do not reject H 0. There is not a significant rank correlation. The two analysts are not showing agreement in their rating of the risk associated with the different investments.
57
57 Slide © 2003 South-Western/Thomson Learning™ Using Excel to Conduct a Test for Significant Rank Correlation n Formula Worksheet (showing data)
58
58 Slide © 2003 South-Western/Thomson Learning™ Using Excel to Conduct a Test for Significant Rank Correlation n Formula Worksheet
59
59 Slide © 2003 South-Western/Thomson Learning™ Using Excel to Conduct a Test for Significant Rank Correlation n Value Worksheet
60
60 Slide © 2003 South-Western/Thomson Learning™ Using Excel to Conduct a Test for Significant Rank Correlation n Using the p -Value The preceding value worksheet shows a two-tail p -value of.1844 The preceding value worksheet shows a two-tail p -value of.1844 The rejection rule is “Reject H 0 if p -value <.10” The rejection rule is “Reject H 0 if p -value <.10” Because.1844 >.10, we cannot reject H 0. There is not a significant rank correlation. Because.1844 >.10, we cannot reject H 0. There is not a significant rank correlation.
61
61 Slide © 2003 South-Western/Thomson Learning™ End of Chapter 17
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.