1 1 Slide 統計學 Spring 2004 授課教師:統計系余清祥 日期: 2004 年 5 月 18 日 第十三週:無母數方法.

Slides:



Advertisements
Similar presentations
Nonparametric Methods: Analysis of Ranked Data
Advertisements

Prepared by Lloyd R. Jaisingh
1 Chapter 20: Statistical Tests for Ordinal Data.
Nonparametric Methods
16- 1 Chapter Sixteen McGraw-Hill/Irwin © 2005 The McGraw-Hill Companies, Inc., All Rights Reserved.
Chapter 16 Introduction to Nonparametric Statistics
Uji Tanda dan Peringkat Bertanda Wilcoxon Pertemuan 25 Matakuliah: Statistika Psikologi Tahun: 2008.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Nonparametric Methods Chapter 15.
Ordinal Data. Ordinal Tests Non-parametric tests Non-parametric tests No assumptions about the shape of the distribution No assumptions about the shape.
1 1 Slide © 2003 South-Western/Thomson Learning™ Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 1 Slide STATISTICS FOR BUSINESS AND ECONOMICS Seventh Edition AndersonSweeneyWilliams Slides Prepared by John Loucks © 1999 ITP/South-Western College.
1 Chapter 10 Comparisons Involving Means  1 =  2 ? ANOVA Estimation of the Difference between the Means of Two Populations: Independent Samples Hypothesis.
1 1 Slide MA4704 Problem solving 4 Statistical Inference About Means and Proportions With Two Populations n Inferences About the Difference Between Two.
1 1 Slide Slides Prepared by JOHN S. LOUCKS St. Edward’s University © 2002 South-Western /Thomson Learning.
Chapter 10 Comparisons Involving Means
Sampling Distribution of If and are normally distributed and samples 1 and 2 are independent, their difference is.
1 1 Slide © 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Statistical Inference About Means and Proportions With Two Populations
© 2003 Pearson Prentice Hall Statistics for Business and Economics Nonparametric Statistics Chapter 14.
Chapter 10b Hypothesis Tests About the Difference Between the Means of Two Populations: Independent Samples, Small-Sample CaseHypothesis Tests About the.
Chapter 14 Analysis of Categorical Data
1 1 Slide 統計學 Spring 2004 授課教師:統計系余清祥 日期: 2004 年 5 月 4 日 第十二週:複迴歸.
Chapter 15 Nonparametric Statistics
© 2011 Pearson Education, Inc
Non-parametric Dr Azmi Mohd Tamil.
1 1 Slide 統計學 Spring 2004 授課教師:統計系余清祥 日期: 2004 年 3 月 30 日 第八週:變異數分析與實驗設計.
11 Chapter Nonparametric Tests © 2012 Pearson Education, Inc.
Non-parametric Methods: Analysis of Ranked Data
Chapter 14: Nonparametric Statistics
1 1 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
1 1 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
14 Elements of Nonparametric Statistics
1 1 Slide Slides by John Loucks St. Edward’s University.
1 1 Slide © 2005 Thomson/South-Western Chapter 10 Statistical Inference About Means and Proportions With Two Populations n Inferences About the Difference.
1 1 Slide © 2005 Thomson/South-Western AK/ECON 3480 M & N WINTER 2006 n Power Point Presentation n Professor Ying Kong School of Analytic Studies and Information.
Overview of non parametric methods
CHAPTER 14: Nonparametric Methods
Chapter 11 Nonparametric Tests.
Copyright © 2012 Pearson Education. Chapter 23 Nonparametric Methods.
CHAPTER 14: Nonparametric Methods to accompany Introduction to Business Statistics seventh edition, by Ronald M. Weiers Presentation by Priscilla Chaffe-Stengel.
©The McGraw-Hill Companies, Inc. 2008McGraw-Hill/Irwin Non-parametric: Analysis of Ranked Data Chapter 18.
© 2000 Prentice-Hall, Inc. Statistics Nonparametric Statistics Chapter 14.
1 1 Slide © 2003 South-Western/Thomson Learning™ Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Ordinally Scale Variables
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
1 Nonparametric Statistical Techniques Chapter 17.
1 1 Slide © 2009 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
1 1 Slide © 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
©The McGraw-Hill Companies, Inc. 2008McGraw-Hill/Irwin Non-parametric: Analysis of Ranked Data Chapter 18.
Statistics in Applied Science and Technology Chapter14. Nonparametric Methods.
CD-ROM Chap 16-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition CD-ROM Chapter 16 Introduction.
1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Nonparametric Statistics.
1 1 Slide 統計學 Spring 2004 授課教師:統計系余清祥 日期: 2004 年 3 月 23 日 第六週:配適度與獨立性檢定.
Nonparametric Methods: Analysis of Ranked Data
Sampling Distribution of If and are normally distributed and samples 1 and 2 are independent, their difference is.
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Nonparametric Statistics.
1 QNT 531 Advanced Problems in Statistics and Research Methods WORKSHOP 5 By Dr. Serhat Eren University OF PHOENIX.
Chapter 13 Understanding research results: statistical inference.
1 Nonparametric Statistical Techniques Chapter 18.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. CHAPTER 14: Nonparametric Methods to accompany Introduction to Business Statistics fifth.
Non-parametric: Analysis of Ranked Data
統計學 Spring 2004 授課教師:統計系余清祥 日期:2004年3月16日 第五週:比較變異數.
Chapter 12 Chi-Square Tests and Nonparametric Tests
St. Edward’s University
Wilcoxon Matched Pairs Signed Ranks test
十二、Nonparametric Methods (Chapter 12)
St. Edward’s University
Nonparametric Statistics
Chapter Sixteen McGraw-Hill/Irwin
Presentation transcript:

1 1 Slide 統計學 Spring 2004 授課教師:統計系余清祥 日期: 2004 年 5 月 18 日 第十三週:無母數方法

2 2 Slide Chapter 19 Nonparametric Methods n Sign Test n Wilcoxon Signed-Rank Test n Mann-Whitney-Wilcoxon Test n Kruskal-Wallis Test n Rank Correlation

3 3 Slide n Most of the statistical methods referred to as parametric require the use of interval- or ratio-scaled data. n Nonparametric methods are often the only way to analyze nominal or ordinal data and draw statistical conclusions. n Nonparametric methods require no assumptions about the population probability distributions. n Nonparametric methods are often called distribution- free methods. Nonparametric Methods

4 4 Slide Nonparametric Methods n In general, for a statistical method to be classified as nonparametric, it must satisfy at least one of the following conditions. The method can be used with nominal data. The method can be used with nominal data. The method can be used with ordinal data. The method can be used with ordinal data. The method can be used with interval or ratio data when no assumption can be made about the population probability distribution. The method can be used with interval or ratio data when no assumption can be made about the population probability distribution.

5 5 Slide Sign Test n A common application of the sign test involves using a sample of n potential customers to identify a preference for one of two brands of a product. n The objective is to determine whether there is a difference in preference between the two items being compared. n To record the preference data, we use a plus sign if the individual prefers one brand and a minus sign if the individual prefers the other brand. n Because the data are recorded as plus and minus signs, this test is called the sign test.

6 6 Slide Example: Peanut Butter Taste Test n Sign Test: Large-Sample Case As part of a market research study, a sample of 36 consumers were asked to taste two brands of peanut butter and indicate a preference. Do the data shown below indicate a significant difference in the consumer preferences for the two brands? 18 preferred Hoppy Peanut Butter (+ sign recorded) 12 preferred Pokey Peanut Butter ( _ sign recorded) 6 had no preference 6 had no preference The analysis is based on a sample size of = 30.

7 7 Slide n Hypotheses H 0 : No preference for one brand over the other exists H a : A preference for one brand over the other exists n Sampling Distribution  2.74 Sampling distribution of the number of “+” values if there is no brand preference Sampling distribution of the number of “+” values if there is no brand preference  = 15 =.5(30)  = 15 =.5(30) Example: Peanut Butter Taste Test

8 8 Slide Example: Peanut Butter Taste Test n Rejection Rule Using.05 level of significance, Reject H 0 if z 1.96 n Test Statistic z = ( )/2.74 = 3/2.74 = n Conclusion Do not reject H 0. There is insufficient evidence in the sample to conclude that a difference in preference exists for the two brands of peanut butter. Fewer than 10 or more than 20 individuals would have to have a preference for a particular brand in order for us to reject H 0.

9 9 Slide Wilcoxon Signed-Rank Test n This test is the nonparametric alternative to the parametric matched-sample test presented in Chapter 10. n The methodology of the parametric matched-sample analysis requires: interval data, and interval data, and the assumption that the population of differences between the pairs of observations is normally distributed. the assumption that the population of differences between the pairs of observations is normally distributed. n If the assumption of normally distributed differences is not appropriate, the Wilcoxon signed-rank test can be used.

10 Slide Example: Express Deliveries n Wilcoxon Signed-Rank Test A firm has decided to select one of two express delivery services to provide next-day deliveries to the district offices. To test the delivery times of the two services, the firm sends two reports to a sample of 10 district offices, with one report carried by one service and the other report carried by the second service. Do the data (delivery times in hours) on the next slide indicate a difference in the two services?

11 Slide Example: Express Deliveries District Office Overnight NiteFlite District Office Overnight NiteFlite Seattle 32 hrs. 25 hrs. Los Angeles3024 Boston1915 Cleveland1615 New York1513 Houston1815 Atlanta1415 St. Louis108 Milwaukee79 Denver1611

12 Slide Wilcoxon Signed-Rank Test n Preliminary Steps of the Test Compute the differences between the paired observations. Compute the differences between the paired observations. Discard any differences of zero. Discard any differences of zero. Rank the absolute value of the differences from lowest to highest. Tied differences are assigned the average ranking of their positions. Rank the absolute value of the differences from lowest to highest. Tied differences are assigned the average ranking of their positions. Give the ranks the sign of the original difference in the data. Give the ranks the sign of the original difference in the data. Sum the signed ranks. Sum the signed ranks.... next we will determine whether the sum is significantly different from zero.

13 Slide Example: Express Deliveries District Office Differ. |Diff.| Rank Sign. Rank Seattle Los Angeles69+9 Boston47+7 Cleveland New York24+4 Houston36+6 Atlanta St. Louis24+4 Milwaukee-24-4 Denver

14 Slide n Hypotheses H 0 : The delivery times of the two services are the same; neither offers faster service than the other. H a : Delivery times differ between the two services; recommend the one with the smaller times. n Sampling Distribution Sampling distribution of T if populations are identical Sampling distribution of T if populations are identical     T = 0 T Example: Express Deliveries

15 Slide Example: Express Deliveries n Rejection Rule Using.05 level of significance, Reject H 0 if z 1.96 n Test Statistic z = ( T -  T )/  T = (44 - 0)/19.62 = 2.24 n Conclusion Reject H 0. There is sufficient evidence in the sample to conclude that a difference exists in the delivery times provided by the two services. Recommend using the NiteFlite service. Note: Large sample test statistic.

16 Slide Mann-Whitney-Wilcoxon Test n This test is another nonparametric method for determining whether there is a difference between two populations. n This test, unlike the Wilcoxon signed-rank test, is not based on a matched sample. n This test does not require interval data or the assumption that both populations are normally distributed. n The only requirement is that the measurement scale for the data is at least ordinal.

17 Slide Mann-Whitney-Wilcoxon Test n Instead of testing for the difference between the means of two populations, this method tests to determine whether the two populations are identical. n The hypotheses are: H 0 : The two populations are identical H a : The two populations are not identical

18 Slide Example: Westin Freezers n Mann-Whitney-Wilcoxon Test (Large-Sample Case) Manufacturer labels indicate the annual energy cost associated with operating home appliances such as freezers. The energy costs for a sample of 10 Westin freezers and a sample of 10 Brand-X Freezers are shown on the next slide. Do the data indicate, using  =.05, that a difference exists in the annual energy costs associated with the two brands of freezers?

19 Slide Example: Westin Freezers Westin Freezers Brand-X Freezers Westin Freezers Brand-X Freezers $55.10 $

20 Slide Example: Westin Freezers n Mann-Whitney-Wilcoxon Test (Large-Sample Case) Hypotheses Hypotheses H 0 : Annual energy costs for Westin freezers and Brand-X freezers are the same. and Brand-X freezers are the same. H a : Annual energy costs differ for the two brands of freezers. two brands of freezers.

21 Slide n First, rank the combined data from the lowest to the highest values, with tied values being assigned the average of the tied rankings. n Then, compute T, the sum of the ranks for the first sample. n Then, compare the observed value of T to the sampling distribution of T for identical populations. The value of the standardized test statistic z will provide the basis for deciding whether to reject H 0. Mann-Whitney-Wilcoxon Test: Large-Sample Case

22 Slide Mann-Whitney-Wilcoxon Test: Large-Sample Case n Sampling Distribution of T for Identical Populations Mean Mean  T = n 1 ( n 1 + n 2 + 1)/2  T = n 1 ( n 1 + n 2 + 1)/2 Standard Deviation Standard Deviation Distribution Form Distribution Form Approximately normal, provided Approximately normal, provided n 1 > 10 and n 2 > 10

23 Slide Example: Westin Freezers Westin Freezers Rank Brand-X Freezers Rank Westin Freezers Rank Brand-X Freezers Rank $ $ $ $ Sum of Ranks 86.5 Sum of Ranks 123.5

24 Slide Example: Westin Freezers n Mann-Whitney-Wilcoxon Test (Large-Sample Case) Sampling Distribution Sampling Distribution    Sampling distribution of T if populations are identical Sampling distribution of T if populations are identical  T = 105 =1/2(10)(21) T

25 Slide Example: Westin Freezers n Rejection Rule Using.05 level of significance, Reject H 0 if z 1.96 n Test Statistic z = ( T -  T )/  T = ( )/13.23 = n Conclusion Do not reject H 0. There is insufficient evidence in the sample data to conclude that there is a difference in the annual energy cost associated with the two brands of freezers.

26 Slide Kruskal-Wallis Test n The Mann-Whitney-Wilcoxon test can be used to test whether two populations are identical. n The MWW test has been extended by Kruskal and Wallis for cases of three or more populations. n The Kruskal-Wallis test can be used with ordinal data as well as with interval or ratio data. n Also, the Kruskal-Wallis test does not require the assumption of normally distributed populations. n The hypotheses are: H 0 : All populations are identical H 0 : All populations are identical H a : Not all populations are identical H a : Not all populations are identical

27 Slide n First, calculate the rank sums T 1, T 2, , T k for the k samples and calculate the test statistic where n i is the number of observations in sample i. where n i is the number of observations in sample i.  The greater the differences in location among the k population, the larger will be the value of the H statistic. n The H statistic will be approximately a chi-square distribution with (k  1) degrees of freedom.

28 Slide Rank Correlation n The Pearson correlation coefficient, r, is a measure of the linear association between two variables for which interval or ratio data are available. n The Spearman rank-correlation coefficient, r s, is a measure of association between two variables when only ordinal data are available. n Values of r s can range from –1.0 to +1.0, where values near 1.0 indicate a strong positive association between the rankings, and values near 1.0 indicate a strong positive association between the rankings, and values near -1.0 indicate a strong negative association between the rankings. values near -1.0 indicate a strong negative association between the rankings.

29 Slide Rank Correlation n Spearman Rank-Correlation Coefficient, r s where: n = number of items being ranked x i = rank of item i with respect to one variable x i = rank of item i with respect to one variable y i = rank of item i with respect to a second variable y i = rank of item i with respect to a second variable d i = x i - y i d i = x i - y i

30 Slide Test for Significant Rank Correlation n We may want to use sample results to make an inference about the population rank correlation p s. n To do so, we must test the hypotheses: H 0 : p s = 0 H 0 : p s = 0 H a : p s = 0 H a : p s = 0

31 Slide n Sampling Distribution of r s when p s = 0 Mean Mean Standard Deviation Standard Deviation Distribution Form Distribution Form Approximately normal, provided n > 10 Approximately normal, provided n > 10 Rank Correlation

32 Slide Example: Connor Investors n Rank Correlation Connor Investors provides a portfolio management service for its clients. Two of Connor’s analysts rated ten investments from high (6) to low (1) risk as shown below. Use rank correlation, with  =.10, to comment on the agreement of the two analysts’ ratings. Connor Investors provides a portfolio management service for its clients. Two of Connor’s analysts rated ten investments from high (6) to low (1) risk as shown below. Use rank correlation, with  =.10, to comment on the agreement of the two analysts’ ratings. InvestmentABCDEF GHI J Analyst # Analyst #

33 Slide Analyst #1 Analyst #2 Analyst #1 Analyst #2 Investment Rating Rating Differ. (Differ.) 2 A1100 B45-11 C9639 D82636 E69-39 F G5324 H I24-24 J10824 Sum =92 Sum =92 Example: Connor Investors

34 Slide Example: Connor Investors n Hypotheses H 0 : p s = 0 (No rank correlation exists.) H 0 : p s = 0 (No rank correlation exists.) H a : p s = 0 (Rank correlation exists.) H a : p s = 0 (Rank correlation exists.) n Sampling Distribution  r = 0 rsrsrsrs Sampling distribution of r s under the assumption of no rank correlation Sampling distribution of r s under the assumption of no rank correlation

35 Slide Example: Connor Investors n Rejection Rule Using.10 level of significance, Reject H 0 if z n Test Statistic z = ( r s -  r )/  r = ( )/.3333 = 1.33 n Conclusion Do no reject H 0. There is not a significant rank correlation. The two analysts are not showing agreement in their rating of the risk associated with the different investments.

36 Slide End of Chapter 19