Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series by Mario F. Triola
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Chapter 13 Nonparametric Statistics 13-1Review and Preview 13-2Sign Test 13-3Wilcoxon Signed-Ranks Test for Matched Pairs 13-4Wilcoxon Rank-Sum Test for Two Independent Samples 13-5Kruskal-Wallis Test 13-6Rank Correction 13-7Runs Test for Randomness
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Key Concept This section describes the nonparametric method of the rank correlation test, which uses paired data to test for an association between two variables. In Chapter 10 we used paired sample data to compute values for the linear correlation coefficient r, but in this section we use ranks as a the basis for computing the rank correlation coefficient r s.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Definition The rank correlation test (or Spearman’s rank correlation test) is a non-parametric test that uses ranks of sample data consisting of matched pairs. It is used to test for an association between two variables.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. 1.With rank correlation, we can analyze paired data that are ranks or can be converted to ranks. This method does not require a normal distribution for any population. 2.Rank correlation can be used to detect some (not all) relationships that are not linear. Advantages
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Objective Compute the rank correlation coefficient r s and use it to test for an association between two variables. Then we can test the following:
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Notation r s =rank correlation coefficient for sample paired data (r s is a sample statistic) ρ s =rank correlation coefficient for all the population data (ρ s is a population parameter) n=number of pairs of sample data d=difference between ranks for the two values within an individual pair
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Requirements The paired data are a simple random sample and the data are ranks or can be converted to ranks. Note: Unlike the parametric methods of Section 10-2, there is no requirement that the sample pairs of data have a bivariate normal distribution. There is no requirement of a normal distribution for any population.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Test Statistic First convert the data to ranks. Then calculate: No ties: After converting the data in each sample to ranks, if there are no ties among ranks for either variable, the exact value of the test statistic can be calculated using this formula:
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. P-values: Sometimes found using technology, provided the technology used has this function. Critical values: If n ≤ 30, critical values are found in Table A-9. If n > 30, use this formula, where the value of z corresponds to the significance level (for example α = 0.05, z = 1.96). P-Values and Critical Values
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Disadvantage A not very serious disadvantage of rank correlation is its efficiency rating of This efficiency rating shows that with all other circumstances being equal, the nonparametric approach of rank correlation requires 100 pairs of sample data to achieve the same results as only 91 pairs of sample observations analyzed through parametric methods, assuming that the stricter requirements of the parametric approach are met.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Rank Correlation Testing
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Rank Correlation Testing
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Example Table 13-7 lists quality rankings and prices of 37-inch LCD televisions. Find the value of the rank correlation coefficient and use it to determine whether or not there is a correlation between quality and price. Use a 0.05 significance level. Based on the result, does it appear that you can get better quality by spending more?
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Example - Continued Requirement Check: We assume we have a simple random sample. The hypotheses are: On the following slide, we convert to ranks.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Example We have no ties, so the exact value of the test statistic can be calculated as shown on the next slide:
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Example - Continued Test Statistic: We refer to Table A-9 to find the critical values of ± Because the test statistic r s = is between the critical values, we fail to reject the null hypothesis.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Example - Continued There is not sufficient evidence to support a claim of a correlation between quality and price. Based on the given sample data, it appears you don’t necessarily get better quality by paying more.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Rank correlation methods sometimes allow us to detect relationships that we cannot detect with the methods of Chapter 10. Below is a scatterplot that shows an S-shaped pattern: Example
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. The methods of Chapter 10 give r = and critical values of ±0.632, suggesting that there is not a linear relationship between x and y. If we use the methods of this section, we get r s = 1 and critical values of ±0.648, suggesting that there is a correlation between x and y. With rank correlations, we can sometimes detect relationships that are not linear. Example - Continued