The Rank-Sum Test Section 15.2.

Slides:



Advertisements
Similar presentations
Tests of Hypotheses Based on a Single Sample
Advertisements

McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Nonparametric Methods Chapter 15.
statistics NONPARAMETRIC TEST
Copyright © 2010, 2007, 2004 Pearson Education, Inc Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
Chapter 9 Hypothesis Testing.
Copyright © Cengage Learning. All rights reserved. 10 Inferences Involving Two Populations.
NONPARAMETRIC STATISTICS
Chapter 11 Nonparametric Tests.
Lesson Inferences about the Differences between Two Medians: Dependent Samples.
Wilcoxon rank sum test (or the Mann-Whitney U test) In statistics, the Mann-Whitney U test (also called the Mann-Whitney-Wilcoxon (MWW), Wilcoxon rank-sum.
1 Section 9-4 Two Means: Matched Pairs In this section we deal with dependent samples. In other words, there is some relationship between the two samples.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Section Inference about Two Means: Independent Samples 11.3.
Copyright © Cengage Learning. All rights reserved. 14 Elements of Nonparametric Statistics.
Nonparametric Statistics. In previous testing, we assumed that our samples were drawn from normally distributed populations. This chapter introduces some.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics S eventh Edition By Brase and Brase Prepared by: Lynn Smith.
1 Nonparametric Statistical Techniques Chapter 17.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics S eventh Edition By Brase and Brase Prepared by: Lynn Smith.
Lesson 15 - R Chapter 15 Review. Objectives Summarize the chapter Define the vocabulary used Complete all objectives Successfully answer any of the review.
Two-Sample Hypothesis Testing. Suppose you want to know if two populations have the same mean or, equivalently, if the difference between the population.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 11-1 Chapter 11 Chi-Square Tests and Nonparametric Tests Statistics for.
Unit 8 Section 8-3 – Day : P-Value Method for Hypothesis Testing  Instead of giving an α value, some statistical situations might alternatively.
GG 313 Lecture 9 Nonparametric Tests 9/22/05. If we cannot assume that our data are at least approximately normally distributed - because there are a.
Slide Slide 1 Section 8-4 Testing a Claim About a Mean:  Known.
CD-ROM Chap 16-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition CD-ROM Chapter 16 Introduction.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Nonparametric Statistics.
Nonparametric tests: Tests without population parameters (means and standard deviations)
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Nonparametric Statistics.
Learning Objectives After this section, you should be able to: The Practice of Statistics, 5 th Edition1 DESCRIBE the shape, center, and spread of the.
Copyright © Cengage Learning. All rights reserved. 15 Distribution-Free Procedures.
1 Nonparametric Statistical Techniques Chapter 18.
Copyright © 2009 Pearson Education, Inc t LEARNING GOAL Understand when it is appropriate to use the Student t distribution rather than the normal.
Copyright © Cengage Learning. All rights reserved. Hypothesis Testing 9.
Nonparametric Statistics
Hypothesis Testing – Two Population Variances
Chapter 12 Chi-Square Tests and Nonparametric Tests
NONPARAMETRIC STATISTICS
Testing the Difference between Means and Variances
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 12 ANALYSIS OF VARIANCE
Hypothesis Testing: Hypotheses
Lesson Inferences about the Differences between Two Medians: Dependent Samples.
Hypothesis Tests for a Population Mean,
INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE Test Review: Ch. 7-9
Environmental Modeling Basic Testing Methods - Statistics
Hypothesis Tests for a Population Mean,
Hypothesis Theory examples.
Hypothesis tests for the difference between two means: Independent samples Section 11.1.
AP Stats Check In Where we’ve been… Chapter 7…Chapter 8…
Lecture Slides Elementary Statistics Twelfth Edition
Nonparametric Tests BPS 7e Chapter 28 © 2015 W. H. Freeman and Company.
Lecture Slides Elementary Statistics Eleventh Edition
Inference about Two Means: Independent Samples
Hypothesis Tests for Proportions
INTRODUCTION TO HYPOTHESIS TESTING
CHAPTER 10 Comparing Two Populations or Groups
Hypothesis tests for the difference between two proportions
One-Way Analysis of Variance
Hypothesis Tests for Two Population Standard Deviations
HYPOTHESIS TESTS ABOUT THE MEAN AND PROPORTION
NONPARAMETRIC METHODS
Hypothesis Tests for a Standard Deviation
CHAPTER 10 Comparing Two Populations or Groups
Power Section 9.7.
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups
Nonparametric Statistics
CHAPTER 10 Comparing Two Populations or Groups
Distribution-Free Procedures
Testing a Claim About a Mean:  Known
Presentation transcript:

The Rank-Sum Test Section 15.2

Objectives Assign ranks to a sample Test hypotheses using the rank-sum test

Assign ranks to a sample Objective 1 Assign ranks to a sample

Ranks To rank the values in a sample, we order them from smallest to largest. The smallest value is assigned a rank of 1, the next smallest is assigned a rank of 2, and so forth. For values in the data set whose ranks are tied, each value is assigned the average of the ranks. Example: The heights of ten fourth-graders, in inches, are 48, 46, 45, 54, 46, 52, 50, 46, 52, 49. Assign ranks to these values. Solution: We arrange the values in increasing order. We notice there are three values tied at 46, and two tied at 52. We assign the ranks 1 through 10 as preliminary ranks. The three values of 46 have preliminary ranks 2, 3, 4. We compute the average of these preliminary ranks: 2 + 3 + 4 3 = 9 3 =3. The rank assigned to each of these values is therefore 3. The two values of 52 have preliminary ranks 8 and 9. We assign each the rank 8 + 9 2 = 17 2 =8.5. Height 45 46 48 49 50 52 54 Preliminary Ranks 1 2 3 4 5 6 7 8 9 10 Ranks 8.5

Test hypotheses using the rank-sum test Objective 2 Test hypotheses using the rank-sum test

The Rank-Sum Test Wilcoxon’s rank-sum test, also known as the Mann-Whitney test, is a nonparametric test to determine whether two population medians are equal. It is a nonparametric alternative to the two-sample 𝑡-test. The test requires that the populations have approximately the same shape. When two populations have the same shape and their medians are equal, they have the same distribution. For this reason the rank-sum test is sometimes described as a test to determine whether two populations differ. The main advantage of the rank-sum test is that it does not require that the populations be approximately normal. The main disadvantage is that the null hypothesis can be falsely rejected when the population medians are equal, if the shapes of the two populations differ substantially.

The Rank-Sum Test The rank-sum test is used to test the null hypothesis that two population medians are equal. Independent samples are drawn from each population. The combined samples are then ranked. The smallest value in the two samples combined is assigned a rank of 1, the second smallest is assigned a rank of 2, and so forth. If the two populations are the same, the ranks for the values assigned to one sample should not be substantially larger or smaller than the ranks assigned to the other. Thus the rank-sum test rejects the null hypothesis when the sum of the ranks for one of the samples is exceptionally large or small. Null hypothesis: 𝐻 0 : 𝑚 1 = 𝑚 2 Alternate hypothesis: Left-tailed: 𝐻 1 : 𝑚 1 < 𝑚 2 Right-tailed: 𝐻 1 : 𝑚 1 > 𝑚 2 Two-tailed: 𝐻 1 : 𝑚 1 ≠ 𝑚 2

Test Statistic To construct the test statistic, let 𝑛 1 be the smaller of the two sample sizes and let 𝑛 2 be the larger. Let 𝑆 be the sum of the ranks for the smaller sample. If both samples are the same size, then 𝑛 1 and 𝑛 2 are both equal to that size, and 𝑆 may be the sum of the ranks of either sample. When 𝐻 0 is true, it can be shown that 𝑆 has mean 𝜇 𝑆 and standard deviation 𝜎 𝑆 given by 𝜇 𝑆 = 𝑛 1 𝑛 1 +𝑛+1 2 𝜎 𝑆 = 𝑛 1 𝑛 2 ( 𝑛 1 + 𝑛 2 +1) 12 We construct the test statistic as 𝑧= 𝑆− 𝜇 𝑆 𝜎 𝑆 . When both samples sizes are 10 or more, the test statistic has an approximately standard normal distribution, so P-values and critical values can be found using the normal curve (Table A.2 or technology). When one or both of the sample sizes is less than 10, special tables can be used. However, since most samples in practice consist of at least 10 values, we do not consider this situation here.

Example – Rank-Sum Test A sample of 14 students took a statistics class online, and another sample of 12 students took an equivalent class in a traditional classroom. Both classes were given the same final exam at the end of the course. Can you conclude that there is a difference between the median scores from the two classes? Use the 𝛼=0.05 level of significance. Solution: The null hypothesis is that there is no difference in the scores between the two teaching methods. The alternate hypothesis is that there is a difference. 𝐻 0 : 𝑚 1 = 𝑚 2 𝐻 1 : 𝑚 1 ≠ 𝑚 2 Online 78 82 83 87 75 63 60 94 62 98 90 97 81 Traditional 73 72 92 100 74 64 84 77 89 70

Example – Rank-Sum Test Solution (continued): We arrange the combined samples in increasing order and assign ranks. Score 60 62 63 64 70 72 73 74 75 77 78 Sample O T Rank 1 2 3 4.5 6 7 8 9 10 11 12.5 81 82 83 84 87 89 90 92 94 97 98 100 14 15 16 17 18 19 20.5 22 23 24 25 26 The sample sizes are 𝑛 1 =12 and 𝑛 2 =14. The smaller sample belongs to the traditional class, so 𝑆 is the sum of the ranks corresponding to that sample. We compute 𝑆=154.5, 𝜇 𝑆 =162, and 𝜎 𝑆 =19.442. The value of the test statistic is 𝑧= 𝑆− 𝜇 𝑆 𝜎 𝑆 = 154.5−162 19.442 =−0.39.

Example – Rank-Sum Test Solution (continued): Since this is a two-tailed test, we may find the P-value by summing the area to the left of 𝑧=−0.39 and to the right of 𝑧=0.39. From Table A.2, we find the P-value to be 0.6965. Technology yields a P-value of 0.6997. Since P > 0.05, we do not reject 𝐻 0 . We cannot conclude that there are differences in test scores between the two methods of instruction.

You Should Know… The primary advantage and disadvantage of the rank-sum test How to assign ranks How to perform the rank-sum test to test the median of a population