Copyright © Cengage Learning. All rights reserved. 14 Elements of Nonparametric Statistics.

Slides:



Advertisements
Similar presentations
Chapter 16 Introduction to Nonparametric Statistics
Advertisements

Hypothesis Testing A hypothesis is a claim or statement about a property of a population (in our case, about the mean or a proportion of the population)
Chapter 10 Section 2 Hypothesis Tests for a Population Mean
1/55 EF 507 QUANTITATIVE METHODS FOR ECONOMICS AND FINANCE FALL 2008 Chapter 10 Hypothesis Testing.
BCOR 1020 Business Statistics
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 9-1 Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests Basic Business Statistics.
Hypothesis Testing for the Mean and Variance of a Population Introduction to Business Statistics, 5e Kvanli/Guynes/Pavur (c)2000 South-Western College.
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Statistics for Business and Economics 7 th Edition Chapter 9 Hypothesis Testing: Single.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 8 Tests of Hypotheses Based on a Single Sample.
Chapter 15 Nonparametric Statistics
Chapter Ten Introduction to Hypothesis Testing. Copyright © Houghton Mifflin Company. All rights reserved.Chapter New Statistical Notation The.
Copyright © Cengage Learning. All rights reserved. 11 Applications of Chi-Square.
AM Recitation 2/10/11.
Overview of Statistical Hypothesis Testing: The z-Test
Chapter 10 Hypothesis Testing
Confidence Intervals and Hypothesis Testing - II
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 9-1 Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests Business Statistics,
Fundamentals of Hypothesis Testing: One-Sample Tests
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap th Lesson Introduction to Hypothesis Testing.
Copyright © Cengage Learning. All rights reserved. 13 Linear Correlation and Regression Analysis.
Copyright © Cengage Learning. All rights reserved. 10 Inferences Involving Two Populations.
Copyright © Cengage Learning. All rights reserved. 9 Inferences Involving One Population.
14 Elements of Nonparametric Statistics
NONPARAMETRIC STATISTICS
Copyright © Cengage Learning. All rights reserved. 8 Introduction to Statistical Inferences.
Copyright © Cengage Learning. All rights reserved. 9 Inferences Involving One Population.
HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Section 10.2.
1 CSI5388: Functional Elements of Statistics for Machine Learning Part I.
14 Elements of Nonparametric Statistics
Chapter 8 Introduction to Hypothesis Testing
Lecture 7 Introduction to Hypothesis Testing. Lecture Goals After completing this lecture, you should be able to: Formulate null and alternative hypotheses.
Copyright © Cengage Learning. All rights reserved. 14 Elements of Nonparametric Statistics.
Copyright © Cengage Learning. All rights reserved. 10 Inferences Involving Two Populations.
Copyright © Cengage Learning. All rights reserved. 10 Inferences Involving Two Populations.
Wilcoxon rank sum test (or the Mann-Whitney U test) In statistics, the Mann-Whitney U test (also called the Mann-Whitney-Wilcoxon (MWW), Wilcoxon rank-sum.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics S eventh Edition By Brase and Brase Prepared by: Lynn Smith.
Ordinally Scale Variables
Nonparametric Statistics. In previous testing, we assumed that our samples were drawn from normally distributed populations. This chapter introduces some.
1 Nonparametric Statistical Techniques Chapter 17.
HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Example 10.17:
Copyright © Cengage Learning. All rights reserved. 13 Linear Correlation and Regression Analysis.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Fundamentals of Hypothesis Testing: One-Sample Tests Statistics.
Copyright © Cengage Learning. All rights reserved. 8 Introduction to Statistical Inferences.
Lecture 9 Chap 9-1 Chapter 2b Fundamentals of Hypothesis Testing: One-Sample Tests.
Economics 173 Business Statistics Lecture 4 Fall, 2001 Professor J. Petry
1 Chapter 8 Introduction to Hypothesis Testing. 2 Name of the game… Hypothesis testing Statistical method that uses sample data to evaluate a hypothesis.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 11-1 Chapter 11 Chi-Square Tests and Nonparametric Tests Statistics for.
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Overview.
Copyright ©2013 Pearson Education, Inc. publishing as Prentice Hall 9-1 σ σ.
Week111 The t distribution Suppose that a SRS of size n is drawn from a N(μ, σ) population. Then the one sample t statistic has a t distribution with n.
Chap 8-1 Fundamentals of Hypothesis Testing: One-Sample Tests.
Copyright © Cengage Learning. All rights reserved. 12 Analysis of Variance.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics S eventh Edition By Brase and Brase Prepared by: Lynn Smith.
CD-ROM Chap 16-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition CD-ROM Chapter 16 Introduction.
© Copyright McGraw-Hill 2004
Applied Quantitative Analysis and Practices LECTURE#14 By Dr. Osman Sadiq Paracha.
Hypothesis Testing Introduction to Statistics Chapter 8 Feb 24-26, 2009 Classes #12-13.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Understanding Basic Statistics Fourth Edition By Brase and Brase Prepared by: Lynn Smith Gloucester County College Chapter Nine Hypothesis Testing.
Copyright © Cengage Learning. All rights reserved. 9 Inferences Based on Two Samples.
NONPARAMETRIC STATISTICS In general, a statistical technique is categorized as NPS if it has at least one of the following characteristics: 1. The method.
Copyright © Cengage Learning. All rights reserved. 12 Analysis of Variance.
Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Statistics for Business and Economics 8 th Edition Chapter 9 Hypothesis Testing: Single.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 7 Inferences Concerning Means.
1 Nonparametric Statistical Techniques Chapter 18.
Chapter 9 Hypothesis Testing Understanding Basic Statistics Fifth Edition By Brase and Brase Prepared by Jon Booze.
Copyright © Cengage Learning. All rights reserved. 10 Inferences about Differences.
NONPARAMETRIC STATISTICS
The Rank-Sum Test Section 15.2.
Copyright © Cengage Learning. All rights reserved.
Presentation transcript:

Copyright © Cengage Learning. All rights reserved. 14 Elements of Nonparametric Statistics

Copyright © Cengage Learning. All rights reserved The Mann–Whitney U Test

3 The Mann–Whitney U test is a nonparametric alternative to the t-test for the difference between two independent means. The usual two-sample situation occurs when the experimenter wants to see whether the difference between the two samples is sufficient to reject the null hypothesis that the two sampled populations are identical.

4 Hypothesis-Testing Procedure

5 Assumptions for inferences about two populations using the Mann–Whitney U test The two independent random samples are independent within each sample as well as between samples, and the random variables are ordinal or numerical. This test is often used in situations in which the two samples are drawn from the same population of subjects but different “treatments” are used on each set. We will demonstrate the procedure in the next example.

6 Example 6 – Two-tailed Hypothesis Test In a large lecture class, when a 1-hour exam is given, the instructor gives two “equivalent” examinations. It is reasonable to ask: Are these two different exams really equivalent? Students in even-numbered seats take exam A, and those in the odd-numbered seats take exam B. To test this “equivalent” hypothesis, two random samples were taken. Table 14.3 lists the exam scores of the two samples. Data on Exam Scores [TA14-03] Table 14.3

7 Example 6 – Two-tailed Hypothesis Test If we assume that the odd- or even-numbered seats had no effect, does the sample present sufficient evidence to reject the hypothesis “The exam forms yielded scores that had identical distributions”? Test using  = Solution: Step 1 a. Parameter of interest: The distribution of scores for each version of the exam b. Statement of hypotheses: H o : Exam A and exam B have test scores with identical distributions. cont’d

8 Example 6 – Solution H a : The two distributions are not the same. Step 2 a. Assumptions: The two samples are independent, and the random variable, exam score, is numerical. b. Test statistic: The Mann–Whitney U statistic c. Level of significance:  = 0.05 Step 3 a. Sample information: The sample data are listed in Table Data on Exam Scores [TA14-03] Table 14.3 cont’d

9 Example 6 – Solution b. Test statistic. The size of the individual samples will be called n a and n b ; actually, it makes no difference which way these are assigned. In our example they both have the value 10. The two samples are combined into one sample (all n a + n b ) and ordered from smallest to largest: cont’d

10 Example 6 – Solution Each is then assigned a rank number. The smallest (49) is assigned rank 1, the next smallest (52) is assigned rank 2, and so on, up to the largest, which is assigned rank n a + n b (20). Ties are handled by assigning to each of the tied observations the mean rank of those rank positions that they occupy. For example, in our example there are two 78s; they are the 10 th and 11 th. The mean rank for each is then = 10.5 cont’d

11 Example 6 – Solution In the case of the three 90s—the 16 th, 17 th, and 18 th data values—each is assigned 17 because = 17. The rankings are shown in Table cont’d Ranked Exam Score Data Table 14.4

12 Example 6 – Solution Figure 14.2 shows the relationship between the two sets of data, first by using the data values and second by comparing the rank numbers for the data. Comparing the Data of Two Samples Figure 14.2 cont’d

13 Example 6 – Solution The calculation of the test statistic U is a two-step procedure. We first determine the sum of the ranks for each of the two samples. Then, using the two sums of ranks, we calculate a U score for each sample. The smaller U score is the test statistic. The sum of ranks R a for sample A is computed as R a = = 86 The sum of ranks R b for sample B is R b = = 124 cont’d

14 Example 6 – Solution The U score for each sample is obtained by using the following pair of formulas: cont’d

15 Example 6 – Solution, the test statistic, is the smaller of U a and U b. For our example, we obtain Therefore = 31. cont’d

16 Example 6 – Solution Before we carry out the test for this example, let’s try to understand some of the underlying possibilities. We know that the null hypothesis is that the distributions are the same and that we will most likely want to conclude from this that the averages are approximately equal. Suppose for a moment that the distributions are indeed quite different; say, all of one sample comes before the smallest data value in the second sample when they are ranked together. cont’d

17 Example 6 – Solution This would certainly mean that we would want to reject the null hypothesis. What kind of a value can we expect for U in this case? Suppose that the 10 A values had ranks 1 through 10 and that the 10 B values had ranks 11 through 20. Then we would obtain R a = 55 and R b = 155 cont’d

18 Example 6 – Solution Therefore = 0 If this were the case, we certainly would want to reach the decision: Reject the null hypothesis. Suppose, on the other hand, that both samples were perfectly matched; that is, a score in each set is identical to one in the other A B A B A B A B cont’d

19 Example 6 – Solution Now what would happen? R a = R b = 105 Therefore, = 50. If this were the case, we certainly would want to reach the decision: Fail to reject the null hypothesis. Note The sum of the two U’s (U a + U b ) will always be equal to the product of the two sample sizes (n a  n b ). For this reason we need concern ourselves only with the smaller U value. cont’d

20 Example 6 – Solution Step 4 Probability Distribution: p-Value: a. Since the concern is for values related to “not the same,” the p-value is the probability of both tails. It will be found by finding the probability of the left tail and doubling: P = 2  P(U  31 for n 1 = 10 and n 2 = 10) cont’d

21 Example 6 – Solution To find the p-value, you have two options: 1. Use Table 13 in Appendix B to place bounds on the p-value: P > Use a computer or calculator to find the p-value: P = b. The p-value is not smaller than . cont’d

22 Example 6 – Solution Classical: a.The critical region is two-tailed because H a expresses concern for values related to “not the same.” Use Table 13A for two-tailed  = The critical value is at the intersection of column n 1 = 10 and row n 2 = 10:23.The critical region is U  23. cont’d

23 Example 6 – Solution b. U is not in the critical region, as shown in the figure. Step 5 a. Decision: Fail to reject H o. b. Conclusion: We do not have sufficient evidence to reject the “equivalent” hypothesis. cont’d

24 Hypothesis-Testing Procedure Calculating the p-Value when Using the Mann–Whitney Test Method 1: Use Table 13 in Appendix B to place bounds on the p-value. By inspecting Table 13A and B at the intersection of column n 1 = 10 and row n 2 = 10, you can determine that the p-value is greater than 0.10; the larger two-tailed value of  is 0.10 in Table 13B. Method 2: If you are doing the hypothesis test with the aid of a computer or graphing calculator, most likely it will calculate the p-value for you.

25 Normal Approximation

26 Normal Approximation If the samples are larger than size 20, we may make the test decision with the aid of the standard normal variable, z. This is possible because the distribution of U is approximately normal with a mean and a standard deviation (14.5) (14.6)

27 Normal Approximation The hypothesis test is then completed using the test statistic : The standard normal distribution may be used whenever n a and n b are both greater than 10. (14.7)

28 Example 7 – One-tailed Hypothesis Test A dog-obedience trainer is training 27 dogs to obey a certain command. The trainer is using two different training techniques: (I) the reward-and-encouragement method and (II) the no-reward method. Table 14.5 shows the numbers of obedience sessions that were necessary before the dogs would obey the command. Data on Dog Training [TA14-05] Table 14.5

29 Example 7 – One-tailed Hypothesis Test Does the trainer have sufficient evidence to claim that the reward method will, on average, require fewer obedience sessions (  = 0.05)? Solution: Step 1 a. Parameter of interest: The distribution of needed obedience sessions for each technique b. Statement of hypotheses: H o : The distributions of the needed obedience sessions are the same for both methods. H a : The reward method, on average, requires fewer sessions. cont’d

30 Example 7 – Solution Step 2 a. Assumptions: The two samples are independent, and the random variable, training time, is numerical. b. Test statistic: The Mann–Whitney U statistic c. Level of significance:  = 0.05 Step 3 a. Sample information: The sample data are listed in Table cont’d

31 Example 7 – Solution b. Test statistic: The two sets of data are ranked jointly, and ranks are assigned as shown in Table Rankings for Training Methods Table 14.6 cont’d

32 Example 7 – Solution The sums are: R I = … = R II = … = The U scores are found using formulas (14.3) and (14.4): cont’d

33 Example 7 – Solution Therefore, = 31. Now we use formulas (14.5), (14.6), and (14.7) to determine the z-statistic: cont’d

34 Example 7 – Solution Step 4 Probability Distribution: p-Value: a. Use the left-hand tail because H a expresses concern for values related to “fewer than.” P = P(z < –2.88) as shown in figure. cont’d

35 Example 7 – Solution To find the p-value, you have three options: 1. Use Table 3 (Appendix B) to calculate the p-value: P = Use Table 5 (Appendix B) to place bounds on the p-value: < P < Use a computer or calculator to find the p-value: P = b. The p-value is smaller than . cont’d

36 Example 7 – Solution Classical: a. The critical region is the left-hand tail because H a expresses concern for values related to “fewer than.” The critical value is obtained from Table 4A: b. is in the critical region, as shown in red in the figure. cont’d

37 Example 7 – Solution Step 5 a. Decision: Reject H o. b. Conclusion: At the 0.05 level of significance, the data show sufficient evidence to conclude that the reward method does, on average, require fewer training sessions. cont’d