NONPARAMETRIC STATISTICS

Slides:



Advertisements
Similar presentations
Elementary Statistics
Advertisements

1 1 Slide 統計學 Spring 2004 授課教師:統計系余清祥 日期: 2004 年 5 月 18 日 第十三週:無母數方法.
Nonparametric Methods
NONPARAMETRIC STATISTICS
Copyright © 2010, 2007, 2004 Pearson Education, Inc Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
© 2003 Pearson Prentice Hall Statistics for Business and Economics Nonparametric Statistics Chapter 14.
Chapter 12 Chi-Square Tests and Nonparametric Tests
Chapter 14 Analysis of Categorical Data
Chapter 15 Nonparametric Statistics
11 Chapter Nonparametric Tests © 2012 Pearson Education, Inc.
Fundamentals of Hypothesis Testing: One-Sample Tests
ITEC6310 Research Methods in Information Technology Instructor: Prof. Z. Yang Course Website: c6310.htm Office:
NONPARAMETRIC STATISTICS
14 Elements of Nonparametric Statistics
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
CHAPTER 14: Nonparametric Methods
Chapter 11 Nonparametric Tests.
What are Nonparametric Statistics? In all of the preceding chapters we have focused on testing and estimating parameters associated with distributions.
CHAPTER 14: Nonparametric Methods to accompany Introduction to Business Statistics seventh edition, by Ronald M. Weiers Presentation by Priscilla Chaffe-Stengel.
© 2000 Prentice-Hall, Inc. Statistics Nonparametric Statistics Chapter 14.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics S eventh Edition By Brase and Brase Prepared by: Lynn Smith.
1 1 Slide © 2003 South-Western/Thomson Learning™ Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Biostatistics, statistical software VII. Non-parametric tests: Wilcoxon’s signed rank test, Mann-Whitney U-test, Kruskal- Wallis test, Spearman’ rank correlation.
Copyright © Cengage Learning. All rights reserved. 14 Elements of Nonparametric Statistics.
1 Nonparametric Statistical Techniques Chapter 17.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 11-1 Chapter 11 Chi-Square Tests and Nonparametric Tests Statistics for.
©The McGraw-Hill Companies, Inc. 2008McGraw-Hill/Irwin Non-parametric: Analysis of Ranked Data Chapter 18.
CHAPTER 5 NONPARAMETRIC STATISTICS In general, a statistical technique is categorized as NPS if it has at least one of the following characteristics: 1.
NONPARAMETRIC STATISTICS In general, a statistical technique is categorized as NPS if it has at least one of the following characteristics: 1. The method.
Copyright © Cengage Learning. All rights reserved. 12 Analysis of Variance.
Chapter Eight: Using Statistics to Answer Questions.
Statistics in Applied Science and Technology Chapter14. Nonparametric Methods.
NONPARAMETRIC STATISTICS In general, a statistical technique is categorized as NPS if it has at least one of the following characteristics: 1. The method.
NONPARAMETRIC STATISTICS In general, a statistical technique is categorized as NPS if it has at least one of the following characteristics: 1. The method.
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Nonparametric Statistics.
SUMMARY EQT 271 MADAM SITI AISYAH ZAKARIA SEMESTER /2015.
Nonparametric statistics. Four levels of measurement Nominal Ordinal Interval Ratio  Nominal: the lowest level  Ordinal  Interval  Ratio: the highest.
1 Nonparametric Statistical Techniques Chapter 18.
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 10 l Non-Parametric Statistics 10.1 The Chi-Square Tests: Goodness-of-fit Test.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. CHAPTER 14: Nonparametric Methods to accompany Introduction to Business Statistics fifth.
Nonparametric Statistics
Non-parametric: Analysis of Ranked Data
Logic of Hypothesis Testing
Non-Parametric Statistics
T-test calculations.
Chapter 12 Chi-Square Tests and Nonparametric Tests
NONPARAMETRIC STATISTICS
5.1 INTRODUCTORY CHI-SQUARE TEST
CHAPTER 11 CHI-SQUARE TESTS
i) Two way ANOVA without replication
Inferential Statistics
Lecture Slides Elementary Statistics Twelfth Edition
SA3202 Statistical Methods for Social Sciences
Lecture Slides Elementary Statistics Twelfth Edition
Spearman Rank Order Correlation Example
Chapter 12 Nonparametric Methods
Inference on Categorical Data
The Rank-Sum Test Section 15.2.
NONPARAMETRIC METHODS
Inferences Between Two Variables
SIMPLE LINEAR REGRESSION
St. Edward’s University
Chapter Nine: Using Statistics to Answer Questions
Nonparametric Statistics
Chapter 9 Hypothesis Testing: Single Population
Chapter Sixteen McGraw-Hill/Irwin
COMPARING VARIABLES OF ORDINAL OR DICHOTOMOUS SCALES: SPEARMAN RANK- ORDER, POINT-BISERIAL, AND BISERIAL CORRELATIONS.
Inferential testing.
Correlation and Simple Linear Regression
Correlation and Simple Linear Regression
Presentation transcript:

NONPARAMETRIC STATISTICS In general, a statistical technique is categorized as NPS if it has at least one of the following characteristics: The method is used on nominal data The method is used in ordinal data The method is used in interval scale or ratio scale data but there is no assumption regarding the probability distribution of the population where the sample is selected. Wilcoxon Signed Rank Test Spearman’s Rank Correlation Test Mann-Whitney Test Kruskal Wallis Test Sign Test

Wilcoxon Signed Rank Test When using this technique, those assumptions should be follow: 1. 2. 3.

Wilcoxon Signed Rank Test Can be applied to two types of sample: one sample or paired sample For one sample, this method tests whether the sample could have been drawn from a population having a hypothesized value as its median For paired sample, to test whether the two populations from which these samples are drawn identical. Important terms :- - difference of paired samples - modular of difference of paired samples R- ranks R( ) – signed-rank

The Wilcoxon Signed Rank test for one sample i. Null and alternative hypothesis: ii. Test procedure: For each of the observed values, find the difference between each value and the median; where median value that has been specified Ignoring the observation where , rank the values so the smallest will have a rank of 1. Where two or more differences have the same value find their mean rank, and use this. For observation where , list the rank as column and list the rank as column Case Rejection region Two tail Right tail Left tail

Then, sum the ranks of the positive differences, and sum the ranks of the negative differences . iii. The test statistic, W is the depends on the alternative hypothesis: For a two tailed test the test statistic For a one tailed test where the the test statistic, For a one tailed test where the the test statistic iv. Critical region: Compare the test statistic, W with the critical value in the tables; the null hypothesis is rejected if v. Make a decision

Example 5.1: An environmental activist believes her community’s drinking water contains at least the 40.0 parts per million (ppm) limit recommended by health officials for a certain metal. In response to her claim, the health department samples and analyzes drinking water from a sample of 11 households in the community. The results are as in the table below. At the 0.05 level of significance, by using Wilcoxon method can we conclude that the community’s drinking water might at least 40.0 ppm recommended limit? Household Observed concentration A 39 B 20.2 C 40 D 32.2 E 30.5 F 26.5 G 42.1 H 45.6

I 42.1 J 29.9 K 40.9

Observed concentration Solution: Household Observed concentration Rank, A 39 -1 1 2 B 20.2 -19.8 19.8 10 C 40 _ D 32.2 -7.8 7.8 6 E 30.5 -9.5 9.5 7 F 26.5 -13.5 13.5 9 G 42.1 2.1 3.5 H 45.6 5.6 5 I J 29.9 -10.1 10.1 8 K 40.9 0.9

1. Or (One tail test) 2. Based on the alternative hypothesis, the test statistic 3. From table of Wilcoxon signed rank for one tail test, 4. We will reject 5. Since , thus we failed to reject and conclude that the city’s water supply might have at least 40.0 ppm of the metal

Exercise 5.1: Student satisfaction surveys ask students to rate a particular course, on a scale of 1 (poor) to 10 (excellent). In previous years the replies have been symmetrically distributed about a median of 4. This year there has been a much greater on-line element to the course, and staff want to know how the rating of this version of the course compares with the previous one. 14 students, randomly selected, were asked to rate the new version of the course and their ratings were as follows: 1 3 6 4 8 2 3 6 5 2 3 4 1 2 Is there any evidence at the 5% level that students rate this version any differently?

The Wilcoxon Signed Rank test for paired sample i. Null and alternative hypothesis: ii. Test procedure: For each of the observed values, calculate Ignoring observation where , rank the values so the smallest will have a rank of 1. Where two or more differences have the same value find their mean rank, and use this. For observation where , list the rank as column and list the rank as column Case Rejection region Two tail Right tail Left tail

i. Null hypothesis

Then, sum the ranks of the positive differences, and sum the ranks of the negative differences iii. The test statistic, W is the depends on the alternative hypothesis: - For a two tailed test the test statistic - For a one tailed test where the the test statistic, - For a one tailed test where the the test statistic iv. Critical region: Compare the test statistic, W with the critical value in the tables; the null hypothesis is rejected if Remember! If there are zero in differences, the value for n must subtracts the number of rows which contain zero in differences. v. Make a decision

Example 5.2: Two computer software packages are being considered for use in the inventory control department of a small manufacturing firm. The firm has selected 12 different computing task that are typical of the kinds of jobs. The results are shown in the table below. At the 0.10 level, can we conclude that those two computer software packages are identical?

Time required for software packages Computing task Time required for software packages A 24 23.1 B 16.7 20.4 C 21.6 17.7 D 23.7 20.7 E 37.5 42.1 F 31.4 36.1 G 14.9 21.8 H 37.3 40.3 I 17.9 26 J 15.5 K 29 35.4 L 19.9 25.5

Time required for software packages Solution: Computing task Time required for software packages Rank, A 24 23.1 0.9 1 B 16.7 20.4 -3.7 3.7 4 C 21.6 17.7 3.9 5 D 23.7 20.7 3 2.5 E 37.5 42.1 -4.6 4.6 6 F 31.4 36.1 -4.7 4.7 7 G 14.9 21.8 -6.9 6.9 10 H 37.3 40.3 -3 I 17.9 26 -8.1 8.1 11 J 15.5 _ K 29 35.4 -6.4 6.4 9 L 19.9 25.5 -5.6 5.6 8

1. (two tail test) Or 2. Based on the alternative hypothesis, the test is 3. 4. From table of Wilcoxon signed rank for two tail test, We will reject 5. Since , thus we reject and conclude that the software packages are not equally rapid in handling computing tasks like those in the sample, or the population median for is not equal to zero and that package x is faster than package y in handling computing task like ones sample.

Exercise 5.2: The following data gives the number of industrial accidents in ten manufacturing plants for one month periods before and after an intensive promotion on safety: Do the data support the claim that the plant before and after an intensive promotion are identical? Test at . Plant 1 2 3 4 5 6 7 8 9 10 Before After

Exercise 5.3 An experiment was conducted to compare the densities of cakes prepared from two different cake mixes, A and B. Six cake pans received batter A and six received batter B. Expecting a variation in oven temperature, the experimenter placed an A and B cake side by side at six different locations in oven. Test the hypothesis of no difference in the median of population distributions of cake densities for two different cake batters. Test at . Cake Mixes A Cake Mixes B 0.135 0.129 0.102 0.120 0.098 0.112 0.141 0.152 0.131 0.144 0.163

Spearman’s Rank Correlation Test We have seen the correlation coefficient r measure the linear relationship between two continuous variable X and Y Spearman’s Rank Correlation Test is used to measure the strength and the direction of the relationship between two variables which are at least ordinal data. A measure of correlation for ranked data based on the definition of Pearson Correlation where there is no tie or few ties called Spearman rank Correlation Coefficient, denoted by where

A value of +1 or -1 indicated perfect association between X and Y The plus sign with value indicates strong positive correlation between the x and y, and indicates weak positive correlation between the x and y The minus sign with value indicates strong negative correlation between the x and y, and indicates weak negative correlation between the x and y When is zero or close to zero, we would conclude that the variable are uncorrelated

Spearman’s Rank Correlation Test 1. 2. Test statistic: 3 Spearman’s Rank Correlation Test 1. 2. Test statistic: 3. Critical value: 4. Rejection region: 5. Conclusion

Example 5.3: The data below show the effect of the mole ratio of sebacic acid on the intrinsic viscosity of copolyesters. Find the Spearman rank correlation coefficient to measure the relationship of mole ratio of sebacic acid and the viscosity of copolyesters. Solutions: X: mole ratio of sebacic Y: viscosity of copolyesters Mole ratio 1.0 0.9 0.8 0.7 0.6 0.5 0.4 0.3 Viscosity 0.45 0.20 0.34 0.58 0.70 0.57 0.55 0.44

Thus which shows a weak negative correlation between the mole ratio of sebacic acid and the viscosity of copolyesters Mole ratio Viscosity 1.0 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.45 0.20 0.34 0.58 0.70 0.57 0.55 0.44 8 7 6 5 4 3 2 1 -2 -4 -3 16 36 9 T = 110

Exercise 5.4: The following data were collected and rank during an experiment to determine the change in thrust efficiency, y as the divergence angle of a rocket nozzle, x changes: Find the Spearman rank correlation coefficient to measure the relationship between the divergence angle of a rocket nozzle and the change in thrust efficiency. Rank X 1 2 3 4 5 6 7 8 9 10 Rank Y

Exercise 5.5 Suppose eight elementary school science teachers have been ranked by a judge according to their teaching ability and all have taken a “national teachers’ examination.” Calculate the Spearman rank correlation coefficient to measure the relationship between Judge Rank ( ) and Test Rank ( ). Judge Rank ( ) Test Rank ( ) 7 1 4 5 2 3 6 8

To determine whether a difference exist between two populations Mann-Whiteny Test To determine whether a difference exist between two populations Sometimes called as Wilcoxon rank sum test Two independent random samples are required from each population. Let be the random samples of sizes and where from population X and Y respectively 1. Null and alternative hypothesis Two tail test Left tail test Right tail test Rejection area

Hypotheses to be tested are:

Example 5.4: Data below show the marks obtained by electrical engineering students in an examination: Can we conclude the achievements of male and female students identical at significance level Gender Marks Male Female 60 62 78 83 40 65 70 88 92

Solution: Gender Marks Rank Male Female 60 62 78 83 40 65 70 88 92 2 3 1. 2. Gender Marks Rank Male Female 60 62 78 83 40 65 70 88 92 2 3 6 7 1 4 5 8 9

From the table of Wilcoxon rank sum test for Reject Since , thus we fail to reject and conclude that the achievements of male and female are not significantly different.

Exercise 5.6: Using high school records, Johnson High school administrators selected a random sample of four high school students who attended Garfield Junior High and another random sample of five students who attended Mulbery Junior High. The ordinal class standings for the nine students are listed in the table below. Test using Mann-Whitney test at 0.05 level of significance. Garfield J. High Mulbery J. High Student Class standing Fields 8 Hart 70 Clark 52 Phipps 202 Jones 112 Kirwood 144 TIbbs 21 Abbott 175 Guest 146