Copyright © 2012 Pearson Education. Chapter 23 Nonparametric Methods.

Slides:



Advertisements
Similar presentations
Prepared by Lloyd R. Jaisingh
Advertisements

Chapter 16 Introduction to Nonparametric Statistics
Chapter 14, part D Statistical Significance. IV. Model Assumptions The error term is a normally distributed random variable and The variance of  is constant.
Copyright © 2009 Pearson Education, Inc. Chapter 29 Multiple Regression.
Copyright © 2010 Pearson Education, Inc. Slide
Nonparametric Statistics Timothy C. Bates
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Nonparametric Methods Chapter 15.
Ordinal Data. Ordinal Tests Non-parametric tests Non-parametric tests No assumptions about the shape of the distribution No assumptions about the shape.
PBHL 5313 Nonparametric Methods Module III Zoran Bursac Biostatistics UAMS.
statistics NONPARAMETRIC TEST
Copyright © 2010, 2007, 2004 Pearson Education, Inc. *Chapter 29 Multiple Regression.
© 2003 Pearson Prentice Hall Statistics for Business and Economics Nonparametric Statistics Chapter 14.
Chapter 14 Analysis of Categorical Data
Final Review Session.
Chapter 11: Inference for Distributions
Educational Research by John W. Creswell. Copyright © 2002 by Pearson Education. All rights reserved. Slide 1 Chapter 8 Analyzing and Interpreting Quantitative.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 14: Non-parametric tests Marshall University Genomics.
Nonparametrics and goodness of fit Petter Mostad
Chapter 15 Nonparametric Statistics
Nonparametric or Distribution-free Tests
Hypothesis Testing and T-Tests. Hypothesis Tests Related to Differences Copyright © 2009 Pearson Education, Inc. Chapter Tests of Differences One.
Review I volunteer in my son’s 2nd grade class on library day. Each kid gets to check out one book. Here are the types of books they picked this week:
Non-parametric Dr Azmi Mohd Tamil.
Chapter 14: Nonparametric Statistics
Hypothesis Testing Charity I. Mulig. Variable A variable is any property or quantity that can take on different values. Variables may take on discrete.
Copyright © 2008 by Pearson Education, Inc. Upper Saddle River, New Jersey All rights reserved. John W. Creswell Educational Research: Planning,
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Inference on the Least-Squares Regression Model and Multiple Regression 14.
Copyright © 2010, 2007, 2004 Pearson Education, Inc Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
Correlation.
Chapter 15 Correlation and Regression
14 Elements of Nonparametric Statistics
NONPARAMETRIC STATISTICS
Previous Lecture: Categorical Data Methods. Nonparametric Methods This Lecture Judy Zhong Ph.D.
Correlation and Regression Used when we are interested in the relationship between two variables. NOT the differences between means or medians of different.
Nonparametric Statistics aka, distribution-free statistics makes no assumption about the underlying distribution, other than that it is continuous the.
Chapter 14 Nonparametric Tests Part III: Additional Hypothesis Tests Renee R. Ha, Ph.D. James C. Ha, Ph.D Integrative Statistics for the Social & Behavioral.
© 2000 Prentice-Hall, Inc. Statistics Nonparametric Statistics Chapter 14.
© Copyright McGraw-Hill CHAPTER 13 Nonparametric Statistics.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics S eventh Edition By Brase and Brase Prepared by: Lynn Smith.
Biostatistics, statistical software VII. Non-parametric tests: Wilcoxon’s signed rank test, Mann-Whitney U-test, Kruskal- Wallis test, Spearman’ rank correlation.
Ordinally Scale Variables
Stats 2022n Non-Parametric Approaches to Data Chp 15.5 & Appendix E.
Nonparametric Tests IPS Chapter 15 © 2009 W.H. Freeman and Company.
1 Nonparametric Statistical Techniques Chapter 17.
Nonparametric Statistics
Lesson 15 - R Chapter 15 Review. Objectives Summarize the chapter Define the vocabulary used Complete all objectives Successfully answer any of the review.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 14 Comparing Groups: Analysis of Variance Methods Section 14.1 One-Way ANOVA: Comparing.
Experimental Research Methods in Language Learning Chapter 10 Inferential Statistics.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Statistics in Applied Science and Technology Chapter14. Nonparametric Methods.
CD-ROM Chap 16-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition CD-ROM Chapter 16 Introduction.
NON-PARAMETRIC STATISTICS
DTC Quantitative Methods Bivariate Analysis: t-tests and Analysis of Variance (ANOVA) Thursday 14 th February 2013.
Nonparametric Statistics
Chapter 15 The Chi-Square Statistic: Tests for Goodness of Fit and Independence PowerPoint Lecture Slides Essentials of Statistics for the Behavioral.
Copyright © 2010 Pearson Education, Inc Chapter Seventeen Correlation and Regression.
Biostatistics Nonparametric Statistics Class 8 March 14, 2000.
Chapter 21prepared by Elizabeth Bauer, Ph.D. 1 Ranking Data –Sometimes your data is ordinal level –We can put people in order and assign them ranks Common.
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Nonparametric Statistics.
Nonparametric Statistics
Hypothesis Testing Procedures Many More Tests Exist!
Lecture 7: Bivariate Statistics. 2 Properties of Standard Deviation Variance is just the square of the S.D. If a constant is added to all scores, it has.
Nonparametric statistics. Four levels of measurement Nominal Ordinal Interval Ratio  Nominal: the lowest level  Ordinal  Interval  Ratio: the highest.
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
Non-Parametric Tests 12/1.
Non-Parametric Tests 12/1.
Non-Parametric Tests 12/6.
Non-Parametric Tests.
Nonparametric Statistics
Presentation transcript:

Copyright © 2012 Pearson Education. Chapter 23 Nonparametric Methods

Copyright © 2012 Pearson Education Many measurements are ordered information with no quantitative values or units. Surveys of opinion using categories such as “very satisfied,” “satisfied,” “neither satisfied nor dissatisfied,” “dissatisfied,” or “very dissatisfied” is a prime example of measurements without quantitative units. These scales were first developed by Rensis Likert and are known as Likert scales. These data can be quite useful but how do you discern the difference between “very satisfied” and “satisfied” or “dissatisfied” versus “very dissatisfied.” These data have no units, so how can we compare them?

Copyright © 2012 Pearson Education Rank When data has a clear order that cannot be quantified, rank is used to get a clearer understanding of the data. To rank the data, from two or more groups, order all the data and number them consecutively. For tied data levels, use the average of its consecutive numbers for the rank of each of the tied data values. Example: Two small classes take a test; rank their letter grades. Class 1A+C-BDA-C+ Class 1 ranks Class 2A-DBAB+C Class 2 ranks

Copyright © 2012 Pearson Education The Wilcoxon Rank-Sum/ Mann-Whitney Statistic The Wilcoxon rank-sum and the Mann-Whitney statistic use the sum of the ranks for each group to determine if the two groups are different. Because the sum of all the ranks is a known value only one of the group’s sums is needed to test the hypothesis. H 0 : The two groups are the same. H A : The two groups differ. We need a P-value to determine whether the rank sums found are out of the ordinary. For smaller groups all possible rank sum can be computed and used to determine the probability of having a rank sum as extreme as the one found given the null is not rejected. Tables are available of critical values for rank sums of smaller groups.

Copyright © 2012 Pearson Education The Wilcoxon Rank-Sum/ Mann-Whitney Statistic Using the Central Limit Theorem for Larger Groups To test larger groups we use the Central Limit Theorem. The means tend to be normally distributed as the sample size grows. Thus, we can expect the sums to be normally distributed since the sample size is constant. All we need to know is the expected value (the mean), and the standard deviation to use a normal distribution in order to find the P-value. For groups 1 and 2, with corresponding rank sums, T 1 and T 2,

Copyright © 2012 Pearson Education The Wilcoxon Rank-Sum/ Mann-Whitney Statistic When to Use the Wilcoxon/Mann-Whitney Test Use when data consists of grades or ordered categories These tests can also be used to replace numeric data with ranks when there is a possibility of extreme outliers, which would affect other tests. With quantitative data the Wilcoxon/Mann-Whitney only has about 95% of the power of a corresponding two sample t-test Using these tests simply to ignore outliers is a bad idea. Outliers are interesting in their own right and ignoring them may result in missing an important fact or instance.

Copyright © 2012 Pearson Education The Wilcoxon Rank-Sum/ Mann-Whitney Statistic Assumptions and Conditions Nonparametric methods do not require assumptions about distributions. However, we still have some assumptions about the structure of the data: Independence Assumption. The data values are mutually independent. Appropriate randomization in data collection is one way to accomplish this. Independent Groups Assumption. The two groups must be independent of each other.

Copyright © 2012 Pearson Education The Wilcoxon Rank-Sum/ Mann-Whitney Statistic Assumptions and Conditions The hypotheses may seem kind of vague. But it’s a feature of the method that it doesn’t test properties of any population parameter. That’s what makes it a nonparametric method.

Copyright © 2012 Pearson Education Kruskal-Wallace Test The Kruskal-Wallace test is a nonparametric method used in hypothesis testing. To perform this test first rank all the data values together, assigning average ranks to tied values. Then sum up these ranks for each group and calculate H with the following formula where the T i are the rank sums for each group, and N is the total number of values.

Copyright © 2012 Pearson Education Kruskal-Wallace Test The Kruskal-Wallace test uses the chi-square distribution denoted with degrees of freedom and significants level. The test says to reject the null hypothesis “H 0 : all populations are identical,” if

Copyright © 2012 Pearson Education Paired Data: The Wilcoxon Signed-Rank Test 1)Find the difference between each set of paired data. 2)Remove the tied data pairs (pairs with zero difference). 3)Using the new count of cases, rank the absolute values of the differences. 4)Sum the ranks of the positive differences (T + ) and the negative differences (T - ). 5)Use the smaller of the T-values as the test statistic. Compare this value with a table of critical values to determine if we can reject the null hypothesis that there is no difference between the two groups. The Wilcoxon Signed-Rank Test is a nonparametric method suitable for paired data. To use the test follow these steps:

Copyright © 2012 Pearson Education Paired Data: The Wilcoxon Signed-Rank Test a)A panel of consumers comparing the ratings of a new soda formula on a 5-point Likert scale to the ratings of a competitor's similar new formula on the same scale b)Judging whether the height of the water, at high tide, at a marina increases more or less consistently as the phases of the moon change Appropriate Not Appropriate Example: For which of the following situations would a Wilcoxon Signed Rank Test be appropriate?

Copyright © 2012 Pearson Education *Friedman Test for a Randomized Block Design 1)Rank the data separately within each block. 2)Sum the ranks for each treatment and denote the sums T i, where i is the index for each treatment. 3)Use the formula where b is the number of blocks and k is the number of treatment to determine the test statistic. The Friedman Test is a nonparametric alternative for ANOVA and is essentially a generalization of the Wilcoxon Signed-Rank test for block designs. To use this test complete the following:

Copyright © 2012 Pearson Education *Friedman Test for a Randomized Block Design The null hypothesis for a Friedman Test is H 0 : All the treatments are identical. With an alternative hypothesis of H a : at least one of the treatments is different. We reject the null hypothesis at the approximate significance level if: We use chi-squared ( ) distribution for an approximate result. There are tables of exact critical values available for a small sample size, but this approximation works well for most cases.

Copyright © 2012 Pearson Education Kendall’s Tau: Measuring Monotonicity A consistent trend is called monotone. Kendall’s Tau ( ) measures monotonicity directly by recording whether the slope between two paired points is positive, negative, or zero. In a monotone plot all the paired points will be positive or all will be negative. Tau is computed with the following formula and can take on values between and. n c is the number of points with a positive slope between them, n d is the number of points with a negative slope between them, and n is the total number of points.

Copyright © 2012 Pearson Education Spearman’s Rho Spearman’s Rho is a correlation coefficient that uses ranks to deal with violations of the Linearity Condition, or outliers and bends. When using this coefficient first rank both the x- and y-variable separately from lowest to highest. If a scatter plot of the original data shows a trend, ranking the data will show a similar trend if the original trend was linear. Spearman’s Rho must be between and. Because this is a correlation coefficient, we use the standard linear correlation coefficient formula with ranks to find rho ( ).

Copyright © 2012 Pearson Education Spearman’s Rho Spearman’s Rho has advantages over the correlation coefficient r in that it can be used when only the ranks of the data are known. Also, the Spearman’s Rho coefficient can be used to measure the consistency of a trend without insisting that the trend be linear. When ranking the data, linear trends remain linear and curved trends take on a linear form. Data containing outliers does not affect the Spearman’s Rho, since the values used are ranks.

Copyright © 2012 Pearson Education Spearman’s Rho a)A panel of teenage tasters and a panel of adult tasters comparing the sweetness of a diet drink (rated from 1 to 10) b)Estimating the association between the value of used cars and their age on a lot where there is known to be some high- end classic cars Not appropriate, this data set would not contain a trend. Appropriate, this data would have a clear nonlinear trend with possible outliers. Example: For which of the following situations would Spearman’s Rho be appropriate?

Copyright © 2012 Pearson Education When Should You Use Nonparametric Methods? Nonparametric methods are particularly valuable when your data contain only information about order. They are useful with quantitative data when the variables violate one or more of our assumptions and conditions. Transferring quantitative values to ranks protects us from the influence of outliers, multi-modal distributions, skewed distributions, and nonlinear relationships. Be careful using nonparametric methods when you don’t have to since they are less powerful than corresponding parametric methods. Also features such as outliers, multiple modes, skews, and nonlinear relationships are important features of the data, and should not be ignored.

Copyright © 2012 Pearson Education Don’t forget to check independence. The assumption that the cases are independent is especially important when P-values and critical values are determined by probability calculations, based on the assumption that all orderings are equally likely. Don’t degrade your data unnecessarily. If you have quantitative data that satisfy the assumptions and conditions of statistics methods such as t-tests, correlation, regression, and ANOVA, you are generally better off using those methods rather than the corresponding nonparametric methods.

Copyright © 2012 Pearson Education What Have We Learned? Recognize when nonparametric methods may be appropriate. Data recorded as ranks without units can usually be best analyzed with nonparametric methods. Data with unusual distributions or for which we can’t reasonably make the assumptions about the distribution, as required by parametric methods, call for distribution-free (nonparametric) methods. Situations in which the question being asked is best answered by a nonparametric method. The Wilcoxon Rank-Sum Statistic, also called the Mann-Whitney statistic, tests whether two independent groups have the same or different typical ranks for some variable.

Copyright © 2012 Pearson Education What Have We Learned? The Kruskal-Wallace Test compares three or more groups. The Wilcoxon Signed-Rank Test is appropriate for paired data. Kendall’s Tau measures the degree on monotonicity in the relationship between two variables. It can be applied to two quantitative variables if the question of interest is about monotonicity—and especially if we suspect, or know that the linearity assumption, required by regression, and Pearson correlation is not satisfied. It can be applied to rank data, where it is not appropriate to apply least squares methods at all.

Copyright © 2012 Pearson Education What Have We Learned? Spearman’s Rho measures the correlation between the ranks of two variables. It can be used even when the relationship is not linear, or when outliers are present in either variable. It can be used when one or both variables consist of ranks.