Nonparametric Statistics

Slides:



Advertisements
Similar presentations
Prepared by Lloyd R. Jaisingh
Advertisements

BPS - 5th Ed. Chapter 241 One-Way Analysis of Variance: Comparing Several Means.
Chapter 16 Introduction to Nonparametric Statistics
Economics 105: Statistics Go over GH 11 & 12 GH 13 & 14 due Thursday.
© 2011 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. More About Categorical Variables Chapter 15.
Lecture 10 Non Parametric Testing STAT 3120 Statistical Methods I.
1-1 Copyright © 2015, 2010, 2007 Pearson Education, Inc. Chapter 25, Slide 1 Chapter 25 Comparing Counts.
Copyright © 2010, 2007, 2004 Pearson Education, Inc Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
Chapter 12 Chi-Square Tests and Nonparametric Tests
Chapter 14 Analysis of Categorical Data
Chapter 12 Chi-Square Tests and Nonparametric Tests
Lesson #25 Nonparametric Tests for a Single Population.
Test statistic: Group Comparison Jobayer Hossain Larry Holmes, Jr Research Statistics, Lecture 5 October 30,2008.
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 17: Nonparametric Tests & Course Summary.
Chi-square Test of Independence
Statistics for Managers Using Microsoft® Excel 5th Edition
Lecture 9 Today: –Log transformation: interpretation for population inference (3.5) –Rank sum test (4.2) –Wilcoxon signed-rank test (4.4.2) Thursday: –Welch’s.
Chapter 11: Inference for Distributions
Inferences About Process Quality
15-1 Introduction Most of the hypothesis-testing and confidence interval procedures discussed in previous chapters are based on the assumption that.
Review for Exam 2 Some important themes from Chapters 6-9 Chap. 6. Significance Tests Chap. 7: Comparing Two Groups Chap. 8: Contingency Tables (Categorical.
Nonparametrics and goodness of fit Petter Mostad
Chapter 15 Nonparametric Statistics
Inferential Statistics: SPSS
Chapter 14: Nonparametric Statistics
Chapter 26: Comparing Counts AP Statistics. Comparing Counts In this chapter, we will be performing hypothesis tests on categorical data In previous chapters,
Copyright © 2010 Pearson Education, Inc. Warm Up- Good Morning! If all the values of a data set are the same, all of the following must equal zero except.
The paired sample experiment The paired t test. Frequently one is interested in comparing the effects of two treatments (drugs, etc…) on a response variable.
More About Significance Tests
NONPARAMETRIC STATISTICS
Copyright © Cengage Learning. All rights reserved. 14 Elements of Nonparametric Statistics.
Lesson Inferences about the Differences between Two Medians: Dependent Samples.
Copyright © 2012 Pearson Education. Chapter 23 Nonparametric Methods.
Previous Lecture: Categorical Data Methods. Nonparametric Methods This Lecture Judy Zhong Ph.D.
Biostatistics, statistical software VII. Non-parametric tests: Wilcoxon’s signed rank test, Mann-Whitney U-test, Kruskal- Wallis test, Spearman’ rank correlation.
Copyright © Cengage Learning. All rights reserved. 14 Elements of Nonparametric Statistics.
Nonparametric Statistics. In previous testing, we assumed that our samples were drawn from normally distributed populations. This chapter introduces some.
MGT-491 QUANTITATIVE ANALYSIS AND RESEARCH FOR MANAGEMENT OSMAN BIN SAIF Session 26.
Nonparametric Tests IPS Chapter 15 © 2009 W.H. Freeman and Company.
Analysis of Qualitative Data Dr Azmi Mohd Tamil Dept of Community Health Universiti Kebangsaan Malaysia FK6163.
1 Nonparametric Statistical Techniques Chapter 17.
+ Chi Square Test Homogeneity or Independence( Association)
BPS - 5th Ed. Chapter 221 Two Categorical Variables: The Chi-Square Test.
Copyright © 2010 Pearson Education, Inc. Slide
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 11-1 Chapter 11 Chi-Square Tests and Nonparametric Tests Statistics for.
Ch11: Comparing 2 Samples 11.1: INTRO: This chapter deals with analyzing continuous measurements. Later, some experimental design ideas will be introduced.
GG 313 Lecture 9 Nonparametric Tests 9/22/05. If we cannot assume that our data are at least approximately normally distributed - because there are a.
Hypothesis Testing One-sample means and proportions Lecture 4.
Copyright © 2010 Pearson Education, Inc. Warm Up- Good Morning! If all the values of a data set are the same, all of the following must equal zero except.
CD-ROM Chap 16-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition CD-ROM Chapter 16 Introduction.
BPS - 5th Ed. Chapter 251 Nonparametric Tests. BPS - 5th Ed. Chapter 252 Inference Methods So Far u Variables have had Normal distributions. u In practice,
NON-PARAMETRIC STATISTICS
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Copyright (c) Bani Mallick1 STAT 651 Lecture 8. Copyright (c) Bani Mallick2 Topics in Lecture #8 Sign test for paired comparisons Wilcoxon signed rank.
Comparing Counts Chapter 26. Goodness-of-Fit A test of whether the distribution of counts in one categorical variable matches the distribution predicted.
BPS - 5th Ed. Chapter 221 Two Categorical Variables: The Chi-Square Test.
Lesson Test to See if Samples Come From Same Population.
Essential Statistics Chapter 171 Two-Sample Problems.
Midterm. T/F (a) False—step function (b) False, F n (x)~Bin(n,F(x)) so Inverting and estimating the standard error we see that a factor of n -1/2 is missing.
Two-Sample-Means-1 Two Independent Populations (Chapter 6) Develop a confidence interval for the difference in means between two independent normal populations.
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
1 Nonparametric Statistical Techniques Chapter 18.
Nonparametric Tests PBS Chapter 16 © 2009 W.H. Freeman and Company.
Lesson Inferences about the Differences between Two Medians: Dependent Samples.
Elementary Statistics
AP Stats Check In Where we’ve been… Chapter 7…Chapter 8…
Lecture Slides Elementary Statistics Twelfth Edition
Chapter 11: Inference for Distributions of Categorical Data
Presentation transcript:

Nonparametric Statistics Lecture 9

Small Sample, Non-normal Population If the sample was large, the Central Limit Theorem would be applicable for testing hypotheses about the mean. If the population was normal, the sampling distribution of the mean is exactly a normal distribution to start with. If the sample is small and the population non-normal, what do we do? Nonparametric statistics is a sub-field of statistics that creates inferences concerning populations that cannot be assumed to follow any particular distribution.

One –Sample Example Suppose that a nurse has been instructed to perform a procedure in a new way . Researchers recorded the change in the number of minutes it took the nurse to perform the procedure. The data is 0.6, -0.5, 1.1, 2.4, 3.5, 2.0 -0.1, 1.0, 2.1, -0.6, -0.2 We would be hard pressed to say that this data even approximately follows a normal distribution.

Assumption of normality for small sample example There are only 11 observations and we might be uncomfortable claiming that this distribution looks normal. Instead, it looks more uniform.

The Sign Test – 5 Steps Assumptions: Random, independent sample Hypotheses: Null hypothesis: Median equals zero Alternative hypothesis: Median does not equal zero Test statistic: p=7/11, interested in comparing proportion that are greater than zero with one-half.

The Sign Test – 5 Steps, cont. P-value: Need exact calculation since CLT doesn’t apply with small samples. 95% CI for p with small samples: (0.308, 0.891) Conclusion: Since 0.5 is included in the 95% confidence interval, we can’t say that the median is significantly different than zero at the 0.05 level. (We fail to reject the null hypothesis.)

The Signed Rank Test – 5 steps Assumptions: The measurement is continuous Independent, random sample from the population Distribution is symmetric Hypotheses: H0: Median of the distribution is 0 HA: Median of distribution is non-zero Test Statistic: Minimum of the rank sums P-value: from the computer! For this example, p=0.0439 Conclusion: As per usual.

Calculation of Signed Rank Test Statistic Order observations from smallest to largest in absolute value |Y|(1) ≤ |Y|(2) ≤ … ≤ |Y|(n) So from example, |-0.1| < |-0.2| < |-0.5| < |-0.6| = 0.6 < 1.0 < 1.1 < 2.0 < 2.1 < 2.4 < 3.5 Assign Ranks to these absolute values 1, 2, … , n In example, 1, 2, … , 11

Signed Rank Test Statistic, cont… Arrange the ranks into two groups: those with actual values that are smaller and those that are larger than zero. Sum the ranks for both the negative and positive valued observations, separately. Here, for negative values, sum of ranks = 1+2+3+4.5 = 10.5 For positive values sum of ranks = 4.5+6+7+8+9+10+11 = 55.5 Test Statistic = smallest rank sum

P-values for signed rank test For critical values and p-values, look at tables/computer generated p-values. This procedure is unavailable in the Student version of SPSS. It is available in SAS and the regular version of SPSS.

Comments on Signed Rank Test More “powerful” than the Sign Test, but requires more assumptions One-sided tests are possible Robust to outliers Some books/programs use the sum of the ranks of the positive values as the test statistic – p-values are always the same Nonparametric confidence intervals are also available from some software programs. For tied observations, use average rank for each tied observation.

Nonparametric statistics for small, non-normal samples Paired Data The same as for univariate data, except perform the test using the differences rather than the raw data. Two Independent Groups Mann-Whitney Rank Sum Test (Ch. 24) Procedure is similar to the Sign Rank test, except that instead of dividing observations according to whether they are positive or negative, we divide observations according to group membership. Assumptions include (1) independent, random samples, (2) independently selected groups, and (3) the shape and spread of the two distributions are the same

Paired Differences Example Wife 0.4 0.5 1.0 0.2 0.9 1.2 0.1 0.6 Husband 0.7 0.0 Difference -0.1 0.3 -0.2 Study Hypothesis: Men and women spend different amounts of time reading/watching the news.

The Signed Rank Test – 5 steps Assumptions: The measurement (difference) is continuous Independent, random sample from the population Distribution of difference is symmetric Hypotheses: H0: Median of the difference is 0 HA: Median of difference is non-zero Test Statistic: Minimum of the rank sums P-value: from the computer! For this example, Conclusion: As per usual.

Computer Outputs - Paired Data for wives and husbands are in two separate columns, with matched observations in the same row. Analyze Nonparametric tests 2 Related Samples… Wilcoxon Signed Ranks Test

Computer Outputs - Paired Data for wives and husbands are in two separate columns, with matched observations in the same row. Analyze Nonparametric tests 2 Related Samples… Sign Test

Two Independent Groups Example Wife 0.4 0.5 1.0 0.2 0.9 1.2 0.1 0.6 Husband 0.7 0.0 Study Hypothesis: Men and women spend different amounts of time reading/watching the news.

The Mann-Whitney Test – 5 steps Assumptions: Independent, random samples Independently selected groups The shape and spread of the two distributions are the same Hypotheses: H0: Group medians are the same HA: Group medians are different Test Statistic: rank sums P-value: from the table or computer! For this example, Conclusion: As per usual.

Computer Outputs - Independent Data for wives & husbands are in the same column; a second column indicates whether each observation is for the wife or husband*. Analyze Nonparametric tests 2 Independent Samples… Mann-Whitney Test *: Type of this variable must be Numeric in SPSS.

Comments on Nonparametric Test for 2 Independent Samples Robust to outliers One-sided tests are possible Nonparametric confidence intervals are also available from some software programs For tied observations, use average rank for each tied observation. Possible Names Mann-Whitney Rank Sum Test Mann-Whitney Test Mann-Whitney U Test Wilcoxon Rank Sum Test

Testing for a Relationship between Categorical Variables Large Sample Size Chi-square test Small Sample Size Chi-square test with Yates’ continuity correction Fisher’s exact test

Urgent Colonoscopy for the Diagnosis and Treatment of Severe Diverticular Hemorrhage New England Journal of Medicine 2000;342:78-82 Severe Bleeding Medical and Surgical Treatment Medical and Colonoscopic Treatment Total No 11 10 21 Yes 6 17 27 Research Hypothesis

Fisher’s Exact Test – 5 steps Assumptions: Independent, random sample from the population Two variables are categorical Hypotheses: H0: Response and Predictor are Independent HA: Response and Predictor are Associated Test Statistic: (p-value) P-value: from the computer! For this example, p=0.057 Conclusion: As per usual.

Data Entry Weight the variable: count. Data Weight Cases…

Computer Outputs - FET Crosstabs Perform FET (or Chi-square test if sample size is large) Analyze Descriptive Statistics Crosstabs… Assign “bleeding” for “Row(s)”, “treat” for “Column(s)” Click “Statistics” to check “Chi- square”

The Inexact Use of Fisher’s Exact Test in Six Major Medical Journals The Inexact Use of Fisher’s Exact Test in Six Major Medical Journals JAMA 1989;261:3430-3433 Table 1. Specification of Use of Fisher’s Exact Test by Journal Journal No. of Articles That Specified / No. of Articles Reviewed ------------------------------------------------------------------------------------------------------ New England Journal of Medicine 8 / 9 Annals of Internal Medicine 2 / 4 British Medical Journal 3 / 6 The Journal of the American 6 / 16 Medical Association Lancet 4 / 14 American Journal of Medicine 0 / 7

Homework To be posted, not graded Solutions will be posted on Monday Read Chapters 24, 25, 27