Nonparametric tests and ANOVAs: What you need to know.

Slides:

Advertisements

Similar presentations

BPS - 5th Ed. Chapter 241 One-Way Analysis of Variance: Comparing Several Means.

Advertisements

Chapter 16 Introduction to Nonparametric Statistics

Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 16 l Nonparametrics: Testing with Ordinal Data or Nonnormal Distributions.

2  How to compare the difference on >2 groups on one or more variables  If it is only one variable, we could compare three groups with multiple ttests:

Design of Experiments and Analysis of Variance

PSY 307 – Statistics for the Behavioral Sciences Chapter 20 – Tests for Ranked Data, Choosing Statistical Tests.

Testing Differences Among Several Sample Means Multiple t Tests vs. Analysis of Variance.

Testing means, part III The two-sample t-test. Sample Null hypothesis The population mean is equal to  o One-sample t-test Test statistic Null distribution.

Independent Sample T-test Formula

Nonparametric Statistics Introduction to Business Statistics, 5e Kvanli/Guynes/Pavur (c)2000 South-Western College Publishing.

The Normal Distribution. n = 20,290  =  = Population.

Analysis of Variance: Inferences about 2 or More Means

Lesson #23 Analysis of Variance. In Analysis of Variance (ANOVA), we have: H 0 :  1 =  2 =  3 = … =  k H 1 : at least one  i does not equal the others.

Independent Samples and Paired Samples t-tests PSY440 June 24, 2008.

Homework Chapter 11: 13 Chapter 12: 1, 2, 14, 16.

PSY 307 – Statistics for the Behavioral Sciences

Test statistic: Group Comparison Jobayer Hossain Larry Holmes, Jr Research Statistics, Lecture 5 October 30,2008.

Final Review Session.

Statistical Methods in Computer Science Hypothesis Testing I: Treatment experiment designs Ido Dagan.

One-way Between Groups Analysis of Variance

Statistical Methods in Computer Science Hypothesis Testing I: Treatment experiment designs Ido Dagan.

Student’s t statistic Use Test for equality of two means

© 2005 The McGraw-Hill Companies, Inc., All Rights Reserved. Chapter 13 Using Inferential Statistics.

Today Concepts underlying inferential statistics

Independent Sample T-test Classical design used in psychology/medicine N subjects are randomly assigned to two groups (Control * Treatment). After treatment,

Statistical Methods in Computer Science Hypothesis Testing II: Single-Factor Experiments Ido Dagan.

Statistical Analysis. Purpose of Statistical Analysis Determines whether the results found in an experiment are meaningful. Answers the question: –Does.

Chapter 15 Nonparametric Statistics

Chapter 12 Inferential Statistics Gay, Mills, and Airasian

Nonparametric or Distribution-free Tests

Analysis of Variance. ANOVA Probably the most popular analysis in psychology Why? Ease of implementation Allows for analysis of several groups at once.

1 Advances in Statistics Or, what you might find if you picked up a current issue of a Biological Journal.

T-distribution & comparison of means Z as test statistic Use a Z-statistic only if you know the population standard deviation (σ). Z-statistic converts.

STA291 Statistical Methods Lecture 31. Analyzing a Design in One Factor – The One-Way Analysis of Variance Consider an experiment with a single factor.

NONPARAMETRIC STATISTICS

Mid-Term Review Final Review Statistical for Business (1)(2)

Chapter 11 HYPOTHESIS TESTING USING THE ONE-WAY ANALYSIS OF VARIANCE.

t(ea) for Two: Test between the Means of Different Groups When you want to know if there is a ‘difference’ between the two groups in the mean Use “t-test”.

Biostat 200 Lecture 7 1. Hypothesis tests so far T-test of one mean: Null hypothesis µ=µ 0 Test of one proportion: Null hypothesis p=p 0 Paired t-test:

Testing means, part II The paired t-test. Outline of lecture Options in statistics –sometimes there is more than one option One-sample t-test: review.

Analysis of variance Petter Mostad Comparing more than two groups Up to now we have studied situations with –One observation per object One.

PSY 307 – Statistics for the Behavioral Sciences Chapter 16 – One-Factor Analysis of Variance (ANOVA)

Ordinally Scale Variables

Copyright © Cengage Learning. All rights reserved. 14 Elements of Nonparametric Statistics.

Comparing Three or More Means ANOVA (One-Way Analysis of Variance)

Analysis of Variance 1 Dr. Mohammed Alahmed Ph.D. in BioStatistics (011)

Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.

Lesson 15 - R Chapter 15 Review. Objectives Summarize the chapter Define the vocabulary used Complete all objectives Successfully answer any of the review.

1 Always be mindful of the kindness and not the faults of others.

Experimental Design and Statistics. Scientific Method

Previous Lecture: Phylogenetics. Analysis of Variance This Lecture Judy Zhong Ph.D.

Experimental Psychology PSY 433 Appendix B Statistics.

1 ANALYSIS OF VARIANCE (ANOVA) Heibatollah Baghi, and Mastee Badii.

Kruskal-Wallis H TestThe Kruskal-Wallis H Test is a nonparametric procedure that can be used to compare more than two populations in a completely randomized.

Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics S eventh Edition By Brase and Brase Prepared by: Lynn Smith.

Friedman F r TestThe Friedman F r Test is the nonparametric equivalent of the randomized block design with k treatments and b blocks. All k measurements.

CD-ROM Chap 16-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition CD-ROM Chapter 16 Introduction.

Kin 304 Inferential Statistics Probability Level for Acceptance Type I and II Errors One and Two-Tailed tests Critical value of the test statistic “Statistics.

Testing Differences in Means (t-tests) Dr. Richard Jackson © Mercer University 2005 All Rights Reserved.

HYPOTHESIS TESTING FOR DIFFERENCES BETWEEN MEANS AND BETWEEN PROPORTIONS.

Session 9: k Samples (Zar, Chapter 10). (1) General Setup: Group 1Group 2…Group k x 11 x 21 x k1 x 12 x 22 x k2 x 13 x 23 x k3 x 1n 1 x 2n 2 x kn k H.

ENGR 610 Applied Statistics Fall Week 8 Marshall University CITE Jack Smith.

 List the characteristics of the F distribution.  Conduct a test of hypothesis to determine whether the variances of two populations are equal.  Discuss.

1 Underlying population distribution is continuous. No other assumptions. Data need not be quantitative, but may be categorical or rank data. Very quick.

Copyright © 2008 by Hawkes Learning Systems/Quant Systems, Inc.

Comparing Three or More Means

Basic Practice of Statistics - 5th Edition

Part Three. Data Analysis

One way ANALYSIS OF VARIANCE (ANOVA)

ANOVA: Analysis of Variance

Presentation transcript:

Nonparametric tests and ANOVAs: What you need to know

Nonparametric tests Nonparametric tests are usually based on ranks There are nonparametric versions of most parametric tests

Parametric One-sample and Paired t-test Two-sample t-test Sign test Mann-Whitney U-test Nonparametric

Quick Reference Summary: Sign Test What is it for? A non-parametric test to compare the medians of a group to some constant What does it assume? Random samples Formula: Identical to a binomial test with p o = 0.5. Uses the number of subjects with values greater than and less than a hypothesized median as the test statistic. P(x) = probability of a total of x successes p = probability of success in each trial n = total number of trials P = 2 * Pr[x  X]

Sample Null hypothesis Median = m o Null distribution Binomial n, 0.5 compare How unusual is this test statistic? P < 0.05 P > 0.05 Reject H o Fail to reject H o Sign test Test statistic x = number of values greater than m o

Quick Reference Summary: Mann-Whitney U Test What is it for? A non-parametric test to compare the central tendencies of two groups What does it assume? Random samples Test statistic: U Distribution under H o : U distribution, with sample sizes n 1 and n 2 Formulae: n 1 = sample size of group 1 n 2 = sample size of group 2 R 1 = sum of ranks of group 1 Use the larger of U1 or U2 for a two-tailed test

Sample Null hypothesis The two groups Have the same median Null distribution U with n 1, n 2 compare How unusual is this test statistic? P < 0.05 P > 0.05 Reject H o Fail to reject H o Mann-Whitney U test Test statistic U 1 or U 2 (use the largest)

Mann-Whitney U test Large-sample approximation: Use this when n 1 & n 2 are both > 10 Compare to the standard normal distribution

Mann-Whitney U Test If you have ties: –Rank them anyway, pretending they were slightly different –Find the average of the ranks for the identical values, and give them all that rank –Carry on as if all the whole-number ranks have been used up

Example Data

Example Sorted Data Data

Example Sorted Data Data TIES

Example Sorted Data Data TIES Rank them anyway, pretending they were slightly different

Example Rank A Sorted Data

Example Rank A Sorted Data Find the average of the ranks for the identical values, and give them all that rank

Example Rank A Sorted Data Average = 1.5 Average = 6

Example Rank A Sorted Data Rank

Example Rank A Sorted Data Rank These can now be used for the Mann-Whitney U test

Benefits and Costs of Nonparametric Tests Main benefit: –Make fewer assumptions about your data –E.g. only assume random sample Main cost: –Reduce statistical power –Increased chance of Type II error

When Should I Use Nonparametric Tests? When you have reason to suspect the assumptions of your test are violated –Non-normal distribution –No transformation makes the distribution normal –Different variances for two groups

Quick Reference Summary: ANOVA (analysis of variance) What is it for? Testing the difference among k means simultaneously What does it assume? The variable is normally distributed with equal standard deviations (and variances) in all k populations; each sample is a random sample Test statistic: F Distribution under H o : F distribution with k-1 and N-k degrees of freedom

Formulae: Quick Reference Summary: ANOVA (analysis of variance) = mean of group i = overall mean n i = size of sample i N = total sample size

k Samples Null distribution F with k-1, N-k df compare How unusual is this test statistic? P < 0.05 P > 0.05 Reject H o Fail to reject H o ANOVA Test statistic Null hypothesis All groups have the same mean

Formulae: Quick Reference Summary: ANOVA (analysis of variance) = mean of group i = overall mean n i = size of sample i N = total sample size There are a LOT of equations here, and this is the simplest possible ANOVA

df group = k-1 df error = N-k

df group = k-1 df error = N-k Sum of Squares df Mean SquaresF-ratio

ANOVA Tables Source of variation Sum of squares dfMean Squares F ratioP Treatment Error Total

ANOVA Tables Source of variation Sum of squares dfMean Squares F ratioP Treatment Error Total

ANOVA Tables Source of variation Sum of squares dfMean Squares F ratioP Treatment k-1 Error N-k Total N-1

ANOVA Tables Source of variation Sum of squares dfMean Squares F ratioP Treatment k-1 Error N-k Total N-1

ANOVA Tables Source of variation Sum of squares dfMean Squares F ratioP Treatment k-1 Error N-k Total N-1

ANOVA Tables Source of variation Sum of squares dfMean Squares F ratioP Treatment k-1 Error N-k Total N-1 *

ANOVA Table: Example Source of variation Sum of squares dfMean Squares F ratioP Treatment Error Total

ANOVA Table: Example Source of variation Sum of squares dfMean Squares F ratioP Treatment Error Total

Additions to ANOVA R 2 value: how much variance is explained? Comparisons of groups: planned and unplanned Fixed vs. random effects Repeatability

Two-Factor ANOVA Often we manipulate more than one thing at a time Multiple categorical explanitory variables Example: sex and nationality

Two-factor ANOVA Don’t worry about the equations for this Use an ANOVA table

Two-factor ANOVA Testing three things: 1.Means don’t differ among treatment 1 2.Means don’t differ among treatment 2 3.There is no interaction between the two treatments

Two-factor ANOVA Table Source of variation Sum of Squares dfMean SquareF ratioP Treatment 1SS 1 k 1 - 1SS 1 k MS 1 MSE Treatment 2SS 2 k 2 - 1SS 2 k MS 2 MSE Treatment 1 * Treatment 2 SS 1*2 (k 1 - 1)*(k 2 - 1)SS 1*2 (k 1 - 1)*(k 2 - 1) MS 1*2 MSE ErrorSS error XXXSS error XXX TotalSS total N-1