Assume as previously that we have k samples on as many treatments

Slides:



Advertisements
Similar presentations
Prepared by Lloyd R. Jaisingh
Advertisements

1 Chapter 20: Statistical Tests for Ordinal Data.
Kruskal Wallis and the Friedman Test.
A. The Basic Principle We consider the multivariate extension of multiple linear regression – modeling the relationship between m responses Y 1,…,Y m and.
1 COMM 301: Empirical Research in Communication Lecture 15 – Hypothesis Testing Kwan M Lee.
Computing the ranks of data is only one of several possible so- called scoring methods that are in use... Section 2.7 reviews three of them – we’ll look.
Lecture 10 Non Parametric Testing STAT 3120 Statistical Methods I.
Nonparametric and Resampling Statistics. Wilcoxon Rank-Sum Test To compare two independent samples Null is that the two populations are identical The.
Nonparametric or Distribution-free Tests
Lecture 4 Ttests STAT 3120 Statistical Methods I.
What if the population standard deviation, , is unknown? We could estimate it by the sample standard deviation s which is the square root of the sample.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Inference on the Least-Squares Regression Model and Multiple Regression 14.
Copyright © Cengage Learning. All rights reserved. 10 Inferences Involving Two Populations.
1 Experimental Statistics - week 4 Chapter 8: 1-factor ANOVA models Using SAS.
Introduction to SAS Essentials Mastering SAS for Data Analytics
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 3 Section 3 – Slide 1 of 19 Chapter 3 Section 3 Measures of Central Tendency and Dispersion.
Testing Multiple Means and the Analysis of Variance (§8.1, 8.2, 8.6) Situations where comparing more than two means is important. The approach to testing.
NON-PARAMETRIC STATISTICS
Experimental Statistics - week 3
Other Types of t-tests Recapitulation Recapitulation 1. Still dealing with random samples. 2. However, they are partitioned into two subsamples. 3. Interest.
Computing the ranks of data is only one of several possible so-called scoring methods that are in use... Section 2.7 reviews three of them – we’ll look.
Use the SET statement to: –create an exact copy of a SAS dataset –modify an existing SAS dataset by creating new variables, subsetting (using a subsetting.
Chi Square Test for Goodness of Fit Determining if our sample fits the way it should be.
95% CI and Width for Mean # of hrs watching TV for Three Different Sample Sizes Sample SizeConfidence Interval Interval WidthStand DevStand Error N=
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide 6- 1.
10 Chapter Chi-Square Tests and the F-Distribution Chapter 10
Inferential Statistics
March 28 Analyses of binary outcomes 2 x 2 tables
Chapter 12 Chi-Square Tests and Nonparametric Tests
Chapter 14 Inference on the Least-Squares Regression Model and Multiple Regression.
Chapter 6 Inferences Based on a Single Sample: Estimation with Confidence Intervals Slides for Optional Sections Section 7.5 Finite Population Correction.
Effect Sizes (continued)
Chapter 13 Nonlinear and Multiple Regression
Social Science Research Design and Statistics, 2/e Alfred P
Hypothesis Testing Using the Chi Square (χ2) Distribution
5-5 Inference on the Ratio of Variances of Two Normal Populations
Lecture 16 Nonparametric Two-Sample Tests
Statistics 103 Monday, July 10, 2017.
Hypothesis Tests for a Population Mean,
Section 6-4 – Confidence Intervals for the Population Variance and Standard Deviation Estimating Population Parameters.
SA3202 Statistical Methods for Social Sciences
Chapter 9 Hypothesis Testing.
We’ll now consider 2x2 contingency tables, a table which has only 2 rows and 2 columns along with a special way to analyze it called Fisher’s Exact Test.
Signed-rank statistic
Consider this table: The Χ2 Test of Independence
Chi Square Two-way Tables
What about ties?? There are two methods mentioned on p.155ff:
Introduction to ANOVA.
Association, correlation and regression in biomedical research
Model Diagnostics and Tests
Confidence Intervals for a Standard Deviation
Test to See if Samples Come From Same Population
Chi2 (A.K.A X2).
Non-parametric Analysis of the Variance in SAS®
CHAPTER 14 MULTIPLE REGRESSION
Hypothesis Tests for a Standard Deviation
NONPARAMETRIC STATISTICS
Exact Test Fisher’s Statistics
Nonparametric Statistics
1-Way Analysis of Variance - Completely Randomized Design
Experimental Statistics - week 8
Common Statistical Analyses Theory behind them
pairing data values (before-after, method1 vs
InferentIal StatIstIcs
K-Sample Methods Assume X1j from treatment 1 (sample of size n1) and and so forth… Xkj from treatment k (sample of size nk) for a total of n1+n2+ … +nk.
See Table and let’s do it in R…
Analysis of two-way tables
Introduction to SAS Essentials Mastering SAS for Data Analytics
Wilcoxon Rank-Sum Test
Data Analysis and Statistical Software I ( ) Quarter: Autumn 02/03
Presentation transcript:

Assume as previously that we have k samples on as many treatments Assume as previously that we have k samples on as many treatments. We’ll let Rij denote the rank of the sample value Xij and as before N = the total number of sample values (see the Table 3.2.1 on page 86 for the complete notations…). Then the Kruskal – Wallis statistic is the rank-based equivalent to the F-statistic given by the formula Note that the sum in the KW formula is actually the treatment sum of squares (remember, (N+1)/2 is the mean of all the N ranks). The constant coefficient of the sum is a “scaling factor” which makes the KW statistic have approximately a chi-square distribution with k-1 degrees of freedom. Thus p-values may be obtained from the chi-square tables, for large enough sample size; or we may use Table A6 in the Appendix (for small sample sizes and for k=3 or 4; or we may use a permutation test based on this KW statistic…

The permutation test based on the KW statistic is done in a similar manner to the others we’ve done except we’ll have to compute KW after each “shuffle” (sample)… try it now on Example 3.2.1 on page 87… see R#7 handout. For SAS, use PROC NPAR1WAY WILCOXON; but be careful about using the EXACT WILCOXON; statement in the k-sample case – it can take several minutes to actually compute the exact probabilities… Try this on the data from Table 3.2.2 on page 87 In the case of ties in the data, use mid-ranks to compute the ranks and make one of two adjustments (see p. 88 and 89):

It is also possible to create a “KW-like” statistic for general scores (not just ranks or mid-ranks) such as van der Waerden scores. See the statistic GS on page 91 and go over it carefully… HW: Finish reading this section 3.2; make sure you can calculate the KW statistic in R and SAS and understand the output. Midterm HW: Apply the Kruskal-Wallis permutation test to the data in problem #2 on page 105.

/*use the following to calculate the tied KW statistics*/ /*note that proc npar1way does the tied KW on p.88*/ dm log 'clear'; dm output 'clear'; options ls=80; data table3_2_3; input food_group $ salt_score @@; datalines; pr1 4 pr1 5 pr1 3 pr1 4 pr1 5 pr1 5 pr1 2 pr2 3 pr2 4 pr2 5 pr2 2 pr2 3 pr2 1 pr2 1 pr2 2 pr3 2 pr3 1 pr3 1 pr3 2 pr3 1 pr3 3 ; proc print; run; proc sort; by food_group; run; proc rank; run; *to get the mid-ranks; proc means; by food_group; run; *to get the means of the ranks for each group; proc means; run; *to get the mean and s.d. for all the ranks combined; proc npar1way wilcoxon data=table3_2_3; class food_group; var salt_score; run; quit;