Lecture 14 The Sign Test and the Rank Test

Slides:



Advertisements
Similar presentations
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 16 l Nonparametrics: Testing with Ordinal Data or Nonnormal Distributions.
Advertisements

McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Nonparametric Methods Chapter 15.
EPI 809 / Spring 2008 Chapter 9 Nonparametric Statistics.
Lecture 10 Non Parametric Testing STAT 3120 Statistical Methods I.
Copyright © 2010, 2007, 2004 Pearson Education, Inc Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
Elementary hypothesis testing
Statistics 07 Nonparametric Hypothesis Testing. Parametric testing such as Z test, t test and F test is suitable for the test of range variables or ratio.
15-1 Introduction Most of the hypothesis-testing and confidence interval procedures discussed in previous chapters are based on the assumption that.
5-3 Inference on the Means of Two Populations, Variances Unknown
7.1 Lecture 10/29.
Chapter 15 Nonparametric Statistics
Statistical Methods II
Nonparametric or Distribution-free Tests
The paired sample experiment The paired t test. Frequently one is interested in comparing the effects of two treatments (drugs, etc…) on a response variable.
NONPARAMETRIC STATISTICS
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Fundamentals of Data Analysis Lecture 4 Testing of statistical hypotheses.
Copyright © Cengage Learning. All rights reserved. 14 Elements of Nonparametric Statistics.
Lesson Inferences about the Differences between Two Medians: Dependent Samples.
What are Nonparametric Statistics? In all of the preceding chapters we have focused on testing and estimating parameters associated with distributions.
Previous Lecture: Categorical Data Methods. Nonparametric Methods This Lecture Judy Zhong Ph.D.
Maximum Likelihood Estimator of Proportion Let {s 1,s 2,…,s n } be a set of independent outcomes from a Bernoulli experiment with unknown probability.
Biostatistics, statistical software VII. Non-parametric tests: Wilcoxon’s signed rank test, Mann-Whitney U-test, Kruskal- Wallis test, Spearman’ rank correlation.
Lecture 9 TWO GROUP MEANS TESTS EPSY 640 Texas A&M University.
Ch11: Comparing 2 Samples 11.1: INTRO: This chapter deals with analyzing continuous measurements. Later, some experimental design ideas will be introduced.
Nonparametric Statistical Methods. Definition When the data is generated from process (model) that is known except for finite number of unknown parameters.
NON-PARAMETRIC STATISTICS
Biostatistics Nonparametric Statistics Class 8 March 14, 2000.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Logistic regression. Recall the simple linear regression model: y =  0 +  1 x +  where we are trying to predict a continuous dependent variable y from.
Nonparametric Statistical Methods. Definition When the data is generated from process (model) that is known except for finite number of unknown parameters.
Two-Sample-Means-1 Two Independent Populations (Chapter 6) Develop a confidence interval for the difference in means between two independent normal populations.
1 Nonparametric Statistical Techniques Chapter 18.
1 Underlying population distribution is continuous. No other assumptions. Data need not be quantitative, but may be categorical or rank data. Very quick.
Virtual University of Pakistan
Nonparametric Statistics
Virtual University of Pakistan
NONPARAMETRIC STATISTICS
STATISTICAL INFERENCE
3. The X and Y samples are independent of one another.
Chapter 4. Inference about Process Quality
Math 4030 – 12a Correlation.
Lecture 16 Nonparametric Two-Sample Tests
Lesson Inferences about the Differences between Two Medians: Dependent Samples.
Environmental Modeling Basic Testing Methods - Statistics
Nonparametric Tests (1)
SA3202 Statistical Methods for Social Sciences
Lecture 13 The Quantile Test
Lecture 11 Nonparametric Statistics Introduction
Nonparametric Statistics
Lecture 15 Wilcoxon Tests
9 Tests of Hypotheses for a Single Sample CHAPTER OUTLINE
Signed-rank statistic
Lecture 7 The Odds/ Log Odds Ratios
Lecture Slides Elementary Statistics Twelfth Edition
What about ties?? There are two methods mentioned on p.155ff:
Lecture 5, Goodness of Fit Test
Lecture 12 Binomial Tests and Quantiles
marketing research with Spss
The Rank-Sum Test Section 15.2.
Lecture 10/24/ Tests of Significance
Confidence Intervals.
Lecture # 2 MATHEMATICAL STATISTICS
Nonparametric Statistics
Distribution-Free Procedures
pairing data values (before-after, method1 vs
STA 291 Spring 2008 Lecture 22 Dustin Lueker.
See Table and let’s do it in R…
For a permutation test, we have H0: F1(x) = F2(x) vs
Data Analysis and Statistical Software I ( ) Quarter: Autumn 02/03
Presentation transcript:

Lecture 14 The Sign Test and the Rank Test Outline of Today The Sign Test The Rank Test 1/16/2019 SA3202, Lecture 14

The Problem Let X~ F (density f) Y~G (density g) We say that X and Y differ in location if X has the same distribution as Y shifted by an amount a X~ Y+theta or f(x)=g(x-theta) or F(x)=G(x-theta) Usually theta is unknown, and is referred to as the shift parameter or the location parameter. The Location Test Problem Given a sample (Xi, Yi), i=1,2, …,n, we wish to test H0: theta=0 (i.e. X~Y) against H1: theta ~=0 (i.e X~Y+theta) or H1: theta>0 or H1: theta<0 1/16/2019 SA3202, Lecture 14

The Sign Test The usual parametric test of this hypothesis is the paired t-test. A simple nonparametric test may be based on the sign of the difference D=X-Y By noting that if X~Y, then by symmetry, P(X>Y)=P(Y>X)=1/2 1/16/2019 SA3202, Lecture 14

Let p=P(D>0)=Pr(X>Y). Then under H0: X~Y, we have p= Let p=P(D>0)=Pr(X>Y). Then under H0: X~Y, we have p=.5 whereas under H1, we have p>.5 when theta>0 <.5 when theta <0 ~=.5 when theta~=0 Thus, we can use a binomial test, based on the sign of the difference Di=Xi-Yi, I=1,2, …,n The test statistic is M=#{i |Di>0}=# of the times that Xi>Yi 1/16/2019 SA3202, Lecture 14

Example Remark : Due to continuity, under H0, Pr(D=0)=Pr(X=Y)=0. That is, the probability of a tie is 0. In practice, however, ties do occur and the usual practice is to discard the ties and base the sign test on the remaining observations. In this case, M=#{i| Di>0}~Binom (n’, .5), n’ =# of no tie pairs Example The number of defective electrical fuse proceeding from each of two production lines, A and B, was recorded for a period of 10 days. The results are as follows: Day 1 2 3 4 5 6 7 8 9 10 A 172 165 206 184 174 142 190 169 161 200 B 201 179 159 192 177 170 182 179 169 210 1/16/2019 SA3202, Lecture 14

We wish to test the hypothesis that the two distributions of defectives produced by A and B are identical, against the alternative that either production line produces more defectives than the other one. n=10, p=.5 Binomial Table ======================================================================= U<=0 U<=1 U<=2 U<=3 U<=4 U<=5 U<=6 U<=7 U<=8 U<=9 U<=10 0.0009 0.0107 0.0547 0.1719 0.3770 0.6231 0.8281 0.9453 0.9893 0.9990 1.0000 ====================================================================== The test statistic is M=# of the times that A exceeds B. Under H0, M~Binom(10,.5). From the Binomial table for n=10, a test of approximate 10% is to reject H0 if Now the observed M is so we reject H0. That is, there is a significance difference between A and B. 1/16/2019 SA3202, Lecture 14

Remark Clearly, the sign test may be also used to test other hypotheses about the shift parameter. For example to test hypothesis H0: theta=theta0, we look at the signs of the differences Di=Xi-Yi-theta0. Note that if X~Y+theta, then the shift parameter theta is the median of difference D=X-Y. Therefore confidence intervals for theta may be obtained, based on the values of differences, as discussed earlier. 1/16/2019 SA3202, Lecture 14

Ranks Definition Given a sample X1, X2, …, Xn ~ F, continuous The rank of the i-th observation, Ri, is the order of the i-th observation when the observations are arranged in increasing order. For example, for the following sample of size 5, i 1 2 3 4 5 Xi 3.1 1.8 5.2 10.1 2.6 The order statistics are X(i) 1.8 2.6 3.1 5.2 10.1 The ranks are Ri 3 1 4 5 2 1/16/2019 SA3202, Lecture 14

The order statistics are X(i) 1.8 3.1 3.1 5.2 10.1 The Tie Rank If two or more observations are equal, referred to as tie., we assign the average rank of these observations to each one. For example, i 1 2 3 4 5 Xi 3.1 1.8 5.2 10.1 3.1 The order statistics are X(i) 1.8 3.1 3.1 5.2 10.1 The ranks are Ri 2.5 1 4 5 2.5=(2+3)/2 This procedure will be used to deal with ties throughout the course. Theoretically, however, the problem of ties can be ignored if a continuous distribution function is considered. 1/16/2019 SA3202, Lecture 14

Rank Distribution Note that R=(R1, R2, …,Rn) is a permutation of the numbers (1,2,…, n) (ignoring ties). By symmetry, all the possible permutations are equally likely: Pr( R = any given permutation of (1,2,…,n))= Also, by symmetry, for each k, we have Pr(Rk=r)= , k=1,2,…,n Pr(Rk=r, Rl=s)= , k, l=1,2, ….,n 1/16/2019 SA3202, Lecture 14

Use these results, we have E(Rk)= Var(Rk)= Cov(Rk, Rl)= 1/16/2019 SA3202, Lecture 14