Lecture 13 The Quantile Test

Slides:



Advertisements
Similar presentations
Previous Lecture: Distributions. Introduction to Biostatistics and Bioinformatics Estimation I This Lecture By Judy Zhong Assistant Professor Division.
Advertisements

Chapter 16 Introduction to Nonparametric Statistics
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 16 l Nonparametrics: Testing with Ordinal Data or Nonnormal Distributions.
Sections 7-1 and 7-2 Review and Preview and Estimating a Population Proportion.
5-3 Inference on the Means of Two Populations, Variances Unknown
7.1 Lecture 10/29.
Chapter 15 Nonparametric Statistics
Nonparametric or Distribution-free Tests
Copyright © 2010, 2007, 2004 Pearson Education, Inc Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
The paired sample experiment The paired t test. Frequently one is interested in comparing the effects of two treatments (drugs, etc…) on a response variable.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Inference for a Single Population Proportion (p).
1 CSI5388: Functional Elements of Statistics for Machine Learning Part I.
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
Comparing two sample means Dr David Field. Comparing two samples Researchers often begin with a hypothesis that two sample means will be different from.
Nonparametric Statistics aka, distribution-free statistics makes no assumption about the underlying distribution, other than that it is continuous the.
Nonparametric Statistics. In previous testing, we assumed that our samples were drawn from normally distributed populations. This chapter introduces some.
Large sample CI for μ Small sample CI for μ Large sample CI for p
Sections 7-1 and 7-2 Review and Preview and Estimating a Population Proportion.
Ch11: Comparing 2 Samples 11.1: INTRO: This chapter deals with analyzing continuous measurements. Later, some experimental design ideas will be introduced.
GG 313 Lecture 9 Nonparametric Tests 9/22/05. If we cannot assume that our data are at least approximately normally distributed - because there are a.
CD-ROM Chap 16-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition CD-ROM Chapter 16 Introduction.
NON-PARAMETRIC STATISTICS
13 Nonparametric Methods Introduction So far the underlying probability distribution functions (pdf) are assumed to be known, such as SND, t-distribution,
Chapter 6 Normal Approximation to Binomial Lecture 4 Section: 6.6.
Virtual University of Pakistan
Inference for a Single Population Proportion (p)
Virtual University of Pakistan
Making inferences from collected data involve two possible tasks:
NONPARAMETRIC STATISTICS
Two-Sample Hypothesis Testing
STATISTICAL INFERENCE
3. The X and Y samples are independent of one another.
Chapter 4. Inference about Process Quality
STA 291 Spring 2010 Lecture 21 Dustin Lueker.
Hypothesis Tests for 1-Sample Proportion
Unit 7 Today we will look at: Normal distributions
Lecture 6 Comparing Proportions (II)
Hypothesis Testing: Hypotheses
Lecture 16 Nonparametric Two-Sample Tests
Lecture 17 Rank Correlation Coefficient
SA3202 Statistical Methods for Social Sciences
Chapter 9 Hypothesis Testing.
Lecture 11 Nonparametric Statistics Introduction
Lecture 15 Wilcoxon Tests
CONCEPTS OF ESTIMATION
9 Tests of Hypotheses for a Single Sample CHAPTER OUTLINE
Lecture 18 The Run Test Outline of Today The Definition
Lecture 7 The Odds/ Log Odds Ratios
Problems: Q&A chapter 6, problems Chapter 6:
Lecture Slides Elementary Statistics Twelfth Edition
What about ties?? There are two methods mentioned on p.155ff:
Lecture 5, Goodness of Fit Test
BOOTSTRAPPING: LEARNING FROM THE SAMPLE
Lecture 12 Binomial Tests and Quantiles
Lecture 14 The Sign Test and the Rank Test
Tutorial 9 Suppose that a random sample of size 10 is drawn from a normal distribution with mean 10 and variance 4. Find the following probabilities:
The Rank-Sum Test Section 15.2.
NONPARAMETRIC METHODS
Inference on the Mean of a Population -Variance Known
Nonparametric Statistics
Last Update 12th May 2011 SESSION 41 & 42 Hypothesis Testing.
Solution U ~Binom(15,.75) 90% CI for Q3=[X(8), X(14)] with exact coverage (90.25%) (based on the data) which is [63.3, 73.3). U~Binom(20,
Tests of inference about 2 population means
Chapter 18 The Binomial Test
Distribution-Free Procedures
STA 291 Spring 2008 Lecture 22 Dustin Lueker.
See Table and let’s do it in R…
STA 291 Spring 2008 Lecture 21 Dustin Lueker.
Chapter 5: Sampling Distributions
Presentation transcript:

Lecture 13 The Quantile Test Outline of Today The Quantile Test CI for Quantiles The Quantile Test for Large Samples The Sign Test for Median 11/19/2018 SA3202, Lecture 13

The Quantile Test Definition The quantile test is a test about whether a quantile is equal to, or smaller, or larger than a given value given a random sample from a continuous but unknown distribution function. 11/19/2018 SA3202, Lecture 13

Testing Procedure The quantile test can be converted to hypothesis test about a binomial parameter, and therefore, can be tested using a binomial test. Let U be the number of observations which are less than or equal to a, U=#{Xi | Xi<= a} Then considering the event “ the observation is less than or equal to a” as “ Success”, it follows that U~Binom(n, p), p= Now, clearly, we have the following relationship: 11/19/2018 SA3202, Lecture 13

Remark Note that the apparent reversal in the direction of the equality. This follows from the fact that Thus, the hypotheses about the quantile are equivalent to hypotheses about the parameter of a binomial distribution U~Binom(n,p), and may be tested in a usual manner. For example, testing ” H0: xp0=a against H1: xp0<a “ is equivalent to testing “H0: p=p0 against H1: p>p0 “ Thus, H0 is rejected when U is too large. 11/19/2018 SA3202, Lecture 13

Example Assume we have the following sample: 34 12 19 67 54 18 88 36 17 58 66 48 33 75 17 73 67 22 61 23 And consider testing H0: =45 (the 30th percentile is 45) against H1: <45 (the 30th percentile is less than 45) The H0 is rejected if U=# {Xi| Xi<=45} is too large. Under H0, U~Binom(20, .3). n=20, p=.3 ============================================================================== Event U<= 0 U<=1 U<=2 U<=3 U<=4 U<=5 U<=6 U<=7 U<=8 U<=9 U<=10 Prob. .0008 .0076 .0355 .1071 .2375 .4164 .6080 .7723 .8867 .9520 .9828 Event U<=11 U<=12 U<=13 U<=14 U<=15 U<=16 U<=17 U<=18 U<=19 U<=20 Prob. .9949 .9987 .9997 .9999 1.0000 1.0000 1.0000 1.0000 1.0000 1.0000 ============================================================================= 11/19/2018 SA3202, Lecture 13

From the Binomial table, we have P(U<=9)=.952 So for a 5% level test, we reject H0 if U>=10. Since the observed U=10, we reject H0. Remark Theoretically, in the quantile test, we can ignore the possibility that an observation exactly equals a (usually called a “tie”), because for a continuous distribution this event has a zero probability. In practice, however, “ties” do occur, and the usual practice is to discard these observations and to base the test on the remaining observations. 11/19/2018 SA3202, Lecture 13

CI for Quantiles Feature Nonparametric CI for the p-th quantile is obtained using order statistics of the sample as confidence limits. Procedure The procedure is to find r and s such that P(X(r )<=xp<X(s)) is equal to the nominal confidence coefficient . Let U=#{Xi | Xi<=xp}. Then Pr(X(r )<=xp< X(s) )=Pr( r<=U<s). 11/19/2018 SA3202, Lecture 13

Remarks Remark 1 By the continuity of F, the events X(r )=xp and X(s)=xp have zero probabilities. Therefore Pr(X(r )<=xp<=X(s))=Pr(X(r )<=xp<X(s)) =Pr(r<=U<s) Thus, we may use the following CI for xp: X(r )<=xp <= X(s) With a given confidence coefficient. 11/19/2018 SA3202, Lecture 13

Remark 2 Since tables of the binomial usually give cumulative probabilities of the form Pr(U<=r) or Pr(U>=r), we use the following formulas: Pr(X(r )<=xp<=X(s))=Pr(X(r )<=xp<X(s)) (by continuity) =Pr(r<=U<s) =Pr(U<=s-1)-Pr(U<=r-1) ={1-Pr(U>s-1)}-{1-Pr(U>r-1)} =Pr(U>=r)-Pr(U>=s) 11/19/2018 SA3202, Lecture 13

Example To obtain a CI for the median, on the basis of a sample of size n=10, note that from the tables of the binomial distribution Binom(10,.5), we have Event U=0 U<=1 U<=2 U<=3 U<=4 U<=5 U<=6 U<=7 U<=8 U<=9 Probability .001 .011 .055 .172 .377 .623 .828 .945 .989 .999 Therefore Pr(X(1)<=x <=X(10))=Pr(1<=U<10)=Pr(U<=9)-Pr(U<=0)=.999-.001=.9980 Pr(1<=U<9)=Pr(U<=8)-Pr(U<=0)=.989-.001=.988 Pr(3<=U<8)=Pr(U<=7)-Pr(U<=2)=.945-.055=.89 It follows that a nominal 99% CI for the median is X(1)<x <X(9) a nominal 90% CI for the median is X(3)<x <X(8). 11/19/2018 SA3202, Lecture 13

The Quantile Test for Large Samples For large n, we can use the normal approximation to the Binomial distribution to find r and s. The key idea here is that U ~AN(np, np(1-p)) . Use a continuity correction, we have Pr(r<=U<s)=Pr(r-.5<=U<=s-.5) Then the approximate CI is r=np+.5-table* (np(1-p))^(1/2) s=np+.5-table*(np(1-p))^(1/2) 11/19/2018 SA3202, Lecture 13

Example For n=100, the CI limits for a 95% CI for the median is Thus, the approximation 95% CI for the median is [X(41), X(60)] 11/19/2018 SA3202, Lecture 13

The Sign Test for the Median Recall that to test H0: =a (the median is a), we may use the statistic U which is “the number of observations which are less or equal to a”. Note that this statistic is simply the number of non-positive terms in the sequence of the following differences X1-a, X2-a, …, Xn-a Let M=# of the positive terms among all nonzero terms Then M~Binom(n’, .5) , where n’ is the number of nonzero differences. The M here is known as a sign statistic. The test based on M is called a sign test. M is usually less than U. They are equal when there are no observations equal to a. 11/19/2018 SA3202, Lecture 13