1/23 Ch10 Nonparametric Tests. 2/23 Outline Introduction The sign test Rank-sum tests Tests of randomness The Kolmogorov-Smirnov and Anderson- Darling.

Slides:



Advertisements
Similar presentations
Ch8 Inference concerning variance
Advertisements

COMPLETE BUSINESS STATISTICS
Chapter 16 Introduction to Nonparametric Statistics
Parametric/Nonparametric Tests. Chi-Square Test It is a technique through the use of which it is possible for all researchers to:  test the goodness.
statistics NONPARAMETRIC TEST
© 2010 Pearson Prentice Hall. All rights reserved The Chi-Square Goodness-of-Fit Test.
Chapter 14 Analysis of Categorical Data
Chapter 12 Chi-Square Tests and Nonparametric Tests
Bivariate Statistics GTECH 201 Lecture 17. Overview of Today’s Topic Two-Sample Difference of Means Test Matched Pairs (Dependent Sample) Tests Chi-Square.
Experimental Evaluation
Inferences About Process Quality
15-1 Introduction Most of the hypothesis-testing and confidence interval procedures discussed in previous chapters are based on the assumption that.
The Kruskal-Wallis Test The Kruskal-Wallis test is a nonparametric test that can be used to determine whether three or more independent samples were.
5-3 Inference on the Means of Two Populations, Variances Unknown
Nonparametrics and goodness of fit Petter Mostad
Chapter 15 Nonparametric Statistics
Nonparametric or Distribution-free Tests
Statistical Inference for Two Samples
Chi-Square Test Dr Kishor Bhanushali. Chi-Square Test Chi-square, symbolically written as χ2 (Pronounced as Ki-square), is a statistical measure used.
AM Recitation 2/10/11.
Marketing Research, 2 nd Edition Alan T. Shao Copyright © 2002 by South-Western PPT-1 CHAPTER 17 BIVARIATE STATISTICS: NONPARAMETRIC TESTS.
11 Chapter Nonparametric Tests © 2012 Pearson Education, Inc.
Overview of Statistical Hypothesis Testing: The z-Test
Chapter 14: Nonparametric Statistics
NONPARAMETRIC STATISTICS
Chapter 9.3 (323) A Test of the Mean of a Normal Distribution: Population Variance Unknown Given a random sample of n observations from a normal population.
1 Chapter 11 Analysis of Variance Introduction 11.2 One-Factor Analysis of Variance 11.3 Two-Factor Analysis of Variance: Introduction and Parameter.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Inference on the Least-Squares Regression Model and Multiple Regression 14.
The paired sample experiment The paired t test. Frequently one is interested in comparing the effects of two treatments (drugs, etc…) on a response variable.
NONPARAMETRIC STATISTICS
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Maximum Likelihood Estimator of Proportion Let {s 1,s 2,…,s n } be a set of independent outcomes from a Bernoulli experiment with unknown probability.
Ch9. Inferences Concerning Proportions. Outline Estimation of Proportions Hypothesis concerning one Proportion Hypothesis concerning several proportions.
Nonparametric Statistics aka, distribution-free statistics makes no assumption about the underlying distribution, other than that it is continuous the.
Wilcoxon rank sum test (or the Mann-Whitney U test) In statistics, the Mann-Whitney U test (also called the Mann-Whitney-Wilcoxon (MWW), Wilcoxon rank-sum.
© Copyright McGraw-Hill CHAPTER 13 Nonparametric Statistics.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics S eventh Edition By Brase and Brase Prepared by: Lynn Smith.
Copyright © Cengage Learning. All rights reserved. 14 Elements of Nonparametric Statistics.
Nonparametric Statistics. In previous testing, we assumed that our samples were drawn from normally distributed populations. This chapter introduces some.
1 Nonparametric Statistical Techniques Chapter 17.
Nonparametric Statistics
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 11-1 Chapter 11 Chi-Square Tests and Nonparametric Tests Statistics for.
Ch11: Comparing 2 Samples 11.1: INTRO: This chapter deals with analyzing continuous measurements. Later, some experimental design ideas will be introduced.
GG 313 Lecture 9 Nonparametric Tests 9/22/05. If we cannot assume that our data are at least approximately normally distributed - because there are a.
Nonparametric Statistical Methods. Definition When the data is generated from process (model) that is known except for finite number of unknown parameters.
Kruskal-Wallis H TestThe Kruskal-Wallis H Test is a nonparametric procedure that can be used to compare more than two populations in a completely randomized.
Copyright © Cengage Learning. All rights reserved. 12 Analysis of Variance.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics S eventh Edition By Brase and Brase Prepared by: Lynn Smith.
Copyright © Cengage Learning. All rights reserved. Chi-Square and F Distributions 10.
CD-ROM Chap 16-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition CD-ROM Chapter 16 Introduction.
Chapter 14: Nonparametric Statistics
NON-PARAMETRIC STATISTICS
DTC Quantitative Methods Bivariate Analysis: t-tests and Analysis of Variance (ANOVA) Thursday 14 th February 2013.
Nonparametric Statistics
© Copyright McGraw-Hill 2004
Biostatistics Nonparametric Statistics Class 8 March 14, 2000.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
NONPARAMETRIC STATISTICS In general, a statistical technique is categorized as NPS if it has at least one of the following characteristics: 1. The method.
Nonparametric Statistical Methods. Definition When the data is generated from process (model) that is known except for finite number of unknown parameters.
Lesson Test to See if Samples Come From Same Population.
 Kolmogor-Smirnov test  Mann-Whitney U test  Wilcoxon test  Kruskal-Wallis  Friedman test  Cochran Q test.
1 Nonparametric Statistical Techniques Chapter 18.
Non-parametric Tests Research II MSW PT Class 8. Key Terms Power of a test refers to the probability of rejecting a false null hypothesis (or detect a.
CHI SQUARE DISTRIBUTION. The Chi-Square (  2 ) Distribution The chi-square distribution is the probability distribution of the sum of several independent,
CONCEPTS OF HYPOTHESIS TESTING
AP Stats Check In Where we’ve been… Chapter 7…Chapter 8…
Some Nonparametric Methods
The Rank-Sum Test Section 15.2.
Nonparametric Statistics
Presentation transcript:

1/23 Ch10 Nonparametric Tests

2/23 Outline Introduction The sign test Rank-sum tests Tests of randomness The Kolmogorov-Smirnov and Anderson- Darling Tests

3/23 Introduction Previous methods of inference that we have studied are based on the assumption that the observation come from normal population. However, since there are many situation where it is doubtful whether the assumption of normality can be met. Alternative techniques based on less stringent assumptions – nonparametric tests.

4/ The sign test The sign test: when we sample a continuous symmetrical population, the probability of getting a sample value less than the mean and the probability of getting a sample value greater than the mean are both ½. We can formulate the hypotheses in terms of the population median.

5/23 Conducting a sign test The following data constitute a random sample of 15 measurements of the octane rating a certain kind of gasoline: Test the null hypothesis against the alternative hypothesis at the 0.01 level of significance.

6/23 Solution 1. Null hypothesis: Alternative hypothesis 2. Level of significance: Criterion: based on the number of plus signs or the number of minus signs. Using the number of plus signs, denoted by x, reject the null hypothesis if the probability of getting x or more plus is less than or equal to 0.01.

7/23 Solution 5. Since is less than 0.01, the null hypothesis must be rejected. We conclude tha the median exceeds Calculation: replacing each value greater than 98.0 with a plus sign and each value less than 98.0 with a minus sign, the 14 sample values yield Thus x=12, and from the binomial distribution of n=14, p=0.5, we get

8/ Rank-sum Tests Rank sums: the U test and the H test. The U test will be presented as a nonparametric alternative to the two- sample t test. The H test will be presented as a nonparametric alternative to the one-way analysis of variance.

9/23 The U test The U test (also called Wilcoxon test or the MannWhitney test) Suppose in a study of sedimentary rocks, the following diameters (in millimeters) were obtained fro two kinds of sand: sand I: sand II: The problem is to decide whether the two populations are the same of if one is more likely to produce larger observations than the other.

10/23 Let X 1 be a random variable having the first distribution and X 2 be a random variable having the second distribution. If for all a, with strict inequality for some a, we say that the second population is stochastically larger than the first population. The U Test: ranking the data jointly, as if they comprise one sample, in an increasing order of magnitude, and for our data we get I I I I II I I II I I I I II I I II II II I I II II II II II II II II I

11/23 The values of the first sample (Sand I): 1, 2, 3, 4, 6, 7, 9, 10,11,12,14,15,19,20, and 29. If there were tie among values, we would assign to each of the tied observations the mean of the ranks which they jointly occupy. For instance, the third and the fourth are identical, we would assign each the rank (3+4)/2 = 3.5. The sums of the ranks are Statistics :

12/23 Under the null hypothesis that the two samples come from identical populations, it can be shown that the mean and the variance of the sampling distribution of U 1 are If there are ties in rank, these formulas provide only approximations, but if the number of ties is small, these approximations will generally be good. Is a random variable having approximately the standard normal distribution.

13/23 Solution of the example 1.Null hypothesis: Populations are identical. Alternative hypothesis: The populations are not identical. 2. Level of significance: Criterion: Reject the null hypothesis if Z Calculations: since n 1 =15 and n 2 =14, we have 5. The null hypothesis must be rejected. There is a difference in the populations of grain size.

14/23 The H test (Kruskal-Wallis test) The H test is a generalization of the U test in that it enables us to test the null hypothesis that k independent random samples come from identical populations. If R i is the sum of the ranks occupied by the n i observations of the i-th sample, and the test is based on the statistic When and the null hypothesis is true, the sampling distribution of the H statistic is well approximated by the chi-square distribution with k-1 degrees of freedom.

15/23 EX. An experiment designed to compare three preventive methods against corrosion yielded the following maximum depths of pits in pieces of wire subjected to the respective treatments: Method A: Method B: Method C: Use the 0.05 level of significance to test the null hypothesis that the three samples come from the identical populations.

16/23 Solution of the example 1.Null hypothesis: Populations are identical. Alternative hypothesis: The populations are not identical. 2. Level of significance: Criterion: Reject the null hypothesis if H> Calculations: 5. The null hypothesis must be rejected. 6. The P-value is = < 0.05

17/ Tests of Randomness Remind that in Ch6, we have discussed some assurance that a sample taken will be random. Provide a technique for testing whether a sample may be looked upon as random after it has actually been obtained. It is based on the number of runs exhibited in the sample results. EX. 8 runs TT HH TT HHH T HHH TTTT HHH

18/23 If a sequence contains n 1 symbols of one kind and n 2 of another kind (and neither n 1 nor n 2 is less than 10), the sampling distribution of the total number of runs, u, can be approximated closely by a normal distribution with Thus the test of the null hypothesis is that the arrangement of the symbols is random can be based on the statistic which has approximately the standard normal distribution.

19/23 EX The following is the arrangement of defective, d, and nondefective, n, pieces produced in the given order by a certain machine: nnnnn dddd nnnnnnnnnn dd nn dddd Test for randomness at the 0.01 level of significance.

20/23 Solution of the example 1.Null hypothesis: Arrangement is random. Alternative hypothesis: Arrangement is not random. 2. Level of significance: Criterion: Reject the null hypothesis if Z Calculations: since n 1 =10, n 2 =17, and u=6, we have 5. The null hypothesis must be rejected. The arrange is not random.

21/ The Kolmogorov-Smirnov and Anderson-Darling Tests The Kolmogorov-Smirnov tests are nonparametric tests for differences between cumulative distributions. The Kolmogorov-Smirnov one-sample test is generally more efficient than the chi-square tests for goodness of fit for small samples, and it can be used for very small samples where the chi-square test does not apply.

22/23 Solution of the example 1.Null hypothesis: Alternative hypothesis: are not uniformly distributed. 2. Level of significance: Criterion: Reject the null hypothesis if D>0.410, where D is the maximum difference between the empirical cumulative distribution and the cumulative distribution assumed under the null hypothesis. 4. Calculations: The difference is greatest at x= The null hypothesis cannot be rejected.

23/23 Anderson-Darling test Difference in the tails can be easier to detect if the difference between the empirical cumulative distribution F n and F is divided by. In particular it is based on the statistic or where