Lecture 11 Nonparametric Statistics Introduction

Slides:



Advertisements
Similar presentations
Tests of Hypotheses Based on a Single Sample
Advertisements

Introduction to Nonparametric Statistics
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Nonparametric Methods Chapter 15.
EPI 809 / Spring 2008 Chapter 9 Nonparametric Statistics.
Chapter 9 Hypothesis Testing Testing Hypothesis about µ, when the s.t of population is known.
Statistics 07 Nonparametric Hypothesis Testing. Parametric testing such as Z test, t test and F test is suitable for the test of range variables or ratio.
15-1 Introduction Most of the hypothesis-testing and confidence interval procedures discussed in previous chapters are based on the assumption that.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 8 Tests of Hypotheses Based on a Single Sample.
Chapter 15 Nonparametric Statistics
Nonparametric or Distribution-free Tests
Chapter 8 Introduction to Hypothesis Testing
Copyright © Cengage Learning. All rights reserved. 8 Tests of Hypotheses Based on a Single Sample.
Copyright © 2010, 2007, 2004 Pearson Education, Inc Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
14 Elements of Nonparametric Statistics
Lecture 7 Introduction to Hypothesis Testing. Lecture Goals After completing this lecture, you should be able to: Formulate null and alternative hypotheses.
Chapter 8 McGrew Elements of Inferential Statistics Dave Muenkel Geog 3000.
CHAPTER 14: Nonparametric Methods
Chapter 14 Nonparametric Statistics. 2 Introduction: Distribution-Free Tests Distribution-free tests – statistical tests that don’t rely on assumptions.
Lesson Inferences about the Differences between Two Medians: Dependent Samples.
What are Nonparametric Statistics? In all of the preceding chapters we have focused on testing and estimating parameters associated with distributions.
Copyright © Cengage Learning. All rights reserved. 14 Elements of Nonparametric Statistics.
1 Nonparametric Statistical Techniques Chapter 17.
GG 313 Lecture 9 Nonparametric Tests 9/22/05. If we cannot assume that our data are at least approximately normally distributed - because there are a.
Kruskal-Wallis H TestThe Kruskal-Wallis H Test is a nonparametric procedure that can be used to compare more than two populations in a completely randomized.
Fall 2002Biostat Statistical Inference - Proportions One sample Confidence intervals Hypothesis tests Two Sample Confidence intervals Hypothesis.
Biostatistics Nonparametric Statistics Class 8 March 14, 2000.
NONPARAMETRIC STATISTICS In general, a statistical technique is categorized as NPS if it has at least one of the following characteristics: 1. The method.
Copyright © 2010, 2007, 2004 Pearson Education, Inc Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
Lesson Test to See if Samples Come From Same Population.
Nonparametric statistics. Four levels of measurement Nominal Ordinal Interval Ratio  Nominal: the lowest level  Ordinal  Interval  Ratio: the highest.
1 Nonparametric Statistical Techniques Chapter 18.
Nonparametric Statistics
Sampling and Sampling Distributions
DTC Quantitative Methods Bivariate Analysis: t-tests and Analysis of Variance (ANOVA) Thursday 20th February 2014  
NONPARAMETRIC STATISTICS
Lecture Nine - Twelve Tests of Significance.
STATISTICAL INFERENCE
Part Four ANALYSIS AND PRESENTATION OF DATA
One-Sample Inference for Proportions
3. The X and Y samples are independent of one another.
P-values.
Chapter 9 Hypothesis Testing
Hypothesis Testing and Confidence Intervals (Part 1): Using the Standard Normal Lecture 8 Justin Kern October 10 and 12, 2017.
STAT 312 Chapter 7 - Statistical Intervals Based on a Single Sample
Lecture 6 Comparing Proportions (II)
CORRELATION ANALYSIS.
Lecture 16 Nonparametric Two-Sample Tests
Lesson Inferences about the Differences between Two Medians: Dependent Samples.
Environmental Modeling Basic Testing Methods - Statistics
Chapter 9 Hypothesis Testing.
Lecture 13 The Quantile Test
Hypothesis Theory examples.
Nonparametric Statistics Overview
Lecture 15 Wilcoxon Tests
Lecture 18 The Run Test Outline of Today The Definition
十二、Nonparametric Methods (Chapter 12)
Some Nonparametric Methods
Lecture Slides Elementary Statistics Eleventh Edition
Discrete Event Simulation - 4
BOOTSTRAPPING: LEARNING FROM THE SAMPLE
Lecture 12 Binomial Tests and Quantiles
Tutorial 7 Consider the example discussed in the lecture, concerning the comparison of two teaching methods A and B, and let W denote the sum of the.
Lecture 14 The Sign Test and the Rank Test
One-Way Analysis of Variance
Test to See if Samples Come From Same Population
The Rank-Sum Test Section 15.2.
NONPARAMETRIC METHODS
Nonparametric Statistics
Distribution-Free Procedures
Exposing Students to Statistical Methods Based on Counts and Ranks
Presentation transcript:

Lecture 11 Nonparametric Statistics Introduction Outline of Today Rank Sum Test Order Statistics 11/20/2018 SA3202, Lecture 11

Introduction Problem of Interest Classical methods of statistical inference rely heavily on detailed distributional assumptions about the underlying population The validity of such procedures may be seriously affected when these assumptions are not satisfied. Nonparametric/Distribution-free Methods To overcome the above problem, methods of statistical analysis have been developed which do not require detailed assumptions about the underlying distribution of the data Distribution-free the methods do not require exact form of the population distribution Nonparametric the analysis is not in terms of the parameters of the underlying distribution. 11/20/2018 SA3202, Lecture 11

Example Suppose we want to compare the effectiveness of two teaching methods A and B. Suppose Use Method A, n1=3 children taught Method B, n2=4 children taught Their performances are assessed after a period of time. The null hypothesis is that H0: The performances of the children are not dependent on the teaching methods That is, essentially, there is no difference between these two methods. A parametric testing procedure: Two-sample t-test A nonparametric testing procedure: Wilcoxon-Mann-Whitney Test 11/20/2018 SA3202, Lecture 11

The Wilcoxon-Mann-Whitney Rank Sum Test Procedure We rank the performances of all the 7 children as 1 (the best) 2 3 …….. 7 (the worst) and record the ranks of the children in Group A. 11/20/2018 SA3202, Lecture 11

Key Idea Under H0, the ranks of the children in Group A are simply a random sample from the numbers 1,2,…7. Thus the rank sum statistic W=Sum of ranks of the children in Group A can be used to test H0 since when Method A is better than B, W will be “too small” (the smallest value is ) when Method A is worse than B, W will be “too large”. (the largest value is ) Thus Reject H0 when W is “too small” or “too large” To conduct the test, we need the null distribution of W. 11/20/2018 SA3202, Lecture 11

The Null Distribution of the Rank Sum Test Basic Fact Under H0, the ranks of the children in Group A form a random sample of size 3 from 1,2,…7 (without replacement). There are 35 such samples, all equally likely. The Sampling Distribution: W Ranks of Children in Group A Probability 6 (1,2,3) 1/35=.0286 7 (1,2,4) 1/35 8 (1,2,5), (1,3,4) 2/35 9 10 11 12 13 14 15 16 17 (4,6,7) 1/35 18 (5,6,7) 1/35=.0286 11/20/2018 SA3202, Lecture 11

Based on this table, we can conduct the Rank Sum Test Based on this table, we can conduct the Rank Sum Test. For example, for the 5% significance test, we use the critical region W<=6 or W>=18. The exact size of the test is .0286+.0286=.0572. 11/20/2018 SA3202, Lecture 11

Remarks The rank sum test is a typical of a wide class of distribution free tests. Some features to note are Simplicity of ideas and assumptions: only classical probability theory, based on the notion of equal-likelihood has been used. Simplicity of Calculations: the calculations needed to obtain the null distribution of the test statistic are very simple in nature. (For large sample sizes, calculation is done using a computer) Also tables of critical values are available for many tests. Applicability to rank data. Note that in the above example, all we needed was the ranks of the children. This is a useful feature of many distribution-free tests, that can be applied to ordinal (rank) data. Drawback: low power ( the capability to detect the violation of the null hypothesis is low) 11/20/2018 SA3202, Lecture 11

Order Statistics Definition Let X1, X2, … Xn be a sample from a continuous distribution with cdf F and pdf f. The order statistics X(1), X(2), …., X(n) are defined as follows: X(1)= the smallest of X1, X2, … Xn X(2)=the second smallest of X1, X2, … Xn ………………………………….. X(n) = the largest of X1, X2, …. Xn For example, for a sample of size 4 x1=9, x2=3, x3=1, x4=5 The order statistics are 11/20/2018 SA3202, Lecture 11

Distribution of the Order Statistics Let Gr(x) and gr(x) denote the cdf and pdf of X(r ) respectively. Then Gn(x)=Pr(X(n)<x) G1(x)=Pr(X(1)<x) 11/20/2018 SA3202, Lecture 11

General Cases Note that we have the following important relationship: X(r ) <=x equivalent to #{Xi| Xi<=x}>= r That is, X(r )<=x if and only if at least r observations are less than or equal to x. Thus Gr(x)=Pr(X(r )<=x)=Pr(U>= r), U=# {Xi | Xi<= x} 11/20/2018 SA3202, Lecture 11

Now, regarding the event “ the observation is less than or equal to x” as “Success”, it follows that U has a Binomial distribution with success probability p=Pr(Xi<=x)=F(x) Thus Gr(x)=Pr(U>=r)= 11/20/2018 SA3202, Lecture 11

The pdf gr(x) of X(r ) is then obtained by differentiating the cdf Gr(x): 11/20/2018 SA3202, Lecture 11