Statistical Data Analysis 2011/2012 M. de Gunst Lecture 6.

Slides:



Advertisements
Similar presentations
Happiness comes not from material wealth but less desire. 1.
Advertisements

CHOOSING A STATISTICAL TEST © LOUIS COHEN, LAWRENCE MANION & KEITH MORRISON.
Nonparametric tests I Back to basics. Lecture Outline What is a nonparametric test? Rank tests, distribution free tests and nonparametric tests Which.
Sampling: Final and Initial Sample Size Determination
Economics 105: Statistics Go over GH 11 & 12 GH 13 & 14 due Thursday.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Nonparametric Methods Chapter 15.
EPI 809 / Spring 2008 Chapter 9 Nonparametric Statistics.
INTRODUCTION TO NON-PARAMETRIC ANALYSES CHI SQUARE ANALYSIS.
Lecture 10 Non Parametric Testing STAT 3120 Statistical Methods I.
Tests Jean-Yves Le Boudec. Contents 1.The Neyman Pearson framework 2.Likelihood Ratio Tests 3.ANOVA 4.Asymptotic Results 5.Other Tests 1.
Lesson #25 Nonparametric Tests for a Single Population.
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 17: Nonparametric Tests & Course Summary.
1 Distribution-free testing If the data are normally distributed, we may apply a z- test or t-test when the parameter of interest is . But what if this.
Tests Jean-Yves Le Boudec. Contents 1.The Neyman Pearson framework 2.Likelihood Ratio Tests 3.ANOVA 4.Asymptotic Results 5.Other Tests 1.
Chapter 11: Inference for Distributions
15-1 Introduction Most of the hypothesis-testing and confidence interval procedures discussed in previous chapters are based on the assumption that.
Statistics 350 Lecture 17. Today Last Day: Introduction to Multiple Linear Regression Model Today: More Chapter 6.
1 Confidence Intervals for Means. 2 When the sample size n< 30 case1-1. the underlying distribution is normal with known variance case1-2. the underlying.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 14: Non-parametric tests Marshall University Genomics.
Nonparametrics and goodness of fit Petter Mostad
Quiz 12  Nonparametric statistics. 1. Which condition is not required to perform a non- parametric test? a) random sampling of population b) data are.
Chapter 13: Inference in Regression
Chapter 14: Nonparametric Statistics
Copyright © 2010, 2007, 2004 Pearson Education, Inc Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
The paired sample experiment The paired t test. Frequently one is interested in comparing the effects of two treatments (drugs, etc…) on a response variable.
NONPARAMETRIC STATISTICS
Today’s lesson Confidence intervals for the expected value of a random variable. Determining the sample size needed to have a specified probability of.
1 CSI5388: Functional Elements of Statistics for Machine Learning Part I.
CHAPTER 14: Nonparametric Methods
9 Mar 2007 EMBnet Course – Introduction to Statistics for Biologists Nonparametric tests, Bootstrapping
Nonparametric Statistical Methods: Overview and Examples ETM 568 ISE 468 Spring 2015 Dr. Joan Burtner.
CHAPTER 14: Nonparametric Methods to accompany Introduction to Business Statistics seventh edition, by Ronald M. Weiers Presentation by Priscilla Chaffe-Stengel.
Nonparametric Statistics aka, distribution-free statistics makes no assumption about the underlying distribution, other than that it is continuous the.
© Copyright McGraw-Hill CHAPTER 13 Nonparametric Statistics.
Nonparametric Hypothesis tests The approach to explore the small-sized sample and the unspecified population.
Confidence intervals and hypothesis testing Petter Mostad
Nonparametric Statistics. In previous testing, we assumed that our samples were drawn from normally distributed populations. This chapter introduces some.
Lesson 15 - R Chapter 15 Review. Objectives Summarize the chapter Define the vocabulary used Complete all objectives Successfully answer any of the review.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
1 Uses both direction (sign) and magnitude. Applies to the case of symmetric continuous distributions: Mean equals median. Wilcoxon Signed-Rank Test.
Nonparametric Statistics
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Biostatistics Nonparametric Statistics Class 8 March 14, 2000.
Statistical Data Analysis 2010/2011 M. de Gunst Lecture 10.
Statistical Data Analysis 2011/2012 M. de Gunst Lecture 7.
Statistical Data Analysis 2011/2012 M. de Gunst Lecture 2.
Value Stream Management for Lean Healthcare ISE 491 Fall 2009 Data Analysis - Lecture 7.
Logistic regression. Recall the simple linear regression model: y =  0 +  1 x +  where we are trying to predict a continuous dependent variable y from.
Inference ConceptsSlide #1 1-sample Z-test H o :  =  o (where  o = specific value) Statistic: Test Statistic: Assume: –  is known – n is “large” (so.
Statistical Data Analysis 2011/2012 M. de Gunst Lecture 3.
Statistical Data Analysis 2011/2012 M. de Gunst Lecture 4.
ENGR 610 Applied Statistics Fall Week 7 Marshall University CITE Jack Smith.
Copyright © 2010, 2007, 2004 Pearson Education, Inc Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
Statistical Data Analysis 2011/2012 M. de Gunst Lecture 5.
What parameter is being tested? Categorical  proportionNumeric  mean.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Multiple Regression Chapter 14.
Objectives (BPS chapter 12) General rules of probability 1. Independence : Two events A and B are independent if the probability that one event occurs.
Two-Sample-Means-1 Two Independent Populations (Chapter 6) Develop a confidence interval for the difference in means between two independent normal populations.
Confidence Intervals. Point Estimate u A specific numerical value estimate of a parameter. u The best point estimate for the population mean is the sample.
Nonparametric Statistics Overview. Objectives Understand Difference between Parametric and Nonparametric Statistical Procedures Nonparametric methods.
1 Underlying population distribution is continuous. No other assumptions. Data need not be quantitative, but may be categorical or rank data. Very quick.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. CHAPTER 14: Nonparametric Methods to accompany Introduction to Business Statistics fifth.
Nonparametric Statistics
Chapter 4. Inference about Process Quality
SA3202 Statistical Methods for Social Sciences
Some Nonparametric Methods
Lecture Slides Elementary Statistics Eleventh Edition
BOOTSTRAPPING: LEARNING FROM THE SAMPLE
Presentation transcript:

Statistical Data Analysis 2011/2012 M. de Gunst Lecture 6

Statistical Data Analysis 2 Statistical Data Analysis: Introduction Topics Summarizing data Exploring distributions Bootstrap Robust methods Nonparametric tests Analysis of categorical data Multiple linear regression

Statistical Data Analysis 3 Today’s topic: Nonparametric methods for one sample problems (Chapter 6: 6.1, 6.2) 6. Nonparametric methods 6.1. One sample: nonparametric tests for location Sign test (Wilcoxon) signed rank test 6.2. Aymptotic efficiency

Statistical Data Analysis 4 Nonparametric methods: Introduction Nonparametric tests No assumption of parametric family for underlying distribution of data For problems with large class of distributions belonging to H 0 Distribution of test statistic same under every distribution that belongs to H 0 Why these tests? Robust w.r.t. confidence level: conf level α for large class of distributions More efficient than tests with more assumptions when these assumptions do not hold: fewer observations necessary for same power (= onderscheidend vermogen)

Statistical Data Analysis One-sample problem Situation realizations of, independent, unknown distr. F Location of F? Which test do you know for this situation?

Statistical Data Analysis 6 One-sample problem: t-test t-test Assumptions: F ~ with unknown. composite which class? Test statistic: ~ t n-1 rejected if P (T ≤ t) ≤ ½ α or P (T ≥ t) ≤ ½ α, t observed value of T Class of all normal distributions with expectation m 0 If σ 2 would be known, say σ 2 =1, H 0 would contain only the N(m 0, 1) distribution and thus would be simple. In that case, no need for t-test, but use Gauss test!

Statistical Data Analysis One-sample problem: sign test sign test Assumptions: F has unique median m composite: large class of families which class? Test statistic: ~ binom(n,0.5) rejected if P (T ≤ t) ≤ ½ α or P (T ≥ t) ≤ ½ α, t observed value of T Same for all distributions under : nonparametric test If k observations = m 0 : delete them and use conditional test binom(n-k,0.5))

Statistical Data Analysis One-sample problem: (Wilcoxon) signed rank test (1) Signed rank test or rank sign test or symmetry test of Wilcoxon Assumptions: F continuous and symmetric with symmetry point m composite: large class of families which class? Test statistic: ~ R i rank of X i in ordered sequence of | | random permutations of 1, …, n random variables with Same for all distributions under : nonparametric test

Statistical Data Analysis 9 One-sample problem: (Wilcoxon) signed rank test (2) (Wilcoxon) signed rank test Test statistic: ~ Why? R i rank of X i in ordered sequence of | |, random permutations of 1, …, n random variables with

Statistical Data Analysis 10 One-sample problem: (Wilcoxon) signed rank test (3) (Wilcoxon) signed rank test Assumptions: F continuous and symmetric with symmetry point m Test statistic: ~ R i rank of X i in ordered sequence of | | n large: normal approximation Groups of equal values: adjust ranks R uses equivalent test

Statistical Data Analysis 11 One-sample problem: tests and confidence intervals Tests correspond to confidence intervals for location parameter of F What is the correspondence? Confidence intervals based on a test contain values of parameter for which H 0 is not rejected if they would have been in H 0 In R: for t-test and (Wilcoxon) signed rank test: confidence interval also in output for sign test: compute separately

Statistical Data Analysis 12 Illustration of one sample tests with R (1) Example Data of Example 6.1

Statistical Data Analysis 13 Illustration of one sample tests with R (2) Example Data of Example 6.1

Statistical Data Analysis 14 Illustration of one sample tests with R (3) Example Data of Example 6.1

Statistical Data Analysis 15 Illustration of one sample tests with R (4) Data of Example 6.1 Example

Statistical Data Analysis 16 Illustration of one sample tests with R (5) Data of Example 6.1 Example

Statistical Data Analysis 17 Illustration of one sample tests with R (6) Data of Example 6.1 Example

Statistical Data Analysis 18 Illustration of one sample tests with R (7) Data of Example 6.1 Example

Statistical Data Analysis Asymptotic efficiency On blackboard

Statistical Data Analysis 20 Illustration of investigating efficiency with R (1) Example

Statistical Data Analysis 21 Illustration of investigating efficiency with R (2) Example

Statistical Data Analysis 22 Illustration of investigating efficiency with R (3) Example

Statistical Data Analysis 23 Illustration of investigating efficiency with R (4) Results last sheet: Example

Statistical Data Analysis 24 Recap 6. Nonparametric methods 6.1. One sample: nonparametric tests for location Sign test (Wilcoxon) signed rank test 6.2. Aymptotic efficiency

Statistical Data Analysis 25 Nonparametric methods for one sample problems The end