Session 8: Paired Samples (Zar, Chapter 9,24). General: One population of subjects: x 1, x 2, …, x n, but a pair of data points on each. Examples: Before.

Slides:



Advertisements
Similar presentations
Comparison of 2 Population Means Goal: To compare 2 populations/treatments wrt a numeric outcome Sampling Design: Independent Samples (Parallel Groups)
Advertisements

COMPLETE BUSINESS STATISTICS
Chapter 16 Introduction to Nonparametric Statistics
PSY 307 – Statistics for the Behavioral Sciences Chapter 20 – Tests for Ranked Data, Choosing Statistical Tests.
Significance Testing Chapter 13 Victor Katch Kinesiology.
Hypothesis Testing and Comparison of Two Populations Dr. Burton.
10-1 Introduction 10-2 Inference for a Difference in Means of Two Normal Distributions, Variances Known Figure 10-1 Two independent populations.
Chapter 12 Chi-Square Tests and Nonparametric Tests
EPI 809 / Spring 2008 Wilcoxon Signed Rank Test. EPI 809 / Spring 2008 Signed Rank Test Example You work in the finance department. Is the new financial.
Chapter 14 Analysis of Categorical Data
Chapter 12 Chi-Square Tests and Nonparametric Tests
9-1 Hypothesis Testing Statistical Hypotheses Statistical hypothesis testing and confidence interval estimation of parameters are the fundamental.
Comparing Two Samples: Part II
Lesson #25 Nonparametric Tests for a Single Population.
Test statistic: Group Comparison Jobayer Hossain Larry Holmes, Jr Research Statistics, Lecture 5 October 30,2008.
Statistics 07 Nonparametric Hypothesis Testing. Parametric testing such as Z test, t test and F test is suitable for the test of range variables or ratio.
HAWKES LEARNING SYSTEMS math courseware specialists Copyright © 2010 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Chapter 12 Additional.
HIM 3200 Chapters 8 & 9 Hypothesis Testing and T-Tests Dr. Burton.
1 Distribution-free testing If the data are normally distributed, we may apply a z- test or t-test when the parameter of interest is . But what if this.
EXPERIMENTAL DESIGN Random assignment Who gets assigned to what? How does it work What are limits to its efficacy?
Inferences About Process Quality
Chapter 14 Tests of Hypotheses Based on Count Data
Nemours Biomedical Research Statistics March 26, 2009 Tim Bunnell, Ph.D. & Jobayer Hossain, Ph.D. Nemours Bioinformatics Core Facility.
Chapter 9 Hypothesis Testing.
Chapter 8 Introduction to Hypothesis Testing
PSY 307 – Statistics for the Behavioral Sciences Chapter 19 – Chi-Square Test for Qualitative Data Chapter 21 – Deciding Which Test to Use.
5-3 Inference on the Means of Two Populations, Variances Unknown
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 8 Tests of Hypotheses Based on a Single Sample.
Chapter 15 Nonparametric Statistics
Chapter 9 Title and Outline 1 9 Tests of Hypotheses for a Single Sample 9-1 Hypothesis Testing Statistical Hypotheses Tests of Statistical.
Estimation and Hypothesis Testing Faculty of Information Technology King Mongkut’s University of Technology North Bangkok 1.
Variance-Test-1 Inferences about Variances (Chapter 7) Develop point estimates for the population variance Construct confidence intervals for the population.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Prentice-Hall, Inc.Chap 12-1 Statistics for Managers Using Microsoft® Excel 5th Edition Chapter.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 9 Hypothesis Testing.
11 Chapter Nonparametric Tests © 2012 Pearson Education, Inc.
Hypothesis Testing.
1 Chapter 15: Nonparametric Statistics Section 15.1 How Can We Compare Two Groups by Ranking?
More About Significance Tests
NONPARAMETRIC STATISTICS
The Hypothesis of Difference Chapter 10. Sampling Distribution of Differences Use a Sampling Distribution of Differences when we want to examine a hypothesis.
University of Ottawa - Bio 4118 – Applied Biostatistics © Antoine Morin and Scott Findlay 21/09/2015 7:46 PM 1 Two-sample comparisons Underlying principles.
Chapter 9 Hypothesis Testing and Estimation for Two Population Parameters.
10-1 Introduction 10-2 Inference for a Difference in Means of Two Normal Distributions, Variances Known Figure 10-1 Two independent populations.
Biostat 200 Lecture 7 1. Hypothesis tests so far T-test of one mean: Null hypothesis µ=µ 0 Test of one proportion: Null hypothesis p=p 0 Paired t-test:
6.5 One and Two sample Inference for Proportions np>5; n(1-p)>5 n independent trials; X=# of successes p=probability of a success Estimate:
Ordinally Scale Variables
4 Hypothesis & Testing. CHAPTER OUTLINE 4-1 STATISTICAL INFERENCE 4-2 POINT ESTIMATION 4-3 HYPOTHESIS TESTING Statistical Hypotheses Testing.
1 Nonparametric Statistical Techniques Chapter 17.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 8 Hypothesis Testing.
Ch11: Comparing 2 Samples 11.1: INTRO: This chapter deals with analyzing continuous measurements. Later, some experimental design ideas will be introduced.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics S eventh Edition By Brase and Brase Prepared by: Lynn Smith.
Comparison of 2 Population Means Goal: To compare 2 populations/treatments wrt a numeric outcome Sampling Design: Independent Samples (Parallel Groups)
CHAPTERS HYPOTHESIS TESTING, AND DETERMINING AND INTERPRETING BETWEEN TWO VARIABLES.
CD-ROM Chap 16-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition CD-ROM Chapter 16 Introduction.
© Copyright McGraw-Hill 2004
Student’s t test This test was invented by a statistician WS Gosset ( ), but preferred to keep anonymous so wrote under the name “Student”. This.
SECTION 1 HYPOTHESIS TEST FOR THE DIFFERENCE IN TWO POPULATION PROPORTIONS Two-Population Tests With Qualitative Data  A lot.
Chapter 14 Nonparametric Methods and Chi-Square Tests
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial.
+ Unit 6: Comparing Two Populations or Groups Section 10.2 Comparing Two Means.
ENGR 610 Applied Statistics Fall Week 7 Marshall University CITE Jack Smith.
SUMMARY EQT 271 MADAM SITI AISYAH ZAKARIA SEMESTER /2015.
Hypothesis Tests u Structure of hypothesis tests 1. choose the appropriate test »based on: data characteristics, study objectives »parametric or nonparametric.
Session 9: k Samples (Zar, Chapter 10). (1) General Setup: Group 1Group 2…Group k x 11 x 21 x k1 x 12 x 22 x k2 x 13 x 23 x k3 x 1n 1 x 2n 2 x kn k H.
Two-Sample-Means-1 Two Independent Populations (Chapter 6) Develop a confidence interval for the difference in means between two independent normal populations.
Class Seven Turn In: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 For Class Eight: Chapter 20: 18, 20, 24 Chapter 22: 34, 36 Read Chapters 23 &
1 Nonparametric Statistical Techniques Chapter 18.
Hypothesis Testing. Steps for Hypothesis Testing Fig Draw Marketing Research Conclusion Formulate H 0 and H 1 Select Appropriate Test Choose Level.
Presentation transcript:

Session 8: Paired Samples (Zar, Chapter 9,24)

General: One population of subjects: x 1, x 2, …, x n, but a pair of data points on each. Examples: Before and after treatment Left and right Evaluator 1 and Evaluator 2 on the same subject Method 1 vs. Method 2

Paired t-test:

If Two-Sided Test: One-Sided Tests:

Example 9.1: H 0 : Hindleg length=Foreleg length H A : 

Sign Test: Chapter 24.7 H 0 : same # increases as decreases H A : increases ≠ decreases H A1 : increases < decreases H A2 : increases > decreases Form S + = # positive signs S - = # negative signs

For H 0 : increases = decreases H A : increases ≠ decreases Compare: Min{S +, S - } ≤ Table B.27[n*,  reject H 0 Note: n* = # pos + # neg = S + + S - Do not include zeros! Sign Test is the same as the test: where P + = true proportion of positives

One-Sided Tests: If S from table ≤ B.27[n*, , reject H o Note: n* = # pos + # neg = S + + S - Do not include zeros! Some statisticians would include zeros if one-sided as the zeros represent non-support for the alternative

Example The sign test for the paired-sample data of Examples 9.1 and 9.3

H 0 : No difference between hindleg and foreleg length. H A : Difference between hindleg and foreleg length. n* = 10; S + =8;S - =2; B.27[  =B.27[0.05(2),10]=1 Therefore, Accept H 0 Using Table B.26b for n=10 and p=0.5, Since the probability is greater than 0.05, do not reject H 0. 1-Sided Test:

Wilcoxon Signed-Rank Test: H 0 : Ranks decreases = Ranks increases H A : decreases ≠ increases H A1 : decreases < increases H A2 : decreases > increases

Rank the data (d i ’s) without regard to “sign”, from smallest to largest including ties as in the Mann-Whitney. Form T + = sum of + ranks. T - = sum of – ranks. If n*  100, Use Table B.12 (App 101): For n*= number of non-zero differences (n*=n + +n - ). For H A : decreases  increases: Min {T +,T - }≤B.12[  n*], reject H 0.

Example 9.3 The Wilcoxon paired –sample test applied to the data of Example 9.1. H 0 : Deer hindleg length is the same as foreleg length. H A : Deer hindleg length is not the same as foreleg length.

n = 10 T + = = 51 T - = = 4 Min {4,51}=4 From Table B.12: T 0,05(2), 10 = 8 Since T - < T 0.05(2), 10. H 0 is rejected. 0.01< P(T - or T + ≤ 4) < 0.02 H 0 : ranks + = ranks - HA: ranks + ≠ ranks - If Min{T +, T - } ≤ Table B.12[  n*]=T  n*, reject H 0

Note: and One-Sided Tests: If we use x 1 - x 2 = d For one-tailed testing we use one-tailed critical values from Table B.12 and either T + or T - as follows.

For the hypotheses H 0 : Measurements in reading 1≤ measurements in reading 2 and H A2 : Measurements in reading 1>measurements in reading 2 Decrease 1  2 For the opposite hypotheses: H 0 : Measurements in reading 1≥ measurements 2 and H A1 : Measurements in reading 1 < measurements in reading 2. Increase 1  2

If we use x 2 – x 1 : Normal Approximation: No Ties:

diff=x 1 -x 2 For H A : 1 ≠ 2 use either T - or T + for T. If x 2 - x 1, reverse the sides. If Z > K  (sides), reject H o (Table B.2)

For Ties: For Zero adjustment: (m = #Zeros)

McNemar’s Test: Analysis of Preference Tests or “which do you like better – Coke or Pepsi?” Many Product tests use this technique: Example 9.4 Comparison of Lotions: H 0 : The proportion of persons experiencing relief is the same with both lotions. H A : The proportion of persons experiencing relief is not the same with both lotions.

Principle: (Relief, Relief) and (No Relief, No Relief) give no information as to which is better! Under H o, f 12 and f 21 estimate the same quantity: Observedf 12 f 21 Total

Degrees of Freedom1 +1= 2-1 =1 estimated value Test: Chi-Square >   , Reject H 0.

Biomedical Applications 1) Examiner vs Examiner

Comparing Against Truth: (The diagnostic test) Ex: Test: X-ray, MR, CT, CEA, PSA, TGF, … Truth: Pathology (Biopsy, FNA, Surgical section), Time and observation, Panel of experts -- (The Gold Standard)

Other names and parameters: True Positive fraction = TPF = sensitivity True Negative fraction = TNF = specificity False Positive fraction = FPF = 1-TNF False Negative fraction = FNF = 1-TPF Positive Predictive Value = positive accuracy Negative Predictive Value = negative accuracy

Comparisons: Two sensitivities from two different studies: Two sensitivities from same study Compare to each other Select only True Positive or True Negative:

Higher Order Tables:

In Summary: 1) Individual McNemar chi-squares 2) Above versus Below -- 1 d.f. 3)Heterogeneity chi-square = Individual-above vs below (1-2) d.f.=#chisquares-1

Ex: Mildness Study Exam 1: after cleaning Exam 2: one month later

Hypothesis Chi-SquareD.F. H o : f 12 =f H o : f 13 =f H o : f 23 =f Total H 0 : below=above Heterogeneity Conclusions: f 12  f 21 Above versus Below not significantly different. No Heterogeneity -- Homogeneous.

5)Rating Scale Data Comparison of Rater to “truth” Examples: a)Diagnostic Radiology Systems 1)Diagnostic value of MR, Xerox, and screen film in detection of Breast Cancer b)Pathology 1)Comparison of staining systems to predict relapse (early vs. late) 2)Monoclonal stains or Micro-satellite probes to predict stage of cancer. c)Laboratory Medicine 1)Comparison of machine classification of cells d)Training 1) Comparison of novice to standard diagnosis

a)Often created from raters looking “blinded” at packets of cases. b)Easy to set up but requires “truth” from (1) another method, (2) gold standard (3) team of raters.

ROC (Receiver Operating Characteristic) Analysis (a)Calculate 2 x 2 Tables: 1: Make Cut point after “Very Likely”

Decide Abnormal Decide Normal

Decide Abnormal Decide Normal And so on to get:

(a)Plot the following points to create an ROC Curve: (0,0) 1:(FP 1,TP 1 ) 2:(FP 2,TP 2 ) 3:(FP 3,TP 3 ) 4:(FP 4,TP 4 ) (1,1)

Summary Chapter Paired Tests

2  2 and k  k tables McNemar tests on Likert scales: (1) Pairwise (2) Pooled (above vs below) (3) Heterogeneity chi-square