Exact probability test (Fisher’s Test) Oh Hun Kwon.

Slides:



Advertisements
Similar presentations
Contingency Tables Prepared by Yu-Fen Li.
Advertisements

Chapter 13: The Chi-Square Test
Chapter Goals After completing this chapter, you should be able to:
Aaker, Kumar, Day Seventh Edition Instructor’s Presentation Slides
8-3 Testing a Claim about a Proportion
Chapter 14 Tests of Hypotheses Based on Count Data
Statistical Analysis. Purpose of Statistical Analysis Determines whether the results found in an experiment are meaningful. Answers the question: –Does.
Presentation 12 Chi-Square test.
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
Week 9 Chapter 9 - Hypothesis Testing II: The Two-Sample Case.
Statistical Analysis Statistical Analysis
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 11-1 Chapter 11 Chi-Square Tests Business Statistics, A First Course 4 th Edition.
Hypothesis Testing CSCE 587.
For testing significance of patterns in qualitative data Test statistic is based on counts that represent the number of items that fall in each category.
A Course In Business Statistics 4th © 2006 Prentice-Hall, Inc. Chap 9-1 A Course In Business Statistics 4 th Edition Chapter 9 Estimation and Hypothesis.
Copyright © Cengage Learning. All rights reserved. 14 Elements of Nonparametric Statistics.
Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?
CHI SQUARE TESTS.
Copyright © 2010 Pearson Education, Inc. Slide
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 11-1 Chapter 11 Chi-Square Tests and Nonparametric Tests Statistics for.
© 2002 Thomson / South-Western Slide 5-1 Chapter 5 Discrete Probability Distributions.
Slide Slide 1 Section 8-4 Testing a Claim About a Mean:  Known.
1 Chapter 11: Analyzing the Association Between Categorical Variables Section 11.1: What is Independence and What is Association?
Review - Confidence Interval Most variables used in social science research (e.g., age, officer cynicism) are normally distributed, meaning that their.
The table shows a random sample of 100 hikers and the area of hiking preferred. Are hiking area preference and gender independent? Hiking Preference Area.
Testing Hypotheses about a Population Proportion Lecture 29 Sections 9.1 – 9.3 Fri, Nov 12, 2004.
1 G Lect 7a G Lecture 7a Comparing proportions from independent samples Analysis of matched samples Small samples and 2  2 Tables Strength.
© Copyright McGraw-Hill 2004
Section 12.2: Tests for Homogeneity and Independence in a Two-Way Table.
Statistical Inference Drawing conclusions (“to infer”) about a population based upon data from a sample. Drawing conclusions (“to infer”) about a population.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Chapter 14 – 1 Chi-Square Chi-Square as a Statistical Test Statistical Independence Hypothesis Testing with Chi-Square The Assumptions Stating the Research.
Lecture 8 Estimation and Hypothesis Testing for Two Population Parameters.
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
Class Seven Turn In: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 For Class Eight: Chapter 20: 18, 20, 24 Chapter 22: 34, 36 Read Chapters 23 &
Independent Samples: Comparing Means Lecture 39 Section 11.4 Fri, Apr 1, 2005.
Comparing Counts Chi Square Tests Independence.
I. ANOVA revisited & reviewed
Chapter 12 Chi-Square Tests and Nonparametric Tests
CHAPTER 26 Comparing Counts.
Chi-Square hypothesis testing
Chapter 9: Non-parametric Tests
Chapter 11 Chi-Square Tests.
Association between two categorical variables
Sampling Distributions
Hypothesis testing. Chi-square test
CHAPTER 11 Inference for Distributions of Categorical Data
Qualitative data – tests of association
Sampling Distributions
AP STATISTICS REVIEW INFERENCE
Data Analysis for Two-Way Tables
Hypothesis Testing and Comparing Two Proportions
Hypothesis testing. Chi-square test
CHAPTER 11 Inference for Distributions of Categorical Data
Chapter 10 Analyzing the Association Between Categorical Variables
Contingency Tables: Independence and Homogeneity
Overview and Chi-Square
Chapter 11 Chi-Square Tests.
CHAPTER 11 Inference for Distributions of Categorical Data
CHAPTER 11 Inference for Distributions of Categorical Data
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 11 Inference for Distributions of Categorical Data
CHAPTER 11 Inference for Distributions of Categorical Data
CHAPTER 11 Inference for Distributions of Categorical Data
CHAPTER 11 Inference for Distributions of Categorical Data
CHAPTER 11 Inference for Distributions of Categorical Data
Chapter 11 Chi-Square Tests.
CHAPTER 11 Inference for Distributions of Categorical Data
Chapter 11 Lecture 2 Section: 11.3.
Presentation transcript:

Exact probability test (Fisher’s Test) Oh Hun Kwon

Exact probability test (Fisher’s exact test) One of a class of exact tests, so called because the significance of the deviation from a null hypothesis can be calculated exactly, rather than relying on an approximation that becomes exact in the limit as the sample size grows to infinity, as with many statistical tests. Fisher's exact test is a statistical significance test used in the analysis of contingency tables where sample sizes are small. It is named after its inventor, R. A. Fisher.

Purpose and scope The test is useful for categorical data that result from classifying objects in two different ways; it is used to examine the significance of the association (contingency) between the two kinds of classification. For hand calculations, the test is only feasible in the case of a 2 × 2 contingency table. However the principle of the test can be extended to the general case of an m × n. Fisher's exact test assumes that the row and column totals are known in ad vance.

In our case NL vs AD  group difference (Thickness,FA,MD,Sulcal depth). Control age,gender if differentiate between two group significantly. –Age : independent two sample t-test –Gender : independent two sample t-test ?????  Chi square test, Fisher’s exact test NLADTotal Male(0) Female (1) Total Calculating statistical power

Assumption Random sampling –The data must be randomly selected from, at least representative of, a larger sample. The data must be form a contingency table –The value must be the number of subjects actually observed, not percentages or rates. Small sample size –Total sample size is less than about 20~40. Independent observations –Each subject must be independently selected from the population. Independent samples –No pairing or matching.

Circumstance (example) Some soldiers are being trained as parachutists. One rather windy afternoon 55 practice jumps take place at two localities, dropping zone A and dropping zone B. Of 15 men who jump at dropping zone A, five suffer sprained ankles, and of 40 who jump at dropping zone B, two suffer this injury. The casualty rate at dropping zone A seems unduly high, so the medical officer in charge decides to investigate the disparity.

Null Hypothesis Is it a difference that might be expected by chance? If not it deserves deeper study. The null hypothesis is that there is no difference in the probability of injury generating the proportion of injured men at each dropping zone.

Possible table The number of possible tables with these marginal totals is eight, that is, th e smallest marginal total plus one.

Calculating Probability InjuredUninjurTotal Zone A a b a+b Zone B c d c+d Total a+c b+dn ※ Fisher showed that the probability of obtaining any such set of values was given by the hypergeometric distribution....

Hypergeometric distribution The hypergeometric distribution is a discrete probability distribution that describes the number of successes in a sequence of n draws from a finite population without replacement, just as the binomial distribution describes the number of successes for draws with replacement. A random variable X follows the hypergeometric distribution with parameters N, m and n if the probability is given by

Hypergeometric Distribution

Calculating P-value The observed set has a probabi lity of The P value is the probability of getting the observed set, or one more extreme. A one tailed P value ⇒ = The two tailed value ⇒ P = for the conventional(double th e one tailed result) ⇒ P = for the mid P approach.

One tailed or Two tailed Observed table More extreme case than observed table

Conclusion In either case, the P value is less than the conventional 5% level The medical officer can conclude that there is a problem in dropping zone A. The way to present the results is: Injury rate in dropping zone A was 33%, in dropping zone B 5%; difference 28% (95% confidence interval 3.5 to 53.1% (from )), P = (Fisher's Exact test mid P).

Practice 9.1 Of 30 men employed in a small workshop 18 worked in one department and 12 in another department. In one year five of the 18 reported sick with septic hands, and of the 12 men in the other department one did so.  Is there a difference in the departments and how would you report this result?

Answer SickNot sick Total Dep A Dep B Total P P P P P P P

Answer P-value = = (one-tailed) P-value = = (two-tailed) Significantly different P is calculated from 2 x (0.5 x ).