Contingency Table Analysis Mary Whiteside, Ph.D..

Slides:



Advertisements
Similar presentations
Contingency Tables Chapters Seven, Sixteen, and Eighteen Chapter Seven –Definition of Contingency Tables –Basic Statistics –SPSS program (Crosstabulation)
Advertisements

Chapter 18: The Chi-Square Statistic
Inference about the Difference Between the
Chapter 13: The Chi-Square Test
Statistics II: An Overview of Statistics. Outline for Statistics II Lecture: SPSS Syntax – Some examples. Normal Distribution Curve. Sampling Distribution.
Chapter 14 Analysis of Categorical Data
Session 7.1 Bivariate Data Analysis
Previous Lecture: Analysis of Variance
Chapter 26: Comparing Counts. To analyze categorical data, we construct two-way tables and examine the counts of percents of the explanatory and response.
Summary of Quantitative Analysis Neuman and Robson Ch. 11
Inferential Statistics
Presentation 12 Chi-Square test.
Cross Tabulation and Chi-Square Testing. Cross-Tabulation While a frequency distribution describes one variable at a time, a cross-tabulation describes.
LIS 570 Summarising and presenting data - Univariate analysis continued Bivariate analysis.
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 11-1 Chapter 11 Chi-Square Tests Business Statistics, A First Course 4 th Edition.
Cross Tabulation Statistical Analysis of Categorical Variables.
1 Measuring Association The contents in this chapter are from Chapter 19 of the textbook. The crimjust.sav data will be used. cjsrate: RATE JOB DONE: CJ.
Chi-Square as a Statistical Test Chi-square test: an inferential statistics technique designed to test for significant relationships between two variables.
Chapter 20 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 These tests can be used when all of the data from a study has been measured on.
Social Science Research Design and Statistics, 2/e Alfred P. Rovai, Jason D. Baker, and Michael K. Ponton Pearson Chi-Square Contingency Table Analysis.
Statistical test for Non continuous variables. Dr L.M.M. Nunn.
Chi- square test x 2. Chi Square test Symbolized by Greek x 2 pronounced “Ki square” A Test of STATISTICAL SIGNIFICANCE for TABLE data.
HYPOTHESIS TESTING BETWEEN TWO OR MORE CATEGORICAL VARIABLES The Chi-Square Distribution and Test for Independence.
Business Statistics: A First Course, 5e © 2009 Prentice-Hall, Inc. Chap 11-1 Chapter 11 Chi-Square Tests Business Statistics: A First Course Fifth Edition.
Copyright © 2010 Pearson Education, Inc. Slide
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 11-1 Chapter 11 Chi-Square Tests and Nonparametric Tests Statistics for.
© Copyright McGraw-Hill CHAPTER 11 Other Chi-Square Tests.
Inferential Statistics. The Logic of Inferential Statistics Makes inferences about a population from a sample Makes inferences about a population from.
1 Chapter 11: Analyzing the Association Between Categorical Variables Section 11.1: What is Independence and What is Association?
Chapter Outline Goodness of Fit test Test of Independence.
Chapter 11: Chi-Square  Chi-Square as a Statistical Test  Statistical Independence  Hypothesis Testing with Chi-Square The Assumptions Stating the Research.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 11 Analyzing the Association Between Categorical Variables Section 11.2 Testing Categorical.
Chapter 14 Chi-Square Tests.  Hypothesis testing procedures for nominal variables (whose values are categories)  Focus on the number of people in different.
Making Comparisons All hypothesis testing follows a common logic of comparison Null hypothesis and alternative hypothesis – mutually exclusive – exhaustive.
Chapter 15 The Chi-Square Statistic: Tests for Goodness of Fit and Independence PowerPoint Lecture Slides Essentials of Statistics for the Behavioral.
1 Week 3 Association and correlation handout & additional course notes available at Trevor Thompson.
Chapter 14 – 1 Chi-Square Chi-Square as a Statistical Test Statistical Independence Hypothesis Testing with Chi-Square The Assumptions Stating the Research.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial.
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
Section 10.2 Objectives Use a contingency table to find expected frequencies Use a chi-square distribution to test whether two variables are independent.
CHI SQUARE DISTRIBUTION. The Chi-Square (  2 ) Distribution The chi-square distribution is the probability distribution of the sum of several independent,
Cross Tabulation with Chi Square
Test of independence: Contingency Table
Chapter 12 Chi-Square Tests and Nonparametric Tests
Chi-Square hypothesis testing
Chi-square Basics.
Making Comparisons All hypothesis testing follows a common logic of comparison Null hypothesis and alternative hypothesis mutually exclusive exhaustive.
Chapter 11 Chi-Square Tests.
8. Association between Categorical Variables
Qualitative data – tests of association
Summarising and presenting data - Bivariate analysis
SA3202 Statistical Methods for Social Sciences
Statistical Analysis of Categorical Variables
M248: Analyzing data Block D.
The Chi-Square Distribution and Test for Independence
AP Stats Check In Where we’ve been… Chapter 7…Chapter 8…
Chapter 11: Inference for Distributions of Categorical Data
Chapter 10 Analyzing the Association Between Categorical Variables
Chapter 11 Chi-Square Tests.
BIVARIATE ANALYSIS: Measures of Association Between Two Variables
Analyzing the Association Between Categorical Variables
Statistics II: An Overview of Statistics
BIVARIATE ANALYSIS: Measures of Association Between Two Variables
Statistical Analysis of Categorical Variables
BUSINESS MARKET RESEARCH
UNIT V CHISQUARE DISTRIBUTION
S.M.JOSHI COLLEGE, HADAPSAR
Chapter Outline Goodness of Fit test Test of Independence.
Chapter 11 Chi-Square Tests.
Presentation transcript:

Contingency Table Analysis Mary Whiteside, Ph.D.

Overview Hypotheses of equal proportions Hypotheses of independence Exact distributions and Fishers test The Chi squared approximation Median test Measures of dependence The Chi squared goodness-of-fit test Cochrans test

Contingency Table Examples Countries - religion by government States – dominant political party by geographic region Mutual funds - style by family Companies - industry by location of headquarters

More examples - Countries - government by GDP categories States - divorce laws by divorce rate categories Mutual funds - family by Morning Star rankings Companies - industry by price earnings ratio category

Statistical Inference hypothesis of equal proportions H 0 : all probabilities (estimated by proportions, relative frequencies) in the same column are equal, H 1 :at least two of the probabilities in the same column are not equal Here, for an r x c contingency table, r populations are sampled with fixed row totals, n 1, n 2, … n r.

Hypothesis of independence H 0 : no association i.e. row and column variable are independent, H 1 : an association, i.e. row and column variable are not independent Here, one populations is sampled with sample size N. Row totals are random variables.

Exact distribution for 2 x 2 tables: hypothesis of equal proportions; n 1 = n 2 =

Fishers Exact Test For 2 x 2 tables assuming fixed row and column totals r, N-r, c, N-c: Test statistic = x, the frequency of cell 11 Probability = hyper-geometric probability of x successes in a sample of size r from a population of size N with c successes

Large sample approximation for either test Chi squared = Observed - Expected] 2 /Expected Observed frequency for cell ij comes from cross-tabulation of data Expected frequency for cell ij = Probability Cell ij * N Degrees of freedom (r-1)*(c-1)

Computing Cell Probabilities Assumes independence or equal probabilities (the null hypothesis) Probability Cell ij = Probability Row i * Probability Column j = (R i /N) * (C j /N) Expected frequency ij = (R/N)*(C/N)*N = R*C/N.

Distribution of the Sum Chi Square with (r-1)*(c-1) degrees of freedom Assumes Observed - Expected] 2 /Expected is standard normal squared

Implies Observed - Expected] / Square root[Expected ] is standard normal Implies and Observed is a Poisson RV Poisson is approximately normal if > 5, traditional guideline Conovers relaxed guideline page 201

Measures of Strength: Categorical Variables Phi 2x2 Cramer's V for rxc Pearson's Contingency Coefficient Tschuprow's T

Measures of Strength: Ordinal Variables Lambda A.. Rows dependent Lambda B.. Columns dependent Symmetric Lambda Kendall's tau-B Kendall's tau-C Gamma

Steps of Statistical Analysis Significance - Strength 1- Test for significance of the observed association 2 - If significant, measure the strength of the association

Consider the correlation coefficient a measure of association (linear relationship between two quantitative variables ) significant but not strong significant and strong not significant but strong not significant and not strong

r and Prob (p-value) r =.20 p-value <.05 r =.90 p-value <.05 r =.90 p-value >.05 r =.20 p-value >.05

Concepts Predictive associations must be both significant and strong In a particular application, an association may be important even if it is not predictive (I.e. strong)

More concepts Highly significant, weak associations result from large samples Insignificant strong associations result from small samples - they may prove to be either predictive or weak with larger samples

Examples Heart attack Outcomes by Anticoagulant Treatment Admission Decisions by Gender

Summary Is there an association? –Investigate with Chi square p-value If so, how strong is it? –Select the appropriate measure of strength of association Where does it occur? –Examine cell contributions