Categorical Data Analysis

Slides:



Advertisements
Similar presentations
Contingency Table Analysis Mary Whiteside, Ph.D..
Advertisements

Chapter 2 Describing Contingency Tables Reported by Liu Qi.
1 1 Slide © 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Inference about the Difference Between the
Statistical Inference for Frequency Data Chapter 16.
Loglinear Models for Contingency Tables. Consider an IxJ contingency table that cross- classifies a multinomial sample of n subjects on two categorical.
1 If we live with a deep sense of gratitude, our life will be greatly embellished.
Copyright ©2011 Brooks/Cole, Cengage Learning More about Inference for Categorical Variables Chapter 15 1.
1 STA 517 – Introduction: Distribution and Inference 1.5 STATISTICAL INFERENCE FOR MULTINOMIAL PARAMETERS  Recall multi(n, =( 1,  2, …,  c ))  Suppose.
1 1 Slide © 2009 Econ-2030-Applied Statistics-Dr. Tadesse. Chapter 11: Comparisons Involving Proportions and a Test of Independence n Inferences About.
1-1 Copyright © 2015, 2010, 2007 Pearson Education, Inc. Chapter 25, Slide 1 Chapter 25 Comparing Counts.
Crosstabs and Chi Squares Computer Applications in Psychology.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 14 Goodness-of-Fit Tests and Categorical Data Analysis.
Previous Lecture: Analysis of Variance
Review for Exam 2 Some important themes from Chapters 6-9 Chap. 6. Significance Tests Chap. 7: Comparing Two Groups Chap. 8: Contingency Tables (Categorical.
1 1 Slide © 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
1 Chapter 20 Two Categorical Variables: The Chi-Square Test.
Chapter 13 Chi-Square Tests. The chi-square test for Goodness of Fit allows us to determine whether a specified population distribution seems valid. The.
The Chi-Square Test Used when both outcome and exposure variables are binary (dichotomous) or even multichotomous Allows the researcher to calculate a.
Chapter 10 Analyzing the Association Between Categorical Variables
How Can We Test whether Categorical Variables are Independent?
AS 737 Categorical Data Analysis For Multivariate
1 1 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 26 Comparing Counts.
A random sample of 300 doctoral degree
1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.
1 1 Slide © 2005 Thomson/South-Western Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial Population Goodness of.
Multinomial Distribution
Chapter 26 Chi-Square Testing
Chapter 13: Categorical Data Analysis Statistics.
Discrete Multivariate Analysis Analysis of Multivariate Categorical Data.
1 Always be mindful of the kindness and not the faults of others.
1 In this case, each element of a population is assigned to one and only one of several classes or categories. Chapter 11 – Test of Independence - Hypothesis.
1 1 Slide Chapter 11 Comparisons Involving Proportions n Inference about the Difference Between the Proportions of Two Populations Proportions of Two Populations.
Introduction Many experiments result in measurements that are qualitative or categorical rather than quantitative. Humans classified by ethnic origin Hair.
FPP 28 Chi-square test. More types of inference for nominal variables Nominal data is categorical with more than two categories Compare observed frequencies.
Analysis of Qualitative Data Dr Azmi Mohd Tamil Dept of Community Health Universiti Kebangsaan Malaysia FK6163.
BPS - 5th Ed. Chapter 221 Two Categorical Variables: The Chi-Square Test.
Data Analysis for Two-Way Tables. The Basics Two-way table of counts Organizes data about 2 categorical variables Row variables run across the table Column.
Contingency Tables 1.Explain  2 Test of Independence 2.Measure of Association.
Statistical Significance for a two-way table Inference for a two-way table We often gather data and arrange them in a two-way table to see if two categorical.
Copyright © 2010 Pearson Education, Inc. Slide
Inference for Distributions of Categorical Variables (C26 BVD)
1 Chapter 11: Analyzing the Association Between Categorical Variables Section 11.1: What is Independence and What is Association?
Chapter 13 Inference for Counts: Chi-Square Tests © 2011 Pearson Education, Inc. 1 Business Statistics: A First Course.
March 30 More examples of case-control studies General I x J table Chi-square tests.
Chapter Outline Goodness of Fit test Test of Independence.
Dan Piett STAT West Virginia University Lecture 12.
1 Follow the three R’s: Respect for self, Respect for others and Responsibility for all your actions.
Copyright © 2010 Pearson Education, Inc. Warm Up- Good Morning! If all the values of a data set are the same, all of the following must equal zero except.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 11 Analyzing the Association Between Categorical Variables Section 11.2 Testing Categorical.
Making Comparisons All hypothesis testing follows a common logic of comparison Null hypothesis and alternative hypothesis – mutually exclusive – exhaustive.
1 Always be mindful of the kindness and not the faults of others.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial.
Comparing Counts Chapter 26. Goodness-of-Fit A test of whether the distribution of counts in one categorical variable matches the distribution predicted.
Fall 2002Biostat Inference for two-way tables General R x C tables Tests of homogeneity of a factor across groups or independence of two factors.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 10 Comparing Two Groups Section 10.3 Other Ways of Comparing Means and Comparing Proportions.
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
Categorical Analysis STAT120C 1. Review of Tests Learned in STAT120C Which test(s) should be used to answer the following questions? – Is husband’s BMI.
Log-linear Models Please read Chapter Two. We are interested in relationships between variables White VictimBlack Victim White Prisoner151 (151/160=0.94)
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 11 Analyzing the Association Between Categorical Variables Section 11.1 Independence.
Data Analysis for Two-Way Tables
If we can reduce our desire,
Chapter 10 Analyzing the Association Between Categorical Variables
Analyzing the Association Between Categorical Variables
Joyful mood is a meritorious deed that cheers up people around you
Chapter 26 Comparing Counts Copyright © 2009 Pearson Education, Inc.
Modeling Ordinal Associations Bin Hu
Presentation transcript:

Categorical Data Analysis Chapter 3: Inference for Contingency Tables

Estimation of Association Parameters Proportion difference point and interval estimators Relative risk Odds ratio Example

IxJ Contingency Tables Inference on difference, RR, odds ratio Are X and Y independent? Ho : independence

Measure the Lack of Independence Pearson chi-square statistic: Likelihood ratio (LR) test statistic: If the statistic is too large, then we have a strong evidence against independence.

Tests for Independence For Pearson or LR test, the df of the chi-square test is the dimension of the whole parameter space (Q) – the dimension of the hypothesized parameter space (Q0), i.e. df = dim(Q) – dim(Q0)

Poisson sampling: df = IJ-(I+J-1) =(I-1)(J-1) Single multinomial sampling: df = (IJ-1)-(I+J-2) = (I-1)(J-1) Independent multinomial sampling: df = I(J-1)-(J-1)

Example: Oral Contraceptive vs. Heart Attack Case-Control study: Retrospective sampling; Column totals were fixed Oral Contra-ceptives Heart attack Yes No Used 23 34 Never used 35 132 Total 58 166

Follow-up Chi-squared Tests Pearson and standardized residuals Partitioning Chi-squared

Residuals Pearson residual: Standardized Pearson residual:

Partitioning Chi-squared Describing association in IxJ table Partition a IxJ table to (I-1)(J-1) sub 2x2 tables Chi-squared= the sum of independent (I-1)(J-1) chi-squareds

Rules for Independent Partitioning S df for the subtables = (I-1)(J-1) Each cell count nij must appear in one and only one subtable Each marginal total (ni+ or n+j) must be a marginal total for one and only one subtable

Example: Aspirin vs. Heart Attack Prospective sampling; Row totals were fixed Fatal H.A. Non-fatal H.A. No H.A. Placebo 18 171 10845 Aspirin 5 99 10933

Ordinary X: Trend Tests Test for Linear Trend alternative: M^2 Choice of scores Example: Table 2.8. 1996 General Social Survey Job Satisfaction Income Very dissatisfied Little dissatisfied Moderately satisfied Very satisfied <15K 1 3 10 6 15K-25K 2 7 25K-40K 14 12 >40K 9 11

Exact Test for Independence The Chi-squared tests are for large samples The Chi-squared tests are valid only when The sample size is large enough so that expected frequencies are greater than or equal to 5 for 80% or more of the categories

Fisher’s Exact Test Consider a 2x2 table Under the three sampling methods, what is the distribution of n11 conditional on n1+, n2+, n+1, n+2? Example: Table 3.8