Lecture 2 The analysis of cross-tabulations. Cross-tabulations Tables of countable entities or frequencies Made to analyze the association, relationship,

Slides:



Advertisements
Similar presentations
Lecture 14 chi-square test, P-value Measurement error (review from lecture 13) Null hypothesis; alternative hypothesis Evidence against null hypothesis.
Advertisements

What is Chi-Square? Used to examine differences in the distributions of nominal data A mathematical comparison between expected frequencies and observed.
CHI-SQUARE(X2) DISTRIBUTION
SPSS Session 5: Association between Nominal Variables Using Chi-Square Statistic.
The Chi-Square Test for Association
The analysis of cross-tabulations. Today Chi-square test Fisher’s exact test Odds and odds ratios McNemars test.
Statistical Inference for Frequency Data Chapter 16.
Chapter 13: The Chi-Square Test
Chi Square Analyses: Comparing Frequency Distributions.
PSY 340 Statistics for the Social Sciences Chi-Squared Test of Independence Statistics for the Social Sciences Psychology 340 Spring 2010.
1 The Chi squared table provides critical value for various right hand tail probabilities ‘A’. The form of the probabilities that appear in the Chi-Squared.
DATA ANALYSIS I MKT525. Plan of analysis What decision must be made? What are research objectives? What do you have to know to reach those objectives?
THE CONCEPT OF STATISTICAL SIGNIFICANCE:
1 SOC 3811 Basic Social Statistics. 2 Reminder  Hand in your assignment 5  Remember to pick up your previous homework  Final exam: May 12 th (Saturday),
Chapter 26: Comparing Counts. To analyze categorical data, we construct two-way tables and examine the counts of percents of the explanatory and response.
The Chi-Square Test Used when both outcome and exposure variables are binary (dichotomous) or even multichotomous Allows the researcher to calculate a.
The Chi-square Statistic. Goodness of fit 0 This test is used to decide whether there is any difference between the observed (experimental) value and.
Cross Tabulation and Chi-Square Testing. Cross-Tabulation While a frequency distribution describes one variable at a time, a cross-tabulation describes.
Categorical Data Prof. Andy Field.
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
Amsterdam Rehabilitation Research Center | Reade Testing significance - categorical data Martin van der Esch, PhD.
SECTION 6.4 Confidence Intervals for Variance and Standard Deviation Larson/Farber 4th ed 1.
CJ 526 Statistical Analysis in Criminal Justice
1 G Lect 6b G Lecture 6b Generalizing from tests of quantitative variables to tests of categorical variables Testing a hypothesis about a.
For testing significance of patterns in qualitative data Test statistic is based on counts that represent the number of items that fall in each category.
Chi-Square as a Statistical Test Chi-square test: an inferential statistics technique designed to test for significant relationships between two variables.
Chapter 9: Non-parametric Tests n Parametric vs Non-parametric n Chi-Square –1 way –2 way.
Repeated measures ANOVA in SPSS Cross tabulations Survival analysis.
Contingency tables Brian Healy, PhD. Types of analysis-independent samples OutcomeExplanatoryAnalysis ContinuousDichotomous t-test, Wilcoxon test ContinuousCategorical.
The binomial applied: absolute and relative risks, chi-square.
© 2014 by Pearson Higher Education, Inc Upper Saddle River, New Jersey All Rights Reserved HLTH 300 Biostatistics for Public Health Practice, Raul.
Analysis of Qualitative Data Dr Azmi Mohd Tamil Dept of Community Health Universiti Kebangsaan Malaysia FK6163.
Essential Statistics Chapter 161 Review Part III_A_Chi Z-procedure Vs t-procedure.
HYPOTHESIS TESTING BETWEEN TWO OR MORE CATEGORICAL VARIABLES The Chi-Square Distribution and Test for Independence.
Chi Square Classifying yourself as studious or not. YesNoTotal Are they significantly different? YesNoTotal Read ahead Yes.
Chi square analysis Just when you thought statistics was over!!
Section 10.2 Independence. Section 10.2 Objectives Use a chi-square distribution to test whether two variables are independent Use a contingency table.
Reasoning in Psychology Using Statistics Psychology
The table shows a random sample of 100 hikers and the area of hiking preferred. Are hiking area preference and gender independent? Hiking Preference Area.
Chapter 11: Chi-Square  Chi-Square as a Statistical Test  Statistical Independence  Hypothesis Testing with Chi-Square The Assumptions Stating the Research.
July, 2000Guang Jin Statistics in Applied Science and Technology Chapter 12. The Chi-Square Test.
Non-parametric Tests e.g., Chi-Square. When to use various statistics n Parametric n Interval or ratio data n Name parametric tests we covered Tuesday.
1 G Lect 7a G Lecture 7a Comparing proportions from independent samples Analysis of matched samples Small samples and 2  2 Tables Strength.
Making Comparisons All hypothesis testing follows a common logic of comparison Null hypothesis and alternative hypothesis – mutually exclusive – exhaustive.
Section 6.4 Inferences for Variances. Chi-square probability densities.
Chapter 14 – 1 Chi-Square Chi-Square as a Statistical Test Statistical Independence Hypothesis Testing with Chi-Square The Assumptions Stating the Research.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial.
Chi-square test Hypothesis testing. Null hypothesis- The status quo– AKA no change/difference. Alternative hypothesis- What you want to prove/show with.
Class Seven Turn In: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 For Class Eight: Chapter 20: 18, 20, 24 Chapter 22: 34, 36 Read Chapters 23 &
Section 10.2 Objectives Use a contingency table to find expected frequencies Use a chi-square distribution to test whether two variables are independent.
Determining and Interpreting Associations between Variables Cross-Tabs Chi-Square Correlation.
CHI SQUARE DISTRIBUTION. The Chi-Square (  2 ) Distribution The chi-square distribution is the probability distribution of the sum of several independent,
POLS 7000X STATISTICS IN POLITICAL SCIENCE CLASS 9 BROOKLYN COLLEGE-CUNY SHANG E. HA Leon-Guerrero and Frankfort-Nachmias, Essentials of Statistics for.
Cross Tabulation with Chi Square
Statistical Inferences for Population Variances
Math 4030 – 10b Inferences Concerning Variances: Hypothesis Testing
Hypothesis Testing Review
Community &family medicine
Qualitative data – tests of association
The Chi-Square Distribution and Test for Independence
Econ 3790: Business and Economics Statistics
Chapter 10 Analyzing the Association Between Categorical Variables
Statistical Process Control
Statistical Analysis Chi-Square.
Inference for Two Way Tables
Chapter 14.1 Goodness of Fit Test.
Quadrat sampling & the Chi-squared test
Hypothesis Testing - Chi Square
Quadrat sampling & the Chi-squared test
Mortality Analysis.
Presentation transcript:

Lecture 2 The analysis of cross-tabulations

Cross-tabulations Tables of countable entities or frequencies Made to analyze the association, relationship, or connection between two variables This association is difficult to describe statistically Null- Hypothesis: “There is no association between the two variables” can be tested Analysis of cross-tabulations with larges samples

Delivery and housing tenure Housing tenurePretermTermTotal Owner-occupier Council tentant Private tentant Lives with parents66672 Other33639 Total

Delivery and housing tenure Expected number without any association between delivery and housing tenure Housing tenurePreTermTotal Owner-occupier899 Council tenant258 Private tenant175 Lives with parents72 Other39 Total

Delivery and housing tenure If the null-hypothesis is true 899/1443 = 62.3% are house owners. 62.3% of the Pre-terms should be house owners: 99*899/1443 = 61.7 Housing tenurePreTermTotal Owner-occupier899 Council tenant258 Private tenant175 Lives with parents72 Other39 Total

Delivery and housing tenure If the null-hypothesis is true 899/1443 = 62.3% are house owners. 62.3% of the ‘Term’s should be house owners: 1344*899/1443 = Housing tenurePreTermTotal Owner-occupier Council tenant258 Private tenant175 Lives with parents72 Other39 Total

Delivery and housing tenure If the null-hypothesis is true 258/1443 = 17.9% are council tenant. 17.9% of the ‘preterm’s should be council tenant: 99*258/1443 = 17.7 Housing tenurePreTermTotal Owner-occupier Council tenant258 Private tenant175 Lives with parents72 Other39 Total

Delivery and housing tenure If the null-hypothesis is true In general Housing tenurePreTermTotal Owner-occupier Council tenant Private tenant Lives with parents Other Total row total * column total grand total

Delivery and housing tenure If the null-hypothesis is true In general Housing tenurePreTermTotal Owner-occupier50(61.7)849(837.3)899 Council tenant29(17.7)229(240.3)258 Private tenant11(12.0)164(163.0)175 Lives with parents6(4.9)66(67.1)72 Other3(2.7)36(36.3)39 Total row total * column total grand total

Delivery and housing tenure If the null-hypothesis is true Housing tenurePreTermTotal Owner-occupier50(61.7)849(837.3)899 Council tenant29(17.7)229(240.3)258 Private tenant11(12.0)164(163.0)175 Lives with parents6(4.9)66(67.1)72 Other3(2.7)36(36.3)39 Total

Delivery and housing tenure test for association If the numbers are large this will be chi- square distributed. The degree of freedom is (r-1)(c-1) = 4 From Table 13.3 there is a 1 - 5% probability that delivery and housing tenure is not associated

Chi Squared Table

Delivery and housing tenure If the null-hypothesis is true It is difficult to say anything about the nature of the association. Housing tenurePreTermTotal Owner-occupier50(61.7)849(837.3)899 Council tenant29(17.7)229(240.3)258 Private tenant11(12.0)164(163.0)175 Lives with parents6(4.9)66(67.1)72 Other3(2.7)36(36.3)39 Total

2 by 2 tables BronchitisNo bronchitisTotal Cough No Cough Total

2 by 2 tables BronchitisNo bronchitisTotal Cough26 (14.49)44 (55.51)70 No Cough247 (258.51)1002 (990.49)1249 Total

Chi Squared Table

Chi-squared test for small samples Expected valued – > 80% >5 –All >1 StreptomycinControlTotal Improvement13 (8.4)5 (9.6)18 Deterioration2 (4.2)7 (4.8)9 Death0 (2.3)5 (2.7)5 Total151732

Chi-squared test for small samples Expected valued – > 80% >5 –All >1 StreptomycinControlTotal Improvement13 (8.4)5 (9.6)18 Deterioration and death 2 (6.6)12 (7.4)14 Total151732

Fisher’s exact test An example SDT A314 B SDT A404 B SDT A134 B SDT A224 B

Fisher’s exact test Survivers: –a, b, c, d, e Deaths: –f, g, h Table 1 can be made in 5 ways Table 2: 30 Table 3: 30 Table 4: 5 70 ways in total SDT A314 B SDT A404 B SDT A134 B SDT A224 B

Fisher’s exact test Survivers: –a, b, c, d, e Deaths: –f, g, h Table 1 can be made in 5 ways Table 2: 30 Table 3: 30 Table 4: 5 70 ways in total The properties of finding table 2 or a more extreme is:

Fisher’s exact test SDT A f 11 f 12 r1r1 B f 21 f 22 r2r2 c1c1 c2c2 n SDT A314 B SDT A f 11 f 12 r1r1 B f 21 f 22 r2r2 c1c1 c2c2 n SDT A404 B

Yates’ correction for 2x2 Yates correction: StreptomycinControlTotal Improvement13 (8.4)5 (9.6)18 Deterioration and death 2 (6.6)12 (7.4)14 Total151732

Chi Squared Table

Yates’ correction for 2x2 Table 13.7 –Fisher: p = –‘Two-sided’p = –χ 2: p = –Yates’p =

Odds and odds ratios Odds, p is the probability of an event Log odds / logit

Odds The probability of coughs in kids with history of bronchitis. p = 26/273 = o = 26/247 = The probability of coughs in kids with history without bronchitis. p = 44/1046 = o = 44/1002 = BronchitisNo bronchitisTotal Cough26 (a)44 (b)70 No Cough247 (c)1002 (d)1249 Total

Odds ratio The odds ratio; the ratio of odds for experiencing coughs in kids with and kids without a history of bronchitis. BronchitisNo bronchitisTotal Cough26; (a)44; (b)70 No Cough247; 9.50 (c)1002; 22.8 (d)1249 Total

Is the odds ratio different form 1? BronchitisNo bronchitisTotal Cough26 (a)44 (b)70 No Cough247 (c)1002 (d)1249 Total We could take ln to the odds ratio. Is ln(or) different from zero? 95% confidence (assumuing normailty)

Confidence interval of the Odds ratio ln (or) ± 1.96*SE(ln(or)) = 0.37 to 1.38 Returning to the odds ratio itself: e to e = 1.45 to 3.97 The interval does not contain 1, indicating a statistically significant difference BronchitisNo bronchitisTotal Cough26 (a)44 (b)70 No Cough247 (c)1002 (d)1249 Total

Chi-square for goodness of fit df = = 2