Biostat 200 Lecture 8 1. The test statistics follow a theoretical distribution (t stat follows the t distribution, F statistic follows the F distribution,

Slides:



Advertisements
Similar presentations
Contingency Tables Prepared by Yu-Fen Li.
Advertisements

1 Contingency Tables: Tests for independence and homogeneity (§10.5) How to test hypotheses of independence (association) and homogeneity (similarity)
Sociology 601 Class 13: October 13, 2009 Measures of association for tables (8.4) –Difference of proportions –Ratios of proportions –the odds ratio Measures.
Categorical Data. To identify any association between two categorical data. Example: 1,073 subjects of both genders were recruited for a study where the.
Analysis of frequency counts with Chi square
1-1 Copyright © 2015, 2010, 2007 Pearson Education, Inc. Chapter 25, Slide 1 Chapter 25 Comparing Counts.
EPI 809 / Spring 2008 Final Review EPI 809 / Spring 2008 Ch11 Regression and correlation  Linear regression Model, interpretation. Model, interpretation.
Chapter Seventeen HYPOTHESIS TESTING
Test statistic: Group Comparison Jobayer Hossain Larry Holmes, Jr Research Statistics, Lecture 5 October 30,2008.
Sociology 601 Class12: October 8, 2009 The Chi-Squared Test (8.2) – expected frequencies – calculating Chi-square – finding p When (not) to use Chi-squared.
Inferences About Process Quality
Nemours Biomedical Research Statistics March 26, 2009 Tim Bunnell, Ph.D. & Jobayer Hossain, Ph.D. Nemours Bioinformatics Core Facility.
Student’s t statistic Use Test for equality of two means
PSY 307 – Statistics for the Behavioral Sciences Chapter 19 – Chi-Square Test for Qualitative Data Chapter 21 – Deciding Which Test to Use.
Data Analysis Statistics. Levels of Measurement Nominal – Categorical; no implied rankings among the categories. Also includes written observations and.
Review for Exam 2 Some important themes from Chapters 6-9 Chap. 6. Significance Tests Chap. 7: Comparing Two Groups Chap. 8: Contingency Tables (Categorical.
Presentation 12 Chi-Square test.
The Chi-Square Test Used when both outcome and exposure variables are binary (dichotomous) or even multichotomous Allows the researcher to calculate a.
Biostat 200 Lecture 8 1. Hypothesis testing recap Hypothesis testing – Choose a null hypothesis, one-sided or two sided test – Set , significance level,
AM Recitation 2/10/11.
Estimation and Hypothesis Testing Faculty of Information Technology King Mongkut’s University of Technology North Bangkok 1.
Inferential Statistics: SPSS
Variance-Test-1 Inferences about Variances (Chapter 7) Develop point estimates for the population variance Construct confidence intervals for the population.
Analysis of Categorical Data
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 26 Comparing Counts.
Copyright © 2010 Pearson Education, Inc. Warm Up- Good Morning! If all the values of a data set are the same, all of the following must equal zero except.
Dr.Shaikh Shaffi Ahamed Ph.D., Dept. of Family & Community Medicine
Biostat 200 Lecture 7 1. Hypothesis tests so far T-test of one mean: Null hypothesis µ=µ 0 Test of one proportion: Null hypothesis p=p 0 Paired t-test:
Chi-square (χ 2 ) Fenster Chi-Square Chi-Square χ 2 Chi-Square χ 2 Tests of Statistical Significance for Nominal Level Data (Note: can also be used for.
Chapter 9: Non-parametric Tests n Parametric vs Non-parametric n Chi-Square –1 way –2 way.
Bandit Thinkhamrop, PhD. (Statistics) Department of Biostatistics and Demography Faculty of Public Health Khon Kaen University, THAILAND.
7. Comparing Two Groups Goal: Use CI and/or significance test to compare means (quantitative variable) proportions (categorical variable) Group 1 Group.
Contingency tables Brian Healy, PhD. Types of analysis-independent samples OutcomeExplanatoryAnalysis ContinuousDichotomous t-test, Wilcoxon test ContinuousCategorical.
Education 793 Class Notes Presentation 10 Chi-Square Tests and One-Way ANOVA.
MBP1010 – Lecture 8: March 1, Odds Ratio/Relative Risk Logistic Regression Survival Analysis Reading: papers on OR and survival analysis (Resources)
© 2014 by Pearson Higher Education, Inc Upper Saddle River, New Jersey All Rights Reserved HLTH 300 Biostatistics for Public Health Practice, Raul.
Statistics for clinicians Biostatistics course by Kevin E. Kip, Ph.D., FAHA Professor and Executive Director, Research Center University of South Florida,
Slide 26-1 Copyright © 2004 Pearson Education, Inc.
Analysis of Qualitative Data Dr Azmi Mohd Tamil Dept of Community Health Universiti Kebangsaan Malaysia FK6163.
Nonparametric Statistics
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Statistical Testing of Differences CHAPTER fifteen.
STATISTICAL ANALYSIS FOR THE MATHEMATICALLY-CHALLENGED Associate Professor Phua Kai Lit School of Medicine & Health Sciences Monash University (Sunway.
6.1 - One Sample One Sample  Mean μ, Variance σ 2, Proportion π Two Samples Two Samples  Means, Variances, Proportions μ 1 vs. μ 2.
CHI SQUARE TESTS.
Copyright © 2010 Pearson Education, Inc. Slide
Biostat 200 Lecture 8 1. Where are we Types of variables Descriptive statistics and graphs Probability Confidence intervals for means and proportions.
Biostat 200 Lecture 7 1. Outline for today Hypothesis tests so far – One mean, one proportion, 2 means, 2 proportions Comparison of means of multiple.
N318b Winter 2002 Nursing Statistics Specific statistical tests Chi-square (  2 ) Lecture 7.
Copyright © 2010 Pearson Education, Inc. Warm Up- Good Morning! If all the values of a data set are the same, all of the following must equal zero except.
1 G Lect 7a G Lecture 7a Comparing proportions from independent samples Analysis of matched samples Small samples and 2  2 Tables Strength.
More Contingency Tables & Paired Categorical Data Lecture 8.
Types of Categorical Data Qualitative/Categorical Data Nominal CategoriesOrdinal Categories.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Copyright c 2001 The McGraw-Hill Companies, Inc.1 Chapter 11 Testing for Differences Differences betweens groups or categories of the independent variable.
Medical Statistics Medical Statistics Tao Yuchun Tao Yuchun 9
Comparing Counts Chapter 26. Goodness-of-Fit A test of whether the distribution of counts in one categorical variable matches the distribution predicted.
THE CHI-SQUARE TEST BACKGROUND AND NEED OF THE TEST Data collected in the field of medicine is often qualitative. --- For example, the presence or absence.
Jump to first page Inferring Sample Findings to the Population and Testing for Differences.
HYPOTHESIS TESTING FOR DIFFERENCES BETWEEN MEANS AND BETWEEN PROPORTIONS.
Fall 2002Biostat Inference for two-way tables General R x C tables Tests of homogeneity of a factor across groups or independence of two factors.
Hypothesis Tests u Structure of hypothesis tests 1. choose the appropriate test »based on: data characteristics, study objectives »parametric or nonparametric.
Bandit Thinkhamrop, PhD. (Statistics) Department of Biostatistics and Demography Faculty of Public Health Khon Kaen University, THAILAND.
I. ANOVA revisited & reviewed
Chapter 9: Non-parametric Tests
Categorical Data Aims Loglinear models Categorical data
STAT 312 Introduction Z-Tests and Confidence Intervals for a
Applied Statistics Using SPSS
Applied Statistics Using SPSS
Discussion Week 1 (4/1/13 – 4/5/13)
Presentation transcript:

Biostat 200 Lecture 8 1

The test statistics follow a theoretical distribution (t stat follows the t distribution, F statistic follows the F distribution, z stat follows the Standard Normal) if certain assumptions are met. These assumptions are: – For t-test and ANOVA, the underlying distribution of the random variable being measured (X) should be approximately normal In reality the t-test is rather robust, so with large enough sample size and without very large outliers, it is ok to use the t-test – For the ANOVA, the variance of the subgroups should be approximately equal – For the Wilcoxon Rank Sum Test and the Kruskal-Wallis the underlying distributions must have the same basic shape 2 Review

Categorical outcomes With the exception of the proportion test, all the previous tests were for comparing numerical outcomes and categorical predictors – E.g., CD4 count by alcohol consumption – BMI by sex We often have dichotomous outcomes and predictors – E.g. Had at least one cold in the prior 3 months by sex 3

We can make tables of the number of observations falling into each category These are called contingency tables E.g. At least one cold by sex. tab coldany sex | sex coldany | Male Female | Total | | | | Total | | 535 4

Contingency tables Often summaries of counts of disease versus no disease and exposed versus not exposed Frequently 2x2 but can generalize to n x k – n rows, k columns Note that Stata sorts on the numeric value, so for 0-1 variables the disease state will be the 2 nd row Exposure +-Total Disease+aba+b -cdc+d Totala+cb+dn=a+b+c+d 5

Contingency tables Contingency tables are usually summaries of data that originally looked like this. Example of data set Obs.Exposure (1=yes; 0=no) Disease (1=yes; 0=no) ……… n00 6

. list coldany sex | coldany sex | | | 1. | yes male | 2. | no male | 3. | yes female | 4. | yes female | 5. | no male | | | 6. | no male | 7. | no male | 8. | yes male | 9. | yes male | 10. | yes male | | | 11. | no female | 12. | yes male | 13. | no male | 14. | yes female | 15. | no female | | | 16. | yes female |. list coldany sex, nolabel | coldany sex | | | 1. | 1 0 | 2. | 0 0 | 3. | 1 1 | 4. | 1 1 | 5. | 0 0 | | | 6. | 0 0 | 7. | 0 0 | 8. | 1 0 | 9. | 1 0 | 10. | 1 0 | | | 11. | 0 1 | 12. | 1 0 | 13. | 0 0 | 14. | 1 1 | 15. | 0 1 | | | 16. | 1 1 | 7

We want to know whether the incidence of colds varies by gender. We could test the null hypothesis that the cumulative incidence of ≥1 cold in males equals that of females. The cumulative incidence is a proportion. H 0 : p males = p females H A : p males ≠ p females 8

. prtest coldany, by(sex) Two-sample test of proportion Male: Number of obs = 295 Female: Number of obs = Variable | Mean Std. Err. z P>|z| [95% Conf. Interval] Male | Female | diff | | under Ho: diff = prop(Male) - prop(Female) z = Ho: diff = 0 Ha: diff 0 Pr(Z z) =

There are other methods to do this (chi- square test) Why? – These methods are more general – can be used when you have more than 2 levels in either variable We will start with the 2x2 example however 10

Overall, the cumulative incidence of least one cold in the prior 3 months is 304/535=.568. This is the marginal probability of having a cold There were 295 males and 240 females Under the null hypothesis, the expected cumulative incidence in each group is the overall cumulative incidence So we would expect 295*.568=167.6 with at least one cold in the males, and 240*.568=136.3 with at least one cold in the females 11. tab coldany sex | sex coldany | Male Female | Total | | | | Total | | 535

We can also calculate the expected number with no colds under the null hypothesis of no difference – Males: 295*(1-.568) = – Females: 240*(1-.568) = We can make a table of the expected counts 12 Observed data. tab coldany sex | sex coldany | Male Female | Total | | | | Total | | 535 EXPECTED COUNTS UNDER THE NULL HYPOTHESIS | sex coldany | Male Female | Total | | | | Total | | 535

Generically 13 Expected counts Exposure +-Total Disease+(a+b)(a+c)/n(a+b)(b+d)/na+b -(c+d)(a+c)/n(c+d)(b+d)/nc+d Totala+cb+dn=a+b+c+d

The Chi-square test compares the observed frequency (O) in each cell with the expected frequency (E) under the null hypothesis of no difference The differences O-E are squared, divided by E, and added up over all the cells The sum of this is the test statistic and follows a chi-square distribution 14

Chi-square test of independence The chi-square test statistic (for the test of independence in contingency tables) for a 2x2 table (dichotomous outcome, dichotomous exposure) i is the index for the cells in the table – there are 4 cells This test statistic is compared to the chi-square distribution with 1 degree of freedom 15

Chi-square test of independence The chi-square test statistic for the test of independence in an nxk contingency table is This test statistic is compared to the chi-square distribution The degrees of freedom for the this test are (n-1)*(k-1), so for a 2x2 there is 1 degree of freedom – n=the number of rows; k=the number of columns in the nxk table – The chi-square distribution with 1 degree of freedom is actually the square of a standard normal distribution Expected cell sizes should all be >1 and <20% should be <5 The Chi-square test is for two sided hypotheses 16

Chi-square distribution 17

Chi-square distribution 18 Mean = degrees of freedom Variance = 2*degrees of freedom

Chi-square test of independence For the example, the chi-square statistic for our 2x2 is ( ) 2 / ( ) 2 / ( ) 2 / ( ) 2 /136.3 =.405 There is 1 degree of freedom Probability of observing a chi-square value with 1 degree of freedom of.405 is.525. di chi2tail(1,.405) Fail to reject the null hypothesis of independence 19

. tab coldany sex, chi | sex coldany | Male Female | Total | | | | Total | | 535 Pearson chi2(1) = Pr = Test statistic (df) p-value

If you want to see the row or column percentages, use row or col options. tab coldany sex, row col chi expected | Key | | | | frequency | | expected frequency | | row percentage | | column percentage | | sex coldany | Male Female | Total | | 231 | | | | | | | | 304 | | | | | | Total | | 535 | | | | | | Pearson chi2(1) = Pr =

Because we using discrete cell counts to approximate a chi-squared distribution, for 2x2 tables some use the Yates correction Not computed in Stata 22

Lexicon When we talk about the chi-square test, we are saying it is a test of independence of two variables, usually exposure and disease. We also say we are testing the “association” between the two variables. If the test is statistically significant (p<0.05 if  =0.05), we often say that the two variables are “not independent” or they are “associated”. 23

Test of independence For small cell sizes in 2x2 tables, use the Fisher exact test It is based on a discrete distribution called the hypergeometric distribution For 2x2 tables, you can choose a one-sided or two-sided test. tab coldany sex, chi exact | sex coldany | Male Female | Total | | | | Total | | 535 Pearson chi2(1) = Pr = Fisher's exact = sided Fisher's exact =

Comparison to test of two proportions. prtest coldany, by(sex) Two-sample test of proportion Male: Number of obs = 295 Female: Number of obs = Variable | Mean Std. Err. z P>|z| [95% Conf. Interval] Male | Female | diff | | under Ho: diff = prop(Male) - prop(Female) z = Ho: diff = 0 Ha: diff 0 Pr(Z z) = For 2x2 tables the chi-square statistic is equal to the z statistic squared. di.6364^

Chi-square test of independence The chi-square test can be used for more than 2 levels of exposure (with a dichotomous outcome) – The null hypothesis is p 1 = p 2 =... = p k – The alternative hypothesis is that not all the proportions are the same Note that, like ANOVA, a statistically significant result does not tell you which level differed from the others Also when you have more than 2 groups, all tests are 2-sided The degrees of freedom for the test are k-1 26

Chi-square test of independence. tab coldany racegrp, chi col exact | Key | | | | frequency | | column percentage | Enumerating sample-space combinations: stage 3: enumerations = 1 stage 2: enumerations = 4 stage 1: enumerations = 0 | racegrp coldany | White, Ca Asian/PI Other | Total | | 233 | | | | 304 | | Total | | 537 | | Pearson chi2(2) = Pr = Fisher's exact =

Another way to state the null hypothesis for the chi-square test: – Factor A is not associated with Factor B The alternative is – Factor A is associated with Factor B For more than 2 levels of the outcome variable this would make the most sense The degrees of freedom are (r-1)*(c-1) (r=rows, c=columns) 28

... tab cold3grp racegrp, chi col exact | Key | | | | frequency | | column percentage | | racegrp cold3grp | White, Ca Asian/PI Other | Total No colds | | 233 | | One cold | | 191 | | >1 cold | | 113 | | Total | | 537 | | Pearson chi2(4) = Pr = Fisher's exact = Note that this is a 3x3 table, so the chi-square test has 2x2=4 degrees of freedom

Paired dichotomous data Matched pairs – Matched case-control study – Before and after data You cannot just put each individual into an exposure and disease box, because then you would lose the benefits of pairing (and the observations would not be independent!) Instead you have a table that tabulates each of the 4 possible states for each pair 30

Paired dichotomous data For a 1:1 matched case/control study, in all pairs, 1 has the disease (case) and 1 does not (control). The table then counts the number of pairs in which – 1. Both were exposed – 2. Neither were exposed – 3. The case was exposed, the control was not – 4. The case was not exposed, the control was exposed 31

Case-control study HIV positives on ART in Uganda 32 ControlsTotal Cases Alcohol consumption prior 3 months?Yes (exposed) No (not exposed) Yes (exposed)4913 No (not exposed)31114 Total72027 The study question was: Is alcohol consumption associated with treatment failure? The null hypothesis is that alcohol consumption is not associated with treatment failure Cases: Treatment failure: HIV viral load after 6 months of ART >400 Controls: HIV viral load <400 Matched on sex, duration on treatment, and treatment regimen class

The test statistic is r and s are the number of discordant pairs – Concordant pairs provide no information Under the null hypothesis, r and s would be equal This statistic has an approximate chi-square distribution with 1 degree of freedom The test is called McNemar’s test – The -1 is a continuity correction, not all versions of the test use this, some use.5 33

r=9, s=3 Test statistic = (6-1)^2/12 = di chi2tail(1,2.083) Test statistic = (6)^2/12 = 3 (Not using the continuity correction) di chi2tail(1,3)

In Stata, use mcc for Matched Case Control mcc case_exposed control_exposed. mcc lastalc_case lasttime_alc_3mos | Controls | Cases | Exposed Unexposed | Total Exposed | 4 9 | 13 Unexposed | 3 11 | Total | 7 20 | 27 McNemar's chi2(1) = 3.00 Prob > chi2 = Exact McNemar significance probability = Proportion with factor Cases Controls [95% Conf. Interval] difference ratio rel. diff odds ratio (exact) 35

Use mcci if you only have the table, not the raw data mcci #both_exposed #case_exposed_only #control_exposed_only #neither_exposed. mcci | Controls | Cases | Exposed Unexposed | Total Exposed | 4 9 | 13 Unexposed | 3 11 | Total | 7 20 | 27 McNemar's chi2(1) = 3.00 Prob > chi2 = Exact McNemar significance probability = Proportion with factor Cases Controls [95% Conf. Interval] difference ratio rel. diff odds ratio (exact) 36

Note that the McNemar test is only for MATCHED case/control data!!! It is quite possible to collect unmatched case control data. Then you analyze using the chi- square methods presented earlier. 37

Paired dichotomous data For before and after data, the pairs are the individual participant, and the four outcomes might be: 1. “Yes” before + “Yes” after (no change) 2. “No” before + “No” after (no change) 3. “Yes” before + “No” after 4. “No” before + “Yes” after E.g. Reporting alcohol consumption before and after being consented to a study in which blood and urine will be tested for an alcohol biomarker 38

Self-reported alcohol consumption in Uganda McNemar’s test for paired data 39 Null hypothesis: The groups change their self- reported alcohol consumption equally After measureBefore measureTotal Alcohol consumption prior 3 monthsYesNo Yes No037 Total125062

Matched case-control study command. mcci | Controls | Cases | Exposed Unexposed | Total Exposed | | 25 Unexposed | 0 37 | Total | | 62 McNemar's chi2(1) = Prob > chi2 = Exact McNemar significance probability = Proportion with factor Cases Controls [95% Conf. Interval] difference ratio rel. diff odds ratio (exact) 40

Statistical hypothesis tests Data and comparison type Alternative hypothesesParametric test Stata command Non-parametric test Stata command Numerical; One meanH a : μ≠ μ a (two-sided) H a : μ>μ a or μ<μ a (one-sided) Z or t-test ttest var1=hypoth val.* Dichotomous; One proportion H a : p≠ p a (two-sided) H a : p>p a or p<p a (one-sided) Proportion test prtest var1=hypoth value* Numerical; Two means, paired data H a : μ 1 ≠ μ 2 (two-sided) H a : μ 1 >μ 2 or μ<μ a (one-sided) Paired t-test ttest var1=var2* Sign test (signtest var1=var2) or Wilcoxon Signed-Rank (signrank var1=var2) Numerical; Two means, independent data H a : μ 1 ≠ μ 2 (two-sided) H a : μ 1 >μ 2 or μ<μ a (one-sided) T-test (equal or unequal variance) ttest var1, by(byvar) unequal (as needed) Wilcoxon rank-sum test ranksum var1, by(byvar) Dichotomous; two proportions H a : p 1 ≠ p 2 (two-sided) H a : p 1 >p 2 (one-sided) Proportion test (z-test) prtest var1, by(byvar) Chi-square test tab var1 var2, chi exact (McNemar’s for paired data: mcc) Numerical, Two or more means, independent data H a : μ 1 ≠ μ 2 or μ 1 ≠ μ 3 or μ 2 ≠ μ 3 etc.ANOVA oneway var1 byvar Kruskal Wallis test kwallis var1, by(byvar) 41

Comparison of disease frequencies across groups The chi-square test and McNemar’s test are tests of independence They does not give us an estimate of how much the two groups differ, i.e. how much the disease outcome varies by the exposure variable We use odds ratios (OR) and relative risks (RR) as measures of ratios of disease outcome (given exposure or lack of exposure) The odds ratio and the relative risk are just two examples of “measures of association” 42

Comparison of disease frequencies – relative risk Exposure Disease+-Total +aba+b -cdc+d Totala+cb+dn=a+b+c+d  Risk ratio (or relative risk or relative rate) = P (disease | exposed) / P(disease | unexposed) = R e / R u = a/(a+c) / b/(b+d) 43

Comparison of disease frequencies – relative risk  Note that you cannot calculate this entity when you have chosen your sample based on disease status I.e. Case-control study – you have fixed a prior the probability of disease! Relative risk is a NO GO! You can calculate it but it won’t have any meaning… Exposure Disease+-Total +aba+b -cdc+d Totala+cb+dn=a+b+c+d 44

Odds If an event occurs with probability p, the odds of the event are p/(1-p) to 1 If an event has probability.5, the odds are 1:1 Conversely, if the odds of an event are a:b, the probability of a occurring is a/(a+b) – The odds of horse A winning over horse B winning are 2:1  the probability of horse A winning is

Odds ratio  Odds of disease among the exposed persons = P(disease | exposed) / (1-P(disease | exposed)) = [ a / (a + c) ] / [ c / (a + c) ] = a/c  Odds of disease among the unexposed persons = P(disease | unexposed) / (1-P(disease | unexposed)) = [ b / (b + d) ] / [ d / (b + d) ] = b/d  Odds ratio = a/c / b/d = ad/bc Exposure Disease+-Total +aba+b -cdc+d Totala+cb+dn=a+b+c+d 46

Odds ratio note Note that the odds ratio is also equal to [ P(exposed | disease)/(1-P(exposed |disease) ] / [ P(exposed | no disease)/(1-P(exposed | no disease) ] This is needed for case-control studies in which the proportion with disease is fixed (so you can’t calculate the odds of disease) 47

Interpretation of ORs and RRs If the OR or RR equal 1, then there is no effect of exposure on disease. If the OR or RR >1 then disease is increased in the presence of exposure. (Risk factor) If the OR or RR <1 then disease is decreased in the presence of exposure. (Protective factor) 48

Comparison of measures of association  When a disease is rare, i.e. the risk is <10%, the odds ratio approximates the risk ratio  The odds ratio overestimates the risk ratio  Why use it? – statistical properties, usefulness in case- control studies 49

The association of having at least one cold with gender tab coldany sex | sex coldany | Male Female | Total | | | | Total | | 535 What is the (estimated) odds ratio? 50

95% Confidence interval for an odds ratio Remember the 95% confidence interval for a mean µ Lower Confidence Limit: Upper Confidence Limit: The odds ratio is not normally distributed (it ranges from 0 to infinity) – But the natural log (ln) of the odds ratio is approximately normal – The estimate of the standard error of the estimated ln OR is 51

95% Confidence interval for an odds ratio We calculate the 95% confidence interval for the log odds Then exponentiate back to obtain the 95% confidence interval for the OR 52

Calculating an odds ratio and 95% confidence interval in Stata using tabodds command Tabodds outcomevar exposurevar, or. tabodds coldany sex, or sex | Odds Ratio chi2 P>chi2 [95% Conf. Interval] Male | Female | Test of homogeneity (equal odds): chi2(1) = 0.40 Pr>chi2 = Score test for trend of odds: chi2(1) = 0.40 Pr>chi2 =

Calculating an odds ratio and 95% confidence interval in Stata using cc command. cc coldany sex Proportion | Exposed Unexposed | Total Exposed Cases | | Controls | | Total | | | | | Point estimate | [95% Conf. Interval] | Odds ratio | | (exact) Attr. frac. ex. | | (exact) Attr. frac. pop | | chi2(1) = 0.40 Pr>chi2 = Exact confidence intervals use the hypergeometric distribution

Odds ratio for matched pairs The odds ratio is r/s The standard error of ln(OR) is So the 95% confidence interval for the estimated OR is 55

For next time Read Pagano and Gauvreau – Pagano and Gauvreau Chapter 15 (review) – Pagano and Gauvreau Chapter 17