1
2 Two-samples tests, X 2 Dr. Mona Hassan Ahmed Prof. of Biostatistics HIPH, Alexandria University
3 Z-test (two independent proportions) P1= proportion in the first group P2= proportion in the second group n1= first sample size n2= second sample size
4 Critical z = 1.96 at 5% level of significance 2.58 at 1% level of significance
5 Researchers wished to know if urban and rural adult residents of a developing country differ with respect to prevalence of a certain eye disease. A survey revealed the following information Total Eye disease Residence NoYes Rural Urban Test at 5% level of significance, the difference in the prevalence of eye disease in the 2 groups Example
6 P1 = 24/300 = 0.08 p2 = 15/500 = > Z* The difference is statistically significant Answer
7 t-Test (two independent means) = mean of the first group = mean of the second group S 2 p = pooled variance
8 Critical t from table is detected at degree of freedom = n 1 + n level of significance 1% or 5%
9 Sample of size 25 was selected from healthy population, their mean SBP =125 mm Hg with SD of 10 mm Hg. Another sample of size 17 was selected from the population of diabetics, their mean SBP was 132 mmHg, with SD of 12 mm Hg. Test whether there is a significant difference in mean SBP of diabetics and healthy individual at 1% level of significance Example
10 S 1 = 12 S 2 =11 State H0 H 0 : 1 = 2 State H1 H 1 : 1 2 Choose α α = 0.01 Answer
11 Critical t at df = 40 & 1% level of significance = 2.58 Decision: Since the computed t is smaller than critical t so there is no significant difference between mean SBP of healthy and diabetic samples at 1 %. Answer
12 Degrees of freedom Probability (p value)
13 Paired t- test (t- difference) Uses: To compare the means of two paired samples. Example, mean SBP before and after intake of drug.
14 di = difference (after-before) Sd = standard deviation of difference n = sample size Critical t from table at df = n-1
15 The following data represents the reading of SBP before and after administration of certain drug. Test whether the drug has an effect on SBP at 1% level of significance. Example
16 SBP (After) SBP (Before) Serial No
17 di 2 di After-Before BP After BP Before Serial No Total ∑ di2∑di Answer
18 Answer
19 Critical t at df = 6-1 = 5 and 1% level of significance = Decision: Since t is < critical t so there is no significant difference between mean SBP before and after administration of drug at 1% Level. Answer
20 Degrees of freedom Probability (p value)
21 Chi-Square test It tests the association between variables... The data is qualitative. It is performed mainly on frequencies. It determines whether the observed frequencies differ significantly from expected frequencies.
22 Where E = expected frequency O = observed frequency
23 Critical X 2 at df = (R-1) ( C -1) Where R = raw C = column I f 2 x 2 table X 2* = 3.84 at 5 % level of significance X 2* = 6.63 at 1 % level of significance
24
25 In a study to determine the effect of heredity in a certain disease, a sample of cases and controls was taken: Total Disease Family history ControlCases Positive Negative Total Using 5% level of significance, test whether family history has an effect on disease Example
26 X 2 = (80-88) 2 /88 + ( ) 2 /112 + ( ) 2 /132 + ( ) 2 /168 = < 3.84 Association between the disease and family history is not significant Total Disease Family history ControlCases positive O E Negative O E Total Answer
27 The odds ratio was developed to quantify exposure – disease relations using case- control data Once you have selected cases and controls ascertain exposure Then, cross-tabulate data to form a 2-by-2 table of counts
28 2-by-2 Crosstab Notation Disease +Disease -Total Exposed +ABA+B Exposed -CDC+D TotalA+CB+DA+B+C+D Disease status A+C = no. of cases B+D = no. of non-cases Exposure status A+B = no. of exposed individuals C+D = no. of non-exposed individuals
29 Disease +Disease - Exposed +AB Exposed -CD Cross-product ratio
30 Exposure variable = Smoking Disease variable = Hypertension D+D- E+3071 E-122 Total3193 Example
31 Interpretation of the Odds Ratio Odds ratios are relative risk estimates Relative risk are risk multipliers The odds ratio of 9.3 implies 9.3× risk with exposure
32 No association OR < 1 OR = 1 OR > 1 Positive association Higher risk Negative association Lower risk (Protective)
33 In the previous example OR = % CI is 1.20 – 72.14
34 Multiple Levels of Exposure Smoking levelCasesControls Heavy smokers Moderate smokers61147 Light smokers1482 Non-smokers8115 Total296618
35 Multiple Levels of Exposure k levels of exposure break up data into (k – 1) 2 2 tables Compare each exposure level to non-exposed e.g., heavy smokers vs. non-smokers CasesControls Heavy smokers Non-smokers8115
36 Multiple Levels of Exposure Smoking levelCasesControls Heavy smokers213274OR 3 =(213)(115)/(274)(8)=11.2 Moderate smokers61147OR 2 =(61)(115)/(147)(8) = 6.0 Light smokers1482OR 1 =(14)(115)/(82)(8) = 2.5 Non-smokers8115 Total Notice the trend in OR (dose-response relationship)
37 Small Sample Size Formula For the Odds Ratio It is recommend to add ½ to each cell before calculating the odds ratio when some cells are zeros D+D- E+3171 E-022 Total3193 OR Small Sample = (A+0.5)(D+0.5) (B+0.5)(C+0.5) OR Small Sample = (31+0.5)(22+0.5) =19.8 (71+0.5)(0+0.5)
38