Download presentation
Presentation is loading. Please wait.
Published byGabriella Fisher Modified over 9 years ago
1
1 Introduction to Modeling Beyond the Basics (Chapter 7)
2
2 Content Simple and multiple linear regression Simple logistic regression –The logistic function –Estimation of parameters –Interpretation of coefficients Multiple logistic regression –Interpretation of coefficients –Coding of variables
3
3 How can we analyse these data? Table 1 Age and systolic blood pressure (SBP) among 33 adult women
4
4 SBP (mm Hg) Age (years) adapted from Colton T. Statistics in Medicine. Boston: Little Brown, 1974
5
5 Simple linear regression Relation between 2 continuous variables (SBP and age) Regression coefficient 1 –Measures association between y and x –Amount by which y changes on average when x changes by one unit –Least squares method y x Slope
6
6 Multiple linear regression Relation between a continuous variable and a set of i continuous or categorical variables Partial regression coefficients i –Amount by which y changes on average when x i changes by one unit and all the other x i s remain constant –Measures association between x i and y adjusted for all other x i Example –SBP versus age, weight, height, etc
7
7 Multiple linear regression Dependent Independent variables Predicted Predictor variables Response variable Explanatory variables Outcome variable Covariables
8
8 Multivariate analysis Model Outcome Linear regression continous Poisson regression counts Cox model survival Logistic regression binomial...... Choice of the tool according to study, objectives, and the variables –Control of confounding –Model building, prediction
9
9 Logistic regression Models the relationship between a set of variables x i –dichotomous (eat : yes/no) –categorical (social class,... ) –continuous (age,...) and –dichotomous variable Y Dichotomous (binary) outcome most common situation in biology and epidemiology
10
10 How can we analyse these data? Table 2 Age and signes of Coronary Heart Disease (CHD), 33 women CHD
11
11 How can we analyse these data? Comparison of the mean age of diseased and non-diseased women –Non-diseased: 38.6 years –Diseased: 58.7 years (p<0.0001) Linear regression?
12
12 Dot-plot: Data from Table 2
13
13 NO YES Y = -0.527 + 0.20 x AGE
14
14 Table 3 - Prevalence (%) of signs of CHD according to age group
15
15 20-2930-3940-4950-5960-6970-7980-89 Dot-plot: Data from Table 3
16
16 Dot-plot: Data from Table 3 Diseased % Age (years) P 1-P
17
17 Dot-plot: Data from Table 3 Diseased % Age (years)
18
18 The logistic function (2) logit of P(y|x) {
19
19 The logistic function (1) Probability of disease x
20
20 The logistic function (2) logit of P(y|x) {
21
21 The logistic function (3) Advantages of the logit –Simple transformation of P(y|x) –Linear relationship with x –Can be continuous (Logit between - to + ) –Known binomial distribution (P between 0 and 1) –Directly related to the notion of odds of disease
22
22 Interpretation of (1)
23
23 Practice 1. MI and Hyperhomocysteinemia? Hyper Homocysteinemia noyesTotal control622183 case424183 Total10462166
24
24 Practice 1 Normal Homocysteine High Homocysteine MI (%)40.3866.13 Odds0.681.95 Ln(Odds)-0.390.67
25
25 Normal HC X = 0 ln(Odds)= + x 0 = ln(Odds) ……. = -0.39 High HC X=1 ln(Odds)= + x 1 = ln(Odds)- ……. = 0.67 - (-0.39) = 1.06 OR ? = e = 2.88 SE = 0.33 How can you interpret /OR?
26
26 Interpretation of (2) = increase in log-odds for a one unit increase in x Test of the hypothesis that =0 (Wald test) Interval testing OR
27
27 If you run Linear Regression … Y =.04 + 0.257 x High HC % MI in High HC = 66.13 % MI in Normal HC = 40.38 Diff = 25.7 % 1 What is your interpretation about 1 ?
28
28 Example Age (<55 and 55+ years) and risk of developing coronary heart disease (CD)
29
29 Results of fitting Logistic Regression Model
30
30 Interpretation of (1)
31
31 Multiple logistic regression More than one independent variable –Dichotomous, ordinal, nominal, continuous … Interpretation of i –Increase in log-odds for a one unit increase in x i with all the other x i s constant –Measures association between x i and log-odds adjusted for all other x i
32
32 Multiple logistic regression Effect modification –Can be modelled by including interaction terms
33
33 Reference Hosmer DW, Lemeshow S. Applied logistic regression.Wiley & Sons, New York, 1989
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.