1 Introduction to Modeling Beyond the Basics (Chapter 7)

Slides:



Advertisements
Similar presentations
Introduction to Logistic Regression Rachid Salmi, Jean-Claude Desenclos, Thomas Grein, Alain Moren.
Advertisements

1 1 Chapter 5: Multiple Regression 5.1 Fitting a Multiple Regression Model 5.2 Fitting a Multiple Regression Model with Interactions 5.3 Generating and.
© Department of Statistics 2012 STATS 330 Lecture 32: Slide 1 Stats 330: Lecture 32.
Logistic Regression Psy 524 Ainsworth.
Third training Module, EpiSouth: Multivariate analysis, 15 th to 19 th June 20091/29 Multivariate analysis: Introduction Third training Module EpiSouth.
Logistic Regression I Outline Introduction to maximum likelihood estimation (MLE) Introduction to Generalized Linear Models The simplest logistic regression.
Logistic Regression.
Departments of Medicine and Biostatistics
Overview of Logistics Regression and its SAS implementation
Logistic Regression Multivariate Analysis. What is a log and an exponent? Log is the power to which a base of 10 must be raised to produce a given number.
Some Terms Y =  o +  1 X Regression of Y on X Regress Y on X X called independent variable or predictor variable or covariate or factor Which factors.
Julius Center.nl Julius Center.nl Health Sciences and Primary Care Estimating additive interaction between continuous determinants M.J. Knol, I. van der.
Statistics 350 Lecture 16. Today Last Day: Introduction to Multiple Linear Regression Model Today: More Chapter 6.
Introduction to Logistic Regression. Simple linear regression Table 1 Age and systolic blood pressure (SBP) among 33 adult women.
Chapter 11 Survival Analysis Part 2. 2 Survival Analysis and Regression Combine lots of information Combine lots of information Look at several variables.
EPI 809/Spring Multiple Logistic Regression.
Logistic Regression Biostatistics 510 March 15, 2007 Vanessa Perez.
Analysis of Complex Survey Data Day 3: Regression.
Generalized Linear Models
Assessing Survival: Cox Proportional Hazards Model Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
Regression and Correlation
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 12: Multiple and Logistic Regression Marshall University.
Logistic Regression II Simple 2x2 Table (courtesy Hosmer and Lemeshow) Exposure=1Exposure=0 Disease = 1 Disease = 0.
Logistic Regression. Outline Review of simple and multiple regressionReview of simple and multiple regression Simple Logistic RegressionSimple Logistic.
Education 795 Class Notes Applied Research Logistic Regression Note set 10.
Simple Linear Regression
ALISON BOWLING THE GENERAL LINEAR MODEL. ALTERNATIVE EXPRESSION OF THE MODEL.
Excepted from HSRP 734: Advanced Statistical Methods June 5, 2008.
Statistics for clinicians Biostatistics course by Kevin E. Kip, Ph.D., FAHA Professor and Executive Director, Research Center University of South Florida,
April 6 Logistic Regression –Estimating probability based on logistic model –Testing differences among multiple groups –Assumptions for model.
Introduction to Logistic Regression Rachid Salmi, Jean-Claude Desenclos, Alain Moren, Thomas Grein.
LOGISTIC REGRESSION A statistical procedure to relate the probability of an event to explanatory variables Used in epidemiology to describe and evaluate.
Linear correlation and linear regression + summary of tests
When and why to use Logistic Regression?  The response variable has to be binary or ordinal.  Predictors can be continuous, discrete, or combinations.
April 4 Logistic Regression –Lee Chapter 9 –Cody and Smith 9:F.
Assessing Binary Outcomes: Logistic Regression Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
1 GCRC Data Analysis with SPSS Workshop Session 5 Follow Up on FEV data Binary and Categorical Outcomes 2x2 tables 2xK tables JxK tables Logistic Regression.
Regression & Correlation. Review: Types of Variables & Steps in Analysis.
Logistic regression. Recall the simple linear regression model: y =  0 +  1 x +  where we are trying to predict a continuous dependent variable y from.
1 Multivariable Modeling. 2 nAdjustment by statistical model for the relationships of predictors to the outcome. nRepresents the frequency or magnitude.
Chapter 12: Correlation and Linear Regression 1.
1 STA 617 – Chp10 Models for matched pairs Summary  Describing categorical random variable – chapter 1  Poisson for count data  Binomial for binary.
Multiple Logistic Regression STAT E-150 Statistical Methods.
Log-linear Models HRP /03/04 Log-Linear Models for Multi-way Contingency Tables 1. GLM for Poisson-distributed data with log-link (see Agresti.
1 Follow the three R’s: Respect for self, Respect for others and Responsibility for all your actions.
1 Chapter 16 logistic Regression Analysis. 2 Content Logistic regression Conditional logistic regression Application.
Logistic Regression Analysis Gerrit Rooks
Dates Presentations Wed / Fri Ex. 4, logistic regression, Monday Dec 7 th Final Tues. Dec 8 th, 3:30.
POPLHLTH 304 Regression (modelling) in Epidemiology Simon Thornley (Slides adapted from Assoc. Prof. Roger Marshall)
Probability and odds Suppose we a frequency distribution for the variable “TB status” The probability of an individual having TB is frequencyRelative.
Nonparametric Statistics
Analysis of matched data Analysis of matched data.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 13: Multiple, Logistic and Proportional Hazards Regression.
BINARY LOGISTIC REGRESSION
Logistic Regression APKC – STATS AFAC (2016).
Advanced Quantitative Techniques
Advanced Quantitative Techniques
Generalized Linear Models
Analysis of Data Graphics Quantitative data
Logistic regression One of the most common types of modeling in the biomedical literature Especially case-control studies Used when the outcome is binary.
Introduction to logistic regression a.k.a. Varbrul
Multiple logistic regression
Jeffrey E. Korte, PhD BMTRY 747: Foundations of Epidemiology II
Introduction to Logistic Regression
Logistic Regression.
Nazmus Saquib, PhD Head of Research Sulaiman AlRajhi Colleges
Önder Ergönül, MD, MPH Koç University, School of Medicine
Presentation transcript:

1 Introduction to Modeling Beyond the Basics (Chapter 7)

2 Content Simple and multiple linear regression Simple logistic regression –The logistic function –Estimation of parameters –Interpretation of coefficients Multiple logistic regression –Interpretation of coefficients –Coding of variables

3 How can we analyse these data? Table 1 Age and systolic blood pressure (SBP) among 33 adult women

4 SBP (mm Hg) Age (years) adapted from Colton T. Statistics in Medicine. Boston: Little Brown, 1974

5 Simple linear regression Relation between 2 continuous variables (SBP and age) Regression coefficient  1 –Measures association between y and x –Amount by which y changes on average when x changes by one unit –Least squares method y x Slope

6 Multiple linear regression Relation between a continuous variable and a set of i continuous or categorical variables Partial regression coefficients  i –Amount by which y changes on average when x i changes by one unit and all the other x i s remain constant –Measures association between x i and y adjusted for all other x i Example –SBP versus age, weight, height, etc

7 Multiple linear regression Dependent Independent variables Predicted Predictor variables Response variable Explanatory variables Outcome variable Covariables

8 Multivariate analysis Model Outcome Linear regression continous Poisson regression counts Cox model survival Logistic regression binomial Choice of the tool according to study, objectives, and the variables –Control of confounding –Model building, prediction

9 Logistic regression Models the relationship between a set of variables x i –dichotomous (eat : yes/no) –categorical (social class,... ) –continuous (age,...) and –dichotomous variable Y Dichotomous (binary) outcome most common situation in biology and epidemiology

10 How can we analyse these data? Table 2 Age and signes of Coronary Heart Disease (CHD), 33 women CHD

11 How can we analyse these data? Comparison of the mean age of diseased and non-diseased women –Non-diseased: 38.6 years –Diseased: 58.7 years (p<0.0001) Linear regression?

12 Dot-plot: Data from Table 2

13 NO YES Y = x AGE

14 Table 3 - Prevalence (%) of signs of CHD according to age group

Dot-plot: Data from Table 3

16 Dot-plot: Data from Table 3 Diseased % Age (years) P 1-P

17 Dot-plot: Data from Table 3 Diseased % Age (years)

18 The logistic function (2) logit of P(y|x) {

19 The logistic function (1) Probability of disease x

20 The logistic function (2) logit of P(y|x) {

21 The logistic function (3) Advantages of the logit –Simple transformation of P(y|x) –Linear relationship with x –Can be continuous (Logit between -  to +  ) –Known binomial distribution (P between 0 and 1) –Directly related to the notion of odds of disease

22 Interpretation of  (1)

23 Practice 1. MI and Hyperhomocysteinemia? Hyper Homocysteinemia noyesTotal control case Total

24 Practice 1 Normal Homocysteine High Homocysteine MI (%) Odds Ln(Odds)

25 Normal HC  X = 0  ln(Odds)=  +  x 0   = ln(Odds) …….  = High HC  X=1  ln(Odds)=  +  x 1   = ln(Odds)-  …….  = (-0.39) = 1.06 OR ? = e  = 2.88 SE  = 0.33 How can you interpret  /OR?

26 Interpretation of  (2)  = increase in log-odds for a one unit increase in x Test of the hypothesis that  =0 (Wald test) Interval testing OR

27 If you run Linear Regression … Y = x High HC % MI in High HC = % MI in Normal HC = Diff = 25.7 %  1 What is your interpretation about  1 ?

28 Example Age (<55 and 55+ years) and risk of developing coronary heart disease (CD)

29 Results of fitting Logistic Regression Model

30 Interpretation of  (1)

31 Multiple logistic regression More than one independent variable –Dichotomous, ordinal, nominal, continuous … Interpretation of  i –Increase in log-odds for a one unit increase in x i with all the other x i s constant –Measures association between x i and log-odds adjusted for all other x i

32 Multiple logistic regression Effect modification –Can be modelled by including interaction terms

33 Reference Hosmer DW, Lemeshow S. Applied logistic regression.Wiley & Sons, New York, 1989