1 Chapter 16 logistic Regression Analysis. 2 Content Logistic regression Conditional logistic regression Application.

Slides:



Advertisements
Similar presentations
Brief introduction on Logistic Regression
Advertisements

Logistic Regression Psy 524 Ainsworth.
ADVANCED STATISTICS FOR MEDICAL STUDIES Mwarumba Mwavita, Ph.D. School of Educational Studies Research Evaluation Measurement and Statistics (REMS) Oklahoma.
Logistic Regression.
Chi Square Tests Chapter 17.
Simple Logistic Regression
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Regression Analysis Once a linear relationship is defined, the independent variable can be used to forecast the dependent variable. Y ^ = bo + bX bo is.
Regression With Categorical Variables. Overview Regression with Categorical Predictors Logistic Regression.
Introduction to Categorical Data Analysis
Log-linear Analysis - Analysing Categorical Data
Logistic Regression STA302 F 2014 See last slide for copyright information 1.
Nemours Biomedical Research Statistics April 23, 2009 Tim Bunnell, Ph.D. & Jobayer Hossain, Ph.D. Nemours Bioinformatics Core Facility.
Chapter 3 Hypothesis Testing. Curriculum Object Specified the problem based the form of hypothesis Student can arrange for hypothesis step Analyze a problem.
Analyzing quantitative data – section III Week 10 Lecture 1.
Educational Research by John W. Creswell. Copyright © 2002 by Pearson Education. All rights reserved. Slide 1 Chapter 8 Analyzing and Interpreting Quantitative.
Logistic Regression Chapter 8.
Logistic Regression In logistic regression the outcome variable is binary, and the purpose of the analysis is to assess the effects of multiple explanatory.
Regression and Correlation
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS & Updated by SPIROS VELIANITIS.
 Catalogue No: BS-338  Credit Hours: 3  Text Book: Advanced Engineering Mathematics by E.Kreyszig  Reference Books  Probability and Statistics by.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 12: Multiple and Logistic Regression Marshall University.
Medical Statistics (full English class) Ji-Qian Fang School of Public Health Sun Yat-Sen University.
Logistic Regression III: Advanced topics Conditional Logistic Regression for Matched Data Conditional Logistic Regression for Matched Data.
CHAPTER 14 MULTIPLE REGRESSION
Excepted from HSRP 734: Advanced Statistical Methods June 5, 2008.
Week 6: Model selection Overview Questions from last week Model selection in multivariable analysis -bivariate significance -interaction and confounding.
01/20151 EPI 5344: Survival Analysis in Epidemiology Interpretation of Models March 17, 2015 Dr. N. Birkett, School of Epidemiology, Public Health & Preventive.
Logistic Regression STA2101/442 F 2014 See last slide for copyright information.
April 6 Logistic Regression –Estimating probability based on logistic model –Testing differences among multiple groups –Assumptions for model.
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
LOGISTIC REGRESSION A statistical procedure to relate the probability of an event to explanatory variables Used in epidemiology to describe and evaluate.
When and why to use Logistic Regression?  The response variable has to be binary or ordinal.  Predictors can be continuous, discrete, or combinations.
April 4 Logistic Regression –Lee Chapter 9 –Cody and Smith 9:F.
Assessing Binary Outcomes: Logistic Regression Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
MBP1010 – Lecture 8: March 1, Odds Ratio/Relative Risk Logistic Regression Survival Analysis Reading: papers on OR and survival analysis (Resources)
1 11 Simple Linear Regression and Correlation 11-1 Empirical Models 11-2 Simple Linear Regression 11-3 Properties of the Least Squares Estimators 11-4.
Lecture 12: Cox Proportional Hazards Model
Multiple Correlation and Regression
1 Multivariable Modeling. 2 nAdjustment by statistical model for the relationships of predictors to the outcome. nRepresents the frequency or magnitude.
Multiple Logistic Regression STAT E-150 Statistical Methods.
Multiple Regression  Similar to simple regression, but with more than one independent variable R 2 has same interpretation R 2 has same interpretation.
BC Jung A Brief Introduction to Epidemiology - XIII (Critiquing the Research: Statistical Considerations) Betty C. Jung, RN, MPH, CHES.
Organization of statistical research. The role of Biostatisticians Biostatisticians play essential roles in designing studies, analyzing data and.
Logistic Regression. Linear regression – numerical response Logistic regression – binary categorical response eg. has the disease, or unaffected by the.
Lecture 4 Ways to get data into SAS Some practice programming
BIOSTATISTICS Lecture 2. The role of Biostatisticians Biostatisticians play essential roles in designing studies, analyzing data and creating methods.
POPLHLTH 304 Regression (modelling) in Epidemiology Simon Thornley (Slides adapted from Assoc. Prof. Roger Marshall)
1 Introduction to Modeling Beyond the Basics (Chapter 7)
Probability and odds Suppose we a frequency distribution for the variable “TB status” The probability of an individual having TB is frequencyRelative.
THE CHI-SQUARE TEST BACKGROUND AND NEED OF THE TEST Data collected in the field of medicine is often qualitative. --- For example, the presence or absence.
Nonparametric Statistics
REGRESSION MODEL FITTING & IDENTIFICATION OF PROGNOSTIC FACTORS BISMA FAROOQI.
Analysis of matched data Analysis of matched data.
Class Seven Turn In: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 For Class Eight: Chapter 20: 18, 20, 24 Chapter 22: 34, 36 Read Chapters 23 &
Chapter 22 Inferential Data Analysis: Part 2 PowerPoint presentation developed by: Jennifer L. Bellamy & Sarah E. Bledsoe.
NURS 306, Nursing Research Lisa Broughton, MSN, RN, CCRN RESEARCH STATISTICS.
Chapter 2. **The frequency distribution is a table which displays how many people fall into each category of a variable such as age, income level, or.
Stats 242.3(02) Statistical Theory and Methodology.
LOGISTIC REGRESSION. Purpose  Logistical regression is regularly used when there are only two categories of the dependent variable and there is a mixture.
Nonparametric Statistics
BINARY LOGISTIC REGRESSION
Logistic Regression APKC – STATS AFAC (2016).
Chapter 13 Nonlinear and Multiple Regression
Basic Statistics Overview
Multiple logistic regression
Nonparametric Statistics
Introduction to Logistic Regression
Logistic Regression.
Presentation transcript:

1 Chapter 16 logistic Regression Analysis

2 Content Logistic regression Conditional logistic regression Application

3 Purpose: Work out the equations for logistic regression which are used to estimate the dependent variable (outcome factor) from the independent variables (risk factors). Logistic regression is a kind of nonlinear regression. Data: 1.The dependent variable is a binary categorical variable that has two values such as "yes" and "no“. 2.All of the independent variables, at least, most of which should be categories. Of course, some of them can be numerical variable. The categorical variable should be quantified.

4 Implication: Logistic regression can be used to study the quantitative relations between the happening of some diseases or phenomena and many risk factors. There are some demerits to use test (or u test ): 1. can only study one risk factor. 2. can only educe the qualitative conclusion.

5 Category: 1.Between-subjects (non-conditional) logistic regression equation 2. Paired (conditional) logistic regression equation

6 § 1 logistic regression ( non-conditional logistic regression )

7 I Basic Conception The probability of positive outcome under the function of m independent variables can be marked like this:

8 If: Regression model Probability: P : 0 ~ 1 , logitP :- ∞ ~ ∞ 。 Scale:

9 Figure 16-1 the figure of logistic function

10 The meaning of model parameter By constant we mean the natural logarithm of likelihood ratio between happening and non-happening when exposure dose is zero. By regression coefficient we mean the change of logitP when the independent variable changes by one unit.

11 The statistical indicator--odds ratio which is used to measure the function of risk factor in the epidemiology,the formula of computation is: Odds ratio (OR)

12 The relationship with logistic P :

13

14 II the parametric estimation of logistic regression model 1. parametric estimation Theory : the estimation of likelihood

15 2.Estimation of OR It can show the OR of two different levels ( c 1 , c 0 ) of one factor.

16 e.g.: 16-1 Table 16-1 is a case-control data which is used to study the relations among smoking 、 drinking and esophagus cancer, please try running logistic regression analysis. Definite every variable’s code

17 Table16-1 the case-control data of the relation between smoking and esophagus cancer

18 Results: 95  confidence interval of The OR of smoking and nonsmoking : The OR of drinking and no drinking

19 III the hypothesis test of logistic regression model 1. Likelihood test 2. Wald test comparing the estimations of parameters with zero, the control is its standard error, statistics are: Both of are more than 3.84, that is to say that esophagus cancer 、 smoking and drinking have relations with each other. The conclusion is same as above.

20 methods : forward selection 、 backward elimination and stepwise regression. Test statistics : it is not F statistic , but one of likelihood 、 Wald test and score test statistics. IV variable selection e.g.: 16-2 In order to discuss the risk factors that relate to coronary heart disease, to take case-control study on 26 coronary heart disease patients and 28 controllers, table 16-2 and table 16-3 show the definition of all factors and the data. Please try using logistic stepwise regression to select the risk factors.

21 Table 16-2 eight probable risk factors of coronary heart disease and valuation

22 Table 16-3 the case-control data of heart disease’s risk factors

23 Table 16-4 e.g.16-2 the independent variables which are entering equation and estimations of related parameters Learn how to see the results !

24

25 Content Logistic regression Conditional logistic regression Application

26 I Principle §2 conditional logistic regression

27 Table 16-5 the data format of 1: M conditional logistic regression * t = 0 is the case and the others are the control.

28 The model of conditional logistic

29 II applied example

30 Table 16-7 the data table of 1:2 paired case-control study about larynx cancer P344:

31 Table16-8 e.g.16-3 The Estimation of independent variables and related parameters which have entered the equation Using stepwise Six risk factors variable selection four factors enter equation , Table16-9 shows the results 。

32 Content Logistic regression Conditional logistic regression Application

33 I the application of logistic regression 1 . The analysis of epidemiologic risk factors One feature of logistic regression is that the meaning of parameter is clear, so logistic regression is suitable for epidemiologic study. § 3 the application of logistic regression and the notice

34 2 . Analysis of clinical experiment The goal of clinical experiment is to assess the effect of some drugs or cure methods, if there are some confounding factors, and they are not balance among teams, the final results will be wrong. So it is necessary to adjust these factors during the process of analysis. when dependent variable is binary, we can use logistic regression to analyze and get the adjusted results.

35 3 . Analyze dose–response of drugs or poisons In the studies about dose–response of some drugs or poisons, if the date is the logarithm of dose,the Probability distribution close to normal. The distribution of normal function is very similar to logistic regression, then we can express their relation through the following model. (While P is the positive rate; X is dose.)

36 4 . Forecast and discrimination logistic regression is a model of probability , so we can use it to predict the probability of something. For example in clinical we can discriminate the probability of some diseases under some index. please refer to the chapter 18 about discrimination.

37 II the notice of application of logistic regression

38 summary : Purpose: Work out the equations for logistic regression which are used to estimate the dependent variable (outcome factor) from the independent variable (risk factor). Logistic regression belong to probability type and nonlinear regression. Data: 1.The dependent variable is a binary categorical variable that has two values such as "yes" and "no“. 2.All of the independent variables, at least, most of which should be categories. Of course, some of them can be numerical variable. The categories variable should be measure by number.

39 Implication: Logistic regression can be used to study the quantitative relations between the happening of some disease or phenomena and many risk factors Category: 1.Between-subjects (non-conditional) logistic regression equation 2. Paired (conditional) logistic regression equation

40 Thinking : In order to analysis the influent factors of the rescue of AMI patients, a hospital collected five years’ data of AMI patients (there are many related factors,this case only lists three ones for the limited space), which has 200 cases in total, the data has been shown in the following table, P=0 means successful rescue , P=1 means death ; X 1 =1 means shock before rescue , X 1 =0 means no shock before rescue ; X 2 =1 means heart failure before rescue , X 2 =0 means no heart failure before rescue ; X 3 =1 means that it has been more than 12 hours from the beginning of AMI symptom to rescue , X 3 =0 means the time has not passed 12 hours. which analysis method is the best one? why? which analysis method is the best one? why? which result can we got ? which result can we got ?

41 The data of the rescue risk factor of the AMI patients The data of the rescue risk factor of the AMI patients P=0(successfully rescued) P=1(death) X1X2X3N X1X2X3N