Logistic Regression Applications Hu Lunchao. 2 Contents 1 1 What Is Logistic Regression? 2 2 Modeling Categorical Responses 3 3 Modeling Ordinal Variables.

Slides:



Advertisements
Similar presentations
Dummy Dependent variable Models
Advertisements

Continued Psy 524 Ainsworth
Statistical Analysis SC504/HS927 Spring Term 2008
Qualitative predictor variables
Lecture 11 (Chapter 9).
Logistic Regression Psy 524 Ainsworth.
Logistic Regression I Outline Introduction to maximum likelihood estimation (MLE) Introduction to Generalized Linear Models The simplest logistic regression.
Multiple Regression Models: Interactions and Indicator Variables
FIN822 Li11 Binary independent and dependent variables.
Logistic Regression.
Simple Logistic Regression
1 9. Logistic Regression ECON 251 Research Methods.
Logistic Regression Multivariate Analysis. What is a log and an exponent? Log is the power to which a base of 10 must be raised to produce a given number.
Chapter 8 Logistic Regression 1. Introduction Logistic regression extends the ideas of linear regression to the situation where the dependent variable,
PH6415 Review Questions. 2 Question 1 A journal article reports a 95%CI for the relative risk (RR) of an event (treatment versus control as (0.55, 0.97).
Introduction to Logistic Regression. Simple linear regression Table 1 Age and systolic blood pressure (SBP) among 33 adult women.
EPI 809/Spring Multiple Logistic Regression.
Logistic Regression Biostatistics 510 March 15, 2007 Vanessa Perez.
Notes on Logistic Regression STAT 4330/8330. Introduction Previously, you learned about odds ratios (OR’s). We now transition and begin discussion of.
An Introduction to Logistic Regression
Generalized Linear Models
Logistic Regression Logistic Regression - Dichotomous Response variable and numeric and/or categorical explanatory variable(s) –Goal: Model the probability.
STAT E-150 Statistical Methods
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 12: Multiple and Logistic Regression Marshall University.
Unit 4b: Fitting the Logistic Model to Data © Andrew Ho, Harvard Graduate School of EducationUnit 4b – Slide 1
MODELS OF QUALITATIVE CHOICE by Bambang Juanda.  Models in which the dependent variable involves two ore more qualitative choices.  Valuable for the.
Logistic Regression. Outline Review of simple and multiple regressionReview of simple and multiple regression Simple Logistic RegressionSimple Logistic.
Logistic Regression III: Advanced topics Conditional Logistic Regression for Matched Data Conditional Logistic Regression for Matched Data.
Logistic Regression- Dichotomous Dependent Variables March 21 & 23, 2011.
Excepted from HSRP 734: Advanced Statistical Methods June 5, 2008.
Correlation and Linear Regression. Evaluating Relations Between Interval Level Variables Up to now you have learned to evaluate differences between the.
April 6 Logistic Regression –Estimating probability based on logistic model –Testing differences among multiple groups –Assumptions for model.
Introduction to Logistic Regression Rachid Salmi, Jean-Claude Desenclos, Alain Moren, Thomas Grein.
Applied Epidemiologic Analysis - P8400 Fall 2002
When and why to use Logistic Regression?  The response variable has to be binary or ordinal.  Predictors can be continuous, discrete, or combinations.
April 4 Logistic Regression –Lee Chapter 9 –Cody and Smith 9:F.
Logistic and Nonlinear Regression Logistic Regression - Dichotomous Response variable and numeric and/or categorical explanatory variable(s) –Goal: Model.
Multiple Regression Petter Mostad Review: Simple linear regression We define a model where are independent (normally distributed) with equal.
Assessing Binary Outcomes: Logistic Regression Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
1 Chapter 1: Stratified Data Analysis 1.1 Introduction 1.2 Examining Associations among Variables 1.3 Recursive Partitioning 1.4 Introduction to Logistic.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 13 Multiple Regression Section 13.3 Using Multiple Regression to Make Inferences.
Chapter 13 Multiple Regression
1 Chapter 2: Logistic Regression and Correspondence Analysis 2.1 Fitting Ordinal Logistic Regression Models 2.2 Fitting Nominal Logistic Regression Models.
Logistic Regression. Linear Regression Purchases vs. Income.
Warsaw Summer School 2015, OSU Study Abroad Program Advanced Topics: Interaction Logistic Regression.
1 STA 617 – Chp10 Models for matched pairs Summary  Describing categorical random variable – chapter 1  Poisson for count data  Binomial for binary.
Multiple Logistic Regression STAT E-150 Statistical Methods.
Multiple Regression  Similar to simple regression, but with more than one independent variable R 2 has same interpretation R 2 has same interpretation.
© Department of Statistics 2012 STATS 330 Lecture 19: Slide 1 Stats 330: Lecture 19.
1 Everyday is a new beginning in life. Every moment is a time for self vigilance.
Dates Presentations Wed / Fri Ex. 4, logistic regression, Monday Dec 7 th Final Tues. Dec 8 th, 3:30.
1 Introduction to Modeling Beyond the Basics (Chapter 7)
Roger B. Hammer Assistant Professor Department of Sociology Oregon State University Conducting Social Research Logistic Regression Categorical Data Analysis.
Logistic Regression and Odds Ratios Psych DeShon.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 13 Multiple Regression Section 13.1 Using Several Variables to Predict a Response.
Analysis of matched data Analysis of matched data.
Logistic Regression Logistic Regression - Binary Response variable and numeric and/or categorical explanatory variable(s) –Goal: Model the probability.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 13: Multiple, Logistic and Proportional Hazards Regression.
Chapter 8 Multivariate Regression Analysis 8.3 Multiple Regression with K Independent Variables 8.4 Significance tests of Parameters.
Logistic Regression When and why do we use logistic regression?
Logistic Regression.
Notes on Logistic Regression
Generalized Linear Models
Introduction to logistic regression a.k.a. Varbrul
Multiple logistic regression
Logistic Regression.
Introduction to Logistic Regression
Chapter 13 Multiple Regression
Regression and Categorical Predictors
Logistic Regression.
Presentation transcript:

Logistic Regression Applications Hu Lunchao

2 Contents 1 1 What Is Logistic Regression? 2 2 Modeling Categorical Responses 3 3 Modeling Ordinal Variables 4 4 Matched Case-Control Studies

An Example 3

4

5

6 What Is Logistic Regression? Logistic Function: Logit transformation: The variance:

7 Modeling Categorical Responses Logistic regression model for binary response variables. For instance, logistic regression can model A voter's choice in a presidential election, with predictor variables political ideology, annual income, education level, and religious affiliation. Whether a person uses illegal drugs (yes or no), with predictors education level, whether employed, religiosity, marital status, and annual income.

Modeling Categorical Responses 8 Income and Having Travel Credit Cards

Modeling Categorical Responses 9 Let x = annual income and y = whether have a travel credit card (1 = yes, 0 = no). Software provides results shown in Table Fit of Logistic Regression Model for Italian Credit Card Data betaS.E.Exp(beta) income Constant

Modeling Categorical Responses 10 The logistic prediction equation is

Modeling Categorical Responses 11 Interpretation for beta: The sign of beta tells us whether it is increasing or decreasing as x increases ; The curve has slope beta*P(y = 1)[1-P(y = 1)] , where P(y = 1) is the probability at that point.

12 This exponential relationship implies that every unit increase in x has a multiplicative effect of e beta on the odds. Interpretation Using the Odds and Odds Ratio

Modeling for Ordinal Variables 13 Many applications have a categorical response variable with more than two categories. For instance, the General Social Survey recently asked subjects whether government spending on the environment should increase, remain the same, or decrease

Cumulative Probabilities 14 Let y denote an ordinal response variable. Let P(y <=j) denote the probability that the response falls in category j or below. With four categories, for example, the cumulative probabilities are

Cumulative Logits 15 A c-category response has c cumulative probabilities With c = 4, for example, the logits are

Cumulative Logit Models for an Ordinal Response 16 The model requires a separate intercept parameter j for each cumulative probability. Since the cumulative probabilities increase as j increases, so do. The effect of an explanatory variable on all the cumulative probabilities for y should be the same, so beta keeps unchanged.

17 Software estimates the parameters using all the cumulative probabilities at once.

Do Republicans tend to be more conservative than Democrats? 18 Let x be a dummy variable for party affiliation, with x = 1 for Democrats and x = 0 for Republicans.

19 Democrats tending to be more liberal than Republicans

Matched Case-Control Studies 20 Case-control studies are used to identify factors that may contribute to a medical condition by comparing subjects who have that condition (the 'cases') with patients who do not have the condition but are otherwise similar. Within each stratum, samples of cases(y=1) and controls(y=0) are chosen. The number of cases and controls need not be constant across strata, but the most common matched designs include one case and form one to five controls per stratum and are thus referred to as 1-M matched studies.

21 In a case–control study, the logistic regression outcome variable Y is binary; here Y = 1 for cases and Y = 0 for controls. Then, it is written as:

ML Approaches to Estimate beta: unconditional and conditional 22 The unconditional likelihood is where n is the total number of individuals.

ML Approaches to Estimate beta: unconditional and conditional 23 The conditional likelihood is where m is the number of cases, and the sum in the denominator is over the C m n possibilities of dividing the numbers from 1 to n into one group {1, 2,...,m} of size m and its complement {l m+1,...,l n }.

What Factors Attribute to Low-birth-weight 24 Dataset : 189 women, of whom 59 had low-birth -weight babies and 130 had normal-weight babies Risk factors: weight in pounds at the last menstrual period (LWT), presence of hypertension (HT), smoking status during pregnancy (Smoke), presence of uterine irritability (UI). The woman’s age (Age),used as the matching variable.

Data Input 25 The variable Low is used to determine whether the subject is a case (Low=1, low-birth-weight baby) or a control (Low=0, normal-weight baby). The dummy time variable Time takes the value 1 for cases and 2 for controls

Conditional logistic regression Result 26 The variable Time is the response, and Low is the censoring variable. The matching variable Age is used in the STRATA statement so that each unique age value defines a stratum.

27

28 Results indicate that women were more likely to have low-birth- weight babies if they were underweight in the last menstrual cycle, were hypertensive, smoked during pregnancy, or suffered uterine irritability

29 THANKS FOR YOUR ATTENTION!