Logistic Regression. Conceptual Framework - LR Dependent variable: two categories with underlying propensity (yes/no) (absent/present) Independent variables:

Slides:



Advertisements
Similar presentations
Modeling of Data. Basic Bayes theorem Bayes theorem relates the conditional probabilities of two events A, and B: A might be a hypothesis and B might.
Advertisements

Continued Psy 524 Ainsworth
Sociology 680 Multivariate Analysis Logistic Regression.
Brief introduction on Logistic Regression
Logistic Regression Psy 524 Ainsworth.
Logistic Regression I Outline Introduction to maximum likelihood estimation (MLE) Introduction to Generalized Linear Models The simplest logistic regression.
Logistic Regression.
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Regression With Categorical Variables. Overview Regression with Categorical Predictors Logistic Regression.
Logistic Regression Multivariate Analysis. What is a log and an exponent? Log is the power to which a base of 10 must be raised to produce a given number.
Multipe and non-linear regression. What is what? Regression: One variable is considered dependent on the other(s) Correlation: No variables are considered.
N-way ANOVA. 3-way ANOVA 2 H 0 : The mean respiratory rate is the same for all species H 0 : The mean respiratory rate is the same for all temperatures.
QUALITATIVE AND LIMITED DEPENDENT VARIABLE MODELS.
Sociology 601 Class 28: December 8, 2009 Homework 10 Review –polynomials –interaction effects Logistic regressions –log odds as outcome –compared to linear.
Introduction to Logistic Regression. Simple linear regression Table 1 Age and systolic blood pressure (SBP) among 33 adult women.
Logistic Regression Biostatistics 510 March 15, 2007 Vanessa Perez.
Social Research Methods
An Introduction to Logistic Regression
Logistic Regression Source
Data Analysis Statistics. Inferential statistics.
Factor Analysis Ulf H. Olsson Professor of Statistics.
Multinomial Logistic Regression Basic Relationships
Logistic Regression Chapter 8.
Statistical hypothesis testing – Inferential statistics II. Testing for associations.
Logistic Regression II Simple 2x2 Table (courtesy Hosmer and Lemeshow) Exposure=1Exposure=0 Disease = 1 Disease = 0.
Logistic Regression.
Wednesday PM  Presentation of AM results  Multiple linear regression Simultaneous Simultaneous Stepwise Stepwise Hierarchical Hierarchical  Logistic.
Chapter 4 Hypothesis Testing, Power, and Control: A Review of the Basics.
1 G Lect 11W Logistic Regression Review Maximum Likelihood Estimates Probit Regression and Example Model Fit G Multiple Regression Week 11.
Hierarchical Binary Logistic Regression
Multinomial Logistic Regression Basic Relationships
CJT 765: Structural Equation Modeling Class 7: fitting a model, fit indices, comparingmodels, statistical power.
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
Logistic Regression Database Marketing Instructor: N. Kumar.
Logistic regression A quick intro. Why Logistic Regression? Big idea: dependent variable is a dichotomy (though can use for more than 2 categories i.e.
University of Warwick, Department of Sociology, 2014/15 SO 201: SSAASS (Surveys and Statistics) (Richard Lampard) Week 7 Logistic Regression I.
AN INTRODUCTION TO LOGISTIC REGRESSION ENI SUMARMININGSIH, SSI, MM PROGRAM STUDI STATISTIKA JURUSAN MATEMATIKA UNIVERSITAS BRAWIJAYA.
1 היחידה לייעוץ סטטיסטי אוניברסיטת חיפה פרופ’ בנימין רייזר פרופ’ דוד פרג’י גב’ אפרת ישכיל.
Logistic (regression) single and multiple. Overview  Defined: A model for predicting one variable from other variable(s).  Variables:IV(s) is continuous/categorical,
When and why to use Logistic Regression?  The response variable has to be binary or ordinal.  Predictors can be continuous, discrete, or combinations.
April 4 Logistic Regression –Lee Chapter 9 –Cody and Smith 9:F.
Week 5: Logistic regression analysis Overview Questions from last week What is logistic regression analysis? The mathematical model Interpreting the β.
Political Science 30: Political Inquiry. Linear Regression II: Making Sense of Regression Results Interpreting SPSS regression output Coefficients for.
I271B The t distribution and the independent sample t-test.
Lecture 12: Cox Proportional Hazards Model
Logistic Regression. Linear Regression Purchases vs. Income.
Logistic Regression. Linear regression – numerical response Logistic regression – binary categorical response eg. has the disease, or unaffected by the.
LOGISTIC REGRESSION Binary dependent variable (pass-fail) Odds ratio: p/(1-p) eg. 1/9 means 1 time in 10 pass, 9 times fail Log-odds ratio: y = ln[p/(1-p)]
Logistic Regression Analysis Gerrit Rooks
Qualitative and Limited Dependent Variable Models ECON 6002 Econometrics Memorial University of Newfoundland Adapted from Vera Tabakova’s notes.
Dates Presentations Wed / Fri Ex. 4, logistic regression, Monday Dec 7 th Final Tues. Dec 8 th, 3:30.
Logistic Regression Saed Sayad 1www.ismartsoft.com.
 Seeks to determine group membership from predictor variables ◦ Given group membership, how many people can we correctly classify?
Nonparametric Statistics
Multiple Independent Variables POLS 300 Butz. Multivariate Analysis Problem with bivariate analysis in nonexperimental designs: –Spuriousness and Causality.
LOGISTIC REGRESSION. Purpose  Logistical regression is regularly used when there are only two categories of the dependent variable and there is a mixture.
Logistic Regression: Regression with a Binary Dependent Variable.
Nonparametric Statistics
BINARY LOGISTIC REGRESSION
Logistic Regression When and why do we use logistic regression?
Logistic Regression APKC – STATS AFAC (2016).
Political Science 30: Political Inquiry
CH 5: Multivariate Methods
Social Research Methods
Multiple logistic regression
Nonparametric Statistics
Logistic Regression.
Regression Part II.
Presentation transcript:

Logistic Regression

Conceptual Framework - LR Dependent variable: two categories with underlying propensity (yes/no) (absent/present) Independent variables: Continuous Categorical (Treat the same way as linear regression)

Maximum Likelihood Equations to difficult to solve Have to “plug in” numbers until you minimize the error Minimizing error is the same thing as maximizing the likelihood Can solve equations that cannot be algebraically solved, but have some kind of best solution in theory.

Two Categories Natural Categories vs. Underlying Propensity Categories and Attenuation Logistic infers the variance that is removed by shrinking the categories to two

The Odds Ratio

The Logistic Model

Logistic Equation

Coefficients Same as linear regression: 1.Make the best weight for each variable. We call these B coefficients 2.Figure out how good your guess for B is based on the data. We call this the standard error. Different from linear regression: 1.Interpretation of B: need to take the exp(B) [because we are using a different relationship to Y. 2.We are asking whether exp(B) is different than 1. (And we have to pick between the likelihood ratio test and Wald)

Test of Coefficients Wald Test: Just using the Maximum Likelihoood standard error to ask questions about the parameters. Very analogous to the tests used on coefficients in linear regression Likelihood ratio test This is a generalized way of testing an arbitrary hypothesis in maximum likelihood using a chi-squared distribution. These two tests are asymptotically equivalent (i.e. as you sample size goes to infinity). Both are really just testing if exp(B) is different from 1 and they usually agree

Interpretation of LR Omnibus test: Chi-square “Do these variables, collectively, do a better job than the mean only model?” Explain variance: Nagelkerke R 2 Maximum likelihood version of R-squared: “What percentage of the model misfit have I explained?” Model fit: Hosmer & Lemeshow test “Is this model correctly classifying the number of cases you would expect?” Classification: Percentage Correct This will make sense to a lot of people, but might not be the statistical measure you are looking for.

Interpretation of Odds Ratio Odds Ratio (OR) or Exp(B) This is the odds of the event occurring divided by the odds of the event not occurring: For example, and OR of 1.2 means that for every unit increase in the independent variable, the dependent variable is 1.2 times more likely to occur. (or 20% more likely to occur) An OR more than one means the DV is more likely to occur with increases in the IV An OR less than one means the DV is less likely to occur with increases in the IV

Some Variations Binary LR: DV has 2 attributes IVs can be categorical or continuous Multinomial LR: DV has 3 or more attributes (not ordered) IVs can be categorical or continuous Ordinal LR: DV has 3 or more ordered attributes IVs can be categorical or continuous

Example of LR Presentation Research question: What is the effect of goal attainment, social integration, academic integration, and sex on extrinsic motivation?

Introduction Extrinsic motivation – going to college to receive external rewards, such as increased finances & status (high=1; low=2). Goal attainment – the development of educational goals, such as grade attainment (scale 1-7). Academic integration – college integration related to academic success, such as doing homework, studying, asking professors questions (scale 1-7). Social integration – college integration related to social success, such as making friends or ‘hanging out’ at school (scale 1-7).

Binary Logistic Regression Binary Logistic Regression of Goal Attainment, Social & Academic Integration, & Sex on Extrinsic Motivation VariableBWaldExp(B) Goal Social * Academic ** Sex *** (M=1; F=0) Constant4.142 χ 2 = 50.65, df = 4, p <.001; n=337; Nagelkerke R 2 =.213 High Extrinsic Motivation = 1; Low Extrinsic Motivation = 2; p<.001; *p<.05; **p<.01; ***p<.001

Discussion Multivariate analyses suggests that college students who: were socially integrated were academically integrated and male were more likely of being highly extrinsically motivated