Logit & Probit Regression

Slides:



Advertisements
Similar presentations
Prof. Navneet Goyal CS & IS BITS, Pilani
Advertisements

Brief introduction on Logistic Regression
Logistic Regression Psy 524 Ainsworth.
Simple Logistic Regression
L OGISTIC R EGRESSION Now with multinomial support!
Multinomial Logit Sociology 8811 Lecture 11 Copyright © 2007 by Evan Schofer Do not copy or distribute without permission.
April 25 Exam April 27 (bring calculator with exp) Cox-Regression
1Prof. Dr. Rainer Stachuletz Limited Dependent Variables P(y = 1|x) = G(  0 + x  ) y* =  0 + x  + u, y = max(0,y*)
In previous lecture, we highlighted 3 shortcomings of the LPM. The most serious one is the unboundedness problem, i.e., the LPM may make the nonsense predictions.

QUALITATIVE AND LIMITED DEPENDENT VARIABLE MODELS.
Multinomial Logistic Regression
January 6, afternoon session 1 Statistics Micro Mini Multiple Regression January 5-9, 2008 Beth Ayers.
Introduction to Logistic Regression. Simple linear regression Table 1 Age and systolic blood pressure (SBP) among 33 adult women.
FIN357 Li1 Binary Dependent Variables Chapter 12 P(y = 1|x) = G(  0 + x  )
Ordinal Logistic Regression
Log-linear and logistic models Generalised linear model ANOVA revisited Log-linear model: Poisson distribution logistic model: Binomial distribution Deviances.
Chapter 11 Survival Analysis Part 2. 2 Survival Analysis and Regression Combine lots of information Combine lots of information Look at several variables.
Log-linear and logistic models
Logistic Regression Biostatistics 510 March 15, 2007 Vanessa Perez.
In previous lecture, we dealt with the unboundedness problem of LPM using the logit model. In this lecture, we will consider another alternative, i.e.
An Introduction to Logistic Regression
BIOST 536 Lecture 4 1 Lecture 4 – Logistic regression: estimation and confounding Linear model.
Unit 5c: Adding Predictors to the Discrete Time Hazard Model © Andrew Ho, Harvard Graduate School of EducationUnit 5c– Slide 1
Generalized Linear Models
Ordinal Logistic Regression “Good, better, best; never let it rest till your good is better and your better is best” (Anonymous)
Unit 4c: Taxonomies of Logistic Regression Models © Andrew Ho, Harvard Graduate School of EducationUnit 4c – Slide 1
MODELS OF QUALITATIVE CHOICE by Bambang Juanda.  Models in which the dependent variable involves two ore more qualitative choices.  Valuable for the.
Methods Workshop (3/10/07) Topic: Event Count Models.
April 6 Logistic Regression –Estimating probability based on logistic model –Testing differences among multiple groups –Assumptions for model.
Session 10. Applied Regression -- Prof. Juran2 Outline Binary Logistic Regression Why? –Theoretical and practical difficulties in using regular (continuous)
9-1 MGMG 522 : Session #9 Binary Regression (Ch. 13)
November 5, 2008 Logistic and Poisson Regression: Modeling Binary and Count Data LISA Short Course Series Mark Seiss, Dept. of Statistics.
AN INTRODUCTION TO LOGISTIC REGRESSION ENI SUMARMININGSIH, SSI, MM PROGRAM STUDI STATISTIKA JURUSAN MATEMATIKA UNIVERSITAS BRAWIJAYA.
Logistic (regression) single and multiple. Overview  Defined: A model for predicting one variable from other variable(s).  Variables:IV(s) is continuous/categorical,
Linear vs. Logistic Regression Log has a slightly better ability to represent the data Dichotomous Prefer Don’t Prefer Linear vs. Logistic Regression.
April 4 Logistic Regression –Lee Chapter 9 –Cody and Smith 9:F.
Lecture Slide #1 Logistic Regression Analysis Estimation and Interpretation Hypothesis Tests Interpretation Reversing Logits: Probabilities –Averages.
Univariate Linear Regression Problem Model: Y=  0 +  1 X+  Test: H 0 : β 1 =0. Alternative: H 1 : β 1 >0. The distribution of Y is normal under both.
Logistic Regression. Linear Regression Purchases vs. Income.
Multiple Logistic Regression STAT E-150 Statistical Methods.
I271B QUANTITATIVE METHODS Regression and Diagnostics.
Logistic Regression. Linear regression – numerical response Logistic regression – binary categorical response eg. has the disease, or unaffected by the.
Chapter 15 The Chi-Square Statistic: Tests for Goodness of Fit and Independence PowerPoint Lecture Slides Essentials of Statistics for the Behavioral.
Heart Disease Example Male residents age Two models examined A) independence 1)logit(╥) = α B) linear logit 1)logit(╥) = α + βx¡
LOGISTIC REGRESSION Binary dependent variable (pass-fail) Odds ratio: p/(1-p) eg. 1/9 means 1 time in 10 pass, 9 times fail Log-odds ratio: y = ln[p/(1-p)]
Logistic Regression Analysis Gerrit Rooks
Qualitative and Limited Dependent Variable Models ECON 6002 Econometrics Memorial University of Newfoundland Adapted from Vera Tabakova’s notes.
1 Week 3 Association and correlation handout & additional course notes available at Trevor Thompson.
Roger B. Hammer Assistant Professor Department of Sociology Oregon State University Conducting Social Research Logistic Regression Categorical Data Analysis.
Logistic Regression and Odds Ratios Psych DeShon.
The Probit Model Alexander Spermann University of Freiburg SS 2008.
1 Assessment and Interpretation: MBA Program Admission Policy The dean of a large university wants to raise the admission standards to the popular MBA.
Before the class starts: Login to a computer Read the Data analysis assignment 1 on MyCourses If you use Stata: Start Stata Start a new do file Open the.
Logistic Regression: Regression with a Binary Dependent Variable.
Chapter 13 LOGISTIC REGRESSION. Set of independent variables Categorical outcome measure, generally dichotomous.
Probit Regression Psych DeShon.
The Probit Model Alexander Spermann University of Freiburg SoSe 2009
Logistic Regression When and why do we use logistic regression?
EHS Lecture 14: Linear and logistic regression, task-based assessment
Logistic Regression APKC – STATS AFAC (2016).
Advanced Quantitative Techniques
Notes on Logistic Regression
Advanced Quantitative Techniques
Generalized Linear Models
Introduction to logistic regression a.k.a. Varbrul
Logistic Regression.
Introduction to Logistic Regression
Limited Dependent Variables
Presentation transcript:

Logit & Probit Regression

Read the data Describe the data use http://www.ats.ucla.edu/stat/stata/dae/logit.dta, clear Describe the data Codebook Summarize Tabulate Make graphs

Description of the Data Dependent variable admit (0/1) Independent variables gre, student's Graduate Record Exam score; gpa, student's grade point average topnotch, a 0/1 variable where 1 indicates that the undergraduate institution was "top notch" and 0 indicates that it is not.

Some Possible Estimation Strategies OLS regression: this analysis is problematic because the assumptions of OLS are violated when using a binary response variable. Logit or Probit model  Probit analysis will produce results similar logit regression. The choice of probit versus logit regression depends largely on individual preferences.

Using Logit Model Before running logit, check to see if any cells (created by the crosstab of our categorical and response variables) are empty or particularly small.  If this occurs, there may be difficulty to run the logit model.

None of the cells are too small or empty (has no cases), so we will run our logit model.

The likelihood ratio chi-square (21.85) with a p-value (0.0001) The log likelihood (-239.06481) can be used in comparisons of nested models.  Observations (400) fewer observations would have been used if any of our variables had missing values.  The likelihood ratio chi-square (21.85) with a p-value (0.0001) our model as a whole fits significantly better than an empty model.

The interpretation of the coefficients can be awkward The interpretation of the coefficients can be awkward. For example, for a one unit increase in gpa, the log odds of being admitted to graduate school (vs. not being admitted) increases by .667. For this reason, many researchers prefer to exponentiate the coefficients and interpret them as odds-ratios. Stata will do this computation for you if you use the or option, illustrated below.

Now we can say that for a one unit increase in gpa, the odds of being admitted to graduate school (vs. not being admitted) increased by a factor of 1.94. Since GRE scores increase only in units of 10, we can take the odds ratio and raise it to the 10th power, e.g. 1.00248 ^ 10 = 1.0250786, and say for a 10 unit increase in GRE score, the odds of admission to graduate school increased by a factor of 1.025.

We will start with prtab. (downloaded from internet) This command can be used with either a categorical variable or a continuous variable and shows the predicted probability of the outcome being 1 for all levels of the specified predictor.  The predicted probability of being accepted into the graduate program is 0.29 if the undergraduate institution was not "top notch" and .39 if it was, while gre and gpa are held constant at their mean value.

Below we can see that the predicted probability of getting accepted is only .15 if one's GRE score is 220 and increases to .429 if one's GRE score is 800 Notice that gpa and topnotch are held constant at their mean, indicated at the end of the output.

We can use the prvalue command to obtain the predicted probabilities when gpa is set to specific values: 2, 3 and 4.  When one's GPA is 2, the predicted probability of being accepted is only .149, and .85 of not being accepted.  When GPA is 3, the probability of being accepted increases to .255 When one's GPA is 4, the predicted probability of being accepted is .40.

Self-Assessment Probit regression ( regress admit1 on topnotch, gre, gpa) Probit regression (reporting odd ratio) Obtain the predicted probability for different topnotch values Obtain the predicted probabilities when gpa is set to specific values: 2, 3 and 4.