Lecture 122007Slide #1 Logistic Regression Analysis Estimation and Interpretation Hypothesis Tests Interpretation Reversing Logits: Probabilities –Averages.

Slides:



Advertisements
Similar presentations
1 1 Chapter 5: Multiple Regression 5.1 Fitting a Multiple Regression Model 5.2 Fitting a Multiple Regression Model with Interactions 5.3 Generating and.
Advertisements

© Department of Statistics 2012 STATS 330 Lecture 32: Slide 1 Stats 330: Lecture 32.
Brief introduction on Logistic Regression
Hypothesis Testing Steps in Hypothesis Testing:
The Art of Model Building and Statistical Tests. 2 Outline The art of model building Using Software output The t-statistic The likelihood ratio test The.
Chapter 14, part D Statistical Significance. IV. Model Assumptions The error term is a normally distributed random variable and The variance of  is constant.
Logit & Probit Regression
Limited Dependent Variables
Regression Analysis Once a linear relationship is defined, the independent variable can be used to forecast the dependent variable. Y ^ = bo + bX bo is.
Multiple Regression [ Cross-Sectional Data ]
April 25, 2006Lecture 14 Slide #1 Logits and Factor Analysis Homework Review Logit Analysis –Logit Interpretation –Logit diagnostics Factor analysis-in-brief.
Model Estimation and Comparison Gamma and Lognormal Distributions 2015 Washington, D.C. Rock ‘n’ Roll Marathon Velocities.
Multiple Linear Regression Model
Binary Response Lecture 22 Lecture 22.
Feb 21, 2006Lecture 6Slide #1 Adjusted R 2, Residuals, and Review Adjusted R 2 Residual Analysis Stata Regression Output revisited –The Overall Model –Analyzing.
CHAPTER 4 ECONOMETRICS x x x x x Multiple Regression = more than one explanatory variable Independent variables are X 2 and X 3. Y i = B 1 + B 2 X 2i +
Maximum Likelihood We have studied the OLS estimator. It only applies under certain assumptions In particular,  ~ N(0, 2 ) But what if the sampling distribution.
Correlation. Two variables: Which test? X Y Contingency analysis t-test Logistic regression Correlation Regression.
An Introduction to Logistic Regression JohnWhitehead Department of Economics Appalachian State University.
Lecture 23 Multiple Regression (Sections )
Lecture 16 – Thurs, Oct. 30 Inference for Regression (Sections ): –Hypothesis Tests and Confidence Intervals for Intercept and Slope –Confidence.
Simple Linear Regression Analysis
Topic 3: Regression.
1 4. Multiple Regression I ECON 251 Research Methods.
Simple Linear Regression Analysis
Multiple Linear Regression Analysis
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS & Updated by SPIROS VELIANITIS.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Simple Linear Regression Analysis Chapter 13.
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
Chapter 13: Inference in Regression
Hypothesis Testing in Linear Regression Analysis
Problems with Incorrect Functional Form You cannot compare R 2 between two different functional forms. ▫ Why? TSS will be different. One should also remember.
Chapter 12 Multiple Regression and Model Building.
2-1 MGMG 522 : Session #2 Learning to Use Regression Analysis & The Classical Model (Ch. 3 & 4)
BPS - 3rd Ed. Chapter 211 Inference for Regression.
Inferences in Regression and Correlation Analysis Ayona Chatterjee Spring 2008 Math 4803/5803.
1 1 Slide © 2007 Thomson South-Western. All Rights Reserved Chapter 13 Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 15 Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple.
Random Regressors and Moment Based Estimation Prepared by Vera Tabakova, East Carolina University.
9-1 MGMG 522 : Session #9 Binary Regression (Ch. 13)
AN INTRODUCTION TO LOGISTIC REGRESSION ENI SUMARMININGSIH, SSI, MM PROGRAM STUDI STATISTIKA JURUSAN MATEMATIKA UNIVERSITAS BRAWIJAYA.
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
April 4 Logistic Regression –Lee Chapter 9 –Cody and Smith 9:F.
11 Chapter 12 Quantitative Data Analysis: Hypothesis Testing © 2009 John Wiley & Sons Ltd.
Chapter 13 Multiple Regression
Week 5Slide #1 Adjusted R 2, Residuals, and Review Adjusted R 2 Residual Analysis Stata Regression Output revisited –The Overall Model –Analyzing Residuals.
Multiple Regression. Simple Regression in detail Y i = β o + β 1 x i + ε i Where Y => Dependent variable X => Independent variable β o => Model parameter.
Environmental Modeling Basic Testing Methods - Statistics III.
April 4, 2006Lecture 11Slide #1 Diagnostics for Multivariate Regression Analysis Homeworks Assumptions of OLS Revisited Visual Diagnostic Techniques Non-linearity.
Chapter 6 Introduction to Multiple Regression. 2 Outline 1. Omitted variable bias 2. Causality and regression analysis 3. Multiple regression and OLS.
I271B QUANTITATIVE METHODS Regression and Diagnostics.
© Department of Statistics 2012 STATS 330 Lecture 22: Slide 1 Stats 330: Lecture 22.
Logistic Regression. Linear regression – numerical response Logistic regression – binary categorical response eg. has the disease, or unaffected by the.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 14-1 Chapter 14 Introduction to Multiple Regression Basic Business Statistics 10 th Edition.
Logistic Regression Analysis Gerrit Rooks
Logistic regression (when you have a binary response variable)
Introduction to Multiple Regression Lecture 11. The Multiple Regression Model Idea: Examine the linear relationship between 1 dependent (Y) & 2 or more.
1 Regression Review Population Vs. Sample Regression Line Residual and Standard Error of Regression Interpretation of intercept & slope T-test, F-test.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Multiple Regression Chapter 14.
Nonparametric Statistics
The Probit Model Alexander Spermann University of Freiburg SS 2008.
4-1 MGMG 522 : Session #4 Choosing the Independent Variables and a Functional Form (Ch. 6 & 7)
BPS - 5th Ed. Chapter 231 Inference for Regression.
Chapter 20 Linear and Multiple Regression
Basic Estimation Techniques
Simple Linear Regression
Tutorial 1: Misspecification
Maximum Likelihood We have studied the OLS estimator. It only applies under certain assumptions In particular,  ~ N(0, 2 ) But what if the sampling distribution.
Presentation transcript:

Lecture Slide #1 Logistic Regression Analysis Estimation and Interpretation Hypothesis Tests Interpretation Reversing Logits: Probabilities –Averages –Types –Estimating variable influence

Lecture Slide #2 Logit Assumptions and Qualifiers The model is correctly specified –True conditional probabilities are logistic function of the X’s –No important X’s omitted; no extraneous X’s included –No significant measurement error The cases are independent No X is a linear function of other X’s –Multicollinearity leads to imprecision Influential cases can bias estimates Sample size: n-K should exceed 100 –Independent covariation is critical

Lecture Slide #3 Logit Hypothesis Tests Nested Model Tests (like F-Tests in OLS) –Is a more complex model a better fit? Test to see if parameters for omitted variables are statistically indistinguishable from zero: Where the Chi-square table uses K degrees of freedom. If p < 0.05, the complex model fits significantly better

Lecture Slide #4 More Logit Hypothesis Tests To test for the overall hypothesis that all b’s are equal to zero (like an overall F-test): –Compare the final log-likelihood with the initial one, using the same formula: Initial log likelihood = Final log likelihood = Difference = = 46.85, df=K-1; p-value > (see Hamilton p. 354)

Lecture Slide #5 Still More Logit Hypothesis Tests z-statistic: –Similar to the t-stat in OLS –Compares the estimated coefficient to the estimated standard error –P-value is derived from the Chi-Square distribution Attached to each estimated coefficient –The p-value shows probability that the null hypothesis is correct, given the data

Lecture Slide #6 Interpreting Logits Logits can be used to directly calculate odds: Logits can be reversed to obtain the predicted probabilities:

Lecture Slide #7 Interpreting Logits, Continued How would you calculate the effect of a particular independent variable, X i, on the probability of Y = 1? Set all X j ’s at their mean, then calculate With X i at it’s minimum and maximum. Then calculate the difference.

Lecture Slide #8 Reversing Logits in Stata: Illustrating the Effect of Certainty First: calculate mean values for the “control” variables (ideology and sex) sum ideology sex Use the mean values to generate L generate L1= _b[_cons] + _b[ideology]* _b[sex]* _b[DR_cert]*DR_cert Next: calculate the anti-log of L generate Phat1=1/(1 + exp(-L1)) Now graph the relationship graph twoway mspline Phat1 DR_cert, bands(50)

Lecture Slide #9 Estimated Probability Effects

Lecture Slide #10 Interpreting Logits, Continued How would you calculate the effect of a particular combination of independent variables on the probability of Y=1? Set all X j ’s at the appropriate values, then calculate (e= ) The result is the average probability for that “type” of respondents

Lecture Slide #11 Example: Effect of ideology, gender on probability of choosing the Linear model for standard setting Model: choice (DR_standard) as a function of: –Ideology, gender and certainty Types –A=conservative male; B=liberal female –Set certainty at the average –A: conservative, male, average level of certainty Ideology=7, gender = 1, certainty=6.289 –B: liberal, female, average level of certainty Ideology=1, gender = 0, certainty= =chose threshold, 1=choose linear

Lecture Slide #12 Logit Model Results L= *DR_cert *ideology *sex

Lecture Slide #13 Analyzing Types L = ( *(ideology)) + ( *(sex)) + ( *(certainty)) L Probability Conservative Males: (indep. vars.: 7; 1; ) Liberal Females: (indep. vars.: 1; 0; ) Hint: Use a spreadsheet to calculate L and P. In Excel, the formula for probability would be: P = 1/(1+EXP(-L)) Example from Scientist data

Lecture Slide #14 Estimates of Coefficient Strength In Excel, calculate the difference in probability for each X at its min and max, holding all other variables constant:

Lecture Slide #15 Estimated Logit Probabilities

Lecture Slide #16 Logit Diagnostics The most useful diagnostics are to match “influence” (case-wise dfbetas) with predicted probabilities:

Lecture Slide #17 Logit Outliers and Influence In this instance, the high influence cases are those are uncertain, liberal males, OR certain conservative females. These “bundles” of attributes make them harder to predict.

Lecture Slide #18 Coming Up... Factor Analysis Readings in the course e-reserves –See the link on the class web page