July 1, 2011 Bamberg (German Stata User Group Meeting)

Slides:



Advertisements
Similar presentations
Titel og undertitel skal stå med store bogstaver (Versaler) og skal holdes indenfor de to vandrette grå linjer. Titlen kan stå i farve, eller der kan vælges.
Advertisements

Probit The two most common error specifications yield the logit and probit models. The probit model results if the are distributed as normal variates,
Richard Williams (with assistance from Cheng Wang) Notre Dame Sociology August 2012 Annual Meetings of the.
Tests of Significance for Regression & Correlation b* will equal the population parameter of the slope rather thanbecause beta has another meaning with.
Logistic Regression Psy 524 Ainsworth.
Quantitative Data Analysis: Hypothesis Testing
Nguyen Ngoc Anh Nguyen Ha Trang
Models with Discrete Dependent Variables
Multiple Linear Regression Model
Binary Response Lecture 22 Lecture 22.
Business 205. Review Sampling Continuous Random Variables Central Limit Theorem Z-test.
Clustered or Multilevel Data
So far, we have considered regression models with dummy variables of independent variables. In this lecture, we will study regression models whose dependent.
Ulf H. Olsson Professor of Statistics
Quantitative Business Analysis for Decision Making Simple Linear Regression.
BIOST 536 Lecture 4 1 Lecture 4 – Logistic regression: estimation and confounding Linear model.
GRA 6020 Multivariate Statistics Probit and Logit Models Ulf H. Olsson Professor of Statistics.
Multivariate Probability Distributions. Multivariate Random Variables In many settings, we are interested in 2 or more characteristics observed in experiments.
Multiple Linear Regression A method for analyzing the effects of several predictor variables concurrently. - Simultaneously - Stepwise Minimizing the squared.
Titel og undertitel skal stå med store bogstaver (Versaler) og skal holdes indenfor de to vandrette grå linjer. Der kan frit vælges farver fra farvepaletterne.
1 Regression Models with Binary Response Regression: “Regression is a process in which we estimate one variable on the basis of one or more other variables.”
Multiple imputation using ICE: A simulation study on a binary response Jochen Hardt Kai Görgen 6 th German Stata Meeting, Berlin June, 27 th 2008 Göteborg.
1 Selection bias and auditing policies on insurance claims Leuven, July 20, 2006 Jean Pinquet, Montserrat Guillén & Mercedes Ayuso.
1 FORECASTING Regression Analysis Aslı Sencer Graduate Program in Business Information Systems.
Copyright © 2014 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill Education.
Trends in Inequality of Educational Opportunity in the Netherlands : The Effect of Missing Data Maarten L. Buis & Harry B.G. Ganzeboom Department.
Managerial Economics Demand Estimation & Forecasting.
11 Chapter 12 Quantitative Data Analysis: Hypothesis Testing © 2009 John Wiley & Sons Ltd.
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
Chapter 10 Copyright © Allyn & Bacon 2008 This multimedia product and its contents are protected under copyright law. The following are prohibited by law:
Qualitative and Limited Dependent Variable Models ECON 6002 Econometrics Memorial University of Newfoundland Adapted from Vera Tabakova’s notes.
Statistical Methods. 2 Concepts and Notations Sample unit – the basic landscape unit at which we wish to establish the presence/absence of the species.
6. Ordered Choice Models. Ordered Choices Ordered Discrete Outcomes E.g.: Taste test, credit rating, course grade, preference scale Underlying random.
Hypothesis Testing and Statistical Significance
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Copyright © 2015 Inter-American Development Bank. This work is licensed under a Creative Commons IGO 3.0 Attribution-Non Commercial-No Derivatives (CC-IGO.
CFA with Categorical Outcomes Psych DeShon.
More Necessary and Less Sufficient: Age-Period-Cohort Approach to Overeducation in a Comparative Perspective Eyal Bar-Haim, Anne Hartung and Louis Chauvel.
Sampling and Sampling Distributions
University of Warwick, Department of Sociology, 2014/15 SO 201: SSAASS (Surveys and Statistics) (Richard Lampard)   Week 5 Multiple Regression  
The simple linear regression model and parameter estimation
Logistic Regression APKC – STATS AFAC (2016).
Advanced Quantitative Techniques
Multiple Imputation using SOLAS for Missing Data Analysis
Assumptions For testing a claim about the mean of a single population
Basic Estimation Techniques
M.Sc. in Economics Econometrics Module I
William Greene Stern School of Business New York University
General Linear Model & Classical Inference
Financial Econometrics Lecture Notes 4
International Conference
Basic Estimation Techniques
Comparing Logit and Probit Coefficients Across Groups: Problems, Solutions, and Problems with the Solutions Richard Williams Notre Dame Sociology
Methods of Economic Investigation Lecture 12
LAMAS Working Group 7-8 December 2015
Gerald Dyer, Jr., MPH October 20, 2016
Confidence Interval Estimation
What is Regression Analysis?
LIMITED DEPENDENT VARIABLE REGRESSION MODELS
Ch. 13. Pooled Cross Sections Across Time: Simple Panel Data.
CHAPTER 6 Statistical Inference & Hypothesis Testing
Regression Assumptions
Understanding Statistical Inferences
MPHIL AdvancedEconometrics
Hypothesis Testing S.M.JOSHI COLLEGE ,HADAPSAR
Ch. 13. Pooled Cross Sections Across Time: Simple Panel Data.
Chapter 13: Using Statistics
Regression Assumptions
Nazmus Saquib, PhD Head of Research Sulaiman AlRajhi Colleges
Introduction to Econometrics, 5th edition
Presentation transcript:

A Correlation Metric for Cross-Sample Comparisons Using Logit and Probit July 1, 2011 Bamberg (German Stata User Group Meeting) KRISTIAN BERNT KARLSON w/ Richard Breen and Anders Holm SFI – The Danish National Centre of Social Research Department of Education, Aarhus University Titel og undertitel står med store bogstaver (Versaler) og skal holdes indenfor de to vandrette grå linjer. Titlen kan stå i farve, eller der kan vælges en baggrundsfarve fra en af farvepaletterne til feltet mellem de to vandrette linjer.

CONTENTS An issue! A solution? An example: Trends in IEO in the US A conclusion Overskrift og underoverskrift i versaler. Correlation Metric / K.B. Karlson / Bamberg / July 1, 2011 26-12-2018

ISSUE: INTERACTION TERMS Interaction effects in logit/probit models not identified Allison (1999): Differences in true effects conflated by differences in conditional error variance (i.e., heteroskedasticity) Overskrift og underoverskrift i versaler. Correlation Metric / K.B. Karlson / Bamberg / July 1, 2011 26-12-2018

ISSUE: INTERACTION TERMS Assume: binary y, manifestation of latent y*. Following standard econometrics, a logit coefficient identifies: Beta = effect from underlying linear reg. model of y* on x s = (function of) latent error standard deviation, sd(y*|x) Overskrift og underoverskrift i versaler. Correlation Metric / K.B. Karlson / Bamberg / July 1, 2011 26-12-2018

ISSUE: INTERACTION TERMS Allison noted problem when comparing effects across groups: We cannot identify difference of interest: Overskrift og underoverskrift i versaler. Correlation Metric / K.B. Karlson / Bamberg / July 1, 2011 26-12-2018

SOLUTION: A REINTERPRETATION OF THE LOGIT COEFFICIENT Interaction terms = identification issue not easily resolved! We suggest a new strategy. Shift of focus from differences in effects (not identified) to differences in correlations (identified). = possible solution to problem identified by Allison (1999) in some situations met in real applications Overskrift og underoverskrift i versaler. Correlation Metric / K.B. Karlson / Bamberg / July 1, 2011 26-12-2018

SOLUTION: A REINTERPRETATION OF THE LOGIT COEFFICIENT We show how to derive, from a logit/probit model, the correlation between an observed predictor, x, and the latent variable, y*, assumed to underlie the binary variable, y: where b is a logit/probit coefficient and var(ω) the variance of a standard logistic/normal variable (π2/3 for logit, 1 for probit). Overskrift og underoverskrift i versaler. Correlation Metric / K.B. Karlson / Bamberg / July 1, 2011 26-12-2018

SOLUTION: A REINTERPRETATION OF THE LOGIT COEFFICIENT It follows that: Thus: Overskrift og underoverskrift i versaler. Correlation Metric / K.B. Karlson / Bamberg / July 1, 2011 26-12-2018

SOLUTION: A REINTERPRETATION OF THE LOGIT COEFFICIENT Uses of the correlation metric for comparisons: + interest in the relative positions of individuals (or other units of analysis) within a group, e.g., countries, regions, cohorts. - interest in the absolute positions of individuals within groups - interest in group-differences in effects, but not the within- group relative positions (e.g., gender, ethnicity). Overskrift og underoverskrift i versaler. Correlation Metric / K.B. Karlson / Bamberg / July 1, 2011 26-12-2018

EXAMPLE: TRENDS IN IEO IN THE US Thanks to Uli Kohler, -nlcorr- implements the new metric. EXAMPLE: Did IEO decline across cohorts born in 20th century? GSS DATA * Five 10-year birth cohorts, 1920 to 1969. * Outcome: high school graduation (y=0/1, y* = educ. propensity) * Predictor: Parental SES (papres80) Corrrelation of interest = corr(SES, y*), over cohorts! Overskrift og underoverskrift i versaler. Correlation Metric / K.B. Karlson / Bamberg / July 1, 2011 26-12-2018

EXAMPLE: TRENDS IN IEO IN THE US Previous research, argument for using logit coefficients: ‘differences in [social] background effects … cannot result from changing marginal distributions of either independent or dependent variables because such changes do not affect [the parameter estimates]’ (Mare 1981: 74, parentheses added). But given our reexpression of the logit coefficent, differences in logit effects across groups (cohorts) will also reflect differences in sd(x). Overskrift og underoverskrift i versaler. Correlation Metric / K.B. Karlson / Bamberg / July 1, 2011 26-12-2018

EXAMPLE: TRENDS IN IEO IN THE US Trends with logit coefficients 1920-1929 1930-1939 1940-1949 1950-1959 1960-1969 Overskrift og underoverskrift i versaler. Correlation Metric / K.B. Karlson / Bamberg / July 1, 2011 26-12-2018

EXAMPLE: TRENDS IN IEO IN THE US Trends with correlations Overskrift og underoverskrift i versaler. Correlation Metric / K.B. Karlson / Bamberg / July 1, 2011 26-12-2018

EXAMPLE: TRENDS IN IEO IN THE US Trends with correlations, decomposed Overskrift og underoverskrift i versaler. Correlation Metric / K.B. Karlson / Bamberg / July 1, 2011 26-12-2018

EXAMPLE: TRENDS IN IEO IN THE US Trends with correlations, contrasts, statistical tests Overskrift og underoverskrift i versaler. Correlation Metric / K.B. Karlson / Bamberg / July 1, 2011 26-12-2018

CONCLUSION Correlation metric to be preferred in some situations -- a solution to the issue identified by Allison (1999) Example: Evidence on trends in IEO different when correlation metric used (compared to logit coefficients). WP: http://papers.ssrn.com/sol3/papers.cfm?abstract_id=1857431 A Reinterpretation of Coefficients from Logit, Probit, and Other Non-Linear Probability Models: Consequences for Comparative Sociological Research Overskrift og underoverskrift i versaler. Correlation Metric / K.B. Karlson / Bamberg / July 1, 2011 26-12-2018