Estimation in Marginal Models (GEE and Robust Estimation)

Slides:



Advertisements
Similar presentations
SJS SDI_21 Design of Statistical Investigations Stephen Senn 2 Background Stats.
Advertisements

Probit The two most common error specifications yield the logit and probit models. The probit model results if the are distributed as normal variates,
Use of Estimating Equations and Quadratic Inference Functions in Complex Surveys Leigh Ann Harrod and Virginia Lesser Department of Statistics Oregon State.
A. The Basic Principle We consider the multivariate extension of multiple linear regression – modeling the relationship between m responses Y 1,…,Y m and.
Copula Regression By Rahul A. Parsa Drake University &
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Lecture 4 (Chapter 4). Linear Models for Correlated Data We aim to develop a general linear model framework for longitudinal data, in which the inference.
Nguyen Ngoc Anh Nguyen Ha Trang
Multivariate linear models for regression and classification Outline: 1) multivariate linear regression 2) linear classification (perceptron) 3) logistic.
Maximum likelihood estimates What are they and why do we care? Relationship to AIC and other model selection criteria.
Regression with a Binary Dependent Variable
Different chi-squares Ulf H. Olsson Professor of Statistics.
Economics 20 - Prof. Anderson1 Multiple Regression Analysis y =  0 +  1 x 1 +  2 x  k x k + u 6. Heteroskedasticity.

QUALITATIVE AND LIMITED DEPENDENT VARIABLE MODELS.
1Prof. Dr. Rainer Stachuletz Multiple Regression Analysis y =  0 +  1 x 1 +  2 x  k x k + u 6. Heteroskedasticity.
Generalized Regression Model Based on Greene’s Note 15 (Chapter 8)
GRA 6020 Multivariate Statistics The Structural Equation Model Ulf H. Olsson Professor of Statistics.
Linear beta pricing models: cross-sectional regression tests FINA790C Spring 2006 HKUST.
Different chi-squares Ulf H. Olsson Professor of Statistics.
Chapter 15 Panel Data Analysis.
Basic Mathematics for Portfolio Management. Statistics Variables x, y, z Constants a, b Observations {x n, y n |n=1,…N} Mean.
Linear and generalised linear models
Linear and generalised linear models Purpose of linear models Least-squares solution for linear models Analysis of diagnostics Exponential family and generalised.
Economics Prof. Buckles
The General LISREL MODEL and Non-normality Ulf H. Olsson Professor of Statistics.
Factor Analysis Ulf H. Olsson Professor of Statistics.
Review of Lecture Two Linear Regression Normal Equation
GEE and Generalized Linear Mixed Models
9. Binary Dependent Variables 9.1 Homogeneous models –Logit, probit models –Inference –Tax preparers 9.2 Random effects models 9.3 Fixed effects models.
Lecture 9: Marginal Logistic Regression Model and GEE (Chapter 8)
Structural Equation Modeling 3 Psy 524 Andrew Ainsworth.
13.7 – Graphing Linear Inequalities Are the ordered pairs a solution to the problem?
Prof. Dr. S. K. Bhattacharjee Department of Statistics University of Rajshahi.
HSRP 734: Advanced Statistical Methods June 19, 2008.
Generalized Linear Models All the regression models treated so far have common structure. This structure can be split up into two parts: The random part:
Ordinary Least Squares Estimation: A Primer Projectseminar Migration and the Labour Market, Meeting May 24, 2012 The linear regression model 1. A brief.
Machine Learning Recitation 6 Sep 30, 2009 Oznur Tastan.
Forecasting Choices. Types of Variable Variable Quantitative Qualitative Continuous Discrete (counting) Ordinal Nominal.
Maximum Likelihood Estimation Methods of Economic Investigation Lecture 17.
GEE Approach Presented by Jianghu Dong Instructor: Professor Keumhee Chough (K.C.) Carrière.
Lecture 3 Linear random intercept models. Example: Weight of Guinea Pigs Body weights of 48 pigs in 9 successive weeks of follow-up (Table 3.1 DLZ) The.
1 Javier Aparicio División de Estudios Políticos, CIDE Primavera Regresión.
Over-fitting and Regularization Chapter 4 textbook Lectures 11 and 12 on amlbook.com.
M.Sc. in Economics Econometrics Module I Topic 4: Maximum Likelihood Estimation Carol Newman.
Estimation Kline Chapter 7 (skip , appendices)
Part 4A: GMM-MDE[ 1/33] Econometric Analysis of Panel Data William Greene Department of Economics Stern School of Business.
Computacion Inteligente Least-Square Methods for System Identification.
Estimation Econometría. ADE.. Estimation We assume we have a sample of size T of: – The dependent variable (y) – The explanatory variables (x 1,x 2, x.
Introduction We consider the data of ~1800 phenotype measurements Each mouse has a given probability distribution of descending from one of 8 possible.
Estimating standard error using bootstrap
Estimator Properties and Linear Least Squares
Regression Models for Linkage: Merlin Regress
Probability Theory and Parameter Estimation I
Example: Solve the equation. Multiply both sides by 5. Simplify both sides. Add –3y to both sides. Simplify both sides. Add –30 to both sides. Simplify.
CH 5: Multivariate Methods
Regression with a Binary Dependent Variable.  Linear Probability Model  Probit and Logit Regression Probit Model Logit Regression  Estimation and Inference.
Distributions and Concepts in Probability Theory
Linear regression Fitting a straight line to observations.
OVERVIEW OF LINEAR MODELS
The Multivariate Normal Distribution, Part 2
OVERVIEW OF LINEAR MODELS
#21 Marginalize vs. Condition Uninteresting Fitted Parameters
Multivariate Methods Berlin Chen, 2005 References:
Econometric Analysis of Panel Data
Causal Relationships with measurement error in the data
Topic 11: Matrix Approach to Linear Regression
Factor Analysis.
Regression and Correlation of Data
Probabilistic Surrogate Models
Presentation transcript:

Estimation in Marginal Models (GEE and Robust Estimation)

GEE Since there is no convenient specification of the joint multivariate distribution of Y for marginal models when the responses are discrete, we require an alternative to MLE GEE is based on the concept of “estimating equations” and provides a very general approach for analyzing correlated responses that can be discrete or continuous

GEE The essential idea behind GEE is to generalize and extend the usual likelihood equations for a GLM with a univariate response by incorporating the covariance matrix of the vector of responses Y For the case of linear models, the GLS estimator (also called Generalized Least Square estimator) for the vector of regression coefficients is a special case of the GEE approach

What we need to specify for implementing GEE Model for the mean Known variance function Working correlation matrix: model for the pariwise correlations among the responses

Working covariance matrix V is called the working covariance matrix to distinguish for the true underlying covariance of Y

GEE minimize GEE equations Solution of the GEE equation

Properties of GEE estimates The GEE estimator is consistent whether or not the within subject associations/correlations have been correctly modelled That is, for GEE estimator to provide a valid estimate of the true beta, we only require that the model for the mean response has been correctly specified

Asymptotic distribution of GEE estimator In large sample, the GEE estimator is multivariate normal True covariance matrix

Sandwich estimate of bread meat Consistent estimate of the true Covariance matrix of Y

Link to stata command xtgee for continuous data substitute into GEE equations, got xtgee,identity link, corr(exch) Use Weighted Least Square for

xtgee, identity link, corr(exch), robust Use Sandwich Estimator for

Link to stata commands xtgee for binary data Substitute into GEE equation, but no closed-form solution, need iteration. Difference between using robust or not analogous to continuous data xtgee,logit link, corr(exch) xtgee, logit link, corr(exch), robust