Lecture 5. Linear Models for Correlated Data: Inference.

Slides:



Advertisements
Similar presentations
Christopher Dougherty EC220 - Introduction to econometrics (chapter 1) Slideshow: exercise 1.16 Original citation: Dougherty, C. (2012) EC220 - Introduction.
Advertisements

From Anova to Regression: analyzing the effect on consumption of no. of persons in family Family consumption data family.dta E/Albert/Courses/cdas/appstat00/From.
Heteroskedasticity The Problem:
Lecture 4 (Chapter 4). Linear Models for Correlated Data We aim to develop a general linear model framework for longitudinal data, in which the inference.
HETEROSCEDASTICITY-CONSISTENT STANDARD ERRORS 1 Heteroscedasticity causes OLS standard errors to be biased is finite samples. However it can be demonstrated.
Lecture 9 Today: Ch. 3: Multiple Regression Analysis Example with two independent variables Frisch-Waugh-Lovell theorem.
1 Nonlinear Regression Functions (SW Chapter 8). 2 The TestScore – STR relation looks linear (maybe)…
Christopher Dougherty EC220 - Introduction to econometrics (chapter 3) Slideshow: exercise 3.5 Original citation: Dougherty, C. (2012) EC220 - Introduction.
Lecture 4 This week’s reading: Ch. 1 Today:
Sociology 601 Class 19: November 3, 2008 Review of correlation and standardized coefficients Statistical inference for the slope (9.5) Violations of Model.
Adaptive expectations and partial adjustment Presented by: Monika Tarsalewska Piotrek Jeżak Justyna Koper Magdalena Prędota.
Multiple Regression Spring Gore Likeability Example Suppose: –Gore’s* likeability is a function of Clinton’s likeability and not directly.
Sociology 601 Class 25: November 24, 2009 Homework 9 Review –dummy variable example from ASR (finish) –regression results for dummy variables Quadratic.
Sociology 601 Class 28: December 8, 2009 Homework 10 Review –polynomials –interaction effects Logistic regressions –log odds as outcome –compared to linear.
1 Multiple Regression EPP 245/298 Statistical Analysis of Laboratory Data.
BCOR 1020 Business Statistics Lecture 28 – May 1, 2008.
Regression Example Using Pop Quiz Data. Second Pop Quiz At my former school (Irvine), I gave a “pop quiz” to my econometrics students. The quiz consisted.
Introduction to Regression Analysis Straight lines, fitted values, residual values, sums of squares, relation to the analysis of variance.
Addressing Alternative Explanations: Multiple Regression Spring 2007.
1 Review of Correlation A correlation coefficient measures the strength of a linear relation between two measurement variables. The measure is based on.
So far, we have considered regression models with dummy variables of independent variables. In this lecture, we will study regression models whose dependent.
1 Michigan.do. 2. * construct new variables;. gen mi=state==26;. * michigan dummy;. gen hike=month>=33;. * treatment period dummy;. gen treatment=hike*mi;
A trial of incentives to attend adult literacy classes Carole Torgerson, Greg Brooks, Jeremy Miles, David Torgerson Classes randomised to incentive or.
Interpreting Bi-variate OLS Regression
1 Zinc Data EPP 245 Statistical Analysis of Laboratory Data.
1 Regression and Calibration EPP 245 Statistical Analysis of Laboratory Data.
Sociology 601 Class 26: December 1, 2009 (partial) Review –curvilinear regression results –cubic polynomial Interaction effects –example: earnings on married.
TESTING A HYPOTHESIS RELATING TO A REGRESSION COEFFICIENT This sequence describes the testing of a hypotheses relating to regression coefficients. It is.
SLOPE DUMMY VARIABLES 1 The scatter diagram shows the data for the 74 schools in Shanghai and the cost functions derived from a regression of COST on N.
EDUC 200C Section 4 – Review Melissa Kemmerle October 19, 2012.
TOBIT ANALYSIS Sometimes the dependent variable in a regression model is subject to a lower limit or an upper limit, or both. Suppose that in the absence.
DUMMY CLASSIFICATION WITH MORE THAN TWO CATEGORIES This sequence explains how to extend the dummy variable technique to handle a qualitative explanatory.
1 INTERACTIVE EXPLANATORY VARIABLES The model shown above is linear in parameters and it may be fitted using straightforward OLS, provided that the regression.
Returning to Consumption
Country Gini IndexCountryGini IndexCountryGini IndexCountryGini Index Albania28.2Georgia40.4Mozambique39.6Turkey38 Algeria35.3Germany28.3Nepal47.2Turkmenistan40.8.
Serial Correlation and the Housing price function Aka “Autocorrelation”
1 Estimation of constant-CV regression models Alan H. Feiveson NASA – Johnson Space Center Houston, TX SNASUG 2008 Chicago, IL.
How do Lawyers Set fees?. Learning Objectives 1.Model i.e. “Story” or question 2.Multiple regression review 3.Omitted variables (our first failure of.
Addressing Alternative Explanations: Multiple Regression
MultiCollinearity. The Nature of the Problem OLS requires that the explanatory variables are independent of error term But they may not always be independent.
EDUC 200C Section 3 October 12, Goals Review correlation prediction formula Calculate z y ’ = r xy z x for a new data set Use formula to predict.
What is the MPC?. Learning Objectives 1.Use linear regression to establish the relationship between two variables 2.Show that the line is the line of.
MULTIPLE REGRESSION WITH TWO EXPLANATORY VARIABLES: EXAMPLE 1 This sequence provides a geometrical interpretation of a multiple regression model with two.
Wiener Institut für Internationale Wirtschaftsvergleiche The Vienna Institute for International Economic Studies Structural change, productivity.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 1) Slideshow: exercise 1.5 Original citation: Dougherty, C. (2012) EC220 - Introduction.
CENTRE FOR INNOVATION, RESEARCH AND COMPETENCE IN THE LEARNING ECONOMY Session 3: Basic techniques for innovation data analysis. Part II: Introducing regression.
Lecture 3 Linear random intercept models. Example: Weight of Guinea Pigs Body weights of 48 pigs in 9 successive weeks of follow-up (Table 3.1 DLZ) The.
. reg LGEARN S WEIGHT85 Source | SS df MS Number of obs = F( 2, 537) = Model |
Christopher Dougherty EC220 - Introduction to econometrics (chapter 5) Slideshow: exercise 5.2 Original citation: Dougherty, C. (2012) EC220 - Introduction.
Panel Data. Assembling the Data insheet using marriage-data.csv, c d u "background-data", clear d u "experience-data", clear u "wage-data", clear d reshape.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 4) Slideshow: exercise 4.5 Original citation: Dougherty, C. (2012) EC220 - Introduction.
(1)Combine the correlated variables. 1 In this sequence, we look at four possible indirect methods for alleviating a problem of multicollinearity. POSSIBLE.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 6) Slideshow: exercise 6.13 Original citation: Dougherty, C. (2012) EC220 - Introduction.
STAT E100 Section Week 12- Regression. Course Review - Project due Dec 17 th, your TA. - Exam 2 make-up is Dec 5 th, practice tests have been updated.
RAMSEY’S RESET TEST OF FUNCTIONAL MISSPECIFICATION 1 Ramsey’s RESET test of functional misspecification is intended to provide a simple indicator of evidence.
SEMILOGARITHMIC MODELS 1 This sequence introduces the semilogarithmic model and shows how it may be applied to an earnings function. The dependent variable.
1 REPARAMETERIZATION OF A MODEL AND t TEST OF A LINEAR RESTRICTION Linear restrictions can also be tested using a t test. This involves the reparameterization.
1 In the Monte Carlo experiment in the previous sequence we used the rate of unemployment, U, as an instrument for w in the price inflation equation. SIMULTANEOUS.
F TESTS RELATING TO GROUPS OF EXPLANATORY VARIABLES 1 We now come to more general F tests of goodness of fit. This is a test of the joint explanatory power.
Birthweight (gms) BPDNProp Total BPD (Bronchopulmonary Dysplasia) by birth weight Proportion.
1 COMPARING LINEAR AND LOGARITHMIC SPECIFICATIONS When alternative specifications of a regression model have the same dependent variable, R 2 can be used.
VARIABLE MISSPECIFICATION II: INCLUSION OF AN IRRELEVANT VARIABLE In this sequence we will investigate the consequences of including an irrelevant variable.
QM222 Class 9 Section A1 Coefficient statistics
The slope, explained variance, residuals
QM222 Your regressions and the test
Auto Accidents: What’s responsible?
QM222 Class 15 Section D1 Review for test Multicollinearity
Covariance x – x > 0 x (x,y) y – y > 0 y x and y axes.
EPP 245 Statistical Analysis of Laboratory Data
Presentation transcript:

Lecture 5

Linear Models for Correlated Data: Inference

Inference Estimation Methods –Weighted Least Squares (WLS) (V i known) –Maximum Likelihood (V i unknown) –Restricted Maximum Likelihood (V i unknown) –Robust Estimation (V i unknown) Hypothesis Testing Example: Growth of Sitka Trees

Weighted-Least Squares Estimation

Weighted-Least Squares Estimation (cont’d)

Estimation of Mean Model: Weighted Least Squares

Estimation of Mean Model: Weighted Least Squares (cont’d)

Note that we can re-write the WRRS as:

What does this equation say? Examples…

Examples: V diagonal

Examples: V diagonal (cont’d)

Examples: V not diagonal

Examples: AR-1 (V not diagonal)

Examples: AR-1 (V not diagonal) (cont’d)

Weighted Least Squares Estimation: Summary

Pigs – “WLS” Fit “WLS” Model results

Pigs – “WLS” Fit

Pigs – OLS fit. regress weight time Source | SS df MS Number of obs = F( 1, 430) = Model | Prob > F = Residual | R-squared = Adj R-squared = Total | Root MSE = weight | Coef. Std. Err. t P>|t| [95% Conf. Interval] time | _cons | OLS results

Pigs – “WLS” Fit

“WLS” Model results

Pigs – OLS fit. regress weight time Source | SS df MS Number of obs = F( 1, 430) = Model | Prob > F = Residual | R-squared = Adj R-squared = Total | Root MSE = weight | Coef. Std. Err. t P>|t| [95% Conf. Interval] time | _cons | OLS results

Efficiency

Efficiency (cont’d)

Example

Example (cont’d)

When can we use OLS and ignore V? 1.Uniform Correlation Model 2.Balanced Data

When can we use OLS and ignore V? (cont’d) 1.(Uniform Correlation) With a common correlation between any two equally- spaced measurements on the same unit, there is no reason to weight measurements differently. 2. (Balanced Data) This would not be true if the number of measurements varied between units because, with >0, units with more measurements would then convey more information per unit than units with fewer measurements.

When can we use OLS and ignore V? (cont’d) In many circumstances where there is a balanced design, the OLS estimator is perfectly satisfactory for point estimation.

Example: Two-treatment crossover design

Example: Two-treatment crossover design (cont’d)

(Recall slide) Inference Estimation Methods –Weighted Least Squares (WLS) (V i known) –Maximum Likelihood (V i unknown) –Restricted Maximum Likelihood (V i unknown) –Robust Estimation (V i unknown) Hypothesis Testing Example: Growth of Sitka Trees

Maximum Likelihood Estimation under a Gaussian Assumption

Maximum Likelihood Estimation under a Gaussian Assumption (cont’d)

(Recall slide) Inference Estimation Methods –Weighted Least Squares (WLS) (V i known) –Maximum Likelihood (V i unknown) –Restricted Maximum Likelihood (V i unknown) –Robust Estimation (V i unknown) Hypothesis Testing Example: Growth of Sitka Trees

Restricted Maximum Likelihood Estimation

(Recall slide) Inference Estimation Methods –Weighted Least Squares (WLS) (V i known) –Maximum Likelihood (V i unknown) –Restricted Maximum Likelihood (V i unknown) –Robust Estimation (V i unknown) Hypothesis Testing Example: Growth of Sitka Trees

Generalized Least Square Estimator Robust Estimation (unstructured covariance matrix)

Robust Estimation of V under a saturated model

Robust Estimation of V “restricted ML” – makes estimates unbiased

Example

Robust Estimation vs. A Parametric Approach

Maximum Likelihood Estimation of V

Example: Growth of sitka trees

Figure 1. Observed data and mean response profiles in each of the four growth chambers for the treatment and control.

Figure 2. Observed mean response in each of the four chambers. Season 1 Season 2

Example: Growth of sitka trees (cont’d)

We first consider the 1998 data.

Example: Growth of sitka trees (cont’d) Unstructured covariance matrix

Example: Growth of sitka trees (cont’d)

Sitka spruce data: Estimated covariance matrix for 1988

Sitka spruce data: Estimated covariance matrix for 1989

Summary: Unstructured Covariance Matrix

Summary: Parametric Models for Covariance Reasons for parametric modelling:

Summary: Parametric Models for Covariance (cont’d) Reasons for parametric modelling (cont’d):

Summary: Unstructured vs. Parametric Covariance

Overall Summary