Esman M. Nyamongo Central Bank of Kenya

Esman M. Nyamongo Central Bank of Kenya
Panel data analysis Econometrics Course organized by the COMESA Monetary Institute (CMI) on 9-13 February 2015, Kampala, Uganda Esman M. Nyamongo Central Bank of Kenya

panel Data- where we are coming from
There are 3 types of data Cross sectional data Time series data Panel data The cross-section and time series are the primary building blocks of Panel

Time series data A time series is a set of observations on the values that a variable takes at different times Such data may be collected at regular time intervals Minutely and Hourly- collected literally continuously ( the so-called real time quote) Daily- e.g., Financial time series-Stock prices, exchange rates; weather reports- rainfall, temperature Weekly – e.g., money supply Monthly- e.g., consumer price index Quarterly- e.g., GDP Semi-annually- e.g., Fiscal data Annually- e.g., Fiscal data Quinquennially ( every 5 years)- e.g., manufacturing survey Decennially- (every 10 years)- e.g., population census data

Illustration of time series

Consumption and Income (annual)
The model setup: Where t= time series

Cross-sectional data Cross-section data is data on one or more variables collected at a particular point in time in time Survey data- questionnaire is designed to capture all variables a research is looking for. Macro data relating to different economic entities : countries, banks at a particular point in time. E.g Other data

Summary data from income survey

Cross-country data The model set up Where i= cross -section

Panel data Panel data is a combination of both time and cross-section data Specialized type is the longitudinal or micropanel data where a cross-sectional unit (say, individual, family, firm) is surveyed over time. Surveying same individual over time is able to provide useful information on the dynamics of individual/household/firm behavior

Example

Pooled data

Some Advantages of panel data
Controlling for individual heterogeneity. Panel data suggests that individuals, firms, states or countries are heterogeneous. Panel data give more informative data, more variability, less collinearity among the variables, more degrees of freedom and more efficiency Panel data are better able to study the dynamics of adjustment. Panel data are better able to identify and measure effects that are simply not detectable in pure cross- section or pure time-series data. Consider a country where labor participation rate is 50%! it is possible each individual has 50% chance of participation 50% of individuals work 100% of the time and the other 50% work 0% of the time yielding 50% participation.

Panel data model set up Time series + cross-section
Where i= cross section; t= time series How then do we estimate this model? Many approaches

Part i

a. pooled estimator Emphasizes the joint estimation of coefficients- ignores panel structure of the data where y= dependent variable, Xs= regressors for all i and t. i.e For a given cross section, observations are serially uncorrelated Across cross-sections and time, the errors are homoscedastic

But how realistic is it to ignore the panel structure of the data?
These assumptions- classical assumptions therefore suggest we estimate the equation by OLS Pooling- increases degrees of freedom, potentially lowering standard errors on the coefficients. This involves stacking cross-sections in the data set This form assumes same intercept and same slope for all coefficients But how realistic is it to ignore the panel structure of the data?

Short practical session
Preparation of data for use in panel estimation Estimation of pooled model using Eviews

Pooled regression results
Same slope and intercept is assumed Homogeneity of the cross-section units

b. Error component model

B. The error-components model
Recall In this model u may display 3 different schemes: (i) consists of 3 individual shocks, each assumed to be independent of each other: is cross section invariant shock- time effect is time invariant shock- individual effect is the error term with usual properties- uncorrelated with Xit

Scheme (i) is the TWO-WAY-ERROR COMPONENT MODEL
(ii) to yield: Cross-section fixed effects model (iii) to yield: Time-fixed effects model. Schemes (ii) and (iii) are referred to as ONE-AWAY ERROR COMPONENT MODEL Scheme (i) is the TWO-WAY-ERROR COMPONENT MODEL

i. One way error component models
Follows from the restrictions imposed by the pooled model i.e joint intercept and slope for i=1,2….N for t=1,2….T One-way error component model allows cross-section heterogeneity in the error term. Error term (uit) becomes the sum of an individual specific effect ( vi- time invariant) and a ‘well behaved’ disturbance ( )

In this formulation we have:
The first part varies over cross-section units but is constant across time The second part varies unsystematically (independently) across time and individuals

Estimation method Two ways to estimate a regression model with error terms that are assumed to consist of several error components: Fixed-effects- (1) the constant/intercept in each equation is a separate parameter (2) values of vi are potentially correlated with the other regressors Random-effects – (1) differences in the vi are randomly distributed between units (2) values of vi are uncorrelated with the other regressors

1. Fixed effects model Recall: Fixed-effects model-
(1) each equation constant is a separate parameter (2) values of vi are potentially correlated with the other regressors

The main question is whether Xit is correlated with uit:
If no, then we have a seemingly unrelated regression If yes, then we have a multi-equation system with common coefficients and endogenous regressors Then how do we account for this endogeneity- In time series we use instrumental variable estimation methods (2sls,3sls etc) However, in panel we can handle this, under certain assumptions, without using instruments How?

Some approaches There are 3 approaches to doing this in panel:
The least squares dummy variables (LSDV) estimator The within-group estimator The first difference estimator Discussed in turns

1. Least Squares Dummy Variables (LSDV)
LSDV method applies OLS to levels with group specific dummies added to the list of regressors. Explains why the estimator is called the Least- Squares Dummy Variables (LSDV) estimator Consider the general model: Stack the observations over t, to obtain:

The pooled regression is then:
Where is a Kronecker product Since 1T is a regressor alongside Xit then we expect:

Too many parameters (especially with large N!
This model is appealing , but consider the number of parameters to be estimated! K+1+(N-1)= k+N K= parameters for the original X-regressors 1= parameter for the intercept N-1= parameters for cross-section fixed effects (omitted x-section captured by intercept) Too many parameters (especially with large N! N-1 K+1

There are many parameters (Bs and Ds) being estimated.
But can be estimated using OLS following the Frisch-Waugh-Lovell (FWL) theorem on partitioned regressions: Digression on FWL and partitioned regressions

Based on this result it can be shown that the fixed effects estimator is the partitioned OLS estimator of in pooled regression. Where It then follows that: Therefore:

Whence it follows that:
This is basically a pooled OLS estimator on transformed data

Least Squares Dummy Variable est.

2. Within, Q, Estimation Still assumes individual effects although we no longer directly estimate them. We demean the data- wipe out the individual effects- to estimate only B How do we wipe out the individual effects? We define a Q matrix . Where Q is defined such that: where

Consider a simple regression:
The trick is to remove the fixed effects, vi. How? Step 1: Average over time t for each i.

Step 2: The transformed regression:
Then, stack by observation for t= 1, …., T, resulting in:

Degrees of freedom of FE estimator
The within-group fixed-effects estimator is pooled OLS on the transformed regression that has been stacked by observations: Degrees of freedom of FE estimator nT-k-n=n(T-1)-k Why- lose 1 degree of freedom in each fixed effect estimated. There are k Bs to be estimated as well

General case: Stack an individual i’s observations for t=1,…..,T, giving: Or i= 1 , 2, …., n Where 1T is a (Tx1) vector of ones

The fixed-effects estimator is applied to a T- equation system transformed from the original system above. The matrix used for the transformation is the so-called annihilator associated with 1T: where Q is a T x T matrix:

The matrices QT and PT are such that:
What does this mean- the QT and PT matrices takes you back to the transformed y

The transformed error-components model is then:
The pooled regression is stated as: sss

The fixed-effects estimator is again obtained by pooled OLS on the transformed system:
Why? QT is idempotent i.e QTQT=QT In other words:

is an NTx1 vector with ‘stacked’ deviations
P is the matrix that averages across time for each individual cross section. Thus pre-multiplying this regression by Q obtains deviations from means WITHIN each cross- section is an NTx1 vector with ‘stacked’ deviations The OLS estimator is therefore: Demeaning the data will not change the estimates for . Similar to running a regression with the line of best fit passing through the origin

Thus the ‘WITHIN’ model becomes a simple regression:
Individual effects can be solved ( not estimated). But we need the following assumption: And solving:

Fixed Effects: Within estimation
Notice the fixed effects are not estimated. Computed instead.

Within- For clickers The computed FE are indicated.
But what are these FE?

Disadvantages of the method
Demeaning the data means X-regressors which are themselves dummy variables cannot be used (sex, religion, etc)

Which one do we work with?- Pooled or FE
Both coefficients are positive and significant. Therefore satisfy some theory! Which one do we choose?

Testing the joint validity of fixed effects
The null hypothesis Alternative HA: Not all equal to 0 We test the null hypothesis of no individual effects within applied Chow or F-test, combining the residual sum of squares for the regression both with constraints (under the null) and without (under alternative). The recipe: RSS- OLS on pooled model ( constant intercept) URSS- OLS on LSDV

The F- statistics is stated as:
If N is ‘large enough’ can use ‘WITHIN’ estimation instead of LSDV for the RSS. The decision rule: P-value < 0.05 we reject the null hypothesis => FE are not redundant P-value> 0.05, we fail to reject the null hypothesis => FE are redundant, suggesting pooled model is valid

Part ii

Ii Two way error component model
Recall: i=1,….,N; t=1, …..,T Where = unobservable individual effect; =unobservable time effect; stochastic disturbance = selector matrix of ones and zeros- individual effects = time dummies

1. Estimating fixed effects
Here we still assume and are fixed parameters to be estimated and Estimation with LSDV requires the estimation of {(N-1)+(T-1)} dummies This can introduce rather severe loss of degrees of freedom once again to avoid this problem we perform ‘ WITHIN’ transformation (similar to one-way model). Now, however, we must demean across both dimensions

Here we work with the following:
Where:JN= matrix of ones of dimension N deviations across i deviations across t Transforming with Q sweeps out the time and individual effects

Here we have a simple regression (one-x regressor):
We now need 2 constraints to capture individual and time effects Then we can compute the intercepts using Again as in one way model we cannot use time-invariant or individual-invariant (dummy regressors) as Q wipes them out.

Two way fixed effects ESTIMATION RESULTS

For clickers How do we interpret these result

Practical session

Supplementary information

Panel data set

Longitudinal/Micropanel data

Pooled regression Same results

1 Kronecker product

LSDV- For clickers

2. Fwl and partitioned regressions
Consider the partitioned regression equation: Where K=k1+k2 The least-squares estimators for B1 and B2 can be expressed as: Where

3. Annihilator matrix The residual maker and the Hat Matrix
Some useful matrices: We know Meaning: Where is called the residual maker since it makes residuals out of y. Matrix M is idempotent if [M2=MM=M]

M is a square matrix and idempotent- show
The matrix has the following properties as well: The hat matrix (H)- makes y hat out of y. where

Esman M. Nyamongo Central Bank of Kenya

Similar presentations

Presentation on theme: "Esman M. Nyamongo Central Bank of Kenya"— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

Esman M. Nyamongo Central Bank of Kenya

Similar presentations

Presentation on theme: "Esman M. Nyamongo Central Bank of Kenya"— Presentation transcript:

Similar presentations

About project

Feedback