Download presentation
Presentation is loading. Please wait.
Published byBrent Pierce Modified over 9 years ago
2
Non-continuous Relationships If the relationship between the dependent variable and an independent variable is non-continuous a slope dummy variable can be used to estimate two sets of coefficients and intercepts for the independent variable. For example, if natural gas usage is not affected by temperature when the temperature rises above 60 degrees, we could have: Gas usage = b 0 + b 1 (GT60) + b 2 (Temp) + b 3 (GT60)(Temp) where GT60 is a dummy equal to 1 when the temperature is above 60 degrees
3
Non-continuous Relationships Note that at temperatures above 60 degrees the net effect of a 1 degree increase in temperature on gas usage is -0.056 (-.866+.810) and the estimated intercept is 6.4 (53.0-46.6) Coefficients Standard Errort StatP-value Intercept53.0022.41521.957.48E-18 GT60-46.62316.682-2.790.0098 Temp-0.8660.0595-14.561.02E-13 (GT60)(Temp)0.8100.2553.180.0039
4
Interaction Terms You can try to control for interactions between two variables by including a variable that is the product of two independent variables. For example, assume we were estimating the salaries of baseball players. If there was a premium paid to players that were both good fielders and good hitters, we might want to include an interaction term for hitting and fielding in the model.
5
Standardized Coefficients When the regression model is estimated after standardizing the values of the dependent and independent variables.
6
Standardized Coefficients Unstandardized StandardizedCoefficients BStd. Err.BetatSig. (Constant)-14.4854.038 -3.587.000 Weight-.007.000-.706-14.177.000 Year.761.050.36015.262.000 Cylinders-.074.232-.016-.320.749 a Dependent Variable: MPG Standardized coefficients can be used to compare the magnitude of the effects of the independent variables.
7
Standardized Residuals Where s is the standard error of estimate and h i is the leverage of observation i. Leverage is determined by the difference between the value of the independent variables and their means.
8
Standardized Residuals The random deviation in the value of y, , is assumed to be normally distributed. Looking at the standardized residuals gives some indication if that is true. Values should lie within 2 standard deviations of 0. Values greater than 2 may indicate the presence of outliers.
9
Residuals for MPG Example MPGPredictedResidual Standardized Residual 14.0014.63-0.63-0.18 15.0013.641.360.40 14.0018.04-4.04-1.18 24.0023.001.000.29 22.0019.842.160.63 18.0020.23-2.23-0.65 21.0021.45-0.45-0.13 27.0024.582.420.71 26.0026.50-0.50-0.15 25.0021.043.961.15...………
10
Distribution of Standardized Residuals
11
Summary of Issues in Building Models Check for multicollinearity Look for outliers Check residuals for nonlinear relationships Check residuals for hetroscedasticy Were all relevant variables included?
12
Time Series Data Observations on a variable measured over successive periods of time.
13
Components of a Time Series Trend – The long-run movement of a time series Seasonal component – The component of a time series that shows a periodic pattern over a year Cyclical component – The component of a time series related to the business cycle Irregular component – The random variation in a time series not accounted for by the other components
14
Smoothing a Time Series Smoothing removes the irregular component of a time series. It can be used to forecast if the variable has no significant trend, cyclical, or seasonal effects. It can also be used to investigate whether there is a trend in the data.
15
Smoothing Techniques Moving average: Weighted moving average: A moving average in which more recent values are given heavier weights Exponential smoothing: When the weight on an observation decreases exponentially as time passes
16
Example, Moving Averages Year, Quarter Sales 2008, 110 2008,214 2008, 38 2008, 416 2009, 16 2009, 218 2009, 38 Where the weights are.1,.2,.3, and.4; oldest to most recent Moving average Weighted moving average* 12 12.6 11 10.2 12 13 12 11.4
17
Exponential Smoothing F t+1 = Y t + (1 – )F t F t+1 = Forecast for period t+1 Smoothing constant Y t = Value of the time series in period t F t = Forecast for time t
18
Example, Moving Averages Year, QuarterSalesExponentially smoothed forecast 2008, 110 2008,214 10.00 2008, 38 12.40 2008, 416 9.76 2009, 16 13.50 2009, 218 9.00 2009, 38 14.40 2009, 4 10.56 Assumes equals 0.6
19
Estimating the Trend T t = b 0 + b 1 t T t = trend value of time series in period t b 0 = intercept in trend line b 1 = slope of trend line t = time
20
Example, Trend Year, Quarter TimeSalesTrend line 2008, 10 100 104 2008,21 120 114.5 2008, 32 115 125 2008, 43 150 135.5 2009, 14 140 146 2009, 25 165 156.5 2009, 36 160 167 T t = 104 + 10.5(t)
21
Time Series and Trend
22
Time Series and Nonlinear Trends
23
Nonlinear Trends Make the trend a nonlinear function of time, such as: y = b 0 + b 1 t + b 2 t 2 If there is a constant percentage change in y over time use the natural log of y as the dependent variable: ln(y) = b 0 + b 1 t Which corresponds to: y = b 0 (t b1 )
24
Cyclical Variation If there is cyclical variation there will be evidence of a correlation between the variable and GDP (remember to adjust for inflation). When forecasting you would need to include a leading indicator into the model. Y t = b 0 + b 1 t +b 2 (housing starts in t-1)
25
Seasonal Variation Seasonal index – Assumes there is a constant percentage difference from the trend in a given season
26
Seasonal Index Interpret the index as the ratio of the average value for that season to the average for the year. How would you interpret an index value of 0.9 for the first quarter? On average values in the first quarter are 90% of the annual average. What if the index for the 2 nd quarter was 1.20? On average values in the 2 nd quarter are 20% above the annual average
28
Examples Assume the following quarterly indices: Q1 0.8 Q2 1.1 Q3 0.7 Q4 1.4 Deseasonalize the following data: Q1 Q2 Q3 Q4 400660560700 500600800500
29
Examples Assume t equals 1 in the first quarter of 2002 and the estimated trend for sales is T = 100 + 50t. What is the forecast for the 4 th quarter of 2009? T = 100 + 50(32) = 1700 What is the seasonalized forecast? 1700(1.4) = 2380
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.