1 Regression-based Approach for Calculating CBL Dr. Sunil Maheshwari Dominion Virginia Power.

Slides:



Advertisements
Similar presentations
Sociology 601 Class 24: November 19, 2009 (partial) Review –regression results for spurious & intervening effects –care with sample sizes for comparing.
Advertisements

Christopher Dougherty EC220 - Introduction to econometrics (chapter 1) Slideshow: exercise 1.16 Original citation: Dougherty, C. (2012) EC220 - Introduction.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 4) Slideshow: interactive explanatory variables Original citation: Dougherty, C. (2012)
Lecture 9 Today: Ch. 3: Multiple Regression Analysis Example with two independent variables Frisch-Waugh-Lovell theorem.
Sociology 601, Class17: October 27, 2009 Linear relationships. A & F, chapter 9.1 Least squares estimation. A & F 9.2 The linear regression model (9.3)
Christopher Dougherty EC220 - Introduction to econometrics (chapter 3) Slideshow: exercise 3.5 Original citation: Dougherty, C. (2012) EC220 - Introduction.
EC220 - Introduction to econometrics (chapter 2)
Adaptive expectations and partial adjustment Presented by: Monika Tarsalewska Piotrek Jeżak Justyna Koper Magdalena Prędota.
Valuation 4: Econometrics Why econometrics? What are the tasks? Specification and estimation Hypotheses testing Example study.
Sociology 601 Class 28: December 8, 2009 Homework 10 Review –polynomials –interaction effects Logistic regressions –log odds as outcome –compared to linear.
1 Multiple Regression EPP 245/298 Statistical Analysis of Laboratory Data.
Introduction to Regression Analysis Straight lines, fitted values, residual values, sums of squares, relation to the analysis of variance.
1 Review of Correlation A correlation coefficient measures the strength of a linear relation between two measurement variables. The measure is based on.
Sociology 601 Class 23: November 17, 2009 Homework #8 Review –spurious, intervening, & interactions effects –stata regression commands & output F-tests.
A trial of incentives to attend adult literacy classes Carole Torgerson, Greg Brooks, Jeremy Miles, David Torgerson Classes randomised to incentive or.
Interpreting Bi-variate OLS Regression
1 Zinc Data EPP 245 Statistical Analysis of Laboratory Data.
Sociology 601 Class 26: December 1, 2009 (partial) Review –curvilinear regression results –cubic polynomial Interaction effects –example: earnings on married.
1 INTERPRETATION OF A REGRESSION EQUATION The scatter diagram shows hourly earnings in 2002 plotted against years of schooling, defined as highest grade.
TESTING A HYPOTHESIS RELATING TO A REGRESSION COEFFICIENT This sequence describes the testing of a hypotheses relating to regression coefficients. It is.
SLOPE DUMMY VARIABLES 1 The scatter diagram shows the data for the 74 schools in Shanghai and the cost functions derived from a regression of COST on N.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 3) Slideshow: precision of the multiple regression coefficients Original citation:
EDUC 200C Section 4 – Review Melissa Kemmerle October 19, 2012.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 5) Slideshow: dummy variable classification with two categories Original citation:
Christopher Dougherty EC220 - Introduction to econometrics (chapter 5) Slideshow: the effects of changing the reference category Original citation: Dougherty,
DUMMY CLASSIFICATION WITH MORE THAN TWO CATEGORIES This sequence explains how to extend the dummy variable technique to handle a qualitative explanatory.
1 INTERACTIVE EXPLANATORY VARIABLES The model shown above is linear in parameters and it may be fitted using straightforward OLS, provided that the regression.
1 TWO SETS OF DUMMY VARIABLES The explanatory variables in a regression model may include multiple sets of dummy variables. This sequence provides an example.
Confidence intervals were treated at length in the Review chapter and their application to regression analysis presents no problems. We will not repeat.
EXERCISE 5.5 The Stata output shows the result of a semilogarithmic regression of earnings on highest educational qualification obtained, work experience,
Returning to Consumption
Country Gini IndexCountryGini IndexCountryGini IndexCountryGini Index Albania28.2Georgia40.4Mozambique39.6Turkey38 Algeria35.3Germany28.3Nepal47.2Turkmenistan40.8.
MultiCollinearity. The Nature of the Problem OLS requires that the explanatory variables are independent of error term But they may not always be independent.
EDUC 200C Section 3 October 12, Goals Review correlation prediction formula Calculate z y ’ = r xy z x for a new data set Use formula to predict.
1 Experimental Statistics - week 10 Chapter 11: Linear Regression and Correlation.
What is the MPC?. Learning Objectives 1.Use linear regression to establish the relationship between two variables 2.Show that the line is the line of.
F TEST OF GOODNESS OF FIT FOR THE WHOLE EQUATION 1 This sequence describes two F tests of goodness of fit in a multiple regression model. The first relates.
MULTIPLE REGRESSION WITH TWO EXPLANATORY VARIABLES: EXAMPLE 1 This sequence provides a geometrical interpretation of a multiple regression model with two.
Chapter 12: Linear Regression 1. Introduction Regression analysis and Analysis of variance are the two most widely used statistical procedures. Regression.
© 2001 Prentice-Hall, Inc. Statistics for Business and Economics Simple Linear Regression Chapter 10.
Wiener Institut für Internationale Wirtschaftsvergleiche The Vienna Institute for International Economic Studies Structural change, productivity.
Simple regression model: Y =  1 +  2 X + u 1 We have seen that the regression coefficients b 1 and b 2 are random variables. They provide point estimates.
. reg LGEARN S WEIGHT85 Source | SS df MS Number of obs = F( 2, 537) = Model |
Econ 314: Project 1 Answers and Questions Examining the Growth Data Trends, Cycles, and Turning Points.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 5) Slideshow: exercise 5.2 Original citation: Dougherty, C. (2012) EC220 - Introduction.
(1)Combine the correlated variables. 1 In this sequence, we look at four possible indirect methods for alleviating a problem of multicollinearity. POSSIBLE.
COST 11 DUMMY VARIABLE CLASSIFICATION WITH TWO CATEGORIES 1 This sequence explains how you can include qualitative explanatory variables in your regression.
Lecture 5. Linear Models for Correlated Data: Inference.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 6) Slideshow: exercise 6.13 Original citation: Dougherty, C. (2012) EC220 - Introduction.
STAT E100 Section Week 12- Regression. Course Review - Project due Dec 17 th, your TA. - Exam 2 make-up is Dec 5 th, practice tests have been updated.
RAMSEY’S RESET TEST OF FUNCTIONAL MISSPECIFICATION 1 Ramsey’s RESET test of functional misspecification is intended to provide a simple indicator of evidence.
1 CHANGES IN THE UNITS OF MEASUREMENT Suppose that the units of measurement of Y or X are changed. How will this affect the regression results? Intuitively,
SEMILOGARITHMIC MODELS 1 This sequence introduces the semilogarithmic model and shows how it may be applied to an earnings function. The dependent variable.
GRAPHING A RELATIONSHIP IN A MULTIPLE REGRESSION MODEL The output above shows the result of regressing EARNINGS, hourly earnings in dollars, on S, years.
1 BINARY CHOICE MODELS: LINEAR PROBABILITY MODEL Economists are often interested in the factors behind the decision-making of individuals or enterprises,
1 REPARAMETERIZATION OF A MODEL AND t TEST OF A LINEAR RESTRICTION Linear restrictions can also be tested using a t test. This involves the reparameterization.
F TESTS RELATING TO GROUPS OF EXPLANATORY VARIABLES 1 We now come to more general F tests of goodness of fit. This is a test of the joint explanatory power.
WHITE TEST FOR HETEROSCEDASTICITY 1 The White test for heteroscedasticity looks for evidence of an association between the variance of the disturbance.
VARIABLE MISSPECIFICATION I: OMISSION OF A RELEVANT VARIABLE In this sequence and the next we will investigate the consequences of misspecifying the regression.
QM222 Class 9 Section A1 Coefficient statistics
QM222 Class 10 Section D1 1. Goodness of fit -- review 2
QM222 Class 11 Section A1 Multiple Regression
The slope, explained variance, residuals
QM222 Your regressions and the test
QM222 Class 15 Section D1 Review for test Multicollinearity
Covariance x – x > 0 x (x,y) y – y > 0 y x and y axes.
EPP 245 Statistical Analysis of Laboratory Data
Introduction to Econometrics, 5th edition
Introduction to Econometrics, 5th edition
Presentation transcript:

1 Regression-based Approach for Calculating CBL Dr. Sunil Maheshwari Dominion Virginia Power

2 Benefits of Regression Approach This approach can be used to calculate CBL for both weather-sensitive and non-weather- sensitive loads. The science of regression theory is well developed. Most statistical packages such as SAS, STATA, SPSS, etc. can perform regression analysis. Regression equations can be easily updated on a periodic basis (perhaps annually).

3 Description of Regression Approach The idea is to treat load as a function of explanatory factors such as weather, time of day, day of the week, etc. Estimate the relationship between load and explanatory variables using a variety of functional forms. Pick the functional form that gives the highest R-sq adjusted or the lowest Root Mean Squared Error (RMSE)

4 Functional Forms for Weather-Sensitive Loads Form 1: Load = a + b*CDD + c*HDD Form 2: Load = a + Σ (b i * CDD i ) + Σ (c j * HDD j ); Temperature breakpoints to be established based on Regression Analysis Form 3: Load = a + Σ (b i * CDD i ) + Σ (c j * HDD j ) + Σ (d k * hour k ); In addition to weather, each hour impacts the load as well.

5 Functional Forms for Non-Weather Sensitive Loads The following forms may do a good job of estimating CBL for Industrial loads: Form 4: Load = a + Σ (b k * hour k ) + Σ (c j * month j ); Form 5: Load = a + b * TimeTrend + Σ (c k * hour k ) + Σ (d j * month j );

6 Applying Theory into Practice… For one of our DSR participants (a Building Complex), we estimated the relationship between 2006 hourly Load and Weather using Functional Form 3: Load = a + Σ (b i * CDD i ) + Σ (c j * HDD j ) + Σ (d k * hour k ); Following temperature breakpoints were used: – For Heating Degree Days (HDD) – 65, 55, 40, 25 – For Cooling Degree Days (CDD) – 65, 80, 90, 100

7 We further sliced the data by: –Day type Weekdays Weekends and Holidays –Season Winter – December - March Summer – June - September Shoulder – April, May, October, November Applying Theory into Practice…

8 Partial Regression Output (Summer, Weekday) regress load cdd_65to80 cdd_80to90 cdd_90to100 cdd_over100 hdd hddsq hour2-hour24 if year==2006 & weekdayflag==1 & holiday==0 & season=="Summer" Source | SS df MS Number of obs = F( 28, 2011) = Model | Prob > F = Residual | R-squared = Adj R-squared = Total | Root MSE = load | Coef. Std. Err. t P>|t| [95% Conf. Interval] cdd_65to80 | cdd_80to90 | cdd_90to100 | cdd_over100 | (dropped) hdd | hddsq | hour2 | hour3 | hour4 |

9 Predicted Load (CBL) based on 2006 data applied to 2007 data Using regression parameters from previous slide, predict the load for Compare predicted load (CBL) to actual load. Absolute average % deviation between Actual and Predicted Load was less than 5%. Regression Equations will be re-estimated every year

10 Actual vs Predicted (CBL)

11 Actual Load vs Predicted (CBL) - Summer, Weekday (Absolute Average % Deviation = 3%)

12 Actual Load vs Predicted (CBL) - Winter, Weekday (Absolute Average % Deviation = 3.6%)

13 Actual Load vs Predicted (CBL) - Shoulder, Weekday (Absolute Average % Deviation = 3.7%)

14 Actual Load vs Predicted (CBL) - Weekend/Holiday (Absolute Average % Deviation = 4.7%)

15 Conclusions 4 Equations with single variable hourly temperature –Summer, Weekday –Winter, Weekday –Shoulder, Weekday –Weekends / Holidays Good fit (R-sq adjusted > 78% in all cases). Simplified calculations, and the regression equations can be easily updated.