A Critical Examination of Hedonic Analysis of a Regression Model (HARM) and META-ANALYSIS Albert R. Wilson BSSE, MBA, CRE (Ret) 1.

Slides:



Advertisements
Similar presentations
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Advertisements

Chap 12-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 12 Simple Regression Statistics for Business and Economics 6.
Copyright © 2011 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 12 Measures of Association.
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Simple Linear Regression. Start by exploring the data Construct a scatterplot  Does a linear relationship between variables exist?  Is the relationship.
Simple Linear Regression and Correlation
LECTURE 3 Introduction to Linear Regression and Correlation Analysis
The Multiple Regression Model Prepared by Vera Tabakova, East Carolina University.
Chapter 14 Introduction to Linear Regression and Correlation Analysis
Chapter 12 Simple Regression
Statistics for Managers Using Microsoft® Excel 5th Edition
Chapter 12 Multiple Regression
Statistics for Business and Economics
Chapter 13 Introduction to Linear Regression and Correlation Analysis
Multivariate Data Analysis Chapter 4 – Multiple Regression.
Further Inference in the Multiple Regression Model Prepared by Vera Tabakova, East Carolina University.
Part 18: Regression Modeling 18-1/44 Statistics and Data Analysis Professor William Greene Stern School of Business IOMS Department Department of Economics.
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Chapter 14 Introduction to Linear Regression and Correlation Analysis
Christopher Dougherty EC220 - Introduction to econometrics (chapter 3) Slideshow: prediction Original citation: Dougherty, C. (2012) EC220 - Introduction.
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. More About Regression Chapter 14.
1 PREDICTION In the previous sequence, we saw how to predict the price of a good or asset given the composition of its characteristics. In this sequence,
1 Chapter 10 Correlation and Regression We deal with two variables, x and y. Main goal: Investigate how x and y are related, or correlated; how much they.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS & Updated by SPIROS VELIANITIS.
Copyright © 2011 Pearson Education, Inc. Multiple Regression Chapter 23.
Correlation and Regression
Introduction to Linear Regression and Correlation Analysis
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 12-1 Chapter 12 Simple Linear Regression Statistics for Managers Using.
Chapter 13: Inference in Regression
Chapter 11 Simple Regression
Copyright © 2014, 2011 Pearson Education, Inc. 1 Chapter 25 Categorical Explanatory Variables.
Copyright © 2014, 2011 Pearson Education, Inc. 1 Chapter 22 Regression Diagnostics.
Statistics for Business and Economics 7 th Edition Chapter 11 Simple Regression Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch.
Copyright © 2014, 2011 Pearson Education, Inc. 1 Chapter 23 Multiple Regression.
Introduction to Linear Regression
Chap 12-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 12 Introduction to Linear.
Extension to Multiple Regression. Simple regression With simple regression, we have a single predictor and outcome, and in general things are straightforward.
Random Regressors and Moment Based Estimation Prepared by Vera Tabakova, East Carolina University.
1 Chapter 12 Simple Linear Regression. 2 Chapter Outline  Simple Linear Regression Model  Least Squares Method  Coefficient of Determination  Model.
Topic 10 - Linear Regression Least squares principle - pages 301 – – 309 Hypothesis tests/confidence intervals/prediction intervals for regression.
Basic Concepts of Correlation. Definition A correlation exists between two variables when the values of one are somehow associated with the values of.
Simple regression model: Y =  1 +  2 X + u 1 We have seen that the regression coefficients b 1 and b 2 are random variables. They provide point estimates.
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
Chapter 16 Data Analysis: Testing for Associations.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 3 Describing Relationships 3.2 Least-Squares.
Robust Estimators.
1 The Decomposition of a House Price index into Land and Structures Components: A Hedonic Regression Approach by W. Erwin Diewert, Jan de Haan and Rens.
Multiple Regression I 1 Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 4 Multiple Regression Analysis (Part 1) Terry Dielman.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 14-1 Chapter 14 Multiple Regression Model Building Statistics for Managers.
Correlation & Regression Analysis
Copyright © 2011 Pearson Education, Inc. Regression Diagnostics Chapter 22.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 14-1 Chapter 14 Multiple Regression Model Building Statistics for Managers.
Regression Analysis Presentation 13. Regression In Chapter 15, we looked at associations between two categorical variables. We will now focus on relationships.
Slide Slide 1 Chapter 10 Correlation and Regression 10-1 Overview 10-2 Correlation 10-3 Regression 10-4 Variation and Prediction Intervals 10-5 Multiple.
11-1 Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Reconciling the Value Estimates Basic Real Estate Appraisal: Principles & Procedures – 9 th Edition © 2015 OnCourse Learning Chapter 15.
Stats Methods at IC Lecture 3: Regression.
Chapter 13 Simple Linear Regression
More Multiple Regression
Chapter 11 Simple Regression
Correlation and Simple Linear Regression
Linear Regression/Correlation
Chapter 10 Correlation and Regression
More Multiple Regression
Correlation and Simple Linear Regression
More Multiple Regression
Product moment correlation
Correlation and Simple Linear Regression
Correlation and Simple Linear Regression
Presentation transcript:

A Critical Examination of Hedonic Analysis of a Regression Model (HARM) and META-ANALYSIS Albert R. Wilson BSSE, MBA, CRE (Ret) 1

Regression Model A model intended to allow an exploration of the hypothetical relationship between possible explanatory variables and the sales price 2

Regression Model Reflection of reality The touchstone of that reality? Actual market participants 3

“Estimated” versus “Predicted” Estimated = Sale IN database Predicted = Sale NOT IN database 4

Predicted Sales Prices At the mean predicted sales price variance is larger than estimated variance by σ 2 (variance in the data) 5

Mean Confidence Intervals (MCI) Estimated and Predicted MCI FOR PREDICTED 4.38 TIMES MCI FOR ESTIMATED 6

DATABASE EDITING GARBAGE IN => GARBAGE OUT (GIGO) 7

Case Example Influence on the Removal of “Flipping Transactions” on the Predicted Prices for 33 Properties PREDICTED SALES PRICES PROPERTY NO.AS PRESENTEDFLIPS REMOVED% CHANGE SUM5,069,2394,018,112(1,051,127) n Adj. R-squared

Editing and Confirmation of Data STEP 1: Edit to identify obvious issues (the desk edit) Case Example Assessor’s Data4,325Removed % R-Squared MLS Data1,888Removed % 9

Editing and Confirmation of Data STEP 2: Identify sales that are not appropriate to the analysis 10

Editing and Confirmation of Data STEP 3: Sales confirmation A values-neutral interview of sale participants OBJECT: to elicit the primary factors motivating the conclusion of the sale price MUST NOT INTRODUCE ANALYST OPINION THIS IS THE ONLY MEANS OF IDENTIFYING/CONFIRMING THE REASONS FOR A CONCLUDED PRICE 11

Regression Model Considerations Faithfully represent: Identified concerns of actual market participants Restrictions imposed by the data Estimates of prices the ONLY VERIFIABLE OUTPUT 12

Coefficient Calculation Result of iterative calculations designed to provide the most accurate estimates of sales prices in database 13

Coefficient Calculation Goodness of Fit Measures of the Goodness of Fit apply only to the relationship between the estimated and actual sales prices in the database They do not apply to the coefficients 14

Most commonly-cited Goodness-of-Fit Measure R-Squared (Coefficient of Determination) 15

R-Squared Generally-applied interpretation: –R-Squared is the amount of variance “explained” by the model 16

Low R-Squared Models Mathematically, as the R-Squared approaches 0.30, it becomes more likely that the model is only measuring random effects 17

The Omitted and Additional Variable Problem Omitting generally increases magnitude and statistical significance of the remaining coefficients Adding generally decreases the magnitude and statistical significance of the remaining variable coefficients 18

Illustration of Omitting or Adding a Variable Base ModelAdded Variable–APNOmitted Variable–Pool VariableCoeff.t-statCoeff.t-stat% ChangeCoeff.t-stat % Change Intercept67, , %66, % APN Fixtures2, , %2, % NoPatio(12,801)-7.77(5,036) %(13,451) % SqFt % % Pool8, , % Garage19, , %19, % Middle Ring(16,141)-11.24(11,230) %(15,276) % Inner Ring(8,875)-4.52(7,114) %(8,012) % , % % 2001(2,017) %(2,028) % 2002(719)-0.253, %(615) % 20037, , %7, % , , %40, % , , %131, % , , %159, % R-Squared

Consequences of Variable Selection Including the Assessor’s Parcel Number APN Coefficient Value0.023 t-statistic8.98 Mean Value30,834,360 R-Squared0.83 Mean Sale Price$211,000 Results in an incremental increase in the sales price of x 30, = $709,190 (APN Coef.)x(Mean Value)=(Incremental Increase) 20

Consequences of Variable Selection Omission of a Variable: Removal of “Pool”; present in 38% of properties –SQFT Cofficient changed from $40.79 to $41.79 –Approximately the same t-statistic Removal of “Fixtures”; present in 100% of properties –SQFT Coefficient changed from $40.79 to $46.50 –T-statistic =

Coefficients Coefficients are simply multipliers for the explanatory variable 22

Causation in Real Estate From the Real Estate Appraiser’s perspective: 1.Causation demonstrated through sales confirmation interviews. 2.Causation NEVER proven through a regression. 23

Strengths and Weaknesses Can never be better than the data Requires significant amount of data: five to 15 or more sales Upper limit to the amount of data: too much may be worse than too little Guide: Are the sales competitive to the subject? Estimate of sales prices most accurate at the mean value of the data Variance of a predicted sales price larger than variance of estimated Thousands of possible regression models 24

Further Considerations Absent standards, the “Rubber Ruler” may apply When recognized and published standards are not used, author must demonstrate the accuracy and reliability of his/her work 25

Hedonic Analysis

The Hedonic Assumption The coefficient accurately and only represents the contribution of the declared meaning of the explanatory variable to the sale price 27

Hedonic Analysis The validity of the hedonic assumption must be demonstrated 28

“Revealed Preference” Idea cannot be supported for real estate

Supporting Literature Not a single paper demonstrated the validity of the hedonic assumption PLUS NO indication of confirmation of raw data NO indication of adherence to any recognized / published standards NO indication of confirmation of results with the normal or typical market participant THE RUBBER RULER EFFECT IS MUCH IN EVIDENCE. 30

Regression Model Accuracy If the regression model is inaccurate, then there is no reason to expect the coefficients to be accurate or meaningful. Therefore the HARM cannot be accurate. 31

CASE EXAMPLE TO POOL OR NOT TO POOL Using the data from the previous case. Does a pool influence value? By how much? The Hedonic Approach, the coefficient is the marginal contribution to value. 32

COMBINED POOL AND NO POOLS COMBINED POOL AND NO POOLS, POOL COEFFICIENT SET TO ZERO Variable COEFFICIEN T MEAN VALUES EXPECTED VALUES COEFFICIEN T MEAN VALUES EXPECTED VALUES Intercept54, ,09054, ,090 ORIG_FIXTURES2, ,4912, ,491 ORIG_NOPATIO-14, ,800-14, ,800 ORIG_POOL9, ,4829, ORIG_SQF , ,815 ORIG_X_3GARA GE 16, ,48516, ,485 SY20005, ,9805, ,980 EXPECTED MEAN SALE PRICE 184, ,061 Adj R

TO POOL OR NOT TO POOL (CONT.) What are the coefficients if there is no pool? 34

COMBINED WITH NO POOL VARIABLE VariableCOEFFICIENTMEAN VALUESEXPECTED VALUES Intercept ,788 ORIG_FIXTURES3, ,957 ORIG_NOPATIO-14, ,006 ORIG_SQF ,822 ORIG_X_3GARAGE16, ,770 SY20005, ,728 EXPECTED MEAN SALE PRICE 184,059 Adj R

Comparision Orig Fixt 2,805 3,088 Orig-nopatio -14, ,725 Orig-no pool 9,162 NA Orig-sqf Orig-garage 16,21316,925 SY2000 5,980 5,728 ESP $184,513 $184,059 R-sq

POOL OR NOT TO POOL (CONT.) WHAT HAPPENS IF WE CONSIDER A DATABASE WITH POOLS, AND SEPARATELY A DATABASE WITHOUT POOLS? 37

WITH POOL ON PROPERTYWITHOUT POOL ON PROPERTY Variable COEFFICIEN T MEAN VALUES EXPECTED VALUES COEFFICIENT MEAN VALUES EXPECTED VALUES Intercept65, ,95854, ,994 ORIG_FIXTURES2, ,1792, ,719 ORIG_NOPATIO-15, ,391-14, ,084 ORIG_POOL ORIG_SQF41.632, , , ,956 ORIG_X_3GARA GE 15, ,30816, ,056 SY20004, ,2117, ,210 EXPECTED MEAN SALE PRICE 204, ,850 Adj R

POOLS AND NO POOLS SEPARATELY ESTIMATED SALE PRICE WITH POOL $204,954 – R-SQUARED0.87 ESTIMATED SALE PRICE W/O POOL $170,805 – R-SQUARED

The Coefficient – What Counts? ALL THAT STATISTICAL SIGNIFICANCE CAN TELL US IS THAT FOR THIS MODEL AND DATABASE THE COEFFICIENT IS A SIGNIFICANT (OR INSIGNIFICANT) MULTIPLIER FOR THE EXPLANATORY VARIABLE. NOTHING MORE. 40

The Appropriate Standard: Economic Significance For us, economic significance is determined by what the normal or typical participant considers important to the conclusion of the transaction. 41

A Criticality: NOT ONE hedonic analysis encountered to date has actually asked this question: “What was important to you in concluding your transaction?” 42

Hedonic Analysis of a Regression Model (HARM) is: Highly inaccurate and unreliable method Not appropriate for appraisal work Observations apply to hedonic analysis NOT regression models! 43