Accounting for Spatial Variation of Land Prices in Hedonic Imputation House Price Indexes: A Semi- Parametric Approach Jan de Haan * and Yunlong Gong **

Slides:



Advertisements
Similar presentations
Dummy variables Hill et al chapter 9. Parameters that vary between observations Assumption MR1 The parameters are the same for all observations. k= the.
Advertisements

1 A House Price Index Based on the SPAR Method Paul de Vries (OTB Research Institute) Jan de Haan (Statistics Netherlands) Gust Mariën (OTB Research Institute)
Multiple Regression and Model Building
Chapter 9: Regression Analysis
Turun kauppakorkeakoulu  Turku School of Economics REGIONAL DIFFERENCES IN HOUSING PRICE DYNAMICS: PANEL DATA EVIDENCE European Real Estate Society 19th.
Integrating Land Use in a Hedonic Price Model Using GIS URISA 2001 Yan Kestens Marius Thériault François Des Rosiers Centre de Recherche en Aménagement.
Relationship between volatility and spatial autocorrelation in real estate prices Lo Y.F. Daniel Department of Real Estate and Construction The University.
Multiple Regression [ Cross-Sectional Data ]
Chapter 13 Multiple Regression
Chapter 12 Multiple Regression
Lecture 25 Regression diagnostics for the multiple linear regression model Dealing with influential observations for multiple linear regression Interaction.
Econ 140 Lecture 181 Multiple Regression Applications III Lecture 18.
QA-3 FRM-GARP Sep-2001 Zvi Wiener Quantitative Analysis 3.
Econ 140 Lecture 171 Multiple Regression Applications II &III Lecture 17.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 14-1 Chapter 14 Introduction to Multiple Regression Basic Business Statistics 11 th Edition.
FRM Zvi Wiener Following P. Jorion, Financial Risk Manager Handbook Financial Risk Management.
Topic 3: Regression.
An Introduction to Logistic Regression
Ch. 14: The Multiple Regression Model building
1 An Introduction to Nonparametric Regression Ning Li March 15 th, 2004 Biostatistics 277.
Empirical Estimation Review EconS 451: Lecture # 8 Describe in general terms what we are attempting to solve with empirical estimation. Understand why.
Stat 112: Lecture 9 Notes Homework 3: Due next Thursday
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 13-1 Chapter 13 Introduction to Multiple Regression Statistics for Managers.
Single and Multiple Spell Discrete Time Hazards Models with Parametric and Non-Parametric Corrections for Unobserved Heterogeneity David K. Guilkey.
Chapter 14 Introduction to Multiple Regression Sections 1, 2, 3, 4, 6.
Purpose of Regression Analysis Regression analysis is used primarily to model causality and provide prediction –Predicts the value of a dependent (response)
Using administrative data sources to develop real estate price statistics: The case of Portugal Rui Evangelista, Statistics Portugal European conference.
MEASURING DWELLING PRICE CHANGES IN POLAND WITH THE APPLICATION OF THE HEDONIC METHOD.
Chapter 14 Introduction to Multiple Regression
Predicting House Prices with Spatial Dependence: A Comparison of Alternative Methods Steven C. Bourassa University of Louisville (USA) and Bordeaux Management.
Improved price index for condominiums by Han-Suck Song a and Mats Wilhelmsson b a) Department of Real Estate Economics, Royal Institute of Technology (KTH),
Stat 112 Notes 15 Today: –Outliers and influential points. Homework 4 due on Thursday.
The Land Leverage Hypothesis Land leverage reflects the proportion of the total property value embodied in the value of the land (as distinct from improvements),
Multiple Regression The Basics. Multiple Regression (MR) Predicting one DV from a set of predictors, the DV should be interval/ratio or at least assumed.
Chap 14-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 14 Additional Topics in Regression Analysis Statistics for Business.
Managerial Economics Demand Estimation & Forecasting.
Handbook on Residential Property Price Indices Chapter 5: Methods Jan de Haan UNECE/ILO Meeting, May 2010.
An Index Number Formula Problem: the Aggregation of Broadly Comparable Items Mick Silver International Monetary Fund Paper presented at the 2008 World.
Calibrated imputation of numerical data under linear edit restrictions Jeroen Pannekoek Natalie Shlomo Ton de Waal.
Stat 112 Notes 9 Today: –Multicollinearity (Chapter 4.6) –Multiple regression and causal inference.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Model Building and Model Diagnostics Chapter 15.
1 The Decomposition of a House Price index into Land and Structures Components: A Hedonic Regression Approach by W. Erwin Diewert, Jan de Haan and Rens.
A Spatial Hedonic Analysis of the Value of the Greenbelt in the City of Vienna, Austria Shanaka Herath, Johanna Choumert, Gunther Maier.
Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall 14-1 Chapter 14 Introduction to Multiple Regression Statistics for Managers using Microsoft.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice- Hall, Inc. Chap 14-1 Business Statistics: A Decision-Making Approach 6 th Edition.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 14-1 Chapter 14 Introduction to Multiple Regression Basic Business Statistics 10 th Edition.
Copyright © 2010 Pearson Education, Inc Chapter Seventeen Correlation and Regression.
Introduction to Multiple Regression Lecture 11. The Multiple Regression Model Idea: Examine the linear relationship between 1 dependent (Y) & 2 or more.
Tutorial I: Missing Value Analysis
Multiple Regression Learning Objectives n Explain the Linear Multiple Regression Model n Interpret Linear Multiple Regression Computer Output n Test.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Prentice-Hall, Inc.Chap 14-1 Statistics for Managers Using Microsoft® Excel 5th Edition Chapter.
 Seeks to determine group membership from predictor variables ◦ Given group membership, how many people can we correctly classify?
Assumptions of Multiple Regression 1. Form of Relationship: –linear vs nonlinear –Main effects vs interaction effects 2. All relevant variables present.
1 Chapter 4: Elements for a Conceptual Framework Workshop on Residential Property Price Indices: Statistics Netherlands, The Hague, February 10-11, 2011.
Chapter 14 Introduction to Multiple Regression
Luciano Gutierrez*, Maria Sassi**
Workshop on Residential Property Price Indices
Multiple Regression Analysis and Model Building
Fundamentals of regression analysis
Chapter 9: Decomposing an RPPI into Land and Structures Components
Prepared by Lee Revere and John Large
Multiple Regression Analysis: Further Issues
Workshop on Residential Property Price Indices
Compiling a National House Price Index for Ireland
Regression III.
Alternative Approaches for Resale Housing Price Indexes
Chapter 7: Demand Estimation and Forecasting
Chapter 7: Demand Estimation and Forecasting
Chapter 9 Dummy Variables Undergraduated Econometrics Page 1
Workshop on Residential Property Price Indices
Presentation transcript:

Accounting for Spatial Variation of Land Prices in Hedonic Imputation House Price Indexes: A Semi- Parametric Approach Jan de Haan * and Yunlong Gong ** * Statistics Netherlands / Delft University of Technology ** Delft University of Technology SEM Conference 2015, Paris

2 Outline Background A simplification of the ‘builder’s model’ –Basic ideas –Adding characteristics and linearizing the model Location and spatial variation of land prices –Two models –Semi-parametric estimation: the MWGR method Hedonic imputation price indexes Empirical results Conclusions

3 Background Uniqueness of properties mainly due to location Location usually included in hedonic models at some aggregate level (postcode dummies), not at individual property level - spatial autocorrelation - location bias Land-structure split Aim of this paper Show how to account for spatial variation of land prices in hedonic house price indexes using geospatial information (longitude/latitude)

4 A simplification of the ‘builder’s model’ Builder’s model (Diewert, de Haan and Hendriks, 2015): value of property is sum of value of land and value of structure: : plot size in square meters : living space in square meters : price of land per square meter : price of structure (living space) per square meter

5 A simplification of the ‘builder’s model’ Potential problems No intercept (Multi)collinearity between plot size and structure size Heteroskedasticity Net depreciation Diewert, de Haan and Hendriks (2015): straight-line depreciation; adjusted value of structure : approximate age of structure in decades : depreciation rate

6 A simplification of the ‘builder’s model’ Writing in linear form, using (multiplicative) dummies for age category, and reparameterizing: No restrictions on parameters Functional form is neither continuous nor smooth Adding structure characteristics (e.g. number or rooms, type of house) Only categorical variables; dummies Ignoring interaction terms and reparameterizing

7 A simplification of the ‘builder’s model’ Fully linear model: Normalizing (dividing by structure size): : “normalized” property price : ratio of plot size to structure size Straightforward estimating equation (including intercept)

8 Location and spatial variation of land prices Assumption: location is capitalized into price of land, not into price of structures 1) Price of land varies across postcode areas k: : price per square meter of land for area k : multiplicative dummy for k 2) Price of land varies across individual properties:

9 Semi-parametric estimation Mixed Geographically Weighted Regression (MGWR) Parametric regression for estimating parameters for structure characteristics Non-parametric part (GWR) for estimating property-specific land prices Moving kernel window approach: - weighted regression on data of i and neighboring properties - decreasing function of distance to i (bi-square function) - bias-variance trade-off: choice of bandwidth using cross validation statistics

10 Hedonic imputation price indexes Hedonic double imputation Laspeyres, Paasche and Fisher house price indexes, e.g. (defined on base period sample) Predicted prices: Estimated quality-adjusted prices:

11 Hedonic imputation price indexes Estimated value shares for land and structures, and, sum to 1 due to double imputation E.g. Laspeyres price index for land: Big influence of properties with relatively large value shares (large plot sizes and high land prices)

12 Empirical results Data set City of “A” in northeastern part of the Netherlands (population around 60,000) Annual data for Total of 6,397 sales, excluding apartments and condominiums Geocoded by Statistics Netherlands Many characteristics but we only used plot size, living space, building period, type of house 57 observations removed (missing characteristics, outliers)

13 Empirical results Three models estimated, separately for each year: 1) No variation in land prices (“OLS”) 2) Variation across postcodes (“OLSD”) 3) Variation across individual properties (“MGWR”) [60 neighboring properties used in MGWR estimations] According to (corrected) AICc as well as RMSE: OLSD performs better than OLS MGWR performs slightly better than OLSD for each year

14 Parameter estimates for structures characteristics, 2007 OLSOLSDMGWR Intercept *** (46.93) *** (53.71) *** (57.51) Building period: *** (25.94) *** (36.67) *** (41.75) Building period: *** (23.36) *** (33.96) *** (41.69) Building period: *** (23.37) *** (32.59) *** (42.87) Building period: *** (21.64) * (26.55) *** (37.26) Terrace *** (35.17) *** (35.24) *** (37.32) Corner *** (31.77) *** (31.18) *** (34.07) Semidetached ** (47.96) *** (47.57) ** (48.73) Duplex *** (30.60) *** (30.17) *** (31.03)

15 Empirical results Intercept measures price per square meter of living space for detached houses built after 2000 (approx. EUR 1400 in 2007) Structures become less expensive as they get older Detached houses are more expensive than other types of houses All but 2 coefficients differ significantly from zero at 1% level

16 (Average) estimated land prices per square meter OLSOLSDMGWR

17 Estimated price of land per square meter, MGWR, 2007

Hedonic imputation Laspeyres house price index Hardly any difference between OLSD and MGWR OLS has downward bias

19 Hedonic imputation Paasche house price index OLS index upward biased

20 Hedonic imputation Fisher house price index Fisher index insensitive to choice of hedonic model Official (nationwide) SPAR index rises much faster

21 Hedonic imputation Fisher price indexes for land OLSD and MGWR similar but OLS very different Very volatile indexes

Hedonic imputation Fisher price indexes for structures Differences much smaller than for land, as expected Official construction cost index much flatter

23 Empirical results Are the trends of the indexes for land and structures plausible? No benchmark available for land For structures: official (nationwide) construction cost index flattens during second half of sample period; price indexes for structures keep rising house prices were still rapidly rising while construction cost index increased by only 4.9% during (CPI: 5.8%) quality change bias in construction cost index?

24 Empirical results Potential causes of volatility of the land and structure indexes Small number of observations Multicollinearity Maybe not: land and structure price changes do not consistently show opposite signs; VIF for ratio of plot size to structure size is low Heteroskedasticity Yes (Breusch-Pagan test for OLS and OLSD) Limited adjustment for quality-mix changes

25 Estimated value shares of land and structures, OLSD Also volatile Share of land on average 33%

26 Empirical results Are the (average) estimated share of land and land price per square meter plausible? Francke and Van de Minne (2015): share of land for ‘s- Hertogenbosch – a more prosperous city than “A” – on average 0.50 ( ) De Groot et al. (2015): substantial cross-city differences in the price of land in 2005, e.g. EUR 717 for Amsterdam, EUR 308 for ‘s-Hertogenbosch, EUR 184 for Leeuwarden Our estimate for “A” in 2005 of EUR 206 seems plausible

27 Conclusions The linearization and ‘normalization’ of the builder’s model is useful for estimating overall house price indexes Double imputation Fisher house price index is insensitive to choice of hedonic model Postcode dummies will suffice; no need to use geocode data; see also Hill and Scholz (2014) Land and structure price indexes are volatile (which is not unexpected as we did not use any exogenous information on price change of structures) Average level of land prices seems plausible Some doubts about official SPAR house price index and construction cost index

28 Future work Price per square meter of land depends on plot size: need to model nonlinear relation of plot size to structure size ratio Data for a bigger, less ‘homogeneous’ city, and longer sample period ( ) More structure characteristics Smoothing?