Bootstrapping – the neglected approach to uncertainty European Real Estate Society Conference Eindhoven, Nederlands, 15-18 June 2011 Paul Kershaw University.

Slides:



Advertisements
Similar presentations
Managerial Economics in a Global Economy
Advertisements

Hypothesis testing and confidence intervals by resampling by J. Kárász.
Uncertainty and confidence intervals Statistical estimation methods, Finse Friday , 12.45–14.05 Andreas Lindén.
Sampling Distributions (§ )
Today: Quizz 11: review. Last quizz! Wednesday: Guest lecture – Multivariate Analysis Friday: last lecture: review – Bring questions DEC 8 – 9am FINAL.
The Multiple Regression Model Prepared by Vera Tabakova, East Carolina University.
Estimation A major purpose of statistics is to estimate some characteristics of a population. Take a sample from the population under study and Compute.
Resampling techniques Why resampling? Jacknife Cross-validation Bootstrap Examples of application of bootstrap.
Chapter 13 Introduction to Linear Regression and Correlation Analysis
2008 Chingchun 1 Bootstrap Chingchun Huang ( 黃敬群 ) Vision Lab, NCTU.
Bootstrapping LING 572 Fei Xia 1/31/06.
Bagging LING 572 Fei Xia 1/24/06. Ensemble methods So far, we have covered several learning methods: FSA, HMM, DT, DL, TBL. Question: how to improve results?
Fall 2006 – Fundamentals of Business Statistics 1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter 7 Estimating Population Values.
8-1 Introduction In the previous chapter we illustrated how a parameter can be estimated from sample data. However, it is important to understand how.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 3) Slideshow: prediction Original citation: Dougherty, C. (2012) EC220 - Introduction.
Standard error of estimate & Confidence interval.
STAT 572: Bootstrap Project Group Members: Cindy Bothwell Erik Barry Erhardt Nina Greenberg Casey Richardson Zachary Taylor.
Bootstrapping applied to t-tests
Bootstrap spatobotp ttaoospbr Hesterberger & Moore, chapter 16 1.
1 PREDICTION In the previous sequence, we saw how to predict the price of a good or asset given the composition of its characteristics. In this sequence,
Correlation and Linear Regression
Review of normal distribution. Exercise Solution.
Linear Regression and Correlation
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 8-1 Confidence Interval Estimation.
1 Chapter 6. Section 6-1 and 6-2. Triola, Elementary Statistics, Eighth Edition. Copyright Addison Wesley Longman M ARIO F. T RIOLA E IGHTH E DITION.
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
Biostatistics IV An introduction to bootstrap. 2 Getting something from nothing? In Rudolph Erich Raspe's tale, Baron Munchausen had, in one of his many.
Montecarlo Simulation LAB NOV ECON Montecarlo Simulations Monte Carlo simulation is a method of analysis based on artificially recreating.
Bootstrapping (And other statistical trickery). Reminder Of What We Do In Statistics Null Hypothesis Statistical Test Logic – Assume that the “no effect”
PARAMETRIC STATISTICAL INFERENCE
University of Ottawa - Bio 4118 – Applied Biostatistics © Antoine Morin and Scott Findlay 08/10/ :23 PM 1 Some basic statistical concepts, statistics.
Chap 12-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 12 Introduction to Linear.
Using Resampling Techniques to Measure the Effectiveness of Providers in Workers’ Compensation Insurance David Speights Senior Research Statistician HNC.
1 Estimation From Sample Data Chapter 08. Chapter 8 - Learning Objectives Explain the difference between a point and an interval estimate. Construct and.
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 13 Linear Regression and Correlation.
Active Learning Lecture Slides For use with Classroom Response Systems Statistical Inference: Confidence Intervals.
Estimation: Confidence Intervals Based in part on Chapter 6 General Business 704.
Resampling techniques
Sampling And Resampling Risk Analysis for Water Resources Planning and Management Institute for Water Resources May 2007.
Limits to Statistical Theory Bootstrap analysis ESM April 2006.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 13-1 Introduction to Regression Analysis Regression analysis is used.
Bootstraps and Jackknives Hal Whitehead BIOL4062/5062.
1 Chapter 6. Section 6-1 and 6-2. Triola, Elementary Statistics, Eighth Edition. Copyright Addison Wesley Longman M ARIO F. T RIOLA E IGHTH E DITION.
Analysis of Chromium Emissions Data Nagaraj Neerchal and Justin Newcomer, UMBC and OIAA/OEI and Mohamed Seregeldin, Office of Air Quality Planning and.
1 1 Slide © 2007 Thomson South-Western. All Rights Reserved Chapter 8 Interval Estimation Population Mean:  Known Population Mean:  Known Population.
An analysis of Time On Market and Advertised to Sale Price Differences OVER TIME European Real Estate Society Conference Milano, Italy, June 2010.
Summarizing Risk Analysis Results To quantify the risk of an output variable, 3 properties must be estimated: A measure of central tendency (e.g. µ ) A.
Review Normal Distributions –Draw a picture. –Convert to standard normal (if necessary) –Use the binomial tables to look up the value. –In the case of.
Inferential Statistics Introduction. If both variables are categorical, build tables... Convention: Each value of the independent (causal) variable has.
Nonparametric Methods II 1 Henry Horng-Shing Lu Institute of Statistics National Chiao Tung University
Estimating a Population Mean. Student’s t-Distribution.
Confidence Intervals for a Population Mean, Standard Deviation Unknown.
©The McGraw-Hill Companies, Inc. 2008McGraw-Hill/Irwin Linear Regression and Correlation Chapter 13.
Capital Management: Do You Have the Right Strategy? Casualty Actuarial Society 1999 Special Interest Seminar Dynamic Financial Analysis Chicago, IL, July.
Quantifying Uncertainty
Bootstrapping James G. Anderson, Ph.D. Purdue University.
Bias-Variance Analysis in Regression  True function is y = f(x) +  where  is normally distributed with zero mean and standard deviation .  Given a.
Statistics and probability Dr. Khaled Ismael Almghari Phone No:
Application of the Bootstrap Estimating a Population Mean
Inference: Conclusion with Confidence
Quantifying uncertainty using the bootstrap
BOOTSTRAPPING: LEARNING FROM THE SAMPLE
Ch13 Empirical Methods.
Interval Estimation and Hypothesis Testing
Techniques for the Computing-Capable Statistician
Fractional-Random-Weight Bootstrap
Bootstrapping and Bootstrapping Regression Models
Introductory Statistics
Presentation transcript:

Bootstrapping – the neglected approach to uncertainty European Real Estate Society Conference Eindhoven, Nederlands, June 2011 Paul Kershaw University of South Australia

Bootstrapping – the neglected approach to uncertainty Slide 2 Overview The history of confidence intervals Pedagogical predilection to a parametric view Real estate research is NOT normal Do not provide a measure of probability Enter the Jackknife Monte Carlo simulation & Bootstrapping The basic algorithms Real World Applications A better mousetrap

Bootstrapping – the neglected approach to uncertainty Slide 3 Introduction The origins of hypothesis tests is 1279 Confidence intervals were derived in 1937 A confidence interval estimates the uncertainty about the true value of some population parameter 50-year lag before medical journals for example advocated their use The lazy approach is to assume a normal distribution

Bootstrapping – the neglected approach to uncertainty Slide 4 Not Normal Very little about real estate can be considered to follow a Normal distribution including: Prices, Land area, building area, age, number of bedrooms, location, physical condition, construction, tenant’s covenant, heating, etc. Linear regression techniques are regularly applied, averages, standard errors and parametric confidence intervals proffered. Why? Is it because we are taught to do it that way – or because we teach it that way – gloss over the ignored assumptions – just give me a number from the printout.

Bootstrapping – the neglected approach to uncertainty Slide 5 Not a measure of Probability This begs the question “what is the confidence interval of a correlation coefficient?” and leads to the second question “why is it so rarely reported?” What is a realistic confidence interval for a computer generated valuation using a linear regression model? Most proprietary AVMs provide their own, often ill defined, assessment of accuracy that is usually somewhat nebulous.

Bootstrapping – the neglected approach to uncertainty Slide 6 Enter the Jackknife Early efforts in the 1950s revolved around the Jackknife (Quenouille, M). The Jackknife provides a technique for estimating the bias and standard error of an estimate irrespective of the shape of the underlying distribution. The following example is based upon the work of Efron, B; The datapoints are LSAT, the average score for the class on a national law test, and GPA, the average undergraduate grade-point average for the class.

Bootstrapping – the neglected approach to uncertainty Slide 7 Sample Data

Bootstrapping – the neglected approach to uncertainty Slide 8 The basic algorithms Compute sample statistics on n separate samples of size n-1. Each sample is the original data with a single observation omitted. Jackknife Heuristic: Remove one data point only and calculate the statistic of interest to give estimate 1 Repeat for each data points to give estimates 2, 3, 4 …n Calculate the percentiles of interest to obtain the confidence interval

Bootstrapping – the neglected approach to uncertainty Slide 9 Jackknife Calculations

Bootstrapping – the neglected approach to uncertainty Slide 10 Monte Carlo & Bootstrapping Monte Carlo simulation caught the imagination of practitioners and researchers following Hertz, David; 1964, Harvard Business Review Monte Carlo simulation uses repeated sampling to determine the properties of some result of interest The re-sampling is carried out with replacement If we apply this technique to the previous Jackknife data we would be Bootstrapping [Adventures of Baron Munchausen] Bootstrapping is repeatedly re-sampling with replacement, calculating the statistic of interest and recording its distribution.

Bootstrapping – the neglected approach to uncertainty Slide 11 Bootstrap Algorithm Remark: to calculate the dispersion of the mean DataArray() = n data points MeanResults(1000) For i = 1 to 1000 Sum=0 For j = 1 to n Sum = Sum + DataArray(RandomBetween(1,n)) Next j MeanResults(i) = Sum / n Next i

Bootstrapping – the neglected approach to uncertainty Slide 12 Real World Application 1 What annoys me most – residential price change reporting and hot spotting Below are sale prices for Q and Q for Detached houses in Aberfoyle Park, South Australia Median 382,500 Average 409,932 Median 385,000 Average 391,946 ChangeMedian0.65% ChangeAverage-4.39%

Bootstrapping – the neglected approach to uncertainty Slide 13 Bootstrap Results 1000 iterations The degree of uncertainty is clearly illustrated. The median has a 95% “confidence interval” of ….

Bootstrapping – the neglected approach to uncertainty Slide 14 A better mousetrap The traditional approach is to select n from n with replacement and calculate statistic of interest and repeat m times This is inefficient for most statistics of interest including the mean, median, standard deviation or correlation coefficient For example the mean is sum/n If for each iteration we remove just one random element and replace it with another random element we can adjust the sum by subtracting the value of the removed element and adding the value of the ingoing element If n is say 50 we save 48 mathematical operations

Bootstrapping – the neglected approach to uncertainty Slide 15 Summary The bootstrap is simple to implement The results are meaningful and easy to interpret No specious assumptions regarding underlying distributions are required Widely accepted It should be embraced by all researchers and practitioners

Bootstrapping – the neglected approach to uncertainty Slide 16 Yesteryear’s Joys Bootstrap Methods: Another Look at the Jackknife B. Efron Source: Annals of Statistics Volume 7, Number 1 (1979), 1-26.