A Review of Benchmarking Methods G Brown, N Parkin, and N Stuttard, ONS.

Slides:



Advertisements
Similar presentations
Some thoughts on density surface updating 1.Major Updates every X years: refitting models (perhaps new kinds of models) to accumulated data over large.
Advertisements

Spreadsheet Modeling & Decision Analysis
Statistical Techniques I EXST7005 Start here Measures of Dispersion.
Lecture Notes #11 Curves and Surfaces II
Use of Kalman filters in time and frequency analysis John Davis 1st May 2011.
Weight in IIP Calculation Introduction Selection of weights updating of weights Fixed weight vs. chain index ( In terms of weights updating )
Year 4 Objectives: Number 1
Exponents and Scientific Notation.
1 1 Slide © 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
AP Statistics Chapters 3 & 4 Measuring Relationships Between 2 Variables.
Data mining and statistical learning - lecture 6
Basis Expansion and Regularization Presenter: Hongliang Fei Brian Quanz Brian Quanz Date: July 03, 2008.
Chapter 18 Interpolation The Islamic University of Gaza
Mixture Designs Simplex Lattice Simplex Centroid
Exponential Smoothing Methods
Moving Averages Ft(1) is average of last m observations
Jim Ramsay McGill University Basis Basics. Overview  What are basis functions?  What properties should they have?  How are they usually constructed?
Chapter 5 Time Series Analysis
Data Sources The most sophisticated forecasting model will fail if it is applied to unreliable data Data should be reliable and accurate Data should be.
1 Spreadsheet Modeling & Decision Analysis: A Practical Introduction to Management Science, 3e by Cliff Ragsdale.
Project  Now it is time to think about the project  It is a team work Each team will consist of 2 people  It is better to consider a project of your.
Chapter 13 Forecasting.
United Nations Statistics Division
Unido.org/statistics Analysis of Divergence of quarterly and Annual Index of Industrial Production Shyam Upadhyaya, Shohreh Mirzaei Yeganeh United Nations.
X-12 ARIMA Eurostat, Luxembourg Seasonal Adjustment.
Spreadsheet Problem Solving
8/7/2015Slide 1 Simple linear regression is an appropriate model of the relationship between two quantitative variables provided: the data satisfies the.
1 BABS 502 Moving Averages, Decomposition and Exponential Smoothing Revised March 11, 2011.
International Workshop on Industrial Statistics Dalian, China June 2010 Shyam Upadhyaya UNIDO Benchmarking of monthly/quarterly.
Constant process Separate signal & noise Smooth the data: Backward smoother: At any give T, replace the observation yt by a combination of observations.
Diane Stockton Trend analysis. Introduction Why do we want to look at trends over time? –To see how things have changed What is the information used for?
Economic Indexes Indexes in statistics Indexes are indicators of size comparison of any socio-economic process. Index number measures how much a variable.
LSS Black Belt Training Forecasting. Forecasting Models Forecasting Techniques Qualitative Models Delphi Method Jury of Executive Opinion Sales Force.
Graph Exponential Growth Functions
Estimating Potentials and Forecasting Sales
IB Chemistry Chapter 11, Measurement & Data Processing Mr. Pruett
Machine Learning Week 4 Lecture 1. Hand In Data Is coming online later today. I keep test set with approx test images That will be your real test.
Interactive Graphics Lecture 9: Slide 1 Interactive Graphics Lecture 9: Introduction to Spline Curves.
Exponential Functions Exponential functions Geometric Sequences.
Fundamentals of Data Analysis Lecture 9 Management of data sets and improving the precision of measurement.
1 Spreadsheet Modeling & Decision Analysis: A Practical Introduction to Management Science, 3e by Cliff Ragsdale.
McGraw-Hill/Irwin Copyright © 2002 by The McGraw-Hill Companies, Inc. All rights reserved. 3-2 Business Forecasting with Accompanying Excel-Based ForecastX™
LINKING PSYCHOMETRIC RISK TOLERANCE WITH CHOICE BEHAVIOUR FUR Conference – July 2008 Peter Brooks, Greg B. Davies and Daniel P. Egan.
DSc 3120 Generalized Modeling Techniques with Applications Part II. Forecasting.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Time Series Forecasting Chapter 13.
Copyright © 2014, 2011 Pearson Education, Inc. 1 Chapter 27 Time Series.
1 DSCI 3023 Forecasting Plays an important role in many industries –marketing –financial planning –production control Forecasts are not to be thought of.
Time Series Analysis and Forecasting
1 CSC 427: Data Structures and Algorithm Analysis Fall 2008 Algorithm analysis, searching and sorting  best vs. average vs. worst case analysis  big-Oh.
Binomial Distribution Derivation of the Estimating Formula for u an d ESTIMATING u AND d.
Evaluating generalised calibration / Fay-Herriot model in CAPEX Tracy Jones, Angharad Walters, Ria Sanderson and Salah Merad (Office for National Statistics)
10B11PD311 Economics. Process of predicting a future event on the basis of past as well as present knowledge and experience Underlying basis of all business.
1 Estimating the Term Structure of Interest Rates for Thai Government Bonds: A B-Spline Approach Kant Thamchamrassri February 5, 2006 Nonparametric Econometrics.
ISCG8025 Machine Learning for Intelligent Data and Information Processing Week 3 Practical Notes Application Advice *Courtesy of Associate Professor Andrew.
Series – Mortgage & Savings Question - You decide to buy a house and take out a mortgage for £60,000, at a fixed interest rate of 5% This 5% interest is.
A comparison of established and newly developed benchmarking methods Jennifer Davies, Duncan Elliott (ONS), Homesh Sayal, John Aston (University of Cambridge)
1 An experimental ISP for ICT for France 28 June 2006 STESWP Task Force on Services.
Benchmarking of Korean Quarterly GDP Benchmarking of Korean Quarterly GDP International Workshop on Consumption and Full Accounting of the Household Sector.
1 BABS 502 Moving Averages, Decomposition and Exponential Smoothing Revised March 14, 2010.
MGS3100_03.ppt/Feb 11, 2016/Page 1 Georgia State University - Confidential MGS 3100 Business Analysis Time Series Forecasting Feb 11, 2016.
Managerial Decision Modeling 6 th edition Cliff T. Ragsdale.
Statistics for Business and Economics Module 2: Regression and time series analysis Spring 2010 Lecture 7: Time Series Analysis and Forecasting 1 Priyantha.
Chapter 15 Forecasting. Forecasting Methods n Forecasting methods can be classified as qualitative or quantitative. n Such methods are appropriate when.
GCSE Additional Mathematics Information Seminar Dunadry Hotel 19 th November 2008.
How do you assign an error to a measurement?
ECE3340 Numerical Fitting, Interpolation and Approximation
Curve-Fitting Spline Interpolation
Some examples Noughts and crosses exercise
Basis Expansions and Generalized Additive Models (2)
Splines There are cases where polynomial interpolation is bad
Presentation transcript:

A Review of Benchmarking Methods G Brown, N Parkin, and N Stuttard, ONS

Overview Introduction What is benchmarking? What we did and why Some methods for benchmarking Some quality measures Comparison of methods Summary 2

Introduction Purpose: to recommend a method for benchmarking to ONS and wider GSS Benchmarking combines two time series of same phenomenon, measured at different frequencies Result: benchmarked series is higher quality Work funded from Quality Improvement Fund 3

What we did and why Identified appropriate benchmarking methods Tested using several hundred ONS time series Used range of quality measures to rank methods Made judgment to combine results from different quality measures Recommended a benchmarking method Update of ONS computer systems prompted examination of methods 4

Benchmarking Want good estimates of levels and growth Have two series measuring same phenomenon Different frequencies Higher frequency more timely, accurate growths o Indicator series Lower frequency delayed, more accurate levels o Benchmark series 5

Benchmarking Resulting high frequency series o Benchmarked series Has good estimates of growth combined with good estimates of level 6

Benchmarking Two types of relation between indicator and benchmark: o Point in time o Average 7

Benchmarking, point in time Example: unemployment monthly and quarterly Benchmarks apply to the third month in each quarter Third monthly estimate in each quarter is forced to equal benchmark 8

9

10

Benchmarking, average Example: turnover monthly and quarterly Benchmarks apply to each month in each quarter Average turnover of three months in each quarter is forced to equal benchmark 11

12

13

Non-negativity Most indicator series must be non-negative In those cases the benchmarked series must be non-negative too Process of benchmarking can produce negative benchmarked series 14

15

What we did and why Identified appropriate benchmarking methods Tested using several hundred ONS time series Used range of quality measures to rank methods Made judgment to combine results from different quality measures Recommended a benchmarking method Update of ONS computer systems prompted examination of methods 16

Benchmarking methods Methods suggested by ONS, variants with different splines o proc Expand (in SAS) o INTER o Kruger Denton Cholette-Dagum Constrained versions of the above for non- negativity 17

Benchmarking methods Methods suggested by ONS, variants with different splines o proc Expand (in SAS) o INTER o Kruger Denton Cholette-Dagum Constrained versions of the above for non- negativity 18

Benchmarking methods Methods suggested by ONS, variants with different splines o proc Expand (in SAS) o INTER o Kruger Denton Cholette-Dagum Constrained versions of the above for non- negativity 19

Benchmarking methods Methods suggested by ONS, variants with different splines o proc Expand (in SAS) o INTER o Kruger Denton Cholette-Dagum Constrained versions of the above for non- negativity 20

ONS methods (and variants) Summary: fits smooth curve through knots 1.Aggregate indicator series 2.Calculate ratio of aggregated to benchmark 3.Augment with fore/backcasts using X-12-ARIMA 4.Interpolate to frequency of indicator 5.Multiply indicator by interpolated series 6.Iterate 1 to 5 Variants use different ways to interpolate 21

Interpolation Three types of cubic spline 1.Proc Expand (point in time/average) 2.INTER (average) 3.Kruger (point in time) Progressively less prone to produce negative values 22

Denton type Summary: try to preserve movements in indicator Minimise a penalty function of differences or relative differences between indicator and benchmark Minimisation using either special methods or off-the-shelf methods for quadratic minimisation Denton usually set up to minimise first differences or proportionate first differences 23

Denton and Cholette-Dagum For indicator points with no benchmark: Denton carries forward the most recent difference between benchmark and indicator Cholette-Dagum assumes the difference decays to zero in a defined way Flexible in the way this is modelled We assume: o Decay is geometric o Rate of decay fixed in advance for all series 24

25

Non-negativity ONS suggestion: o Benchmark on log scale o Exponentiate o Distribute residual differences Optimisation approach for Denton type: o Set up basic method as a matrix problem o Add constraints as part of matrix setup o Solve using off-the-shelf optimiser in SAS 26

What we did and why Identified appropriate benchmarking methods Tested using several hundred ONS time series Used range of quality measures to rank methods Made judgment to combine results from different quality measures Recommended a benchmarking method Update of ONS computer systems prompted examination of methods 27

Time series used for testing Mixture of: o Monthly to quarterly o Quarterly to annual o Average and point in time Different lengths Included some awkward series (to test non- negativity) 28

What we did and why Identified appropriate benchmarking methods Tested using several hundred ONS time series Used range of quality measures to rank methods Made judgment to combine results from different quality measures Recommended a benchmarking method Update of ONS computer systems prompted examination of methods 29

How the methods were compared 1.Failures 2.Verification of benchmarking constraint 3.Preserving change 4.Revisions 5.Smoothness 6.Closeness 30

How the methods were compared 1.Failures – program fails to benchmark 2.Verification of benchmarking constraint 3.Preserving change 4.Revisions 5.Smoothness 6.Closeness 31

How the methods were compared 1.Failures 2.Verification of benchmarking constraint - benchmarked not equal to benchmark 3.Preserving change 4.Revisions 5.Smoothness 6.Closeness 32

How the methods were compared 1.Failures 2.Verification of benchmarking constraint 3.Preserving change – size and direction 4.Revisions 5.Smoothness 6.Closeness 33

How the methods were compared 1.Failures 2.Verification of benchmarking constraint 3.Preserving change 4.Revisions – size & bias when perturbing or adding benchmark 5.Smoothness 6.Closeness 34

How the methods were compared 1.Failures 2.Verification of benchmarking constraint 3.Preserving change 4.Revisions 5.Smoothness – relative variance of indicator and benchmarked 6.Closeness 35

How the methods were compared 1.Failures 2.Verification of benchmarking constraint 3.Preserving change 4.Revisions 5.Smoothness 6.Closeness – between indicator and benchmarked 36

How the methods were compared For each one of preserving change, revisions, smoothness and closeness, calculate: o For each method, for each time series, for different lengths of the series o Rank methods for each series and length o Average the ranks over all series o Plot and compare average ranks by length 37

38

39

Recommended method Around 100 plots compared Judgment made on overall best performing method Based on good performance and lack of bad performance Recommended method: Cholette-Dagum (0.8) 40

Summary Aim: recommend method for benchmarking to ONS and wider GSS Update of ONS computer systems prompted examination of methods Used several quality measures to rank methods Made judgment to combine results from different quality measures Recommended: Cholette-Dagum (0.8) 41

Any questions? 42