Evaluating the benefits of using VAT data to improve the efficiency of editing in a multivariate annual business survey Daniel Lewis.

Slides:



Advertisements
Similar presentations
Katherine Jenny Thompson
Advertisements

Evaluating the Effects of Business Register Updates on Monthly Survey Estimates Daniel Lewis.
Statistics for Improving the Efficiency of Public Administration Daniel Peña Universidad Carlos III Madrid, Spain NTTS 2009 Brussels.
UNDERSTANDING AND ACCESSING FINANCIAL MARKET Nia Christina
Using Business Taxation Data as Auxiliary Variables and as Substitution Variables in the Australian Bureau of Statistics Frank Yu, Robert Clark and Gabriele.
Research on Improvements to Current SIPP Imputation Methods ASA-SRM SIPP Working Group September 16, 2008 Martha Stinson.
Objectives 10.1 Simple linear regression
Weighting Methodology for the Private Landlords Survey Robert Bucknall, ONS.
Editing and Imputing VAT Data for the Purpose of Producing Mixed- Source Turnover Estimates Hannah Finselbach and Daniel Lewis Office for National Statistics,
Multiple Regression Fenster Today we start on the last part of the course: multivariate analysis. Up to now we have been concerned with testing the significance.
Regression With Categorical Variables. Overview Regression with Categorical Predictors Logistic Regression.
Examining the use of administrative data for annual business statistics Joanna Woods, Ria Sanderson, Tracy Jones, Daniel Lewis.
Econometric Details -- the market model Assume that asset returns are jointly multivariate normal and independently and identically distributed through.
Chapter 8 Logistic Regression 1. Introduction Logistic regression extends the ideas of linear regression to the situation where the dependent variable,
Multiple Linear Regression Model
Multiple Regression Involves the use of more than one independent variable. Multivariate analysis involves more than one dependent variable - OMS 633 Adding.
Concept of Reliability and Validity. Learning Objectives  Discuss the fundamentals of measurement  Understand the relationship between Reliability and.
Notes on Logistic Regression STAT 4330/8330. Introduction Previously, you learned about odds ratios (OR’s). We now transition and begin discussion of.
Stat 112: Lecture 9 Notes Homework 3: Due next Thursday
1 Methods for detecting errors in VAT Turnover data Phil Lewis Processing, Editing and Imputation branch Business Statistics Methods-Survey Methodology.
Maintenance of Selective Editing in ONS Business Surveys Daniel Lewis.
United Nations Economic Commission for Europe Statistical Division Applying the GSBPM to Business Register Management Steven Vale UNECE
Improving Quality in the Office for National Statistics’ Annual Earnings Statistics Pete Brodie & Kevin Moore UK Office for National Statistics.
Combining administrative and survey data: potential benefits and impact on editing and imputation for a structural business survey UNECE Work Session on.
Work Package 5: Integrating data from different sources in the production of business statistics Daniel Lewis Office for National Statistics (UK)
Chapter 11 Simple Regression
Measuring the quality of regional estimates from the ABS Jennie Davies and Daniel Ayoubkhani.
The Edit Anders Norberg, Statistics Sweden (SCB) Work Session on Statistical Data Editing Ljubljana, Slovenia, 9-11 May 2011.
Consultants: Jillian Lyon and Mary Ehlers The Impact of Atypical Antipsychotic Use on Obstructive Sleep Apnea.
GDP measurement issues Graeme Walker Head of National Accounts Productivity Puzzle Seminar: 16 October 2012.
2-1 MGMG 522 : Session #2 Learning to Use Regression Analysis & The Classical Model (Ch. 3 & 4)
Sébastien CHAMI 5 May, 2010 Reengineering French structural business statistics An extended use of administrative data.
Integrating administrative and survey data in the new Italian system for SBS: quality issues O. Luzi, F. Oropallo, A. Puggioni, M. Di Zio, R. Sanzo Nurnberg,
Lecture 22 Dustin Lueker.  The sample mean of the difference scores is an estimator for the difference between the population means  We can now use.
Use of Administrative Data in Statistics Canada’s Annual Survey of Manufactures Steve Matthews and Wesley Yung May 16, 2004 The United Nations Statistical.
A generic tool to assess impact of changing edit rules in a business survey – SNOWDON-X Pedro Luis do Nascimento Silva Robert Bucknall Ping Zong Alaa Al-Hamad.
Least-Squares Regression Section 3.3. Why Create a Model? There are two reasons to create a mathematical model for a set of bivariate data. To predict.
Deliverable 2.6: Selective Editing Hannah Finselbach 1 and Orietta Luzi 2 1 ONS, UK 2 ISTAT, Italy.
Keynesian Hospitals: Public Employment and Political Pressure Andrew E. Clark (Paris School of Economics and IZA) Carine Milcent (Paris School of Economics)
Chapter 9 Analyzing Data Multiple Variables. Basic Directions Review page 180 for basic directions on which way to proceed with your analysis Provides.
A Strategy for Prioritising Non-response Follow-up to Reduce Costs Without Reducing Output Quality Gareth James Methodology Directorate UK Office for National.
The application of selective editing to the ONS Monthly Business Survey Emma Hooper Office for National Statistics
Jeroen Pannekoek - Statistics Netherlands Work Session on Statistical Data Editing Oslo, Norway, 24 September 2012 Topic (I) Selective and macro editing.
Investigating improvements in quality of survey estimates by updating auxiliary information in the sampling frame using returned and modelled data Alan.
NYGEAR UP State Evaluation State Technical Workshop February 23, 2015.
ESSnet AdminData Methods of estimation for business statistics variables that cannot be obtained from administrative data sources (WP3) Duncan Elliott.
Evaluating generalised calibration / Fay-Herriot model in CAPEX Tracy Jones, Angharad Walters, Ria Sanderson and Salah Merad (Office for National Statistics)
Chapter 16 Data Analysis: Testing for Associations.
1 Selective data editing Development & implementation Q 2010 Helsinki Jörgen Svensson Process Owner Statistics Sweden (SCB)
Stat 112 Notes 9 Today: –Multicollinearity (Chapter 4.6) –Multiple regression and causal inference.
© The McGraw-Hill Companies, Inc., 2002 McGraw-Hill/Irwin Slide Cost Estimation.
M Thomas Is the IPO Pricing Process Efficient? Michelle Lowry G William Schwert (Latest Draft February 2003)
Topic (iii): Macro Editing Methods Paula Mason and Maria Garcia (USA) UNECE Work Session on Statistical Data Editing Ljubljana, Slovenia, 9-11 May 2011.
Click to edit Master title style Midterm 3 Wednesday, June 10, 1:10pm.
Topics, Summer 2008 Day 1. Introduction Day 2. Samples and populations Day 3. Evaluating relationships Scatterplots and correlation Day 4. Regression and.
Network analysis as a method of evaluating support of enterprise networks in ERDF projects Tamás Lahdelma (Urban Research TA, Finland)
Psychology 202a Advanced Psychological Statistics November 3, 2015.
Looking for statistical twins
to see if program cost causes more in savings. $1.74
Correlation, Bivariate Regression, and Multiple Regression
Notes on Logistic Regression
Improving the efficiency of editing in ONS business surveys
VAT data in Business Register and Business Statistics
Statistical Office of the Republic of Slovenia
STA 291 Summer 2008 Lecture 23 Dustin Lueker.
STA 291 Spring 2008 Lecture 23 Dustin Lueker.
Chapter 6 Logistic Regression: Regression with a Binary Dependent Variable Copyright © 2010 Pearson Education, Inc., publishing as Prentice-Hall.
Clinical prediction models
Chapter 13 Excel Extension: Now You Try!
Chapter 15 Excel Extension: Now You Try!
Presentation transcript:

Evaluating the benefits of using VAT data to improve the efficiency of editing in a multivariate annual business survey Daniel Lewis

Overview Background Creating predictors Using predictors to improve current edit rules Using predictors to improve selective editing Conclusions

Background ONS desire to improve efficiency of business statistics through greater use of admin data Investigate possibility of improving editing by incorporating admin data Use VAT data to improve editing for Annual Business Survey (ABS) Create predictors for key ABS variables based on VAT Turnover and Expenditure data Use predictors to test possible improvements to current ABS micro editing and new selective editing approach

Creating predictors Created predictors for 8 key variables Turnover, Purchases, Total taxes, Opening and Closing stocks, Net Capex, GVA, Employment costs Used multivariate linear regression to predict 8 variables based on VAT Turnover and Expenditure Additional explanatory variables from business register – Industry, Size and Region Tried combining with logistic regression to deal with zeros Developed models using previous period data and applied coefficients to current VAT data

Using predictors in current ABS editing Range of edit rules currently used for ABS, defined by industrial sector – Catering, Production Mostly based on relationships between variables Developed new rules based on VAT predictors: Fine-tuned x to get similar failures to current rules and evaluated effect on estimated bias Also tested similar rules using previous survey response as predicted value

Results for current micro editing In both sectors it is possible to greatly improve edit rule efficiency for all variables except Net Capex For Catering sector, similar improvements are possible using previous response instead of VAT data For Production sector, using VAT predictors leads to much greater improvements than simply using previous response

Using predictors in selective editing for ABS Currently investigating Selekt (Stats Sweden) selective editing method for ABS Try to improve study results for Catering and Production sectors using VAT predictors Selekt selective editing score: Try using VAT predictors as expected values in Impact part of score and test variables in Suspicion In place of previous value where available, class median where not

Results for selective editing For Catering, the VAT predictors perform less well than previous survey values in terms of estimated bias introduced (for same level of savings) For Production, it was possible to achieve the same results using VAT predictors rather than previous values, but there was no improvement May be possible to use VAT data to improve Selekt implementation in some other way as our understanding of the software deepens

Conclusions It is possible to make large improvements to current ABS micro edits by re-defining the rules For one sector tested, it is necessary to use VAT data to get the biggest benefits We have not been able to improve the application of Selekt to ABS using the predictors, but there may still be a role in Selekt for VAT data Results are for two out of seven ABS sectors – improvements may be possible in other sectors