Improvements in editing methods and processes for use of Value Added Tax data in UK National Accounts Martina Portanti and Robert Breton Office for National.

Slides:



Advertisements
Similar presentations
Chapter 9. Performance Management Enterprise wide endeavor Research and ascertain all performance problems – not just DBMS Five factors influence DB performance.
Advertisements

Improvements to the Quality of Tax Data in the Context of their Use in Business Surveys at Statistics Canada François Brisebois, Martin Beaulieu, Richard.
Input Data Warehousing Canada’s Experience with Establishment Level Information Presentation to the Third International Conference on Establishment Statistics.
Editing and Imputing VAT Data for the Purpose of Producing Mixed- Source Turnover Estimates Hannah Finselbach and Daniel Lewis Office for National Statistics,
1 Editing Administrative Data and Combined Data Sources Introduction.
Editing of mixed source data for turnover statistics Jeffrey Hoogland (SN) Work Session on Statistical Data Editing (Ljubljana, Slovenia, 9-11 May 2011)
08/08/2015 Statistics Canada Statistique Canada Paradata Collection Research for Social Surveys at Statistics Canada François Laflamme International Total.
Eurostat Statistical Data Editing and Imputation.
Measuring the quality of regional estimates from the ABS Jennie Davies and Daniel Ayoubkhani.
Rudi Seljak, Metka Zaletel Statistical Office of the Republic of Slovenia TAX DATA AS A MEANS FOR THE ESSENTIAL REDUCTION OF THE SHORT-TERM SURVEYS RESPONSE.
1 Presentation to OG6 Canberra, Australia May 2011 Statistical Uses of Administrative Data in Canada.
Quality issues on the way from survey to administrative data: the case of SBS statistics of microenterprises in Slovakia Andrej Vallo, Andrea Bielakova.
Deliverable 2.6: Selective Editing Hannah Finselbach 1 and Orietta Luzi 2 1 ONS, UK 2 ISTAT, Italy.
The application of selective editing to the ONS Monthly Business Survey Emma Hooper Office for National Statistics
Use of Administrative Data Seminar on Developing a Programme on Integrated Statistics in support of the Implementation of the SNA for CARICOM countries.
New sources – administrative registers Genovefa RUŽIĆ.
CBS-SSB STATISTICS NETHERLANDS – STATISTICS NORWAY Work Session on Statistical Data Editing Oslo, Norway, September 2012 Jeroen Pannekoek and Li-Chun.
Statistical Expertise for Sound Decision Making Quality Assurance for Census Data Processing Jean-Michel Durr 28/1/20111Fourth meeting of the TCG - Lubjana.
Workshop on Price Index Compilation Issues February 23-27, 2015 Data Collection Issues Gefinor Rotana Hotel, Beirut, Lebanon.
Outlining a Process Model for Editing With Quality Indicators Pauli Ollila (part 1) Outi Ahti-Miettinen (part 2) Statistics Finland.
Federal Department of Home Affairs FDHA Federal Statistical Office FSO Profiling in Switzerland Costs and benefits David Ackermann 28th September 2010.
4-6 September 2013, Vilnius Quality in Statistics: Administrative Data and Official Statistics USING ADMINISTRATIVE DATA SOURCES IN OFFICIAL.
Economic Statistic Directorate Producer Price Index in Palestine Ashraf Samara 2016.
Expanding the Role of Synthetic Data at the U.S. Census Bureau 59 th ISI World Statistics Congress August 28 th, 2013 By Ron S. Jarmin U.S. Census Bureau.
PFTAC GDP Compilation and Forecasting Workshop Use of administrative data Suva, Fiji October 17-21, 2016.
Integrating economic statistics in the Netherlands
Theme (v): Managing change
Oslo City Group Clean Technology Satellite Account
Redesigning French structural business statistics, using more administrative data ICESIII, Montréal, june 2007.
The Development of Statistical Business Registers in
Presentation by Eurostat
<Name of Product>Pilot Closeout Meeting <Customer Name>
DEFECT PREDICTION : USING MACHINE LEARNING
Kevin Moore Head of Platforms Development and Support Branch
Integra Telecommunications – Prophix story
Ivo Havinga United Nations Statistics Division
MTM Tools key to running
ESSnet on Consistency Workshop
Regional Workshop on Short-term Economic Indicators and Service Statistics September 2017 Chiba, Japan Alick Nyasulu SIAP.
Re-engineering the French Business Register ( )
Increased Efficiency and Effectiveness
Quality Aspects and Approaches in Business Statistics
Survey phases, survey errors and quality control system
Profiling in Switzerland Costs and benefits
Improving the efficiency of editing in ONS business surveys
The main results of the project and plans for the future - surviving from the tax reform and rationalization of the editing process Ms. Suvi Kiema and.
YTY − an integrated production system for business statistics
National Accounts Metadata
Automated Testing and Integration with CI Tool
Survey phases, survey errors and quality control system
Electronic Data Collection at Statistics Canada
Tomaž Špeh, Rudi Seljak Statistical Office of the Republic of Slovenia
Validation at Statistics Sweden
A new fantastic source for updating the Statistical Business Register
VAT data in Business Register and Business Statistics
Issues in Administrative Data
Improving Sampling and Processing of UK Business Register Units Mark Pont, Gary Brown & John Perry Office for National Statistics, UK.
The compilation of turnover and wage and salary indices
PRODCOM SURVEY IN THE UNITED KINGDOM
ANALYSIS OF POSSIBILITY TO USE TAX AUTHORITY DATA IN STS
Use of monthly tax return data
User Views on Quality Reporting
Education and Training Statistics Working Group – 2-3 June 2016
Jeroen Pannekoek, Sander Scholtus and Mark van der Loo
ANALYSIS OF POSSIBILITY TO USE TAX AUTHORITY DATA IN STS. RESULTS
SMP Slovakia: Main recommendations
Assurance of Quality of German National Accounts by Management
Hanna Gembarzewska, Monika Grabani
Managing Supply-Use Benchmarking
LAMAS Working Group June 2018
Presentation transcript:

Improvements in editing methods and processes for use of Value Added Tax data in UK National Accounts Martina Portanti and Robert Breton Office for National Statistics, UK

Overview ONS use of VAT turnover data Data challenges Current methods micro editing and processing Recommended methods improvements System Timeline

ONS use of VAT data Historically VAT used as a source for the Business register 2013 ESSNet on Administrative data provided opportunity to explore methods to use VAT turnover data for producing business statistics Ad-hoc system built to process data for further research/exploration Since then, attempt to re-use system for statistical production purposes Limitations in both methods and system

Data challenges Data needs to be calendarised to a monthly series Data needs to be linked and apportioned to Reporting Unit level to ensure correct industrial classification and identify business employment sizeband 1. Businesses can file VAT returns on any of 16 patterns (monthly; 3 x quarterly; 12 x annual). 2. VAT returns are available at VAT reporter level while ONS business statistics are compiled from information collected at reporting unit (RU) level.

Current process Three editing stages Take on After apportionment 1 Three editing stages Take on After apportionment After calendarisation Structural validation at stage 1 2 Content validation at stage 2 and 3 Thousands pounds rule Quarterly pattern rule Suspicious turnover rule 3

Issues Position of editing modules Validation methods Cleaning methods Macro-editing stage Position of editing module. Only structural validation takes place at the VAT unit level; recommendation to include a content validation module earlier in the process Validation method – less problematic as current rules set to pick up systematic errors where the error mechanism is known; the suspicious turnover rule could be reviewed as necessary to ensure that is still the most efficient way of identifying large errors Over-reliance upon automatic cleaning. The suspicious turnover rule, in particular, is likely to cause overediting. A business that fail the suspicious turnover rule will have its current return discard and replaced by an imputed value using ratio imputation. The macro-editing stage (not captured in the diagram above) is quite resource intense. While this can be expected because of the new source, inefficiencies in the micro-editing stage also likely drive the amount of resources required at macro-level

Target process Key changes: 1 Key changes: Content validation module at VAT unit level Selective editing stage No editing after calendarisation 2 2 3

Recommended improvements/1 1. Content validation module at VAT unit level “The early, the better” £000 rule and quarterly pattern rule Error detection Error correction may still be required at RU level

Recommended improvements/2 2. Selective editing stage Clerical investigation of influential cases earlier in the process Accept Impute Manual construction Decrease macro-editing effort Identify genuine business changes Improve knowledge of new data source Improve micro-editing methods

Recommended improvements/3 3. No editing after calendarisation Ensure impact of editing vs calendarisation can be easily separated Allow more sophisticated calendarisation methods to be applied Streamline data processing system

Current system SAS-macro based system utilising 20 main macros Around 200 files created to process each data period 1 day processing Lack of interactive features to query and clean data Lack of flexibility to test different methods/sequencing

Ongoing system development Distributing computing technology SPARK Jupyter notebook Oozie Processing time down to 5 minutes to process 4 years of data Agile approach with collaborative work across methodologists, data scientists and IT developers Spark (coding language to take advantage of distributing power to split processing across nodes –py spark Oozie orchestration tool to put together sparkjobs in a workflow Jupyter notebook development tool to have code and graphs in the same place; interactive tool to quickly develop, explore data and communicate with business areas Then code can be lifted and put into a spark module for production processes

Jupyter notebook/1

Jupyter notebook/2

Jupyter notebook/3

Oozie workflow

Timeline July 2017 – delivery of a VAT turnover processing system July 2017-December 2017 – National Accounts testing within NA systems End of 2017 – publication of Q3 quarterly national accounts estimates including VAT data for the first time

Thank you Contact details Martina Portanti martina.portanti@ons.gov.uk Head of Processing, Editing and Imputation (Business surveys) Survey Methodology and Statistical Computing division Robert Breton robert.breton@ons.gov.uk Head of Data Sources Data Science Economic Statistics Data Sources