The Use of Administrative Sources for Statistical Purposes Common Problems and Solutions.

Slides:



Advertisements
Similar presentations
Statistics NZs experience in using Administrative Data in an Integrated Programme of Economic Vince Galvin General Manager Strategy & Communications.
Advertisements

Paul Smith Office for National Statistics
Goods for Processing / Toll Processing … a pragmatic approach What is toll processing? Why is toll processing used? What is the problem? How has ONS dealt.
United Nations Statistics Division/DESA
Enhancing Data Quality of Distributive Trade Statistics Workshop for African countries on the Implementation of International Recommendations for Distributive.
Session 3: Summary of Business Profiling Wiesbaden Group Paris 24 – 27 November 2008.
United Nations Statistics Division Principles and concepts of classifications.
Chapter 3 – Evaluation of Performance
Regional Workshop for African Countries on Compilation of Basic Economic Statistics Pretoria, July 2007 Administrative Data and their Use in Economic.
Results and next steps from the ESSnet Admin Data Alison Pritchard Business Outputs & Developments, Office for National Statistics, UK 4 December 2012.
The Dutch Censuses of 1960, 1971 and 2001 Producing public use files in the IPUMS project Wijnand Advokaat Statistics Netherlands Division Social and Spatial.
Changes in the Structure of the Business Population Business Demography Jillian Delaney 29 September 2011.
OECD Short-Term Economic Statistics Working PartyJune Analysis of revisions for short-term economic statistics Richard McKenzie OECD OECD Short.
The Use of Administrative Sources for Economic Statistics An Overview Steven Vale Office for National Statistics UK.
ISO 9001: Supporting Statistical Systems Prices Division, Office for National Statistics Derek Bird May2014.
Quality assuring the UK business register Andrew Allen.
United Nations Statistics Division Recoding the business register to ISIC Rev.4.
© 2007 Hay Group. All rights reserved. Salary Survey Report January 30, 2007 State of Kansas.
United Nations Economic Commission for Europe Statistical Division Applying the GSBPM to Business Register Management Steven Vale UNECE
Statistik.atSeite 1 Implementation of NACE Rev. 2 at the national level Norbert Rainer.
1 BUSINESS REGISTER CBS-ISRAEL. 2 LEGAL FRAME WORK in 1997 two inter-governmental committees issued: 1. LEGAL ASPECTS 2. PRACTICAL & TECHNICAL ASPECTS.
Carmela Pascucci – Istat - Italy Meeting of the Working Party on International Trade in Goods and Trade in Services Statistics (WPTGS) Linking business.
Electronic reporting in Poland 27th Voorburg Group Meeting Warsaw, Poland October 1st to October 5th, 2012 Central Statistical Office of Poland.
The Use of Administrative Sources for Statistical Purposes Steven Vale United Nations Economic Commission for Europe.
Rudi Seljak, Metka Zaletel Statistical Office of the Republic of Slovenia TAX DATA AS A MEANS FOR THE ESSENTIAL REDUCTION OF THE SHORT-TERM SURVEYS RESPONSE.
Classification of Holding Companies and Head Offices according to NACE Rev.2.
Session 1 – Use of profiling for public administration Linda Scott Head of Business Register Operations UK.
Improvements in stratification in the UK's Office for National Statistics Pete Brodie, Martina Portanti & Emily Carless UK Office for National Statistics.
A Checklist for Business Profiling John Perry, UK Office for National Statistics.
1 Selected Issues with Implementation of 2008 SNA (continued) Training Workshop on System of National Accounts for ECO Member Countries October 2012,
Register-Based Census 2011 in Slovenia – Some Quality Aspects Danilo Dolenc Statistical Office of the Republic of Slovenia UNECE-Eurostat Expert Group.
IMPUTING MISSING ADMINISTRATIVE DATA FOR SHORT-TERM ENTERPRISE STATISTICS Pieter Vlag – Statistics Netherlands Joint work with DESTATIS, Statistics Estonia,
Deliverable 2.6: Selective Editing Hannah Finselbach 1 and Orietta Luzi 2 1 ONS, UK 2 ISTAT, Italy.
A Strategy for Prioritising Non-response Follow-up to Reduce Costs Without Reducing Output Quality Gareth James Methodology Directorate UK Office for National.
United Nations Statistics Division/DESA International Recommendations for the Index of Industrial Production (IIP)
Frameworks for the Access and Use of Administrative Data, With the Example of Current Practice in the UK Steven Vale Office for National Statistics UK.
Statistics New Zealand’s End-to-End Metadata Life-Cycle ”Creating a New Business Model for a National Statistical Office if the 21 st Century” Gary Dunnet.
Valentina Stoevska ILO Department of Statistics Workshop on MDG Data Reconciliation: Employment Indicators, Beirut, July
United Nations Economic Commission for Europe Statistical Division Mapping Data Production Processes to the GSBPM Steven Vale UNECE
12 October 2009 EU-OECD Workshop Introducing NACE rev 2 in EU Short-term business Statistics Brian Newson Head of STS unit Eurostat.
Use of Administrative Data Seminar on Developing a Programme on Integrated Statistics in support of the Implementation of the SNA for CARICOM countries.
New sources – administrative registers Genovefa RUŽIĆ.
Compilation of Distributive Trade Statistics in African Countries Workshop for African countries on the implementation of International Recommendations.
Copyright 2010, The World Bank Group. All Rights Reserved. Principles, criteria and methods Part 2 Quality management Produced in Collaboration between.
1 QUALITY IMPROVEMENTS IN CROATIAN BUSINESS REGISTER AND IMPLICATIONS OF REVISION OF NACE Prepared by Zrinka Pavlović and Dubravka Celić Geneva,
Preparing for A Strategy for Change Based on Previous Experiences Steve Vale Office for National Statistics, UK.
United Nations Oslo City Group on Energy Statistics OG7, Helsinki, Finland October 2012 ESCM Chapter 8: Data Quality and Meta Data 1.
1 Statistical business registers as a prerequisite for integrated economic statistics. By Olav Ljones Deputy Director General Statistics Norway
Business Registers Recommendations Manual Presentation by Eurostat
Data Dissemination Conditions in the European Statistical System (ESS) UNECE, Warschau May 2009.
13-Jul-07 State of the art of the ISCO-08 implementation.
Workshop on MDG, Bangkok, Jan.2009 MDG 3.2: Share of women in wage employment in the non-agricultural sector National and global data.
How official statistics is produced Alan Vask
ESDS Seminar Apr The EU Labour Force Survey Arturo de la Fuente, Estat-F2 “Labour Market Statistics”
Administrative Data and Official Statistics Administrative Data and Official Statistics Principles and good practices Quality in Statistics: Administrative.
M O N T E N E G R O Negotiating Team for Accession of Montenegro to the European Union Working Group for Chapter 18 – Statistics Bilateral screening: Chapter.
Statistical Business Register Enterprise Groups in Latvia Sarmite Prole Head of Business Register Section Business Statics Department Central Statistical.
Implementation of Quality indicators for administrative data
Sharne Bailey, Tony Byrne UK, Office for National Statistics
Quality Aspects and Approaches in Business Statistics
Business Register Quality Improvement
A new fantastic source for updating the Statistical Business Register
6.1 Quality improvement Regional Course on
Issues in Administrative Data
Administrative Data and their Use in Economic Statistics
Albania 2021 Population and Housing Census - Plans
Wiesbaden, 24 October, 2007 Svetlana Shutova Statistics Estonia
Mapping Data Production Processes to the GSBPM
Use of administrative data for statistical purposes
Preparing for A Strategy for Change Based on Previous Experiences Steve Vale Office for National Statistics, UK.
Presentation transcript:

The Use of Administrative Sources for Statistical Purposes Common Problems and Solutions

Public Opinion The level of public concern about government departments sharing data varies from country to country There is usually some suspicion of the motives for data sharing Sometimes public opinion favours data sharing

Adopt and publish a code of practice following international standards Clearly stated limits and rules may help reduce concerns The principle of the “one-way flow” of sensitive data must be understood by all Solutions

Publish cost-benefit analyses of the use of different sources It may be possible to claim that data are more secure – No questionnaires sent by post – Fewer clerical staff, so fewer people with access to data

Public Profile Direct contact with the public via surveys helps raise the profile of the statistical office The use of administrative data can reduce contact with the public and awareness of the work of the statistical office

Effective ‘marketing’ of the statistical office and data outputs Greater involvement with education institutions, business groups, and other target customers Solutions

Units Administrative units may be different to statistical units: – Job / person – Tax unit / enterprise – Dwelling / household They may need to be converted to meet statistical requirements

Group Exercise Statistical and Administrative Units – How Many Enterprises?

Enterprise Definition “the smallest combination of legal units that is an organisational unit producing goods or services, which benefits from a certain degree of autonomy in decision- making.... An enterprise may be a sole legal unit.” Source: EU Regulation 696/93 on statistical units

Examples Taken from “The Impact of Diverging Interpretations of the Enterprise Concept” - a study prepared for Eurostat by Statistics Netherlands with input from Denmark and UK

Example 1 Two legal units in an enterprise group have different 4 digit NACE codes; both are selling mainly to third parties outside the group. They share buildings, management, purchases and employees.

Answers NL: Combine into one enterprise –Intensity of shared production factors UK: Combine into one enterprise –Intensity of shared production factors DK: Two separate enterprises –Both sell more than 50% outside the group

Four Legal Units : A and B have different activities, no combined purchases, but share buildings. C and D share buildings, employees, and purchases. All four present themselves as one firm. Example 2

Answers NL/DK: A and B are separate enterprises. Combine C and D into one enterprise –because A and B operate on market terms, whilst C and D share production factors UK: All four in one enterprise –because they present themselves as one firm

Three legal units: All produce mainly for external customers, they share management and purchases, and represent themselves as one firm. A and B share a building. B and C have the same activity, share employees and capital goods and can not supply separate data. Example 3

Answers NL: Combine into one enterprise –All share management and purchases, and represent themselves as one firm UK/DK: Combine B and C into one enterprise, A is a separate enterprise –Because B and C are horizontally integrated, and data are only available for these two together

Twelve legal units form an enterprise group. Only one is active, the others have no employees. Example 4

Answers NL: One enterprise which only consists of the active unit –Because units which are not active are not part of an enterprise UK: One enterprise which consists of all units –Because there is no point having separate enterprises for non-active units DK: Each unit is a separate enterprise –There are no strong ties between the units

Solutions Automatic rules for simple cases –These must be clear and consistent Statistical “adjustments” –E.g. the statistical unit is persons. The administrative unit is jobs. We know from a survey that working people have, on average, 1.15 jobs. This adjustment factor can therefore be used to estimate persons in employment from jobs Profiling

Profiling Definition Profiling is a method to analyse the legal, operational and accounting structure of an enterprise group at national and world level, in order to establish the statistical units within that group, their links, and the most efficient structures for the collection of statistical data. Source: Eurostat Business Registers Recommendations Manual, Chapter 19

Profiling Gives a better understanding of complex unit structures It is expensive and time consuming It needs trained staff It is a compromise based on a trade- off between quality, quantity and the resources available

Quality QuantityResources

Business Profiling in the UK 14 Staff Approx cases per year – Including 100 public sector Mix of desk and visit profiling – Approx 200 visits per year Should profilers also collect data from key businesses?

Definitions of Variables Administrative data are collected according to administrative concepts and definitions Administrative and statistical priorities are often different, so definitions are often different

Unemployment Statistical definition (ILO) – Out of work – Available for work – Actively seeking work Administrative definitions are often based on those claiming unemployment benefits

Solutions Know and document the differences and their impact Use other variables to derive or estimate the impact of the difference Statistical adjustments during data processing

Classifications Two scenarios: 1. Same classification system 2. Different classification systems

Same Classification Used for different purposes May not be a priority variable for the administrative source Different classification rules Different emphasis, e.g. specific activity rather than main activity

Solutions Understand how classification data are collected and what they are used for Provide coding expertise, tools and training to administrative data suppliers

Different Classifications (or different versions of the same classification) Not always a 1 to 1 correlation between codes Tools are needed to convert codes from one classification to another

Solutions (1) Stress the advantages of using a common classification Offer expertise to help re-classify administrative sources Give early notice of classification changes and help implement them across government

Solutions (2) Use text descriptions to re-code administrative data Use probabilistic conversion matrices to convert codes –This results in individual unit classifications not always being correct, but aggregate data should be OK

Example of a conversion matrix (Approx. 22% probability of correct code!)

Missing Data Impute where possible Many different imputation methods are used. Two common methods are: – Deductive Imputation – Hot-deck Imputation

Case Study Eurostat have a project to develop enterprise demography They want to estimate the impact of enterprise births Employment of new enterprises is used, but this variable is often missing or unreliable for new units

Solutions Calculate turnover per head ratios to impute missing variables Ratios based on “similar” units by classification and size Problems with outliers therefore trimming used, e.g. x% or mean of inter-quartile range

Possible Distributions

Turnover per head ratios in practice ISICTPH A business has ISIC class 45.12, turnover is 200, employment is missing. What is the imputed employment value?

Imputed employment is: 200 / 68 = 2.94 = 3

Ratios such as turnover per head are also very useful for validating updates, matching and detecting errors!

Timeliness Two Issues Data arrive too late Data relate to a different time period

Data arrive too late Data from annual tax returns are often only available several months after the end of the tax year, so they are unsuitable for monthly or quarterly statistics Lags in registering “real world” events

UK VAT Birth Lags (1)

UK VAT Birth Lags (2) 2/3 of businesses are on the register within 2 months of start-up Mean lag = 4 months due to “outliers” Median = Approx. 40 days Some pre-register - negative lags

Solutions Understand the length and impact of lags Adjust data accordingly Look for ways to reduce lags where possible

Different Time Periods Administrative reference period (e.g. Financial/tax year) may not be the same as the statistical reference period Monthly average versus point in time (e.g. employment data)

Different Time Periods

Solutions Statistical corrections or estimations using data from other reference periods Be aware of possible biases when using point in time reference dates

Using data from different sources Data from different sources may not agree This may be due to: – Different definitions, classifications, time periods,.... – Errors

Using data from different sources Group Exercise

Solutions Data validation checks Benchmarking against other sources Priority rules for updating from different sources Knowledge of source quality

Benchmarking The map compares UK business register and “Yellow Pages” coverage for England and Wales Key: Blue = More businesses on register Red = More businesses in “Yellow Pages”

Priority Rules Different sources can be given different priorities for different variables To stop a “low priority” source overwriting a “high priority” one – Use source codes – Use priority / quality markers – Store dates with variables – Load data in reverse priority order

Resistance to Change Statisticians may resist the use of administrative data because they: –Do not trust data unless they collect them themselves; –Focus on negative quality aspects; –Have an over-optimistic view of the quality of survey data; –Assume survey respondents comply with statistical norms.

Solutions Education (courses like this!) Take a wider view of all the dimensions of quality, and focus on the impact on users Determine the real relative quality of survey and administrative data Identify how cost savings can be used to improve quality and / or increase outputs

Change Management Risk of changes in: – Government / administrative policy – Thresholds – Definitions – Coverage – Systems

High-Risk Times Immediately after an election Change of minister Change of government policy Change in EU legislation and…. When you least expect it!

Solutions Manage the Risk by: –Legal provisions –Contractual agreements –Regular contact with administrative colleagues –Anticipating changes –Contingency plans

Questions: Do you have contingency plans? Have you ever needed contingency plans?

Better to be proactive beforehand than have to react after the event !

Summary There are many problems to overcome when using administrative sources Most can be reduced by effective planning and management The benefits are still often greater than the costs

Group Discussion What is the main problem with using administrative sources in your area of statistics?