Integrating administrative and survey data in the new Italian system for SBS: quality issues O. Luzi, F. Oropallo, A. Puggioni, M. Di Zio, R. Sanzo Nurnberg,

Slides:



Advertisements
Similar presentations
A Statistical Architecture for Economic Statistics Ron McKenzie ICES III.
Advertisements

GDP by Income Approach and Accounts of Household Sector For Qatar Experience Prepared by : Aisha Al-Mansoori Statistical Researcher Population & Social.
Using Business Taxation Data as Auxiliary Variables and as Substitution Variables in the Australian Bureau of Statistics Frank Yu, Robert Clark and Gabriele.
Some considerations on developing a DWH for SBS estimates Orietta Luzi – Mauro Masselli Istat - Italy march 2013.
GENEralised software for Sampling Estimates and Errors in Surveys (GENESEES V. 3.0) Piero Demetrio Falorsi - Salvatore Filiberti Istat Structural Business.
Enhancing Data Quality of Distributive Trade Statistics Workshop for African countries on the Implementation of International Recommendations for Distributive.
Quality Guidelines for statistical processes using administrative data European Conference on Quality in Official Statistics Q2014 Giovanna Brancato, Francesco.
Results and next steps from the ESSnet Admin Data Alison Pritchard Business Outputs & Developments, Office for National Statistics, UK 4 December 2012.
An Integrated Approach to Economic Statistics “ The Canadian Experience” UNSD – IBGE Workshop on Manufacturing Statistics Kevin Roberts Rio de Janeiro,
1 Overview of Manufacturing Statistics in Africa UNECA Andry Andriantseheno Workshop on manufacturing Statistics Lusaka 4-7 May 2009.
Trade and business statistics: use of administrative data Lunch Seminar Enrico Giovannini Italian National Statistical Institute (ISTAT) New York, February,
Use of administrative data in statistics - challenges and opportunities ICES III End Panel Discussion Montreal, June 2007 Heli Jeskanen-Sundström Statistics.
Regional GDP Workshop. Purpose of the Project October Regional GDP Workshop Regional GDP Scope Annual Current price (nominal) GDP By region.
Combining administrative and survey data: potential benefits and impact on editing and imputation for a structural business survey UNECE Work Session on.
Korean SME Characteristics & Proposed Developments for Data Linking Presenter : Sunghee Han.
Work Package 5: Integrating data from different sources in the production of business statistics Daniel Lewis Office for National Statistics (UK)
Carmela Pascucci – Istat - Italy Meeting of the Working Party on International Trade in Goods and Trade in Services Statistics (WPTGS) Linking business.
Quality in the Swedish Business Database The Quality Survey 2004 Round Table Beijing 2004 Swedish presentation, session 5, 18 th Round Table, Beijing –
The Canadian Integrated Approach to Economic Surveys Marie Brodeur, Peter Koumanakos, Jean Leduc, Éric Rancourt, Karen Wilson Statistics Canada International.
1 NATIONAL STATISTICAL COORDINATION BOARD International Workshop on Energy Statistics, Beijing, China, September 2012 Vivian R. Ilarina, Philippines.
Rudi Seljak, Metka Zaletel Statistical Office of the Republic of Slovenia TAX DATA AS A MEANS FOR THE ESSENTIAL REDUCTION OF THE SHORT-TERM SURVEYS RESPONSE.
Use of survey (LFS) to evaluate the quality of census final data Expert Group Meeting on Censuses Using Registers Geneva, May 2012 Jari Nieminen.
12th Meeting of the Group of Experts on Business Registers
1 Presentation to OG6 Canberra, Australia May 2011 Statistical Uses of Administrative Data in Canada.
Use of Administrative Data in Statistics Canada’s Annual Survey of Manufactures Steve Matthews and Wesley Yung May 16, 2004 The United Nations Statistical.
Record matching for census purposes in the Netherlands Eric Schulte Nordholt Senior researcher and project leader of the Census Statistics Netherlands.
1 11 th Joint UNECE/Eurostat/OECD Seminar on Business Registers Valentín Llorente García (INE - Spain) Session 3: Business Register as a source for further.
Copyright 2010, The World Bank Group. All Rights Reserved. Business statistics surveys 1. Design 1 Business statistics and registers.
Quality issues on the way from survey to administrative data: the case of SBS statistics of microenterprises in Slovakia Andrej Vallo, Andrea Bielakova.
Eurostat Overall design. Presented by Eva Elvers Statistics Sweden.
Deliverable 2.6: Selective Editing Hannah Finselbach 1 and Orietta Luzi 2 1 ONS, UK 2 ISTAT, Italy.
GENERAL GOVENMENT SECTOR OF THE REPUBLIC OF ARMENIA Workshop on implementation of the 2008 SNA in EECCA countries and linkages with BPM6 and GFSM 2014.
Handling inconsistencies in integrated business data Bonn September 2006 * Jeffrey Hoogland Ilona Verburg.
1 1 Workshop on Improving Statistics on SME's and Entrepreneurship, Paris, September 2003 Draft Conclusions and Recommendations.
The new multiple-source system for Italian Structural Business Statistics based on administrative and survey data Orietta Luzi, Ugo Guarnera, Paolo Righi.
ESSnet on Datawarehousing - the business register Pieter Vlag – Statistics Netherlands.
Copyright 2010, The World Bank Group. All Rights Reserved. 1 GOVERNMENT FINANCE STATISTICS INTRODUCTION TO GOVERNMENT FINANCE STATISTICS Part 2 This lecture.
Cristina Casciano, Viviana De Giorgi, Filippo Oropallo Istat Division for Structural Business Statistics, Agriculture, Foreign Trade and Consumer Prices.
Use of Administrative Data Seminar on Developing a Programme on Integrated Statistics in support of the Implementation of the SNA for CARICOM countries.
ESSnet AdminData Methods of estimation for business statistics variables that cannot be obtained from administrative data sources (WP3) Duncan Elliott.
-1- After Crisis: Policy Reforms for Effective Management of State Owned Enterprises in Latvia Mr. Daniels Pavļuts Minister of Economics 18 January 2012,
Data warehouse approach to statistical data management and the prospect of its use for scanner data Antonio Laureti Palma Workshop scanner.
1 For a Population Statistical Register Characteristics and Potentials for the Official Statistics Central department for administrative data and archives.
Performance Indicators Workshop for African countries on the Implementation of International Recommendations for Distributive Trade Statistics May.
1 M Gouws – Stats SA Business Register (UNSD regional workshop) Presentation to UNSD regional workshop for African countries – Basic economic statistics.
Multivariate selective editing via mixture models: first applications to Italian structural business surveys Orietta Luzi, Guarnera U., Silvestri F., Buglielli.
1 Statistical business registers as a prerequisite for integrated economic statistics. By Olav Ljones Deputy Director General Statistics Norway
1 Overview of Economic Statistics in Africa UNECA Andry Andriantseheno Regional Workshop on Basic Economic Statistics Addis-Ababa October 2007.
Fulvia Cerroni - Viviana De Giorgi (Istat) Session 11: Use of administrative data in the statistical system 15 October 2008 The Tax Authority Source as.
The use of administrative data for the production of official economic statistics in Brazil - current situation and challenges for the future Shanghai,
COMBINING SURVEY AND ADMINISTRATIVE DATA IN THE ITALIAN EU-SILC EXPERIENCE: POSITIVE AND CRITICAL ASPECTS National Institute of Statistics - Italy Claudio.
Session topic (i) – Editing Administrative and Census data Discussants Orietta Luzi and Heather Wagstaff UNECE Worksession on Statistical Data Editing.
4-6 September 2013, Vilnius Quality in Statistics: Administrative Data and Official Statistics USING ADMINISTRATIVE DATA SOURCES IN OFFICIAL.
Implementation of Quality indicators for administrative data
Compilation and Dissemination of Distributive Trade Statistics
Lecture 08.
“The infrastructure for the SBS-Frame production in ISTAT”
Guidelines on Integrated Economic Statistics
Prague EU-SILC Best Practice Workshop, 14th and 15th September 2017
ADMINISTRATIVE DATA IN ANNUAL BUSINESS STATISTICS OF LATVIA
Chapter 8 Profitability
Italian situation in the following areas:
Guidelines on Integrated Economic Statistics
DEVELOPMENT OF IMPUTATION MODEL FOR SMALL ENTERPRISES
CARICOM TRADE IN SERVICES STATISTICS (CTIS)PROJECT June-December 2016
Guidelines on Integrated Economic Statistics
IMPROVEMENT OF SBS DATA QUALITY
RENATA TUMĖNIENĖ Chief specialist Enterprises statistics division
Parallel Session: BR maintenance Quality in maintenance of a BR:
L 9 - Chapter 8 Profitability
Presentation transcript:

Integrating administrative and survey data in the new Italian system for SBS: quality issues O. Luzi, F. Oropallo, A. Puggioni, M. Di Zio, R. Sanzo Nurnberg, 9-11 September European Establishment Statistics Workshop

The new Italian system for SBS The sources Quality issues Future activities Outline 2

New production process of a statistical database (frame) containing complete and coherent unit-level information to estimate key annual SBS (production value, turnover, intermediate costs, value added, wages, labor cost, profits) based on the integrated use of administrative and fiscal data, as primary source of information,… ….complemented by direct survey data (non covered sub-populations, or variables which are not directly available from external sources) Release: October 2013 The new Italian system for SBS 2

The overall strategy 7

5 FS: Financial Statements from Chambers of Commerce (about 700,000 units). SS: Sector Studies survey (includes each year about 3.5 million enterprises) Unico: Tax data, from the Ministry of Economy and Finance, based on a unified model of tax declarations by legal form Social Security data, from the National Security Institute (INPS), which includes firm level data and individual (employees) data on wages and labor cost. The administrative and fiscal data sources The Business Register (BR): about 4.5 million enterprises. Central role as reference list of the SBS target population The Business Surveys (BS) SME - Sample survey on Small/Medium Enterprises (1-99 persons employed) LE - Total survey on Large Enterprises (100 and more persons employed ) The statistical sources

The micro-data matrix X* 7

6 Sources’ coverage Administrative SourceUnits (non overlapping)% Financial statements718, Fiscal Purpose Survey2,931, Tax Return Data585, No source208, Total 4,443,

(also based on recommendations from Essnet admin data and BLUE-ETS) Coverage completeness of the source in terms of target population Completeness degree to which the source includes information to estimate the target statistics Consistency how much admin definitions of variables/units are close to the SBS ones Accuracy: statistical adequacy of admin items for estimating target parameters Punctuality: relates to the delivery of source data along time Integrability: extent to which the source is capable of being integrated Assessing sources’ quality and usability 8

Let X* be the target SBS and A i the related information from the source i For each enterprise, some of the X* variables may exist in more than one sources in different combinations, according to the dimension, the social security rules, the fiscal status etc. The variables Ai in source i may coincide or may approximate the corresponding X* Objective of the analyses Establish a hierarchy between the sources Identify the possible deterministic “corrections” from Ai to X* Identify the appropriate multivariate models so that x * ij = f(a ij …..) Assessing sources’ quality and usability: consistency and accuracy 8

(i)Assessing consistency: closeness between the A i definitions in source i with respect to the X* definitions →variables harmonization (ii) Assessing accuracy: analysis (on overlapping admin and survey data) of A i and X* distributions to assess possible “biases” at micro-data and distribution level at estimation level and selective editing →further harmonization; data modeling Assessing sources’ quality and usability 8

Assessing accuracy at micro-data and distribution level Example: quality Indicators for the main SBS variables (survey vs SS) 8 Main economic variablesKS* ± 5% (%units) ± 5% (%value)%diff. value diff. (000€) median diff. (000€) IQR diff. (000€) CV diff. Current earnings excl. VAT, inclusive of indirect taxes Increase in fixed assets for internal works Change in work in progress Changes in inventories (finished goods, raw materials and goods for resale) Other revenues and income (non-financial, non-overtime) Purchases of goods (a) Purchases of services (b) Purchases of goods and services (a+b) Tenure Leasehold Other operating expenses Cost of labor Depreciation and amortization Value Added EBITDA Net Operating Margin

Assessing accuracy at estimation level Example: Value added: source and sampling effects 8 %diff Estimates (Yframe-YSME)/Yframe Source effect Sampling effect

Selective editing allows for the identification of influential units at a given domain level based on pre-defined influence criteria (score functions) Identification of possible lacks Domains characterized by the highest amount of influential data lacks in the harmonization process Legal/administrative constraints enterprises’ behavior related to the specific source objectives →systematic discrepancies in sources’ information Other sources of discrepancies Measurement errors Data variability →random discrepancies in sources’ information Assessing accuracy: selective editing 8

Absolute and % number of units with influential discrepancies between SS and SME data which need correction, by source. k=0.05 Assessing accuracy: identify “anomalies” in administrative data 8 Variable Source RevenuesPurchasesValue Added FS n. units44273 %units SS n. units171, %units

 Improve the selective editing approach  Improve multivariate robust data modeling for  Imputation of non covered units  Modeling of either non covered or not consistent source’s variables  Further analyses of data and economic indicators to improve accuracy for specific SME sub-populations  Improving relationships with data providers for ensuring sources punctuality and quality Process documentation for future standardization Current activities 8

Reduction of statistical burden and costs (medium term) Data available at an extremely refined level of detail Higher accuracy and coherence within and across statistical domains Some drawbacks Stability over time – data can be changed for internal decision of the owner administration Initial costs for assessing sources’ quality and for ensuring sources’ usability Expected benefits 3