ESSnet on Datawarehousing - the business register Pieter Vlag – Statistics Netherlands.

Slides:



Advertisements
Similar presentations
Annual growth rates derived from short term statistics and annual business statistics Dr. Pieter A. Vlag, Dr. K. van Bemmel Department of Business Statistics,
Advertisements

ESSnet DataWareHousing Statistical Business Register, its role in a DWH Hub Gilissen + Pieter Vlag Statistics Netherlands.
Some considerations on developing a DWH for SBS estimates Orietta Luzi – Mauro Masselli Istat - Italy march 2013.
March 2013 ESSnet DWH - Workshop IV DATA LINKING ASPECTS OF COMBINING DATA INCLUDING OPTIONS FOR VARIOUS HIERARCHIES (S-DWH CONTEXT)
Results and next steps from the ESSnet Admin Data Alison Pritchard Business Outputs & Developments, Office for National Statistics, UK 4 December 2012.
Statistics on enterprise groups – the EGR potential European Commission – Eurostat Directorate G: Global business statistics.
Pieter Vlag ESSnet DWH: business register. Outline Central role of the  statistical units,  population frame, which includes number of enterprises,
Federal Department of Home Affairs FDHA Federal Statistical Office FSO Business register as a basis for survey frames Natalia Dorontsova 2 september 2013.
Quality assuring the UK business register Andrew Allen.
Trade and business statistics: use of administrative data Lunch Seminar Enrico Giovannini Italian National Statistical Institute (ISTAT) New York, February,
1 The Business Register: Introduction and Overview Ronald H. Lee
14-15 September 2011 STATISTICAL BUSINESS REGISTERS AS BACKBONE FOR BUSINESS STATISTICS Joint UNECE/OECD/Eurostat Business Registers expert meeting
United Nations Economic Commission for Europe Statistical Division Applying the GSBPM to Business Register Management Steven Vale UNECE
Regional GDP Workshop. Purpose of the Project October Regional GDP Workshop Regional GDP Scope Annual Current price (nominal) GDP By region.
Combining administrative and survey data: potential benefits and impact on editing and imputation for a structural business survey UNECE Work Session on.
Eurostat Results from the TEST EGR with respect to Inward and Outward FATS populations (2011)
TOWARDS INTEROPERABLE STATISTICAL BUSINESS REGISTERS Harrie van der Ven Project manager ESSnet EGR January 2014 Valencia.
Eurostat 09. Statistical registers and frames 1. Presented by Ágnes Andics, Ildikó Györki HCSO Hungarian Central Statistical Office 2.
Eurostat Q2014 – Session 35 Quality assurance for Business Statistics in Europe through the ESS.VIP.ESBRs project D. Francoz Eurostat.
The Statistical Business Register of Macao SAR Government of Macao SAR Statistics and Census Service.
Interactive session: Mapping the BPM-Notation on a SDWH layered architecture Discussion on Vision in sub groups.
ESSnet DataWareHousing Stocktaking Pieter Vlag, Viviana di Giorgi, Sonia Queresma.
Integrating administrative and survey data in the new Italian system for SBS: quality issues O. Luzi, F. Oropallo, A. Puggioni, M. Di Zio, R. Sanzo Nurnberg,
ESSnet on Consistency of Concepts and applied Methods of Business and Trade-related Statistics: Statistical Units P.Schuhl & S.Mabile, INSEE - France ________________________________________________________.
ESS NET ON C ONSISTENCY OF C ONCEPTS AND A PPLIED M ETHODS OF B USINESS AND T RADE -R ELATED S TATISTICS W ORK P ACKAGE 2: 2011 PROJECT ON TARGET POPULATION,
Quality issues on the way from survey to administrative data: the case of SBS statistics of microenterprises in Slovakia Andrej Vallo, Andrea Bielakova.
The Adoption of METIS GSBPM in Statistics Denmark.
IMPUTING MISSING ADMINISTRATIVE DATA FOR SHORT-TERM ENTERPRISE STATISTICS Pieter Vlag – Statistics Netherlands Joint work with DESTATIS, Statistics Estonia,
Eurostat Overall design. Presented by Eva Elvers Statistics Sweden.
Deliverable 2.6: Selective Editing Hannah Finselbach 1 and Orietta Luzi 2 1 ONS, UK 2 ISTAT, Italy.
Copyright 2010, The World Bank Group. All Rights Reserved. Business registration, part 2 Administrative and statistical business registers 1 Business statistics.
May 2012 ESSnet DWH - Workshop III BUSINESS REGISTER IN STATISTICS LITHUANIA Jurga Rukšėnaitė Chief specialist.
ESS-net DWH ESSnet DWH - Metadata in the S-DWH Harry Goossens – Statistics Netherlands Head Data Service Centre / ESSnet Coordinator
Explaining the statistical data warehouse (S-DWH)
United Nations Economic Commission for Europe Statistical Division Mapping Data Production Processes to the GSBPM Steven Vale UNECE
Use of Administrative Data Seminar on Developing a Programme on Integrated Statistics in support of the Implementation of the SNA for CARICOM countries.
Compilation of Distributive Trade Statistics in African Countries Workshop for African countries on the implementation of International Recommendations.
ESSnet ON MICRO DATA LINKING AND DATA WAREHOUSING IN STATISTICAL PRODUCTION RESULTS OF STOCKTAKING, CONCLUSIONS OF FIRST YEAR * Pieter Vlag Senior Statistical.
© Federal Statistical Office Germany, Division IB, Institute for Research and Development in Federal Statistics Sheet 1 Surveys, administrative data or.
Statistics Portugal Methodology and Information Systems Department Information Infrastructure Unit Isabel Farinha and Jorge Magalhães « 21th Meeting of.
The use of VAT for monthly and quarterly turnover estimates A case study between NL and UK Pieter Vlag, Henk van de Velden Nino Mushkudiani all at: Statistics.
Work packages SGA II ESSnet on microdata linking and data warehousing in statistical production Harry Goossens – Statistics Netherlands Head Data Service.
Data sources of the EuroGroups Register Presentation by Eurostat
STS Compilation with Multiple Data Sources Anu Peltola Economic Statistics Section, UNECE UNECE Workshop on Short-Term Statistics (STS) and Seasonal Adjustment.
ESS-net DWH ESSnet on microdata linking and data warehousing in statistical production Harry Goossens – Statistics Netherlands Head Data Service Centre.
Beijing, October 19, th International Roundtable on Business Survey Frames Co-ordinating role of the Business Register in Economic Statistics Results.
Trade & Business Statistics Geert Bruinooge Statistics Netherlands.
1 Statistical business registers as a prerequisite for integrated economic statistics. By Olav Ljones Deputy Director General Statistics Norway
ESS-net DWH ESSnet on microdata linking and data warehousing in statistical production.
14-Sept-11 The EGR version 2: an improved way of sharing information on multinational enterprise groups.
Profiling procedure Sarmite Prole Head of Business Register Section Business Statics Department Central Statistical Bureau of Latvia May 19-23, 2014.
Use of the Statistics New Zealand Business Register for the agriculture industry and the not for profit sector Geoff Mead
Kiev 2 nd – 5 th October 2012 Mrs Vibeke Skov Møller The Danish Statistical Business Register: Units, variables and extracts.
Statistical Business Register Enterprise Groups in Latvia Sarmite Prole Head of Business Register Section Business Statics Department Central Statistical.
4-6 September 2013, Vilnius Quality in Statistics: Administrative Data and Official Statistics USING ADMINISTRATIVE DATA SOURCES IN OFFICIAL.
ESSnet workshop Köln Pieter Vlag Some discussion points
Guidelines on the use of estimation methods for the integration of administrative sources DIME/ITDG meeting 2018/02/22.
Sample surveys versus business register evaluations:
Goals and objectives of Work package 2 of the ESSnet on Consistency of concepts and applied methods of business and trade-related statistics Norbert Rainer,
1 What is EGR? ESTP course on EGR 6-7 September 2016.
Tomaž Špeh, Rudi Seljak Statistical Office of the Republic of Slovenia
6.1 Quality improvement Regional Course on
Italian situation in the following areas:
22nd October 2007 Ivo Beuken Statistics Netherlands
Session 7 – Eurostat 2017 SBR User Survey
ESSnet DataWareHousing
Mapping Data Production Processes to the GSBPM
ESSnet on Consistency Workshop
Wiesbaden Group Neuchatel 24 – 27 September 2018
ESTP course on EuroGroups Register
Presentation transcript:

ESSnet on Datawarehousing - the business register Pieter Vlag – Statistics Netherlands

1 Outline of the presentation DataWareHouse and importance population frame relationship population frame - business register -(default) target population, statistical units other crucial datasources: “backbones” -turnover + employment datalinking : the statistical unit base conflicting information between datasources - when correcting in statistical DWH - when correcting in backbones - when feedback to business register ESSnet DWH – business register

2 Definition of a statistical Datawarehouse (according to the FPA) ESSnet DWH – business register The broad definition of a data warehouse to be used in this ESSnet is therefore: ‘A common conceptual model for managing all available data of interest, enabling the NSI to (re)use this data to create new data/new outputs, to produce the necessary information and perform reporting and analysis, regardless of the data’s source.’

3 A DataWarehouse: the general idea ESSnet DWH – business register As staging area is “core business” for NSIs, term statistical DWH is used for staging area + WareHouse

4 The statistical DataWarehouse: architecture and layers ESSnet DWH – business register

5 The statistical DataWareHouse: processing steps the GSBPM model ESSnet DWH – business register process input DWH / int. data 5.1a: link data 5.1b: integrate data see presentation Fursova Calculate aggregates

Titel van de presentatie 6 datasource 1datasource 2datasource 3 Output 1Output 2 Output 3 Linking Processing (integration layer) Integrated data p.analyse 4 GSBPM -step

7 A datawarehouse without population frame ESSnet DWH – business register Datasource I: Admin data Datasource I: Survey 1 Datasource I: Survey 2 Datasource I: BIG DATA different sources cover different enterprises -> information about ? timing of availability sources differs -> when complete desc. available ?

8 A Datawarehouse with a population frame ESSnet DWH – business register Population. Datasource 1: admin data 1 Datasource 2: BIG DATA Datasource 3: survey 1 Datasource 4: survey 2 ADVANTAGE: the coverage of DWH is known (e.g. which enterprises are included in a DWH)

9 Units and target population The population should be known for the datawarehouse;e.g. “about which enterprises info” its preparation phase ;e.g. when linking data sources Challenges are: units may differ between the data sources - decision: which unit used for linking what is the reference population -decision: how is the default target population defined ESSnet DWH – business register

10 Proposals Only statistical unit (=enterprise) is used -for data-linking -in processing phase of the statistical - DWH -justification: most obvious, ESSnet on Consistency, maintenance Default target population : all enterprises with economic activity in reference period (e.g. year) -justification: SBS-regulation -widest definition of enterprises from which flexible outputs for subpopulations can be derived -term default is used: as subpopulations do have a target population, too ESSnet DWH – business register

Titel van de presentatie 11 4 GSBPM -step Linked data Integrated data Processing on stat. unit + default target population only flexible datasources with different populations and units Weighting to flexible pop. flexible output for different populations, and units

12 Population frame and the Business Register Determination of the default target population in SDWH in 2 steps: the population frame, i.e. a list of enterprises with a certain kind of activity during a period. confirmation which enterprises of the list really performed economic activities during a period The business register provides information for the population frame. Therefore, the statistical Business Register is an indirect datasource for the statistical-DWH ESSnet DWH – business register

13 Information needed from stat.business register Recommended information for the population frame : the frame reference year the statistical enterprises unit, including national ID and EGR ID the name and address of the enterprise the national identification number (ID) of the enterprise the date in population (mm/yr) the date out of population (mm/yr) the NACE-code the institutional sector code a size class ESSnet DWH – business register

14 Other backbones ESSnet AdminData: VAT and social security admin almost complete for quarter and annual can be used for high-quality estimates for turnover + employment respectively. ESSnet DWH: VAT and social security data are crucial to confirm the activity status of enterprises implictly to determine the default target population to integrate data suitable for flexible outputs measurement errors are reduced of sample survey (or data about subpopulation) if weighting to pop.numbers + VAT-turnover + employment Proposal: to include these admin data as backbones in a stat-DWH ESSnet DWH – business register

15 Source layer Int. + Analyses layer Access layer Integration layer SBR Pop-frame data 1data 2 VATempl. GSBPM 5.1: link & integrate GSBPM : “process” GSBPM : calculate aggregates Check processing GSBPM 6: analyse / “DATAWAREHOUSE” GSBPM 7-9: disseminate Backbones in a statistical-DWH Backbones are crucial for data-linking and data-integration; -> need to be checked/cleaned by source in the source layer

16 Source layer Int. + Analyses layerIntegration layer SBR Pop-frame data 1data 2 VAT empl. GSBPM 5.1: link & integrate GSBPM : “process” GSBPM : calculate aggregates Check processing GSBPM 6: analyse GSBPM 7-9: disseminate Observed: admin data incorporated in BR When choosing this option, - important part of linking process outside the S-DWH - unless S-DWH integral part of S-DWH (maintenance ?)

17 Determining default target population ESSnet DWH – business register If statistical-DWH covers annual statistics only relatively straightforward - derive population frame from business register at the end of reference year t -determine active or non-active as soon as VAT and/or employment data become available If STS included in statistical-DWH more complicated: -updating necessary !

18 Updating population ESSnet DWH – business register

19 SBR Pop-frame data 1 data 2 VATempl. GSBPM 5.1: link & integrate GSBPM : “process” GSBPM : calculate aggregates Check processing “DATAWAREHOUSE” The largest enterprises L.E. output 1 output 2 output 3 If a team within a NSI produces consistent microdata for largest enterprises -> consider this source as backbone

20 Units: ideal situation ESSnet DWH – business register enterprise has a unique ID enterprise group has a unique ID enterprise and enterprise group correspond with statistical definitions are used in all data sources In practice more complex situations do exist (especially when using more admin data)

Titel van de presentatie 21 4 GSBPM -step Linked data Integrated data processing on one unit + one population only flexible datasources with different population and units Flexible output for different populations, and units Key question: how to manage these different in- and output units and their relationships to the statistical unit

ESSnet DWH – business register 22 ENTERPRISE (=statistical unit) ENTERPRISE GROUP Legal unit “Accountìng” unit “VAT-unit” other units “other tax” units enterprise Enterprise Local unit LKAU KAU Enterprise group INPUTIN S-DWH processing OUTPUT

23 The unit base ESSnet DWH – business register Some remarks: Complexity of unit base depends on - scope of statistical-DWH -national legislation (practices) with respect to enterprise units Unit base closely related to Business Register. Main motivation to place this base outside the Business registers - more flexible in case of new in- and outputs - more transparent in case of linking errors

24 SBR Pop-frame VATempl. GSBPM 5.1: link & integrate GSBPM : “process” GSBPM : calculate aggregates Check processing “DATAWAREHOUSE” Position of Business Register in stat -DWH L.E. output 1 output 2 output 3 survey units tax BIG DATA other

25 Feedback to Business Register ESSnet DWH – business register In case of conflicting information between datasources and conclusion is influential error in backbones (and indirectly SBR) When incorporating corrections in statistical DWH ? When incorporating corrections in backbones ? When incorporating corrections in SBR?

26 SBR Pop-frame survey other units VATempl. GSBPM 5.1: link & integrate GSBPM : “process” GSBPM : calculate aggregates Check processing L.E. Correction of information In SDWH: corrections at 5.6 In backbones themselves: timing most important revisions In SBR: after end of year (for consistency) – exception major impact “DATAWAREHOUSE” output 1 output 2 output 3

27 Conclusions ESSnet DWH – business register Requirements for statistical-DWH Population well defined Use of one unit in processing Backbones desired for populations, VAT-turnover, admin data employment, large enterprises Business Register is indirect input for statistical DWH population frame, unit base, survey Timing of corrections errors (backbone information) in DWH: before weighting in backbone: when revising in Business Register: end of year