Session D12: Multisource statistics New sources: new modelling approaches Author: Gras Fabrice, Eurostat, unit B1, Methodology and corporate architecture.

Slides:



Advertisements
Similar presentations
Paul Smith Office for National Statistics
Advertisements

Quality Guidelines for statistical processes using administrative data European Conference on Quality in Official Statistics Q2014 Giovanna Brancato, Francesco.
CZECH STATISTICAL OFFICE | Na padesatem 81, Prague 10 | Jitka Prokop, Czech Statistical Office SMS-QUALITY The project and application.
1 Editing Administrative Data and Combined Data Sources Introduction.
Edit and Imputation of the 2011 Abu Dhabi Census Glenn Hui and Hanan AlDarmaki Statistics Centre - Abu Dhabi UNECE CES Work Session on Statistical Data.
Combining administrative and survey data: potential benefits and impact on editing and imputation for a structural business survey UNECE Work Session on.
Work Package 5: Integrating data from different sources in the production of business statistics Daniel Lewis Office for National Statistics (UK)
CZECH STATISTICAL OFFICE Na padesátém 81, CZ Praha 10, Czech Republic The use of administrative data sources (experience and challenges)
TYPOLOGY OF PRODUCTS IN OFFICIAL STATISTICS Thomas Burg Marcus Hudec.
1 The system aspect of statistical quality Q2014 european conference on quality in official statistics Special session: Consistency of Concepts and Applied.
Q2010, Helsinki Development and implementation of quality and performance indicators for frame creation and imputation Kornélia Mag László Kajdi Q2010,
Probabilistic Mechanism Analysis. Outline Uncertainty in mechanisms Why consider uncertainty Basics of uncertainty Probabilistic mechanism analysis Examples.
Quality issues on the way from survey to administrative data: the case of SBS statistics of microenterprises in Slovakia Andrej Vallo, Andrea Bielakova.
Assessing Quality for Integration Based Data M. Denk, W. Grossmann Institute for Scientific Computing.
Eurostat Overall design. Presented by Eva Elvers Statistics Sweden.
for statistics based on multiple sources
Oslo, 24–26 September 2012 Work Session on Statistical Data Editing APPLICATION OF THE DEVELOPED SAS MACRO FOR EDITING AND IMPUTATION AT.
Statistik.atSeite 1 Norbert Rainer Quality Reporting and Quality Indicators for Statistical Business Registers European Conference on Quality in Official.
Addressing the challenge of producing European comparable data using administrative data Mihaela AGAFIŢEI Sorina VÂJU UNECE Seminar on Statistical Data.
Copyright 2010, The World Bank Group. All Rights Reserved. Principles, criteria and methods Part 2 Quality management Produced in Collaboration between.
Eurostat – Unit D5 Key indicators for European policies Third International Seminar on Early Warning and Business Cycle Indicators Annotated outline of.
ESSnet(s) Big Data I + II Item 8 of the agenda Joint DIME-ITDG Plenary Luxembourg, 24 Feb 2015.
Eurostat Accuracy of Results of Statistical Matching Training Course «Statistical Matching» Rome, 6-8 November 2013 Marcello D’Orazio Dept. National Accounts.
QUALITY ASSESSMENT OF THE REGISTER-BASED SLOVENIAN CENSUS 2011 Rudi Seljak, Apolonija Flander Oblak Statistical Office of the Republic of Slovenia.
Overview and challenges in the use of administrative data in official statistics IAOS Conference Shanghai, October 2008 Heli Jeskanen-Sundström Statistics.
Census quality evaluation: Considerations from an international perspective Bernard Baffour and Paolo Valente UNECE Statistical Division Joint UNECE/Eurostat.
The business process models and quality issues at the Hungarian Central Statistical Office (HCSO) Mr. Csaba Ábry, HCSO, Methodological Department Geneva,
Quality at a Glance: Documentation of Quality Indicators at Statistics Austria European Conference on Quality in Official Statistics Rome, 8-11 July 2008.
Slide 1DSS Board 1-2 December 2014 Eurostat ESS.VIP ADMIN project on use of administrative data sources Agenda point 7 DSS Board 1-2 December 2014.
1 General Recommendations of the DIME Task Force on Accuracy WG on HBS, Luxembourg, 13 May 2011.
Eurostat – Unit D1 Key indicators for the European policies Euro-indicators Working Group Luxembourg, 4 th & 5 th December 2008.
Workshop on Implementing Standards for Statistical Modernisation 2016 Geneva, September 2016 Complementing the GSBPM with Quality Indicators for.
KOMUSO - ESSnet on quality of multisource statistics
Session D7: Big Data Analysis from Classification to Dimensional reduction The curse of dimensionality in official statistics Emanuele Baldacci,
Methods for Data-Integration
UNECE Data Integration Project
Implementation of Quality indicators for administrative data
Theme (v): Managing change
Theme (i): New and emerging methods
Conference of European Statistics Stakeholders October 2016
Trade performance of the EU economies: Inter-country input-output tables (IOTs) as a necessary tool Conference of European Statistics Stakeholders, Budapest.
Towards more flexibility in responding to users’ needs
Innovation in statistical processes and products: a European view
Henri Luomaranta, Statistics Finland
Estimation methods for the integration of administrative sources
Estimation methods for the integration of administrative sources
Guidelines on the use of estimation methods for the integration of administrative sources DIME/ITDG meeting 2018/02/22.
DIME Plenary Thomas Burg –Statistics Austria
Generic Statistical Business Process Model (GSBPM)
KOMUSO Information for the Big Data society in official statistics
Sampling Distribution
Sampling Distribution
Progress of the ESS.VIP ADMIN Special focus on the ESSnet on quality of multiple sources statistics. DIME/ITDG SG, Fabrice Gras, unit B1.
Guidelines on the use of estimation methods for the integration of administrative sources WG Methodology 2018/05/03.
6.1 Quality improvement Regional Course on
Item 8 Cost assessment survey of production of statistics in the ESS
3.2 ESS VIP Administrative Data
Working Group on Standards June 2018 Jean-Marc MUSEUX, Unit B1
Item 3 of the draft agenda ESS.VIP ADMIN: progress report
ITDG meeting of of October 2011
Cost accounting in the ESS
3.4 Modernisation of Social Statistics
ESS.VIP ADMIN EssNet on Quality in Multi-source Statistics, progress report 19TH WORKING GROUP ON QUALITY IN STATISTICS, 6 December 2016 Fabrice Gras,
ESS.VIP Validation Item 5.1
The new quality strategy in the modernised Italian National Statistical Institute Giovanna Brancato Giorgia Simeoni, Antonia Boggia,
A modest attempt at measuring and communicating about quality
Quality of Multisource Statistics
Metadata on quality of statistical information
2.7 Annex 3 – Quality reports
Policy Group on Statistical Cooperation October 2014, Antalya
Presentation transcript:

Session D12: Multisource statistics New sources: new modelling approaches Author: Gras Fabrice, Eurostat, unit B1, Methodology and corporate architecture Conference of European Statistics Stakeholders Budapest, 20–21 October 2016

Outline: New sources: multiple usages with a tendency towards more and more multi-sources statistics. Integration of new sources in the "official statistics" universe: increasing use of various modelling techniques in addition of surveys. Quality assessment of multi-sources statistics? Measuring uncertainty of multi-sources statistics Eurostat activities

Possible usages of new sources Direct 1. Direct Tabulation 2. Substitution and supplementation Indirect 1. Creation and update of registers 2. Editing and imputation 3. Estimation 4. Data validation/ confrontation

Integration of new sources Statistical toolbox: Editing techniques and outliers detection. Data linkage/matching methods: probabilistic or not. Modelling: calibration, state-space models, temporal disaggregation, small area models, stone model, regression techniques, etc …

Quality assessment of new sources: Input quality: Eurostat quality dimensions applicable (timeliness, relevancy, accuracy, comparability, consistency, clarity, sustainability) Process quality: total quality management Output quality: Eurostat quality dimensions Main issue accuracy measurement (bias +measurement error) Bias = comparability

Sources of uncertainty Input Sources n: Bias + Measurement error = B + e In Data linkage/matching for source n: false positive/true positive (p1n, p2n) Estimation/imputation: Y = f(X) + h (normally should remove the bias) Main issue: estimation of the parameters above

Measurement of uncertainty: Survey for estimating parameters of underlying distributions. Model outputs Qualitative assessment of parameters. Bias: need of several sources, availability of auxiliary variable, qualitative assesment

Output accuracy Aggregation of the different sources of errors for the different used sources at the different steps of the statistical process: Existence of an analytical expression. Simulation. Main issues: Computational cost. Model specification errors not taken into account. Cost and update of the estimated parameters.

Example: Input measurement error transmission during the linkage/matching process: Xi N (m, s2), i = 1 … N X= S Xi E(X) = N (1- p1n+ p2n) m Var (X) = Var (N (1- p1n+ p2n) s2) = N2 (1- p1n+ p2n)2 s2 To be inserted during the estimation/imputation phase: Y = f(., X) + h

Eurostat activities: ESS VIP.ADMIN: Working package 2: Estimation methods Review of relevant estimations methods and provision of guidelines (2016-2018) Working package 3: Quality measures for statistics using administrative data Consortium of NSIs led by Denmark dealing with input, process and output quality (2015-2019). BIG-DATA: Assessment of the quality of Big-Data sources (including big-data selectivity). Big-data econometrics

Conclusion: Multi-sources statistics: Increasing use of estimation methods Input of uncertainty other than sampling error at different steps of the statistical production process. Parameters necessary to the estimation of the uncertainty could be obtained through surveys or qualitative assessment. Output accuracy: aggregation of the uncertainty coming fron various sources along the production process. Use of simulation methods.

Thank you for your attention Questions welcome References: Zhang, L-C. (2012). Topics of statistical theory for register-based statistics and data integration. Statistica Neerlandica, vol. 66, pp. 41-63. ESS.VIP.ADMIN http://ec.europa.eu/eurostat/cros/content/essvip-admin-administrative-data-sources_en BIG-DATA http://ec.europa.eu/eurostat/cros/content/big-data_en