Quality reporting Twinning project Ukraine – workshop October 2015 Karin Blix, Quality coordinator Statistics Denmark

Slides:



Advertisements
Similar presentations
Enhancing Data Quality of Distributive Trade Statistics Workshop for African countries on the Implementation of International Recommendations for Distributive.
Advertisements

Quality Guidelines for statistical processes using administrative data European Conference on Quality in Official Statistics Q2014 Giovanna Brancato, Francesco.
Mogens Grosen Nielsen Statistics Denmark
Implementation of GSBPM, DDI and SDMX reference metadata at Statistics Denmark UNECE workshop 5-7 May 2015 Mogens Grosen Nielsen
The use and convergence of quality assurance frameworks for international and supranational organisations compiling statistics The European Conference.
Summary of workshop Workshop on Writing Metadata for Development Indicators Lusaka, Zambia 30 July – 1 August 2012.
Giovanna Brancato, Marina Signore Istat Work Session on Statistical Metadata (METIS) Metadata and Quality Indicators Reuse for Quality reporting Geneva,
European Conference on Quality in Official Statistics (Q2010) 4-6 May 2010, Helsinki, Finland Brancato G., Carbini R., Murgia M., Simeoni G. Istat, Italian.
Quality assurance activities at EUROSTAT CCSA Conference Helsinki, 6-7 May 2010 Martina Hahn, Eurostat.
REFERENCE METADATA FOR DATA TEMPLATE Ales Capek EUROSTAT.
Implementation of Eurostat Quality Declarations with Cost- Effective Use of Standards Q European conference on quality in statistics Vienna 2-5 June.
Development of metadata in the National Statistical Institute of Spain Work Session on Statistical Metadata Genève, 6-8 May-2013 Ana Isabel Sánchez-Luengo.
Overview of quality work in Statistics Denmark Kirsten Wismer.
Eurostat Overall design. Presented by Eva Elvers Statistics Sweden.
BAIGORRI Antonio – Eurostat, Unit B1: Quality; Classifications Q2010 EUROPEAN CONFERENCE ON QUALITY IN STATISTICS Terminology relating to the Implementation.
Statistical Metadata System in the State Statistical Committee Baku, Azerbaijan, 2013 State Statistical Committee of the Republic of Azerbaijan 1.
> 1 Eurostat meeting 5-7 June 2013 Draft ´ Mogens Grosen (Statistics Denmark)
ESSnet on microdata linking and data warehousing in statistical production: Metadata Quality in the Statistical Data Warehouse.
United Nations Economic Commission for Europe Statistical Division Mapping Data Production Processes to the GSBPM Steven Vale UNECE
Use of Administrative Data Seminar on Developing a Programme on Integrated Statistics in support of the Implementation of the SNA for CARICOM countries.
Copyright 2010, The World Bank Group. All Rights Reserved. Principles, criteria and methods Part 2 Quality management Produced in Collaboration between.
1 Integration of the Eurostat and ESS Metadata Systems A. Götzfried Head of Unit B6 Eurostat.
1 Statistical business registers as a prerequisite for integrated economic statistics. By Olav Ljones Deputy Director General Statistics Norway
General Recommendations on STS Carsten Boldsen Hansen Economic Statistics Section, UNECE UNECE Workshop on Short-Term Statistics (STS) and Seasonal Adjustment.
1 Quality reporting within the Eurostat and the ESS metadata systems August Götzfried and Håkan Linden Eurostat Unit B6: Reference databases and metadata.
14-Sept-11 The EGR version 2: an improved way of sharing information on multinational enterprise groups.
Reference metadata: a step towards greater accessibility and clarity of statistical data European conference on quality in official statistics 2-5 June.
Copyright 2010, The World Bank Group. All Rights Reserved. Principles, criteria and methods Part 1 Quality management Produced in Collaboration between.
13 November, 2014 Seminar on Quality Reports QUALITY REPORTS EXPERIENCE OF STATISTICS LITHUANIA Nadiežda Alejeva Head, Price Statistics.
Quality Reports Ukraine November Regulation 223/2009 on European Statistics European Statistics shall be produced  on the basis of uniform standards.
Statistical process model Workshop in Ukraine October 2015 Karin Blix Quality coordinator
United Nations Economic Commission for Europe Statistical Division GSBPM in Documentation, Metadata and Quality Management Steven Vale UNECE
1 Recent developments in quality related matters in the ESS High level seminar for Eastern Europe, Caucasus and Central Asia countries Claudia Junker,
>> Metadata What is it, and what could it be? EU Twinning Project Activity E.2 26 May 2013.
Eurostat Quality reporting on energy statistics Framework and experience at EU level United Nations Oslo Group on Energy Statistics Aguascalientes (Mexico),
Quality declarations Study visit from Ukraine 19. March 2015
Short Training Course on Agricultural Cost of Production Statistics
Implementation of Quality indicators for administrative data
Development of Strategies for Census Data Dissemination
Towards more flexibility in responding to users’ needs
The ESS vision, ESSnets and SDMX
5b. SDMX and reference metadata: guideline examples
Quality assurance in official statistics
Exchanging Reference Metadata using SDMX
4.1. Data Quality 1.
Documentation of statistics
Measuring Data Quality and Compilation of Metadata
Rolling Review of Education Statistics
The new metadata structure & Country Specific Notes
Statistics Denmark’s presentation of metadata
Introduction to DDI and Colectica at Statistics Denmark
Statistical Information Technology
ESTP Course Balance of Payments – Introductory course Paris, May 2014 Quality issues.
Sub-Regional Workshop on International Merchandise Trade Statistics Compilation and Export and Import Unit Value Indices 21 – 25 November Guam.
The European Statistics Code of Practice - a Basis for Eurostat’s Quality Assurance Framework Marie Bohatá Deputy Director General, Eurostat ... Strategic.
Mapping Data Production Processes to the GSBPM
Quality Reporting in CBS
Education and Training Statistics Working Group, May 2011
The role of metadata in census data dissemination
Karin Blix, Statistics Denmark,
Metadata on quality of statistical information
Prodcom Working Group Item Quality reporting and indicators
M. Henrard, B5 N. Buysse and H. Linden, B6 Eurostat
Business architecture
Annegrete Wulff Statistics Denmark
2.7 Annex 3 – Quality reports
Work Session on Statistical Metadata (Geneva, Switzerland May 2013)
Petr Elias Czech Statistical Office
Introduction to reference metadata and quality reporting
ESS conceptual standards for quality reporting
Presentation transcript:

Quality reporting Twinning project Ukraine – workshop October 2015 Karin Blix, Quality coordinator Statistics Denmark

 …or metadata at all?  Statistics have great potential to generate knowledge  A common reference for society  Informed decision making  But the quality of statistics depends very much of the users’ need  We need to inform the users of how we have produced the statistics  Quality = fitness for use Why do we need quality reporting? 2

 Regulation 223/2009 on European Statistics  European Statistics shall be produced  on the basis of uniform standards  using harmonized methods  Users shall have access to metadata describing the quality of statistical output, in order to interpret and use the statistics correctly …and 3

 Metadata is – in short – data about data  Why is metadata important?  The processes used to collect data and to produce statistics can have an impact the usefulness of data for the individual user  Metadata provides information that enables the user to assess whether the statistics suit their purpose.  Reference-metadata describes content. Quality reports can be an example of this.  Structural metadata describes data – variables and values. Short about metadata 4

Integration of metadata 5 Hvad betyder Quality declaration Variable/dataset Concept Variable database Klassifikationsdatabase Classifications Methods/ ”Survey” StatBankMethods papers Class database Concepts database

6 Metadata users

 SDMX - Statistical Data and Metadata Exchange  ESMS - Euro-SDMX Metadata Structure  User-oriented format for quality reporting  ESQRS ESS Standard for Quality Reports Structure  Producer-oriented format for quality reporting  SIMS - Single Integrated Metadata Structure  All statistical concepts of ESMS and ESQRS have been included and streamlined in Some abbreviations 7

 Producing statistics is about describing phenomena in the society  Not just anything – but some important phenomena in the society – something that some users seek information about  Population  Education  The economy  Etc.  An illustration is given by the Swede Bo Sundgren in Statistical systems: Some fundamentals Producing statistics 8

”Objective” reality (Ideal) statistical characteristics – what we are seeking information about (ideal) population (ideal) variables (true) values Conceptualised reality Reality as it is conceived and operationalised by designers Statistical target characteristics Target population Variables that can be measured Values that can be measures Reality as presented by data Reality as it is perceived by respondents and represented by data Observed object characteristics Observed objects Observed variables Observed values Statistics about reality as interpreted by users Reality as it is understood by users when interpreting data Interpreted statistics 9 Discrepancies -Coverage (first kind) -Sampling -Operational def. of variables deviate from ideal definitions Discrepancies -Coverage (second kind) -Respondents cannot be found -Respondents refuse to answer -Respondents misinterpret -Respondents make mistakes (conscious or unconscious) Discrepancies -Frames/references differ between users, designers and respondents -Understanding of statistical methods From Bo Sundgren: Statistical systems – some fundamentals (2004)

 Help for the user to understand the statistics – giving the user information about the frame we have worked within  Explain the content of the statistics  History  Purpose of the statistics  Content – population, variables etc.  Quality = Fitness for use  Quality of contents:  Relevance, Accuracy & reliability, Timeliness and punctuality, Coherence & comparability, Accessibility and clearness Solution: quality declarations 10

 Re-organisation 2014 following the ESS handbook  Three levels 1. “Front page” to appear at the webpage of Statistics Denmark, with a short description of the 9 headlines in the Structure. From the front page one can open around 100 specified topics (SIMS) 2. SIMS topics cover the more detailed quality report (see guidelines in Annex 2). From level 2 one can open annexes for further description 3. Annexes  The idea is in one product to cover all customers (national, international, EU).  Prepared in Danish and English Quality reports in Statistics Denmark 11

 Starting point is Code of Practise and ESS Quality assurance framework  Indicator 4.3 – reporting of quality  Indicator 15.5 – metadata are documented according to standardised metadata systems  Standards:  SIMS  ESQR  ESMS  GSBPM Starting point for quality reports 12

 Existing work-processes and metadata  Fragmented and non-standardised work-processes  Metadata linked to final data and no reuse  Presentation of metadata fragmented and incomplete  Concepts database incomplete  Classifications and code-lists in many places  Introduction of standards  Generic statistical business Process Model (GSBPM)  SIMS, SDMX (ESQRS and ESMS) from Eurostat  DDI and DDI-tools to ensure integrated metadata Challenges on fulfilling user-needs in a cost-effective way 13

Customized presentation at dst.dk -Must support users business processes -Many users use the quality declarations

How to write  Short easy to understand sentences. We say – have a ”high school graduate” in mind.  Say one thing at the time.  Use simple words and explain difficult words.  Be short.  Be explicit- call a spade a spade. 15

 Once for all purposes reporting  Each concept is only reported upon once and is re-usable  Integrated and consistent quality and metadata  Reporting framework where the reports are stored in the same database  A flexible and up to date system  Where future extensions are possible by adding new concepts,  Single Integrated Metadata Structure” (SIMS)  A dynamic and unique inventory of ESS quality and metadata statistical concepts has been created  In this structure, all statistical concepts of the two existing ESS report structures (ESMS and ESQRS) have been included and streamlined, by assuring that all concepts appear and are therefore reported upon only once Streamlining and harmonising metadata and quality reporting 16

METADATA IN COLECTICA Enter SIMS fields Publish at Send Quality report to EU Publish at the Intranet 17

18

ESMSSIMSESQRS 14Accuracy and reliabilityS.15Accuracy and reliabilityV 14.1Overall accuracyS.15.1Overall accuracyV.1Overall accuracy 14.2Sampling errorS.15.2Sampling error and A1. Sampling errors - indicators for UV.2Sampling error S A1. Sampling errors - indicators for PV.2.1Sampling errors - indicators 14.3Non-sampling errorS.15.3 Non-sampling error and A4. Unit non-response - rate for U and A5. Item non-response - rate for U V.3Non-sampling error Coverage errorV.3.1Coverage error S A2. Over-coverage - rateV.3.1.1Over-coverage - rate S A3. Common units - proportion S Measurement errorV.3.2Measurement error S Non response errorV.3.3Non response error S A4. Unit non-response - rate for PV.3.3.1Unit non-response - rate S A5. Item non-response - rate for PV.3.3.2Item non-response - rate S Processing errorV.3.4Processing error V.3.4.1Imputation - rate V.3.4.2Common units - proportion S Model assumption errorV.3.5Model assumption error V.3.7Seasonal adjustment Example: Accuracy and reliability 19

Process model 20

Handbook is addressed to  NSO for their own internal assessment of process and output quality  NSO as the starting point for preparing user-oriented quality reports  NSO for producer-oriented quality reports to Eurostat Single metadata structure should promote  Both user-oriented and producer-oriented should be derived from the same source  Maximum re-use of information in the metadata system  Reduction and simplification of documents  The user-oriented quality reports should be improved ESS handbook - Purpose 21

1.Introduction 2.Relevance, assessment of user needs 3.Accuracy and reliability 4.Timeliness and punctuality 5.Accessibility and clarity 6.Coherence and comparability 7.Cost and burden 8.Confidentiality 9.Statistical processing 2-6: output components7-9: process components ESS Handbook - Structure 22

The ESS Handbook applies to the following statistical processes: 1.Sample survey 2.Census 3.Statistical process using administrative source(s) 4.Statistical process involving multiple data sources 5.Price or other economic index process 6.Statistical compilation assembling a variety of primary sources (e.g. National Accounts) Statistical Processes 23

 For all 9 headlines in the structure (Relevance, Accuracy …)  For all statistical process on the whole and where relevant  For the 6 types of statistical processes (Sample Survey, Census …) Guidelines for preparing detailed quality reports 24

1 Introduction (s2): 2 Statistical presentation (s4): 2.1 Data description (s4.1): 2.2 Classification system (s4.2): 2.3 Sector coverage (s4.3): 2.4 Statistical concepts and definitions (s4.4): 2.5 Statistical unit (s4.5): 2.6 Statistical population (s4.6): 2.7 Reference area (s4.7): 2.8 Time coverage (s4.8): 2.9 Base period (s4.9): 2.10 Unit of measure (s5): 2.11 Reference period (s6): 2.12 Frequency of dissemination (s10): 2.13 Legal acts and other agreements (s7.1): 2.14 Cost and burden (s19): 2.15 Comment (s22): Content of the quality reports in DK 25

3 Statistical processing (s21): 3.1 Source data (s21.1): 3.2 Frequency of data collection (s21.2): 3.3 Data collection (s21.3): 3.4 Data validation (s21.4): 3.5 Data compilation (s21.5): 3.6 Adjustment (s21.6): 4 Relevance (s14): 4.1 User Needs (s14.1): 4.2 User Satisfaction (s14.2): 4.3 Data completeness rate (s14.3): Content (cont.) 26

5 Accuracy and reliability (s15): 5.1 Overall accuracy (s15.1): 5.2 Sampling error (s15.2): 5.3 Non-sampling error 5.4 Quality management (s13): 5.5 Quality assurance (s13.1): 5.6 Quality assessment (s13.2): 5.7 Data revision - policy (s20.1): 5.8 Data revision practice (s20.2): 6 Timeliness and punctuality (s16): 6.1 Timeliness and time lag - final results (s16.1): 6.2 Punctuality (s16.2): Contents (cont.) 27

7 Comparability (s17): 7.1 Comparability - geographical (s17.1): 7.2 Comparability over time (s17.2): 7.3 Coherence - cross domain (s18.1): 7.4 Coherence - internal (s18.2): 8 Accessibility and clarity (s11): 8.1 Release calendar (s9.1): 8.2 Release calendar access (s9.2): 8.3 User access (s9.3): 8.4 News release (s11.1): 8.5 Publications (s11.2): 8.6 On-line database (s11.3): 8.7 Micro-data access (s11.4): 8.8 Other (s11.5): 8.9 Confidentiality - policy (s8.1): 8.10 Confidentiality - data treatment (s8.2): 8.11 Documentation on methodology (s12.1): 8.12 Quality documentation (s12.2): Contents (cont.) 28

9 Contact (s1): 9.1 Contact organisation (s1.1): 9.2 Contact organisation unit (s1.2): 9.3 Contact name (s1.3): 9.4 Contact person function (s1.4): 9.5 Contact mail address (s1.5): 9.6 Contact -address (s1.6): 9.7 Contact phone number (s1.7): 9.8 Contact fax number (s1.8): Contents (cont.) 29

16 quality indicators to be reported in the ESS Quality Reports to Eurostat  RelevanceR1  Accuracy and ReliabilityA1 – A7  Timeliness and PunctualityTP1 – TP3  Coherence and ComparabilityCC1 – CC2  Accessibility and ClarityAC1 – AC3 (cover the output components in slide 7) Quality and Performance Indicators 30

U ( user ) and P ( producer ) in SIMS  P-fields are producer-oriented and are normally just the figure(s) asked for  U-fields are user-oriented and are an annotated version of the figure(s) asked for 31 Indicator U SIMS ID P SIMS ID Indicator U SIMS ID P SIMS ID Indicator U SIMS ID P SIMS ID A A CC A TP AC A TP AC A CC AC A R A TP

”Objective” reality (Ideal) statistical characteristics – what we are seeking information about (ideal) population (ideal) variables (true) values Conceptualised reality Reality as it is conceived and operationalised by designers Statistical target characteristics Target population Variables that can be measured Values that can be measures Reality as presented by data Reality as it is perceived by respondents and represented by data Observed object characteristics Observed objects Observed variables Observed values Statistics about reality as interpreted by users Reality as it is understood by users when interpreting data Interpreted statistics 32 Discrepancies -Coverage (first kind) -Sampling -Operational def. of variables deviate from ideal definitions From Bo Sundgren: Statistical systems – some fundamentals (2004)

A1Sampling error 33

A2Over-coverage 34

”Objective” reality (Ideal) statistical characteristics – what we are seeking information about (ideal) population (ideal) variables (true) values Conceptualised reality Reality as it is conceived and operationalised by designers Statistical target characteristics Target population Variables that can be measured Values that can be measures Reality as presented by data Reality as it is perceived by respondents and represented by data Observed object characteristics Observed objects Observed variables Observed values Statistics about reality as interpreted by users Reality as it is understood by users when interpreting data Interpreted statistics 35 Discrepancies -Coverage (second kind) -Respondents cannot be found -Respondents refuse to answer -Respondents misinterpret -Respondents make mistakes (conscious or unconscious) From Bo Sundgren: Statistical systems – some fundamentals (2004)

Mixed statistical processes where some variables or data for some units come from survey data and others from administrative sources  Measure for agreement between different sources  Definition: Units in both the survey and the administrative source Units in the survey A3Common units 36

A4Unit non-response 37

A5Item non-response 38

A6Data revision 39

A7Imputation 40

 Examples  News release - NYTNYT  Quality declaration – Consumer Price IndexConsumer Price Index 41

you will find: ESS handbook for quality reports 2014 ESS Quality and Performance Indicators 2014 Single Integrated Metadata Structure and its Technical Manual ESS Quality Glossary Handbook on Data Quality - Assessment Methods and Tools Handbook on improving quality by analysis of process variables ESS Quality Reporting 42

43