Data collection and processing, data quality Management seminar on global assessment Yalta, 23 - 25 September 2009 Session 10 1 Heinrich Brüngger 10.

Slides:



Advertisements
Similar presentations
Improving Statistical Systems: Advice from the UIS Technical Guide National training workshop Amman, Jordan October 2010.
Advertisements

1 WHO Conflicts of Interest Policy for WHO Experts.
Statistical Disclosure Control (SDC) at SURS Andreja Smukavec General Methodology and Standards Sector.
Professional user requirements of statistical dissemination Comments on the papers to Session 1 of the 91 st DGINS Conference, Copenhagen, 26 to 27 May.
Copyright 2010, The World Bank Group. All Rights Reserved. Statistical organization Principles and models Part 1 Strengthening Statistics Produced in Collaboration.
Brian A. Harris-Kojetin, Ph.D. Statistical and Science Policy
Enhancing Data Quality of Distributive Trade Statistics Workshop for African countries on the Implementation of International Recommendations for Distributive.
1 Human resources management in NSOs Training workshop for SADC member states. Luanda, 2-6 Dec 2006 Olav Ljones, Deputy Director General, Statistics Norway.
POLICIES AND PROCEDURES FOR ARCHIVING DATA IN BURUNDI.
Chapter 7 Database Auditing Models
United Nations Economic Commission for Europe Statistical Division Applying the GSBPM to Business Register Management Steven Vale UNECE
Institutional arrangements and legal framework for energy statistics United Nations Statistics Division International Workshop on Energy Statistics
Regional Seminar on Census Data Archiving for Africa, Addis Ababa, Ethiopia, September 2011 Overview of Archiving of Microdata Session 4 United Nations.
United Nations Statistics Division
United NationsUnited Nations Economic Commission for Europe Statistical Division Results of survey on organization of data collection in National Statistical.
Topic 4 How organisations promote quality care Codes of Practice
1 The system aspect of statistical quality Q2014 european conference on quality in official statistics Special session: Consistency of Concepts and Applied.
1 Hsin Chu, August 2012 Regulatory Impact Assessment Charles-Henri Montin, Senior Regulatory Expert, Ministry of economy and finance, Paris
Copyright 2010, The World Bank Group. All Rights Reserved. Planning and programming Planning and prioritizing Part 1 Strengthening Statistics Produced.
Identifying Data Needs: Workshop on Household Surveys and Measurement of Labour Force with Focus on Informal Economy Maseru, Lesotho, April 2008.
S7: Audit Planning. Session Objectives To explain the need for planning To explain the need for planning To outline the essential elements of planning.
Population Census carried out in Armenia in 2011 as an example of the Generic Statistical Business Process Model Anahit Safyan Member of the State Council.
Transition from traditional census to sample survey? (Experience from Population and Housing Census 2011) Group of Experts on Population and Housing Censuses,
By: Ada van Krimpen Director International Statistical Institute (ISI) 8 October 2014.
GLOBAL ASSESSMENT OF STATISTICAL SYSTEM OF KAZAKHSTAN ZHASLAN OMAROV DEPUTY CHAIRMAN, STATISTICS AGENCY OF REPUBLIC OF KAZAKHSTAN. 4.3.
Audit Planning. Session Objectives To explain the need for planning To outline the essential elements of planning process To finalise the audit approach.
Database Security and Auditing: Protecting Data Integrity and Accessibility Chapter 7 Database Auditing Models.
BAIGORRI Antonio – Eurostat, Unit B1: Quality; Classifications Q2010 EUROPEAN CONFERENCE ON QUALITY IN STATISTICS Terminology relating to the Implementation.
AADAPT Workshop South Asia Goa, December 17-21, 2009 Maria Isabel Beltran 1.
Monitoring public satisfaction through user satisfaction surveys Committee for the Coordination of Statistical Activities Helsinki 6-7 May 2010 Steve.
Data sources and collection methods Ken Mease Cairo, June 2009.
1 ESCWA/UNSD Expert Group Meeting on National Accounts May 2009, Cairo, Egypt Gulab Singh UN Statistics Division Exhaustive Measurement of Economy.
Use of Administrative Data Seminar on Developing a Programme on Integrated Statistics in support of the Implementation of the SNA for CARICOM countries.
New sources – administrative registers Genovefa RUŽIĆ.
Copyright 2010, The World Bank Group. All Rights Reserved. The Statistical System Features and characteristics of statistical systems Part 2 Strengthening.
Legal and institutional foundation of economic statistics Overview of international experience Regional Workshop for African Countries on Compilation of.
United Nations Statistics Division Work Programme on Economic Census Vladimir Markhonko, Chief Trade Statistics Branch, UNSD Youlia Antonova, Senior Statistician,
Implementation of the European Statistics Code of Practice Yalta September 2009 Pieter Everaers, Eurostat.
Statistics Netherlands’ modernization programme: the use of administrative data, lessons learned and the way ahead. Geert Bruinooge Assistant Director.
Creating Open Data whilst maintaining confidentiality Philip Lowthian, Caroline Tudor Office for National Statistics 1.
Institutional and legal framework of the national statistical system: the national system of official statistics Management seminar on global assessment.
United Nations Oslo City Group on Energy Statistics OG7, Helsinki, Finland October 2012 ESCM Chapter 8: Data Quality and Meta Data 1.
STRATEGY FOR DEVELOPMENT OF ISIS AND IT STRATEGY IN THE NSI-BULGARIA Main principles, components, requirements.
United Nations Workshop on Principles and Recommendations for a Vital Statistics System, Revision 3, for African English-speaking countries Addis Ababa,
Overview and challenges in the use of administrative data in official statistics IAOS Conference Shanghai, October 2008 Heli Jeskanen-Sundström Statistics.
1 Performance Auditing ICAS & IRAS Officers NAAA 21 Jan 2016.
10 August Inter-Regional Workshop on the Production of Gender Statistics New Delhi, India, 6-10 August 2007 Strengthening National Gender Statistics.
14-Sept-11 The EGR version 2: an improved way of sharing information on multinational enterprise groups.
Interstate Statistical Committee of the Commonwealth of Independent States (CIS-STAT) CES seminar “Challenges for future population and housing censuses.
1 This project is supported by the European Union 3 rd MEDREG-IMME Seminar Reform and Opening of Maghreb Electricity Markets September 2013 MRA (Malta)
United Nations Economic Commission for Europe Statistical Division WHAT MAKES AN EFFECTIVE AND EFFICIENT STATISTICAL SYSTEM Lidia Bratanova, Statistical.
1 European Statistics Code of Practice. I.Institutional Environment Principle II.Statistical processes Principle III.Statistical Output Principle.
Administrative Data and Official Statistics Administrative Data and Official Statistics Principles and good practices Quality in Statistics: Administrative.
Introduction to Statistics Estonia Study visit of the State Statistical Service of Ukraine on Dissemination of Statistical Information and related themes.
Hallgrímur Snorrason Management seminar on global assessment Session 6: Institutional and legal framework of the national statistical system Yalta
Dissemination of sensitive group-specific results
Session 2: Institutional arrangements for energy statistics
Quality assurance in official statistics
Dissemination Workshop for African countries on the Implementation of International Recommendations for Distributive Trade Statistics May 2008,
DELINEATION OF A NATIONAL STATISTICAL SYSTEM (NSS)
OECD Chief Statistician and Director, Statistics Directorate
Generic Statistical Business Process Model (GSBPM)
Ethical questions on the use of big data in official statistics
Sub-regional workshop on integration of administrative data, big data
Quality, efficiency and productivity: a challenge for official statistics EFTA/CROSTAT/EUROSTAT Strategic Management Seminar, Split, November 2007.
On data accessibility and confidentiality……..
Heinrich Brüngger, Director
Jan Byfuglien Statistics Norway
Transformation of the National Statistical System: Experience
General Challenges and Drawbacks.
Presentation transcript:

Data collection and processing, data quality Management seminar on global assessment Yalta, September 2009 Session 10 1 Heinrich Brüngger 10

Part I: Data collection and processing Part II: Confidentiality (with emphasis on data collection and processing) Part III: Data quality (quality management) 2

Nature of decisions on methods of data collection and processing Under professional independence (FP 2), therefore within the statistical system without outside interference once the decision about the “what” to measure in broad terms have been made Decisions about what official statistics should measure may involve the political level, and have to involve discussions with a broad range of users 3

Professional independence The “how” part of statistical activities has to focus exclusively at obtaining the most reliable representation of the phenomenon of the real world with the given resources Scientific methods, international standards, and empirically established good practices are the best guide for professional independence The statistical law may require the adoption by government in the case of statistical surveys because of explicit legitimacy in the interaction with respondents and the burden caused to respondents, but governments should not interfere in methodological issues 4

Main data sources (FP 5) Official statistics has to have the right to use three types of sources: ◦ Statistical surveys: data collection from individuals, households, corporate and unicorporated businesses, and public entities outside the own government structure, for the exclusive purpose of statistics (most of which official statistics) ◦ Administrative records: secondary use of data collected primarily for administrative purposes about individuals or private businesses by other parts of the same government structure 5

Main data sources (ctd.) ◦ Secondary use of environment and territorial observation and monitoring data collected by specialised government agencies For statistical surveys, the statistical law applies to all phases of the statistical activity; for the other two sources, primary data collection has to be based on other laws, and the statistical legislation becomes only applicable once data are handed over to a producer of official statistics 6

Statistical programmes If the law requires input-oriented programmes, a clear distinction has to be made between statistical surveys, where the statistical law and the statistical programmes are the legal basis for data collection, and the second and third data source, where the primary data collection is for another purpose and has therefore to have a different legal basis 7

Minor data sources for national official statistics Official statistics from a partner country or from an international organisation (especially for international flows) Registers in the hand of specialised private companies (as one of the sources used to keep statistical registers updated) 8

Statistical surveys Exhaustive vs. sample surveys (mixed forms possible) Mail, face to face interview, telephone interviews, internet (mixed forms possible) Compulsory participation of the respondent vs. right to decline participation (mixed forms possible) Recall and follow-up policy for non-response, depending on type of survey 9

Response burden Response burden has to be assessed in advance and taken into account: ◦ all new surveys have to be tested first (pilots) ◦ all respondents have to be informed about the purpose and legal basis of the survey, and especially about the confidentiality measures ◦ forms and questions have to be understandable, non-intrusive, and make answers possible for him/her from memory or from existing material (businesses) 10

Response burden (ctd.) ◦ first reminders have to be proportionate ◦ response rates have to be closely monitored The NSI has also to act as custodian for the respondents; users will tend to minimise the issue of response burden 11

Surveys (ctd.) Information given by a respondent in statistical surveys have no legal implications for him; they can (and should) therefore be checked and when necessary altered (edited) by the NSI In many cases, the characteristics of interest to users cannot be taken over directly into questionnaires, but have to be compiled as derived characteristics using several questions 12

Non-response (valid for all data sources) The NSI has to develop a policy for treating general and item non-response, which has to be subsequently fine-tuned and documented for each survey Imputations for missing (or unusable) information is legitimate, but a careful balance has to be found between “not enough” and “too much” (risk of introducing biases) 13

Administrative sources The statistical office must have the legal and de facto right to receive, for its tasks, regularly and on an ad hoc basis, microdata sets from other ministries and public entities taken from their administrative sources This does not mean that direct identifiers have to be included in all cases, but the possibility should not be entirely excluded 14

Administrative sources (ctd.) Statistical producers have the right to alter (edit an impute) the administrative data received from other ministries to improve compatibility with statistical definitions and classifications Data received in this way should never be given back to the data owner or transferred by the statistical producer to a third party for administrative purposes 15

Relationship between data sources Before considering introducing a new survey, or expanding an existing one, for new information needs, other possible data sources (especially administrative) have to be examined for their potential The possibility of combining questions about different areas in the same survey is limited: the survey system has to be structured carefully 16

Relationship between data sources (ctd.) Administrative sources are in constant change, and some risk of disappearing completely The final decision-making about administrative sources is outside the statistical system. Therefore, the NSI has the responsibility to constantly monitor (if possible anticipate) and react to planned changes that may affect the use of administrative sources for official statistics positively or negatively 17

Relationship between data sources (ctd.) NSIs have to avoid considering data sources, and especially surveys, as stove-pipe type of parallel operations, but rather as a system of interrelated operations with n:m relationship between data sources and results Very often, the best estimates are based on a judicious combination of sources, combining the individual strengths and reducing the individual weaknesses Statistical registers of good quality are a cornerstone for a good survey system, and for matching different sources 18

Relationship between data sources (ctd.) When results on the same issue are published from different sources separately (i.e. without prior integration), an accounting relationship between the results should be established, with the components quantified if possible Otherwise, only the most complete aggregate should be released as results of official statistics in absolute terms 19

Confidentiality (FP 6) Statistical confidentiality aims at protecting the privacy of individual units about which data are collected and processed. It has two components: ◦ Producers of official statistics do not disclose, either directly or indirectly, characteristics about protected units to any third party in such a way that any user might derive additional information about a protected unit 20

FP 6 (ctd.) Producers of official statistics use data about protected individual units only for statistical purposes (official statistics in the first place) The statistical system, and in the first instance the NSI, have to make sure in an effective a credible way that the two elements of the confidentiality principle are strictly followed in all parts of the production and dissemination processes, and that confidential data are stored in a safe way 21

Confidentiality in data collection ◦ Legal situation in countries may differ with respect to:  Protected units: physical persons, private households, private businesses (whether legal persons or unincorporated enterprises) should be fully protected; government units as institutions cannot invoke statistical confidentiality; public enterprises in a competitive market are a borderline case  All characteristics treated alike, or some considered to be free (e.g. for legal persons), or particularly sensitive (higher degree of protection, e.g. about health, crime, ethnicity) 22

Confidentiality in data collection (ctd.) ◦ Data collection in statistical surveys: replies of respondents should go back as directly as possible to the statistical producer. No other government agency should be involved as intermediary (regional level!) ◦ Data processing: except in the context of statistical registers, identifiers of units should be separated from the context variables by the statistical producer at an as early stage as possible, and either stored separately or destroyed 23

Confidentiality in data collection (ctd.) ◦ Data security: final (edited) microdata sets have to be stored safely, and access limited to those within the NSO who use the data regularly. No direct access from outside the NSO should be possible. For files with identifiers (other than statistical registers), each access has to be documented. ◦ Data security, update and access regulations for statistical registers have to be worked out carefully in a separate book of rules. 24

Quality management: transparency as prerequisite All decision on methods (collection/processing/dissemination) have to be documented and made accessible for the public (FP 3) Quality management is addressed by several principles of the EU Code of Practice Quality management refers to products, processes and staff 25

Quality management: products Products to be regularly assessed with pre-defined quality criteria (and rejected if minimal quality level for fitness of use not met): ◦ Final micro-data sets from a data collection instance ◦ Results of official statistics prior to release Products to be assessed occasionally ◦ Questionnaires for surveys ◦ Dissemination products 26

Products to be released An important element of professional independence is the right of the NSI not to release official results if pre-defined minimal quality criteria are not met; this has to be a decision by the chief statistician This does not preclude the use and release of such results as one input among others into a broader (e.g. geographical) aggregate 27

Quality management: processes Processes should be defined in terms of who decides on what and when within the statistical system, and who has to be consulted and informed; they have to be documented for mainly internal use Processes should be defined for each phase in a generic statistical business model (from design to archiving) 28

Processes (ctd.) It is important to define the system of processes and responsibilities in such a way that the tendency for creating “black boxes” by responsible organisational units is offset. Stove-pipes (same organisational unit responsible for all phases of the production process for a given activity such as survey) are problematic also from this point of view 29

Processes (ctd.) The process definitions will be different between on-going activities (with room for small improvements), and major redesigns/innovations, and between production activities ending in the release of results of official statistics and infrastructure activities such as dissemination or training 30

Cross-cutting evaluations An overall evaluation of the portfolio of activities and the extent to which the needs of users are met is necessary when the preparation for the next multi-annual programme starts Extensive user consultations are necessary, and external expertise (such as in a GA) may help in identifying and prioritising gaps to be addressed in the next programme 31

Tendency in organisational structures of NSI First-level division by broad subject areas used to be predominant Recent shift to functional break-down at first level (data collection; processing; dissemination) for various reasons, among others improved quality management Senior management has to provide incentives for quality management leading to measurable improvements 32

Staff management Quality management has to become an almost automatic part of the activity for each staff member Regular training on management issues should include a substantial part on quality management Regular assessment of the individual performance of each staff member can be very instrumental, if not implemented as a bureaucratic routine, but as open dialogue towards improvements 33

Thank you for your attention 34