Imputation in UNECE Statistical Databases: Principles and Practices

Slides:

Advertisements

Similar presentations

United Nations Expert Group Meeting on Revising the Principles and Recommendations for Population and Housing Censuses New York, 29 October – 1 November.

Advertisements

OECD Short-Term Economic Statistics Working PartyJune Analysis of revisions for short-term economic statistics Richard McKenzie OECD OECD Short.

United Nations Economic Commission for Europe Statistical Division Applying the GSBPM to Business Register Management Steven Vale UNECE

United Nations Statistics Division

1 The availability, timeliness and quality of rapid estimates UNCTAD experience Henri Laurencin INTERNATIONAL SEMINAR ON TIMELINESS, METHODOLOGY AND COMPARABILITY.

4 May 2010 Towards a common revision for European statistics By Gian Luigi Mazzi and Rosa Ruggeri Cannata Q2010 European Conference on Quality in Official.

Forecasting supply chain requirements

CZECH STATISTICAL OFFICE 1 The Quality Metadata System In the Czech Statistical Office Work Session on Statistical Metadata (METIS)

United Nations Economic Commission for Europe Statistical Division Software Approaches for the Dissemination of Statistical Information UNECE Training.

Workshop on MDG Monitoring Kampala, Uganda, 5-8 May 2008 Reconciling international and national sources for effective global monitoring Francesca Perucci.

United Nations Economic Commission for Europe Statistical Division UNECE Workshop on Consumer Price Indices Istanbul, Turkey,10-13 October 2011 Session.

United Nations Economic Commission for Europe Statistical Division Mapping Data Production Processes to the GSBPM Steven Vale UNECE

Use of Administrative Data Seminar on Developing a Programme on Integrated Statistics in support of the Implementation of the SNA for CARICOM countries.

Chapter 1 – Introduction Vladimir Markhonko United Nations Statistics Division The 4 th meeting of the Oslo Group on energy statistics Ottawa, Canada,

Eurostat – Unit D5 Key indicators for European policies Third International Seminar on Early Warning and Business Cycle Indicators Annotated outline of.

Measuring Sustainable development: Achievements and Challenges Enrico Giovannini OECD Chief Statistician June 2005.

Regional Seminar on Promotion and Utilization of Census Results and on the Revision on the United Nations Principles and Recommendations for Population.

Kathy Corbiere Service Delivery and Performance Commission

FIJI PERSPECTIVE. Donor programs well aligned to strategic priorities of Government However, the lack of a proper framework to guide the Government- Donor.

GSBPM and GAMSO Steven Vale UNECE

INTERNATIONAL MIGRATION DATA as input for population projections Anne HERM and Michel POULAIN Estonian Interuniversity Population Research Centre, Estonia.

General Recommendations on STS Carsten Boldsen Hansen Economic Statistics Section, UNECE UNECE Workshop on Short-Term Statistics (STS) and Seasonal Adjustment.

ORGANISATION FOR ECONOMIC CO-OPERATION AND DEVELOPMENT ORGANISATION DE COOPÉRATION ET DE DEVELOPMENT ÉCONOMIQUES OECDOCDE 1 Joint session Agenda item 13.

Agenda item: 14 FUTURE STESWP WORK OVER THE NEXT TWELVE MONTHS OECD Short-term Economic Statistics Working Party (STESWP) Paris June 2007.

Model based approach for estimating and forecasting crop statistics: Update, consolidation and improvement of AGROMET model “AGROMET Project” Working Group.

A Training Course for the Analysis and Reporting of Data from Education Management Information Systems (EMIS)

WORK OF THE OECD EXPERT GROUP ON DISPARITIES IN NATIONAL ACCOUNTS Jorrit Zwijnenburg National Accounts Division OECD Advisory Expert Group on National.

Administrative Data and Official Statistics Administrative Data and Official Statistics Principles and good practices Quality in Statistics: Administrative.

Quality declarations Study visit from Ukraine 19. March 2015

Establishing a Statistical Business Register (BR)

INTERNATIONAL STATISTICAL INSTITUTE

UNECE-CES Work session on Statistical Data Editing

Secretary-General’s report on the SDGs: the global reporting system

Workshop on MDG Monitoring United Nations Statistics Division

Seasonal Adjustment Methods and Country Practices

Towards more flexibility in responding to users’ needs

Pietro Gennari FAO Chief Statistician co-Chair CCSA

The Global Indicator Framework DA 10 Opening Workshops

Nader KEYROUZ-Advisor SDG preparedness workshop

REPORTING SDG INDICATORS USING NATIONAL REPORTING PLATFORMS

Artur Andrysiak Economic Statistics Section, UNECE

Dissemination Workshop for African countries on the Implementation of International Recommendations for Distributive Trade Statistics May 2008,

DELINEATION OF A NATIONAL STATISTICAL SYSTEM (NSS)

United Nations Statistics Division DESA, New York

Regional Workshop on Short-term Economic Indicators and Service Statistics September 2017 Chiba, Japan Alick Nyasulu SIAP.

United Nations Statistics Division DESA, New York

L. Isella, A. Karvounaraki (JRC) D. Karlis (AUEB)

ESS Vision 2020 Validation: Implementation of deliverables

Generic Statistical Business Process Model (GSBPM)

Organising and Managing a Statistical System in a Small Country

Gas balancing – where next

ESS Vision 2020: ESS.VIP Validation

2. An overview of SDMX (What is SDMX? Part I)

9. Quality and Experimental data

United Nations Statistics Division DESA, New York

Overview of Approaches to Register-Based Populating Censuses

Sub-Regional Workshop on International Merchandise Trade Statistics Compilation and Export and Import Unit Value Indices 21 – 25 November Guam.

ETS WG 6-7 September, 2006 Point 6: UOE quality report History

Heinrich Brüngger, Director

Mapping Data Production Processes to the GSBPM

Towards a Work Programme for the Common Implementation Strategy for the Water Framework Directive (2000/60/EC) Water Directors Meeting 28 November.

The role of metadata in census data dissemination

ENUMERATION OF ETHNIC/RELIGIOUS/LANGUAGE GROUPS CENSUS 2011 ALBANIA

Session Conclusions, Recommendations, and Proposals

DIAGNOSTIC FRAMEWORK: National Accounts and Supporting Statistics

Workshop on MDG Monitoring

Étienne Saint-Pierre, Statistics Canada

4th Meeting of the Expert Group on the Integration of Statistical and Geospatial Information (UN EG-ISGI) – Nov 2017 Summary of progress Martin Brady,

Capacity development and Financing data for development

Capabilities: Improving the Quality of Statistical Capacity Development Steven Vale UNECE

Presentation transcript:

Imputation in UNECE Statistical Databases: Principles and Practices Steven Vale and Heinrich Brüngger, UNECE Statistical Division

Contents The ECOSOC view of statistical imputation Current practices Basic principles Step-by-step implementation Conclusions and open questions 14 November 2018

ECOSOC views Resolution 2006/6 on strengthening statistical capacity Sets limits for the use of imputation ... but also implicitly endorses it as a statistical technique Statistical agencies need to review their practices to ensure compliance

Defining imputation “A procedure for entering a value for a specific data item where the response is missing or unusable” Boundary issues: Imputing and editing Imputing and forecasting

Current practice in UNECE Very limited ad-hoc imputation Four cases: Account identities Regional aggregates Poor quality national data with little impact on region totals Re-classification Using imputations from others Sufficient transparency in source metadata?

Basic principles (1) Imputed national data are not published Avoids the need for consultation Only official sources used for imputation Preference for data from same country Clear distinction between “real” and imputed data Transparency – imputed data clearly flagged, and methods documented

Basic principles (2) Aggregates must contain > 90% “real” data, covering > 50% of countries Imputed data are re-calculated periodically to adjust for revisions Method used defined at the level of the variable and stored as an attribute Decisions on the use of imputation to be taken with regard to the quality framework

Step-by-step application Automatic imputation routines to extend imputation towards the boundaries set by the ECOSOC Resolution One step at a time, with pause and review to consider quality and cost / benefit “Dashboard” to allow statisticians to choose the most appropriate method Implemented in the context of re-engineering of statistical database system

First step Use a linear trend to impute missing values Requirements: Sufficient time series observations (at least 3 out of previous 5 periods) Closeness of fit of linear trend (R2 close to 1) Constraints Validity of R2 for few observations Forward imputation only

2000 2001 2002 2003 2004 2005 2006 2007 N Y Data Available: Y = Yes N = No Imputation: = Yes = No

Next steps More flexibility: Longer time series Imputing values at start and in middle of time series Non-linear trends? Cross-country imputation in strictly limited cases?

Conclusions Strong links between imputation and quality Trade-off between accessibility and accuracy Step-by-step, pause and review approach seems appropriate Transparency is essential Standardization of practices between international organizations would help

Open questions Are other organizations interested in defining a common policy on the use of imputation, in response to the ECOSOC Resolution? Could we go further and consider harmonization of methods and tools? How should this be done? Is a specific forum needed, or can this be dealt with in combination with work on data quality? Have other organizations modified their policies on imputation in the light of the ECOSOC Resolution, and if so, how?