Workshop on Price Index Compilation Issues February 23-27, 2015 Data Collection Issues Gefinor Rotana Hotel, Beirut, Lebanon.

Slides:



Advertisements
Similar presentations
Katherine Jenny Thompson
Advertisements

Innovation data collection: Advice from the Oslo Manual South East Asian Regional Workshop on Science, Technology and Innovation Statistics.
United Nations Statistics Division/DESA
©2014 IDBS, Confidential Statistical Process Control Workshop An Introduction to the Principles behind SPC Ilca Croufer.
Editing and Imputing VAT Data for the Purpose of Producing Mixed- Source Turnover Estimates Hannah Finselbach and Daniel Lewis Office for National Statistics,
Deliverable 2.8: Outliers Gary Brown Office for National Statistics UK.
1 Editing Administrative Data and Combined Data Sources Introduction.
Palestinian Central Bureau of Statistics (PCBS) Palestine Poverty Maps 2009 March
OECD Short-Term Economic Statistics Working PartyJune Analysis of revisions for short-term economic statistics Richard McKenzie OECD OECD Short.
Multiple Indicator Cluster Surveys Survey Design Workshop Data Analysis and Reporting MICS Survey Design Workshop.
United Nations Statistics Division/DESA International Recommendations for the Index of Industrial Production (IIP)
Copyright © 2007 Pearson Education Canada 1 Chapter 12: Audit Sampling Concepts.
France : Improving checks in customs data OCDE – 7 November 2011.
Ragin’s comparative method. Characteristics comparative method Combinations of conditions are attributed causal value Cases are studied as unique combinations.
Rudi Seljak, Metka Zaletel Statistical Office of the Republic of Slovenia TAX DATA AS A MEANS FOR THE ESSENTIAL REDUCTION OF THE SHORT-TERM SURVEYS RESPONSE.
Hydrologic Modeling: Verification, Validation, Calibration, and Sensitivity Analysis Fritz R. Fiedler, P.E., Ph.D.
Workshop on Price Index Compilation Issues February 23-27, 2015 Publication and Analysis Gefinor Rotana Hotel, Beirut, Lebanon.
Editing and Validation Prices and PPPs* ICP-Western Asia Regional Meeting Beirut, Lebanon May 26, 2005 Yonas Biru *Presentation adopted.
Methods of Media Research Communication covers a broad range of topics. Also it draws heavily from other fields like sociology, psychology, anthropology,
INTERNATIONAL LABOUR ORGANIZATION Conditions of Work and Employment Programme (TRAVAIL) 2012 Module 13: Assessing Maternity Protection in practice Maternity.
Workshop on Price Index Compilation Issues February 23-27, 2015 Sample Design, Selection, and Maintenance Gefinor Rotana Hotel, Beirut, Lebanon.
ILO Department of Statistics1 ILO experience in quickly estimating the impact of financial crisis on the global labour market International Seminar on.
Workshop on Price Statistics Compilation Issues February 23-27, 2015 Compilation of Elementary Indices Gefinor Rotana Hotel, Beirut, Lebanon.
Copyright 2010, The World Bank Group. All Rights Reserved. Managing processes Core business of the NSO Part 2 Strengthening Statistics Produced in Collaboration.
Workshop on Price Index Compilation Issues February 23-27, 2015 Classification Systems Gefinor Rotana Hotel, Beirut, Lebanon.
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 19 Process of Quantitative Data Analysis and Interpretation.
Generic Approaches to Model Validation Presented at Growth Model User’s Group August 10, 2005 David K. Walters.
Implementation of quality indicators in the Finnish statistics production process Kari Djerf Statistics Finland Q2008, Rome Italy.
Chapter Fourteen Data Preparation 14-1 Copyright © 2010 Pearson Education, Inc.
1 Calculation of unit value indices at Eurostat Training course on Trade Indices Beirut, December 2009 European Commission, DG Eurostat Unit G3 International.
Workshop on Price Index Compilation Issues February 23-27, 2015 Market Basket Items and Weights Gefinor Rotana Hotel, Beirut, Lebanon.
Handbook on Residential Property Price Indices Chapter 5: Methods Jan de Haan UNECE/ILO Meeting, May 2010.
UNECE Workshop on Consumer Price Indices Session 2: Sampling of Outlets and Products Presentation by Cengiz Erdoğan, TurkStat October Istanbul, Turkey.
UNECE Workshop on Consumer Price Indices Session 4: Practical Experiences with Calculating Elementary Indices and Treatment of Missing Prices Presentation.
Workshop on Price Index Compilation Issues February 23-27, 2015 Imputation of Missing Values, Seasonal Products and Quality Changes Gefinor Rotana Hotel,
Understanding Sampling
9. Applying scientific tools & techniques for improving customer satisfaction.
CBS-SSB STATISTICS NETHERLANDS – STATISTICS NORWAY Work Session on Statistical Data Editing Oslo, Norway, September 2012 Jeroen Pannekoek and Li-Chun.
Statistical Expertise for Sound Decision Making Quality Assurance for Census Data Processing Jean-Michel Durr 28/1/20111Fourth meeting of the TCG - Lubjana.
A Key Component for ICP 2004 A well-established product list  Compare „ like with like “  Collect prices for relevant products  Establish reliable PPPs.
Model Selection and Validation. Model-Building Process 1. Data collection and preparation 2. Reduction of explanatory or predictor variables (for exploratory.
1.  Interpretation refers to the task of drawing inferences from the collected facts after an analytical and/or experimental study.  The task of interpretation.
STATISTICAL METHODS AND DATA MANAGEMENT TOOLS FOR OUTLIER DETECTION IN TRI DATA Dr. Nagaraj K. Neerchal and Justin Newcomer Department of Mathematics and.
United Nations Oslo City Group on Energy Statistics OG7, Helsinki, Finland October 2012 ESCM Chapter 8: Data Quality and Meta Data 1.
United Nations Regional Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys Asunción,
1 Outline  Introduction  Methodology for compiling CPI  Data Collection and Compilation.
Study of Editing and Imputation Practices at Statistics Finland Janika Konnu and Pauli Ollila Statistics Finland Q2010: Editing session Wednesday 5 th.
DETECTION OF OUTLIERS IN THE CANADIAN CONSUMER PRICE INDEX (CPI) DETECTION OF OUTLIERS IN THE CANADIAN CONSUMER PRICE INDEX (CPI) ABDELNASSER SAÏDI AND.
The Role of the Tukey algorithm in validation procedures for prices data in a consumer price index: the UK experience on this and more general aspects.
1 ICP 2004 Workshop for Regional Coordinators Eurostat, Luxembourg March 24 to 28, 2003 Chapter 4 Price concepts and quality.
E-PRTR incompleteness check Irene Olivares Industrial Pollution Group Air and Climate Change Programme Eionet NRC workshop on Industrial Pollution Copenhagen.
Slide 7.1 Saunders, Lewis and Thornhill, Research Methods for Business Students, 5 th Edition, © Mark Saunders, Philip Lewis and Adrian Thornhill 2009.
CPI/HICP price collection Validation of prices Twinning project: Development of new statistical methodologies and indicators in selected areas of statistics.
Nagraj Rao Statistician Asian Development Bank CROP CUTTING: AN INTRODUCTION.
Producer Price Indices in Denmark - Producer price index for commodities (PPI) March 20 th 2015.
FDI - Imputation. Overview Introduction Overview of Imputation Methods Overview of Outliering methods Overview of Estimation methods Aggregation Disclosure.
Copyright 2010, The World Bank Group. All Rights Reserved. Producer prices, part 2 Measurement issues Business Statistics and Registers 1.
Comparison of Estimation Methods for Agricultural Productivity Yu Sheng ABARES the Superlative vs. the Quantity- based Index Approach August 2015.
Nagraj Rao Statistician Asian Development Bank CROP CUTTING: AN INTRODUCTION.
Chapter Fourteen Data Preparation 14-1 Copyright © 2010 Pearson Education, Inc.
1 IT system and data validation process in Latvian CPI/HICP Prepared by Oskars Alksnis, Central Statistical Bureau of Latvia EU Twinning Project Forwarding.
Workshop on Price Index Compilation Issues February 23-27, 2015 Price Specifications Gefinor Rotana Hotel, Beirut, Lebanon.
Regional Coordinators Meeting
PRODUCTION PROCESS AND FLOW
Regional Workshop on Short-term Economic Indicators and Service Statistics September 2017 Chiba, Japan Alick Nyasulu SIAP.
Tomaž Špeh, Rudi Seljak Statistical Office of the Republic of Slovenia
WinTIM, Indices methodology and tool Wiking Althoff, CESD Communautaire External trade experts meeting on the CARDS Programme, Luxembourg, May.
The Norwegian CPI Data Validation and Editing
Annual routines with chaining
Data validation handbook
Presentation transcript:

Workshop on Price Index Compilation Issues February 23-27, 2015 Data Collection Issues Gefinor Rotana Hotel, Beirut, Lebanon

2 Data Collection Methods What methods can be used to collect data? What are the advantages of each method? What are the disadvantages? What is the most effective method? How do we decide which method to use for the CPI? What factors? What is ideal method – CPI vs PPI? One size fits all?? February 23-27, 2015METAC and IMF Statistics Dept.

Period to Period Price Collection Who do we collect data from? What types of data should be collected and why? How often? How do we prepare for price collection? Which data collection method is best? What are the challenges? February 23-27, 2015METAC and IMF Statistics Dept. 3

Data Collection Forms design Complete specifications Prices (including historic?) Reasons for price change New specifications All data collected returned to HQ February 23-27, 2015METAC and IMF Statistics Dept. 4

Data Validation Includes Data editing Non-statistical & statistical checking Outliers Missing prices Credibility checking February 23-27, 2015METAC and IMF Statistics Dept. 5

The Challenge “CPI estimates are subject to errors that may arise from a variety of sources” –ILO Because prices in shops can change and the CPI production timetable can be very tight, editing needs to be done in a short period of time. Precision matters! February 23-27, 2015METAC and IMF Statistics Dept. 6

The Challenge “CPI estimates are subject to errors that may arise from a variety of sources” –ILO Because prices in shops can change and the CPI production timetable can be very tight, editing needs to be done in a short period of time. Precision matters! Validation feeds into long-term methodological development February 23-27, 2015METAC and IMF Statistics Dept. 7

The Challenge “CPI estimates are subject to errors that may arise from a variety of sources” –ILO Because prices in shops can change and the CPI production timetable can be very tight, editing needs to be done in a short period of time. Precision matters! Validation feeds into long-term methodological development February 23-27, 2015METAC and IMF Statistics Dept. 8

Data Validation in the field Facilitated by electronic data capture What are the appropriate checks? February 23-27, 2015METAC and IMF Statistics Dept. 9

Data Validation in the field Facilitated by electronic data capture What are the appropriate checks? February 23-27, 2015METAC and IMF Statistics Dept. 10

Data Validation at the Office Two steps: Detection of possible errors Verification and correction All collected prices should be examined. But not all with the same degree of scrutiny. Most time and price editing effort should focus in areas where. Elementary aggregates have relatively small price samples. Elementary aggregates with high expenditure weights. Elementary aggregates with large weights but few quotes. February 23-27, 2015METAC and IMF Statistics Dept. 11

Data Editing Two tools available for checking for errors and outliers: Non-statistical checking Statistical checking The following techniques work best with large volumes of data. But still applicable to small samples. February 23-27, 2015METAC and IMF Statistics Dept. 12

Data Editing – non-statistical checking Non-statistical checking Manual inspection of data Includes price change and min-max tests Comparisons can be expanded to include comparison with prices of similar products collected from similar outlets by different collectors in the same region (i.e. movements/levels compared with price data collected in parallel from other outlets). Refined over time. Should not automatically result in deletion February 23-27, 2015METAC and IMF Statistics Dept. 13

Data editing - Statistical checking The main conceptual difference with non-statistical checking is that statistical checking techniques automatically calculate the limits for acceptable movement. Automatic adjustment of acceptable limits as new price data is received. Data intensive for reliable results. Best suited for use at the head and regional offices. Compares each price change with changes in the other items from a given price sample. The price sample is the one to which the item being checked belongs. The sample for testing may be a combination of price samples for similar products. February 23-27, 2015METAC and IMF Statistics Dept. 14

Statistical checking - methods The various methods Median and quartile values The acceptable limits of price change are set as a predefined multiple of the range between the median and the quartiles. Assumes normal distribution Modified median and quartile values Transforms distances from the median to increase effectiveness of check Tukey algorithm Applies adjustment to the inter quartile range February 23-27, 2015METAC and IMF Statistics Dept. 15

A Key Benefit of Statistical Checking The limits are set by the data and they can change over time As further data is received As price levels change But these methods are data intensive. Remember that the objective is not to flag observations for deletion but to identify those that need closer scrutiny. February 23-27, 2015METAC and IMF Statistics Dept. 16

Dealing with Outliers Outliers are defined as price movements that are exceptionally large compared with the majority of movements. The strategy adopted by most statistical institutes is to reduce the impact of exceptional observations rather than exclude them from the index. The tests for outliers are the same as those for identifying potential errors, as described (described above). If outliers are to be modified, they are usually modified to lie on the pre-defined boundaries of acceptable movement or to be imputed by the movement of a suitable sample of prices. Price collectors & supervisors are responsible for providing information about the reasons for extreme price movements or levels and why price quotes have been accepted as valid. February 23-27, 2015METAC and IMF Statistics Dept. 17

Missing prices – minimising the number when in the field There are a number of ways of minimising the occurrence of missing observations. For example. Maintaining the relevance of the sample of items priced. Early contact with respondents that are usually late returning forms. Substitution and quality change/adjustment. Separate issue. February 23-27, 2015METAC and IMF Statistics Dept. 18

Credibility checking Other types of errors are more demanding Exceeding pre-defined movement limits Other errors may require checking with the shopkeeper. Problems caused by unusual prices and price movements can be avoided by training price collectors to recognise these situations and to collect relevant explanatory information during the initial price collecting visit. February 23-27, 2015METAC and IMF Statistics Dept. 19