Quality in Italian consumer price survey: optimal allocation of resources and indicators to monitor the data collection process Federico Polidoro, Rosabel.

Slides:



Advertisements
Similar presentations
Evaluating the Effects of Business Register Updates on Monthly Survey Estimates Daniel Lewis.
Advertisements

The Bulgarian CPI And The Index Of A Small Basket Of Goods And Services Joint UNECE/ILO Meeting on Consumer Price Indices Geneva, May 2006 National.
Methodological approaches to recording certain types of services in the Consumer Price Index in Belarus Ekaterina Grikhanova, National Statistical Committee.
1 Third Workshop on ICP Western Asia Beirut, October 2004 Design of ICP price survey Sultan Ahmad, Consultant Based on Keith.
Introduction Simple Random Sampling Stratified Random Sampling
1 Incorporating Statistical Process Control and Statistical Quality Control Techniques into a Quality Assurance Program Robyn Sirkis U.S. Census Bureau.
Maintaining high quality surveys with optimized interviewers replacements : the new French sample monitoring strategy Sébastien Faivre, INSEE, Head of.
UNECE Workshop on Consumer Price Indices Session 1: The Concept, Scope and Coverage of the CPI Presentation by Cengiz Erdoğan, TurkStat October Istanbul,
Introduction to Statistics
Sampling.
Chapter 17 Additional Topics in Sampling
A new sampling method: stratified sampling
Stratified Simple Random Sampling (Chapter 5, Textbook, Barnett, V
Q USE OF PROCESS DATA TO DETERMINE THE NUMBER OF CALL ATTEMPTS IN A TELEPHONE SURVEY Annica Isaksson Linköping University, Sweden Peter Lundquist.
~ Draft version ~ 1 HOW TO CHOOSE THE NUMBER OF CALL ATTEMPTS IN A TELEPHONE SURVEY IN THE PRESENCE OF NONRESPONSE AND MEASUREMENT ERRORS Annica Isaksson.
Sébastien FAIVRE INSEE Workshop on scanner data, Stockholm, /06/2012 Would scanner data improve the French CPI?
1 Price index of pharmaceutical products, other medical products and appliances in the HICP/CPI Pia Skare Rønnevik.
Joint UNECE/Eurostat Meeting on Population and Housing Censuses (13-15 May 2008) Sample results expected accuracy in the Italian Population and Housing.
UNECE Workshop on Consumer Price Indices Session 3: Calculation Expenditure Weight Presentation by Cengiz Erdoğan, TurkStat October Istanbul, Turkey.
Combining administrative and survey data: potential benefits and impact on editing and imputation for a structural business survey UNECE Work Session on.
Using survey data collection as a tool for improving the survey process Silvia Biffignandi, Antonio Laureti Giulio Perani University of Bergamo Istat Istat.
Aim: How do we find confidence interval? HW#9: complete question on last slide on loose leaf (DO NOT ME THE HW IT WILL NOT BE ACCEPTED)
MGT-491 QUANTITATIVE ANALYSIS AND RESEARCH FOR MANAGEMENT OSMAN BIN SAIF Session 13.
United Nations Economic Commission for Europe Statistical Division Calculation of the consumer price index - Recommendations of the CPI Manual Joint EFTA/UNECE/SSCU.
Sampling. Concerns 1)Representativeness of the Sample: Does the sample accurately portray the population from which it is drawn 2)Time and Change: Was.
Definitions Observation unit Target population Sample Sampled population Sampling unit Sampling frame.
Joint UNECE/Eurostat Meeting on Population and Housing Censuses (28-30 October 2009) Accuracy evaluation of Nuts level 2 hypercubes with the adoption of.
Sampling: Theory and Methods
Regional Seminar on Developing a Program for the Implementation of the 2008 SNA and Supporting Statistics Dr. Cem BAŞ September 2013 Ankara - Turkey.
The Statistical Business Register of Macao SAR Government of Macao SAR Statistics and Census Service.
9 th Workshop on Labour Force Survey Methodology – Rome, May 2014 The Italian LFS sampling design: recent and future developments 9 th Workshop on.
Workshop on Price Index Compilation Issues February 23-27, 2015 Sample Design, Selection, and Maintenance Gefinor Rotana Hotel, Beirut, Lebanon.
Sampling Methods. Definition  Sample: A sample is a group of people who have been selected from a larger population to provide data to researcher. 
Poverty measurement: experience of the Republic of Moldova UNECE, Measuring poverty, 4 May 2015.
Luigi Biggeri 1, Rita De Carli 2 and Tiziana Laureti 3 1 Istat and University of Firenze, Italy; 2 Istat, Italy; 3 University of Tuscia, Italy Geneva,
1 MODERNIZATION OF BELARUSIAN STATISTICS _________________________________________________ IMPLEMENTATION OF THE PROCESS APPROACH IN ORGANIZING THE STATISTICAL.
Introduction Biostatistics Analysis: Lecture 1 Definitions and Data Collection.
1 Biggeri L*., Brunetti* A. and. Laureti° T *Italian National Statistical Institute (Istat), Rome, Italy; ° University of Tuscia, Viterbo, Italy Geneva,
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 1-1 Statistics for Managers Using Microsoft ® Excel 4 th Edition Chapter.
Workshop on Price Index Compilation Issues February 23-27, 2015 Market Basket Items and Weights Gefinor Rotana Hotel, Beirut, Lebanon.
ECE/ILO Meeting on Consumer Price Indices, May 10-12, 2006, Geneva1 E-COMMERCE IN THE ISRAELI CPI Yoel Finkel Merav Yiftach Central Bureau of Statistics,
SAMPLING TECHNIQUES. Definitions Statistical inference: is a conclusion concerning a population of observations (or units) made on the bases of the results.
United Nations Economic Commission for Europe Statistical Division UNECE Workshop on Consumer Price Indices Istanbul, Turkey,10-13 October 2011 Session.
Using administrative registers in sample surveys European Conference on Quality in Official Statistics 3-–6 May 2010 Kaja Sõstra Statistics Estonia.
UNECE Workshop on Consumer Price Indices Session 2: Sampling of Outlets and Products Presentation by Cengiz Erdoğan, TurkStat October Istanbul, Turkey.
Impact of updating weights on tracking performance and volatility: Industry survey G. Bruno, L. Crosilla, P. Margani, A. Righi EU Workshop on Recent Developments.
The challenge of a mixed-mode design survey and new IT tools application: the case of the Italian Structure Earning Surveys Fabiana Rocci Stefania Cardinleschi.
The use of registers in sampling of outlets and routines for drawing and inclusion of new outlets CPI workshop Istanbul, 10 – 13 October 2011.
Meeting of the Group of Experts on Consumer Price Indices. Geneva May 2014, UNECE Workshop 7: Seasonal products. Palais des Nations, Geneva - Meeting.
Sampling design issues in Italian experience on scanner data and the possible integration with microdata coming from traditional data collection Claudia.
The Swedish experience with scanner data From sampling to index calculation Paulina Jonéus.
Towards an improvement of current migration estimates for Italy Domenico Gabrielli, Maria Pia Sorvillo Istat - Italy Joint UNECE-Eurostat Work session.
Lecture VI Statistics. Lecture questions Mathematical statistics Sampling Statistical population and sample Descriptive statistics.
 When every unit of the population is examined. This is known as Census method.  On the other hand when a small group selected as representatives of.
Basic Business Statistics, 8e © 2002 Prentice-Hall, Inc. Chap 1-1 Inferential Statistics for Forecasting Dr. Ghada Abo-zaid Inferential Statistics for.
ICCS 2009 IDB Seminar – Nov 24-26, 2010 – IEA DPC, Hamburg, Germany Training Workshop on the ICCS 2009 database Weights and Variance Estimation picture.
1 Outline  Introduction  Methodology for compiling CPI  Data Collection and Compilation.
Sampling technique  It is a procedure where we select a group of subjects (a sample) for study from a larger group (a population)
CZECH STATISTICAL OFFICE Na padesátém 81, CZ Praha 10, Czech Republic Business Register in the Czech Statistical Office =DISSEMINATION.
United Nations Economic Commission for Europe Statistical Division UNECE Workshop on Consumer Price Indices Istanbul, Turkey,10-13 October 2011 Session.
Sampling and Sampling Distributions. Sampling Distribution Basics Sample statistics (the mean and standard deviation are examples) vary from sample to.
Wiesbaden Group on Business Registers International Roundtable on Business Survey Frames Tallinn, 27 – 30 September 2010 Simonetta Cozzi,ISTAT, Italy Wiesbaden.
Realisation of Outlets Register for the implementation of new probability sample strategy on the Consumer Price Index Survey Wiesbaden Group on Business.
Price Indices for External Trade of Goods Eleonora Baghy 26/05/2013
Stratified Sampling for Data Mining on the Deep Web
Annual routines with chaining
Sampling and estimation
Wir bewegen I n f o r m a t i o n e n
Carsten Olsson Eurostat
Presentation transcript:

Quality in Italian consumer price survey: optimal allocation of resources and indicators to monitor the data collection process Federico Polidoro, Rosabel Ricci, Anna Maria Sgamba ( Istat - Italy )

introduction quality in Consumer Price Survey two research topics 1. the optimal allocation of the available resources (minimizing sample error + burden and cost) 2. the definition of a system of indicators to monitor data collection process (minimizing non sample error)

the calculation of a consumer price index (CPI) requires a large amount of resources the optimal allocation of the available resources introduction allocating these resources in the most efficient way (quality: burden and cost) the aim the issue

indicators to monitor data collection process introduction improving data quality (quality: accuracy) the definition of a system of indicators to monitor data collection process the issue the aim

1. the optimal allocation of the available resources

Approach description Italian background Approach to variance estimation Cost function Case study and results 1. the optimal allocation of the available resources

identifying the optimal sample sizes either in terms of outlets or in terms of elementary items observed in order to minimize sample error measured by sample variance the objective of this research 1. the optimal allocation of the available resources

the optimal allocation approach 1. the optimal allocation of the available resources derive optimal sample sizes minimizing variance of the estimates for a given cost  a variance function  a cost function 2 pillars in order to

Italian background consumer price index sampling structure Sampling of geographical areas Sampling of outlets Sampling of products Sampling of elementary items in each outlet 1. the optimal allocation of the available resources

consumer price index sampling design non-probability sampling consumer price index sampling methods Italian background 1. the optimal allocation of the available resources

Italian background consumer price index sampling methods Sampling of geographical areas the selection of geographical areas is established by Italian laws (No 222/1927 and 621/1975) in 2007 prices were collected in 85 county chief towns (Municipal Offices of Statistics, MOS) all over the national territory 1. the optimal allocation of the available resources

Italian background Sampling of outlets within each county chief towns, the selection of outlets is carried out by MOS sample is drawn by outlet list of the Chamber of commerce, statistical business register (ASIA), census data and other local sources the outlets with the highest total sales are chosen (mix of cut-off and quota sampling) in 2007 prices are collected in about outlets all over the national territory consumer price index sampling methods 1. the optimal allocation of the available resources

Sampling of products in 2007, 540 products are included in the CPI’s consumer price index sampling methods the selection of products is carried out by National Institute of Statistics (Istat) the selection of the products - a list (basket) of products types with product type specifications - is based on sales data (cut-off sampling) Italian background 1. the optimal allocation of the available resources

Sampling of elementary items in each outlet within each outlet, the selection of elementary items is carried out by MOS’s price collector the most sold elementary items is chosen (the representative item method) in the 2007 about price quotations are collected all over the national territory consumer price index sampling methods Italian background 1. the optimal allocation of the available resources

sample update yearly base revision consumer price index sampling methods optimum sample allocation current sizes of samples for elementary items are not optimal Italian background 1. the optimal allocation of the available resources

the approach to variance estimation 1. the optimal allocation of the available resources The Swedish approach has been used to estimate the variance of CPI (Dalén, Ohlsson, 1995) the sample is considered drawn from a two- dimensional population of products and outlets a cross-classified sample (CCS)

the approach to variance estimation 1. the optimal allocation of the available resources  representative products – as rows (i)  outlets – as columns (j)  stratification into categories of products – stratum (g)  stratification into outlet groups – stratum (h)  the crossing of strata - cell (g,h)  the parameter (index) = I  parameter estimator (index) = Î

the approach to variance estimation 1. the optimal allocation of the available resources the general index (target parameter) V gh = weight for cell turnover for the category of products g traded in the outlets of group h where the cell index is I gh = index cell

the approach to variance estimation 1. the optimal allocation of the available resources w i = weight for representative product i w h = weight for outlet j l ij = 1 if representative product i is traded in outlet j l ij = 0 otherwise the cell index where

the approach to variance estimation 1. the optimal allocation of the available resources the estimated general index the estimated cell index

1. the optimal allocation of the available resources in CCS assumption the variance estimator can be decomposed into: V PRO = variance between representative products V OUT = variance between outlets V INT = outlet and representative product interaction variance V(Î) tot ~ V PRO + V OUT + V INT where the approach to variance estimation

1. the optimal allocation of the available resources formulas for variance estimation the approach to variance estimation

1. the optimal allocation of the available resources with the following formula for variance estimation the approach to variance estimation where

Case study 1. the optimal allocation of the available resources one geographical area Udine county chief town (Resident population: ) one COICOP division (two-digit level) “Food and non alcoholic beverages” reference period December 2007

Case study 1. the optimal allocation of the available resources Outlets are divided into 12 strata according a commercial distribution type (reduced to 5 types for Food and non alcoholic beverages) Representative products are divided into 52 strata according to the national nomenclature (categories of products) Currently for outlets and products purposive sampling is used but a probability sampling has been postulated for both the approach to variance estimation

Case study 1. the optimal allocation of the available resources Inclusion probabilities for representative products (π gi ) Inclusion probabilities for outlets (π hj ) Imputation by brands information in each strata Imputation by the amount of representative products collected in each outlet the approach to variance estimation

Case study 1. the optimal allocation of the available resources main numerical results the approach to variance estimation Sample size = Î (index) = Food and non alcoholic beverages Division V PRO = V OUT = V INT = V TOT = % confidence interval

1. the optimal allocation of the available resources the cost function one data collection method Thus the following function cost is used interviewers collect prices each month by visiting each outlet

1. the optimal allocation of the available resources the approach to cost function estimation C 0 = fixed cost (i.e. for administration and other) n h = the number of outlets into stratum h m g = the number of products into stratum g a h = fixed cost per outlet into stratum h (i.e. for travel time) b h = cost to measuring one product in the outlets of stratum h r gh = average relative frequency of products in stratum g sold in outlets of stratum h where

1. the optimal allocation of the available resources the allocation problem County chief town: Udine Resident population: Reference time: December 2007 Food and non alcoholic beverages price quotes: Food and non alcoholic beverages outlets: 43 C 0 = not considered a h = we consider the average travel time  h b h = we consider the average collecting time  h Estimate C TOT = 182 h. Case study

Conclusion 1. the optimal allocation of the available resources Developing the contents of the paper solving the problem of nonlinear optimization deriving from the Cost and Variance formula Important news: preliminary attempt to estimate Italian CPI variance Enhancing effort to move towards a probability approach to CPI sampling

2. indicators to monitor data collection process

Data collection: the net design Istat CPI Office Data server DB Oracle server FTP server Web server Firewall intranet Data collector PSTN or UMTS Data collector PSTN or UMTS Data collector

2. indicators to monitor data collection process 8 Different steps of data check and data quality indicators 1.Data collection software 2.UMTS data transmission for each outlet or data collection tour: first check and first data quality set of indicators on the web server (possible real time data in the outlet)

2. indicators to monitor data collection process 8 3.Second check on the total amount of monthly elementary data and second data quality set of indicators (MOS) 4.Final check (the third one) on total amount of elementary data coming from all the chief towns (Istat) and third set of data quality indicators 5.Quarterly check concerning sampling Different steps of data check and data quality indicators

2. indicators to monitor data collection process 8 A completely integrated data production process where each event that will be stressed by the system of indicators will produce consequences in order to remove mistakes or their possible causes Different steps of data check and data quality indicators

Thank you for your attention Federico Polidoro (Istat - Italy, Rosabel Ricci (Istat - Italy, Anna Maria Sgamba (Istat - Italy,