WORKSHOP ON SCANNER DATA Geneva 10 May 2010 Joint presentation by Ragnhild Nygaard (Statistics Norway) and Heymerik van der Grient (Statistics Netherlands)

Slides:



Advertisements
Similar presentations
Possible integration of the PPP sample into the Czech CPI basket: advantages and limitations Jiří Mrázek Czech Statistical Office Eurostat Conference Reviewing.
Advertisements

The Bulgarian CPI And The Index Of A Small Basket Of Goods And Services Joint UNECE/ILO Meeting on Consumer Price Indices Geneva, May 2006 National.
Methodological approaches to recording certain types of services in the Consumer Price Index in Belarus Ekaterina Grikhanova, National Statistical Committee.
1 1 Treatment of clothing and footwear – new Regulation 1. Definitions 2. Minimum standards 3. Impact on Norwegian HICP.
Scanner Data – next steps ahead Berthold Feldmann – Eurostat Price statistics; purchasing power parities; housing statistics.
United Nations Statistics Division/DESA
Some Implementation Issues of Scanner Data Muhanad Sammar, Anders Norberg & Can Tongur.
A Matched-Model Geometric Mean Price Index for Supermarket Scanner Data Nicolette de Bruijn Peter Hein van Mulligen Jan de Haan Sixth EMG Workshop, 13.
Chapter 10 Index analysis. The concept and classification of index A statistical indicator providing a representation of the value of the securities which.
Sébastien FAIVRE INSEE Workshop on scanner data, Stockholm, /06/2012 Would scanner data improve the French CPI?
1 Price index of pharmaceutical products, other medical products and appliances in the HICP/CPI Pia Skare Rønnevik.
Scanner Data – Discussion on the Treatment of Discounts in the CPI Muhanad Sammar, Anders Norberg & Can Tongur Statistics Sweden.
The Re-design of the Dutch CPI Jan de Haan Statistics Netherlands Geneva, May 2006.
Miroslav Jankovic (Statistical Office of Serbia and Montenegro) Dragi Stojiljkovic Tatjana Stanojevic-Miladinovic (Statistical Office of the Republic of.
UST. What is the Consumer Price Index (CPI)? the average change over time in the prices of: consumer items. goods services that people buy for day – to.
IAOS Conference – “Meeting the demands of a changing world” DaNang 2014 Theme 2: Exploiting new technologies & new data sources Topic: Price and economic.
Federal Department of Home Affairs FDHA Federal Statistical Office FSO Scanner data in the Swiss CPI An alternative to price collection in the field Reto.
Operations and Supply Chain Management
United Nations Economic Commission for Europe Statistical Division Calculation of the consumer price index - Recommendations of the CPI Manual Joint EFTA/UNECE/SSCU.
Section 3B- Modules 14/15- Inflation and the Business Cycle.
Methodology for Core Inflation Calculation Olga Kalabukha Director of the Price Statistics Department of the SSSU.
Regional Seminar on Developing a Program for the Implementation of the 2008 SNA and Supporting Statistics Dr. Cem BAŞ September 2013 Ankara - Turkey.
Consumer Behaviour in UK Price Indices Joe Winton, Robert O’Neill & Duncan Elliott Office For National Statistics 1.
Nobel Prize - Economics Three Amigos Eugene Fama - U. Chicago Lars Peter Hansen - U. Chicago Robert Shiller - Yale Financial Economics American.
Workshop on Price Statistics Compilation Issues February 23-27, 2015 Compilation of Elementary Indices Gefinor Rotana Hotel, Beirut, Lebanon.
CHAPTER 17: Index Numbers to accompany Introduction to Business Statistics third edition, by Ronald M. Weiers Presentation by Priscilla Chaffe-Stengel.
Composite Price Index  Unweighted Aggregative method  Unweighted Average of Relatives method  Weighted Aggregative Method  Paasche Index  Laspeyres.
1 1 Slide Slides Prepared by JOHN S. LOUCKS St. Edward’s University © 2002 South-Western/Thomson Learning 
Price-updating of weights in the CPI Why do we price-update expenditure weights? - Conceptual issues - Practical consequences What does the CPI Manual.
Interstate Statistical Committee of the Commonwealth of Independent States (CIS STAT) Methodological approaches to measuring core inflation in the CIS.
Workshop on Price Index Compilation Issues February 23-27, 2015 Market Basket Items and Weights Gefinor Rotana Hotel, Beirut, Lebanon.
ECE/ILO Meeting on Consumer Price Indices, May 10-12, 2006, Geneva1 E-COMMERCE IN THE ISRAELI CPI Yoel Finkel Merav Yiftach Central Bureau of Statistics,
United Nations Economic Commission for Europe Statistical Division UNECE Workshop on Consumer Price Indices Istanbul, Turkey,10-13 October 2011 Session.
United Nations Economic Commission for Europe Statistical Division Producer Price Indices - Recommendations of international manuals Joint EFTA/UNECE/SSCU.
UNECE Workshop on Consumer Price Indices Session 4: Practical Experiences with Calculating Elementary Indices and Treatment of Missing Prices Presentation.
Workshop on Price Index Compilation Issues February 23-27, 2015 Imputation of Missing Values, Seasonal Products and Quality Changes Gefinor Rotana Hotel,
A Key Component for ICP 2004 A well-established product list  Compare „ like with like “  Collect prices for relevant products  Establish reliable PPPs.
Workshop on Price Index Compilation Issues February 23-27, 2015 Data Collection Issues Gefinor Rotana Hotel, Beirut, Lebanon.
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Index Numbers Chapter 15.
Meeting of the Group of Experts on Consumer Price Indices. Geneva May 2014, UNECE Workshop 7: Seasonal products. Palais des Nations, Geneva - Meeting.
Sampling design issues in Italian experience on scanner data and the possible integration with microdata coming from traditional data collection Claudia.
1 A framework for international comparisons of volume and prices in health care Interim Report Manfred Huber 7 th Meeting of HA Experts and Correspondents.
1 The use of scanner data on non-food products Statistics Norway.
Scanner data in the Luxembourg HICP/CPI Moving towards implementation Claude Lamboray Vanda Guerreiro Scanner Data Workshop ISTAT 1-2 October 2015.
Revising the CPI for the Eastern Caribbean Currency Union ECE/ILO CPI Meeting Geneva, Switzerland May , 2010.
The Swedish experience with scanner data From sampling to index calculation Paulina Jonéus.
The Use of Scanner Data for Price Statistics in the Netherlands Huib van de Stadt Head of Unit Prices and Business Cycle Statistics Netherlands 15 October.
Copyright 2010, The World Bank Group. All Rights Reserved. Producer prices, part 1 Introduction Business Statistics and Registers 1.
Index numbers Value-, price-, quantity indices. Measuring changes in time Indices: relative change in price, quantity and value of products or services.
 Definition  Unweighted and Weighted Index Numbers ( Simple Index Numbers, Laspeyre’s, Paasche’s Index, Fisher’s “Ideal” Index)  CPI ( Consumer Price.
Investment Analysis Lecture 7 Industry Analysis.
Index Numbers Chapter 15 McGraw-Hill/Irwin Copyright © 2012 by The McGraw-Hill Companies, Inc. All rights reserved.
1 Outline  Introduction  Methodology for compiling CPI  Data Collection and Compilation.
CPI Measurement Problems The Case of Malawi By Charles Machinjili National Statistical Office Malawi.
Isabelle LEONARD Project manager for statistical issues Insee 2015/10/01 Real sale prices vs. displayed prices : an issue to consider before expanding.
5/25/2016Indices and Price Analyses Unit 1 The Household Expenditure Survey (HES) Presented by: The Indices and Price Analyses Unit Statistical.
Exploiting new technologies and new data sources – the opportunities and challenges associated with scanner data: building on the experiences so far David.
CPI/HICP price collection Validation of prices Twinning project: Development of new statistical methodologies and indicators in selected areas of statistics.
Copyright 2010, The World Bank Group. All Rights Reserved. Producer prices, part 2 Measurement issues Business Statistics and Registers 1.
INDEX NUMBERS. Index Number 2  An index number is a statistical measure designed to show changes in variables or a group of related variables with respect.
Index of Industrial Production: Methods of Compilation
ECON 201 Indices Week
Value-, price-, quantity indices
Session 7: Adjustment for quality changes
Some properties of the RGEKS index for scanner data
Price Indices for External Trade of Goods Eleonora Baghy 26/05/2013
Scanner Data – Discussion on the Treatment of Discounts in the CPI
Some Implementation Issues of Scanner Data
Antonio Chessa Statistics Netherlands, Consumer Prices
”Evaluating unit-value price indices in a dynamic item universe” 16th Meeting of the Ottawa Group, 8-10 May 2019 Rio de Janeiro, Brazil Li-Chun Zhang.
Presentation transcript:

WORKSHOP ON SCANNER DATA Geneva 10 May 2010 Joint presentation by Ragnhild Nygaard (Statistics Norway) and Heymerik van der Grient (Statistics Netherlands)

Historical overview – NL Supermarkets  Mid 90s: first contacts with chain(s)  2002: first implementation: 1/2 chain(s) Yearly Laspeyres (labour intensive)  Construction of yearly basket of items  Manual linking of items to COICOP-groups  Manual replacement of disappearing items Reduction of ca monthly price quotes in field survey

Historical overview – NL, cont Supermarkets  2010: extension: 6 chains Monthly chained Jevons (efficient process)  No manual linking of items  No explicit replacements Extra reduction of ca monthly price quotes in field survey

Historical overview – N  1997: first contact with one chain Gradually contact with more chains Implementation in the CPI  only price information of specific representative items  2002: scanner data from all the chains (no questionnaires - big incentive)  Aug 2005: expanded use for COICOP 01 price and quantity information for all items in representative outlets

Questions to be answered when dealing with scanner data  How/Where require scanner data?  Which statistical method?  How to link items to COICOP?  How to deal with all kind of particularities in data?  Development of new computer system?

Source of scanner data  Market research companies Cleaned data (very) expensive Two-stage delivery chain (timeliness)  Companies/Chains Raw data Cheap (NL/N do not pay) Direct contact with original supplier

Negotiations with companies  Time consuming process Negotiations can take up to a year or more including meetings, sending test data, analysing data etc. Be aware of some company establishing costs e.g. preparing the data extractions  Can company provide what you want/need? E.g. information to link items to COICOP automatically

Negotiations with companies, cont.  Focus on advantages for companies Minor costs once established (just a copy of their sales administration) No questionnaires or monthly visits of price collectors  Other incentives for companies? Money – not likely Information  E.g. company price development compared to overall price development

Negotiations with companies, cont.  Establishing good routines with the companies are essential Strict time schedules No changes in formats when implemented

Pre - production work  Take your time analyzing the data Enormous amount of data  N: Over price observations each month divided into about items Build shadow system (prototype)  Compare the new price indexes based on scanner data with the old method for a certain period of time before implementation  Discover possible problems in advance Unexpected situations will arise for sure

Pre - production work  Ideas for analysing the data: Is same EAN always same item? Extreme price changes Specific price development at beginning or end life cycle EAN structurally  Risk of bias! All kind of dynamics in data Missing prices Do properties of data change over time Etc

Methodology / IT-system  Find methodology that: Delivers good indexes (e.g. no bias) Can deal with all particularities in data  Build IT-system that supports the chosen methodology  Learn from experiences other countries using scanner data

Properties of data C onsequences for methodology NL and N  High attrition rate of items

Properties of data, cont. C onsequences for methodology NL and N  How to deal with high attrition rate of items NL: monthly chained index N: monthly chained index

Properties of data, cont. C onsequences for methodology NL and N Sales: low prices combined with enormous increase in quantities sold

Properties of data, cont. C onsequences for methodology NL and N  Consequences of sales: Single observations can have extremely high influence on elementary index Risk of bias applying monthly chaining and explicit weights

Properties of data, cont. C onsequences for methodology NL and N  Bias not just theoretically! Example for detergents FormulaWeekly index I(200835; =100) Monthly index I(200808; =100) Laspeyres Paasche Fisher Törnqvist Jevons Walsh

Properties of data, cont. C onsequences for methodology NL and N  How to deal with sales? NLcrude weighting on item level: w=0 or 1 NManual checks of price ratios that contribute most to elementary results: “critical observations”

Properties of data, cont. C onsequences for methodology NL and N  Implausible price changes NLprice changes (p t /p t-1 ) of more than a factor 4 are deleted Changes of +5000% and -99% do actually occur Nprice changes (p t /p t-1 ) of more than a factor 3 are deleted

Properties of data, cont. C onsequences for methodology NL and N  Temporarily missing prices

Properties of data, cont. C onsequences for methodology NL and N  How to deal with temporarily missing prices: NL:imputation of missing prices N :no adjustments, but imputing prices is considered for the near future

Properties of data, cont. C onsequences for methodology NL and N  Quality differences Items with same EAN are considered to be identical Items with different EAN are treated as different items (no matching)  How to deal with quality differences: NLOnly adjustment in exceptional cases: manual interference NNo adjustment

Actual method - NL  Data received: For each item each week:  EAN  Short description  (Chain specific) product group Used to link items to COICOP automatically  Expenditures  Quantities sold

Actual method – NL, cont.  Price of item: Unit value based on first three weeks of month  Unweighted price index elementary level: Monthly chained Jevons on selection of items  Weighted price index higher aggregates: Yearly chained Laspeyres Weights based on scanner data of all 52 weeks of previous year

Actual method – NL, cont.  Item selection at elementary level Items with low expenditures: w=0 Other items: w=1  Threshold of low (average) expenditure share:  Example: threshold =1% for χ=2 and N=50

Actual method – NL, cont.  Determination of threshold value  Simulations lead to: Optimal value: χ=1.25  Ca 50% of items is excluded (on average)  Elementary index based on 80 à 85% of total expenditures  Elementary level (chain dependent) comparable with COICOP6

Actual method – NL, cont.  Refinements: Extreme price changes are excluded (factor 4) Missing prices are imputed Dump prices at end lifecycle item are excluded (see paper)

Actual method – NL. What advantages were achieved?  Indexes are of higher quality Compared with old method scanner data Compared with field survey  Response burden for companies is lower No price collection in the shops  Efficiency gains? Yes: more or less automatic production process Investment costs (IT-system) were (very) high

Illustrations  Price indexes based on five supermarkets

Illustrations  Price indexes based on five supermarkets

Actual method - N  Data received: For each item in the midweek of the month:  EAN/PLU  Short description  (Chain specific) product group  Calculated average price  Quantity sold  Expenditure

Actual method – N, cont.  Sample of representative outlets Stratified by chain and concept  Matching EAN/PLU with COICOP6  Weighted Jevons price index on elementary level with expenditures shares of current and base period; Monthly chained Törnqvist index  Scanner data weights between the COICOP6 groups

 Higher aggregates: Yearly chained Laspeyres Weights from HES (NR as of 2011)  Exclude strongly seasonal items only available for a certain period of the year  Manual control and possibly exclusion of extreme contributions to elementary results Actual method – N, cont.

Actual method – N What advantages were achieved?  Indexes of higher quality? New methodology led to reduction of e.g sampling and measurement errors, but also to new biases Much more data – more detailed price indexes Considering both prices and quantities Many indexes have improved, others have not  Low response burden for companies No questionnaires  Efficiency gains? Automatic production process which requires some manual interference  Resources demanded not much higher than before High investment costs (IT-system)

New methodology  Newly developed index ( Ivancic, Diewert, Fox ) Rolling year GEKS price index  Source: GEKS-algorithm of purchasing power parities (International Comparison Programme) GEKS index transitive by construction  chained index equals direct index  no chain drift A geometric mean of direct superlative price indexes

New methodology, cont. bilateral indexes (Törnqvist or Fisher) between entities j and l (l=1..M) and between entities k and l, respectively Purchasing power parities: entity is country Scanner data : entity is month

New methodology, cont.  Expanding time period leads to revising all previous GEKS indexes  Solution: rolling version (chaining) etc

RYGEKS and NL  RYGEKS specifically developed for Statistics Netherlands as remedy for not-weighting at elementary level Not (yet) applied in practice Used as benchmark  Finding optimal value threshold Current method (NL) resembles RYGEKS quite well (on average)  No bias found

RYGEKS and NL: Illustrations

RYGEKS and NL, cont.  Plans for near future: Shadow system based on RYGEKS indexes Continuous benchmark for current method Implementation when RYGEKS is widely accepted?  More (international) analysis needed

RYGEKS and N  RYGEKS indexes tested on Norwegian scanner data on different levels; EAN, elementary and aggregated COICOP levels  For COICOP 01 compared a monthly chained Törnqvist index with a monthly chained RYGEKS index  The results indicate some bias in the Törnqvist index

RYGEKS and N, cont.  Small deviations for many COICOP aggregates Milk, Cheese and eggs, Oils and fats, Vegetables, Fish

RYGEKS and N, cont.  While others show more deviations Meat, Sugar, jam and chocolate

RYGEKS and N, cont.

 Causing bias; Missing prices Seasonal items (not excluded) Price and quantity oscillating over time  Shadow system for calculating RYGEKS indexes on monthly basis established Too early to be implemented

Scanner data in other branches?  NL: Expanding to other branches desirable Data available (e.g. durables) Problem of quality changes Analysis needed  N: Continuously working to expand scanner data  Increasing pressure from chains and outlets Data available for pharmaceutical products, wine and spirits (state monopoly) and petrol  Mostly price information implemented Have tried to cover clothing, but matched item model unsuccessful