Processing bulk data from the Internet Calculating clothing price indices for the CPI Karlijn Bakker
Internet data collection Practices Results Contents Goals of the project Internet data collection Practices Results Processing bulk data from the Internet
Develop a price index for clothing based on internet prices Goals of the project Efficiency Develop a price index for clothing based on internet prices Compare internet data with scannerdata Processing bulk data from the Internet
Internet data collection Daily price collection for 2 online clothing retailers The robots collect all items it can find on (a part of) the site The robots collect prices and characteristics of the goods Processing bulk data from the Internet
Computerized processing required Average price approach Processing bulk data Computerized processing required Average price approach Dividing items into groups Processing bulk data from the Internet
Aggregation Processing bulk data from the Internet
Use hidden information Use the location on the website Making groups Use the article name Use hidden information Use the location on the website This depends on the site. Processing bulk data from the Internet
Example Processing bulk data from the Internet
Results look promising Risk: sites can change Research continues Conclusion Results look promising Risk: sites can change Research continues Processing bulk data from the Internet