Eurostat WebDataNet Conference 2015 Salamanca, 26 th – 28 th May 2015 Fernando Reis, Big Data Task-Force European Commission (Eurostat) Web activity evidence.

Slides:



Advertisements
Similar presentations
8 th OG Meeting, BAKU Chapter 9: Data Dissemination Mr. Robert Maluta Kwinda Deputy Director.
Advertisements

Will ‘big data’ transform official statistics?
WILL BIG DATA CHANGE EVERYTHING IN ACCOUNTING AND AUDITING? Miklos A. Vasarhelyi Rutgers University.
Barteld Braaksma and Kees Zeelenberg “Re-make / Re-model”: Should big data change the modelling paradigm in official statistics?
PowerPoint Presentation for Dennis, Wixom & Tegarden Systems Analysis and Design Copyright 2001 © John Wiley & Sons, Inc. All rights reserved. Slide 1.
WebDataNet Conference 2015 Salamanca, 26th – 28th May 2015
Google Flu Trends Terminology –Influenza = flu –ILI = influenza like illness CDC ILI time series –Weekly –1-2 week publication lag Predicting it using.
Working Party on Transport Trends and Economics, 21 st session Geneva, 9-10 September 2008 Supply chain challenges for transport.
TQS in SSGI Survey: What do the terms 'general interest' and 'services of general interest' mean in different countries, regions and cities? What does.
Session 1: Understanding the Value of Official statistics: Introduction Eurostat CES seminar, 9 th of April, 2014 Mariana Kotzeva, Adviser Hors Classe.
Communication and dissemination of indicators Soong Sup Lee, World Bank.
European Conference on Administrative Simplification in Official Statistics, 2-3 December 2010, Ghent Ways of statistical data collection - the key element.
CZECH STATISTICAL OFFICE Na padesátém 81, CZ Praha 10, Czech Republic Strengthening Statistical Capacity to Improve MDG Data in Conditions.
9 th Workshop on Labour Force Survey Methodology – Rome, May 2014 The Italian LFS sampling design: recent and future developments 9 th Workshop on.
Big Data Activities at Eurostat Workshop on Statistical Data Collection, 29 Apr – 1 May 2015, Washington D.C, USA
Recent Developments of the OECD Business Tendency and Consumer Opinion Surveys Portal coi/coordination
1 The availability, timeliness and quality of rapid estimates UNCTAD experience Henri Laurencin INTERNATIONAL SEMINAR ON TIMELINESS, METHODOLOGY AND COMPARABILITY.
4 May 2010 Towards a common revision for European statistics By Gian Luigi Mazzi and Rosa Ruggeri Cannata Q2010 European Conference on Quality in Official.
Learning outcomes for BUSINESS INFORMATCIS Vladimir Radevski, PhD Associated Professor Faculty of Contemporary Sciences and Technologies (CST)
Sore throat? Sniffles?Sore throat? Sniffles?  Google it! Duh!  During flu season, more people enter search queries concerning the flu.  Each year 90.
By: Ada van Krimpen Director International Statistical Institute (ISI) 8 October 2014.
African Centre for Statistics United Nations Economic Commission for Africa Session 9: Dissemination of Environment Statistics Workshop on Environment.
Eurostat Web activity evidence to increase timeliness of official statistics IAOS – 10 October.
Query trends CS 349 Presentation December 2 nd, 2008 Catherine Grevet.
1 1 Resources and Funding of Official Statistics Olav Ljones SADC Work Shop 2- 6 Dec 2006, Luanda.
Introduction for Basic Epidemiological Analysis for Surveillance Data National Center for Immunization & Respiratory Diseases Influenza Division.
United Nations Economic Commission for Europe Statistical Division Data Initiatives: The UNECE Gender Database and Website Victoria Velkoff On behalf of.
Implementation of the European Statistics Code of Practice Yalta September 2009 Pieter Everaers, Eurostat.
January 24-25, 2005International Technical Meeting on Measuring Migrant Remittances 1 Measuring Migrants’ Remittances: From the Perspective of the European.
1 Official Statistics in Times of Crisis Walter Radermacher Eurostat.
High-Level Forum on Strategic Planning for Statistics in Central Asia Countries Bishkek, Kyrgyz Republic, May 2006 Oleg Kara, Deputy Director General,
Eurostat Statistical challenges in collecting e-commerce data What do we know, What would we like to know, and What is difficult Carsten OLSSON Eurostat.
Some Final Material. GOOGLE FLU TRENDS Sore throat? Sniffles? Google it! Duh! During flu season, more people enter search queries concerning the flu.
Report on the breakout session on Rapid Estimates Roberto Barcellan European Commission - Eurostat.
Project management and executive planning of activities Управление проектом и планирование выполнения работ М.А. Коробейникова Marina Korobeinikova Life.
By, CA K RAGHU, PAST PRESIDENT – INSTITUTE OF CHARTERED ACCOUNTANTS OF INDIA.
Eurostat Quality assurance in the global statistical system- the role of CCSA Pieter Everaers Eurostat COMMITTEE FOR THE COORDINATION OF STATISTICAL ACTIVITIES.
Overview of Programme of the Working Group on Flash Estimates of GDP Roberto Barcellan European Commission - Eurostat.
DIRECTORATE GENERAL ECONOMICS, RESEARCH AND STATISTICS Forecasting Tourist Inflows Through Google use Concha Artola Economic Analysis and Forecasting General.
United Nations Statistics Division Overview of handbook on cyclical composite indicators Expert Group Meeting on Short-Term Economic Statistics in Western.
21 June 2011 High level seminar for EECCA on “Quality matters in statistics” High level seminar for EECCA on “Quality matters in statistics” The Code of.
Introducing Precictive Analytics
Importance of statistics data for regional cooperation
Discussion: Timely estimates of economic indicators – Session C3 –
Cost analysis of key statistical products
EUROPEAN THEMATIC NETWORK and Open Educational Resources
Statistics and Politics
UGC RAE /9/20.
Sub-regional workshop on integration of administrative data, big data and geospatial information for the compilation of SDG indicators for English-speaking.
Regional Workshop on Short-term Economic Indicators and Service Statistics September 2017 Chiba, Japan Alick Nyasulu SIAP.
GDP growth estimates for Europe at 30 days
OECD Chief Statistician and Director, Statistics Directorate
New ways to get the data Multiple mode and big data
“Managing Modern National Statistical Systems in Democratic Societies”
Dissemination guidelines at INE
GDP growth estimates for Europe at 30 days
TF meeting 7 October '15 Luxembourg
SAEG 15th March 2018 Item 2.1 Use of By Dario Buono.
Item 8 Cost assessment survey of production of statistics in the ESS
Quarterly National Accounts
Task Force on Environmental transfers of the Working Group on
NTTS 2009 Conference Brussels February 2009
Task Force on GDP Flash estimates at t+30 days
Analyzing social media data to monitor public health trends
Ðì SA Effective Monitoring and Evaluation of Progress on the SDGs Monitoring SDGs : the perspective of Armstat Learning Conference: Implementing.
Ethical Implications of using Big Data for Official Statistics
UNEP / Division of Early Warning and Assessment (DEWA)
Quarterly National Accounts
Quarterly National Accounts
Promoting official statistics: Statistical literacy (DRAFT version)
Presentation transcript:

Eurostat WebDataNet Conference 2015 Salamanca, 26 th – 28 th May 2015 Fernando Reis, Big Data Task-Force European Commission (Eurostat) Web activity evidence to increase timeliness of official statistics

Eurostat Official statistics Census-taking Relief (“Altar of Domitius Ahenobarbus”), Rome, Italy, ca. 100 B.C.E.,

Eurostat '….To provide an indispensable element in the information system of a democratic society, serving the government, the economy and the public with data about the economic, demographic, social and environmental situation….' [Fundamental Principles of Official Statistics; principle 1 on Relevance, impartiality and equal access] What is the role of official statistics today?

Eurostat My definition of big data Data deluge Larger, faster, more (a.k.a. Volume, Velocity, Variety) Everything is data Text, sound, images, video Analytics Predictive analytics Ex: Google translate, voice recognition, suggestions systems, health applications The new data product by excellence Official stat: chances of getting a new job An emergent market

Eurostat Past experiences 2005: Association between web activity and unemployment identified 2006: Google Trends 2008: Google Flu Trends (GFT) 2009: GFT underestimated official figures 1 st revision of GFT model 2013: GFT overestimated flu peak values 2 nd revision of GFT model 2014: Backlash against big data

Eurostat Data Source: Google Trends (

Eurostat Weekly influenza-like illness (ILI) surveillance and Google Flu Trends (GFT) search query estimates, June 2003–March 2013 Olson DR, Konty KJ, Paladini M, Viboud C, et al. (2013) Reassessing Google Flu Trends Data for Detection of Seasonal and Pandemic Influenza: A Comparative Epidemiological Study at Three Geographic Scales. PLoS Comput Biol 9(10) License: Creative Commons CC0 public domain dedication

Eurostat Weekly influenza-like illness (ILI) surveillance and Google Flu Trends (GFT) search query estimates, June 2003–March 2013 Olson DR, Konty KJ, Paladini M, Viboud C, et al. (2013) Reassessing Google Flu Trends Data for Detection of Seasonal and Pandemic Influenza: A Comparative Epidemiological Study at Three Geographic Scales. PLoS Comput Biol 9(10) License: Creative Commons CC0 public domain dedication

Eurostat Weekly influenza-like illness (ILI) surveillance and Google Flu Trends (GFT) search query estimates, June 2003–March 2013 Olson DR, Konty KJ, Paladini M, Viboud C, et al. (2013) Reassessing Google Flu Trends Data for Detection of Seasonal and Pandemic Influenza: A Comparative Epidemiological Study at Three Geographic Scales. PLoS Comput Biol 9(10) License: Creative Commons CC0 public domain dedication

Eurostat Weekly influenza-like illness (ILI) surveillance and Google Flu Trends (GFT) search query estimates, June 2003–March 2013 Olson DR, Konty KJ, Paladini M, Viboud C, et al. (2013) Reassessing Google Flu Trends Data for Detection of Seasonal and Pandemic Influenza: A Comparative Epidemiological Study at Three Geographic Scales. PLoS Comput Biol 9(10) License: Creative Commons CC0 public domain dedication

Eurostat Source: Financial Times Magazine (2014).

Eurostat Lessons from GFT Premature release of statistical product can harm its reputation Avoid big data hubris Google search algorithms frequent changes impacts validity of models We need transparency and replicability GFT search terms unknown GT is based on a sample which sampling methodology is unknown

Eurostat Other sources of web activity Wikipedia page views Flu Twitter International and internal migration flows Possibly other Visits to particular websites

Eurostat How to introduce web activity data in official flash estimates? Launch a larger scale balanced study Negative results normally are not published Purpose: guide decision on investment

Eurostat How to introduce web activity data in official flash estimates? Diversification and assessment of the web activity data sources NSI lack control of the source Black box Inability to guarantee that there was no manipulation Breaks in series Lack of continuity Diversify the sources Revision of prediction models Accreditation and certification

Eurostat How to introduce web activity data in official flash estimates? Integration of web activity data with traditional official statistics sources Official statistics should not simply reproduce what others can do, but instead do it making use of its specific comparative advantages We are the original producers, we know its details Use more detail than what is published Traditional methods (surveys)

Eurostat How to introduce web activity data in official flash estimates? Research on relation between web activity and the phenomena being predicted Remember lesson from GFT Do not confuse web activity with the phenomenon itself

Eurostat How to introduce web activity data in official flash estimates? Joint effort on the development of appropriate prediction models Learn from each other Transparency International comparability

Eurostat Thank you for your attention Fernando Reis Eurostat Task Force on Big Data