Big Data in Official Statistics: Generalities

Slides:



Advertisements
Similar presentations
Federal Statistics in an Age of a Self-Monitoring Social and Economic Eco-System Robert M. Groves US Census Bureau.
Advertisements

Will ‘big data’ transform official statistics?
Barteld Braaksma and Kees Zeelenberg “Re-make / Re-model”: Should big data change the modelling paradigm in official statistics?
Frank Yu Australian Bureau of Statistics Unstructured Data 1.
EGM – Population & Housing Censuses Eurostat / UNECE - Geneva - 24/25 May 2012 Beyond 2011 The future of population statistics (England & Wales) Alistair.
United Nations Economic Commission for Europe Statistical Division NTTS 2015 – Satellite Workshop on Big Data March 9, 2015 The Big Data Project – The.
Big Data at Eurostat and the ESS
ONS Big Data Project. Plan for today Introduce the ONS Big Data Project Provide a overview of our work to date Provide information about our future plans.
United Nations Economic Commission for Europe Statistical Division Big Data International Cooperation Steven Vale UNECE
Use of administrative data in statistics - challenges and opportunities ICES III End Panel Discussion Montreal, June 2007 Heli Jeskanen-Sundström Statistics.
The Future of Statistical Data Collection? Challenges and Opportunities Johan Erikson (Statistics Sweden) Gustav Haraldsen (Statistics Norway) Ger Snijkers.
Changing the perspective: users’involvement in prioritising needs in the Italian National Statistical Programme Monica Attias, ISTAT Q2014 Vienna, June.
Globalisation processes in the field of statistics Discussion DGINS, Budapest, 2007 Irena Križman Director-General of the Statistical Office of the Republic.
1 Ma Jiantang Commissioner National Bureau of Statistics of China September 3, 2012 Ma Jiantang Commissioner National Bureau of Statistics of China September.
Big Data Activities at Eurostat Workshop on Statistical Data Collection, 29 Apr – 1 May 2015, Washington D.C, USA
Looking Backward, Guessing Forward Robert M. Groves US Census Bureau October 28, 2011.
African Centre for Statistics United Nations Economic Commission for Africa Addressing Data Discrepancies in MDG Monitoring: The Role of UN Regional Commissions.
Research and Analytical Priorities post-Hong Kong: Trade in Services By: Abdoulaye Ndiaye ResearcherILEAP-JEICP Hong Kong Trade and.
Use of Administrative Data Seminar on Developing a Programme on Integrated Statistics in support of the Implementation of the SNA for CARICOM countries.
1 Innovation in Services Business Service Design and Innovation Fostering the Economic and Legal Framework for Innovation Performance and Development of.
High-Level Forum on Strategic Planning for Statistics in Central Asia Countries Bishkek, Kyrgyz Republic, May 2006 Oleg Kara, Deputy Director General,
1 Statistical business registers as a prerequisite for integrated economic statistics. By Olav Ljones Deputy Director General Statistics Norway
Acceptance of Social Media Marketing in the Sanitary Market Marcus Diedrich 6. November 2015.
Big Data for Measuring the Information Society INTERNATIONAL TELECOMMUNICATION UNION BIG DATA PROJECT - INNOVATIVE WAYS TO UTILIZE BIG DATA AS A NEW DATA.
Big Data and Official Statistics: Philippine Context Erniel B. Barrios.
Data Science in Official Statistics: The Big Data Team
The Development of Statistical Business Registers in
Short Training Course on Agricultural Cost of Production Statistics
UNECE Data Integration Project
Development of Strategies for Census Data Dissemination
Martine Durand Chief Statistician, OECD
Cost analysis of key statistical products
IPEC.
UNECE Work Session on Gender Statistics, Belgrade,
"Development of Strategies for Census Data Dissemination".
United Nations Development Account 10th Tranche Statistics and Data
Big Data ESSNet: Web Scraping for Job Vacancy Statistics Nigel Swier UK Office for National Statistics.
The importance of administrative data in the era of SDGs
Generic Statistical Business Process Model (GSBPM)
Conference on ICT For Sustainable Development
New ways to get the data Multiple mode and big data
Modernisation of European Enterprise and Trade Statistics - MEETS OECD-Working Party on International Trade in Goods and Trade in Services Statistics.
International experience in economic statistics programmes
Organization of efficient Economic Surveys
Scanning the environment: The global perspective on the integration of non-traditional data sources, administrative data and geospatial information Sub-regional.
Sub-regional workshop on integration of administrative data, big data
Applying the Generic Statistical Business Process Model to Business Register Maintenance Steven Vale UNECE
Director General of the National Accounts
Information Society Statistics
ESS Vision 2020.
Item 3 of the draft agenda ESS.VIP ADMIN: progress report
High level seminar on the implementation of the
The Golden Age of Survey Research
Draft Methodology for impact analysis of ESS.VIP Projects
Meeting the emerging needs for statistics on migrants:
MICRO: Enhancing Competitiveness of Micro-enterprises in Rural Areas
International Statistics
Heinrich Brüngger, Director
Addressing the challenge of water scarcity and droughts
3.4 Modernisation of Social Statistics
ISIbalo Young African Statisticians Association-Uganda Chapter
The thorough revision of the Urban Audit project
Streamlining statistical production
Access to Big Data for Statistical Purposes
Future Work Steven Vale UNECE
ESS Vision 2020.
Use of administrative data for statistical purposes
Privacy and personal data protection
Case Study: HLG Big Data Sandbox
Presentation transcript:

Big Data in Official Statistics: Generalities Antonino Virgillito

Big Data in Official Statistics: the International Setting - 1 Strategic vision of HLG, June 2011: «We are in a changeover from a society with little or no data available to one that has an abundance of data… … Another important point is that nowadays it is much easier to get data that cover more than the traditional national statistics users would need. We do not, however, have the mechanisms in place to make full use of these data»

Big Data in Official Statistics: the International Setting - 2 HLG Working paper 2013/6, January 2013: « Apart from generating new commercial opportunities in the private sector, Big data is also potentially very interesting as an input for official statistics; either for use on its own, or in combination with more traditional data sources such as sample surveys and administrative registers»

Big Data in Official Statistics: the International Setting - 3 Scheveningen Memorandum, September 2013 «Acknowledge that Big Data represent new opportunities and challenges for Official Statistics, and therefore encourage the European Statistical System and its partners to effectively examine the potential of Big Data sources in that regard.

Big Data for OS: The Concept Big Data can be an input for official statistics: either for use on its own or in combination with more traditional data sources such as sample surveys and administrative registers

Big Data Sources UNECE Classification Social Networks (human-sourced information) Traditional Business systems (process-mediated data) Internet of Things (machine-generated data)

Social Networks (human-sourced information) Interactions with news media and social media, job posting Humans interacting with devices (also mobile) produce data

Social Networks (human-sourced information) Interactions with news media and social media, job posting Humans interacting with devices (also mobile) produce data Example: Blog posts Twitter messages

Social Networks (human-sourced information) Interactions with news media and social media, job posting Humans interacting with devices (also mobile) produce data Example: Blog posts Twitter messages User-generated maps

Traditional Business systems (process-mediated data) Data collected by traditional systems in a passive mode Example: Web search logs

Traditional Business systems (process-mediated data) Data collected by traditional systems in a passive mode Example: Web search logs Medical records

Traditional Business systems (process-mediated data) Data collected by traditional systems in a passive mode Example: Web search logs Medical records Commercial transactions Banking/stock records

Internet of Things (machine-generated data) Sensors and machines used to measure and record the events and situations in the physical world.  Example: Traffic sensors

Internet of Things (machine-generated data) Sensors and machines used to measure and record the events and situations in the physical world.  Example: Traffic sensors Enviornmental Sensors

How can Big Data be included in official statistics? As the only source (replacement/new statistics) Traffic intensity statistics (NL) and ‘Billion Prices’ project (MIT) As the main source with survey/admin. data as benchmark Google trends like approaches, (regular) benchmarking needed As an additional source for a survey/admin. data based statistics for example to enable small area estimation As ‘supplier’ of missing data for example use data on level of education from the internet to fill gaps in education register But also for nowcasting and to increase timeliness! Don’t use it

Why do Big Data look so appealing to NSIs? Competitive pressure Private sector may take advantage of Big Data and produce more and more statistics that attempt to beat official statistics on timeliness and relevance The “Official Statistics” trademark could slowly lose reputation and relevance unless NSIs get on board Funding constraints Economic crisis (2009-20??) urges organizations to look for ways to increase efficiency and cut costs Being traditional data collection so cost-intensive, interest in alternative data sources and Big Data is growing

Why do Big Data look so appealing to NSIs? Improving quality of traditional statistics Providing new auxiliary information that NSIs could exploit to - Build and maintain better sampling frames - Design better samples - Build better Calibration estimators - Soften nonresponse bias further Reducing respondents’ burden Potential for discovering new knowledge New well-being indicators Agriculture and environment statistics New measures of consumers’ confidence Consumer behavior beyond HBS

Issues and Challenges Legislative, regulating access to data Privacy Possible diffrrent legislation country by country Privacy Possible privacy-by-design strategies Financial Private providers for Big data Management Including Training

Issues and Challenges – Statistical methodology Representativeness Difficult to define target population, survey population and survey frame Linking methods of Big Data with statistical units (individuals, families, enterprises,…) Estimation procedures Quality of the results

Issues and Challenges – Collecting Big Data Big Data originated from the need to manage data that grew inside organizations as a consequence of their business No collection involved In statistical offices we do not have such a situation because our “input” data is always generated from the collection phase Big Data too have to be gathered from external sources Most common sources of Big Data for statistical purposes datasets from external providers data extracted from Internet

International Initiatives UN Global Working Group UNECE-HLG Big Data Sandbox ESSNet project on Big Data