WP7 – COMBINING BIG DATA - STATISTICAL DOMAINS

Slides:



Advertisements
Similar presentations
Big Data Quality, Partnerships and Privacy Teams.
Advertisements

Quality Management (WP5) Roman CHIRCA Agency for Innovation and Technological Transfer TecTNet ………... This project has been funded with support from the.
1 Women Entrepreneurs in Rural Tourism Evaluation Indicators Bristol, November 2010 RG EVANS ASSOCIATES November 2010.
Introduction 1. Purpose of the Chapter 2. Institutional arrangements Country Practices 3. Legal framework Country Practices 4. Preliminary conclusions.
United Nations Economic Commission for Europe UNECE Transport Division 1 TRANS-EUROPEAN RAILWAY (TER) PROJECT 2 nd Expert Group Meeting (Budapest, 23 September.
WP 9: 1 st Planning meeting summary Clarification between WP members of common objectives: Workshop planning and logistics with time- line Planning for.
Work packages SGA II ESSnet on microdata linking and data warehousing in statistical production Harry Goossens – Statistics Netherlands Head Data Service.
Work Package 6 L2C Kick-off meeting Fontainebleau, March 7th 2006.
ESS-VIP ICT Project Work Package III Task Force Meeting, Luxembourg, 5 March 2013.
ESSnet(s) Big Data I + II Item 8 of the agenda Joint DIME-ITDG Plenary Luxembourg, 24 Feb 2015.
ESS-net DWH ESSnet on microdata linking and data warehousing in statistical production Harry Goossens – Statistics Netherlands Head Data Service Centre.
The FDES revision process: progress so far, state of the art, the way forward United Nations Statistics Division.
ANALYSIS PHASE OF BUSINESS SYSTEM DEVELOPMENT METHODOLOGY.
EVALUATION OF THE SEE SARMa Project. Content Project management structure Internal evaluation External evaluation Evaluation report.
Slide 1DSS Board 1-2 December 2014 Eurostat ESS.VIP ADMIN project on use of administrative data sources Agenda point 7 DSS Board 1-2 December 2014.
Thomas Gutberlet HZB User Coordination NMI3-II Neutron scattering and Muon spectroscopy Integrated Initiative WP5 Integrated User Access.
Statistical Business Register Enterprise Groups in Latvia Sarmite Prole Head of Business Register Section Business Statics Department Central Statistical.
Research Design
University of Macedonia © University of Macedonia Co-financed by the European Regional Development Fund (ERDF) (75%) and the Greek National Funds (25%)
Stages of Research and Development
KOMUSO - ESSnet on quality of multisource statistics
Introduction to Marketing Research
UEmploy Consultancy for Employment Inclusion
Part Two.
Updating the Regulation for the JINR Programme Advisory Committees
Capital Project / Infrastructure Renewal – Making the Business Case
General Meeting cern, 10-12/10/2017 CREATIONS Demonstrators
ESSNet Pilot: Web Scraping for Job Vacancy Statistics
Investigating population movements
Unit 6 Research Project in HSC Unit 6 Research Project in Health and Social Care Aim This unit aims to develop learners’ skills of independent enquiry.
WP4 Measurements & social indicators.
Istituto Nazionale di Statistica – Istat
Task force on statistical units: survey of current practices
WP7 MULTI DOMAINS.
Steering Group Admin Project, 12 May 2016
Implementing the ESS Vision 2020
DIME Plenary Thomas Burg –Statistics Austria
Software life cycle models
Goals and objectives of Work package 2 of the ESSnet on Consistency of concepts and applied methods of business and trade-related statistics Norbert Rainer,
Dissemination Workshop ESSnet Big Data Sofia, February 2017
ESSNet Pilot: Web Scraping for Job Vacancy Statistics
Progress of the ESS.VIP ADMIN Special focus on the ESSnet on quality of multiple sources statistics. DIME/ITDG SG, Fabrice Gras, unit B1.
ESSnet on Quality of multisource statistics
Module 8- Stages in the Evaluation Process
ESS.VIP ADMIN Sorina Vâju.
ESS.VIP ADMIN Sorina Vâju.
Use of Web scraping for Enterprises Characteristics
Item 3 of the draft agenda ESS.VIP ADMIN: progress report
Results of the XBRL Pilot Project
Albania 2021 Population and Housing Census - Plans
CVTS 2015 – Draft Commission Regulation amending Regulation (EC) No 198/2006 Agenda item 2.2 DSS Meeting 3-4 April 2014.
Draft Methodology for impact analysis of ESS.VIP Projects
Big Data ESSNet WP 1: Web scraping / Job Vacancies Pilot
Culture Statistics: what next?
WP 6 Combining big data: early estimates
Item 7 - Roadmap and mandate for the Task Force on UOE Education Expenditure Data Eurostat Education and Training Statistics Working Group - Luxembourg,
MEETING OF THE WORKING GROUP ON CULTURE STATISTICS 23-24/11/2016
Activity on WFD and agriculture
Wiesbaden, 24 October, 2007 Svetlana Shutova Statistics Estonia
Workshop on Health Statistics Context, Objectives & Organisation
ESS.VIP ADMIN EssNet on Quality in Multi-source Statistics, progress report 19TH WORKING GROUP ON QUALITY IN STATISTICS, 6 December 2016 Fabrice Gras,
Task Force 3, Cultural Industries Kutt Kommel
Rail transport developments Agenda point 7.2
Morbidity statistics Item 10 of the agenda
European Statistical System Network on Culture (ESSnet Culture)
Agenda item 4.2 Task Force on migrants’ health
European Statistical System Network on Culture (ESSnet Culture)
ESS.VIP ADMIN – Status report Item 4.1 of the draft agenda
Meeting Of The European Directors of Social Statistics
- Kick-off meeting - ERANET Cofund BlueBio WP4 (Leader: AEI)
Presentation transcript:

WP7 – COMBINING BIG DATA - STATISTICAL DOMAINS The meeting to prepare SGA-1 by ESSnet BIG DATA 7-8 January 2016

Agenda Introduction TASK 1. Data availability/Data inventory TASK 2. Data feasibility TASK 3. Data combination TASK 4. Summary plus future perspectives

‘Population’, ‘Tourism/border crossings’ and ‘Agriculture’. Introduction Aim of this workpackage is to find out how a combination of: big data sources, administrative data statistical data may enrich statistical output in domains: ‘Population’, ‘Tourism/border crossings’ and ‘Agriculture’. In many cases, one data source will not suffice for producing official statistics. In these cases, one has to combine different data sources. This package has a scientific nature. From the methodological, qualitative and technical point of view it is required to work with professional independence. According to EUROSTAT: „Perform a conceptual investigation of the potential of combining multiple sources. This investigation should be organized by statistical domain. The following domains should be targeted: demography and migration, tourism, agriculture”

Introduction TASK 4 Summary plus future perspectives From the methodological, qualitative and technical point of view it is required to work with professional independence. However, WP 7 should use the experience of other workpackages especially sources-oriented. It is a reason that some WP7` tasks should take place after the tasks of other workpackages.

TASK 1. Data availability/Data inventory Identify big data sources taking into account sustainability and availability in several countries. Establishing an inventory of these sources by: brainstorming - a review of potential sources Preparation of the questionnaire with questions about the sources used by the project participants. Sending the questionnaire to participants Gathering answers and preparation for analysis Assessment of the possibility of using sources for big data analysis in the domains of population, tourism/border crossings, agriculture Build the list of potential sources

TASK 1. Data availability/Data inventory Identify which results or new products from the source-oriented pilots may contribute to these domains. Match the sources from the list of potential sources to following domains: Population Tourism/border crossings Agriculture Preliminary analysis of possibility for using sources to each domain - including: Consideration of the legal aspects Consideration of availability The preliminary analysis of the methodological aspects Consideration of the quality issues Preparation of initial technical requirements Build the list of exploitable sources for each domain   Describe the added value of delivered linkage between these sources to current statistics. Analyze the list of exploitable sources for each domain Prepare the map of linkages between Big Data sources (e.g which aspect of one data source can be used in several domains) Describe the added value for each domain.

TASK 1. Data availability/Data inventory Milestone 1. List of availability big data sources in the domain(s); by M6

TASK 2. Data feasibility Carry out explorative analyses on two or three big data sources in the domain of population, tourism / border crossings or agriculture. Selection the most value big data sources for each domain. Evaluation of the legal aspects; Evaluation of availability; Evaluation of methodology; Evaluation of the quality; Evaluation of technical requirements. Analyzing results. Preliminary assessment of the usefulness - developing the assessment factors Selection and recommendation two or three big data sources for using in the domain of population, tourism / border crossings, agriculture. Preparing the SWOT analysis (positive and negative factors of using several sources) Recommendation the most important and useful sources.

TASK 2. Data feasibility Milestone 2. Recommendation for using two or three big data sources in the domain(s); by M13 Recommendation for using two or three big data sources in the domain(s) – investigation whether that sources could be useful;

the partial report for each domain containing basic information on: SGA-1 SUMMARY Milestone 1. List of availability big data sources in the domain(s); by M6 Milestone 2. Recommendation for using two or three big data sources in the domain(s); by M13 DELIVERABLE the partial report for each domain containing basic information on: The data access (with legal and privacy aspects) The data quality issues The methodology (focus also on combining data) The technical aspects by M14

TASK 3. Data combination The experimental work (if practical work would be possible or if not it would be theoretical considerations including consultation with practice ex. sandbox ) Data collection Data preparation Data analysis Describe practical, technical and methodological aspects when combining big data outputs in the statistical system. For example, differences in definition, populations and volatility etc. Provide first answers on quality issues when combining big data with traditional outputs. Provide answers on the question whether micro-data have to be used when combining big data estimates with traditional outputs or data at aggregated level can be considered. Analysis advantage and disadvantages of combining data Preparing the list of criteria for combining data

TASK 3. Data combination Milestone 3. Combining data analyze; by M19

TASK 4. Summary plus future perspectives Suggest pilots and domains with successful implementation potential for further elaboration in the second wave of pilots in 2018. Recommendation on legal aspects; Recommendation on availability; Recommendation on methodology; Recommendation on quality; Recommendation on technical requirements. Conclusion

TASK 4. Summary plus future perspectives Milestone 4. List of potential pilots and domains with successful implementation potential - for further elaboration in the second wave of pilots in 2018. by M23 Deliverable (by M24) - THE GENERAL REPORT FOR DOMAINS including: The data access (with legal and privacy aspects) The data quality issues The methodology (focus also on combining data) The technical aspects

SGA-2 SUMMARY Deliverable THE GENERAL REPORT FOR DOMAINS including: Milestone 3. Combining data analyze; by M19 Milestone 4. List of pilots and domains with successful implementation potential for further elaboration in the second wave of pilots in 2018; by M23 Deliverable THE GENERAL REPORT FOR DOMAINS including: The data access (with legal and privacy aspects) The data quality issues The methodology (focus also on combining data) The technical aspects by M24

TIMETABLE FOR WP 7 COMBINING BIG DATA - STATISTICAL DOMAINS INVENTORY PREPARATION SELECTION COMBINING REPORT AND RECOMMEND Milestones 1. List of availability big data sources in the domain(s); by M6 Milestones 2. Recommendation for using two or three big data sources in the domain(s); by M13 DELIVERABLE the partial report for each domain by M14 Milestones 3. Combining data analyze; by M19 Milestones 4. List of pilots and domains with successful implementation potential for further elaboration in the second wave of pilots in 2018; by M23 Deliverable THE GENERAL REPORT FOR DOMAINS by M24

a.nowicka@stat.gov.pl w.jazwinska@stat.gov.pl