Session 7 – Data aggregation and visualisation

Slides:

Advertisements

Similar presentations

United Nations Statistics Division Principles and concepts of classifications.

Advertisements

9/6/2001Database Management – Fall 2000 – R. Larson Information Systems Planning and the Database Design Process University of California, Berkeley School.

Quantifying Data.

Survey Data Management and Combined use of DDI and SDMX DDI and SDMX use case Labor Force Statistics.

Use of survey (LFS) to evaluate the quality of census final data Expert Group Meeting on Censuses Using Registers Geneva, May 2012 Jari Nieminen.

Census/NeSS Roadshows March 2003 Better Information Initiatives.

Monitoring at the Household Level Methods, Problems, and Use of Critical Information.

Handbook on Residential Property Price Indices Chapter 5: Methods Jan de Haan UNECE/ILO Meeting, May 2010.

Statistik.atSeite 1 Norbert Rainer Quality Reporting and Quality Indicators for Statistical Business Registers European Conference on Quality in Official.

The Geographic Information System of the European Commission (GISCO) By Albrecht Wirthmann, GISCO, Eurostat ESPON.

Expert Group Meeting on MDG, Astana, 5-8 Oct.2009 MDG 3.2: Share of women in wage employment in the non-agricultural sector Sources of discrepancies between.

S T A T I S T I K A U S T R I A Quality Assessment of register-based Statistics A Quality Framework Manuela LENK Directorate.

Extracting value from grey literature Processes and technologies for aggregating and analysing the hidden Big Data treasure of the organisations.

1 Data.gov Initiative Implementation Acceleration Discussion Architecture and Infrastructure Committee Meeting March 19, 2009 Mike Carleton and Sonny Bhagowalia.

INSTITUTO NACIONAL DE ESTATÍSTICA Census 2011 Mapping Portuguese Process United Nations EGM on Contemporary Practices in Census Mapping and Use of GIS.

1 1 Geographic characteristics Proposal for 2020 CES recommendations Group of Experts on Population and Housing Censuses Geneva 23 – 26 September 2014.

Technical Assistance Office TCP Projects 2005 Contractual and Financial Management Administrative and Financial Handbook Prepared by IA, 14/12/2001 SOCRATES.

Session 2: Developing a Comprehensive M&E Work Plan.

1 Early Warning and Business Cycle Indicators in Analytical Frameworks International Seminar on Early Warning and Business Cycle Indicators 14 – 16 December.

European Commission’s project “Mapping of Broadband Services in Europe” IETF 96 Meeting, Berlin.

Quality declarations Study visit from Ukraine 19. March 2015

Session 4 – Data collection

Session 2 – Objective of workshop and status quo of project

Introduction To DBMS.

Survey on distributed natural gas quality

Session 5 – Data safety / security

Monitoring and Evaluation Systems for NARS Organisations in Papua New Guinea Day 3. Session 9. Periodic data collection methods.

Data collection process Mapping of Broadband Services in Europe First Stakeholder Consultation Workshop 7th June 2016 Eric Delannoy Mission France.

Land Cover Side Event: A new path forward for generating products

UNECE Seminar on New Frontiers for Statistical Data Collection, Geneva

Benefits expected from data providers

Objective ITY-ADQ ESSIP Plan 2015 Ana Paula FRANGOLHO DPS/PEPR

Inventory of cross-border and transnational data

Session 3 – Presentation of alpha version, tools and benefits

Haleh Kootval, Samuel Muchemi Public Weather Services Programme

Template library tool and Kestrel training

CMS HIPAA Transaction Implementation Status Checklist

ICP 7-th Regional Coordinators Meeting World Bank, Washington D.C.

Nettest An implementation of BEREC’s recommendations

Improving public accessibility and user engagement

Generic Statistical Business Process Model (GSBPM)

Identifying Inquiry and Stating the Problem

Improving and Using Family Survey Data

Disseminating regional and urban statistics The new visualisation tool of Eurostat Teodora Brandmüller Unit E4 Regional statistics and geographical information.

Indicator structure and common elements for information flow

ECONOMIC and SOCIAL CLASSIFICATIONS Introductory course Day 2 – second afternoon session PRODCOM Marie-Madeleine Fuger INSEE – France Hans Van Hooff CBS.

Discussion Group Meeting on Regional Statistics – 11th October 2011

Integrated management of LAU based territorial classifications

Brian Gong Center for Assessment

6.1 Quality improvement Regional Course on

Country report - Finland

ESTP COURSE ON ECONOMIC AND SOCIAL CLASSIFICATIONS Introductory course Day 2 – second afternoon session PRODCOM Marie-Madeleine Fuger INSEE – France.

Metadata The metadata contains

Zsófia Ercsey - KSH – Hungary Marie-Madeleine Fuger - INSEE – France

Research Problem: The research problem starts with clearly identifying the problem you want to study and considering what possible methods will affect.

WISE - State of the art --- WISE - in the context of SEIS

Urban Statistics on a national scale in the Netherlands

Country report - Denmark

Ass. Prof. Dr. Mogeeb Mosleh

Education and Training Statistics Working Group, May 2011

GSBPM AND ISO AS QUALITY MANAGEMENT SYSTEM TOOLS: AZERBAIJAN EXPERIENCE Yusif Yusifov, Deputy Chairman of the State Statistical Committee of the Republic.

Metadata on quality of statistical information

Indicator 3.05 Interpret marketing information to test hypotheses and/or to resolve issues.

How can DTM Multi-Sectoral Location Assessment be useful for Partners?

Decision-Making Tree for Clusters/partners for using DTM Location Assessment to collect data This visual does not indicate that DTM coordinator should.

DTM Field Companion for Location Assessment Sectoral Questions

How can DTM Multi-Sectoral Location Assessment be useful for

Zsófia Ercsey - KSH – Hungary Marie-Madeleine Fuger - INSEE – France

Geo-enabling the SDG indicators – experiences from the UN Global Geospatial Management and the GEOSTAT 3 project Agenda item 12 Ekkehard PETRI – Eurostat,

Stephanie Hirner ESTP ”Administrative data and censuses

Presentation transcript:

Session 7 – Data aggregation and visualisation

TUV + external speakers Agenda Data aggregation and visualisation TUV + external speakers 09:00 – 10:30 Presentation of data aggregation approach and visualisation approach TUV Practice report on challenges regarding “Quality of Experience” data Mr. Manner, Netradar (Finland) Mrs. Teixeira, INRIA (France) Mr. Rood, Stratix (Netherlands) Survey and discussion, inter alia: Aggregation an visualisation rules Representativeness of data All

Step 1: The process of data aggregation and visualisation Selection and processing of raw data Step 2: Aggregation Step 3: Conversion to data model Step 4: Data visualisation

Step 1: Selection and processing of raw data What is raw data and which choices have to be made? What is raw data Choices to be made Best case: Information on QoS at a specific location / geo-coordinate Other cases: Information on QoS linked to a geographical area What should be the purpose / expressiveness of my aggregated data? Which information do I want to share? Which single QoS data sets should be used to build up the aggregated data sets? NOTE: IP-addresses for single measurements are „personal data“ and cannot be collected within this project.  Data transformation into geo-coordinates or aggregation to address or grid level before data provision !

Step 1: Selection and processing of raw data Selection / elimination of single data based on Location accuracy Plausability of values Multiple measurements from same user in same area - 10 Mbit/s Description in meta data on what has been done

Step 1: Selection and processing of raw data Discussion on who should carry this out? Option 1: The initiative as owner of data Option 2: The contractor Requirement: close cooperation with data suppliers / giving instructions on selection / elimination choices + - + - Every data supplier carries out individual measures numerous approaches on processing raw data Knows context (focus and intention) of collected data best Data supplier has sovereignty of interpretation Contractor does not know the context of collected data  Risk of misinterpretation and data privacy concerns More homogenous approach on process raw data as only carried out by contractor

Step 2: Aggregation What is aggregated data and which choices have to be made? What is aggregated data? Choices to be made Combination of all values of QoS data sets that derive from the same region and have the same content to an aggregated value (e.g. min, max etc.) Which aggregated values should be provided? / Which quality criteria have to be fulfilled to provide a value for a region? Which significance should the values have?

Description in meta data on what has been done Step 2: Aggregation Definition of aggregation rules and information on which values will be supplied for which regions: Sample sizes Spatial distribution of samples Type of additional information needs be displayed in order to explain values Rural Urban 8 samples 1,300 samples Description in meta data on what has been done

Option 1: The initiative as owner of data Option 2: The contractor Step 2: Aggregation Discussion on who should carry this out? Option 1: The initiative as owner of data Option 2: The contractor Requirement: close cooperation with data suppliers / giving instructions on selection / elimination choices Every data supplier carries out individual measures numerous different aggregation rules - Data supplier has sovereignty of interpretation; knows the intended significance of the values, is able to assess which values are crucial + Contractor does not know the context of collected data  Risk of misinterpretation - Homogeneous approach to aggregation rules as only carried out by contractor +

Step 3: Conversion to data model Possible issues that could complicate transfer of data into the data model? Data provided on spatial resolution level which is not compliant to templates Divergent value categories

Step 3: Conversion to data model – Possible issues Data provided on spatial resolution level which is not compliant to templates By means of statistical data (e.g. population or number of households) aggregation can be carried out by contractor for data which is supplied in non-grid format If this statistical data is missing aggregation cannot be carried out by contractor Missing statistical data Data is provided for postal codes without geo-reference codes to NUTS levels Missing geo-data

Step 3: Conversion to data model Divergent value categories Different technologies clustered into technology group Different speeds clustered into speed group Statistical value differences Max Min Average etc. Median >35Mbit/s >50Mbit/s 16-24Mbit/s etc. <1Gbit/s etc. NGA Overall fixed Mobile Wired Wireless

Step 4: Data visualisation General approach Data will be visualised strictly according to data suppliers‘ intention: Without modification, or only modified in coordination with data supplier According to agreement in Memorandum of Understanding Publication on public version Publication on expert version Publication via data feed Specifications given in meta data will be displayed

Step 4: Data visualisation Visualisation of data sets from different initiatives Same value combination / different areas Same value combination / same areas X = X X = X ≠ = If collection approaches are similar, data from different data suppliers (can be visualised in one layer („national initiatives“) Data suppliers decide This decision can be indicated in data model „Solitary visualisation“ or „Simultaneous visualisation“ As collection approaches are very heterogeneous, it is not possible to visualise several initiatives on one layer Challenge to ensure that each data set is represented according to data suppliers‘ intention