Warehouse approach: reusing data

Slides:



Advertisements
Similar presentations
Comparable statistics in the EU: ESS, an example of an effective regional statistical system Claudia Junker, Eurostat, head of unit "Statistical cooperation"
Advertisements

International Seminar on Modernizing Official Statistics:
CZECH STATISTICAL OFFICE Na padesátém 81, CZ Praha 10, Czech Republic The use of administrative data sources (experience and challenges)
Quality assurance activities at EUROSTAT CCSA Conference Helsinki, 6-7 May 2010 Martina Hahn, Eurostat.
Development of metadata in the National Statistical Institute of Spain Work Session on Statistical Metadata Genève, 6-8 May-2013 Ana Isabel Sánchez-Luengo.
Implementation of quality indicators in the Finnish statistics production process Kari Djerf Statistics Finland Q2008, Rome Italy.
The ESS.VIP Programme: a response to the challenges facing the ESS Mariana Kotzeva, ESS VIP Programme Coordinator Advisor Hors Classe ESTAT.
BAIGORRI Antonio – Eurostat, Unit B1: Quality; Classifications Q2010 EUROPEAN CONFERENCE ON QUALITY IN STATISTICS Terminology relating to the Implementation.
ORGANISATION FOR ECONOMIC CO-OPERATION AND DEVELOPMENT ORGANISATION DE COOPÉRATION ET DE DEVELOPMENT ÉCONOMIQUES OECDOCDE Workshop on improving statistics.
Monitoring public satisfaction through user satisfaction surveys Committee for the Coordination of Statistical Activities Helsinki 6-7 May 2010 Steve.
1 1 Expert Group on Energy Statistics. New York 2 – 5 Nov ESCN and the Oslo Group Olav Ljones Chair of the Oslo Group Statistics Norway
Developments in European Statistics Challenges in Official Statistics Visit to NSO Romania July 2009 Walter Radermacher, Chief Statistician of the.
Metadata Working Group Jean HELLER EUROSTAT Directorate A: Statistical Information System Unit A-3: Reference data bases.
SDMX and Metadata SDMX Basics Course 12 April 2013 Daniel Suranyi Eurostat B5 Management of statistical data and metadata.
2.An overview of SDMX (What is SDMX? Part I) 1 Edward Cook Eurostat Unit B5: “Central data and metadata services” SDMX Basics course, October 2015.
ESTAT/EFTA/UNECE high level seminar on streamlining statistical production, 7-8 July 2011 The European Statistical System’s Vision Claudia Junker, Tomasz.
1 The GSBPM and ESS statistical business process metadata Session 4 H. Linden, Unit B6 Eurostat Workshop on Statistical Metadata (METIS) (Geneva, 5-7 October.
1 Enhancing data quality by using harmonised structural metadata within the European Statistical System A. Götzfried Head of Unit B6 Eurostat.
1 Quality reporting within the Eurostat and the ESS metadata systems August Götzfried and Håkan Linden Eurostat Unit B6: Reference databases and metadata.
Global Assessments and Integrated Economic Statistics Global Assessments and Integrated Economic Statistics from products towards processes United Nations.
Relationship between Short-term Economic Statistics Expert Group Meeting on Short-Term Statistics February 2016 Amman, Jordan.
Development Account: 6th Tranche Strengthening the capacity of National Statistical Offices (NSOs) in the Caribbean Small Island Developing States to fulfill.
Scientific, technical and socio-economic aspects of impacts of, and vulnerability and adaptation to, climate change SBSTA in-session workshop on Scientific,
Quality declarations Study visit from Ukraine 19. March 2015
Theme (iv): Standards and international collaboration
Short Training Course on Agricultural Cost of Production Statistics
UNECE-CES Work session on Statistical Data Editing
MANAGEMENT OF STATISTICAL PRODUCTION PROCESS METADATA IN ISIS
Towards more flexibility in responding to users’ needs
Session 2: Institutional arrangements for energy statistics
The ESS vision, ESSnets and SDMX
SOFTWARE DESIGN AND ARCHITECTURE
Towards connecting geospatial information and statistical standards in statistical production: two cases from Statistics Finland Workshop on Integrating.
Modernization Maturity Model and Roadmap
Methodology – Monitoring and Tuning the Operational System
Modernisation of European social statistics
Introducing Statistical Standards -GAMSO
ESS-VIP ICT Project Task Force Meeting, Luxembourg, 5 March 2013.
Implementing the ESS Vision 2020
10-13 September 2013, Ankara, Turkey
ESS VIP-ESBRs European System of interoperable statistical Business Registers 2012 ESSnet Workshop, Rome  3-4 December 2012 Eurostat.
11. The future of SDMX Introducing the SDMX Roadmap 2020
Overview of working draft v. 29 January 2018
Developments in European Statistics
Goals and objectives of Work package 2 of the ESSnet on Consistency of concepts and applied methods of business and trade-related statistics Norbert Rainer,
The ESS VIP programme: a response to the challenges facing the ESS
ESSnet Projects Pascal JACQUES Unit/B5 Methodology and research
2. An overview of SDMX (What is SDMX? Part I)
Working Party on Regional Statistics 1-2 October 2012
2. An overview of SDMX (What is SDMX? Part I)
Innovative sources and tools for living conditions surveys
EGR and BR ESS work and plans
Methodology – Monitoring and Tuning the Operational System
High level seminar on the implementation of the
Sub-Regional Workshop on International Merchandise Trade Statistics Compilation and Export and Import Unit Value Indices 21 – 25 November Guam.
3.4 Modernisation of Social Statistics
The thorough revision of the Urban Audit project
ESS.VIP Validation Item 5.1
Streamlining statistical production
Implementing the “Vision” within the ESS
M. Henrard, B5 N. Buysse and H. Linden, B6 Eurostat
Business architecture
1. SDMX: Background and purpose
ESS.VIP.SERV Shared Services
Meeting of the EHIS Technical Group Luxembourg January 2012
Work Session on Statistical Metadata (Geneva, Switzerland May 2013)
Presentation of Project Joint meeting of the ESS.VIP.BUS ICT Project
Working towards a central Register : Simple, Complete and Widely Accessible September 29, 2010 Session no 5 - Register quality as a common task : Cooperation.
Implementing the “Vision” within ESS
Interoperability of metadata systems: Follow-up actions
Presentation transcript:

Warehouse approach: reusing data ESSNet Data Warehouse Tallinn workshop 20-21 March 2013 Allan Randlepp

Stovepipe model The stovepipe model is the outcome of a historic process in which statistics in individual domains have developed independently. (Terminology Relating To The Implementation Of The Vision On The Production Method Of Eu Statistics) 27/11/2018

Advantages the production processes are best adapted to the corresponding products; it is flexible in that it can adapt quickly to relatively minor changes in the underlying phenomena that the data describe; it is under the control of the domain manager and it results in a low-risk business architecture, as a problem in one of the production processes should normally not affect the rest of the production. (Terminology Relating To The Implementation Of The Vision On The Production Method Of Eu Statistics) It has a number of advantages: the production processes are best adapted to the corresponding products; it is flexible in that it can adapt quickly to relatively minor changes in the underlying phenomena that the data describe; it is under the control of the domain manager and it results in a low-risk business architecture, as a problem in one of the production processes should normally not affect the rest of the production. (Terminology Relating To The Implementation Of The Vision On The Production Method Of Eu Statistics) 27/11/2018

Disadvantages First, it may impose an unnecessary burden on respondents when the collection of data is conducted in an uncoordinated manner and respondents are asked for the same information more than once. Second, the stovepipe model is not well adapted to collect data on phenomena that cover multiple dimensions, such as globalisation, sustainability or climate change. Last but not least, this way of production is inefficient and costly, as it does not make use of standardisation between areas and collaboration between Member States. Redundancies and duplication of work, be it in development, in production or in dissemination processes are unavoidable in the stovepipe model. These inefficiencies and costs for the production of national data are further amplified when it comes to collecting and integrating regional data, which are indispensible for the design, monitoring and evaluation of some EU policies. (Terminology Relating To The Implementation Of The Vision On The Production Method Of Eu Statistics) However, the stovepipe model also has a number of disadvantages. First, it may impose an unnecessary burden on respondents when the collection of data is conducted in an uncoordinated manner and respondents are asked for the same information more than once. Second, the stovepipe model is not well adapted to collect data on phenomena that cover multiple dimensions, such as globalisation, sustainability or climate change. Last but not least, this way of production is inefficient and costly, as it does not make use of standardisation between areas and collaboration between Member States. Redundancies and duplication of work, be it in development, in production or in dissemination processes are unavoidable in the stovepipe model. These inefficiencies and costs for the production of national data are further amplified when it comes to collecting and integrating regional data, which are indispensible for the design, monitoring and evaluation of some EU policies. (Terminology Relating To The Implementation Of The Vision On The Production Method Of Eu Statistics) 27/11/2018

Integrated model “Innovative way of producing statistics based on the combination of various data sources in order to streamline the production process. This integration is twofold: horizontal integration across statistical domains at the level of National Statistical Institutes and Eurostat. Horizontal integration means that European statistics are no longer produced domain by domain and source by source but in an integrated fashion, combining the individual characteristics of different domains/sources in the process of compiling statistics at an early stage, for example households or business surveys.   vertical integration covering both the national and EU levels. Vertical integration should be understood as the smooth and synchronized operation of information flows at national and ESS levels, free of obstacles from the sources (respondents or administration) to the final product (data or metadata). Vertical integration consists of two elements: joint structures, tools and processes and the so-called European approach to statistics (see this entry).” (Terminology Relating To The Implementation Of The Vision On The Production Method Of Eu Statistics) (Terminology Relating To The Implementation Of The Vision On The Production Method Of Eu Statistics) 27/11/2018

Advantages By integrating data sets and combining data from different sources (including administrative sources) the various disadvantages of the stovepipe model could be avoided. This new approach would improve efficiency by elimination of unnecessary variation and duplication of work and create free capacities for upcoming information needs. (Terminology Relating To The Implementation Of The Vision On The Production Method Of Eu Statistics) 27/11/2018

Warehouse approach “The warehouse approach provides the means to store data once, but use it for multiple purposes. A data warehouse treats information as a reusable asset. Its underlying data model is not specific to a particular reporting or analytic requirement. Instead of focusing on a process-oriented design, the underlying repository design is modelled based on data inter-relationships that are fundamental to the organisation across processes.” (Terminology Relating To The Implementation Of The Vision On The Production Method Of Eu Statistics) “Based on this approach statistics for specific domains should not be produced independently from each other, but as integrated parts of comprehensive production systems, called data warehouses. A data warehouse can be defined as a central repository (or "storehouse") for data collected via various channels.” (Terminology Relating To The Implementation Of The Vision On The Production Method Of Eu Statistics)   “Data validation, integration, and provisioning into an dissemination database typically accounts for a large part of the statistical production costs, especially in the statistical fields where a lot of metadata would need to be harmonised or fine-tuned (for instance, statistical units, concepts and classifications). But by sourcing data into a reusable information asset in the form of a well-designed warehouse, these costs are incurred just once for the organisation, rather than multiple times in the case of multi-stovepipe deployments.” (Terminology Relating To The Implementation Of The Vision On The Production Method Of Eu Statistics) (Terminology Relating To The Implementation Of The Vision On The Production Method Of Eu Statistics) 27/11/2018

Integrated warehouse model 27/11/2018

Variables in stovepipe model 27/11/2018

Standardising variables in warehouse

Integrated warehouse

New surveys and variables

Designing surveys