Session 8 Data Processing Estonian case study

Slides:



Advertisements
Similar presentations
Editing and Imputing VAT Data for the Purpose of Producing Mixed- Source Turnover Estimates Hannah Finselbach and Daniel Lewis Office for National Statistics,
Advertisements

1 Business Exchange Structures Concepts.
Business Case for Industriali- sation in Statistics Estonia: Small Example of a Large Trend MSIS 2013 Allan Randlepp Tuulikki Sillajõe.
Brief Overview of Data Processing of Afghanistan Household Listing, Pilot Census Results, Population and Housing Census and NRVA Survey Brief Overview.
United Nations Economic Commission for Europe Statistical Division Applying the GSBPM to Business Register Management Steven Vale UNECE
United Nations Workshop on Principles and Recommendations for a Vital Statistics System, Revision 3, for African English-speaking countries Addis Ababa,
Metadata management and statistical business process at Statistics Estonia Work Session on Statistical Metadata (Geneva, Switzerland 8-10 May 2013) Kaja.
Session IV - Use of administrative data for data collection - Statistics Belgium Geneva, 31 October – 2 November.
Development of metadata in the National Statistical Institute of Spain Work Session on Statistical Metadata Genève, 6-8 May-2013 Ana Isabel Sánchez-Luengo.
Register-Based Census 2011 in Slovenia – Some Quality Aspects Danilo Dolenc Statistical Office of the Republic of Slovenia UNECE-Eurostat Expert Group.
Annual Bookkeeping Report as the primary administrative source for the production of SBS — current experiences and future plans Neuchâtel September.
European Conference on Quality in Official Statistics Session 26: Quality Issues in Census « Rome, 10 July 2008 « Quality Assurance and Control Programme.
MODERN CENSUS in POLAND Janusz Dygaszewicz Central Statistical Office in Poland Group of Experts on Population and Housing Census Geneva, October.
13-Jul-07 Implementation of SDMX for data and metadata exchange Balance of Payments Working Group 2-3 April 2012 Daniel Suranyi Eurostat B5 Management.
(Re)designing administrative data – towards register based census. Estonian experience Kristi Lehto Statistics Estonia Methodology and analysis department.
New sources – administrative registers Genovefa RUŽIĆ.
InSPIRe Australian initiatives for standardising statistical processes and metadata Simon Wall Australian Bureau of Statistics December
Editing of linked micro files for statistics and research.
Collecting grid based data Census 2000 and 2011 Diana Makarenko-Piirsalu Statistics Estonia.
Statistics Estonia on its way to improving efficiency UN ECE Seminar on New Frontiers for Statistical Data Collection Geneva, ‒ Tuulikki.
Pilot Census in Poland Some Quality Aspects Geneva, 7-9 July 2010 Janusz Dygaszewicz Central Statistical Office POLAND.
Open GSBPM compliant data processing system in Statistics Estonia (VAIS) 2011 MSIS Conference Maia Ennok Head of Data Warehouse Service Data Processing.
RECENT DEVELOPMENT OF SORS METADATA REPOSITORIES FOR FASTER AND MORE TRANSPARENT PRODUCTION PROCESS Work Session on Statistical Metadata 9-11 February.
An Overview of Editing and Imputation Methods for the next Italian Censuses Gianpiero Bianchi, Antonia Manzari, Alessandra Reale UNECE-Eurostat Meeting.
METIS 2011 Workshop Session III – National Implementation of the GSBPM Alice Born and Tim Dunstan Thursday October 6, 2011 Implementation of the GSBPM.
How official statistics is produced Alan Vask
Statistics Estonia's new system for statistical data activity processing (VAIS) ITDG Luxembourg 2010 Allan Randlepp.
Statistics Estonia’s meeting with the expert from Austria Diana Beltadze Project Manager Statistics Estonia.
1 Handbook on Population and Housing Census Editing Department of Economic and Social Development United Nations Statistics Division Studies in Methods,
Jordan Population and Housing Census 2015 Prepared by: Ahmad Mowafi
ZAMBIA CENSUS MAPPING PRESETATION
Quality assurance in population and housing census SUDAN’s EXPERIANCE in QUALITY assurance of Censuses By salah El din. A . Magid OUR EXPERIANCE IN 5.
MANAGEMENT OF STATISTICAL PRODUCTION PROCESS METADATA IN ISIS
The ESS vision, ESSnets and SDMX
Post Enumeration Survey Census
17F4-final-presentation
Austrian Statistical Datawarehouse (sDWH)
The status of metadata standards and ModernStats models in SURS
WORKSHOP GROUP ON QUALITY IN STATISTICS
SDMX Opportunities MED Meeting 14 May 2013 Daniel Suranyi Eurostat B5
Annual Bookkeeping Report as the primary administrative source for the production of SBS: current experiences MERIKE PÕLDSAAR Workshop.
Survey phases, survey errors and quality control system
Generic Statistical Business Process Model (GSBPM)
YTY − an integrated production system for business statistics
Session 8 Data Processing
Survey phases, survey errors and quality control system
Tomaž Špeh, Rudi Seljak Statistical Office of the Republic of Slovenia
Validation at Statistics Sweden
Use of handheld electronic devices for data collection in GeoStat
Metadata Framework as the basis for Metadata-driven Architecture
Workshop on ESA 2010 transmission programme – What and how?
Validation process and the IT tools used at KAS
ENGENDERING POPULATION CENSUSES IN TANZANIA
Technical Coordination Group for the next Census round in South East Europe EUROSTAT PREPARATION FOR CENSUS 2020 MONTENEGRO Budapest Jun 2017.
SDMX as basis for water data reporting
United Nations Regional Workshop on the 2020 World Programme on Population and Housing Censuses: International Standards and Contemporary Technologies,
Implementation of a more efficient way of collecting data SBS: electronic data collection Statistics Belgium.
ESS VIP ICT Project Task Force Meeting 5-6 March 2013.
2014 General Population Census of Georgia
Wiesbaden, 24 October, 2007 Svetlana Shutova Statistics Estonia
GSBPM AND ISO AS QUALITY MANAGEMENT SYSTEM TOOLS: AZERBAIJAN EXPERIENCE Yusif Yusifov, Deputy Chairman of the State Statistical Committee of the Republic.
SDMX Implementation The National Accounts use case
Basic preconditions The next round of population and housing censuses is scheduled for the start of the new decade (2021), both in the EU and in the partner.
Lao in Census Quality Assurance
Quick statistics - how to deal with quality?
Work Session on Statistical Metadata (Geneva, Switzerland May 2013)
Technical Coordination Group, Zagreb, Croatia, 26 January 2018
Technical Coordination Group Budapest, Hungary
Stephanie Hirner ESTP ”Administrative data and censuses
Presentation transcript:

Session 8 Data Processing Estonian case study United Nations Technical Meeting on Use of Technology in Population and Housing Censuses 28t h - 1st December 2016 Statistics Estonia Diana Beltadze

Data arrangement I stage

Data processing started on the 31.12.11 and lasted up to the 31.10.12

Issues after data collection Identification of persons (missing personal codes) Identification of addresses Elimination of duplicates Encoding Majority of identification and encoding exercises were completed by the end of April.

Address operator map application

Primary data correction team Data correction manager Operator for ID codes Coders Operator for addresses Operator for duplicates Coder Senior coder

Encoding of text answers Variable Manual % Automat % Occupation 98,1 1,9 EMTAK 91,9 8,1 Religion 68,0 32,0 VS-RTK 50,4 49,6 Citizenship 71,4 28,6 Language 0,4 99,6 Nationality 73,3 26,7 Dialect 52,9 47,1 Address 100,0 0,0

Data processing work flow Population register VVIS I STAGE VVIS Buildings register, Land Board Data collection system, primary data correction Education register II stage VAIS Data correction system, imputations, controls etc

Data processing, stage 2 VAIS is a collection of tools and technologies aimed at automating data processing (Phase 5 in GSBPM). In essence, the task of check, clean, and transforming statistical activity data can be identified as taking the raw data from one or more sources and transforming it to data warehouse(observation registry) for statistical analyse

Metadata driven template based tool Template driven approach provides an universal solution for three main goals: Create an easy to use statistical data processing tool requiring minimal programming skills for transformation package creation.  Create a metadata driven process-oriented and automated statistical data processing tool. Create an extendable data transformation tool.

Data processing with VAIS includes: Automating and speeding up data transformation Raw data, transformation metadata and source data audit trails Metadata driven template based tool Balancing automation and manual intervention

Balancing automation and manual intervention Manual data processing RAW data Data Warehouse Automated data processing OK? Metadata (validation and transformation rules)

Implementation VAIS development 05.2010- 10.2011 Data processing of Population and Housing Census 2011 (01.11.2011-30.10.2012) Reuse administrative data (2012) Data collecting system for administrative data (ADAM) and eSTAT development for prefilling questionnaires in eSTAT with administrative data (annual bookkeeping report). (31.08.2011). VAIS is used for converting administrative data into the statistical data format. (for the year 2012 i.e for the reference year 2011 data collection) Data processing of other statistical activities (first pilots 2012) Data processing of next registry based Population and Housing Census (pilot 2014)

VAIS budget Investments 0,7 million EUR Support 0,1-0,2 million EUR (11.2011-12.2012) VAIS development duration was 18 months

Conclusion: different ways to avoid and correct mistakes using data from population register controls in time of filling in questionnaire using classifications in questionnaire identification persons and addresses coding removing duplicates additional controls imputation

Thank You!