Pilot Census in Poland Some Quality Aspects Geneva, 7-9 July 2010 Janusz Dygaszewicz Central Statistical Office POLAND.

Slides:



Advertisements
Similar presentations
Making the Case for Metadata at SRS-NSF National Science Foundation Division of Science Resources Statistics Jeri Mulrow, Geetha Srinivasarao, and John.
Advertisements

Enhancing Data Quality of Distributive Trade Statistics Workshop for African countries on the Implementation of International Recommendations for Distributive.
Quality Guidelines for statistical processes using administrative data European Conference on Quality in Official Statistics Q2014 Giovanna Brancato, Francesco.
CZECH STATISTICAL OFFICE | Na padesatem 81, Prague 10 | Jitka Prokop, Czech Statistical Office SMS-QUALITY The project and application.
Application for presenting census results in the context of statistical data confidentiality in Poland Amelia Wardzińska-Sharif Central Statistical Office.
United Nations Expert Group Meeting on Revising the Principles and Recommendations for Population and Housing Censuses New York, 29 October – 1 November.
United Nations Economic Commission for Europe Statistical Division Applying the GSBPM to Business Register Management Steven Vale UNECE
The use and convergence of quality assurance frameworks for international and supranational organisations compiling statistics The European Conference.
New technologies used in 2010 Census Round – Polish case study Janusz Dygaszewicz Director of Central Census Bureau Geneva, 30 September – 03 October 2013.
Regional Seminar on Census Data Archiving for Africa, Addis Ababa, Ethiopia, September 2011 Overview of Archiving of Microdata Session 4 United Nations.
Case Studies: Statistics Canada (WP 11) Alice Born Statistics UNECE Workshop on Statistical Metadata.
Survey Data Management and Combined use of DDI and SDMX DDI and SDMX use case Labor Force Statistics.
Population Census carried out in Armenia in 2011 as an example of the Generic Statistical Business Process Model Anahit Safyan Member of the State Council.
Eurostat Overall design. Presented by Eva Elvers Statistics Sweden.
NATIONAL STATISTICAL COMMITTEE OF THE KYRGYZ REPUBLIC: METADATA AND DATABASE ARCHIVE CREATION L. Tekeeva Deputy Chairman of the National Statistical Committee.
Implementation of quality indicators in the Finnish statistics production process Kari Djerf Statistics Finland Q2008, Rome Italy.
CZECH STATISTICAL OFFICE Na padesátém 81, CZ Praha 10, Czech Republic 1 Subsystem QUALITY in Statistical Information System Czech.
European Conference on Quality in Official Statistics Session 26: Quality Issues in Census « Rome, 10 July 2008 « Quality Assurance and Control Programme.
Multi-modal of data collection for the 2010 Population and Housing Census National Statistical Office, Thailand (Daejeon, Republic of Korea, April.
MODERN CENSUS in POLAND Janusz Dygaszewicz Central Statistical Office in Poland Group of Experts on Population and Housing Census Geneva, October.
October 28-30, 2009 UNECE Geneva Quality Assessment of 2008 Integrated Census - Israel Pnina ZADKA Central Bureau of Statistics Israel.
United Nations Economic Commission for Europe Statistical Division Mapping Data Production Processes to the GSBPM Steven Vale UNECE
Use of Administrative Data Seminar on Developing a Programme on Integrated Statistics in support of the Implementation of the SNA for CARICOM countries.
Statistik.atSeite 1 Norbert Rainer Quality Reporting and Quality Indicators for Statistical Business Registers European Conference on Quality in Official.
Quality Assurance Programme of the Canadian Census of Population Expert Group Meeting on Population and Housing Censuses Geneva July 7-9, 2010.
Electronic data collection System in CSB of Latvia By Karlis Zeila, Vice President, CSB of Latvia IT DG meeting, October , Eurostat.
Editing of linked micro files for statistics and research.
1 C. ARRIBAS, D. LORCA, A. SALINERO & A. COLMENERO Measuring statistical quality at the Spanish National Statistical Institute.
Copyright 2010, The World Bank Group. All Rights Reserved. Principles, criteria and methods Part 2 Quality management Produced in Collaboration between.
May 12-15, Evaluating the Integrated Census Israel Pnina ZADKA Central Bureau of Statistics Israel.
Developing and applying business process models in practice Statistics Norway Jenny Linnerud and Anne Gro Hustoft.
Regional Seminar on Promotion and Utilization of Census Results and on the Revision on the United Nations Principles and Recommendations for Population.
Eurostat SDMX and Global Standardisation Marco Pellegrino Eurostat, Statistical Office of the European Union Bangkok,
Metadata Working Group Jean HELLER EUROSTAT Directorate A: Statistical Information System Unit A-3: Reference data bases.
© Statistisches Bundesamt, I/A Case study Federal Statistical Office Germany (Destatis) Joint UNECE/ EUROSTAT/ OECD Work Session on Statistical Metadata.
United Nations Oslo City Group on Energy Statistics OG7, Helsinki, Finland October 2012 ESCM Chapter 8: Data Quality and Meta Data 1.
Copyright 2010, The World Bank Group. All Rights Reserved. Recommended Tabulations and Dissemination Section B.
RECENT DEVELOPMENT OF SORS METADATA REPOSITORIES FOR FASTER AND MORE TRANSPARENT PRODUCTION PROCESS Work Session on Statistical Metadata 9-11 February.
7b. SDMX practical use case: Census Hub
QUALITY ASSESSMENT OF THE REGISTER-BASED SLOVENIAN CENSUS 2011 Rudi Seljak, Apolonija Flander Oblak Statistical Office of the Republic of Slovenia.
Overview and challenges in the use of administrative data in official statistics IAOS Conference Shanghai, October 2008 Heli Jeskanen-Sundström Statistics.
Role of the IMDB in the CBA and IM Strategy Presented to Information Management Committee Standards Division June
Census quality evaluation: Considerations from an international perspective Bernard Baffour and Paolo Valente UNECE Statistical Division Joint UNECE/Eurostat.
5.8 Finalise data files 5.6 Calculate weights Price index for legal services Quality Management / Metadata Management Specify Needs Design Build CollectProcessAnalyse.
The business process models and quality issues at the Hungarian Central Statistical Office (HCSO) Mr. Csaba Ábry, HCSO, Methodological Department Geneva,
First meeting of the Technical Cooperation Group for the Population and Housing Censuses in South East Europe Vienna, March 2010 POST-ENUMERATION.
How official statistics is produced Alan Vask
A Training Course for the Analysis and Reporting of Data from Education Management Information Systems (EMIS)
Metadata models to support the statistical cycle: IMDB
Implementation of Quality indicators for administrative data
Development of Strategies for Census Data Dissemination
Multimode census enumeration in Poland Session 6
Guidelines for planning the costs of statistical surveys and other work implemented by the organisational units of official statistics services.
Generic Statistical Business Process Model (GSBPM)
Quality Assurance Plans of Turkey for 2021PHC
Tomaž Špeh, Rudi Seljak Statistical Office of the Republic of Slovenia
2. An overview of SDMX (What is SDMX? Part I)
6.1 Quality improvement Regional Course on
SDMX in the S-DWH Layered Architecture
Albania 2021 Population and Housing Census - Plans
Quality Assurance in the European Statistical System
The change of data sources in the Spanish SILC
Mapping Data Production Processes to the GSBPM
Metadata used throughout statistics production
The role of metadata in census data dissemination
Metadata on quality of statistical information
2.7 Annex 3 – Quality reports
Technical Coordination Group, Zagreb, Croatia, 26 January 2018
Joint UNECE/Eurostat/OECD
New technologies of data collections supporting transformation from traditional to combined and register-based censuses (PL) Janusz Dygaszewicz Central.
Presentation transcript:

Pilot Census in Poland Some Quality Aspects Geneva, 7-9 July 2010 Janusz Dygaszewicz Central Statistical Office POLAND

2 XML TXT Registry 1 Metadata server Operational Microdata Base Registry 2 Registry n Analitycal Microdata Base ETL Tools Portal CAXI Data processing infrastructure XML Files Statistical Files Golden Record Metadata SDMX Questionaries

Key elements of census process in terms of census quality Census planning - scope of census, Data sources, Data collecting, Data storing, Data processing, Development of census results, Dissemination of census results, Census Metadata System. Census Quality 3

CENSUS PLANNING 4

Census planning Quality aspects: relevance, accuracy, costs including the burden on respondents, information security Determining the data scope defined in Act including: Compliance with needs of domestic and EU users, Quality of data source, Coherence and comparability of results from census 2011 and 2002, Census Quality 5

DATA ACQUISITION 6

7 XML TXT Registry 1 Metadata server Operational Microdata Base Registry 2 Registry n Analitycal Microdata Base ETL Tools Portal CAXI Data acquisition XML Files Statistical Files Golden Record Metadata SDMX Questionaries

Files format: Flat files, XML files, Local Databases XML files integration, Data acquisition 8

Data acquisition - Portal 9

Datasources Quality aspects: accuracy, timeliness and punctuality, comparability and coherence, costs including the burden on respondents, information security Assessment of data sources quality for census: analyses of methodological compliance of concepts definitions from registers with those adopted in statistics and the UNECE and EUROSTAT Recommendations for the 2010 Censuses on Population and Housing, developing methodology for compliance analyses, constructing the IT system PiK for describing, comparing and assessing coherence level, Census Quality – data acquisition 10

Registers developing methodology for assessing the quality: dimensions, quality indicators, evaluation and description of sources quality, MATRIX that represents the possibility of obtaining the values for the census from registers: census variable compliance indicators (methodology compliance indicator), register suitability indicators (population coverage indicator for data from the register), Census Quality – data acquisition 11

Data sets developing methodology for assessing the quality, evaluation and description of data sets quality, developing methodology for improving source data sets quality – rules for: standardization, normalization, de- duplication, editing, imputation, calibration Census Quality – data acquisition 12

CENSUS FRAME PREPARATION 13

Citizens, buildings and dwelling list preparing, Citizens, buildings and dwelling list and statistical data integration, Census Frame preparing. Census Frame preparation 14 Goal Frame Preparation, Random Sample preparation,

Quality of Census Frame 15 Census frame pre-census revision - checking in field by enumerators Census frame preparation – validation and updating in counties,

Enumerator tracking

18

19

20

21

22

Census Completeness Monitoring

24

TRANSFORMATION TO STATISTICAL REGISTER 25

26 XML TXT Registry 1 Metadata server Operational Microdata Base Registry 2 Registry n Analitycal Microdata Base ETL Tools Portal CAXI Source data collection and preparation XML Files Statistical Files Golden Record Metadata SDMX Questionaries

Registers loading into data laboratory envroiment,Denormalization,Standarization,Deduplication,Validation,Data completion,Vocabulary validation and automatic correction,Statistical files (register) generation, Source data collection and preparation 27

Collecting data Quality aspects: accuracy, costs including the burden on respondents, information security Collecting data from information systems Central registers, Distributed registers, format / file structure (XSD schemas), data transfer platform, application for encrypted data transfer, application for validation and data set control Census Quality – collection and preparation 28

Data loading to Operational Microdatabase,ValidationManual and automatic correction (cleaning),Deduplication,Variables calculating, Source data loading and correction 29

30 XML TXT Registry 1 Metadata server Operational Microdata Base Registry 2 Registry n Analitycal Microdata Base ETL Tools Portal CAXI CAxI XML Files Statistical Files Golden Record Metadata SDMX Questionaries

CAII - Computer Assisted Internet Interview, CAPI - Computer Assisted Personal Interview, CATI - Computer Assisted Telephone Interviewing. CAxI 31 CAXI

Collecting data from respondents: CAII, CAPI, CATI; CAxI input validation: Numerical data validation (answers within boundaries) Cross question arithmetical validation Hints and automatic answer completion Dictionaries and drop down menus CAxI logical validation: Answers determined by questions Cross question logical validation Data collection logical paths Census Quality – data collection by electronic questionare 32

Data storing Quality aspects: information security Data storing in Operational Microdata Base, Notification of Operational Microdata Base to registration by General Inspector for Protection of Personal Data, Census Quality 33

GOLDEN RECORD, 34

35 XML TXT Registry 1 Metadata server Operational Microdata Base Registry 2 Registry n Analitycal Microdata Base ETL Tools Portal CAXI Golden Record generation XML Files Statistical Files Golden Record Metadata SDMX Questionaries

36 XML TXT Registry 1 Metadata server Operational Microdata Base Registry 2 Registry n Analitycal Microdata Base ETL Tools Portal CAXI Export to Analitycal Microdata Base XML Files Statistical Files Golden Record Metadata SDMX Questionaries

Integration with Census Frame and CAxI data,Validation,Correction,Operational Imputation,Transfer proper values to Golden Record, Golden Record generation 37 Registers 1..n CAxI Golden Record OMB Layers

Transition Tables Preparing,Golden Records anonymisation,Transfer to Analitycal Microdatabase, Export to Analitycal Microdata Base 38

Data processing Quality aspects: accuracy Developing quality indicators for data sets at each stage of data processing and the procedures for calculating their value, Developing procedures for bringing data from administrative sources to full compliance or minimum discrepancy with appropriate methodology adopted in statistics, Developing procedures for normalization, editing of data sets from the administrative systems, including the imputation of data (administrative data sets), Developing procedures for synchronization of data from administrative systems, Developing rules for linking data from different administrative systems, Developing rules for linking data from administrative systems with data from CAII, CAPI, CATI, Developing rules for calculation of Golden Record census variables, Developing rules for anonymisation of Golden Record census data. Census Quality 39

ANALITYCAL MICRODATABASE 40

41 XML TXT Registry 1 Metadata server Operational Microdata Base Registry 2 Registry n Analitycal Microdata Base ETL Tools Portal CAXI Analitycal Microdata Base XML Files Statistical Files Golden Record Metadata SDMX Questionaries

Analitycal Microdata Base - process 42 Process data Load data and metadata Integrate data Classify and code data Edit and validate data Impute Derive new variables Wage Aggregate Create files Analyse Produce preliminary statistics Check quality Analyse Prepare statistics for Dissemination Approve Disseminate Prepare and modify dissemiation systems Prepare products Manage products Promote Monitor Archive Manage metainformation Manage quality

Functionality 43 Administration Information Security Management Data Processing Information Analisys Requirement and Product Management Dissemination Metadata Quality Management Analitycal Microdatabase

Development of census results Quality aspects: relevance, accuracy, comparability and coherence Developing rules for missing data completion - imputation and calibration, Developing rules for creating derived objects - creation of new objects (households, families), Developing a model / method of data estimation with the use of the data from administrative systems and sample surveys, Developing rules for calculating data outputs. Census Quality 44

DISEMINATION 45

Dissemination of census results Quality aspects: relevance, timeliness and punctuality, accessibility and clarity, comparability and coherence, information security Designing Analitycal Microdata Base features including compliance with users needs, accessibility and clarity of census data. Census Quality - disemination 46

METAINFORMATION MANAGEMENT 47

48 XML TXT Registry 1 Metadata server Operational Microdata Base Registry 2 Registry n Analitycal Microdata Base ETL Tools Portal CAXI Metadata server XML Files Statistical Files Golden Record Metadata SDMX Questionaries

Metainformation management 49 Metainformation Definition Bussines ReferencialConceptualMethodicalQualityStructural Technical System Postprocessi ng

Census Metadata System Quality aspects: accessibility and clarity Developing quality indicators at each stage of census and the procedures for calculating their value. Census Quality – metainformation 50

51 POLAND