Presentation is loading. Please wait.

Presentation is loading. Please wait.

Pilot Census in Poland Some Quality Aspects Geneva, 7-9 July 2010 Janusz Dygaszewicz Central Statistical Office POLAND.

Similar presentations


Presentation on theme: "Pilot Census in Poland Some Quality Aspects Geneva, 7-9 July 2010 Janusz Dygaszewicz Central Statistical Office POLAND."— Presentation transcript:

1 Pilot Census in Poland Some Quality Aspects Geneva, 7-9 July 2010 Janusz Dygaszewicz Central Statistical Office POLAND

2 2 XML TXT Registry 1 Metadata server Operational Microdata Base Registry 2 Registry n Analitycal Microdata Base ETL Tools Portal CAXI Data processing infrastructure XML Files Statistical Files Golden Record Metadata SDMX Questionaries

3 Key elements of census process in terms of census quality Census planning - scope of census, Data sources, Data collecting, Data storing, Data processing, Development of census results, Dissemination of census results, Census Metadata System. Census Quality 3

4 CENSUS PLANNING 4

5 Census planning Quality aspects: relevance, accuracy, costs including the burden on respondents, information security Determining the data scope defined in Act including: Compliance with needs of domestic and EU users, Quality of data source, Coherence and comparability of results from census 2011 and 2002, Census Quality 5

6 DATA ACQUISITION 6

7 7 XML TXT Registry 1 Metadata server Operational Microdata Base Registry 2 Registry n Analitycal Microdata Base ETL Tools Portal CAXI Data acquisition XML Files Statistical Files Golden Record Metadata SDMX Questionaries

8 Files format: Flat files, XML files, Local Databases XML files integration, Data acquisition 8

9 Data acquisition - Portal 9

10 Datasources Quality aspects: accuracy, timeliness and punctuality, comparability and coherence, costs including the burden on respondents, information security Assessment of data sources quality for census: analyses of methodological compliance of concepts definitions from registers with those adopted in statistics and the UNECE and EUROSTAT Recommendations for the 2010 Censuses on Population and Housing, developing methodology for compliance analyses, constructing the IT system PiK for describing, comparing and assessing coherence level, Census Quality – data acquisition 10

11 Registers developing methodology for assessing the quality: dimensions, quality indicators, evaluation and description of sources quality, MATRIX that represents the possibility of obtaining the values for the census from registers: census variable compliance indicators (methodology compliance indicator), register suitability indicators (population coverage indicator for data from the register), Census Quality – data acquisition 11

12 Data sets developing methodology for assessing the quality, evaluation and description of data sets quality, developing methodology for improving source data sets quality – rules for: standardization, normalization, de- duplication, editing, imputation, calibration Census Quality – data acquisition 12

13 CENSUS FRAME PREPARATION 13

14 Citizens, buildings and dwelling list preparing, Citizens, buildings and dwelling list and statistical data integration, Census Frame preparing. Census Frame preparation 14 Goal Frame Preparation, Random Sample preparation,

15 Quality of Census Frame 15 Census frame pre-census revision - checking in field by enumerators Census frame preparation – validation and updating in counties,

16 Enumerator tracking

17

18 18

19 19

20 20

21 21

22 22

23 Census Completeness Monitoring

24 24

25 TRANSFORMATION TO STATISTICAL REGISTER 25

26 26 XML TXT Registry 1 Metadata server Operational Microdata Base Registry 2 Registry n Analitycal Microdata Base ETL Tools Portal CAXI Source data collection and preparation XML Files Statistical Files Golden Record Metadata SDMX Questionaries

27 Registers loading into data laboratory envroiment,Denormalization,Standarization,Deduplication,Validation,Data completion,Vocabulary validation and automatic correction,Statistical files (register) generation, Source data collection and preparation 27

28 Collecting data Quality aspects: accuracy, costs including the burden on respondents, information security Collecting data from information systems Central registers, Distributed registers, format / file structure (XSD schemas), data transfer platform, application for encrypted data transfer, application for validation and data set control Census Quality – collection and preparation 28

29 Data loading to Operational Microdatabase,ValidationManual and automatic correction (cleaning),Deduplication,Variables calculating, Source data loading and correction 29

30 30 XML TXT Registry 1 Metadata server Operational Microdata Base Registry 2 Registry n Analitycal Microdata Base ETL Tools Portal CAXI CAxI XML Files Statistical Files Golden Record Metadata SDMX Questionaries

31 CAII - Computer Assisted Internet Interview, CAPI - Computer Assisted Personal Interview, CATI - Computer Assisted Telephone Interviewing. CAxI 31 CAXI

32 Collecting data from respondents: CAII, CAPI, CATI; CAxI input validation: Numerical data validation (answers within boundaries) Cross question arithmetical validation Hints and automatic answer completion Dictionaries and drop down menus CAxI logical validation: Answers determined by questions Cross question logical validation Data collection logical paths Census Quality – data collection by electronic questionare 32

33 Data storing Quality aspects: information security Data storing in Operational Microdata Base, Notification of Operational Microdata Base to registration by General Inspector for Protection of Personal Data, Census Quality 33

34 GOLDEN RECORD, 34

35 35 XML TXT Registry 1 Metadata server Operational Microdata Base Registry 2 Registry n Analitycal Microdata Base ETL Tools Portal CAXI Golden Record generation XML Files Statistical Files Golden Record Metadata SDMX Questionaries

36 36 XML TXT Registry 1 Metadata server Operational Microdata Base Registry 2 Registry n Analitycal Microdata Base ETL Tools Portal CAXI Export to Analitycal Microdata Base XML Files Statistical Files Golden Record Metadata SDMX Questionaries

37 Integration with Census Frame and CAxI data,Validation,Correction,Operational Imputation,Transfer proper values to Golden Record, Golden Record generation 37 Registers 1..n CAxI Golden Record OMB Layers

38 Transition Tables Preparing,Golden Records anonymisation,Transfer to Analitycal Microdatabase, Export to Analitycal Microdata Base 38

39 Data processing Quality aspects: accuracy Developing quality indicators for data sets at each stage of data processing and the procedures for calculating their value, Developing procedures for bringing data from administrative sources to full compliance or minimum discrepancy with appropriate methodology adopted in statistics, Developing procedures for normalization, editing of data sets from the administrative systems, including the imputation of data (administrative data sets), Developing procedures for synchronization of data from administrative systems, Developing rules for linking data from different administrative systems, Developing rules for linking data from administrative systems with data from CAII, CAPI, CATI, Developing rules for calculation of Golden Record census variables, Developing rules for anonymisation of Golden Record census data. Census Quality 39

40 ANALITYCAL MICRODATABASE 40

41 41 XML TXT Registry 1 Metadata server Operational Microdata Base Registry 2 Registry n Analitycal Microdata Base ETL Tools Portal CAXI Analitycal Microdata Base XML Files Statistical Files Golden Record Metadata SDMX Questionaries

42 Analitycal Microdata Base - process 42 Process data Load data and metadata Integrate data Classify and code data Edit and validate data Impute Derive new variables Wage Aggregate Create files Analyse Produce preliminary statistics Check quality Analyse Prepare statistics for Dissemination Approve Disseminate Prepare and modify dissemiation systems Prepare products Manage products Promote Monitor Archive Manage metainformation Manage quality

43 Functionality 43 Administration Information Security Management Data Processing Information Analisys Requirement and Product Management Dissemination Metadata Quality Management Analitycal Microdatabase

44 Development of census results Quality aspects: relevance, accuracy, comparability and coherence Developing rules for missing data completion - imputation and calibration, Developing rules for creating derived objects - creation of new objects (households, families), Developing a model / method of data estimation with the use of the data from administrative systems and sample surveys, Developing rules for calculating data outputs. Census Quality 44

45 DISEMINATION 45

46 Dissemination of census results Quality aspects: relevance, timeliness and punctuality, accessibility and clarity, comparability and coherence, information security Designing Analitycal Microdata Base features including compliance with users needs, accessibility and clarity of census data. Census Quality - disemination 46

47 METAINFORMATION MANAGEMENT 47

48 48 XML TXT Registry 1 Metadata server Operational Microdata Base Registry 2 Registry n Analitycal Microdata Base ETL Tools Portal CAXI Metadata server XML Files Statistical Files Golden Record Metadata SDMX Questionaries

49 Metainformation management 49 Metainformation Definition Bussines ReferencialConceptualMethodicalQualityStructural Technical System Postprocessi ng

50 Census Metadata System Quality aspects: accessibility and clarity Developing quality indicators at each stage of census and the procedures for calculating their value. Census Quality – metainformation 50

51 51 POLAND


Download ppt "Pilot Census in Poland Some Quality Aspects Geneva, 7-9 July 2010 Janusz Dygaszewicz Central Statistical Office POLAND."

Similar presentations


Ads by Google