Download presentation
Presentation is loading. Please wait.
Published bySheena Daniels Modified over 9 years ago
1
Copyright 2010, The World Bank Group. All Rights Reserved. PROCESSING, Part 1 Data capture, editing, imputation and tabulation Quality assurance for census 1
2
Copyright 2010, The World Bank Group. All Rights Reserved. COMPONENTS OF PROCESSING Data Capture Editing Imputation Tabulation 2
3
Copyright 2010, The World Bank Group. All Rights Reserved. DATA CAPTURE Key Entry Scanning Direct Entry 3
4
Copyright 2010, The World Bank Group. All Rights Reserved. KEY ENTRY Key entry as a data capture techniques has advantages and disadvantages: Pro Relatively inexpensive Skills readily available Employment 4
5
Copyright 2010, The World Bank Group. All Rights Reserved. KEY ENTRY (cont’d) Con Time consuming Requires many workstations Error prone 5
6
Copyright 2010, The World Bank Group. All Rights Reserved. SCANNING Scanning is a process similar to photocopying, but the current technology has advanced far beyond simple photocopying. There are three levels of scanning: 1.OMR: Optical Mark Recognition 2.OCR: Optical Character Recognition 3.ICR: Intelligent Character Recognition 6
7
Copyright 2010, The World Bank Group. All Rights Reserved. SCANNING (cont’d) Pro Fast processing Reliable 7
8
Copyright 2010, The World Bank Group. All Rights Reserved. Con Expensive upfront costs for equipment, software, & training Very precise requirements for paper, printing and processing 8 SCANNING (cont’d)
9
Copyright 2010, The World Bank Group. All Rights Reserved. DIRECT ENTRY Most Recent Innovations Enumerators use hand-held computers, or where internet is common Self enumeration through the internet 9
10
Copyright 2010, The World Bank Group. All Rights Reserved. DIRECT ENTRY Pro More efficient – Saves a step (immediate data capture) Improves data quality – Editing at respondent level – Timeliness Reduces some costs – Printing questionnaires 10
11
Copyright 2010, The World Bank Group. All Rights Reserved. DIRECT ENTRY Con Requires better trained enumerators Riskier – Hardware failure – PDA loss – Requires electricity Expensive 11
12
Copyright 2010, The World Bank Group. All Rights Reserved. EDITING The Editing Process Identifies errors Identifies non-response Identifies logical inconsistencies 12
13
Copyright 2010, The World Bank Group. All Rights Reserved. DEALING WITH ERRORS Replacement (Imputation) Weighting 13
14
Copyright 2010, The World Bank Group. All Rights Reserved. IMPUTATION Two classes of imputation Deterministic Stochastic 14
15
Copyright 2010, The World Bank Group. All Rights Reserved. DETERMINISTIC IMPUTATION Will yield the same answer each time – Missing data may be calculable from other values - e.g. Citizenship can be calculated from Place of Birth – Missing data is imputed by a sequential donor technique or any other method that will yield identical results 15
16
Copyright 2010, The World Bank Group. All Rights Reserved. DETERMINISTIC IMPUTATION Six main types – Deductive or Logical – Mean Value – Ratio/Regression – Sequential Hot Deck – Sequential Cold Deck – Nearest-Neighbor 16
17
Copyright 2010, The World Bank Group. All Rights Reserved. STOCHASTIC IMPUTATION Can yield different results if the process is rerun – Use of a random donor or other randomized approach – Use of randomized residuals to create realistic data Most deterministic methods have a stochastic counterpart 17
18
Copyright 2010, The World Bank Group. All Rights Reserved. IMPUTATION Which inconsistency to change? The general rule is to change as few values as possible Which values to impute? – For a 10 year old married university graduate, do we change the age or the education and marital status? – changing only the age can make the record consistent, therefore change age 18
19
Copyright 2010, The World Bank Group. All Rights Reserved. VALIDATION After imputation data are consistent. May or may not be correct Validation is the final step before certification and release 19
20
Copyright 2010, The World Bank Group. All Rights Reserved. VALIDATION May be subject to bias: Design Training Enumerator Respondent Processing 20
21
Copyright 2010, The World Bank Group. All Rights Reserved. CERTIFICATION Final step before data release NSO’s expression of confidence in the data 21
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.