Data Capture Process Stages

Slides:



Advertisements
Similar presentations
SADC Course in Statistics The Use of Optical Character Recognition Technology In National Statistical Offices.
Advertisements

1 Survey Technology. Data Collection Tools Available in the Market 1. Paper Survey 2. Smart Paper 3. Cell Phones 4. Personal Digital Assistants - PDAs.
INTRODUCTION ABOUT OMR. INDEX  Concept/Definition  Form Design  Scanners & Software  Storage  Accuracy  OMR Advantages  Commercial Suppliers.
Commercial Data Processing Lesson 2: The Data Processing Cycle.
Data Capture Methods. In this topic, we will be looking at: Methods of data capture When it would be appropriate to use each method Advantages and disadvantages.
Commercial Data Processing Lesson 3: Data Validation.
UNSD-ESCWA Regional Workshop on Census Data Processing in the ESCWA region: Contemporary technologies for data capture, methodology and practice of data.
EORTC Remote Data Capture System For trials with electronic queries
AUTOMATIC DATA CAPTURE  a term to describe technologies which aim to immediately identify data with 100 percent accuracy.
Brief Overview of Data Processing of Afghanistan Household Listing, Pilot Census Results, Population and Housing Census and NRVA Survey Brief Overview.
UNSD-ESCWA Regional Workshop on Census Data Processing in the ESCWA region: Contemporary technologies for data capture, methodology and practice of data.
Workshop on international standards, contemporary technologies and regional cooperation Noumea, New Caledonia, 4 – 8 February 2008 Introduction to Optical.
UNSD Census Workshop Day 2 - Session 6 Data Capture: Optical Mark Recognition Andy Tye – International Manager DRS are Worldwide specialists in data capture.
Census Data Capture Challenge Intelligent Document Capture Solution UNSD Workshop - Minsk Dec 2008 Amir Angel Director of Government Projects.
1 Introduction to Computers Day 2. 2 Input Devices Input devices are used to feed data and instructions to the computer systems.They consist of a range.
UNSD Regional Workshop on Census Data Processing for the English speaking African Countries: Contemporary technologies for data capture, methodology and.
UNSD Census Workshop Day 2 - Session 6 Data Capture: Optical Mark Recognition Andy Tye – International Manager DRS are Worldwide specialists in Census.
Today’s Lecture application controls audit methodology.
Input Design Objectives
1 Use of scanning technology for data capture ICR System (Intelligent Character Recognition) Information and Communication Technology Center National Statistical.
Topics Covered: Data preparation Data preparation Data capturing Data capturing Data verification and validation Data verification and validation Data.
UNSD-ESCWA Regional Workshop on Census Data Processing in the ESCWA region: Contemporary technologies for data capture, methodology and practice of data.
Fall 2006 Retake Training Materials 07FL309. Retake Training Materials Slide 2 Retake Training Materials Icons TA Identifies School Coordinator slides.
Data management in the field Ari Haukijärvi 2nd EHES training seminar.
UNSD Regional Workshop on Census Data Processing for the English speaking African Countries: Contemporary technologies for data capture, methodology and.
Regional Workshop on the 2010 World Programme on Population and Housing Censuses: International standards, contemporary technologies for census mapping.
Scanning Technology and Its Application in Ethiopia Yakob Mudesir Deputy Director General Central Statistical Agency of Ethiopia
© Beta Systems Software AG Process Stages of Census Surveys Richard J. Lang, International Manager September 2008, Bangkok.
King Fahd University of Petroleum & Minerals Department of Management and Marketing MKT 345 Marketing Research Dr. Alhassan G. Abdul-Muhmin Editing and.
Data Capture Overview United Nations Statistics Division
UNSD Census Workshop Day 2 - Session 7 Data Capture: Intelligent Character Recognition Andy Tye – International Manager DRS are Worldwide specialists in.
Data Capture Technology Statistical Centre Of IRAN Presented by : MS. SOMAYE AHANGAR Vice – Presidency for Strategic Planning and Supervision Statistical.
UNSD Regional Workshop on Census Data Processing for the English speaking African Countries: Contemporary technologies for data capture, methodology and.
Census Data Processing: Contemporary Technologies for Data Capture Bangkok, Thailand September, 2008 By Jatan Kumar Saha Systems Analyst Bangladesh.
Test and Review chapter State the differences between archive and back-up data. Answer: Archive data is a copy of data which is no longer in regular.
Data Management Seminar, 9-12th July 2007, Hamburg 11 ICCS 2009 – Field Trial Survey Operations Overview.
Status of Data Capture Technology in Population and Housing Censuses in the ESCAP region Statistics Division ESCAP.
By Blake Stratton. Data Chapter The questionnaire is Printed on paper. People write or tick the boxes. Someone needs to type it in the computer. Some.
RESEARCH METHODS Lecture 29. DATA ANALYSIS Data Analysis Data processing and analysis is part of research design – decisions already made. During analysis.
Data Processing of the 2010 Population and Housing Census September 2008, Bangkok, Thailand National Statistical Office, Thailand.
UNSD-UNESCAP Regional Workshop on Census Data Processing: Contemporary technologies for data capture, methodology and practice of data editing, documentation.
Census Data Capture: ABS Experience 1991 to 2006 Noumea February 2008.
Regional Workshop on the 2010 World Programme on Population and Housing Censuses: International standards, contemporary technologies for census mapping.
Regional Workshop on the 2010 World Programme on Population and Housing Censuses: International standards, contemporary technologies for census mapping.
Census Data Capture with OCR Technology: Ghana’s Experience Presented at the UNSD Regional Workshop on Census Data Processing Dar es Salaam, Tanzania 9.
UNSD-ESCWA Regional Workshop on Census Data Processing in the ESCWA region: Contemporary technologies for data capture, methodology and practice of data.
Idaho Procedures M650 GREEN LIGHT OPTICAL SCAN TABULATOR.
Workshop on Census Cartography and Management, Bangkok, Thailand, 15–19 October 2007 How to Structure, Design and Evaluate Capacity for the use of GIS.
Donna Morrell, CTR NAACCR 2014 Annual Conference Ottawa, Ontario, Canada June 25, 2014 Using Scanners and Optical Character Recognition for Pathology Report.
UNSD-UNESCAP Regional Workshop on Census Data Processing: Contemporary technologies for data capture, methodology and practice of data editing, documentation.
GCSE ICT LESSON 5 Booklet Sections: 6 & 7 Data Capture & Checking Data.
National Population Commission (NPopC)
DATA COLLECTION Data Collection Data Verification and Validation.
CHAPTER 19 Data processing
EU-SILC Survey Process in the Czech Republic presentation for EU-SILC Methodological Workshop November 7th Martina Mysíková, Martin Zelený Social.
Outsourcing of Census Operations United Nations Statistics Division
UNSD Census Workshop Data Capture: Optical Mark Recognition
UNSD Census Workshop Data Capture: Intelligent Character Recognition
Bangkok, Thailand, September 2008
Databases.
Databases.
UN Workshop on Data Capture, Bangkok Session 7 Data Capture
Optical Data Capture: Optical Character Recognition (OCR)
UN Workshop on Data Capture, Dar es Salaam Session 7 Data Capture
Data Capture - ICR Typical Workflow
UNSD Census Workshop Day 2 - Session 6
Optical Data Capture: Optical Mark Recognition (OMR)
Data Preparation (Click icon for audio) Dr. Michael R. Hyman, NMSU.
Chapter 11: Printers IT Essentials v6.0 Chapter 11: Printers
Manual Data Capture – Key Entry
Presentation transcript:

Data Capture Process Stages UNSD-UNESCAP Regional Workshop on Census Data Processing: Contemporary technologies for data capture, methodology and practice of data editing, documentation and archiving Bangkok, Thailand, 15-19 September 2008

Overview Objective Major Process Stages Factors/Considerations Document Scanning operations Recognizing operations Verifying operations Coding Assistance Factors/Considerations UNSD-UNESCAP Regional Workshop on Census Data Processing: Contemporary technologies for data capture, methodology and practice of data editing, documentation and archiving Bangkok, Thailand, 15-19 September 2008 2

Objective To provide an overview of the major process stages associated with optical data capture and quality assurance considerations UNSD-UNESCAP Regional Workshop on Census Data Processing: Contemporary technologies for data capture, methodology and practice of data editing, documentation and archiving Bangkok, Thailand, 15-19 September 2008

Major Process Stages Document Scanning Recognizing Verifying Scanner Speeds are dependent on process chosen Recognizing Recognizing is dependent on the sophistication of the recognition engine Automatic Electronic Verification Major Process Stages Verifying Non-Successful Electronic Verification prepare data in a form suitable for entry into computer Coding Assistance UNSD-UNESCAP Regional Workshop on Census Data Processing: Contemporary technologies for data capture, methodology and practice of data editing, documentation and archiving Bangkok, Thailand, 15-19 September 2008

Document Scanning Stage Key feature: scanning speed Scanning speed will be determined by: Quality of the scanner machines Size of non-drop out color Paper quality, cleanness & weight UNSD-UNESCAP Regional Workshop on Census Data Processing: Contemporary technologies for data capture, methodology and practice of data editing, documentation and archiving Bangkok, Thailand, 15-19 September 2008

Recognizing Stage Accuracy of interpretation will be determined by: The recognizing process is to interpret images Accuracy of interpretation will be determined by: Recognition engine/memory dictionary; Configuration threshold UNSD-UNESCAP Regional Workshop on Census Data Processing: Contemporary technologies for data capture, methodology and practice of data editing, documentation and archiving Bangkok, Thailand, 15-19 September 2008 6

Verifying Stage Processing can be in geographic order or in random order: Automatic electronic verification Non successful electronic verification: Need to compare the value of the interpreted image with the real image of the form. Image manipulation UNSD-UNESCAP Regional Workshop on Census Data Processing: Contemporary technologies for data capture, methodology and practice of data editing, documentation and archiving Bangkok, Thailand, 15-19 September 2008 7

Verifying Stage (cont.) Image Manipulation: Electronic questionnaires can be sent to specialist operators then back to the original operator if necessary (in some cases, the same questionnaire can be worked on simultaneously by two or more persons) UNSD-UNESCAP Regional Workshop on Census Data Processing: Contemporary technologies for data capture, methodology and practice of data editing, documentation and archiving Bangkok, Thailand, 15-19 September 2008

Coding Assistance Stage Process in which census questionnaire entries are assigned numerical and/ or alphanumeric values Objective is to prepare data in a form suitable for entry into computer Done by setting up possible responses to each question in the census questionnaire UNSD-UNESCAP Regional Workshop on Census Data Processing: Contemporary technologies for data capture, methodology and practice of data editing, documentation and archiving Bangkok, Thailand, 15-19 September 2008

Factors to be considered Questionnaire Design & Preparation Data Collection & Processing Considerations Field Operation Staff Training UNSD-UNESCAP Regional Workshop on Census Data Processing: Contemporary technologies for data capture, methodology and practice of data editing, documentation and archiving Bangkok, Thailand, 15-19 September 2008

Thank You UNSD-UNESCAP Regional Workshop on Census Data Processing: Contemporary technologies for data capture, methodology and practice of data editing, documentation and archiving Bangkok, Thailand, 15-19 September 2008

Additional material UNSD-UNESCAP Regional Workshop on Census Data Processing: Contemporary technologies for data capture, methodology and practice of data editing, documentation and archiving Bangkok, Thailand, 15-19 September 2008

Questionnaire Design & Preparation Form Design Advise Consider the number items to be included in a form Pre-print codes near the place where the box for ticks are located Considering the speed of the data capture process - it is advisable to use marks or “ticks” as much as possible Define drop out color properly; use registration marks (allows for quicker recognition) UNSD-UNESCAP Regional Workshop on Census Data Processing: Contemporary technologies for data capture, methodology and practice of data editing, documentation and archiving Bangkok, Thailand, 15-19 September 2008

Questionnaire Design & Preparation Form Design Advise Maintain consistent pattern in which the information to be collected will be located Do not disturb the visibility of the ticks and marks with titles, labels or instructions Avoid putting "answers" of one field to another page of the questions; Avoid using open ended questions UNSD-UNESCAP Regional Workshop on Census Data Processing: Contemporary technologies for data capture, methodology and practice of data editing, documentation and archiving Bangkok, Thailand, 15-19 September 2008 14

Questionnaire Design & Preparation How to Obtain Good Results of Scanning Select adequate paper quality Select a reliable printing press Use appropriate ink, considering drop out color (for the questionnaires paper heavier than 80 grams per square meter can help avoid paper crashes in scanner) UNSD-UNESCAP Regional Workshop on Census Data Processing: Contemporary technologies for data capture, methodology and practice of data editing, documentation and archiving Bangkok, Thailand, 15-19 September 2008 15

Data Collection & Processing Considerations Field Operation Field Operators should have basic knowledge of the data capture process chosen Staff Training A set-up of required training for staff will ensure quality and effectiveness of the data captured UNSD-UNESCAP Regional Workshop on Census Data Processing: Contemporary technologies for data capture, methodology and practice of data editing, documentation and archiving Bangkok, Thailand, 15-19 September 2008

Field Operation Considerations Reasons of Error-Reading of OCR: Bad condition of the form because of dirt, folded, crumple, etc Unnecessary lines of characters such as points, decorative strokes, hooks, etc Checking the questionnaires for completeness and consistencies UNSD-UNESCAP Regional Workshop on Census Data Processing: Contemporary technologies for data capture, methodology and practice of data editing, documentation and archiving Bangkok, Thailand, 15-19 September 2008

Training for Processing Staff Installation and set-up break-down of equipment (e.g. hardware and software) Basic software knowledge Scanner operating procedures Troubleshooting (e.g. solutions to common problems/issues) UNSD-UNESCAP Regional Workshop on Census Data Processing: Contemporary technologies for data capture, methodology and practice of data editing, documentation and archiving Bangkok, Thailand, 15-19 September 2008

Control steps Control steps should be taken if the information image is partial or no information to assure the quality of generated files Value Checking Steps Control for Blank Missing Questionnaire Value Checking Steps: Verify that the information captured is the same with the questionnaire Control for Blank: If the information is blank, what type of control must be taken Missing Questionnaire; Make sure that the entire and all questionnaires are scanned completely, no missing and no duplication as well Therefore control procedures including to produce control tables to compare with manual work UNSD-UNESCAP Regional Workshop on Census Data Processing: Contemporary technologies for data capture, methodology and practice of data editing, documentation and archiving Bangkok, Thailand, 15-19 September 2008