Maintaining data quality: fundamental steps

Slides:



Advertisements
Similar presentations
1 Site Editing on the Portal. 2 After signing on, click on the plus sign for Sites :
Advertisements

JQuery MessageBoard. Lets use jQuery and AJAX in combination with a database to update and retrieve information without refreshing the page. Here we will.
BASIC SKILLS AND TOOLS USING ACCESS
Review of Data Processing Steps MICS3 Data Analysis and Report Writing Workshop.
MICS4 Survey Design Workshop Multiple Indicator Cluster Surveys Survey Design Workshop MICS4 Technical Assistance.
Data collection procedures Presentation template for adaptation and use in medicine prices and availability survey training workshop for survey personnel.
Web Design Issues in a Business Establishment Panel Survey Third International Conference on Establishment Surveys (ICES-III) June 18-21, 2007 Montréal,
Tutorial 9 – Creating On-Screen Forms Using Advanced Table Techniques
XP New Perspectives on Microsoft Office Word 2003 Tutorial 6 1 Microsoft Office Word 2003 Tutorial 6 – Creating Form Letters and Mailing Labels.
Copyright © 2010 Pearson Education, Inc. Slide
1 ICOTS (Interstate Compact Offender Tracking System) Attachments Training.
Is it true that university students sleep late into the morning and even into the afternoon? Suppose we want to find out what time university students.
1 Data processing and exporting Module 2 Session 6.
Housekeeping: Variable labels, value labels, calculations and recoding
Creating Data Entry Screens in Epi Info
SADC Course in Statistics Exploratory Data Analysis (EDA) in the data analysis process Module B2 Session 13.
SADC Course in Statistics Common complications when analysing survey data Module I3 Sessions 14 to 16.
SADC Course in Statistics Producing a product portfolio Module I3 Session
SADC Course in Statistics Analysing Data Module I3 Session 1.
Forms to Spreadsheets A-Team Spring Brown Bags February 7, 2014 Jennifer Lowman Coordinator, Student Persistence Research University of Nevada, Reno.
© Copyright 2004 United Parcel Service of America, Inc. UPS, the UPS brandmark, and the color brown are registered trademarks of United Parcel Service.
Excel Functions. Part 1. Introduction 2 An Excel function is a formula or a procedure that is performed in the Visual Basic environment, outside the.
1 NatQuery 3/05 An End-User Perspective On Using NatQuery To Extract Data From ADABAS Presented by Treehouse Software, Inc.
The essentials managers need to know about Excel
1:30-2:15. Preliminary Tables A & B and frequencies for checking during data submission Tables A-J Careers advisers (plus employer names file, etc.) National.
Multiple Indicator Cluster Surveys Survey Design Workshop MICS Technical Assistance MICS Survey Design Workshop.
1 Field Management: Roles & Responsibilities Partially Adapted from Multiple Indicator Cluster Surveys (MICS) Regional Training Workshop – Survey Techniques,
Case Management Techniques
Session # 2 SWE 211 – Introduction to Software Engineering Lect. Amanullah Quadri 2. Fact Finding & Techniques.
Equations Lesson
IMPLEMENTING YOUR SURVEY. By the end of this lesson you will be able to: Conduct a survey questionnaire. Recruit and train enumerators and encoders to.
Test Taking Strategies for Aviation Meteorology (AMT 220)
Introduction AmeriCorps State & National 1 The following presentation will guide AmeriCorps State and National Program users through how to create Applicant-Determined.
INSERT BOOK COVER 1Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall. Exploring Microsoft Office Excel 2010 by Robert Grauer, Keith.
Management Information Systems [MOIS470]
Registered Nurse Education Programs Capitation & Special Programs Funding Webinar Presented by: Barbara Zendejas, Melissa Omand and Manuela Lachica,
Benchmark Series Microsoft Excel 2013 Level 2
CREATING A PAYMENT REQUEST FOR A NEW VENDOR
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 1.1 Chapter Five Data Collection and Sampling.
1 Displaying Open Purchase Orders (F/Y 11). 2  At the end of this course, you should be able to: –Run a Location specific report of all Open Purchase.
Fraction XI Adding Mixed Numbers With Unlike Denominators
Business Planning using Spreasheets-2 1 BP-2: Good Spreadsheet Practice  There is always the temptation to rush in and start entering data.  However.
IG Pro & CMS.
SCIA Special Circumstances Instructional Assistance
What is a Survey? A scientific social research method that involves
EMR 6500: Survey Research Dr. Chris L. S. Coryn Kristin A. Hobson Spring 2013.
February Reviewing and Approving an Expense Report Press F5 to begin the slide show. Slides will automatically advance.
Page 1 of 36 The Public Offering functionality in Posting allows users to submit requests for public offerings of Petroleum and Natural Gas(PNG) and Oil.
By Hui Bian Office for Faculty Excellence Spring
Order of Operations And Real Number Operations
Chapter 8 Improving the User Interface
1 State Records Center Searching and Requesting Inventory  Versatile web address:  Look for any new ‘Special.
Benchmark Series Microsoft Excel 2013 Level 2
1 Formatting Your Survey. What should a format look like? For any questionnaire, whether small or big, the important things are: a.Skip patterns b.Options.
© Janice Regan, CMPT 102, Sept CMPT 102 Introduction to Scientific Computer Programming The software development method algorithms.
1 Pilot Testing & Training. 2 2 Piloting the questionnaire After several drafts… you are now ready to pilot! Initially, try the full questionnaire; if.
McGraw-Hill/Irwin © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 9 Processing the Data.
NextGen Trustee Department Disbursements This class will cover the various methods of handling department disbursements. Whether entering them manually.
PSAT PREP Titan Forum Lesson Plans October, 2013.
Copyright 2010, The World Bank Group. All Rights Reserved. Data Processing and Tabulation, Part I.
Examining data using Microsoft Access Queries Using Criteria and Calculations SESSION 3.2 This section covers specifying an exact match condition in a.
TIMOTHY SERVINSKY PROJECT MANAGER CENTER FOR SURVEY RESEARCH Data Preparation: An Introduction to Getting Data Ready for Analysis.
Preparing to collect data. Make sure you have your materials Surveys –All surveys should have a unique numerical identifier on each page –You can write.
Creating a data set From paper surveys to excel. STEPS 1.Order your filled questionnaires 2.Number your questionnaires 3.Name your variables. 4.Create.
Forum to improve your experience entering data into SRDR 1 SRDR is being developed and maintained by the Brown EPC under contract with the Agency for Healthcare.
WHO The World Health Survey Data Entry
Dale Rhoda & Mary Kay Trimner Stata Conference 2018
2018 NM Community Survey Data Entry Training
NextGen Trustee General Ledger Accounting
Baseline Household Survey (CAPI) High-frequency Data Collection (CATI)
Presentation transcript:

Maintaining data quality: fundamental steps

Agenda The whole process Questionnaire design Data collection Software design Data entry

The whole process Questionnaire Design Data collection Software design Asking the right questions, in the right way Structure the questionnaire effectively  Pilot & Back-Translate Veracity Quality of survey Quality of filling questionnaires  Back Checks & Accompaniments Software design Data entry and management Minimize data entry errors Organize data in an effective way Clean data  Double entry & error checking

Agenda The whole process Questionnaire design Data collection Software design Data entry

Questionnaire design Clear skip patterns whenever needed. Grids The software designer will then need to include those in the data entry software. Grids Single/multiple options Interviewer checkpoints When coding your questions, make sure that all options are included. For example, if there is a chance, even small, that people will say “I don’t know”, do include the code “-999” in the question.

Pilot and translate survey Pilot: in non research areas, but similar setting Depending on how ready questionnaire is, 30 to 40 pilots Can also pilot some sections more intensively Translation: back translation is MANDATORY

Agenda The whole process Questionnaire design Data collection Software design Data entry

Data collection: surveyors Selection Training: before survey, and on-going Before survey: Classroom and field Questionnaire + field instructions + behavior on field Training on the issue of interest Also, if you have time to do an instruction manual, it is useful Keep going to the field with them and do reminder trainings (ex. You notice they prompt too much etc.) Maintain motivation: go out with them, bonuses etc. STAY IN THE FIELD WITH THEM

Data collection: quality checks Team structure One supervisor for five surveyors A field monitor if your team is big to help you manage the team Monitoring on the field Accompaniments by supervisor: all the time Accompaniments by monitor: 75% of the time Accompaniments by yourself: maybe 15% of the time Back-checks by field monitor: 15% of questionnaires, some sections (mandatory!) Do some back-checks yourself Analyse the data from back-checks right away! If you use a survey company, you still need to do your own back-checks and some accompaniments

Questionnaire quality: scrutiny Scrutinize questionnaires Have surveyors, and supervisors do it But also do it yourself! If you have a project assistant, ask him to scrutinize 100% but still scrutinize 50% or so yourself (at least most tricky sections) Examples of instances where only you can catch mistakes: codes for activity, logical consistency When scrutinizing, write all codes, even if not pre-coded “-777” for missing, or “-999” for “I don’t know” If you find too many missing data, or data not consistent, send surveyors back to the field

Agenda The whole process Questionnaire design Data collection Software design Data entry

Data management: goals Quality Timing Timing is important, and you need to monitor the Data Entry Officers (DEO) or the Data Entry (DE) company carefully to make sure they stick to timelines, but by no mean you should sacrifice any steps related to quality check (if you save time on those steps, you’ll lose time later).

Data entry software Software Need to think about it as soon as questionnaire close to final Could be done by survey company or outsourced to someone else (less expensive, or someone you trust better) Goal is that DEO should be able to do as few mistakes as possible

Data entry software Software developing: send the developer a detailed spreadsheet indicating instructions for each question (what is the range of acceptable values, logical checks, etc.). The more detailed this will be, the more time you’ll save later. Software testing: When a software designer does the software, you need to test it your self by entering a bunch of questionnaires (for e.g pilot questionnaires, or also invent the responses, just make sure you test all the parts of the software). Check output: Then look at the output carefully and make sure it looks fine, and also send it to the professors you work with to make sure they are satisfied with the output.

Checking output When checking output try to imagine yourself analyze the data! All field need to be numerical (except text fields, like comments or “others – specify”). Again, there is not much you can do with text fields when you analyse. One example: when questions have multiple choice responses (let’s say the question is “where do you take your water from?” and there are 5 options “well, tap, etc.”) This question should be considered as 5 questions (1. Do you take your water from the well? Yes or no 2. Do you take your water from the tap? Yes or no etc.). The response for this question will be a binary variable (i.e either 1 (yes) or 0 (no). This becomes obvious if you put your self in the shoes of the person who will analyse the data (among others, you!). If this is considered as only one question, and the DEO fills “1, 2, 5” in the unique response field, you can not do anything with that data!

Agenda The whole process Questionnaire design Data collection Software design Data entry

Data entry Timing: Data entry should start no as soon as possible after data collection start – and before collection is over! Double entry: Mandatory. Must be written in contract. One output Two outputs, reconciled Error checking: Check the error rate on a regular basis (batches of 200 or 300 questionnaires). And before you do any cleaning Payment to DE company: In contract, clause that the first payment will be done only after 200 or so questionnaires have been given to you, the error rate checked by you, and less than 0.5%. Pay only after that. Get bad data re-entered entirely: whatever is the nature of the errors

Error rate checking What is it? For each batch, re-enter a sample of data fields and compare this data with the data given by the company (for those fields) Need approximately 3000 by batch How to do? Divide your data in sub-sections (of about 25 questions) In some cases you will receive your data split in tabs – you can use those tabs as sub-sections – if small enough For each sub-section select 5% of questionnaires in your batch, randomly selected Enter data from that section of the selected questionnaires (using an excel spreadsheet, or the data entry software) Compare your dataset with original data (use stata, excel, or comparison software), and check on physical questionnaire who did the mistake Error rate: numbers of errors made by the company/number of fields (one error is one field with a mistake, not one question!) Calculate error rate for each section, and overall

Data cleaning and organizing Clean your data in a different file Rename and label variables Check for logical errors Look at ranges and outliers Do basic data summaries Check for duplicate data Check for missing data Look at distribution of data by surveyors/teams