DATA ENTRY AND CLEANING Zerubabel Ogom Ojoo

Slides:



Advertisements
Similar presentations
MICS4 Data Processing Workshop Multiple Indicator Cluster Surveys Data Processing Workshop Data Entry Editing.
Advertisements

MICS DATA PROCESSING Data Entry Editing. REMEMBER AND REMIND YOUR FIELD STAFF: The best place to correct data is in the field where the respondent is.
Industry Issues: Dataset Preparation for Time to Event Analysis Davis Gates Schering Plough Research Institute.
Organisation Of Data (1) Database Theory
Pengolahan dan Analisa Data Indra Budi Fasilkom UI.
Marketing Research Aaker, Kumar, Day and Leone Tenth Edition Instructor’s Presentation Slides 1.
System Design System Design - Mr. Ahmad Al-Ghoul System Analysis and Design.
The controlled assessment is worth 25% of the GCSE The project has three stages; 1. Planning 2. Collecting, processing and representing data 3. Interpreting.
Crime Section, Central Statistics Office..  The Crime Section would like to acknowledge the assistance provided by the Probation Service in this project.
© 2009, KAISER PERMANENTE CENTER FOR HEALTH RESEARCH Getting to Know Your Data Basic Data Cleaning Principles.
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
Präsentationstitel IAB-ITM Find the right tags in DDI IASSIST 2009, 27th-30th Mai 2009 IAB-ITM Finding the Right Tags in DDI 3.0: A Beginner's Experience.
CHAPTER 14, QUANTITATIVE DATA ANALYSIS. Chapter Outline  Quantification of Data  Univariate Analysis  Subgroup Comparisons  Bivariate Analysis  Introduction.
Brief Overview of Data Processing of Afghanistan Household Listing, Pilot Census Results, Population and Housing Census and NRVA Survey Brief Overview.
Basic Concept of Data Coding Codes, Variables, and File Structures.
Database Structure Basics National Center for Immunization & Respiratory Diseases Influenza Division Pam Kennedy Analyst, McKing Consulting Regional Training.
HELPFUL HINTS FOR MAINTAINING PAYROLL DATA Session 9A 2014 Spring KASBO Conference Sheila Miller, Business Analyst Office of KIDS Kentucky Department of.
Presenter: Shant Mandossian EFFECTIVE TESTING OF HEALTHCARE SIMULATION SOFTWARE.
Marketing Research Aaker, Kumar, Day Seventh Edition Instructor’s Presentation Slides.
McGraw-Hill/Irwin © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 9 Processing the Data.
Math 116 Chapter 12.
Power to the People Evidence from a Randomized Field Experiment on Community-Based Monitoring in Uganda Martina Björkman, IGIER, University of Bocconi,
Think of a topic to study Review the previous literature and research Develop research questions and hypotheses Specify how to measure the variables in.
Data Quality Data Cleaning Beverly Musick, M.S. May 20, This module was recorded at the health informatics –training course— data management series.
© 1999, by Que Education and Training, Chapter 5, pages of Introduction to Computer Programming with Visual Basic 6: A Problem-Solving Approach.
Worksheet for a Service Business
How FACILITY CMIS and E-Portal are used within the organisation
EMIS-R Level 1 & 2 Reports Teresa Williams NWOCA/SSDT NWOCA Conference Fall 2012.
Statistical Analysis Mean, Standard deviation, Standard deviation of the sample means, t-test.
MIS 301 Information Systems in Organizations Dave Salisbury ( )
Why Is It There? Getting Started with Geographic Information Systems Chapter 6.
Sampling Methods  Sampling refers to how observations are “selected” from a probability distribution when the simulation is run. 1.
Tips for Researchers on Completing the Data Analysis Section of the IRB Application Don Allensworth-Davies, MSc Statistical Manager, Data Coordinating.
The Marketing Research Project. Purposes of the Project 1.Give you practical experience at conducting a marketing research project. 2.Examine some factors.
Public Expenditure Tracking Surveys(PETS) Zerubabel Ojoo Management systems and Economic Consultants Jan
North Carolina Program Integrity Sampling/Extrapolation Practicum Bradford Woodard, M.S. Senior Health Data Analyst.
System Analysis and Design
AADAPT Workshop South Asia Goa, December 17-21, 2009 Maria Isabel Beltran 1.
Control Chart Methodology for Evaluating CEMS Data
Using Data in the Goal-Setting Process Webinar September 30, 2015.
Data and information. Information and data By the end of this, you should be able to state the difference between DATE and INFORMAITON.
Systems Life Cycle. Know the elements of the system that are created Understand the need for thorough testing Be able to describe the different tests.
Post enumeration survey in the 2009 Pilot Census of Population, Households and Dwellings in Serbia Olga Melovski Trpinac.
Experimental Research Methods in Language Learning Chapter 9 Descriptive Statistics.
DATA PREPARATION: PROCESSING & MANAGEMENT Lu Ann Aday, Ph.D. The University of Texas School of Public Health.
Nonparametric Statistics
PROCESSING OF DATA The collected data in research is processed and analyzed to come to some conclusions or to verify the hypothesis made. Processing of.
PROCESSING, ANALYSIS & INTERPRETATION OF DATA
The Software Development Process
INFORMATION MANAGEMENT Module INFORMATION MANAGEMENT Module
ESchoolPLUS District Data Coordinator September Training September 18, 2015.
Preparing Data for Quantitative Analysis Copyright © 2010 by the McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin.
Copyright 2010, The World Bank Group. All Rights Reserved. Recommended Tabulations and Dissemination Section B.
1 UNIT 13: DATA ANALYSIS. 2 A. Editing, Coding and Computer Entry Editing in field i.e after completion of each interview/questionnaire. Editing again.
Excel part 5 Working with Excel Tables, PivotTables, and PivotCharts.
José Aponte Public Health Advisor Module 3: Adding Intelligence to Forms 12 June 2012 Epi Info™ 7 Introductory Training Office of Surveillance, Epidemiology,
Research Methodology Lecture No :32 (Revision Chapters 8,9,10,11,SPSS)
PAYROLL HELPFUL HINTS Sheila Miller, Business Analyst Office of KIDS Kentucky Department of Education.
Assistant Instructor Nian K. Ghafoor Feb Definition of Proposal Proposal is a plan for master’s thesis or doctoral dissertation which provides the.
Sociology. Sociology is a science because it uses the same techniques as other sciences Explaining social phenomena is what sociological theory is all.
A Training Course for the Analysis and Reporting of Data from Education Management Information Systems (EMIS)
WHAT IS A SAMPLING DISTRIBUTION? Textbook Section 7.1.
Chapter Fourteen Data Preparation 14-1 Copyright © 2010 Pearson Education, Inc.
PCB 3043L - General Ecology Data Analysis Organizing an ecological study What is the aim of the study? What is the main question being asked? What are.
WHO The World Health Survey Data Entry
Control Chart Methodology for Evaluating CEMS Data
Just the basics: Learning about the essential steps to do some simple things in SPSS Larkin Lamarche.
Power to the People Evidence from a Randomized Field Experiment on Community-Based Monitoring in Uganda Martina Björkman, IGIER, University of Bocconi,
Rusinga DSS DATA MANAGEMENT.
Spreadsheets, Modelling & Databases
Presentation transcript:

DATA ENTRY AND CLEANING Zerubabel Ogom Ojoo

DATA ENTRY AND CLEANING  Objective is to help prepare effective data entry programs that minimize the post cleaning period

DATA ENTRY AND CLEANING  Elements of clean data  Consistent and logical All variables are consistent. Expenditure of continuous variables realistic Consistency in coding All missing values justified and documented

DATA ENTRY AND CLEANING  Good questionnaire layout facilitates the data entry design and subsequent data entry and cleaning 1.Involve a Data Management Specialist from the beginning 2.Delineate the questionnaire into sections 3.Pre-code all variables directly on the questionnaires 4.Enumerate each variable clearly 5.Create entry boxes for response fields 6.Integrate logical skips and test it during pilot

DATA ENTRY AND CLEANING Quality Controls  Design an effective data entry program 1.Data entry screens must match questionnaire. 2.Concurrent controls (Real time controls) 3.Range checks.

DATA ENTRY AND CLEANING NoQuestionsCodesSkip to 1Did you receive any private funds last year? 1Yes 2No >>q3 2How much? 3Date of birth 4Year started teaching in this school 5Year started teaching 6What is your salary?Enter “9” for refuse to answer >>10 7Relationship with district1 Excellent 2 Good 3 Fair 4 Bad

DATA ENTRY AND CLEANING  Real time Controls  Checks are done at data entry time e.g. If S1Q1 = 2 then skip to S1Q3 Endif e.g If S1Q1 = 1 and S1Q3 <0 then reject (if s1q1 =1 then amount must exist) endif

DATA ENTRY AND CLEANING  Range checks  Limits all out of range values Most out of range values come from data entry miskeys and carelessness  Soft checks Warns data entry operators but can override  Hard check Cannot override

DATA ENTRY AND CLEANING  Use controls or reference tables whenever available  Sample codes 1, 24, 45, 60, 90, 154, 766, 980  Other identification variables

DATA ENTRY AND CLEANING  Simple consistency checks  Q1: when did you start teaching in this school = 2000  Q2: when did you start teaching = 2002 Inconsistent Consistency check: If S4Q1>S4Q2 then reject Message (S4Q1 must be >= S4Q2) Endif

DATA ENTRY AND CLEANING Conclusion  Data Entry screens must match questionnaire  Real time Controls  Range checks  Simple Consistency checks

Software The data analyst has to make a selection of the software to be used for statistical analysis e.g. SPSS, SAS, MS Access, MS Excel The analyst will be guided by the researchers on the sort of tables to be generated.

ANALYSIS OF PETS Primary analysis should focus on your PET objectives : Measure leakage of funds tracked on their way to schools and analyze the causes  Use simple parameters (simple average percentages and standard deviations) but complex relationship can be developed.  Analyze equity in leakage distribution(urban/rural divide etc)  Comparing resources disbursed at various levels: central, district, subdistrict, subcounty, schools against entitlements.  Calculating average differences between levels.  Determine how these differences vary over time and space.  What are the explanations.  What are the proposed interventions.

Amount actual received/Amount entitled Calculated by administrative level. Region,District, School. Explanations for the leakages. LEAKAGES

Analysis beyond leakages Leakage is the primary focus but there are many aspects of service delivery that are equally important. Textbooks procured and at school but not used or stored outside school. Teachers are hired and paid for but not teaching. Students enrolled but absent most of the time. Students at school but parents not providing for uniform and lunches. Health workers paid for but not attending to patients when drugs are out of stock. Rude nurses scare pregnant women who want to deliver at health center. Health centers opened in time but waiting time to see health worker too long. Most patients opt for traditional healers instead of public health centers. All these call for a combination of research methodologies and interventions to investigate and resolve the above.