Farm Household Surveys DATABASE ORGANISATION AND DATA CLEANING Glwadys Aymone GBETIBOUO C4ECOSOLUTIONS, CAPE TOWN Economics analyses of climate change.

Slides:



Advertisements
Similar presentations
How to write a study protocol Hanne-Merete Eriksen (based on Epiet 2004)
Advertisements

Multiple Indicator Cluster Surveys Data Entry and Processing.
MICS Data Processing Workshop
MICS Data Processing Workshop Overview. Data Processing Design Data processing is organized around clusters There is one set of data files for each cluster.
MICS4 Data Processing Workshop Multiple Indicator Cluster Surveys Data Processing Workshop Overview of Data Processing System.
MICS4 Survey Design Workshop Multiple Indicator Cluster Surveys Survey Design Workshop Data Entry and Processing.
Forthcoming events in MICS3 MICS3 Data Analysis and Report Writing Workshop.
Do Economic and Demographic Characteristics Differ between Web and Mail Respondents to the 2005 Census of Agriculture Content Test? By Nancy J. Dickey.
Preparing Data for Quantitative Analysis
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Data Processing and Fundamental Data Analysis CHAPTER fourteen.
Learning Objectives 1 Copyright © 2002 South-Western/Thomson Learning Data Processing and Fundamental Data Analysis CHAPTER fourteen.
Learning Objectives Copyright © 2004 John Wiley & Sons, Inc. Data Processing, Fundamental Data Analysis, and Statistical Testing of Differences CHAPTER.
1 QUANTITATIVE DESIGN AND ANALYSIS MARK 2048 Instructor: Armand Gervais
©2004, 2006, 2008 UIW Department of Instructional Technology Meat and Potatoes SPSS Presented by Terence Peak.
McGraw-Hill/Irwin McGraw-Hill/Irwin Copyright © 2009 by The McGraw-Hill Companies, Inc. All rights reserved.
SOWK 6003 Social Work Research Week 10 Quantitative Data Analysis
MICS Data Processing Workshop Multiple Indicator Cluster Surveys Data Processing Workshop Data Quality Tables.
Learning Objective Chapter 13 Data Processing, Basic Data Analysis, and Statistical Testing of Differences CHAPTER thirteen Data Processing, Basic Data.
Survey Methodology Survey data entry/cleaning EPID 626 Lecture 10.
Conducting a Job Analysis to Establish the Examination Content Domain Patricia M. Muenzen Associate Director of Research Programs Professional Examination.
Microdata sources for research on the Economics of Tobacco Control Lynn Woolfrey Economics of Tobacco Control Workshop University of Cape Town 25 June.
MGT-491 QUANTITATIVE ANALYSIS AND RESEARCH FOR MANAGEMENT OSMAN BIN SAIF Session 15.
(5) Moderators - Coding. Overview General Information to keep in mind:  Coded variables - Objective – which are usually study characteristics that can.
STAT 3130 Statistical Methods II Missing Data and Imputation.
Organizing Your Data for Statistical Analysis in SPSS
Data Processing, Fundamental Data
MICS4 Survey Design Workshop Multiple Indicator Cluster Surveys Survey Design Workshop Data Analysis and Reporting.
Key steps in conducting survey research Decide if a survey is the best design to use Short time, economical, dispersed population, anonymity Report what.
Workshop on International Standards, Contemporary Technologies and Regional Cooperation, Noumea, New Caledonia, 04–08 February 2008 Results Generated from.
Using IPUMS.org Katie Genadek Minnesota Population Center University of Minnesota The IPUMS projects are funded by the National Science.
MICS Data Processing Workshop Multiple Indicator Cluster Surveys Data Processing Workshop Overview of MICS Tools, Templates, Resources, Technical Assistance.
Chapter Thirteen Validation & Editing Coding Machine Cleaning of Data Tabulation & Statistical Analysis Data Entry Overview of the Data Analysis.
Research Methodology Lecture No : 21 Data Preparation and Data Entry.
Chapter 1:Statistics: The Art and Science of Learning from Data 1.1: How Can You Investigate Using Data? 1.2: We Learn about Populations Using Samples.
P REPARING FOR DATA ANALYSIS MBBS H ONOURS P ROGRAM Jenny Zhang Research Fellow School of Medicine The University of Queensland.
AADAPT Workshop South Asia Goa, December 17-21, 2009 Maria Isabel Beltran 1.
European Conference on Quality in Official Statistics Session 26: Quality Issues in Census « Rome, 10 July 2008 « Quality Assurance and Control Programme.
1 Sri Lanka Quarterly Labour Force Survey. Household surveys before 1990 Labour Force and Socio Economic Survey (LFSES) 1980/811985/86 After 5 years /70.
Chapter Twelve Copyright © 2006 John Wiley & Sons, Inc. Data Processing, Fundamental Data Analysis, and Statistical Testing of Differences.
PROCESSING, ANALYSIS & INTERPRETATION OF DATA
Data Management Seminar, 9-12th July 2007, Hamburg Entering Data Part 2.
Data Management Seminar, 8-11th July 2008, Hamburg 1 Survey Administration Receiving Material Data Submission Instrument Preparation Codebook Adaptation.
Data processing of the 1999 Vietnam Population Census.
Dr. Michael R. Hyman, NMSU Data Preparation. 2 File, Record, and Field.
13 Data Processing and Fundamental Data Analysis.
United Nations Workshop on Evaluation and Analysis of Census Data, 1-12 December 2014, Nay Pyi Taw, Myanmar DATA VALIDATION-I Evaluation of editing and.
Preparing Data for Quantitative Analysis Copyright © 2010 by the McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin.
DATA DESCRIPTION Research Methods College of Public and Community Services University of Massachusetts at Boston ©2012 William Holmes 1.
TIMOTHY SERVINSKY PROJECT MANAGER CENTER FOR SURVEY RESEARCH Data Preparation: An Introduction to Getting Data Ready for Analysis.
Calculators How to use yours! Use this document to note down appropriate commands for YOUR calculator in the spaces provided on page 7, Ch 2 in the Lecture.
Chapter 34 Organisation & Collection of Data. Primary & Secondary Data PRIMARY DATA is collected for a particular purpose. PRIMARY DATA is obtained from.
Data Processing, Fundamental Data Analysis, and the Statistical Testing of Differences Chapter Twelve.
Data Management Seminar, 9-12th July 2007, Hamburg 1 Survey Administration Receiving Material Data Submission Instrument Preparation Codebook Adaptation.
First meeting of the Technical Cooperation Group for the Population and Housing Censuses in South East Europe Vienna, March 2010 POST-ENUMERATION.
Data Preparation and Description Lecture 24 th. Recap If you intend to undertake quantitative analysis consider the following: type of data (scale of.
Coding Preparing The Research for Data Entry. Coding (defined) Coding is the process of converting questionnaire responses into a form that a computer.
Chapter Fourteen Copyright © 2004 John Wiley & Sons, Inc. Data Processing and Fundamental Data Analysis.
WG/UNICEF Child functioning module: Preliminary results from Samoa & Supporting documentation Mitchell Loeb National Center for Health Statistics/ Washington.
UNSD-UNESCAP Regional Workshop on Census Data Processing: Contemporary technologies for data capture, methodology and practice of data editing, documentation.
Quantitative Data Analysis and Interpretation
CHAPTER 13 Data Processing, Basic Data Analysis, and the Statistical Testing Of Differences Copyright © 2000 by John Wiley & Sons, Inc.
Databases.
Databases.
Rusinga DSS DATA MANAGEMENT.
Data Processing, Basic Data Analysis, and the
Data Preparation (Click icon for audio) Dr. Michael R. Hyman, NMSU.
MOON Data File Components
Presentation transcript:

Farm Household Surveys DATABASE ORGANISATION AND DATA CLEANING Glwadys Aymone GBETIBOUO C4ECOSOLUTIONS, CAPE TOWN Economics analyses of climate change impacts workshop Accra, Ghana

Database organisation and cleaning, or data management is generally seen as a set of tasks related to the tabulation phase of the survey, in other words, activities that are conducted towards the end of the survey project, that use computers in clean offices. Survey data management should begin concurrently with questionnaire design. Keys points to consider: – Nature and identification of the statistical units observed – Built-in redundancies – Length and complexity of the questionnaire – Sample size and design – Survey timing and scheduling

DATA ENTRY : “flat file”

codification of the statistical unit ADM0ADM1ADM2CADM0CADM1CADM2CODE South AfricaEastern CapeAberden

Household code 8 digits code HHCODE

DATA ENTRY SYSTEM A complex household survey typically contains hundreds of variables. For example household survey dataset 2003 GEF study : 1342 variables After the survey instrument has been finalized, you develop the data entry system and provide a protocol for data entry. Coding questionnaire Coding sheet Household data: 12 worksheets Climate data; soil data, runoff data

DATA ENTRY hhcodeTIBfarmtyperelheadhhsizegender1age1 HHCODETIB : : : : : : : : :

Data cleaning Generally data is subjected to control mechanisms: 1.range checks, 2.consistency checks and 3.typographical checks

Range checks Every variable in the survey contains only data within a limited domain of valid values. tab farmtype, missing farmtype | Freq. Percent Cum | | | | | Total | hhcode farmtype remark CHECK DATA FOR THIS OBS.

Consistency check Values from one question are consistent with values from another question.  Demographic consistency of the household  Consistency of age and other individual characteristics gen test=hhmales+hhfemales list hhcode hhsize hhmales hhfemales test remark if test!=hhsize, hhcode hhsize hhmales hhfemales test remark CHECK DATA FOR THIS OBS CHECK DATA FOR THIS OBS. tab age5 hhcode age5 remark CHECK DATA FOR THIS OBS.

Typographical checks Typographical error consists in the transposition of digits like entering : 41 rather than 14 This error can be check through the double data entry of all questionnaires -999 rather than.-99 in a numerical input foreach var of varlist _all { replace `var'=-99 if `var'==-999 replace `var'=. if `var'==-99 } Use the tab function to obtain frequency tables of the datafrequency tables of the data