Sample Issues and Field Work Session V Lusaka, January 20, 2003 Juan Munoz and Francesca Recanatini www.worldbank.org/wbi/governance.

Slides:



Advertisements
Similar presentations
Multiple Indicator Cluster Surveys Survey Design Workshop
Advertisements

Faculty of Allied Medical Science Biostatistics MLST-201
MKTG 3342 Fall 2008 Professor Edward Fox
Discussion Sampling Methods
Taejin Jung, Ph.D. Week 8: Sampling Messages and People
MISUNDERSTOOD AND MISUSED
Dr. Chris L. S. Coryn Spring 2012
Who and How And How to Mess It up
Beginning the Research Design
Sampling.
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
CHAPTER twelve Basic Sampling Issues Copyright © 2002
Determining the Sample Plan
Formalizing the Concepts: Simple Random Sampling.
Sampling ADV 3500 Fall 2007 Chunsik Lee. A sample is some part of a larger body specifically selected to represent the whole. Sampling is the process.
Sampling Procedures and sample size determination.
Formalizing the Concepts: STRATIFICATION. These objectives are often contradictory in practice Sampling weights need to be used to analyze the data Sampling.
Sampling Design.
Sampling Concepts Population: Population refers to any group of people or objects that form the subject of study in a particular survey and are similar.
Sampling Designs and Sampling Procedures
SAMPLING METHODS Chapter 5.
Sample Design.
Copyright 2010, The World Bank Group. All Rights Reserved. Agricultural Census Sampling Frames and Sampling Section A 1.
Sampling January 9, Cardinal Rule of Sampling Never sample on the dependent variable! –Example: if you are interested in studying factors that lead.
Sampling. Concerns 1)Representativeness of the Sample: Does the sample accurately portray the population from which it is drawn 2)Time and Change: Was.
Sampling: Theory and Methods
Sampling: What you don’t know can hurt you Juan Muñoz.
CHAPTER 12 – SAMPLING DESIGNS AND SAMPLING PROCEDURES Zikmund & Babin Essentials of Marketing Research – 5 th Edition © 2013 Cengage Learning. All Rights.
Chap 20-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 20 Sampling: Additional Topics in Sampling Statistics for Business.
Sampling Methods. Definition  Sample: A sample is a group of people who have been selected from a larger population to provide data to researcher. 
7-1 Chapter Seven SAMPLING DESIGN. 7-2 Selection of Elements Population Element the individual subject on which the measurement is taken; e.g., the population.
Learning Objectives Copyright © 2004 John Wiley & Sons, Inc. Basic Sampling Issues CHAPTER Ten.
Scot Exec Course Nov/Dec 04 Survey design overview Gillian Raab Professor of Applied Statistics Napier University.
CHAPTER 12 DETERMINING THE SAMPLE PLAN. Important Topics of This Chapter Differences between population and sample. Sampling frame and frame error. Developing.
1 Hair, Babin, Money & Samouel, Essentials of Business Research, Wiley, Learning Objectives: 1.Understand the key principles in sampling. 2.Appreciate.
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Population and sample. Population: are complete sets of people or objects or events that posses some common characteristic of interest to the researcher.
Sampling Design and Analysis MTH 494 LECTURE-12 Ossam Chohan Assistant Professor CIIT Abbottabad.
SAMPLING TECHNIQUES. Definitions Statistical inference: is a conclusion concerning a population of observations (or units) made on the bases of the results.
Tahir Mahmood Lecturer Department of Statistics. Outlines: E xplain the role of sampling in the research process D istinguish between probability and.
Assuring good field work Juan Muñoz. What happens when fieldwork is poor? A long and frustrating process of “data cleaning” becomes unavoidable The data.
Chapter 15 Sampling and Sample Size Winston Jackson and Norine Verberg Methods: Doing Social Research, 4e.
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Basic Sampling Issues CHAPTER twelve.
Sampling Techniques 19 th and 20 th. Learning Outcomes Students should be able to design the source, the type and the technique of collecting data.
5-4-1 Unit 4: Sampling approaches After completing this unit you should be able to: Outline the purpose of sampling Understand key theoretical.
Chapter Eleven Sampling: Design and Procedures Copyright © 2010 Pearson Education, Inc
Chapter 6: 1 Sampling. Introduction Sampling - the process of selecting observations Often not possible to collect information from all persons or other.
Chapter Ten Copyright © 2006 John Wiley & Sons, Inc. Basic Sampling Issues.
Bangor Transfer Abroad Programme Marketing Research SAMPLING (Zikmund, Chapter 12)
 When every unit of the population is examined. This is known as Census method.  On the other hand when a small group selected as representatives of.
Sampling technique  It is a procedure where we select a group of subjects (a sample) for study from a larger group (a population)
1 Aspects of Sampling for Household Surveys Kathleen Beegle Workshop 17, Session 1c Designing and Implementing Household Surveys March 31, 2009.
Population vs. Sample. Population: a set which includes all measurements of interest to the researcher (The collection of all responses, measurements,
1 Health Results-Based Financing Impact Evaluation Surveys Quality Assurance and Data Management Álvaro Canales, Beatriz Godoy, Juan Muñoz Sistemas Integrales.
Types of method Quantitative: – Questionnaires – Experimental designs Qualitative: – Interviews – Focus groups – Observation Triangulation.
Sampling Concepts Nursing Research. Population  Population the group you are ultimately interested in knowing more about “entire aggregation of cases.
Sampling Design and Procedure
Sampling Chapter 5. Introduction Sampling The process of drawing a number of individual cases from a larger population A way to learn about a larger population.
AC 1.2 present the survey methodology and sampling frame used
Sampling.
Sampling Designs and Sampling Procedures
Graduate School of Business Leadership
Meeting-6 SAMPLING DESIGN
Sampling: Design and Procedures
Sampling: Theory and Methods
Basic Sampling Issues.
Random sampling Carlo Azzarri IFPRI Datathon APSU, Dhaka
BUSINESS MARKET RESEARCH
Presentation transcript:

Sample Issues and Field Work Session V Lusaka, January 20, 2003 Juan Munoz and Francesca Recanatini

Motivation The team has defined: The purpose of the assessment The variables to study The empirical tool to use The process to employ But, who should be targeted?

Basic Definitions Sampling allows to measure characteristics of a population, when accessing the whole population is not possible because of economic, practical or physical considerations.  Sampling allows to select a subset of a population to study a specific issue in a meaningful way

Basic Definitions Population: the sum of all the observations within a specified set Target population: all statistical units of interest for the purposes of analysis Working population: all statistical units that can be surveyed

Probability sampling Also known as Scientific Sampling. Respondents are selected randomly. Each respondent in the population has a known, nonzero probability of being included in the sample.

Basic Sampling Techniques The three basic techniques of probability sampling: Simple Random Sampling Multi-stage Sampling Stratified Sampling Most household and firm surveys use a combination of these three techniques.

Probability sampling Permits establishing sampling errors and confidence intervals. Other sampling procedures (purposive sampling, convenience sampling, quota sampling, etc.) cannot do that. Other sampling procedures can also yield biased conclusions.

Simple Random Sampling Respondents are selected independently. Every respondents in the population has an equal chance or probability of being selected in the sample. This probability is: p = n/N where n=the size of the sample. N=the size of the study population.

Simple Random Sampling Simple random sampling is almost never the only technique used in practice, because: A Sampling Frame may not be available, or it would be very large (a Sampling Frame is a list of all units in a study population that can be used to select a sample from. Fieldwork may be difficult since the selected households would be too scattered.

Simple Random Sampling Simple random sampling is almost never the only technique used in practice, but it is useful to illustrate some basic facts about sampling: Sampling errors and confidence intervals. The relationship between sampling error and sample size. The relationship between sampling error and population size. Sampling errors vs. non-sampling errors.

Sampling error and sample size Sampling error e when estimating a proportion p with a sample of size n taken from an infinite population

Confidence intervals In a sample of 1,000 enterprises, 280 enterprises (28 percent) have been harassed by a predatory agency. Sampling error is 1.42 percent.

Confidence intervals In a sample of 1,000 enterprises, 280 enterprises (28 percent) have been harassed by a predatory agency. Sampling error is 1.42 percent. Sampling error 95 percent confidence interval:28 ± percent confidence interval: 28 ±

Sampling error and sample size Sampling error Sample size To halve sampling error......sample size must be quadrupled

Sample size and population size Sampling error e when estimating a proportion p with a sample of size n taken from a population of size N finite population correction

Sample size and population size Sample size needed for a given precision Population size

Sample size Error Non-sampling error Sampling error Total error Sampling vs. non-sampling errors

Two-stage Sampling The population is divided up into subgroups, or “ Primary Sampling Units (PSUs) ”, that represent aggregates of individual households. In the first stage, a sample of PSUs is selected. In the second stage, a sample of individual households is chosen in each of the selected PSUs.

Two-stage Sampling Solves the problems of Simple Random Sampling Provides an opportunity to link community- level factors to respondent behavior The sample can be made self-weighted if In the first stage, PSUs are selected with Probability Proportional to Size (PPS) In the second stage, a fixed number of respondents are chosen within the selected PSUs The price to pay is cluster effect

Cluster effect Sampling error grows when the sample of size n is drawn from k PSUs, with m households in each PSU (n=k m) Cluster effect Intra-cluster correlation coefficient

1.95 Cluster effects Intra-cluster correlation coefficient 0.05 Number of PSUs Number of households per PSU For a total sample size of 12,000 households

Cluster effects Intra-cluster correlation coefficient Number of PSUs Number of households per PSU For a total sample size of 12,000 households

Cluster effects Intra-cluster correlation coefficient Number of PSUs Number of households per PSU For a total sample size of 12,000 households

, Cluster effects Intra-cluster correlation coefficient Number of PSUs Number of households per PSU For a total sample size of 12,000 households

Stratified Sampling The population is divided up into subgroups or “ strata ”. A separate sample of households is then selected from each strata.

Stratified Sampling There are two primary reasons for using a stratified sampling design: To potentially reduce sampling error by gaining greater control over the composition of the sample. To ensure that particular groups within a population are adequately represented in the sample. The two objectives are generally contradictory in practice.

Stratified Sampling Stratification Variable: variable or variables by which a study population is divided up into strata (or groups) in order to select a stratified sample. Proportionate Stratified Sample: Stratified sample where the number of respondents selected from each strata is proportional to the number of units in each strata in the population. Disproportionate Stratified Sample: Stratified sample where the number of respondents selected from each strata is not proportional to the number of units in each strata in the population. Almost all national household surveys use Disproportionate Stratified Sampling. This implies that raising factors, or “ sampling weights ” need to be used to obtain national estimates from the sample.

Parts of the country may need to be excluded from the sample for security or other reasons Excluded strata

Measuring change Pros and cons of panel samples A panel can measure change more accurately A panel permits correlating change in the outcomes with change in other factors A panel approach may reduce the effort of the second and subsequent rounds Panels are harder to manage and entail long-term commitments between data users and producers Panels are subject to attrition (respondent fatigue, migration, disappearance from the market, etc.) A panel is more vulnerable to manipulation from the predatory agencies

Assuring good field work

What happens when fieldwork is poor? A long and frustrating process of “ data cleaning ” becomes unavoidable The data loose their policy-making relevance Data quality is not guaranteed The process converges (at best) to databases that are internally consistent The process entails a myriad of decisions, generally undocumented Users mistrust the data

Key factors Manage the survey as an integrated project Implement the team concept in the organization of field operations Integrate computer-based quality controls to field operations Establish strong supervision procedures Ensure sufficient training Work with a reduced staff over an extended period of data collection

Management levels Core staff Survey manager Field operations manager Data manager Tactical options for the organization of field teams Mobile teams with fixed data entry Mobile teams with integrated data entry Sometime in the future: the paperless interview

Mobile teams with fixed data entry Cote d’Ivoire (1984) Peru (1985) Ghana Pakistan Guinea-Conakry Mozambique

Composition of a field team SupervisorInterviewers Data entry operator

The team and its tools SupervisorInterviewers Data entry operator Antropo- metrist

Two PSUs visited in a four- week period Alama Bamako Regional Office

First week AlamaBamako Regional Office Operator remains in Regional Office Rest of the team travels to Alama

First week AlamaBamako Regional Office Operator remains in Regional Office Rest of the team travels to Alama

First week Alama Bamako Regional Office Operator remains in Regional Office Rest of the team travels to Alama

First week Alama Bamako Regional Office Operator remains in Regional Office Rest of the team travels to Alama

First week Alama Bamako Regional Office Operator remains in Regional Office Rest of the team travels to Alama

First week AlamaBamako Regional Office Operator remains in Regional Office Rest of the team travels to Alama They complete first half of questionnaires in all selected households

First week AlamaBamako Regional Office Operator remains in Regional Office Rest of the team travels to Alama

First week AlamaBamako Regional Office Operator remains in Regional Office Rest of the team travels to Alama

First week Alama Bamako Regional Office Operator remains in Regional Office Rest of the team travels to Alama and back

First week AlamaBamako Regional Office Supervisor gives Alama questionnaires to DEO Rest of the team travels to Alama and back

Second week Alama Bamako Regional Office Operator enters first week data from Alama Rest of the team travels to Bamako

Second week Alama Bamako Regional Office Operator enters first week data from Alama Rest of the team travels to Bamako

Second week Alama Bamako Regional Office Operator enters first week data from Alama Rest of the team travels to Bamako They complete first half of questionnaires in all selected households

Second week Alama Bamako Regional Office Operator enters first week data from Alama Rest of the team travels to Bamako and back

Second week Alama Bamako Regional Office Supervisor gives Bamako questionnaires to DEO. DEO gives back Alama questionnaires with flagged inconsistencies Rest of the team travels to Bamako and back

Third week Alama Bamako Regional Office Operator enters first week data from Bamako Team completes second half of questionnaires. They correct inconsistencies from first half

Fourth week AlamaBamako Regional Office Operator enters second week data from Alama. Corrects inconsistencies from first round Team completes second half of questionnaires. They correct inconsistencies from first half

Fourth week Regional Office The result is a clean data set on diskette, ready for analysis immediately after data collection

Mobile teams with integrated data entry Nepal (1992) Argentina Paraguay Bangladesh (2000)

Mobile teams with integrated data entry Regional Office Alama Bamako Cocody Team works with portable computers and printers

Mobile teams with integrated data entry Regional Office Alama Bamako Cocody Operator travels with the rest of the field team

Mobile teams with integrated data entry Regional Office Alama Bamako Cocody Data entry and validation almost immediate

Mobile teams with integrated data entry Regional Office Alama Bamako Cocody Reduced trips to and from Regional Office to selected PSUs

Mobile teams with integrated data entry Regional Office Alama Bamako Cocody

Benefits of integration Provides reliable and timely databases Provides immediate feedback on the performance of the field staff, allowing early detection of inadequate behaviors Ensures that all field staff applies uniform criteria throughout the full period of data collection Solves inconsistencies through direct verification of households reality, rather that through office guesswork Is consistent with the total quality culture

Supervision tasks Verification of questionnaires for completeness Random re-interviews of households Observation of interviews

Selecting and training field staff Why is it important How long does it take How is it organized

Example: Day 2 of interviewer training for household survey Definition of household (and dwelling, family, etc.) Pictorial of a sample household Slide with an empty roster (explain case conventions, encoding, skip patterns, etc.)

Example, cont. Fill the roster for the sample household (need for legible handwriting, recording of ages, use of a calendar of events, etc.) Role playing (trainer as a respondent, simulating borderline cases) Role playing (trainees interview each other)