Introduction to Sampling for the Implementation of PATs

Slides:



Advertisements
Similar presentations
4 Uses and Abuses of Statistics 4.1 Introduction to Statistics
Advertisements

MBF3C Lesson #1: Sampling Types and Techniques
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 1 Section 2 – Slide 1 of 22 Chapter 1 Section 2 Observational Studies, Experiments, and.
Chapter 10: Sampling and Sampling Distributions
Selection of Research Participants: Sampling Procedures
Friday, May 7, Descriptive Research Week 8 Lecture 2.
QBM117 Business Statistics Statistical Inference Sampling 1.
Chapter 7 Sampling Distributions
Taejin Jung, Ph.D. Week 8: Sampling Messages and People
Who and How And How to Mess It up
Sampling.
Sampling and Sample Size Determination
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 7-1 Chapter 7 Sampling Distributions Basic Business Statistics 10 th Edition.
11 Populations and Samples.
The Excel NORMDIST Function Computes the cumulative probability to the value X Business Statistics: A First Course, 5e © 2009 Prentice-Hall, Inc
Sampling Methods.
Impact Evaluation Session VII Sampling and Power Jishnu Das November 2006.
SINGLE VARIABLE DATA DEFINITIONS ETC. GENERAL STUFF STATISTICS IS THE PROCESS OF GATHERING, DISPLAYING, AND ANALYZING DATA. DATA CAN BE GATHERED BY CONDUCTING.
Sampling Moazzam Ali.
Chapter 5: Descriptive Research Describe patterns of behavior, thoughts, and emotions among a group of individuals. Provide information about characteristics.
Social Research Methods
Key terms in Sampling Sample: A fraction or portion of the population of interest e.g. consumers, brands, companies, products, etc Population: All the.
Sample Design.
COLLECTING QUANTITATIVE DATA: Sampling and Data collection
MGT-491 QUANTITATIVE ANALYSIS AND RESEARCH FOR MANAGEMENT OSMAN BIN SAIF Session 13.
July, 2014 HQ Sampling. AGENDA 1. A brief Overview of Sampling 2. Types of Random Sampling Simple Random and Systematic Random 3. Types of Probability.
IB Business and Management
Sampling: Theory and Methods
Chap 20-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 20 Sampling: Additional Topics in Sampling Statistics for Business.
Introduction to Inferential Statistics. Introduction  Researchers most often have a population that is too large to test, so have to draw a sample from.
Collecting Samples Chapter 2.3 – In Search of Good Data Mathematics of Data Management (Nelson) MDM 4U.
Shooting right Sampling methods FETP India. Competency to be gained from this lecture Select a sample from a population to generate precise and valid.
1 Sampling Techniques. 2 Introduction I l A unit is the entity that is of interest to us. l The population consists of all units of interest. l The population.
Under the Guidance of Dr. ADITHYA KUMARI H. Associate Professor DOS in Library and Information Science University of Mysore Mysore By Poornima Research.
Chapter 11 – 1 Chapter 7: Sampling and Sampling Distributions Aims of Sampling Basic Principles of Probability Types of Random Samples Sampling Distributions.
Population and Sampling
MDM4U - Collecting Samples Chapter 5.2,5.3. Why Sampling? sampling is done because a census is too expensive or time consuming the challenge is being.
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Population and sample. Population: are complete sets of people or objects or events that posses some common characteristic of interest to the researcher.
Slide 1 Copyright © 2004 Pearson Education, Inc. Misuses of Statistics  Bad Samples  Small Samples  Misleading Graphs  Pictographs  Distorted Percentages.
STANDARD ERROR Standard error is the standard deviation of the means of different samples of population. Standard error of the mean S.E. is a measure.
Sampling Design and Analysis MTH 494 LECTURE-12 Ossam Chohan Assistant Professor CIIT Abbottabad.
SAMPLING TECHNIQUES. Definitions Statistical inference: is a conclusion concerning a population of observations (or units) made on the bases of the results.
Tahir Mahmood Lecturer Department of Statistics. Outlines: E xplain the role of sampling in the research process D istinguish between probability and.
Learning Objectives Explain the role of sampling in the research process Distinguish between probability and nonprobability sampling Understand the factors.
AP STATISTICS Section 5.1 Designing Samples. Objective: To be able to identify and use different sampling techniques. Observational Study: individuals.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Chapter 7 Sampling and Sampling Distributions.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 7-1 Chapter 7 Sampling Distributions Basic Business Statistics.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 1-1 A Population is the set of all items or individuals of interest.
Sampling Sources: -EPIET Introductory course, Thomas Grein, Denis Coulombier, Philippe Sudre, Mike Catchpole -IDEA Brigitte Helynck, Philippe Malfait,
Chapter 10 Sampling: Theories, Designs and Plans.
LIS 570 Selecting a Sample.
Chapter 7 Sampling Bryman: Social Research Methods: 3e Authored by Susie Scott.
Collecting Samples Chapter 2.3 – In Search of Good Data Mathematics of Data Management (Nelson) MDM 4U.
Ch. 11 SAMPLING. Sampling Sampling is the process of selecting a sufficient number of elements from the population.
SECTION B: METHODS OF INVESTIGATION (300 WORDS). Focus for Section B: Describe the Methods Used to Collect Data Questionnaires – What types of questions?
 When every unit of the population is examined. This is known as Census method.  On the other hand when a small group selected as representatives of.
Sullivan – Statistics: Informed Decisions Using Data – 2 nd Edition – Chapter 1 Section 2 – Slide 1 of 21 Chapter 1 Section 2 Observational Studies, Experiments,
CHAPTER FOUR SAMPLING PROCEDURES March 11, SAMPLING PROCEDURES –Population and Sampling –The Need for sampling –Characteristics of Good Sampling.
Chapter 7 Introduction to Sampling Distributions Business Statistics: QMIS 220, by Dr. M. Zainal.
Chapter 12 Vocabulary. Matching: any attempt to force a sample to resemble specified attributed of the population Population Parameter: a numerically.
1 Data Collection and Sampling ST Methods of Collecting Data The reliability and accuracy of the data affect the validity of the results of a statistical.
Statistics Definitions Part 2. Representative Sample For a sample to be representative of a population, it must possess the same characteristics as the.
Section 1.2 Random Samples. 2 Sampling Using a small group to represent the population Census includes the entire population usually not practical, and.
Sampling Design and Procedure
Sampling Dr Hidayathulla Shaikh. Contents At the end of lecture student should know  Why sampling is done  Terminologies involved  Different Sampling.
Collecting Samples Chapter 2.3 – In Search of Good Data Mathematics of Data Management (Nelson) MDM 4U.
Lecture 5.  It is done to ensure the questions asked would generate the data that would answer the research questions n research objectives  The respondents.
Social Research Methods
Chapter 8 SAMPLING and SAMPLING METHODS
Presentation transcript:

Introduction to Sampling for the Implementation of PATs Materials Developed by The IRIS Center at the University of Maryland

Advantages of Sampling In most cases, do not want to survey EVERYONE Why? Too costly Too time consuming Too many resources needed

Advantages of Sampling To make our work more cost-effective: Interview the minimum number needed Reduce: time cost human error

Survey Sampling According to sampling theory we can get valid results from studying only a fraction (a sample) of our clients, provided: the sample is REPRESENTATIVE of the qualities of our client POPULATION, and of sufficient SIZE to satisfy the assumptions of the statistical techniques used in our analysis

Simple Random Sampling For the sample to be representative, it must be obtained randomly. It is a simple random sample if each item in the population has an equal chance of being selected.

Types of Bias in Survey Process Poor randomization is not the only cause of biased samples. Bias and error are more often introduced by: poor group definition interviewer error inadequate records (incomplete or outdated client lists).

Longitudinal Design Longitudinal studies compare multiple clients at multiple points in time (at least two points in time). Often there is a baseline (when the client began the program) and an endline (two years later, for example).

Cross Sectional Design Cross sectional studies compare multiple clients in the program at one point in time. Ex: On October 1, 2005, program looks at: Incoming clients 2-year clients 4-year clients

Calculating Sample Size: How Big is Big Enough? Sample results are almost never identical to the entire population The larger the sample of clients, the greater the likelihood that the statistical analysis will yield “significant” results that closely resemble the entire client population.

Calculating Sample Size Different Views: Statistician – maximalist – at least 500 Field researcher – minimalist – at least 35 to 50 for each subgroup we want to analyze and compare USAID PAT – at least 300

Trade off: Larger sample is more accurate, but Trade off: Larger sample is more accurate, but costs more in time and money To make generalizations about entire population, need a total sample size of 200-400 (depending on total population and confidence level desired)

Sample Size Calculator Creative Research Systems: www.surveysystem.com/sscalc.htm  Population Size Confidence Level Confidence Interval Sample Size 1,000 5 95% 278 5,000 357 10,000 370 50,000 381 100,000 383 1,000,000 384

How to Sample Randomly? RANDOM = giving each client an equal chance to be selected This is done by: drawing numbers, as in a lottery numbering all clients and selecting numbers from a random number table systematically, by selecting every ‘nth’ case from a complete list of clients DANGER!!! The list may be biased by: who is left out—Is the list up-to-date?

Steps in Taking a Simple Random Sample Number a copy of the complete client list, and note the total number of clients (the last number) Decide on your sample size Create a list of random numbers Use Excel or a random number table to select the sample, matching the numbers from the table with those on your numbered client list.

Cluster Sampling To focus on specific subgroups, first classify the population into several subpopulations, called “strata,” then randomly sample from each stratum (subgroup).

Cluster Sampling Is a way of selecting randomly, when you have a geographically dispersed population when time is limited. This method can help reduce the time and cost in data collection. Group the clients into clusters (could be branches or loan groups). Randomly choose the clusters. Then sample random individuals from only some randomly chosen clusters.

Stratified Sampling Stratified survey sampling enables you to focus on specific groups (for example, women or rural people), ensuring that they will be represented in the sample. Although random survey sampling, done correctly, will give the researcher roughly proportional samples of all groups, disproportional stratified sampling will guarantee that a certain group is adequately represented.

Parametric Statistics Assumes that the distribution of values for your variables are normal (Bell Curve), and also relatively similar to each other. In parametric statistics, thirty is a “magic minimum number”--meaning that it is generally accepted as the minimum cell size for each stratum or subgroup of a simple sample.

Minimum for Each Subgroup 30 = ‘minimum magic number’ for each subgroup To do any statistical analysis between subgroups, need a minimum of 30 in each subgroup in order to have any chance at all of finding ‘significant’ differences. BUT, 30 is NOT enough for your total sample. 30 = ‘minimum magic number’ for each subgroup To have a chance at finding statistically significant differences between subgroups of the sample, you need a minimum of 30 cases in each subgroup. BUT, 30 is not enough for your total sample.

If you want to compare between subgroups, you need 35 in each cell Since the magic minimum number is 30, and you may have some missing values in some of your interview forms, for practical purposes, you need to always have a minimum number of 35 completed surveys for each cell of the sampling frame.

Handling Sampling Problems in the Field If you cannot interview the client who is sampled (not available, refusal, etc.) Sample ‘at least’ an extra 40% and have alternates available to be interviewed in each area (subgroup) Help ensure that you complete 35 questionnaires for each subgroup (if you plan to do additional analysis and compare subgroups) Make better use of the interviewers’ time Cannot interview the client who is sampled (not at home; sick; refusal; etc.) .We have sampled 25% extra for these situations .We need a minimum number from each area 45 Coast - 35 Mountains - 40 Plain Your team will need to do more than the minimum numbers because some of the forms will prove to be not useable and we must have the minimum numbers for statistical purposes

Example of a Sampling Frame Survey Sample Region 1 Region 2 Region 3 Total Clients interviewed 112 100 88 300 Substitute sample (approx 40%) 45 40 35 120 157 140 123 420

What if there are not enough with the 40% extra? Check with the sample tracking coordinator to give you new names B. If there is not time, the field supervisor must adjust in the field 1) Use random number table and select clients from master list that have not already been selected 2) If you do not have a random number table, can ask someone to pick a number between # and ## at random Do NOT introduce bias 3) Write down the changes that you made and how you did it A. Check with the sample tracking coordinator to give you new names B. If there is not time, the field unit coordinator must adjust in the field while keeping bias out. Should have a complete client list and a random number table. 1). Take the random number table and choose the ones on your center list that have not been selected 2). If you do not have a random number table, choose random numbers in your head and then see which clients are selected Do NOT introduce bias by just choosing the closest person or the nicest looking person or the person who has a store with cold drinks 3) write down the changes that you made and how you did it and give it to the sample tracking coordinator that afternoon.

An excerpt from a Random Number Table 32 50 92 46 24 69 48 93 77 87 47 17 29 36 55 81 34 70 46 99 27 95 04 69 59 71 30 74 42 36 45 11 49 20 50 86 16 75 80 55 33 98 93 66 76 13 56 08 38 43 12 11 01 21 41 13 87 08 47 98 64 61 65 94 30 17 51 54 45 85 41 22 96 26 64 38 09 93 01 49 43 06 09 24 42 23 23 21 65 14 95 76 09 00 24 54 15 04 34 41 58 61 05 09 82 97 30 78 89 23 44 66 18 71 83 08 21 74 18 91 Can use: www.random.org/nform.html

Random walk sampling -- less expensive but more prone to bias IF YOU DON’T HAVE A CLIENT LIST Random walk sampling -- less expensive but more prone to bias Watch out for “tarmac bias”, selecting only houses that are easily accessible from the road

Example of BDS Sampling Investigative emphasis: final beneficiaries. Will use three subsectors (irrigation, cashews, potable water). Will focus only on end users of the technologies. Will focus on region surrounding Ziguinchor.

Example of Business Development Services Sampling Sample size = 200 Casamance region is the focus Program has three sub-sectors Sample in each sub-sector stratified according to major differences between types of clients Irrigation – individual owners and group owners Cashew processing – shellers and peelers Potable water – tubewells and rope pumps (rural and peri-urban) Of the total country of Senegal, the regions where the largest concentration of clients and USAID dollars was Casamance. The program is divided into three main sub-sectors: irrigation pumps, cashew processing, and potable water pumps. The percentage of clients interviewed in each sub-sector will be relative to the percentage of the total program budget dedicated to each sub-sector

Example of BDS Sampling Generate a list of the direct clients and divide by subgroup # of clients per stratum or subgroup depends on percentage the stratum constitutes in sector Select clients using a random number list Each direct client will provide information to the interviewers so that they can create a list of end users from which some will be chosen according to a predetermined random number list. Program will generate a list of the direct clients and disaggregate using the strata identified The number of clients to be interviewed in each strata will be calculated with respect to the percentage that group constitutes in its sector A sample of direct clients will be chosen using a random number list. Each direct client will provide information to the interviewers so that they can create a list of end users from which some will be chosen according to a predetermined random number list. Irrigation: Individual owners will provide names of other users. 10 groups will provide names of group members from which a minimum of 2 from each will be chosen. Cashew processing: Unit owners will provide a list of workers. Potable water: 15 rope pump owners, disaggregated by urban and rural locations, will provide a list of households using the pumps.

BDS Sampling Framework Example Subsector Irrigation Cashew Potable Water Total Total Number of Beneficiaries 5,500 800 3,500 9,800 Percentage of Total Beneficiary Population 56% 9% 35% 100% Total Number of Beneficiaries to be interviewed for PAT implementation (based on 300 + 40% extra, or 420) 235 38 147 420   Type of Client Individual Group Shellers Peelers Tubewell Rope Pump Percentage of Total 65% 45% 55% 10% 90% Number to be interviewed 153 82 17 21 15 132 Rural Urban 30% 70% 40 92