Copyright 2010, The World Bank Group. All Rights Reserved. Part 1 Sample Design Produced in Collaboration between World Bank Institute and the Development.

Slides:



Advertisements
Similar presentations
1/26/00 Survey Methodology Sampling, Part 2 EPID 626 Lecture 3.
Advertisements

Statistics for Managers Using Microsoft® Excel 5th Edition
Economics 105: Statistics Review #1 due next Tuesday in class Go over GH 8 No GH’s due until next Thur! GH 9 and 10 due next Thur. Do go to lab this week.
Copyright 2010, The World Bank Group. All Rights Reserved. Estimation and Weighting Part II.
Determining the Sample Plan
Multiple Indicator Cluster Surveys Survey Design Workshop
AP Statistics C5 D2 HW: p.287 #25 – 30 Obj: to understand types of samples and possible errors Do Now: How do you think you collect data?
MISUNDERSTOOD AND MISUSED
Dr. Chris L. S. Coryn Spring 2012
Who and How And How to Mess It up
Sampling.
Why sample? Diversity in populations Practicality and cost.
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 7-1 Chapter 7 Sampling Distributions Basic Business Statistics 10 th Edition.
Responding driven sampling Principles of Sampling Session 1.
11 Populations and Samples.
A new sampling method: stratified sampling
Section 5.1. Observational Study vs. Experiment  In an observational study, we observe individuals and measure variables of interest but do not attempt.
7-1 Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall Chapter 7 Sampling and Sampling Distributions Statistics for Managers using Microsoft.
Sampling Methods.
Formalizing the Concepts: Simple Random Sampling.
Sampling Procedures and sample size determination.
Copyright ©2005 Brooks/Cole, a division of Thomson Learning, Inc. How to Get a Good Sample Chapter 4.
Formalizing the Concepts: STRATIFICATION. These objectives are often contradictory in practice Sampling weights need to be used to analyze the data Sampling.
United Nations Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys, Amman, Jordan,
Arun Srivastava. Types of Non-sampling Errors Specification errors, Coverage errors, Measurement or response errors, Non-response errors and Processing.
Copyright 2010, The World Bank Group. All Rights Reserved. Agricultural Census Sampling Frames and Sampling Section A 1.
Learning Objective Chapter 11 Basic Sampling Issues CHAPTER eleven Basic Sampling Issues Copyright © 2000 by John Wiley & Sons, Inc.
Sampling. Concerns 1)Representativeness of the Sample: Does the sample accurately portray the population from which it is drawn 2)Time and Change: Was.
Definitions Observation unit Target population Sample Sampled population Sampling unit Sampling frame.
Copyright 2010, The World Bank Group. All Rights Reserved. Estimation and Weighting, Part I.
Sampling: Theory and Methods
From Sample to Population Often we want to understand the attitudes, beliefs, opinions or behaviour of some population, but only have data on a sample.
CHAPTER 12 – SAMPLING DESIGNS AND SAMPLING PROCEDURES Zikmund & Babin Essentials of Marketing Research – 5 th Edition © 2013 Cengage Learning. All Rights.
Multiple Indicator Cluster Surveys Survey Design Workshop Sampling: Overview MICS Survey Design Workshop.
Chap 20-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 20 Sampling: Additional Topics in Sampling Statistics for Business.
Copyright ©2011 Pearson Education 7-1 Chapter 7 Sampling and Sampling Distributions Statistics for Managers using Microsoft Excel 6 th Global Edition.
Scot Exec Course Nov/Dec 04 Survey design overview Gillian Raab Professor of Applied Statistics Napier University.
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
STANDARD ERROR Standard error is the standard deviation of the means of different samples of population. Standard error of the mean S.E. is a measure.
Sampling Design and Analysis MTH 494 LECTURE-12 Ossam Chohan Assistant Professor CIIT Abbottabad.
United Nations Regional Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys, Bangkok,
URBDP 591 A Lecture 11: Sampling and External Validity Objectives What is sampling? Types of Sampling –Probability sampling –Non-probability sampling External.
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Basic Sampling Issues CHAPTER twelve.
Copyright 2010, The World Bank Group. All Rights Reserved. Part 2 Sample Design Produced in Collaboration between World Bank Institute and the Development.
AP STATISTICS LESSON AP STATISTICS LESSON DESIGNING DATA.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 7-1 Chapter 7 Sampling Distributions Basic Business Statistics.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson Prentice-Hall, Inc.Chap 7-1 Statistics for Managers Using Microsoft® Excel 5th Edition.
Sampling Sources: -EPIET Introductory course, Thomas Grein, Denis Coulombier, Philippe Sudre, Mike Catchpole -IDEA Brigitte Helynck, Philippe Malfait,
7: Sampling Theory and Methods. 7-2 Copyright © 2008 by the McGraw-Hill Companies, Inc. All rights reserved. Hair/Wolfinbarger/Ortinau/Bush, Essentials.
Chapter 6: 1 Sampling. Introduction Sampling - the process of selecting observations Often not possible to collect information from all persons or other.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 7-1 Chapter 7 Sampling and Sampling Distributions Basic Business Statistics 11 th Edition.
 An observational study observes individuals and measures variable of interest but does not attempt to influence the responses.  Often fails due to.
Basic Business Statistics
Sampling technique  It is a procedure where we select a group of subjects (a sample) for study from a larger group (a population)
1 Aspects of Sampling for Household Surveys Kathleen Beegle Workshop 17, Session 1c Designing and Implementing Household Surveys March 31, 2009.
1 Data Collection and Sampling ST Methods of Collecting Data The reliability and accuracy of the data affect the validity of the results of a statistical.
Topics Semester I Descriptive statistics Time series Semester II Sampling Statistical Inference: Estimation, Hypothesis testing Relationships, casual models.
Plan for Today: Chapter 1: Where Do Data Come From? Chapter 2: Samples, Good and Bad Chapter 3: What Do Samples Tell US? Chapter 4: Sample Surveys in the.
Sampling. Census and Sample (defined) A census is based on every member of the population of interest in a research project A sample is a subset of the.
PRESENTED BY- MEENAL SANTANI (039) SWATI LUTHRA (054)
United Nations Regional Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys, Addis.
Sampling Design and Procedure
Sampling Chapter 5. Introduction Sampling The process of drawing a number of individual cases from a larger population A way to learn about a larger population.
Random Sampling Error and Sample Size are Related
Collecting Samples Chapter 2.3 – In Search of Good Data Mathematics of Data Management (Nelson) MDM 4U.
Copyright 2010, The World Bank Group. All Rights Reserved. Agricultural Census Sampling Frames and Sampling Section B 1.
SAMPLE DESIGN.
Meeting-6 SAMPLING DESIGN
Presentation transcript:

Copyright 2010, The World Bank Group. All Rights Reserved. Part 1 Sample Design Produced in Collaboration between World Bank Institute and the Development Data Group (DECDG)

Copyright 2010, The World Bank Group. All Rights Reserved. Sampling and Total Survey Error Impractical to survey entire populations Samples are drawn, data collected, and estimates are made that won’t equal true population values Sampling Error is error arising solely from the sampling process (measure: variance) Surveys are also subject to biases from nonsampling errors such as nonresponse

Copyright 2010, The World Bank Group. All Rights Reserved. Key Sample Design Issues Probability based to allow valid analysis –Unbiased estimates (nearly) with 100% response –Ability to measure variance Frame that covers target population well –Ability to identify and access chosen units Analytic needs (What estimates are needed?) –How good do variances have to be? Geographic or other stratification

Copyright 2010, The World Bank Group. All Rights Reserved. Household Frame Worldwide, labor force surveys use samples of households (HHs) or housing units Target population of adults that may be in the labor force Difficult to sample persons directly But fairly easy to sample households –Some countries have HH lists –Areas frames used when no list

Copyright 2010, The World Bank Group. All Rights Reserved. Area Frames An ideal frame for a household survey: Is 100% Complete (includes all HHs) Has 0% destroyed/empty HHs Has 0% units that are not HHs at all Has 0% chance of duplicating a HH Has 100% ability to locate a sampled HH Area Frames are very good in this regard and are widely used (see diagram)

Copyright 2010, The World Bank Group. All Rights Reserved. 6 HHs missing from frame (Undercoverage) Target Population *extra HHs in frame Frame *Survey work can identify extra units in the sample so that you don’t use them in estimates (avoids overcoverage). HH Frame Coverage Target Population Covered by Frame

Copyright 2010, The World Bank Group. All Rights Reserved. 2-Stage Area Sampling First-Stage Sample of Areas Define areas 100% covering a country –Can be large or small –Enumeration Areas (EAs) of HH Information on each EA from last census is available –Select a probability sample of areas Second-Stage Sample of Households List HHs only in the 1 st -stage sampled areas –Select a probability sample of the listed HH

Copyright 2010, The World Bank Group. All Rights Reserved. Why 2-Stage Area Sampling? Area frames are typically used if: No list of households or housing units OR there may be a list that is not up-to-date (misses new construction) 1 st -stage geographic units that contain the households are readily available (such as enumeration areas)

Copyright 2010, The World Bank Group. All Rights Reserved. Why 2-Stage Area Sampling? Area frames: Areas: 100% complete with 0% duplication Up-to-date if households in the stage 1 units are re-listed Avoid difficulties of: –Creating a list –Supplementing list with new construction

Copyright 2010, The World Bank Group. All Rights Reserved. List Households or Housing Units? After listing HHs, time is needed to sample the households to be surveyed Interviewers return to the field after the 2 nd -stage sampling operation is completed It is necessary that the interviewer be given enough information to re-locate a sampled household

Copyright 2010, The World Bank Group. All Rights Reserved. List Households or Housing Units? Housing Unit (HU) is a physical address –List by observation –Find later from an address (number & street) HHs often listed in more rural countries –Atypical HUs, and problems finding HUs –More intense HH listing, obtain head of household –Defined using family ties & living arrangements –Locate later by head of household

Copyright 2010, The World Bank Group. All Rights Reserved. Use of Households (HHs) HUs can be difficult to identify in some areas and many addresses difficult to locate if they are sampled. Villages? Farms? Cities? Towns? HHs identified by head of household can readily be found after sampling. Up-to-date if sampling and surveying occur shortly after the listing operation. 12

Copyright 2010, The World Bank Group. All Rights Reserved. A Typical 2-Stage Design Major Stratification by Geography –Subnational data desired (states, regions, etc.) Minimum of about 300 responding HH in each –Administrative convenience Stage 1 sample of EAs, systematic PPES –PPES = Probability Proportional to Estimated Size Stage 2 sample of HHs, systematic –List HHs in the EAs selected in the first stage –Select a fixed number of HHs per EA (ex: 10 HHs)

Copyright 2010, The World Bank Group. All Rights Reserved. Why sample EAs PPES? Enumeration areas are relatively small (ex: HHs) But they differ in size –That makes Probability Proportional to Estimates Size sampling efficient EA size from the last census can be used –“old” so size is estimated and not actual Size should be related to the labor force –# HHs, total population, # adults, etc.

Copyright 2010, The World Bank Group. All Rights Reserved. Example: Sampling EAs PPES Small geographic area of a country with only 7 EAs and 2,000 estimated HHs  sample 2 Introduce concept of Sampling Interval (SI) –In this example SI = 1,000 = 2,000/2 the total # estimated HHs divided by the # sample Eas Random number (rn) determines selections –rn is between 0 and 1 (ex.423) –1 st selection compute: SI * rn = 1,000 *.423 = 423 –2 nd selection compute SI * rn + SI = 423+1,000 = 1,423

Copyright 2010, The World Bank Group. All Rights Reserved. Example: Sampling EAs PPES EA HHest CumHH choose 2 EAs marked by * EA EA * The first EA with CumHH EA greater than 423 is EA2 EA EA * The first EA with CumHH EA greater than 1,423 is EA5 EA

Copyright 2010, The World Bank Group. All Rights Reserved. Example: Sampling EAs PPES HHest is the estimated size of each EA –The # HH from the last census CumHH column cumulates Hhest 2 EAs selected are the ones that first exceed the calculated numbers 423 and 1,423 –1st selection: EA2 CumHH of 550 exceeds 423 –2 nd selection: EA5 CumHH of 1,500 exceeds 1,423

Copyright 2010, The World Bank Group. All Rights Reserved. Example: EA Probabilities EA Probability of selection: HHest/SI The EA sizes are estimated –but the probabilities are valid –probabilities sum to 2, the sample size –As actual EA sizes become known for the sampled EAs, DO NOT change the probabilities Save the probabilities of sampled EAs

Copyright 2010, The World Bank Group. All Rights Reserved. Example: EA Probabilities EA HHest PPES probabilities EA = 350/1000 EA2* = 200/1000  save! EA = 400/1000 EA = 250/1000 EA5* = 300/1000  save! EA = 350/1000 EA = 150/

Copyright 2010, The World Bank Group. All Rights Reserved. Example: EA Weights Weight -- inverse of a sampling probability –In this case only of the EA probability EA2 probability.2 weight 5 = 1/.2 EA5 probability.3 weight = 1/.3 Simple check of EA probabilities –HHest * weight should equal the SI EA2 HHest *5 = 1,000 = SI EA5 HHest * = 1,000 = SI

Copyright 2010, The World Bank Group. All Rights Reserved. Example: List & Sample HHs Listing only needed for sampled EA2 & EA5 Select fixed sample n = 10 HH in each EA –Common range for n is HH per EA Sample HHs systematically from an EA –HHs not random, but periodicity is not a problem –HHs are listed in simple geographic order –HH sort by economic variables can add efficiency (rich/poor? urban/rural?)

Copyright 2010, The World Bank Group. All Rights Reserved. Example: Sample HH from EAs EA HHest PPES HH n HHprob EA EA HH is the actual number of HHs listed Probabilities of selection are based on HH ED2 HHprob = n/HH = 10/210 = ED5 HHprob = n/HH = 10/280 =

Copyright 2010, The World Bank Group. All Rights Reserved. Example: Sample HH from EA2 Calculate a sampling interval (SI) for EA2 –SI = HH/n = 210/10 = 21 –Choose a random start from 1-21, ex: rs = 12 The HHs are listed 1, 2, 3, 4, 5, …, 210 Choose rs, rs+SI, rs+2SI, rs+3SI, …, rs+9SI 12 th 33 rd 54 th 75 th 96 th 117 th 138 th 159 th 180 th 201 st

Copyright 2010, The World Bank Group. All Rights Reserved. Example: Sample HH from EA5 Calculate a sampling interval for EA5 –SI = HH/n = 280/10 = 28 –Choose a random start from 1-28, ex: rs = 6 The HHs are listed 1, 2, 3, 4, 5, …, 280 Choose rs, rs+SI, rs+2SI, rs+3SI, …, rs+9SI 6 th 34 th 62 nd 90 th 118 th 146 th 174 th 202 nd 230 th 258 th

Copyright 2010, The World Bank Group. All Rights Reserved. Example: Total 2-Stage Probability Multiply the probabilities from each stage EA HHest PPES HH n HHprob EA EA Multiply the probabilities PPES * HHprob and the inverse is the weight for EA2.200  = weight 105 for EA5.300  = weight

Copyright 2010, The World Bank Group. All Rights Reserved. Self Weighting Samples Probabilities and weights in the example are different for EA2 and EA5 Can modify HH sampling to make the probabilities and weights equal (almost) Samples where all weights are equal (or all weights in a stratum) are called self-weighting samples. Small gains in efficiency

Copyright 2010, The World Bank Group. All Rights Reserved. Contribute! That was not too difficult, was it? Start a VSS discussion on sampling! Visit 27