Presentation is loading. Please wait.

Presentation is loading. Please wait.

SAMPLE DESIGN: WHO WILL BE IN THE SAMPLE? Lu Ann Aday, Ph.D. The University of Texas School of Public Health.

Similar presentations


Presentation on theme: "SAMPLE DESIGN: WHO WILL BE IN THE SAMPLE? Lu Ann Aday, Ph.D. The University of Texas School of Public Health."— Presentation transcript:

1 SAMPLE DESIGN: WHO WILL BE IN THE SAMPLE? Lu Ann Aday, Ph.D. The University of Texas School of Public Health

2 SAMPLE DESIGN: Key Components Target Population or Universe: group about which information is desired Sampling frame: operational definition of the target population which directly matches the target population, e.g., existing or constructed list of individuals from which the sample would actually be drawn Sample elements: types of individuals or units that will be drawn, i.e., ultimate sampling unit refers to final sampling unit that is usually the focus of the analysis, e.g., individuals

3 SAMPLE DESIGN: Types of Designs Probability Sample: Relies on laws of chance to pick the sample, where probability of selection is known, i.e., based on sampling fraction: n/N Nonprobability Sample: Relies on human judgment to pick the sample

4 SAMPLE DESIGN: Types of Nonprobability Designs Purposive: Pick people for certain purpose, e.g., focus groups Quota: Pick target number of people in certain categories, e.g., women 18-35 Chunk: Pick convenient “chunk” of people, e.g., church attendees Volunteer: Ask for volunteers, e.g., healthy male medical students Snowball: Identify small number of individuals representative of the population of interest, who then identify others that meet the same inclusion criteria, e.g., drug users

5 SAMPLE DESIGN: Types of Probability Designs Simple random sample Systematic random sample Stratified sample Cluster sample

6 SAMPLE DESIGN: Simple Random Sample Definition: Every unit in the population has a known, nonzero, and equal chance of being selected through a lottery-type procedure

7 SAMPLE DESIGN: Simple Random Sample Procedures Draw sample randomly from numbers assigned to sampling elements placed in a sampling “urn” OR Use a random numbers table to identify sampling elements to be included OR Use computer software to randomly select sample from computerized sampling frame

8 RANDOM NUMBERS TABLE: Example: 1-Select random starting point “X”; 2-Look at 1 st two digits of random numbers; 3-Proceed from left to right through table to identify elements from sampling frame (numbered 1-50) until the target sample size (n), e.g., 10, has been reached. 9156742595 X279583013404024 1795556349909994912720044 4650318584188454961802304 9215789634948247817184610 1457762765350658126339667

9 SAMPLE DESIGN: Systematic Random Sample Definition: Variation of simple random sample selected through randomly selecting a starting point and then taking every n’th unit thereafter, based on the sampling fraction

10 SAMPLE DESIGN: Systematic Random Sample Procedures 1-Determine the sampling interval required to sample the required number of cases, based on the sampling fraction: n/N, e.g, 10/50 = 1/5 2-Select a random starting point “X” within the first sampling interval, e.g., elements 1-5 3-Starting at “X”, sample every n/Nth case from the sampling frame until the target sample size (n), e.g., 10, has been reached

11 SYSTEMATIC RANDOM SAMPLE: Example, e.g., n/N=10/50 = 1/5 (20%) 111213141 212223242 3 X13 X23 X33 X43 X 414243444 515253545 616263646 717273747 8 X18 X28 X38 X48 X 919293949 1020304050

12 SAMPLE DESIGN: Stratified Sample Definition: Sample based on dividing the population into homogeneous strata and drawing random-type sample separately from all the strata Proportionate: Use same sampling fraction in each stratum Disproportionate: Use different sampling fraction in each (or selected) stratum

13 SAMPLE DESIGN: Stratified Sample Procedures 1-Order or group the sampling frame by relevant strata 2-Determine the sampling interval required to sample the required number of cases, based on the sampling fraction 3-Select a random starting point “X” within the first sampling interval 4-Starting at “X”, sample every n/Nth case from the sampling frame until the target sample size (n) has been reached

14 STRATIFIED SAMPLE: Example- Proportionate, e.g., n/N=1/20 (5%) in all strata STRATAN (%)n/Nn (%) A500 (5%)1/2025 (5%) B3000 (30%)1/20150 (30%) C2000 (20%)1/20100 (20%) D500 (5%)1/2025 (5%) E700 (7%)1/2035 (7%) F1600 (16%)1/2080 (16%) G700 (7%)1/2035 (7%) H1000 (10%)1/2050 (10%) 10000500

15 STRATIFIED SAMPLE: Example- Disproportionate, e.g., n/N=1/20 (5%) in strata B,C,F,H & 1/10 (10%) in strata A,D,E,G STRATAN (%)n/Nn (%) A500 (5%)1/1050 (8.1%) B3000 (30%)1/20150 (24.2%) C2000 (20%)1/20100 (16.1%) D500 (5%)1/1050 (8.1%) E700 (7%)1/1070 (11.3%) F1600 (16%)1/2080 (12.9%) G700 (7%)1/1070 (11.3%) H1000 (10%)1/2050 (8.1%) 10000620

16 SAMPLE DESIGN: Cluster Sample Definition: Sample based on dividing the population into heterogeneous clusters and drawing random-type sample separately from sample of clusters

17 CLUSTER SAMPLE: Example—Probability Proportionate to Size (PPS) (Aday & Cornelius, 2006, Table 6.2) (continued in next lecture) Block A: 100 HUs*Block F: 250 HUs*Block K: 200 HUs* Block B: 50 HUsBlock G: 125 HUs*Block L: 300 HUs* Block C: 75 HUsBlock H: 50 HUsBlock M: 125 HUs Block D: 150 HUs*Block I: 100 HUs*Block N: 150 HUs* Block E: 200 HUs*Block J: 50 HUsBlock O: 275 HUs*

18 CRITERIA FOR EVALUATING SAMPLE DESIGNS Precision—how close the estimates derived from the sample are to the true population value as a function of variable sampling error Accuracy—how close the estimates derived from the sample are to the true population value as a function of systematic sampling error (bias)

19 CRITERIA FOR EVALUATING SAMPLE DESIGNS (cont.) Complexity—number of stages and steps required to implement the sample design Efficiency—obtaining the most accurate and precise estimates at the lowest possible costs

20 ADVANTAGES & DISADVANTAGES: Simple Random Sample ADVANTAGES Requires little knowledge of population in advance DISADVANTAGES May not capture certain groups of interest May not be very efficient

21 ADVANTAGES & DISADVANTAGES: Systematic Random Sample ADVANTAGES Easy to analyze and compute sampling (standard) errors High precision DISADVANTAGES Periodic ordering of elements in sample frame may create biases in the data May not capture certain groups of interest May not be very efficient

22 ADVANTAGES & DISADVANTAGES: Stratified Sample ADVANTAGES Enables certain groups of interest to be captured Enables disproportionate sampling within strata Highest precision DISADVANTAGES Requires knowledge of population in advance May introduce more complexity in analyzing data and computing sampling (standard) errors

23 ADVANTAGES & DISADVANTAGES: Cluster Sample ADVANTAGES Lowers field costs Enables sampling of groups of individuals for which detail on individuals themselves may not be available DISADVANTAGES Introduces more complexity in analyzing data and computing sampling (standard) errors Lowest precision


Download ppt "SAMPLE DESIGN: WHO WILL BE IN THE SAMPLE? Lu Ann Aday, Ph.D. The University of Texas School of Public Health."

Similar presentations


Ads by Google