Download presentation
Presentation is loading. Please wait.
Published byEdith Howard Modified over 9 years ago
1
SAMPLE DESIGN: WHO WILL BE IN THE SAMPLE? Lu Ann Aday, Ph.D. The University of Texas School of Public Health
2
SAMPLE DESIGN: Key Components Target Population or Universe: group about which information is desired Sampling frame: operational definition of the target population which directly matches the target population, e.g., existing or constructed list of individuals from which the sample would actually be drawn Sample elements: types of individuals or units that will be drawn, i.e., ultimate sampling unit refers to final sampling unit that is usually the focus of the analysis, e.g., individuals
3
SAMPLE DESIGN: Types of Designs Probability Sample: Relies on laws of chance to pick the sample, where probability of selection is known, i.e., based on sampling fraction: n/N Nonprobability Sample: Relies on human judgment to pick the sample
4
SAMPLE DESIGN: Types of Nonprobability Designs Purposive: Pick people for certain purpose, e.g., focus groups Quota: Pick target number of people in certain categories, e.g., women 18-35 Chunk: Pick convenient “chunk” of people, e.g., church attendees Volunteer: Ask for volunteers, e.g., healthy male medical students Snowball: Identify small number of individuals representative of the population of interest, who then identify others that meet the same inclusion criteria, e.g., drug users
5
SAMPLE DESIGN: Types of Probability Designs Simple random sample Systematic random sample Stratified sample Cluster sample
6
SAMPLE DESIGN: Simple Random Sample Definition: Every unit in the population has a known, nonzero, and equal chance of being selected through a lottery-type procedure
7
SAMPLE DESIGN: Simple Random Sample Procedures Draw sample randomly from numbers assigned to sampling elements placed in a sampling “urn” OR Use a random numbers table to identify sampling elements to be included OR Use computer software to randomly select sample from computerized sampling frame
8
RANDOM NUMBERS TABLE: Example: 1-Select random starting point “X”; 2-Look at 1 st two digits of random numbers; 3-Proceed from left to right through table to identify elements from sampling frame (numbered 1-50) until the target sample size (n), e.g., 10, has been reached. 9156742595 X279583013404024 1795556349909994912720044 4650318584188454961802304 9215789634948247817184610 1457762765350658126339667
9
SAMPLE DESIGN: Systematic Random Sample Definition: Variation of simple random sample selected through randomly selecting a starting point and then taking every n’th unit thereafter, based on the sampling fraction
10
SAMPLE DESIGN: Systematic Random Sample Procedures 1-Determine the sampling interval required to sample the required number of cases, based on the sampling fraction: n/N, e.g, 10/50 = 1/5 2-Select a random starting point “X” within the first sampling interval, e.g., elements 1-5 3-Starting at “X”, sample every n/Nth case from the sampling frame until the target sample size (n), e.g., 10, has been reached
11
SYSTEMATIC RANDOM SAMPLE: Example, e.g., n/N=10/50 = 1/5 (20%) 111213141 212223242 3 X13 X23 X33 X43 X 414243444 515253545 616263646 717273747 8 X18 X28 X38 X48 X 919293949 1020304050
12
SAMPLE DESIGN: Stratified Sample Definition: Sample based on dividing the population into homogeneous strata and drawing random-type sample separately from all the strata Proportionate: Use same sampling fraction in each stratum Disproportionate: Use different sampling fraction in each (or selected) stratum
13
SAMPLE DESIGN: Stratified Sample Procedures 1-Order or group the sampling frame by relevant strata 2-Determine the sampling interval required to sample the required number of cases, based on the sampling fraction 3-Select a random starting point “X” within the first sampling interval 4-Starting at “X”, sample every n/Nth case from the sampling frame until the target sample size (n) has been reached
14
STRATIFIED SAMPLE: Example- Proportionate, e.g., n/N=1/20 (5%) in all strata STRATAN (%)n/Nn (%) A500 (5%)1/2025 (5%) B3000 (30%)1/20150 (30%) C2000 (20%)1/20100 (20%) D500 (5%)1/2025 (5%) E700 (7%)1/2035 (7%) F1600 (16%)1/2080 (16%) G700 (7%)1/2035 (7%) H1000 (10%)1/2050 (10%) 10000500
15
STRATIFIED SAMPLE: Example- Disproportionate, e.g., n/N=1/20 (5%) in strata B,C,F,H & 1/10 (10%) in strata A,D,E,G STRATAN (%)n/Nn (%) A500 (5%)1/1050 (8.1%) B3000 (30%)1/20150 (24.2%) C2000 (20%)1/20100 (16.1%) D500 (5%)1/1050 (8.1%) E700 (7%)1/1070 (11.3%) F1600 (16%)1/2080 (12.9%) G700 (7%)1/1070 (11.3%) H1000 (10%)1/2050 (8.1%) 10000620
16
SAMPLE DESIGN: Cluster Sample Definition: Sample based on dividing the population into heterogeneous clusters and drawing random-type sample separately from sample of clusters
17
CLUSTER SAMPLE: Example—Probability Proportionate to Size (PPS) (Aday & Cornelius, 2006, Table 6.2) (continued in next lecture) Block A: 100 HUs*Block F: 250 HUs*Block K: 200 HUs* Block B: 50 HUsBlock G: 125 HUs*Block L: 300 HUs* Block C: 75 HUsBlock H: 50 HUsBlock M: 125 HUs Block D: 150 HUs*Block I: 100 HUs*Block N: 150 HUs* Block E: 200 HUs*Block J: 50 HUsBlock O: 275 HUs*
18
CRITERIA FOR EVALUATING SAMPLE DESIGNS Precision—how close the estimates derived from the sample are to the true population value as a function of variable sampling error Accuracy—how close the estimates derived from the sample are to the true population value as a function of systematic sampling error (bias)
19
CRITERIA FOR EVALUATING SAMPLE DESIGNS (cont.) Complexity—number of stages and steps required to implement the sample design Efficiency—obtaining the most accurate and precise estimates at the lowest possible costs
20
ADVANTAGES & DISADVANTAGES: Simple Random Sample ADVANTAGES Requires little knowledge of population in advance DISADVANTAGES May not capture certain groups of interest May not be very efficient
21
ADVANTAGES & DISADVANTAGES: Systematic Random Sample ADVANTAGES Easy to analyze and compute sampling (standard) errors High precision DISADVANTAGES Periodic ordering of elements in sample frame may create biases in the data May not capture certain groups of interest May not be very efficient
22
ADVANTAGES & DISADVANTAGES: Stratified Sample ADVANTAGES Enables certain groups of interest to be captured Enables disproportionate sampling within strata Highest precision DISADVANTAGES Requires knowledge of population in advance May introduce more complexity in analyzing data and computing sampling (standard) errors
23
ADVANTAGES & DISADVANTAGES: Cluster Sample ADVANTAGES Lowers field costs Enables sampling of groups of individuals for which detail on individuals themselves may not be available DISADVANTAGES Introduces more complexity in analyzing data and computing sampling (standard) errors Lowest precision
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.