MA151 Lecture 2: Sampling methods

Slides:



Advertisements
Similar presentations
Sampling.
Advertisements

Sampling ADV 3500 Fall 2007 Chunsik Lee. A sample is some part of a larger body specifically selected to represent the whole. Sampling is the process.
Producing Data: Sampling BPS chapter 8 © 2006 W.H. Freeman and Company.
 Sampling Design Unit 5. Do frog fairy tale p.89 Do frog fairy tale p.89.
Sample Surveys.  The first idea is to draw a sample. ◦ We’d like to know about an entire population of individuals, but examining all of them is usually.
Chapter 12 Notes Surveys, Sampling, & Bias Examine a Part of the Whole: We’d like to know about an entire population of individuals, but examining all.
Sampling Design Notes Pre-College Math.
Sampling. Sampling Can’t talk to everybody Select some members of population of interest If sample is “representative” can generalize findings.
Section 5.1 Designing Samples Malboeuf AP Statistics, Section 5.1, Part 1 3 Observational vs. Experiment An observational study observes individuals.
Population and sample. Population: are complete sets of people or objects or events that posses some common characteristic of interest to the researcher.
Section 5.1 Continued.  A simple random sample (SRS) of size n contains n individuals from the population chosen so that every set of n individuals has.
AP Review #4: Sampling & Experimental Design. Sampling Techniques Simple Random Sample – Each combination of individuals has an equal chance of being.
Chapter 2 Lesson 2.2a Collecting Data Sensibly 2.2: Sampling.
Section 5.1 Designing Samples AP Statistics
Learning Objectives Explain the role of sampling in the research process Distinguish between probability and nonprobability sampling Understand the factors.
AP STATISTICS Section 5.1 Designing Samples. Objective: To be able to identify and use different sampling techniques. Observational Study: individuals.
Producing Data: Sampling BPS chapter 8 © 2010 W.H. Freeman and Company.
Chapter 5 Sampling: good and bad methods AP Standards Producing Data: IIB4.
 An observational study observes individuals and measures variable of interest but does not attempt to influence the responses.  Often fails due to.
Get out homework. Get out notes.. SECTION 5.1 CONTINUED Designing Samples.
Sample Design AP Statistics. Quick definitions Response Variable / Dependent Variable (the output) Explanatory Variables /Independent Variables (input)
Chapter 11 Sample Surveys. How do we gather data? Surveys Opinion polls Interviews Studies –Observational –Retrospective (past) –Prospective (future)
Sampling Chapter 5. Introduction Sampling The process of drawing a number of individual cases from a larger population A way to learn about a larger population.
Ten percent of U. S. households contain 5 or more people
Chapter 12 Sample Surveys.
Chapter 12 Sample Surveys.
Sample Surveys.
Sampling From Populations
Sampling and Experimentation
Chapter 12 Sample Surveys
Sampling.
Sampling Procedures Cs 12
Sources of Bias 1. Voluntary response 2. Undercoverage 3. Nonresponse
Sampling Design.
Part III – Gathering Data
Section 5.1 Designing Samples
Chapter 10 Samples.
Statistics – Chapter 1 Data Collection
Graduate School of Business Leadership
Population and samples
Sample Surveys Chapter 11.
Bias On-Level Statistics.
SAMPLING (Zikmund, Chapter 12.
CHAPTER 12 Sample Surveys.
Honors Statistics Chapter 12 Part 2
Chapter 12 Sample Surveys
Producing Data: Sampling
Inference for Sampling
Chapter 4 Sampling Design.
Welcome.
Sampling Population – any well-defined set of units of analysis; the group to which our theories apply Sample – any subset of units collected in some manner.
Section 5.1 Designing Samples
Sampling and Surveys How do we collect data? 8/20/2012.
Unit 1 Day 4 Biased or Unbiased in Sampling
Producing Data Chapter 5.
Chapter 12 Sample Surveys.
Sampling and Study Design
WARM – UP Use LINE 5 of the random digit table. 30. The World Series.
Warm Up Imagine you want to conduct a survey of the students at Leland High School to find the most beloved and despised math teacher on campus. Among.
SAMPLING (Zikmund, Chapter 12).
Section 5.1 Designing Samples
Chapter 5: Producing Data
MATH 2311 Section 6.1.
Chapter 5: Producing Data
Chapter 5 Producing Data.
Chapter 3 producing data
Chapter 4: Designing Studies
10/18/ B Samples and Surveys.
Designing Samples Section 5.1.
EQ: What is a “random sample”?
Presentation transcript:

MA151 Lecture 2: Sampling methods Grazyna Badowski University of Guam gbadowski@uguam.uog.edu

Population and sample Population is the entire collection of subjects of interest in a study. You need to specify clearly the population. Sample is the smaller group from the population on which the study collects data.

Sample Sample should be representative of population. The distribution of a sample should be similar to the distribution of the population. A sample that does not represent the population in some important way is said to be biased. How do you choose a representative sample?

Theoretical vs. accessible population Theoretical population is the population you would like to generalize your results to. And usually it is not the population that will be accessible to you. Source: http://www.socialresearchmethods.net/kb/sampprob.php

Sampling Frame The listing of the accessible population from which you'll draw your sample is called the sampling frame. Examples: telephone book, voters’ registration, list of students, patients’ registry

Probability Sampling Methods Simple Random Sampling Stratified Random Sampling Cluster Sampling Systematic Sampling Multistage Sampling

Probability Sampling Methods A probability sampling method is any method of sampling that utilizes some form of random selection. In order to have a random selection method, you must set up some process or procedure that assures that the different units in your population have equal probabilities of being chosen.

Simple Random Sampling Each possible sample of size n has the same chance of being selected. We need the list of all subjects in the population - sampling frame, and use random numbers to draw sample. SRS might not get good representation of subgroups in a population. SRS is simple and easy to explain, it is reasonable to generalize the results. But we might not get goo

Stratified Random Sampling First divide the population into separate groups, called strata, and then select a SRS from each stratum. Source: http://www.socialresearchmethods.net/kb/sampprob.php

Stratified Sampling – cont. It is useful when the strata are groups that the study will compare. For example, your strata could be ethnicity. The sampling is proportional if the proportions of the sample chosen are the same as those in the entire population.

Stratified Random Sampling Advantages Individual estimates for each stratum. Can get more accurate estimate of parameter if variable is less variable within each strata than within the whole population. Can use disproportional sampling when one stratum small. (weights) Use different interviewers within each strata.

Cluster Sampling The population is divided into groups called clusters, we select a random sample of clusters and measure only those clusters. Advantage: you only need a list of clusters, instead of a list of all individuals. If each cluster fairly represents the full population, cluster sampling will give us an unbiased sample.

Cluster vs. Stratified Sampling We stratify to ensure that our sample represents different groups in the population; strata are homogeneous, but differ from one another. Clusters are alike, each heterogeneous and resembling the overall population. We select clusters to make sampling more practical/affordable. For example, random sample of city blocks and then sample every family on the block.

Systematic Sampling We draw a sample by selecting individuals systematically. For example you might choose every 50th person on a list. You start with randomly selected individual. Subjects have to be randomly ordered.

Multistage Sampling Combined sampling methods. Example: sampling students in grade schools 1. Stratified sample from national school districts. 2. SRS of schools within districts. 3. SRS of classes within schools.

Difficulties in Sampling Wrong sampling frame. An SRS from an incomplete sampling frame introduces bias because the individuals included may differ from the ones not in the frame. People in prison, homeless, students, travelers.

Difficulties in Sampling 2. Not reaching selected individuals/low response rate. The problem is that those who don’t respond may differ from those who do. It is better to put resources into getting a smaller sample and ensure a high response rate. (revisit)

Disasters Voluntary response sample. viewers’/listeners’ opinions Convenience sampling: include individuals who are convenient. streetcorner interview

Literary Digest Poll of 1936 In 1936, the Literary Digest, predicted that Landon would beat FDR in that year's election by 57 to 43 percent.  The Digest mailed over 10 million questionnaires to names drawn from lists of automobile and telephone owners, and over 2.3 million people responded . At the same time Gallup sampled only 50,000 people and predicted that Roosevelt would win.  Roosevelt won with 62% of the vote.  Two mistakes: sampling frame wrong Low response rate, only 23% Telephone lists, club and associations membership

Biases in Random Samples Randomization doesn’t correct for certain problems with sampling Bias 1: Undercoverage: some groups in the population are left out of the process of choosing the sample Bias 2: Nonresponse: sampled individuals can not be contacted or do not cooperate Eg. 1936 presidential polls Low response rate: less than 25% of responded Undercoverage of poorer demographics: sample of voters relied heavily on lists of automobile and telephone owners, which were generally more affluent voters Well, at least we learned from those mistakes, right?

Nonprobability Sampling Convenience (most available) Purposive sampling Quota sampling (setting quotas and then using convenience sampling to obtain those quotas; similar to stratified ) Snowball sampling the researcher specifies the characteristics of the population of interest and then locates individuals who match those characteristics). For example, you might decide that you want to only include "boys who are in the 7th grade and have been diagnosed with ADHD" in your research study. You would then, try to find 50 students who meet your "inclusion criteria" and include them in your research study.

Experimental psychology example Mostly used convenience samples. Who? According to Joseph Henrich (Uof B.C.), most undergraduates are WEIRD and not representative of humanity as a whole. Western, Educated, Industrialized, Rich and Democratic He found a random American undergraduate is about 4,000 times more likely than an average human being to be the subject of such a study. He did analysis of leading psychology journals

Source: The Economist, May 26th, 2012

More Potential Problems with Surveys Response Bias: respondents may not answer truthfully to survey questions Illegal or unpopular behavior such as drug usage Controversial topics such as teen sexual activity Race or gender of interviewer can influence answers about race or gender-related questions Respondents often have trouble remembering past events eg. yearly nutrition and health surveys

More Potential Problems with Surveys Wording of questions can be confusing or intentionally lead the respondent Do you favor a ban on disposable diapers? It is estimated that disposable diapers account for less than 2% of the trash in today’s landfills. In contrast, beverage containers, third-class mail and yard wastes account for 21% of the trash in landfills. Given this, would it be fair to ban disposable diapers? Complicated multi-part forms that require lots of skipped questions lead to a drop off in response