Sampling: Surveys and How to Ask Questions

Slides:



Advertisements
Similar presentations
Sampling: Theory and Methods
Advertisements

Copyright © 2010 Pearson Education, Inc. Slide
Chapter 7 Sampling and Sampling Distributions
VI. Sampling: (Nov. 2, 4) Frankfort-Nachmias & Nachmias (Chapter 8 – Sampling and Sample Designs) King, Keohane and Verba (Chapter 4) Barbara Geddes
1. 2 GUIDELINES 1. Identify the variable(s) of interest (the focus) and the population of the study. 2. Develop a detailed plan for collecting data. If.
Chi-Square and Analysis of Variance (ANOVA)
Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 1.1 Chapter Five Data Collection and Sampling.
Putting Statistics to Work
Chapter 8 Estimation Understandable Statistics Ninth Edition
Collecting Data Sensibly
BPS - 5th Ed. Chapter 81 Producing Data: Sampling.
Where do data come from and Why we don’t (always) trust statisticians.
Sampling A population is the total collection of units or elements you want to analyze. Whether the units you are talking about are residents of Nebraska,
Sampling.
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. Sampling: Surveys and How to Ask Questions Chapter 4.
Copyright ©2011 Brooks/Cole, Cengage Learning Sampling: Surveys and How to Ask Questions Chapter 5 1.
Chapter 4 How to get the Data Part1 n In the first 3 lectures of this course we spoke at length about what care we should take in conducting a study ourselves.
Chapter 12 Sample Surveys
Copyright ©2005 Brooks/Cole, a division of Thomson Learning, Inc. How to Get a Good Sample Chapter 4.
Copyright © 2011 Pearson Education, Inc. Samples and Surveys Chapter 13.
5.1 Designing Samples.  Differentiate between an observational study and an experiment  Learn different types of sampling techniques  Use a random.
Producing Data: Sampling BPS - 5th Ed.Chapter 81.
CHAPTER 8 Producing Data: Sampling BPS - 5TH ED.CHAPTER 8 1.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 2 The Data Analysis Process and Collecting Data Sensibly.
Statistics The science of collecting, analyzing, and interpreting data. Planning A Study Using The Statistical Problem Solving Process: Ask a question.
Copyright © 2009 Pearson Education, Inc. Publishing as Longman. The 1936 Literary Digest Presidential Election Poll Case Study: Special Topic Lecture Chapter.
1 Stat 1510 Statistical Thinking & Concepts Producing Data: Sampling.
 Sampling Design Unit 5. Do frog fairy tale p.89 Do frog fairy tale p.89.
Chapter 12 Designing Good Samples. Doubting the Holocaust? An opinion poll conducted in 1992 for the American Jewish Committee asked: Does it seem possible.
Measurements, Mistakes and Misunderstandings in Sample Surveys Lecture 1.
Copyright ©2005 Brooks/Cole, a division of Thomson Learning, Inc. How to Get a Good Sample Chapter 4.
DATA COLLECTION METHODS Sampling
Sampling: Surveys and How to Ask Questions Chapter 4.
Sampling Design Notes Pre-College Math.
Section 5.1 Designing Samples Malboeuf AP Statistics, Section 5.1, Part 1 3 Observational vs. Experiment An observational study observes individuals.
Chapter 41 Sample Surveys in the Real World. Chapter 42 Thought Question 1 (from Seeing Through Statistics, 2nd Edition, by Jessica M. Utts, p. 14) Nicotine.
Deciding what and how to measure
Other Probability Sampling Methods
BPS - 5th Ed. Chapter 81 Producing Data: Sampling.
Chapter 15 Sampling and Sample Size Winston Jackson and Norine Verberg Methods: Doing Social Research, 4e.
Section 5.1 Designing Samples AP Statistics
AP STATISTICS LESSON AP STATISTICS LESSON DESIGNING DATA.
Copyright © 2010 Pearson Education, Inc. Chapter 19 Confidence Intervals for Proportions.
AP STATISTICS Section 5.1 Designing Samples. Objective: To be able to identify and use different sampling techniques. Observational Study: individuals.
Section 2.2. Census – obtaining information from an entire population Sample – obtaining information from a selected part of the population Bias – the.
Chapter Eleven The entire group of people about whom information is needed; also called the universe or population of interest. The process of obtaining.
CONFIDENCE STATEMENT MARGIN OF ERROR CONFIDENCE INTERVAL 1.
SECTION 4.1. INFERENCE The purpose of a sample is to give us information about a larger population. The process of drawing conclusions about a population.
I can identify the difference between the population and a sample I can name and describe sampling designs I can name and describe types of bias I can.
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. Gathering Useful Data Chapter 3.
 An observational study observes individuals and measures variable of interest but does not attempt to influence the responses.  Often fails due to.
Chapter 3 Producing Data. Observational study: observes individuals and measures variables of interest but does not attempt to influence the responses.
5.3: SAMPLING. Errors in Sampling Sampling Errors- Errors caused by the act of taking a sample. Makes sample results inaccurate. Random Sampling Error.
Copyright © 2014, 2011 Pearson Education, Inc. 1 Chapter 13 Samples and Surveys.
Copyright ©2011 Brooks/Cole, Cengage Learning Sampling: Surveys and How to Ask Questions 1.
Sampling Chapter 5. Introduction Sampling The process of drawing a number of individual cases from a larger population A way to learn about a larger population.
Introduction/ Section 5.1 Designing Samples.  We know how to describe data in various ways ◦ Visually, Numerically, etc  Now, we’ll focus on producing.
Ten percent of U. S. households contain 5 or more people
MATH Section 6.1. Sampling: Terms: Population – each element (or person) from the set of observations that can be made Sample – a subset of the.
Stat 100 Jan. 21 Read Chapter 4, Try problems 1,2, 5,7,9,14,15,17
Essential Statistics Producing Data: Sampling
Sampling.
Inference for Sampling
Essential Statistics Producing Data: Sampling
MATH 2311 Section 6.1.
Chapter 5: Producing Data
Basic Practice of Statistics - 5th Edition Producing Data: Sampling
MATH 2311 Section 6.1.
Presentation transcript:

Sampling: Surveys and How to Ask Questions Chapter 4 Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc.

4.1 The Beauty of Sampling Sample Survey: a subgroup of a large population questioned on set of topics. Special type of observational study. Less costly and less time than a census. With proper methods, a sample of 1500 can almost certainly gauge the percentage in the entire population who have a certain trait or opinion to within 3%. Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc.

Conservative margin of error: The Margin of Error The sample proportion and the population proportion with a certain trait or opinion differ by less than the margin of error in at least 95% of all random samples. Conservative margin of error: For proportions: For percents: Add and subtract margin of error to create an approximate 95% confidence interval. Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc.

Example 4.1 The Importance of Religion for Adult Americans Poll of n = 1003 adult Americans: “How important would you say religion is in your own life?” Very important 65% Fairly important 23% Not very important 12% No opinion 0% Conservative margin of error is 3%: Approx. 95% confidence interval for the percent of all adult Americans who say religion is very important: 65%  3% or 62% to 68% Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc.

Interpreting Confidence Interval The interval 62% to 68% may or may not capture the percent of adult Americans who considered religion to be very important in their lives. But, in the long run this procedure will produce intervals that capture the unknown population values about 95% of the time => called the 95% confidence level. Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc.

Advantages of a Sample Survey over a Census Sometimes a Census Isn’t Possible when measurements destroy units Speed especially if population is large Accuracy devote resources to getting accurate sample results Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc.

Bias: How Surveys Can Go Wrong Results based on a survey are biased if method used to obtain those results would consistently produce values that are either too high or too low. Selection bias occurs if method for selecting participants produces sample that does not represent the population of interest. Nonresponse bias occurs when a representative sample is chosen but a subset cannot be contacted or doesn’t respond. Response bias occurs when participants respond differently from how they truly feel. Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc.

4.2 Simple Random Sampling and Randomization Probability Sampling Plan: everyone in population has specified chance of making it into the sample. Simple Random Sample: every conceivable group of units of the required size has the same chance of being the selected sample. Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc.

Choosing a Simple Random Sample You Need: List of the units in the population. Source of random numbers : Minitab or random number tables (book). Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc.

Simple Random Sample of Students Class of 270 students. Want a simple random sample of 10 students. Number the units: Students numbered 001 to 270 – place in one column. Generate random numbers: in an adjacent column, generate uniform random numbers (Calc – Random Data – Uniform …). Sort: both columns by random numbers. Choose: the top ten students are those selected in the sample. Save your results! Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc.

Using a Table of Random Digits in a Randomized Experiment Randomization plays a key role in designing experiments to compare treatments. Completely randomized design = all units are randomly assigned to treatment conditions. Matched-pairs / Randomized Block design = randomize order treatments are assigned within pair/block. Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc.

Example 4.5 Assigning Children to Lift Weights In Case Study 3.2, 43 children randomly assigned – the first 15 to Group 1, the next 16 to Group 2, and the remaining 12 to Group 3. Using random numbers to assign children to groups: - assign labels to each child assign a random number to each child sort by random numbers first 15 go to group 1, next 16 to group 2, last 12 to group 3 Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc.

4.3 Other Sampling Methods Not always practical to take a simple random sample, can be difficult to get a numbered list of all units. Example: College administration would like to survey a sample of students living in dormitories. Shaded squares show a simple random sample of 30 rooms. Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc.

Stratified Random Sampling Divide population of units into groups (called strata) and take a simple random sample from each of the strata. College survey: Two strata = undergrad and graduate dorms. Take a simple random sample of 15 rooms from each of the strata for a total of 30 rooms. Ideal: stratify so little variability in responses within each of the strata. Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc.

Cluster Sampling Divide population of units into groups (called clusters), take a random sample of clusters and measure only those items in these clusters. College survey: Each floor of each dorm is a cluster. Take a random sample of 5 floors and all rooms on those floors are surveyed. Advantage: need only a list of the clusters instead of a list of all individuals. Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc.

Systematic Sampling Order the population of units in some way, select one of the first k units at random and then every kth unit thereafter. College survey: Order list of rooms starting at top floor of 1st undergrad dorm. Pick one of the first 11 rooms at random => room 3, then pick every 11th room after that. Note: often a good alternative to random sampling but can lead to a biased sample. Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc.

Random-Digit Dialing Method approximates a simple random sample of all households in the United States that have telephones. List all possible exchanges (= area code + next 3 digits). Take a sample of exchanges (chance of being sampled based on white pages proportion of households with a specific exchange). Take a random sample of banks (= next 2 digits) within each sampled exchange. Randomly generate the last two digits from 00 to 99. Once a phone number determined, make multiple attempts to reach someone at that household. Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc.

Multistage Sampling Example: Using a combination of the sampling methods, at various stages. Example: Stratify the population by region of the country. For each region, stratify by urban, suburban, and rural and take a random sample of communities within those strata. Divide the selected communities into city blocks as clusters, and sample some blocks. Everyone on the block or within the fixed area may then be sampled. Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc.

Example 4.7 The Nationwide Personal Transportation Survey Nationwide Personal Transportation Survey: taken every 5 years by the U.S. Department of Transportation. 1995 Survey = 21,000 households. Interviews conducted by telephone using a computer-assisted telephone interviewing (CATI) system. Multistage Sample: U.S. households were stratified by region of country, size of metropolitan area, and whether there is a subway system. Households were then selected by random-digit dialing. Everyone in a selected household was included => each household was a cluster. Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc.

Example 4.8 A Los Angeles Times National Poll “… half of Americans polled said they view Jan. 1, 2000, as ‘just another New Year’s Day’ … About one in 10 report that they are stockpiling goods.” Los Angeles Times Times Poll 1,249 adults nationwide by telephone. Over a two-day period in February 1999. Telephone numbers chosen from all exchanges in nation. Random-digit dialing techniques used so listed and non-listed numbers could be contacted. Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc.

4.4 Difficulties and Disasters in Sampling Some problems occur even when a sampling plan has been well designed. Using wrong sampling frame Not reaching individuals selected Self-selected sample Convenience/Haphazard sample Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc.

Using the Wrong Sampling Frame The sampling frame is the list of units from which the sample is selected. This list may or may not be the same as the list of all units in the desired “target” population. Example: using telephone directory to survey general population excludes those who move often, those with unlisted home numbers, and those who cannot afford a telephone. Solution: use random-digit dialing. Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc.

Not Reaching the Individuals Selected Failing to contact or measure the individuals who were selected in the sampling plan leads to nonresponse bias. Telephone surveys tend to reach more women. Some people are rarely home. Others screen calls or may refuse to answer. Quickie polls: almost impossible to get a random sample in one night. Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc.

Nonresponse or Volunteer Response “In 1993 the GSS (General Social Survey) achieved its highest response rate ever, 82.4%. This is five percentage points higher than our average over the last four years.” GSS News, Sept 1993 The lower the response rate, the less the results can be generalized to the population as a whole. Response to survey is voluntary. Those who respond likely to have stronger opinions than those who don’t. Surveys often use reminders, follow up calls to decrease nonresponse rate. Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc.

Example 4.9 Which Scientists Trashed the Public? “82% (of scientists) trashed the media, agreeing with the statement ‘The media do not understand statistics well enough to explain new findings.’ ” Science (Mervis, 1998) Science Poll 1400 professionals (in science and in journalism). Only 34% response rate among scientists. Typical respondent was white, male physical scientist over age of 50 doing basic research. Respondents represent a narrow subset of scientists => inappropriate to generalize to all scientists. Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc.

Disasters in Sampling Example 4.10 A Meaningless Poll Responses from a self-selected group, convenience sample or haphazard sample rarely representative of any larger group. Example 4.10 A Meaningless Poll “Do you support the President’s economic plan?” Results from TV quickie poll and proper study: Those dissatisfied more likely to respond to TV poll and it did not give the “not sure” option. Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc.

Case Study 4.1 The Infamous Literary Digest Poll of 1936 Election of 1936: Democratic incumbent Franklin D. Roosevelt and Republican Alf Landon Literary Digest Poll: Sent questionnaires to 10 million people from magazine subscriber lists, phone directories, car owners, who were more likely wealthy and unhappy with Roosevelt. Only 2.3 million responses for 23% response rate. Those with strong feelings, the Landon supporters wanting a change, were more likely to respond. (Incorrectly) Predicted a 3-to-2 victory for Landon. Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc.

Case Study 4.1 The Infamous Literary Digest Poll of 1936 Election of 1936: Democratic incumbent Franklin D. Roosevelt and Republican Alf Landon Gallup Poll: George Gallup just founded the American Institute of Public Opinion in 1935. Surveyed a random sample of 50,000 people from list of registered voters. Also took a random sample of 3000 people from the Digest lists. (Correctly) Predicted Roosevelt the winner. Also predicted the (wrong) results of the Literary Digest poll within 1%. Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc.

4.5 How to Ask Survey Questions Possible Sources of Response Bias in Surveys Deliberate bias: The wording of a question can deliberately bias the responses toward a desired answer. Unintentional bias: Questions can be worded such that the meaning is misinterpreted by a large percentage of the respondents. Desire to Please: Respondents have a desire to please the person who is asking the question. Tend to understate response to an undesirable social habit/opinion. Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc.

Possible Sources of Response Bias in Surveys (cont) Asking the Uninformed: People do not like to admit that they don’t know what you are talking about when you ask them a question. Unnecessary Complexity: If questions are to be understood, they must be kept simple. Some questions ask more than one question at once. Ordering of Questions: If one question requires respondents to think about something that they may not have otherwise considered, then the order in which questions are presented can change the results. Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc.

Possible Sources of Response Bias in Surveys (cont) Confidentiality and Anonymity: People will often answer questions differently based on the degree to which they believe they are anonymous. Easier to ensure confidentiality, promise not to release identifying information, than anonymity, researcher does not know the identity of the respondents. Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc.

Be Sure You Understand What Was Measured: Words can have different meanings. Important to get a precise definition of what was actually asked or measured. E.g. Who is really unemployed? Some Concepts Are Hard to Precisely Define: E.g. How to measure intelligence? Measuring Attitudes and Emotions: E.g. How to measure self-esteem and happiness? Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc.

Open or Closed Questions: Should Choices Be Given? Open question = respondents allowed to answer in own words. Closed question = given list of alternatives, usually offer choice of “other” and can fill in blank. If closed are preferred, they should first be presented as open questions (in a pilot survey) for establishing list of choices. Results can be difficult to summarize with open questions. Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc.

Case Study 4.2 No Opinion of Your Own? Let Politics Decide 1978 Poll, Cincinnati, Ohio: people asked whether they “favored or opposed repealing the 1975 Public Affairs Act.” No such act, about one-third expressed opinion. 1995 Washington Post Poll: 1000 randomly selected people asked “Some people say the 1975 Public Affairs Act should be repealed. Do you agree or disagree that it should be repealed?” 43% expressed opinion, 24% agreeing should be repealed. Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc.

Case Study 4.2 No Opinion of Your Own? Let Politics Decide (cont) Second 1995 Washington Post Poll: polled two separate groups of 500 randomly selected adults. Group 1: “President Clinton [a Democrat] said that the 1975 Public Affairs Act should be repealed. Do you agree or disagree?” Of those expressing an opinion: 36% of the Democrats agreed should be repealed, 16% of the Republicans agreed should be repealed. Group 2: “The Republicans in Congress said that the 1975 Public Affairs Act should be repealed. Do you agree or disagree?” Of those expressing an opinion: 36% of the Republicans agreed should be repealed, 19% of the Democrats agreed should be repealed. Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc.