Statistical Thinking Sample Surveys in the Real World

Slides:



Advertisements
Similar presentations
+ Sampling and Surveys Inference for Sampling The purpose of a sample is to give us information about alarger population. The process of drawing conclusions.
Advertisements

Sampling Methods.
Chapter 3 Producing Data 1. During most of this semester we go about statistics as if we already have data to work with. This is okay, but a little misleading.
Chapter 1 The Where, Why, and How of Data Collection
AP Statistics Chapter 5 Notes.
© 2004 Prentice-Hall, Inc.Chap 1-1 Basic Business Statistics (9 th Edition) Chapter 1 Introduction and Data Collection.
Math 161 Fall 2008 Producing Data: Sampling
1Chapter 8 Sampling1 Chapter 8 Producing Data: Sampling.
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
Chapter 12 Sample Surveys
The Practice of Statistics
Section 5.1. Observational Study vs. Experiment  In an observational study, we observe individuals and measure variables of interest but do not attempt.
Copyright ©2005 Brooks/Cole, a division of Thomson Learning, Inc. How to Get a Good Sample Chapter 4.
Social Research Methods
Chapter 41 Sample Surveys in the Real World. Chapter 42 Thought Question 1 (from Seeing Through Statistics, 2nd Edition, by Jessica M. Utts, p. 14) Nicotine.
MATH1342 S08 – 7:00A-8:15A T/R BB218 SPRING 2014 Daryl Rupp.
Chapter 1 Getting Started
Chapter 12: AP Statistics
COLLECTING QUANTITATIVE DATA: Sampling and Data collection
Chapter 1: Introduction to Statistics
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
From Sample to Population Often we want to understand the attitudes, beliefs, opinions or behaviour of some population, but only have data on a sample.
BPS - 5th Ed. Chapter 81 Producing Data: Sampling.
Producing Data: Sampling BPS - 5th Ed.Chapter 81.
CHAPTER 8 Producing Data: Sampling BPS - 5TH ED.CHAPTER 8 1.
Statistics: Basic Concepts. Overview Survey objective: – Collect data from a smaller part of a larger group to learn something about the larger group.
1 Stat 1510 Statistical Thinking & Concepts Producing Data: Sampling.
Chap 20-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 20 Sampling: Additional Topics in Sampling Statistics for Business.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
Sample Surveys.  The first idea is to draw a sample. ◦ We’d like to know about an entire population of individuals, but examining all of them is usually.
Ch.7: March 02, Chapter 7 Producing Data: Sampling.
1  Specific number numerical measurement determined by a set of data Example: Twenty-three percent of people polled believed that there are too many polls.
Chapter 41 Sample Surveys in the Real World. Chapter 42 Thought Question 1 (from Seeing Through Statistics, 2nd Edition, by Jessica M. Utts, p. 14) Nicotine.
Section 1-4 Collecting Sample Data. DEFINITIONS Observational Study: observing and measuring specific characteristics without attempting to modify the.
MAT 1000 Mathematics in Today's World. Last Time 1.Two types of observational study 2.Three methods for choosing a sample.
Slide 1 Copyright © 2004 Pearson Education, Inc. Misuses of Statistics  Bad Samples  Small Samples  Misleading Graphs  Pictographs  Distorted Percentages.
Chapter 1 Getting Started 1.1 What is Statistics?.
© 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Lecture # 6:Designing samples or sample survey Important vocabulary Experimental Unit: An individual person,animal object on which the variables of interest.
BPS - 5th Ed. Chapter 81 Producing Data: Sampling.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
AP STATISTICS Section 5.1 Designing Samples. Objective: To be able to identify and use different sampling techniques. Observational Study: individuals.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Section 1-5 Collecting Sample Data.
Chapter Eleven The entire group of people about whom information is needed; also called the universe or population of interest. The process of obtaining.
1 Chapter 2: Sampling and Surveys. 2 Random Sampling Exercise Choose a sample of n=5 from our class, noting the proportion of females in your sample.
I can identify the difference between the population and a sample I can name and describe sampling designs I can name and describe types of bias I can.
Bangor Transfer Abroad Programme Marketing Research SAMPLING (Zikmund, Chapter 12)
1 Introduction to Statistics. 2 What is Statistics? The gathering, organization, analysis, and presentation of numerical information.
Basic Business Statistics, 8e © 2002 Prentice-Hall, Inc. Chap 1-1 Inferential Statistics for Forecasting Dr. Ghada Abo-zaid Inferential Statistics for.
Chapter 3 Surveys and Sampling © 2010 Pearson Education 1.
Sampling technique  It is a procedure where we select a group of subjects (a sample) for study from a larger group (a population)
1 Research Methods in AD/PR COMM 420 Section 8 Tuesday / Thursday 3:35 pm -5:30 pm 143 Stuckeman Nan Yu 2007 Fall_COMM 420_Week NY.
Chapter 12 Vocabulary. Matching: any attempt to force a sample to resemble specified attributed of the population Population Parameter: a numerically.
1 Data Collection and Sampling ST Methods of Collecting Data The reliability and accuracy of the data affect the validity of the results of a statistical.
Chapter 5 Sampling and Surveys. Section 5.3 Sample Surveys in the Real World.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Designing Studies In order to produce data that will truly answer the questions about a large group, the way a study is designed is important. 1)Decide.
1 Chapter 11 Understanding Randomness. 2 Why Random? What is it about chance outcomes being random that makes random selection seem fair? Two things:
Errors in Sampling Sampling Errors- Errors caused by the act of taking a sample. Makes sample results inaccurate. Random Sampling Error Errors caused by.
Sampling & Simulation Chapter – Common Sampling Techniques  For researchers to make valid inferences about population characteristics, samples.
Sampling Design and Procedure
Chapter 1 Getting Started Understanding Basic Statistics Fifth Edition By Brase and Brase Prepared by Jon Booze.
Sampling Chapter 5. Introduction Sampling The process of drawing a number of individual cases from a larger population A way to learn about a larger population.
Introduction/ Section 5.1 Designing Samples.  We know how to describe data in various ways ◦ Visually, Numerically, etc  Now, we’ll focus on producing.
MATH Section 6.1. Sampling: Terms: Population – each element (or person) from the set of observations that can be made Sample – a subset of the.
Essential Statistics Producing Data: Sampling
Essential Statistics Producing Data: Sampling
Chapter 5: Producing Data
MATH 2311 Section 6.1.
Basic Practice of Statistics - 5th Edition Producing Data: Sampling
Presentation transcript:

Statistical Thinking Sample Surveys in the Real World Chapter 4 Sample Surveys in the Real World Here are some notes which pertain to this page. Chapter 4 Chapter 4

Statistical Thinking Thought Question When surveying students on their opinions on their professor’s teaching methods, do you think it matters who conducts the interviews? Explain your answer with an example. From Seeing Through Statistics, 2nd Edition by Jessica M. Utts. … is an example of interviewer bias. Chapter 4 Chapter 4

Sources of Error in Surveys Random sampling reduces bias in choosing a sample and allows control of variability. Sampling in the real world is more complex and less reliable than we might hope for. Confidence statements do not reflect all sources of error that are present in sampling. Chapter 4

Statistical Thinking Sampling Errors – Errors that are caused by the act of taking a sample. Random Sampling Error: the difference between a statistic and the population parameter it estimates; such an error results from chance sample fluctuations. - Measured by the margin or error. Nonsampling Errors – Errors that are not related to the act of taking a sample. Example: Sample data that are incorrectly collected, recorded, or analyzed (such as using a defective instrument, or copying the data incorrectly). page 23 of text Nonsampling errors can be much larger than the sampling errors. Chapter 4 Chapter 4

Sampling Errors Using the wrong sampling frame. Undercoverage: Excluding some units in the population. Chapter 4

Sampling Errors Disasters Using voluntary response (self selection). Statistical Thinking Sampling Errors Disasters Using voluntary response (self selection). Using a convenience or haphazard sample. cannot extend results to the population of interest (need a broad cross-section of the population). Chapter 4 Chapter 4

Sources of Nonsampling Errors Non-response bias: Cannot contact subjects or they do not respond. - Nonrespondents often behave or think differently from respondents. - low response rates can lead to huge biases. - Can be as high as 75% to 80% (see page 58 of textbook). Response Errors: Incorrect answers by respondents. Chapter 4

Nonsampling errors cont. Survey format effects: Factors such as question order, questionnaire layout, self -administered questionnaire or interviewer, can affect the results. Interviewer effects: Different interviewers asking the same questions can obtain different answers. Processing Errors: Data that are incorrectly collected, recorded, calculated etc. Chapter 4

Concerns when Asking Survey Questions Statistical Thinking Concerns when Asking Survey Questions Deliberate bias Unintentional bias Desire to please Asking the uninformed Unnecessary complexity Ordering of questions Confidentiality and anonymity Chapter 4 Chapter 4

Statistical Thinking Deliberate Bias “If you found a wallet with $20 in it, would you return the money?” “If you found a wallet with $20 in it, would you do the right thing and return the money?” Chapter 4 Chapter 4

Statistical Thinking Desire to Please “Is your instructor doing a good job presenting the course material in a clear and interesting way?” Yes No Chapter 4 Chapter 4

Asking the Uninformed: Case Study Statistical Thinking Asking the Uninformed: Case Study Washington Post National Weekly Edition (April 10-16, 1995, p. 36) A 1978 poll done in Cincinnati asked people whether they “favored or opposed repealing the 1975 Public Affairs Act.” There was no such act! About one-third of those asked expressed an opinion about it. Chapter 4 Chapter 4

Ordering of Questions e.g. One German poll asked these two questions: Would you say that traffic contributes more or less to air pollution than industry? (Result: 45% blamed traffic and 27% blamed industry.) Would you say that industry contributes more or less to air pollution than traffic? (Result: 24% blamed traffic and 57% blamed industry.) Chapter 4

Confidentiality and Anonymity Confidential answer respondent is known, but the information is a secret facilitates follow-up studies Anonymous answer the respondent is not known, or cannot be linked to his/her response usually yields more truthful answers Chapter 4

Dealing with errors Statistical methods are available for estimating the likely size of sampling errors. - margin of error gives the sampling error. All we can do with nonsampling errors is to try to minimize them at the study-design stage. - Substitute similar households for non-responders. - Weight the responses of those who responded such that the sample matches the population. Chapter 4

Statistical Thinking Pilot Survey One tests a survey on a relatively small group of people to try to identify any problems with the survey design before conducting the survey proper. Chapter 4 Chapter 4

Example: Page 67 #4.4 Chapter 4 Statistical Thinking Example: Page 67 #4.4 A survey of users of the Internet found that males outnumbered females by nearly 2 to 1. This was a surprise, because earlier surveys had put the ratio of men to women closer to 9 to 1. Later in the article we find this information: Detailed surveys were sent to more than 13,000 organizations on the Internet; 1,468 usable responses were received. According to Mr. Quarterman, the margin of error is 2.8 percent, with a confidence level of 95 percent. What was the response rate for this survey? (The response rate is the percentage of the planned sample that responded). The response rate is Chapter 4 Chapter 4

Statistical Thinking Example Cont. Use the quick method to estimate the margin of error of this survey. Is your result close to the 2.8% claimed? Very close to the reported value of 2.8%. Do you think that the small margin of error is a good measure of the accuracy of the survey’s results? Explain your answer. The reported margin of error is probably unreliable, because we know nothing about the 88.7% that did not respond; they may be more (or less) male-dominated than those that responded. Chapter 4 Chapter 4

Example Use the following to answer questions 1-3: BLANK A in a sampling method means that the sample results will systematically misrepresent the population in the same way when we take repeated samples. For example, if we contact only people listed in telephone directories, the sample suffers from BLANK B. If some people chosen for the sample refuse to participate, the sample suffers from BLANK C. Both BLANK B and BLANK C are common sources of BLANK A. 1. BLANK A should read  A) Bias. B) high variability.  C) random sampling error. D) imprecise measurement. 2. BLANK B should read  A) nonresponse. B) undercoverage.  C) voluntary response. D) double-blindness. 3. BLANK C should read  A) nonresponse. B) undercoverage.  C) voluntary response. D) double-blindness. Chapter 4

Statistical Thinking Questions to ask about a study (determining the soundness of statistical studies) Who funded the study, and who conducted the study (and are they related)? Who were the individuals (or objects) studied, and how were they selected [population, sample size, response rate]? What was the setting in which the measurements were taken [time, location, method of contact, etc.]? What was the exact nature of the measurements made or questions asked? Were there any other differences in the groups being compared [any confounding/lurking variables]? What was the magnitude of any claimed effects or differences? Chapter 4 Chapter 4

Case Study Questions to ask Statistical Thinking Case Study Questions to ask Brooks Shoes Brings Flawed Study to Court reported in Statistical Reasoning in Law and Public Policy, Vol. 2, pp. 517-520 Chapter 4 Chapter 4

Questions to ask Brooks’ lawyer instrumental in designing survey. Statistical Thinking Questions to ask Brooks Shoes Who funded the study, and who conducted the study (and are they related)? Funding Source: Brooks shoe company. Brooks’ lawyer instrumental in designing survey. Researchers: “Interviewers” – employees of the company? They were inadequately trained in how to conduct an unbiased survey. Chapter 4 Chapter 4

Questions to ask 121 spectators & participants at 3 track meets. Statistical Thinking Questions to ask Brooks Shoes Who were the individuals or objects studied, and how were they selected? 121 spectators & participants at 3 track meets. Not selected to be “representative” of the general public in the area (e.g., they were more educated). What was the setting in which the measurements were taken? Track meets, where people are more likely to be familiar with athletic shoes. Chapter 4 Chapter 4

Statistical Thinking Questions to ask Brooks Shoes What was the exact nature of the measurements made or questions asked? Identifying Brooks shoe. Also, “How long have you known about Brooks Running Shoes?” Were there any other differences in the groups being compared? Probably not a problem here. Chapter 4 Chapter 4

Statistical Thinking Questions to ask Brooks Shoes What was the magnitude of any claimed effects or differences? 71% recognized a Brooks shoe [33% of these due to the ‘V’ design] 39% thought Suave shoe was Brooks shoe [48% of these due to the ‘V’ design] Suave conducted a proper survey of 404 respondents and found that only 2.7% recognized a Brooks shoe from the ‘V’! Chapter 4 Chapter 4

Other Sampling Designs We might want our sample to represent certain characteristics of the population (example: gender, race, age, etc.) In practice, more complex (but easy to implement) sampling designs are used. Chapter 4

Stratified Random Sample first divide the population into groups of similar individuals, called strata second, choose a separate simple random sample in each stratum third, combine these simple random samples to form the full sample if only certain strata are (randomly) chosen to be used, and all subjects in these strata make up the sample, then we have a cluster sample. Chapter 4

Systematic Sample randomly select a member of the sampling frame for the sample using a set procedure or rule, select the rest of the individuals for the sample for example, randomly select an individual from the sampling frame, and then select every 25th member of the sampling frame to be in the sample Chapter 4

Multistage Sample divide the population of interest into groups randomly select some of those groups divide the resulting collection of individuals into smaller groups continue dividing the resulting collection of individuals into groups and randomly selecting some of those groups until you can simply list all of the resulting individuals and randomly select n of them for your sample Chapter 4

Probability Sampling Plans A probability sample is a sample chosen by chance. Some probability samples (such as stratified sampling) do not give an equal chance to all the samples they allow – thus, they are not simple random samples. Chapter 4

Probability Sampling Plans Simple random sampling (SRS) Stratified random sampling Cluster sampling Systematic sampling Random digit dialing Multistage sampling Chapter 4

Statistical Thinking _____________ Sampling - Select some starting point and then select every Kth element in the population Chapter 4 Chapter 4

Statistical Thinking _______________ Sampling - subdivide the population into subgroups (strata) that share the same characteristic, then draw a sample from each stratum Chapter 4 Chapter 4

Statistical Thinking __________________ Sampling - divide the population into sections (or clusters); randomly select some of those clusters; choose all members from selected clusters Students will most often confuse stratified sampling with cluster sampling. Both break the population into strata or sections. With stratified a few are selected from each strata. With cluster, choose a few of the strata and choose all the member from the chosen strata. Chapter 4 Chapter 4

Example 1. We divide the class into two groups: first year students and others. We then take random samples from each group. This is an example of   A) simple random sampling. B) stratified random sampling.   C) clustered sampling. D) systematic random sampling.   E) multistage sampling. 2. To take a sample of students in this class we make a list ordered by social security number and select every 5th student in this list to be in our sample. This is an example of  A) systematic sampling. B) clustered sampling.  C) simple random sampling. D) multistage sampling.  E) stratified random sampling. Chapter 4

Key Concepts Ask questions when you read about studies. Statistical Thinking Key Concepts Ask questions when you read about studies. Sampling & Nonsampling Errors. Concerns when asking survey questions Stratified Sampling, Cluster Sampling, Systematic Sampling, Multistage Sampling. Chapter 4 Chapter 4

Statistical Thinking Review The number of 2-year-old maple trees with heights taller than 28.3 ft is Numerical and continuous Numerical and discrete C) Categorical and continuous D) Categorical and discrete From Seeing Through Statistics, 2nd Edition by Jessica M. Utts. … is an example of interviewer bias. Chapter 4 Chapter 4

Statistical Thinking Review The following information is collected of sex offenders in a BC court: Gender, Year of birth, Weight, Height, Level of education attained. The levels of measurement of these variables are: A) Nominal, ordinal, ratio, ratio, ordinal B) Nominal, interval, ratio, ratio, ordinal C) Nominal, interval, ratio, ratio, nominal D) Ordinal, interval, ratio, interval, ordinal E) Nominal, interval, interval, ratio, nominal From Seeing Through Statistics, 2nd Edition by Jessica M. Utts. … is an example of interviewer bias. Chapter 4 Chapter 4

Statistical Thinking Review A psychologist at Malaspina surveys all students from each of 20 randomly selected classes. What sampling method was used? Cluster B) Simple Random C) Convenience D) Systematic E) Stratified From Seeing Through Statistics, 2nd Edition by Jessica M. Utts. … is an example of interviewer bias. Chapter 4 Chapter 4

Statistical Thinking Review A psychologist surveys 100 employees of a company and asks them their favorite colour. She codes the responses as follows: 0 for red, 1 for blue, 2 for green, 3 for yellow, 4 for purple, 5 for brown, 6 for other. What level of measurement is the data? From Seeing Through Statistics, 2nd Edition by Jessica M. Utts. … is an example of interviewer bias. Chapter 4 Chapter 4

Statistical Thinking Review A random sample of 500 smokers are asked to halve the number of cigarettes consumed each day so that any effect on pulse rate can be measured. This study is An observational study An experiment A census A parameter From Seeing Through Statistics, 2nd Edition by Jessica M. Utts. … is an example of interviewer bias. Chapter 4 Chapter 4

Review Which of the following statements is FALSE? Statistical Thinking Review Which of the following statements is FALSE? Non-response can cause bias in surveys because non- respondents often tend to behave differently from people who respond. B) Non-sampling errors are often bigger than the random sampling errors in surveys. C) Slight changes in the wording of questions can make a measurable difference to survey results. D) People will sometimes answer a question differently for different interviewers. E) Sophisticated statistical methods can always correct the results if the population you are sampling from is different from the population of interest. From Seeing Through Statistics, 2nd Edition by Jessica M. Utts. … is an example of interviewer bias. Chapter 4 Chapter 4