Where do data come from and Why we don’t (always) trust statisticians.

Slides:



Advertisements
Similar presentations
Overview of Lecture Parametric vs Non-Parametric Statistical Tests.
Advertisements

VI. Sampling: (Nov. 2, 4) Frankfort-Nachmias & Nachmias (Chapter 8 – Sampling and Sample Designs) King, Keohane and Verba (Chapter 4) Barbara Geddes
1. 2 GUIDELINES 1. Identify the variable(s) of interest (the focus) and the population of the study. 2. Develop a detailed plan for collecting data. If.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 1.1 Chapter Five Data Collection and Sampling.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 4: Designing Studies Section 4.1 Samples and Surveys.
AP Statistics Section 5.1 B More on Sampling. Methods for sampling from large populations spread out over a wide area are usually more complex than an.
Chapter 7: Data for Decisions Lesson Plan
Chapter 5 Producing Data
Chapter 3 Producing Data 1. During most of this semester we go about statistics as if we already have data to work with. This is okay, but a little misleading.
AP Statistics Chapter 5 Notes.
CHAPTER 4 Designing Studies
The Practice of Statistics
Section 5.1. Observational Study vs. Experiment  In an observational study, we observe individuals and measure variables of interest but do not attempt.
Austin Cole February 16, Outline I. Sampling a. Bad Sampling Methods b. Random Sampling II. Experiments III. Applying Sample to a Population IV.
4.2 Statistics Notes What are Good Ways and Bad Ways to Sample?
BPS - 5th Ed. Chapter 81 Producing Data: Sampling.
 Sampling Design Unit 5. Do frog fairy tale p.89 Do frog fairy tale p.89.
Chapter 1: The Nature of Statistics
Sampling is the other method of getting data, along with experimentation. It involves looking at a sample from a population with the hope of making inferences.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 4 Designing Studies 4.1 Samples and Surveys.
Sampling Design Notes Pre-College Math.
Chapter 5: Producing Data “An approximate answer to the right question is worth a good deal more than the exact answer to an approximate question.’ John.
Chapter 7: Data for Decisions Lesson Plan Sampling Bad Sampling Methods Simple Random Samples Cautions About Sample Surveys Experiments Thinking About.
Chapter 12 Sample Surveys
CHAPTER 8: Producing Data Sampling ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 4 Designing Studies 4.2Experiments.
Designing Samples Chapter 5 – Producing Data YMS – 5.1.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 4 Designing Studies 4.2Experiments.
AP Review #4: Sampling & Experimental Design. Sampling Techniques Simple Random Sample – Each combination of individuals has an equal chance of being.
Conducting A Study Designing Sample Designing Experiments Simulating Experiments Designing Sample Designing Experiments Simulating Experiments.
BY: Nyshad Thatikonda Alex Tran Miguel Suarez. How to use this power point 1) Click on the box with the number. Best to click on the black part and not.
AP STATISTICS LESSON AP STATISTICS LESSON DESIGNING DATA.
AP STATISTICS Section 5.1 Designing Samples. Objective: To be able to identify and use different sampling techniques. Observational Study: individuals.
SECTION 4.1. INFERENCE The purpose of a sample is to give us information about a larger population. The process of drawing conclusions about a population.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 4 Designing Studies 4.2Experiments.
Collection of Data Jim Bohan
I can identify the difference between the population and a sample I can name and describe sampling designs I can name and describe types of bias I can.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 4 Designing Studies 4.2Experiments.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 4 Designing Studies 4.1 Samples and Surveys.
Chapter 7 Data for Decisions. Population vs Sample A Population in a statistical study is the entire group of individuals about which we want information.
1. What is one method of data collection? 2. What is a truly random way to survey/sample people?
The population in a statistical study is the entire group of individuals about which we want information The population is the group we want to study.
Designing Studies In order to produce data that will truly answer the questions about a large group, the way a study is designed is important. 1)Decide.
CHAPTER 9: Producing Data Experiments ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
1 Chapter 11 Understanding Randomness. 2 Why Random? What is it about chance outcomes being random that makes random selection seem fair? Two things:
Plan for Today: Chapter 1: Where Do Data Come From? Chapter 2: Samples, Good and Bad Chapter 3: What Do Samples Tell US? Chapter 4: Sample Surveys in the.
5.1: Designing Samples. Important Distinction Observational Study – observe individuals and measure variables but do not attempt to influence the responses.
Ten things about Experimental Design AP Statistics, Second Semester Review.
Ten percent of U. S. households contain 5 or more people
MATH Section 6.1. Sampling: Terms: Population – each element (or person) from the set of observations that can be made Sample – a subset of the.
Producing Data 1.
Unit 4--Lesson 2. Lesson Objectives At the end of the lesson, students can: Identify common issues with sampling and surveys Design an experiment using.
Chapter 5 Data Production
Sources of Bias 1. Voluntary response 2. Undercoverage 3. Nonresponse
Section 5.1 Designing Samples
CHAPTER 4 Designing Studies
Inference for Sampling
Producing Data, Randomization, and Experimental Design
Producing Data, Randomization, and Experimental Design
CHAPTER 4 Designing Studies
Ten things about Experimental Design
CHAPTER 4 Designing Studies
CHAPTER 4 Designing Studies
CHAPTER 4 Designing Studies
CHAPTER 4 Designing Studies
CHAPTER 4 Designing Studies
What do Samples Tell Us Variability and Bias.
CHAPTER 4 Designing Studies
Designing Samples Section 5.1.
CHAPTER 4 Designing Studies
Presentation transcript:

Where do data come from and Why we don’t (always) trust statisticians.

Induction vs. Deduction the gist of statistics Deduction: “What is true about the whole, must be true about a part.” Induction: “What is true about the part might be true about the whole.”

Population vs. Sample Population is the entire group of individuals about which we want information. Sample is a part of population from which we actually collect information. We use samples to study population because, often, populations are impossible or impractical to study.

Real Life Example of a Bad Sample Ann Landers, a famous columnist, collected a sample of 10,000 people who wrote in to answer this question: “If you could do it all over again, would you have children?” 70% of the respondents said that they would not have children. When a sample was selected at random, 91% of the people said that they would have children.

Potential problems with sample surveys Undercoverage occurs when some groups in population are left out of the process of choosing the sample. Nonresponse occurs when an individual chosen for the sample cannot be contacted or refuses to respond.

Another Real life Example of a Bad Sample In 1936 Literary Digest mailed out 10,000,000 ballots asking who the respondents are going to vote for – A. Landon or F.D. Roosevelt. 2,300,000 ballots were returned, predicting a strong win (57%) for Landon.

Another Real life Example of a Bad Sample George Gallup surveyed 50,000 people chosen randomly. Comparison of forecasts: Gallup’s Prediction for Roosevelt 56% Gallup’s prediction of Digest 44% Digest prediction for Roosevelt 43% Actual vote 62% Literary Digest used their subscription list, phone directory, lists of car owners, club members.

Right and Wrong Ways to Sample A simple random sample is a sample where (1) each unit of population has an equal chance of being chosen and (2) all units are chosen independently. The sample is biased if at least one group of individuals has greater chances of being selected.

Example of a good sample You want to study effects of computers on GPA. You don’t have the resources to study all students. To select a sample of students for the study you Get a list of all students, Select at random students on the list, Collect information from the students selected, Compare those who have computer with those who don’t.

Example of a bad sample You want to study effects of computers on GPA. You don’t have the resources to study all students. To select a sample of students for the study you Use your friends. Hang an ad in the computer lab. Post an on-line questionnaire on WKU site.

Stratified Random Sample When we know proportions of each group in the population – Stratified random sample is better than SRS. In stratified sample, number of people chosen from each group is proportional to the size of that group in the population.

Confounding Two explanatory variables are confounded when their effects on the response variable cannot be distinguished from each other. Confounding is often a problem with a study that uses sample surveys to collect data (even if sampling is done right).

Observation vs. Experiment Observational study - observes individuals and measures variables but does not attempt to influence responses. Experiment imposes treatment on individuals to observe their responses.

How to design an Experiment The purpose of an experiment is to find out how one variable (response variable) changes in response to change in another variable (explanatory variable). Experiment: Subject Treatment Response

Placebo Effect Placebo effect – change in behavior due to participation in experiment. Placebo effect is a problem when experiment does not have a control group (a basis for comparison) To avoid the problem – design a randomized comparative experiment.

How to design a Randomized Comparative Experiment Randomly split the subjects into two groups: control group – receives no treatment treatment group – receives treatment Compare the results. Both will be equally affected by Placebo effect, so the difference between the groups shows whether the treatment works.

How to interpret results of an experiment Observe outcomes for treatment and control groups. If outcomes are different enough so that we can say that this difference would rarely occur by chance, we conclude that the difference is statistically significant.

Population vs. Sample Population is the entire group of individuals about which we want information. Sample is a part of population from which we actually collect information. Based on the sample, we make conclusion about the whole population.

Parameter vs. Statistic A Parameter is the number that describes the population. A Statistic is a number that describes the sample. We use statistics to estimate parameters.

Sampling Distribution The result of your study is a statistic, which can vary from sample to sample Sampling Distribution of a statistic is the distribution of values taken by the statistic in all possible samples of the same size from the same population Estimate=True Parameter + Sampling Error

Bias and variability A statistic is biased if the mean of the sampling distribution is not equal to the true value of the parameter being estimated. Variability of a statistic is the spread of sampling distribution. Bias does not go away with larger samples. Variability goes away with larger samples.