Drawing Samples in “Observational Studies” Sample vs. the Population How to Draw a Random Sample What Determines the “Margin of Error” of a Poll?

Slides:



Advertisements
Similar presentations
Confidence Intervals for Population Means
Advertisements

Chapter 7: Data for Decisions Lesson Plan
Chapter 8: Estimating with Confidence
GATHERING DATA Chapter Experiment or Observe?
Excursions in Modern Mathematics, 7e: Copyright © 2010 Pearson Education, Inc. 16 Mathematics of Normal Distributions 16.1Approximately Normal.
Physics 270 – Experimental Physics. Let say we are given a functional relationship between several measured variables Q(x, y, …) What is the uncertainty.
Launching into Inference: From Common Core to AP Statistics Daren Starnes The Lawrenceville School CMC South 2013.
Probability - 1 Probability statements are about likelihood, NOT determinism Example: You can’t say there is a 100% chance of rain (no possibility of.
Confidence Intervals for Proportions
1. Estimation ESTIMATION.
Copyright © 2010 Pearson Education, Inc. Chapter 19 Confidence Intervals for Proportions.
Statistical inference Population - collection of all subjects or objects of interest (not necessarily people) Sample - subset of the population used to.
Chapter 10: Estimating with Confidence
Copyright ©2005 Brooks/Cole, a division of Thomson Learning, Inc. How to Get a Good Sample Chapter 4.
Binomial Probability Distribution.
Chapter 7 Confidence Intervals and Sample Sizes
Hypothesis Testing in Linear Regression Analysis
Section #4 October 30 th Old: Review the Midterm & old concepts 1.New: Case II t-Tests (Chapter 11)
Chapter 1: Introduction to Statistics
From Sample to Population Often we want to understand the attitudes, beliefs, opinions or behaviour of some population, but only have data on a sample.
POSC 202A: Lecture 9 Lecture: statistical significance.
F OUNDATIONS OF S TATISTICAL I NFERENCE. D EFINITIONS Statistical inference is the process of reaching conclusions about characteristics of an entire.
Statistics: Concepts and Controversies What Is a Confidence Interval?
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 8: Estimating with Confidence Section 8.1 Confidence Intervals: The.
8.2 Estimating Population Means LEARNING GOAL Learn to estimate population means and compute the associated margins of error and confidence intervals.
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
Estimates and Sample Sizes Lecture – 7.4
Political Science 30: Political Inquiry Drawing a Good Sample.
Sampling Distribution ● Tells what values a sample statistic (such as sample proportion) takes and how often it takes those values in repeated sampling.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 8: Estimating with Confidence Section 8.1 Confidence Intervals: The.
Ch 8 Estimating with Confidence. Today’s Objectives ✓ I can interpret a confidence level. ✓ I can interpret a confidence interval in context. ✓ I can.
Statistical Inference: Making conclusions about the population from sample data.
AP Statistics.  Observational study: We observe individuals and measure variables of interest but do not attempt to influence responses.  Experiment:
Section 2-1 Samples, Good and Bad. Remember: We select a sample in order to get information about some population (entire group of individuals about which.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 7 Estimates and Sample Sizes 7-1 Review and Preview 7-2 Estimating a Population Proportion.
Chapter 7: Data for Decisions Lesson Plan Sampling Bad Sampling Methods Simple Random Samples Cautions About Sample Surveys Experiments Thinking About.
Agresti/Franklin Statistics, 1 of 56 Chapter 4 Gathering data Learn …. How to gather “good” data About Experiments and Observational Studies.
MAT 1000 Mathematics in Today's World. Last Time 1.Two types of observational study 2.Three methods for choosing a sample.
Chapter 7 The Logic Of Sampling. Observation and Sampling Polls and other forms of social research rest on observations. The task of researchers is.
Making Inferences. Sample Size, Sampling Error, and 95% Confidence Intervals Samples: usually necessary (some exceptions) and don’t need to be huge to.
Statistical Hypotheses & Hypothesis Testing. Statistical Hypotheses There are two types of statistical hypotheses. Null Hypothesis The null hypothesis,
Confidence Intervals and Tests of Proportions. Assumptions for inference when using sample proportions: We will develop a short list of assumptions for.
© 2013 Pearson Education, Inc. Active Learning Lecture Slides For use with Classroom Response Systems Introductory Statistics: Exploring the World through.
Copyright © 2010 Pearson Education, Inc. Chapter 19 Confidence Intervals for Proportions.
Chapter 8 : Estimation.
Physics 270 – Experimental Physics. Let say we are given a functional relationship between several measured variables Q(x, y, …) x ±  x and x ±  y What.
Introduction to Inference: Confidence Intervals and Hypothesis Testing Presentation 4 First Part.
Statistics and Quantitative Analysis U4320 Segment 5: Sampling and inference Prof. Sharyn O’Halloran.
12.2 Inference for a Population Proportion We are interested in the unknown proportion p of a population that has some outcome – call the outcome we are.
Review I A student researcher obtains a random sample of UMD students and finds that 55% report using an illegally obtained stimulant to study in the past.
Inference: Probabilities and Distributions Feb , 2012.
Survey Research. Sampling and Inference June 9, 2008 Ivan Katchanovski, Ph.D. POL 242Y-Y.
CONFIDENCE INTERVALS: THE BASICS Unit 8 Lesson 1.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 19 Confidence Intervals for Proportions.
Chapter 3 Producing Data. Observational study: observes individuals and measures variables of interest but does not attempt to influence the responses.
Chapter 7 Data for Decisions. Population vs Sample A Population in a statistical study is the entire group of individuals about which we want information.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 19 Confidence Intervals for Proportions.
Plan for Today: Chapter 1: Where Do Data Come From? Chapter 2: Samples, Good and Bad Chapter 3: What Do Samples Tell US? Chapter 4: Sample Surveys in the.
Unit 4 Gathering Data LESSON SAMPLING WHAT ARE GOOD WAYS AND POOR WAYS TO SAMPLE?
10.1 Estimating with Confidence Chapter 10 Introduction to Inference.
Statistics 19 Confidence Intervals for Proportions.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 8: Estimating with Confidence Section 8.1 Confidence Intervals: The.
 Confidence Intervals  Around a proportion  Significance Tests  Not Every Difference Counts  Difference in Proportions  Difference in Means.
Intro to Probability and Statistics 1-1: How Can You Investigate Using Data? 1-2: We Learn about Populations Using Samples 1-3: What Role Do Computers.
Political Science 30: Political Inquiry
CHAPTER 8 Estimating with Confidence
The Diversity of Samples from the Same Population
Estimating a Population Proportion
Confidence Intervals: The Basics
Political Science 30 Political Inquiry
Presentation transcript:

Drawing Samples in “Observational Studies” Sample vs. the Population How to Draw a Random Sample What Determines the “Margin of Error” of a Poll?

Sample vs. the Population - 1 An observational study simply observes cases, WITHOUT attempting to impose a treatment and WITHOUT requiring any quasi- or natural experimental design. Researchers can ask their cases questions in order to measure some variable. Most of the time, researchers look closely at a small sample of the overall population.

Sample vs. the Population - 2 A population is the ENTIRE group of cases about which you want information. A sample is a SUBSET of the population which is used to gain information about the whole population. Population Sample

Sample vs. the Population - 3 A parameter is a number describing a population. It is a usually a mystery. A statistic is a number describing a sample. Statistics vary from sample to sample. If our sample is representative of the population, sample statistics will closely approximate population parameters.

How to Draw a Random Sample - 1 A simple random sample gives all members of the population an EQUAL chance to be drawn into the sample. Draw names out of a hat, a really big hat Label every case in the population with a number, then draw some random numbers In a telephone poll, random digit dialing uses a random number generator to get even those with unlisted numbers.

How to Draw a Random Sample - 2 WARNING! Random Sampling and Random Assignment are different! Random sampling selects which cases in your population you will study. Random assignment starts with a sample, then divides it into two or more groups. Observational studies may use random sampling, but not random assignment, and this hurts their internal validity.

How to Draw a Random Sample - 3 Sampling bias is consistent deviation of the sample statistic from the parameter Sampling variability describes how far apart statistics are over many samples.

Nonrandom methods of drawing a sample (Note: These are Bad!) = 1 A voluntary response sample includes the members of the population who voice their desire to be included in the sample “Literary Digest Poll” mailed 10 million ballots to magazine readers to volunteer participate in their Presidential election survey. 2 million surveys came back, predicting that FDR would lose 43%-57%. FDR won, 61%-39%.

Nonrandom methods of drawing a sample (Note: These are Bad!) - 2 A convenience sample studies the segment of the population that is easiest for the researcher to reach. Polls only of people who have telephones. (Less of a problem than it used to be). Studies by college students of their dormmates. Internet polls at or

Fundamentals of Significance Testing - 1 If you flip a “fair” (i.e., unbiased) coin 10 times there is LESS than a 25% chance you will obtain 5 heads and 5 tails. Put another way, there is approximately a 75% chance that you will get an UNEQUAL number of heads and tails (e.g., 7 and 3, 8 and 2, rather than 5 and 5).

Fundamentals of Significance Testing - 2 A “fair” (i.e., unbiased) coin will flip a DIFFERENT number of heads in repeated 10 flip trials. This is known as SAMPLING VARIATION (i.e., that repeated 10 flip samples of the same “fair” coin will NOT produce the same percentage of heads). Sampling variation raises this critical question:

Fundamentals of Significance Testing - 3 If we flip a coin 10 times and get an unequal number of head and tails did this occur because the coin was “biased” (not equally weighted – e.g., biased in favor of flipping heads) OR because of sampling variation (i.e., in the long run – say 10,000 flips - the coin would flip an equal number of heads and tails, it’s just in this 10 flip sample there was an unequal number of heads and tails)? This example is critical to the following discussion.

Fundamentals of Significance Testing - 4 In the coin flipping example, we will NEVER KNOW FOR SURE if the coin is UNBIASED. All we can do is ESTIMATE the PROBABILITY that it is UNBIANSED given the results that we have. Our “DECISION RULE” will be as follows: if the test results show that there is a 5%, or less chance, that the coin is unbiased, we will reject the notion that it is unbiased and conclude that it is biased.

Fundamentals of Significance Testing - 5 For example, if we find that the amount of time a person studies for this course (the independent variable), effects the grade they make in this course (the dependent variable), we need to ask the previous question applied to our circumstances: Is this because increased studying actually results in a higher grade, OR is the result due to sampling variation (i.e., on average, increased studying does not result in a higher grade, but did in our sample)?

Fundamentals of Significance Testing - 6 The remaining slides in this sequence are designed to help answer this critical question. Keep this discussion in mind as we proceed!

Do NOT worry about the “math” on the next group of slides. It is much more important that you understand the LOGIC of the text (i.e., the written words) than the mathematics. Immediately ahead we will construct “confidence intervals” for our statistical results. The following example should demonstrate the importance of a confidence interval. Confidence Intervals - 1

Confidence Intervals - 2 Suppose we randomly survey 1,100 adult Americans and find that 52% think President Obama is doing a good job as president. The president’s job approval rating is important because it impact’s the president’s ability both to get his legislative program enacted and his party’s ability to retain both the presidency and congressional strength.

Confidence Intervals - 3 Thus, it is important to know how much confidence we can place in the 52% approval rating Obama had in the poll. For example, how likely would the result from our SAMPLE be that Obama had a 52% approval rating when his ACTUAL approval rating among all adult Americans was 46%? In a two candidate race, you can’t win with 46% of the vote! The following slides show us how to answer such a question.

Confidence Intervals - 4 What are the two principles of any test of statistical significance? See if you can apply these principles to the formulas on the following slide.

Confidence Intervals – 5 – DON’T WRITE THE FORMULA! The 95% confidence interval around a sample proportion is: And the 99.7% confidence interval would be:

Confidence Intervals – 6 – DON’T WRITE THE FORMULA! Some years ago, in a poll of 505 likely voters, the Field Poll found 55% support for the recall of Gov. Davis

Confidence Intervals – 7 The margin of error for this poll was plus or minus 4.4 percentage points. This means that if we took many samples using the Field Poll’s methods, 95% of the samples would yield a statistic within plus or minus 4.4 percentage points of the true population parameter.