The Guessing Game The entire business of Statistics is dedicated to the purpose of trying to guess the value of some population parameter. In what follows.

Slides:



Advertisements
Similar presentations
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Confidence Intervals Chapter 8.
Advertisements

Estimation in Sampling
Chapter 10: Estimating with Confidence
Confidence Intervals This chapter presents the beginning of inferential statistics. We introduce methods for estimating values of these important population.
Sampling Distributions
Excursions in Modern Mathematics, 7e: Copyright © 2010 Pearson Education, Inc. 16 Mathematics of Normal Distributions 16.1Approximately Normal.
Standard Normal Table Area Under the Curve
McGraw-Hill Ryerson Copyright © 2011 McGraw-Hill Ryerson Limited. Adapted by Peter Au, George Brown College.
Confidence Intervals for Proportions
1-1 Copyright © 2015, 2010, 2007 Pearson Education, Inc. Chapter 18, Slide 1 Chapter 18 Confidence Intervals for Proportions.
T scores and confidence intervals using the t distribution.
The standard error of the sample mean and confidence intervals
t scores and confidence intervals using the t distribution
The standard error of the sample mean and confidence intervals
Point and Confidence Interval Estimation of a Population Proportion, p
Confidence intervals using the t distribution. Chapter 6 t scores as estimates of z scores; t curves as approximations of z curves Estimated standard.
1 More about the Confidence Interval of the Population Mean.
The standard error of the sample mean and confidence intervals How far is the average sample mean from the population mean? In what interval around mu.
Copyright © 2010 Pearson Education, Inc. Chapter 19 Confidence Intervals for Proportions.
Estimation 8.
1 The Sample Mean rule Recall we learned a variable could have a normal distribution? This was useful because then we could say approximately.
T scores and confidence intervals using the t distribution.
Chapter 10: Estimating with Confidence
Inferential Statistics
Chapter 19: Confidence Intervals for Proportions
The standard error of the sample mean and confidence intervals How far is the average sample mean from the population mean? In what interval around mu.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 9 Section 1 – Slide 1 of 39 Chapter 9 Section 1 The Logic in Constructing Confidence Intervals.
Estimation Statistics with Confidence. Estimation Before we collect our sample, we know:  -3z -2z -1z 0z 1z 2z 3z Repeated sampling sample means would.
Chapter 11: Estimation Estimation Defined Confidence Levels
Dan Piett STAT West Virginia University
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 8: Estimating with Confidence Section 8.1 Confidence Intervals: The.
Confidence Intervals 1 Chapter 6. Chapter Outline Confidence Intervals for the Mean (Large Samples) 6.2 Confidence Intervals for the Mean (Small.
Section 8.1 Estimating  When  is Known In this section, we develop techniques for estimating the population mean μ using sample data. We assume that.
90288 – Select a Sample and Make Inferences from Data The Mayor’s Claim.
1 Psych 5500/6500 Standard Deviations, Standard Scores, and Areas Under the Normal Curve Fall, 2008.
Confidence Interval Estimation for a Population Proportion Lecture 31 Section 9.4 Wed, Nov 17, 2004.
Determination of Sample Size: A Review of Statistical Theory
February 2012 Sampling Distribution Models. Drawing Normal Models For cars on I-10 between Kerrville and Junction, it is estimated that 80% are speeding.
READING HANDOUT #5 PERCENTS. Container of Beads Container has 4,000 beads 20% - RED 80% - WHITE Sample of 50 beads with pallet. Population - the 4,000.
Estimation of a Population Mean
+ DO NOW. + Chapter 8 Estimating with Confidence 8.1Confidence Intervals: The Basics 8.2Estimating a Population Proportion 8.3Estimating a Population.
Confidence Interval Estimation For statistical inference in decision making:
Section 6.1 Confidence Intervals for the Mean (Large Samples) Larson/Farber 4th ed.
1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Example: In a recent poll, 70% of 1501 randomly selected adults said they believed.
CONFIDENCE INTERVALS: THE BASICS Unit 8 Lesson 1.
Confidence Interval Estimation For statistical inference in decision making: Chapter 9.
INFERENTIAL STATISTICS DOING STATS WITH CONFIDENCE.
COURSE: JUST 3900 INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE Test Review: Ch. 4-6 Peer Tutor Slides Instructor: Mr. Ethan W. Cooper, Lead Tutor © 2013.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 19 Confidence Intervals for Proportions.
Confidence Intervals. Point Estimate u A specific numerical value estimate of a parameter. u The best point estimate for the population mean is the sample.
Statistics 19 Confidence Intervals for Proportions.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 8: Estimating with Confidence Section 8.1 Confidence Intervals: The.
Chapter 6 Confidence Intervals 1 Larson/Farber 4th ed.
Chapter 8: Estimating with Confidence
Confidence Intervals for Proportions
Chapter 8: Estimating with Confidence
Confidence Intervals for Proportions
Confidence Interval Estimation for a Population Proportion
Chapter 8: Estimating with Confidence
Chapter 8 Confidence Intervals.
Chapter 6 Confidence Intervals
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Presentation transcript:

The Guessing Game The entire business of Statistics is dedicated to the purpose of trying to guess the value of some population parameter. In what follows the population parameter (the target parameter) will be either a mean or a proportion p. All that we have at our disposal is n values taken at random from the population x 1, x 2, …, x n and their average and their variance:

Not much to go on, but when n is large (admittedly we don’t know what that means, but n ≥ 30 will make us feel better) we have the wonderful Central Limit Theorem that tells us that the distribution of the sample mean (the average) is approximately normal

Trouble is, we don’t know and we don’t know . We just have the following picture: and the one number we know, is somewhere on the horizontal line. Actually

“somewhere” is a cop-out, we know exactly where is, we just don’t know where the red curve is relative to ! Could be like this

or like this: (in both figures the blue dot is )

Looks somewhat hopeless, but there are some statements we can make for sure. The next slides show four of them! Hopefully you’ll catch on.Replace the blue dot (that represents ) with the RED DOT (that represents the standardization of )

80% chance the red dot is inside the blue bars. (why?)

85% chance the red dot is inside the blue bars. (why?)

90% chance the red dot is inside the blue bars. (why?)

95% chance the red dot is inside the blue bars. (why?)

The Theory What went on in each of the previous four slides? Let’s see. We picked a percentage of area 1.80% - first slide 2.85% - second 3.90% - third 4.95% - fourth From the chosen percentage we got (via the standard normal tables)

symmetric z-scores 1.80%gave us %gave us %gave us %gave us In fact, if you give me any positive area ≤ 1 (call it 1 -  ) I can find the corresponding symmetric z-scores by looking for the area value

(figure it out from the figure!) The two z-scores you get are written as ±Z   /2 and the number 1 - is called confidence coefficient if in decimal form confidence level if in percent form

Confidence Intervals Why are we using the word “confidence”? Confidence in what? Of course, we hope it is confidence in our prediction! In fact we want the confidence level to be just the probability that our prediction is correct. Trouble is …. We haven’t predicted anything !! We just have established that

For any confidence coefficient 1 - we can find z-values and so that P(red dot between and ) = 1 - Recall that the red dot stands for the standardized value

So we obtain the statement that a little 7-th grade algebra transforms into This is translated into English as: is inside the interval

… with probability 1 - Or, in slightly different (and more pompous sounding) words “We are (1 - )% confident that is inside the interval We call this interval the (1 - )% confidence interval. THAT’S OUR PREDICTION !

One last step: what do we use for sigma ? If we know it (sometimes we do) … HALLELUYA ! If not, we approximate sigma using the sample standard deviation where s is the (computed) sample standard deviation.

An example The numbers shown have been obtained as time T (in seconds) elapsed from the time the cage door is opened to the time of exit from the cage for 40 lab mice in separate cages; (20 of the mice have been given a tranquilizer, the other 20 a placebo, but this is for another problem later.) Construct the following confidence intervals for the mean of T

The Question: 1.90% confidence interval 2.95% confidence interval 3.99% confidence interval 4.30% confidence interval 5.10% confidence interval

The data: The sample mean and standard deviation are: therefore

The z-scores For each we compute z’s (from my “stats” program or from the table) 1.90% % % % % 0.126

The answers Using the formula we get the intervals 1.90% [2.760, 3.239] 2.95% [2.715, 3.285] 3.99% [2.625, 3.375] 4.30% [2.944, 3.056] 5.10% [2.982, 3.018] Note that the higher the confidence the wider the interval. Is this reasonable?

or who’s gonna win the elections? Quite often one needs to estimate what proportion p of a population prefers option A over option B. One takes a “large enough” random sample of the population, counts how many prefer A, divides by the size n of the sample and gets a number, denoted by (a statistic!).

Of course is a random variable, and it turns out that it is an unbiased estimator of p, that is E( ) = p If we knew the standard deviation  of we could construct confidence intervals for p as we did for the parameter. (If n is big enogh the Central Limit Theorem still holds) We can show that  = pq/n (remember that q = 1 - p), but this is tautological (we don’t know p !) However, if n is large enough, we can use  for p and proceed as with.

In other words, we use instead of and instead of and get the interval

An Example What proportion p of Notre Dame students know a language other than English? In a random sample of 1,500 Notre Dame students, 855 stated they knew some language other than English. Develop a 98% confidence interval for p based on this sample.

We have = 0.02 Therefore = 2.33 (why?) Now = 855/1500 = 0.57 and therefore we approximate with (why?) We get the 98% confidence interval as (0.57 – 2.33x0.128, x0.128) That is (0.54, 0.60)

THE FORMULAS To estimate with confidence 1 -