Stats: Modeling the World

Slides:



Advertisements
Similar presentations
Chapter 18 Sampling distribution models
Advertisements

Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 18 Sampling Distribution Models.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 18 Sampling Distribution Models.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 18 Sampling Distribution Models.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 18 Sampling Distribution Models.
Copyright © 2006 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide
Chapter 18 Sampling Distribution Models
Copyright © 2010 Pearson Education, Inc. Chapter 18 Sampling Distribution Models.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide
Sample Distribution Models for Means and Proportions
Copyright © 2012 Pearson Education. All rights reserved Copyright © 2012 Pearson Education. All rights reserved. Chapter 10 Sampling Distributions.
WARM – UP 1.Phrase a survey or experimental question in such a way that you would obtain a Proportional Response. 2.Phrase a survey or experimental question.
Copyright © 2010 Pearson Education, Inc. Slide
AP Statistics Chapter 9 Notes.
Estimates and Sample Sizes Lecture – 7.4
Chapter 6 Lecture 3 Sections: 6.4 – 6.5.
Chapter 18: Sampling Distribution Models
Sampling Distribution Models
Chapter 18: Sampling Distribution Models AP Statistics Unit 5.
Copyright © 2010 Pearson Education, Inc. Slide
From the Data at Hand to the World at Large
Copyright © 2009 Pearson Education, Inc. Chapter 18 Sampling Distribution Models.
1 Chapter 18 Sampling Distribution Models. 2 Suppose we had a barrel of jelly beans … this barrel has 75% red jelly beans and 25% blue jelly beans.
Copyright © 2008 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 18 Sampling Distribution Models.
February 2012 Sampling Distribution Models. Drawing Normal Models For cars on I-10 between Kerrville and Junction, it is estimated that 80% are speeding.
Sampling Distribution Models Chapter 18. Toss a penny 20 times and record the number of heads. Calculate the proportion of heads & mark it on the dot.
Chapter 18: Sampling Distribution Models
Chapter 6 Lecture 3 Sections: 6.4 – 6.5. Sampling Distributions and Estimators What we want to do is find out the sampling distribution of a statistic.
Chapter 18 Sampling distribution models math2200.
Understand that Sampling error is really sample variability
Chapter 7 Review.
MATH Section 4.4.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 18 Sampling Distribution Models.
Copyright © 2009 Pearson Education, Inc. Chapter 18 Sampling Distribution Models.
1 Copyright © 2014, 2012, 2009 Pearson Education, Inc. Chapter 15 Sampling Distribution Models.
Statistics 18 Sampling Distribution. The Central Limit Theorem for Sample Proportions Rather than showing real repeated samples, imagine what would happen.
Chapter 10 Confidence Intervals for Proportions © 2010 Pearson Education 1.
Sampling Distributions
Sampling Distributions Chapter 18
Sampling Distributions
Sampling Distributions – Sample Means & Sample Proportions
Topic 8: Sampling Distributions
Ch. 18 – Sampling Distribution Models (Day 1 – Sample Proportions)
Sampling Distribution Models
Sampling Distribution Models
FINAL EXAMINATION STUDY MATERIAL PART I
Understanding Sampling Distributions: Statistics as Random Variables
Central Limit Theorem Sampling Distributions Central Limit Theorem
Some useful results re Normal r.v
The Diversity of Samples from the Same Population
Chapter 18: Sampling Distribution Models
Sampling Distribution of a Sample Proportion
Sampling Distributions of Proportions
Sampling Distribution Models
MATH 2311 Section 4.4.
Chapter 9.1: Sampling Distributions
AP Statistics: Chapter 18
Chapter 10: Estimating with Confidence
WARM – UP 1. Phrase a survey or experimental question in such a way that you would obtain a Proportional Response. 2. Phrase a survey or experimental.
Sampling Distributions
Sampling Distributions
Confidence Intervals with Proportions
Sampling Distributions of Proportions section 7.2
Estimates and Sample Sizes Lecture – 7.4
8.3 Estimating a Population Mean
Essential Statistics Sampling Distributions
Introduction to Sampling Distributions
Sampling Distributions
Warmup Which of the distributions is an unbiased estimator?
MATH 2311 Section 4.4.
Presentation transcript:

Stats: Modeling the World Chapter 18 Sampling Distribution Models

Bias and Variability

The Fundamental Theorem of Statistics: AKA The Central Limit Theorem Let’s imagine that we ask people “Do you believe in ghosts?” -In 2005 one poll 40% of 889 adults said yes, in another during the same year 48% of 809 adults said yes. Why the difference? How can surveys taken at the same time with the same question get different results?

The Fundamental Theorem of Statistics: AKA The Central Limit Theorem The answer is at the heart of statistics! Different samples will lead to different proportions! Can we predict how much a sample proportion will vary? YES!

The Fundamental Theorem of Statistics: AKA The Central Limit Theorem Imagine we take all possible random samples of 808 people that were asked the ghost question- Where would the center be??? Well, we don’t know! – but we do know that the center will be the true proportion – let’s call it p For argument’s sake let’s call p = 0.45 What would be the shape of the histogram?

The Fundamental Theorem of Statistics: AKA The Central Limit Theorem It would look like this: We call this a sampling distribution of proportions and each sampling proportion is called (p=hat) This is a normal curve with a and a standard deviation of

The Fundamental Theorem of Statistics: AKA The Central Limit Theorem For our example: and Remember the 68-95-99.7 Rule? This 95% of the data is within two standard deviations or 2(1.75%)=3.5% and the poll which gave us 48% belief in ghosts is consistent with our guess of 45% The 3.5% is called the sampling error (not really an error but the variability from one sample to another – better term: sampling variability

The Fundamental Theorem of Statistics: AKA The Central Limit Theorem What Assumptions do we have to make to do all of this? INDEPENDENCE: the sampled values must be independent SAMPLE SIZE: the sample size, n, must be large enough We check these by checking 3 conditions Randomization: was the data collected representative of the population 10% condition: The sample size, n, must be no larger than 10% of the population Success/Failure condition: np and nq need to be at least 10

The Fundamental Theorem of Statistics: AKA The Central Limit Theorem: Example The candy company that makes M & M’s claims that 10% of the candies are green. In a really large bag of 500 M & M’s a class of statistics students found that 12% were green. Is this unusual? Step 1:Check Conditions Randomization: It would be reasonable to assume the candies were placed in the bag randomly 10% condition: 500 M & M’s is less than the population of M & M’s Success/Failure: np=500(0.10)=50 and nq = 500(0.90)=450 are both greater than 10

The Fundamental Theorem of Statistics: AKA The Central Limit Theorem: Example The candy company that makes M & M’s claims that 10% of the candies are green. In a really large bag of 500 M & M’s a class of statistics students found that 12% were green. Is this unusual? So, By the 68-95-99.7 Rule, 1.49 is within 2 standard deviations, so this result is not unusual.

The Fundamental Theorem of Statistics: AKA The Central Limit Theorem: Example Suppose 13% of people are left-handed. At a particular college lecture hall there are 200 seats with built in desks, 15 of these of these desks are built for left-handed students. In a class of 90 students, what is the probability that there will not be enough seats for the left-handed students? PLAN: I want to find out the probability that in a group of 90 students, more than 15 will be left handed (15/90 = 16.7%). So my sample size is 90, the proportion of lefties in a population is 13%, and I want to find P(L > 16.7%)

The Fundamental Theorem of Statistics: AKA The Central Limit Theorem: Example Suppose 13% of people are left-handed. At a particular college lecture hall there are 200 seats with built in desks, 15 of these of these desks are built for left-handed students. In a class of 90 students, what is the probability that there will not be enough seats for the left-handed students? Check Conditions: Independence: It is reasonable to assume that left-handedness is independent Randomization: The 90 students in the class can be thought of as a random sample 10% condition: 90 students is certainly less than 10% of all studnets Success/Failure: np = 90(0.13)=11.7 and nq = 90(0.87) = 78.3 which are both larger than 10

The Fundamental Theorem of Statistics: AKA The Central Limit Theorem: Example Suppose 13% of people are left-handed. At a particular college lecture hall there are 200 seats with built in desks, 15 of these of these desks are built for left-handed students. In a class of 90 students, what is the probability that there will not be enough seats for the left-handed students? OK— Conclusion: There is about a 14.5% chance that there will not be enough seats for the left-handed students in the class.

A cable company believes 20% of the subscribers of a cable television company watch the shopping channel at least once a week. The cable company is trying to decide whether to replace this channel with a new local station. A survey of 100 subscribers will be undertaken. The cable company has decided to keep the shopping channel if the sample result is greater than 25%. What is the approximate probability that the cable company will keep the shopping channel? What is the population proportion and standard deviation? Find your z-score and the probability: p = 0.20 St Dev= There is a probability of 0.106 that the cable company will keep the shopping channel

Suppose that 60% of the high school faculty in the state voted in favor of 4 years of math as a graduation requirement. The Californian will be contacting 200 faculty members selected at random as a follow-up. What is the probability that less than half of the faculty members polled voted for the requirement? What is the population proportion and standard deviation? Find your z-score and the probability: p = 0.6 St Dev= The probability that less than half of the faculty members polled voted for the requirement is 0.0021

It is estimated that 48% of all motorists in Brazil use their seat belts. If a police officer observes 400 cars go by in one morning, what is the probability that the proportion of drivers wearing seat belts is between 45% and 55%? What is the population proportion and standard deviation? Find your z-score and the probability: p = 0.48 St Dev= There is a 0.8824 probability that between 45% and 55% of the drivers are wearing their seat belts.

Sampling Distributions for Means Simulation

The Central Limit Theorem The Central Limit Theorem (The Fundamental Theorem of Statistics) says that the sampling distribution for any mean becomes nearly NORMAL as the sample size grows! The mean = And the standard deviation =

Consider the approximately normal population of heights of male college students with mean of 69 inches and standard deviation of 5 inches. A random sample of 25 heights is obtained. Find the mean and standard deviation of the average heights for sample sizes of 25. Find the probability that the sample mean of 25 college male students will be greater than 73 inches. Show your work!

The Wechsler Adult Intelligence Scale (WAIS) is a common “IQ Test” for adults. The distribution of WAIS scores for persons over 16 years of age is approximately normal with a mean of 100 and a standard deviation of 15. What is the probability that a randomly chosen individual has a WAIS score of 105 or higher? The probability that a randomly selected individual has a WAIS score above 105 is 0.3707

The Wechsler Adult Intelligence Scale (WAIS) is a common “IQ Test” for adults. The distribution of WAIS scores for persons over 16 years of age is approximately normal with a mean of 100 and a standard deviation of 15. What is the mean and standard deviation of the sampling distribution of the average WAIS score for a random sample of 60 people? What is the probability that the average WAIS score for a sample of 60 people is 105 or higher? Mean = 100 St Dev = The probability the average WAIS score for a sample of 60 people is 105 or higher is 0.0049 What if this population was NOT normally distributed?!?!?!

Within 2 standard deviations of the mean…. 157 to 167 dF The mean temperature of coffee sold in restaurants is normally distributed with a mean of 162 dF with a standard deviation of 10 dF. Find the mean and standard deviation for the sampling distribution for a random sample of 16 restaurants. Draw this model: What interval contains the middle 95% of the temperatures? What is the approximate probability that the sample of 16 restaurants has a mean temp below 158 dF? Mean = 162 St Dev = Within 2 standard deviations of the mean…. 157 to 167 dF The probability mean temp for the 16 restaurants is below 158 dF is is 0.0548

Based on data from the Highway Loss Data Institute, collision repair costs for Honda Odyssey minivans are found to have a distribution that is roughly bell shaped with a mean of $1786 and a standard deviation of $937 Suppose that your next door neighbor just had his Odyssey repaired and the bill was $2300. Is this an unusual repair bill? How do you know? We can see how unusual this is by looking at its z-score to see how far from the typical repair cost this bill is: This is less than one standard deviation from the mean, so this is fairly common. About 29% of cars have more expensive repair bills.

Based on data from the Highway Loss Data Institute, collision repair costs for Honda Odyssey minivans are found to have a distribution that is roughly bell shaped with a mean of $1786 and a standard deviation of $937 Suppose that you discovered that First OK Honda recently repaired five Odyssey’s and the average repair bill was $2300. How typical is this? For a sample size of 5… Mean = 1786 St Dev = This is a bit more than one standard deviation from the mean, so this is not extremely rare. More expensive repair bills will happen about 11% of the time with samples of 5 cars.

Mean = 3.9 St Dev = Roughly Normal If we have 160 minutes for 40 songs, each song can last (on average) 4 minutes The probability that the total amount of time needed exceeds the available airtime is 0.2776