Psych 5500/6500 The Sampling Distribution of the Mean Fall, 2008.

Slides:



Advertisements
Similar presentations
The Normal Curve. Introduction The normal curve Will need to understand it to understand inferential statistics It is a theoretical model Most actual.
Advertisements

1 COMM 301: Empirical Research in Communication Lecture 15 – Hypothesis Testing Kwan M Lee.
Chapter 6 – Normal Probability Distributions
A Sampling Distribution
Unit 7 Section 6.1.
Confidence Intervals This chapter presents the beginning of inferential statistics. We introduce methods for estimating values of these important population.
Sampling Distributions and Sample Proportions
Central Limit Theorem.
PPA 415 – Research Methods in Public Administration Lecture 5 – Normal Curve, Sampling, and Estimation.
1 The Basics of Regression Regression is a statistical technique that can ultimately be used for forecasting.
1 Hypothesis Testing In this section I want to review a few things and then introduce hypothesis testing.
The standard error of the sample mean and confidence intervals How far is the average sample mean from the population mean? In what interval around mu.
Sampling Distributions
Chapter Six z-Scores and the Normal Curve Model. Copyright © Houghton Mifflin Company. All rights reserved.Chapter The absolute value of a number.
Chapter 7 Probability and Samples: The Distribution of Sample Means
Chapter 11: Random Sampling and Sampling Distributions
Inferential Statistics
Chapter 5 DESCRIBING DATA WITH Z-SCORES AND THE NORMAL CURVE.
QUIZ CHAPTER Seven Psy302 Quantitative Methods. 1. A distribution of all sample means or sample variances that could be obtained in samples of a given.
Probability and the Sampling Distribution Quantitative Methods in HPELS 440:210.
1 Psych 5500/6500 Statistics and Parameters Fall, 2008.
Chapter 5For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 Suppose we wish to know whether children who grow up in homes without access to.
Confidence Intervals. Estimating the difference due to error that we can expect between sample statistics and the population parameter.
Psy B07 Chapter 1Slide 1 ANALYSIS OF VARIANCE. Psy B07 Chapter 1Slide 2 t-test refresher  In chapter 7 we talked about analyses that could be conducted.
Probability Tables. Normal distribution table Standard normal table Unit normal table It gives values.
Sections 8-1 and 8-2 Review and Preview and Basics of Hypothesis Testing.
Standardized Score, probability & Normal Distribution
Chapter 5 Sampling Distributions
The Normal Distribution The “Bell Curve” The “Normal Curve”
AP Statistics Chapter 9 Notes.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Introduction to Inferential Statistics. Introduction  Researchers most often have a population that is too large to test, so have to draw a sample from.
PARAMETRIC STATISTICAL INFERENCE
Section 8.1 Estimating  When  is Known In this section, we develop techniques for estimating the population mean μ using sample data. We assume that.
Vegas Baby A trip to Vegas is just a sample of a random variable (i.e. 100 card games, 100 slot plays or 100 video poker games) Which is more likely? Win.
Rule of sample proportions IF:1.There is a population proportion of interest 2.We have a random sample from the population 3.The sample is large enough.
Some probability distribution The Normal Distribution
1 Psych 5500/6500 Standard Deviations, Standard Scores, and Areas Under the Normal Curve Fall, 2008.
Chapter 10 – Sampling Distributions Math 22 Introductory Statistics.
1 Lecture note 4 Hypothesis Testing Significant Difference ©
Chapter 7: Sample Variability Empirical Distribution of Sample Means.
1 Psych 5500/6500 The t Test for a Single Group Mean (Part 1): Two-tail Tests & Confidence Intervals Fall, 2008.
Standardized Distributions Statistics Introduction Last time we talked about measures of spread Specifically the variance and the standard deviation.
Anthony J Greene1 Where We Left Off What is the probability of randomly selecting a sample of three individuals, all of whom have an I.Q. of 135 or more?
Chapter 7 Probability and Samples: The Distribution of Sample Means
Chapter 7 Probability and Samples: The Distribution of Sample Means.
Distributions of the Sample Mean
BUS304 – Chapter 6 Sample mean1 Chapter 6 Sample mean  In statistics, we are often interested in finding the population mean (µ):  Average Household.
8 Sampling Distribution of the Mean Chapter8 p Sampling Distributions Population mean and standard deviation,  and   unknown Maximal Likelihood.
LSSG Black Belt Training Estimation: Central Limit Theorem and Confidence Intervals.
Psych 5500/6500 Probability Fall, Gambler’s Fallacy The “gambler’s fallacy” is to think that the “law of averages” makes independent events no longer.
Chapter 10: Introduction to Statistical Inference.
Sampling distributions rule of thumb…. Some important points about sample distributions… If we obtain a sample that meets the rules of thumb, then…
Statistics and Quantitative Analysis U4320 Segment 5: Sampling and inference Prof. Sharyn O’Halloran.
Estimating a Population Mean:  Known
Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics.
INFERENTIAL STATISTICS DOING STATS WITH CONFIDENCE.
Chapter 7 Confidence Intervals and Sample Size McGraw-Hill, Bluman, 7 th ed., Chapter 7 1.
MATH Section 4.4.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Sample Means. Parameters The mean and standard deviation of a population are parameters. Mu represents the population mean. Sigma represents the population.
Sampling Distributions
Chapter 7 Probability and Samples
Sampling Distributions
MATH 2311 Section 4.4.
Sampling Distributions
Chapter 5: Sampling Distributions
How Confident Are You?.
MATH 2311 Section 4.4.
Chapter Outline The Normal Curve Sample and Population Probability
Presentation transcript:

Psych 5500/6500 The Sampling Distribution of the Mean Fall, 2008

Sampling Distribution of the Mean The 'sampling distribution of the mean’ (SDM): the population of all the sample means you could get if you sampled a certain number of scores from a certain population.

For Example In a previous semester I asked the students to draw a sample from a deck of playing cards.

Original Population from Which the Sample Was Drawn 4 cards of each type (jacks counted as ’11’, queens as ’12’, kings as ’13’). This is a graph of individual scores in the population (i.e. ‘Y’). The mean of the population of playing cards is μ Y =7 and its standard deviation is σ Y =3.74. Note the population is not normally distributed, and the exact values of μ and σ are known (not estimated).

Sample Means When N=4 The students were asked to sample four cards and find the mean of the sample. Not surprisingly they obtained many different sample means. Sample means from when n=4: 6.75, 5.5, 7.25, 7.5, 6.25, 8, 9.75, 7, 9.5, 4.25, 3.75, 9, 5.25, 7.75, 7.5, 11.5, 2.25, 9.25, 3.5, 8.75, 5.5, 6, 7.25, 7.75, 9.75, 7, 9.5, 7.5, 6.5, 9.25, 7.25, 7.25, 9.5

SDM for N=4 This is a graph of the 23 sample means (rounded off to the nearest whole number). We are starting to see the shape of the sampling distribution of the mean when n=4. Note that the mean of the sample means looks to be around ‘7’ (the mean of the original population, which is why the sample mean is an unbiased estimate of the population mean).

Sample Means When N=8 The students were then asked to sample eight cards and find the sample mean. Sample means from when n=8: 6.25, 8.25, 5.75, 5.38, 6.63, 7.5, 7.5, 9.13, 8.38, 5.63, 6.13, 5.88, 8.13, 7.75, 7.13, 4.7, 5.63, 6.63, 9.13, 5.88, 5.88, 5.13, 8.63, 6.13, 7.5, 9.13, 8.13, 7.63, 6.75, 7.88, 7.38, 7.50, 7.85

SDM for N=8 Again, this is a graph of the sample means for when N=8. And again, the mean of the sample means looks to be the same as the mean of the original population (7).

Comparisons The next three slides show the three graphs. Note the following: 1. While the population from which we sampled was not normally distributed, the graphs of the sample means begin to look more like normal curves. 2. The variance of the sample means is less than the variance of the original population, as n moves from 4 to 8, the variance of the sample means decreases (the sample mean is a ‘consistent’ estimate of the population mean).

Original Population from Which the Sample Was Drawn This is a graph of individual scores (Y).

SDM for N=4 This is a graph of sample means (when n=4).

SDM for N=8 This is a graph of the sample means for when N=8.

Short Cut The preceding approach for finding the sampling distribution of the mean would actually require that we obtain an infinite number of sample means to arrive at a true picture of the population of sample means we could obtain if we sampled a certain number of scores from a certain population (i.e. the SDM). This is a good way to introduce the concept of SDM but we need a short cut for actually producing an SDM...

1) The Shape of the SDM You can count on the SDM being normally distributed if either of the following two conditions are met.  The SDM will be normally distributed if the population you sampled from is normally distributed.  The SDM will be normally distributed (even if the population you sampled from is not) if the N of your sample is large enough (Central Limit Theorem). Rule of thumb: N ≥ 30

2) The Mean of the SDM The mean of the population of sample means equals the mean of the population from which you sample (that is why the sample mean is an ‘unbiased’ estimate of the population mean).

3) The Standard Deviation of the SDM The standard deviation of the sample means is less than the standard deviation of the population from which you sampled, as the means will vary less than the scores do.

Example: Original Population Let’s say the population is normally distributed, which means that the SDM will be normally distributed as well.

SDM for N=4

SDM for N=64

Probability and the SDM When the SDM is normally distributed we can answer certain types of questions. The following slides take us through a typical question from the homework assignment.

Question We will begin by repeating a process learned in an earlier lecture. We are sampling from a population that is normally distributed with a mean of 55 and a standard deviation of 10. What is the probability of drawing a score from that population that is between 50 and 60?

Original Population Step 1: draw and label the population.

Original Population Step 2: shade in the area of question.

Original Population Step 3: compute the z scores and look up the area under the normal curve. The probability of obtaining a single score between 50  Y  60 = =.3830 p=.3830

Question Now we are going to ask a new question. If we sample nine scores from that population, what is the probability of obtaining a sample mean that is between 50 and 60?

SDM for N=9 Step 1: draw the sampling distribution of the mean, which is the population of all the sample means we could get if we sample 9 scores from the original population. We know the SDM is normally distributed, its mean is the same as the mean of the population, and we can compute the standard deviation of the curve (‘standard error’). Note this is a population of sample means.

SDM for N=9 Step 2: shade in the area of question.

SDM & Standard Score To figure out the shaded area of the normal curve we need to change the sample means of 50 and 60 to standard scores.

As always, the standard score will be the ‘raw’ score on the graph (this is a graph of sample means) – the mean of the graph (the mean of the sample means) divided by the standard deviation of the graph (the standard deviation of the sample means, a.k.a. the ‘standard error’)

SDM for N=9 Step 3: compute the z scores and look up the area under the normal curve. The probability of obtaining a sample mean between 50 and 60 = p=.8664

Looking Back When we sampled one score from a normal population that had μ=55 and σ=10 there was a 38.3% chance that the score would be within 5 of the population mean. When we sampled 9 scores from that population there was a 86.64% chance that the sample mean would be within 5 of the population mean.

1-tail and 2-tail p values We are very close to doing some statistical analyses to test specific hypothesis. The next step is to play with scenarios such as: You sample 36 scores from a population that has a μ=80 and σ=12. For what value of the sample mean is there only a 5% chance that you would obtain a sample mean that is that far or farther above the population mean?

To set up the problem first draw the population you will be sampling from, and then the SDM (population of sample means for N=36). We don’t know if the population is normally distributed, do we know if the SDM is?

Formulas

What sample mean would be 1.65 standard deviations above the mean on this curve?

Conditional Probability Let’s think of it as a conditional probability.

Another Example You sample 36 scores from a population that has a μ=80 and σ=12. For what value of the sample mean is there only a 5% chance that you would obtain a sample mean that is that far or farther below the population mean?

Conditional Probability

Final Example You sample 36 scores from a population that has a μ=80 and σ=12. For what values of the sample mean is there only a 5% chance that you would obtain a sample mean that is that far or farther away from the population mean (in either direction)?

For a normal curve the z scores that cut off a total of the 5% most extreme scores (in both directions) are:

Conditional Probability