Statistical Techniques I EXST7005 Distribution of Sample Means.

Slides:



Advertisements
Similar presentations
Estimation of Means and Proportions
Advertisements

Assumptions for Z Confidence Intervals
Statistical Techniques I
Sampling Distributions
Sampling Distributions (§ )
Statistical Techniques I EXST7005 Lets go Power and Types of Errors.
McGraw-Hill Ryerson Copyright © 2011 McGraw-Hill Ryerson Limited. Adapted by Peter Au, George Brown College.
Central Limit Theorem.
Chapter 7 Introduction to Sampling Distributions
HYPOTHESIS TESTING Four Steps Statistical Significance Outcomes Sampling Distributions.
Sampling Distributions
Chapter 6 Introduction to Sampling Distributions
Chapter 7 Sampling and Sampling Distributions
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 6 Introduction to Sampling Distributions.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 6-1 Introduction to Statistics Chapter 7 Sampling Distributions.
Evaluating Hypotheses
Chapter 2 Simple Comparative Experiments
Introduction to Probability and Statistics Chapter 7 Sampling Distributions.
QMS 6351 Statistics and Research Methods Chapter 7 Sampling and Sampling Distributions Prof. Vera Adamchik.
Inferential Statistics
The Central Limit Theorem For simple random samples from any population with finite mean and variance, as n becomes increasingly large, the sampling distribution.
Standard error of estimate & Confidence interval.
Chapter 6 Sampling and Sampling Distributions
Probability and the Sampling Distribution Quantitative Methods in HPELS 440:210.
Statistical Techniques I EXST7005 Review. Objectives n Develop an understanding and appreciation of Statistical Inference - particularly Hypothesis testing.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 7 Sampling Distributions.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 6 Sampling Distributions.
Topic 5 Statistical inference: point and interval estimate
1 Statistical Inference Greg C Elvers. 2 Why Use Statistical Inference Whenever we collect data, we want our results to be true for the entire population.
Chap 6-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 6 Introduction to Sampling.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 6-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
Sampling W&W, Chapter 6. Rules for Expectation Examples Mean: E(X) =  xp(x) Variance: E(X-  ) 2 =  (x-  ) 2 p(x) Covariance: E(X-  x )(Y-  y ) =
Anthony J Greene1 Where We Left Off What is the probability of randomly selecting a sample of three individuals, all of whom have an I.Q. of 135 or more?
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 3 Section 2 – Slide 1 of 27 Chapter 3 Section 2 Measures of Dispersion.
Sampling Theory The procedure for drawing a random sample a distribution is that numbers 1, 2, … are assigned to the elements of the distribution and tables.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc.. Chap 7-1 Developing a Sampling Distribution Assume there is a population … Population size N=4.
1 Chapter 7 Sampling Distributions. 2 Chapter Outline  Selecting A Sample  Point Estimation  Introduction to Sampling Distributions  Sampling Distribution.
8 Sampling Distribution of the Mean Chapter8 p Sampling Distributions Population mean and standard deviation,  and   unknown Maximal Likelihood.
Chapter Thirteen Copyright © 2004 John Wiley & Sons, Inc. Sample Size Determination.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 7-1 Chapter 7 Sampling Distributions Basic Business Statistics.
Chapter 18: Sampling Distribution Models
Chapter 10: Introduction to Statistical Inference.
Chapter 5 Sampling Distributions. The Concept of Sampling Distributions Parameter – numerical descriptive measure of a population. It is usually unknown.
Introduction to Inference Sampling Distributions.
© 2010 Pearson Prentice Hall. All rights reserved Chapter Sampling Distributions 8.
Lecture 5 Introduction to Sampling Distributions.
Lecture 8: Measurement Errors 1. Objectives List some sources of measurement errors. Classify measurement errors into systematic and random errors. Study.
m/sampling_dist/index.html.
Chapter Eleven Sample Size Determination Chapter Eleven.
Chapter 7, continued.... IV. Introduction to Sampling Distributions Suppose you take a second sample of n=30 and calculate your estimators again: s =
Sampling Distributions
Measures of Dispersion
Chapter 7 Sampling and Sampling Distributions
Sample Size Determination
Sampling Distributions
Behavioral Statistics

Chapter 7 Sampling Distributions.
Elementary Statistics
Chapter 7 Sampling Distributions.
Daniela Stan Raicu School of CTI, DePaul University
Daniela Stan Raicu School of CTI, DePaul University
Chapter 7 Sampling Distributions.
Sampling Distribution of a Sample Proportion
CHAPTER 15 SUMMARY Chapter Specifics
Sampling Distributions
Chapter 7 Sampling Distributions.
Sampling Distribution of a Sample Proportion
Sampling Distributions (§ )
Chapter 7 Sampling Distributions.
Presentation transcript:

Statistical Techniques I EXST7005 Distribution of Sample Means

OBJECTIVES n Usually we will be testing hypotheses about means. We will need some additional information about the nature of means of samples in order to do hypothesis tests.

Distribution of Sample Means Means are the basis for testing hypotheses about , the most common types of hypothesis tests. n Imagine a POPULATION from which we are drawing samples. è Population size = N  Mean =   Variance =  2 è Parent population values are: – Yi = Y1, Y2, Y3,..., YN

Distribution of Sample Means (continued) n The samples of size n form a DERIVED POPULATION n There are Nn possible samples of size n that can be drawn from a population of size N (sampling WITH replacement). n for each sample we calculate a mean   Yk =  Yi/n è where k = 1, 2, 3,..., Nn

Distribution of Sample Means (continued) n The Derived Population of Means of samples of size n n Population size = Nn Mean =  Y Variance =  Y n Derived population values   Yk =  Y1,  Y2,  Y3,...,  YNn

Distribution of Sample Means (continued) n Mean of the DERIVED POPULATION   Y =  Yk/Nn è where k = 1, 2, 3,..., Nn n Variance of the DERIVED POPULATION   2  Y =  Yk-  )2/Nn è where k = 1, 2, 3,..., Nn è n = the sample size è N = the population size è Population size = Nn

Example of a Derived Population n Parent Population: Yi = 0, 1, 2, 3   =  Yi/N = 6/4=1.5   2 =  Yi-  )2/N = [(0-1.5)2+(1-1.5)2+(2-1.5)2+(3- 1.5)2]/4 = 5/4 = 1.25   = 1.12 Original Population r.f.

n The Derived Population è where n = 2 and è Nn = 42 = 16 n Draw all possible samples of size 2 from the Parent Population (sampling with replacement, so that values will occur more than once), and calculate  Y for each sample (Nn). The Derived Population

SampleMean 0, , , , , , , , , , , , , , , , 33.0 The Derived Population

Frequency table of the Derived Population MeansFrequencyRelative Freq Sum =161

Derived Population Histogram of the Derived Population n Note that the derived population is shaped more like the normal distribution than the original population.

Probability statement from the two distributions Find P(1  Y  2) n For the original population, P(1  Y  2)= n For the derived population, P(1  Y  2)= r.f

Given a population with mean  and variance  2, if we draw all possible samples of size n (with replacement) from the population and calculate  Y, then the derived population of all possible sample means will have  Mean:  Y =   Variance:  Y =  2/n  Standard deviation:  Y =  /  n =  2/n THEOREM on the distribution of sample means

THEOREM on the distribution of sample means (continued) n Notice that the variance and standard deviation of the mean have "n" in the denominator. As a result, the variance of the derived population becomes smaller as the sample size increases regardless of the value of the population variance.

CENTRAL LIMIT THEOREM n AS THE SAMPLE SIZE (n) INCREASES, THE DISTRIBUTION OF SAMPLE MEANS OF ALL POSSIBLE SAMPLES, OF A GIVEN SIZE FROM A GIVEN POPULATION, APPROACHES A NORMAL DISTRIBUTION IF THE VARIANCE IS FINITE. If the base distribution is normal, then the means are normal regardless of n.

CENTRAL LIMIT THEOREM (continued) n Why is this important? (and it is very important!) n If we are more interested in the MEANS (and therefore the distribution of the means) than the original distribution, then normality is a more reasonable assumption. n Often, perhaps even USUALLY, we will be MORE INTERESTED in the MEANS of the DISTRIBUTION THAN IN THE DISTRIBUTIONS of the INDIVIDUALS.

NOTES on the distribution of sample means as n increases,  Y and  Y decrease.  Y   for any n  Y   for any n > 1 as n increases and  Y becomes smaller, the distribution of  Y's becomes closer to  Y. (i.e. we get a better estimate).

Some new terms Reliability (as a statistical concept) - the closer the estimate of  to the actual value of , the more "reliable" the estimate. n Accuracy (as a statistical concept) - this term refers to the lack of bias in the estimate, and not how small the variance is. An estimate may be very accurate, but have a great deal of scatter about the mean.

Estimation of  Y and  Y n In practice we cannot draw all possible samples. Recall that E(S2) =  so, S2  Y = S2/  n is an estimate of  Y è where; – E(S2  Y) =  Y – and; – S2  Y is an estimate of the variance of sample means of size n – S2 is the estimate of the variance of observations

Estimation of  Y and  Y (continued) S2  Y = S2/  n is called the STANDARD ERROR to distinguish it from the Standard deviation è it is also called the Standard Deviation OF THE MEANS è NOTE: that this division is by "n" for both populations and samples, not by "n-1" as with the calculation of variance for samples.

Estimation of  Y and  Y (continued) S2  Y is a measure of RELIABILITY of the sample means as an estimate of the true population mean.  i.e. the smaller S2  Y, the more reliable  Y as an estimate of  n Ways of increasing RELIABILITY  Basically, anything that decreases our estimate of  Y makes our estimate more reliable.

Estimation of  Y and  Y (continued) How do we decrease our estimate of  Y?  Increase the sample size; if n increases then  Y decreases.  Decrease the variance; if our estimate of  decreases then  Y decreases. – This can sometimes be done by; l refining our measurement techniques l finding a more homogeneous population to measure

The Z transformation for a DERIVED POPULATION n We will use the Z transformation for two purposes, individuals and means. è for individuals use – Zi = (Yi -  )/  è for means we will use – Zi = (  Yi -  Y)/  Y 

Summary n Most testing of hypotheses will concern tests of a derived population of means. The mean of the derived population of sample means is  Y The Variance of the derived population of sample means is  Y

Summary (continued) n The CENTRAL LIMIT THEOREM is an important aspect of hypothesis testing because it states that sample means tend to be more nearly normally distributed than the parent population. n Reliability and accuracy are statistical concepts relating to variability and lack of bias, respectively.