Sampling Distributions (§ )

Slides:



Advertisements
Similar presentations
Sampling Distributions
Advertisements

Previous Lecture: Distributions. Introduction to Biostatistics and Bioinformatics Estimation I This Lecture By Judy Zhong Assistant Professor Division.
Sampling: Final and Initial Sample Size Determination
Sampling Distributions (§ )
Terminology A statistic is a number calculated from a sample of data. For each different sample, the value of the statistic is a uniquely determined number.
Chapter 6 Introduction to Sampling Distributions
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 6 Introduction to Sampling Distributions.
Sampling Distributions
Chapter 7: Variation in repeated samples – Sampling distributions
Sampling We have a known population.  We ask “what would happen if I drew lots and lots of random samples from this population?”
AP Statistics Section 10.2 A CI for Population Mean When is Unknown.
BCOR 1020 Business Statistics
Standard error of estimate & Confidence interval.
Chapter 6: Sampling Distributions
Review of normal distribution. Exercise Solution.
Dan Piett STAT West Virginia University
AP Statistics Chapter 9 Notes.
STA291 Statistical Methods Lecture 16. Lecture 15 Review Assume that a school district has 10,000 6th graders. In this district, the average weight of.
Introduction to Statistical Inference Chapter 11 Announcement: Read chapter 12 to page 299.
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
Topics: Statistics & Experimental Design The Human Visual System Color Science Light Sources: Radiometry/Photometry Geometric Optics Tone-transfer Function.
Estimation in Sampling!? Chapter 7 – Statistical Problem Solving in Geography.
AP Statistics 9.3 Sample Means.
Slide 1 © 2002 McGraw-Hill Australia, PPTs t/a Introductory Mathematics & Statistics for Business 4e by John S. Croucher 1 n Learning Objectives –Identify.
Chapter 7: Sample Variability Empirical Distribution of Sample Means.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 7 Sampling Distributions.
8 Sampling Distribution of the Mean Chapter8 p Sampling Distributions Population mean and standard deviation,  and   unknown Maximal Likelihood.
Sampling Error SAMPLING ERROR-SINGLE MEAN The difference between a value (a statistic) computed from a sample and the corresponding value (a parameter)
1 Topic 5 - Joint distributions and the CLT Joint distributions –Calculation of probabilities, mean and variance –Expectations of functions based on joint.
Chapter 10: Confidence Intervals
Sampling Distributions Chapter 18. Sampling Distributions A parameter is a measure of the population. This value is typically unknown. (µ, σ, and now.
Review Normal Distributions –Draw a picture. –Convert to standard normal (if necessary) –Use the binomial tables to look up the value. –In the case of.
One Sample Mean Inference (Chapter 5)
Chapter 5 Sampling Distributions. The Concept of Sampling Distributions Parameter – numerical descriptive measure of a population. It is usually unknown.
Introduction to Inference Sampling Distributions.
© 2010 Pearson Prentice Hall. All rights reserved Chapter Sampling Distributions 8.
1 Probability and Statistics Confidence Intervals.
SAMPLING DISTRIBUTION OF MEANS & PROPORTIONS. SAMPLING AND SAMPLING VARIATION Sample Knowledge of students No. of red blood cells in a person Length of.
SAMPLING DISTRIBUTION OF MEANS & PROPORTIONS. SAMPLING AND SAMPLING VARIATION Sample Knowledge of students No. of red blood cells in a person Length of.
10.1 – Estimating with Confidence. Recall: The Law of Large Numbers says the sample mean from a large SRS will be close to the unknown population mean.
ESTIMATION OF THE MEAN. 2 INTRO :: ESTIMATION Definition The assignment of plausible value(s) to a population parameter based on a value of a sample statistic.
Sampling Distributions Chapter 18. Sampling Distributions A parameter is a number that describes the population. In statistical practice, the value of.
WARM UP: Penny Sampling 1.) Take a look at the graphs that you made yesterday. What are some intuitive takeaways just from looking at the graphs?
Sampling and Sampling Distributions. Sampling Distribution Basics Sample statistics (the mean and standard deviation are examples) vary from sample to.
CHAPTER 6: SAMPLING, SAMPLING DISTRIBUTIONS, AND ESTIMATION Leon-Guerrero and Frankfort-Nachmias, Essentials of Statistics for a Diverse Society.
Sampling Distribution of the Sample Mean
Statistics 200 Objectives:
Chapter 6: Sampling Distributions
Statistical Inference
Inference for the Mean of a Population
Estimating the Value of a Parameter
Estimating the Value of a Parameter Using Confidence Intervals
ESTIMATION.
Inference: Conclusion with Confidence
Chapter 6: Sampling Distributions
Estimating the Population Mean Income of Lexus Owners
Statistical inference: distribution, hypothesis testing
Elementary Statistics
STATISTICS INFORMED DECISIONS USING DATA
Sampling Distribution Models
Econometric Models The most basic econometric model consists of a relationship between two variables which is disturbed by a random error. We need to use.
Chapter 6 Confidence Intervals.
Estimating the Value of a Parameter
Sampling Distributions
Sampling Distributions
Chapter 8: Confidence Intervals
The Normal Distribution
Interval Estimation Download this presentation.
How Confident Are You?.
STATISTICS INFORMED DECISIONS USING DATA
Presentation transcript:

Sampling Distributions (§4.11 - 4.12) Typically we select sample data from a population in order to compute some statistic of interest. If we were to take two random samples from the same population, it would be very unlikely that we would find that we have computed the exact same value of the statistics. Hence, the value of the statistic will vary from sample to sample. That is the statistic itself is a random variable. In this lecture we discuss how statistics (functions of data) have distributions of their own, and how those distributions can be determined in some cases by means of the Central Limit Theorem.

Sampling Distribution of the Mean 1. Because no one sample is exactly like the next, the sample mean will vary from sample to sample, and hence is itself a random variable. 2. Random variables have distributions, and since the sample mean is a random variable it must have a distribution. 3. Regardless of the distribution of the measurements (discrete or continuous), the distribution of the sample mean can be approximated by a normal distribution [Central Limit Theorem]. 4. If the sample mean has a normal distribution, we can compute probabilities for specific events using the properties of the normal distribution.

A Resampling Experiment Suppose we are interested in finding the mean of a large population of individuals. A population too large to census. We decided to take a sample of 5 individuals, measure their responses and compute the sample mean. Now suppose we did this 1000 times (I.e. generate 1000 samples of size 5 and hence 1000 means). What would the distribution of these means look like? To make things interesting, assume the probability density function of the measurements in the populations has an exponential shape, with mean 1. Area under curve is one.

A sampling experiment: Draw a sample of size n from any population. Do many times. Compute the sample mean Add the sample mean to a list. Construct Histogram/Frequency Table

Example Continued Draw a random sample of 5 individuals from population. Compute sample mean. Add mean to list. If number of simulations less than 1000, return to 1 else go to 5. Make histogram of 1000 means. Sample Mean 1.3498 0.6293 0.7390 0.7377 1.2206

Mean of 10 Observations Draw a random sample of 10 individuals from population. Compute sample mean. Add mean to list. If number of simulations less than 1000, return to 1 else go to 5. Make histogram of 1000 means.

Comparison Note: vertical scales are different. Means of 5 Means of 10 Both have mean of about 1.0. Spread of right plot is narrower than left plot. Somehow, when we look at the distribution of samples of size 10 we have less spread than if we look at samples of size 5. Is there a general rule here?

Central Limit Theorem of Statistics If random samples, each with n measurements, are repeatedly drawn from the same population having true mean m and standard deviation s, then when n is large, the relative frequency histogram for the sample means (calculated from the repeated samples) will be approximately normal (bell-shaped) with mean m and standard deviation s/Ön, that is, Note: In addition, the approximation becomes closer to true normal as n increases.

Population and Sampling Distribution Distribution of means of random samples of size 10 from population. Distribution of measurements in population Random Sample Mean: m Standard Deviation: s Distribution: Anything Mean: m Standard Deviation: s/10 Distribution: Approx Normal

Illustrating the CLT for the distribution of the sample mean when drawing samples from an exponential distribution of mean 1 (based on 1000 draws).

Standard Error of the Mean The quantity s is referred to as the standard deviation. It is a measure of spread in the population. The quantity s/Ön is referred to as the standard error of the mean. It is a measure of spread in the distribution of means of random samples of size n from a population of measurements having true standard deviation s. I.e. it is just the standard deviation of

Uses of the Central Limit Theorem The Central Limit Theorem is “central” to Statistics because it allows us to make inferences (decisions) about unknown population parameters, from sample estimates (statistics). We can estimate the true mean and standard deviation of a population using the sample mean and sample standard deviation. Using the sample mean, the sample standard deviation, and the central limit theorem, we can develop hypothesis tests to determine whether the TRUE population mean is equal to some specific value, AND/OR, construct confidence intervals for the true mean.

Population: mean μ, std. dev. σ Draw sample (size n) Estimate: Draw Inferences: Quantify uncertainty in estimates (confidence intervals and hypothesis tests).