STA 291 - Lecture 161 STA 291 Lecture 16 Normal distributions: ( mean and SD ) use table or web page. The sampling distribution of and are both (approximately)

Slides:



Advertisements
Similar presentations
Mean, Proportion, CLT Bootstrap
Advertisements

SAMPLING DISTRIBUTIONS Chapter How Likely Are the Possible Values of a Statistic? The Sampling Distribution.
 These 100 seniors make up one possible sample. All seniors in Howard County make up the population.  The sample mean ( ) is and the sample standard.
AP Statistics: Section 9.1 Sampling Distributions
Sampling: Final and Initial Sample Size Determination
Chapter 10: Estimating with Confidence
Sampling Distributions and Sample Proportions
Sociology 601: Class 5, September 15, 2009
Chapter 7 Sampling and Sampling Distributions
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 6 Introduction to Sampling Distributions.
Sampling Distributions
BCOR 1020 Business Statistics
Chapter 10: Estimating with Confidence
A P STATISTICS LESSON 9 – 1 ( DAY 1 ) SAMPLING DISTRIBUTIONS.
Chapter 5 Sampling Distributions
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 7 Sampling Distributions.
Chapter 7: Sampling Distributions
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 6 Sampling Distributions.
AP Statistics Chapter 9 Notes.
STA291 Statistical Methods Lecture 16. Lecture 15 Review Assume that a school district has 10,000 6th graders. In this district, the average weight of.
Estimation of Statistical Parameters
Chapter 7 Statistical Inference: Confidence Intervals
Chapter 8 Introduction to Inference Target Goal: I can calculate the confidence interval for a population Estimating with Confidence 8.1a h.w: pg 481:
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 8: Estimating with Confidence Section 8.1 Confidence Intervals: The.
STAT 111 Introductory Statistics Lecture 9: Inference and Estimation June 2, 2004.
LECTURE 16 TUESDAY, 31 March STA 291 Spring
Lecture 14 Dustin Lueker. 2  Inferential statistical methods provide predictions about characteristics of a population, based on information in a sample.
Review of Chapters 1- 5 We review some important themes from the first 5 chapters 1.Introduction Statistics- Set of methods for collecting/analyzing data.
Agresti/Franklin Statistics, 1e, 1 of 139  Section 6.4 How Likely Are the Possible Values of a Statistic? The Sampling Distribution.
HAWKES LEARNING SYSTEMS math courseware specialists Copyright © 2010 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Chapter 9 Samples.
9.3: Sample Means.
STA Lecture 181 STA 291 Lecture 18 Exam II Next Tuesday 5-7pm Memorial Hall (Same place) Makeup Exam 7:15pm – 9:15pm Location TBA.
Chapter 7 Probability and Samples: The Distribution of Sample Means.
Distributions of the Sample Mean
Chapter 7 Sampling and Sampling Distributions ©. Simple Random Sample simple random sample Suppose that we want to select a sample of n objects from a.
Lecture 11 Dustin Lueker. 2  The larger the sample size, the smaller the sampling variability  Increasing the sample size to 25… 10 samples of size.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Unit 5: Estimating with Confidence Section 10.1 Confidence Intervals: The Basics.
STA Lecture 191 STA 291 Lecture 19 Exam II Next Tuesday 5-7pm Memorial Hall (Same place as exam I) Makeup Exam 7:15pm – 9:15pm Location CB 234.
LECTURE 25 THURSDAY, 19 NOVEMBER STA291 Fall
AP Statistics: Section 9.1 Sampling Distributions.
STA Lecture 171 STA 291 Lecture 17 Chap. 10 Estimation – Estimating the Population Proportion p –We are not predicting the next outcome (which is.
Chapter 7 Point Estimation of Parameters. Learning Objectives Explain the general concepts of estimating Explain important properties of point estimators.
Statistics and Quantitative Analysis U4320 Segment 5: Sampling and inference Prof. Sharyn O’Halloran.
Chapter 7: Sampling Distributions Section 7.1 How Likely Are the Possible Values of a Statistic? The Sampling Distribution.
1 Chapter 9: Sampling Distributions. 2 Activity 9A, pp
Sampling Distributions Chapter 18. Sampling Distributions A parameter is a measure of the population. This value is typically unknown. (µ, σ, and now.
Review Normal Distributions –Draw a picture. –Convert to standard normal (if necessary) –Use the binomial tables to look up the value. –In the case of.
Descriptive Statistics for one Variable. Variables and measurements A variable is a characteristic of an individual or object in which the researcher.
Lecture 11 Dustin Lueker. 2  The larger the sample size, the smaller the sampling variability  Increasing the sample size to 25… 10 samples of size.
Sampling Distributions: Suppose I randomly select 100 seniors in Anne Arundel County and record each one’s GPA
MATH Section 4.4.
Chapter 7: The Distribution of Sample Means
LECTURE 26 TUESDAY, 24 NOVEMBER STA291 Fall
Chapter 8: Estimating with Confidence
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 8: Estimating with Confidence Section 8.1 Confidence Intervals: The.
Sampling Distributions Chapter 18. Sampling Distributions A parameter is a number that describes the population. In statistical practice, the value of.
Lecture 13 Dustin Lueker. 2  Inferential statistical methods provide predictions about characteristics of a population, based on information in a sample.
Chapter 9 Sampling Distributions 9.1 Sampling Distributions.
Sampling and Sampling Distributions. Sampling Distribution Basics Sample statistics (the mean and standard deviation are examples) vary from sample to.
!! DRAFT !! STA 291 Lecture 14, Chap 9 9 Sampling Distributions
Sampling Distributions
LECTURE 24 TUESDAY, 17 November
STA 291 Spring 2010 Lecture 12 Dustin Lueker.
Week 10 Chapter 16. Confidence Intervals for Proportions
STA 291 Spring 2008 Lecture 11 Dustin Lueker.
MATH 2311 Section 4.4.
Sampling Distribution Models
STA 291 Spring 2008 Lecture 13 Dustin Lueker.
STA 291 Summer 2008 Lecture 12 Dustin Lueker.
STA 291 Spring 2008 Lecture 12 Dustin Lueker.
Presentation transcript:

STA Lecture 161 STA 291 Lecture 16 Normal distributions: ( mean and SD ) use table or web page. The sampling distribution of and are both (approximately) normal

Sampling Distributions – Sampling Distribution of Central limit theorem: no matter what the population look like, as long as we use SRS, and when sample size n is large, the above two sampling distribution are (very close to) normal. STA Lecture 162

is approximately normal with mean = p, SD = is approximately normal with mean = μ, SD = STA Lecture 163

4 Central Limit Theorem For random sampling, as the sample size n grows, the sampling distribution of the sample mean approaches a normal distribution. So does the sample proportion Amazing: This is the case even if the population distribution is discrete or highly skewed The Central Limit Theorem can be proved mathematically We will verify it experimentally in the lab sessions

STA Lecture 165 Central Limit Theorem Online applet 1 tml Online applet /spt/CLT-SampleMean.html

STA Lecture 166 Population distribution vs. sampling distribution Population distribution: = distribution of, a sample of size one from the population. In a simple random sample of size 4: each one has the distribution of the population. But the average of the 4 has a different distribution --– the sampling distribution of mean, when n=4.

STA Lecture 167 Has a distribution different from the population distribution: (1) shape is more normal (2) mean remains the same (3) SD is smaller (only half of the population SD)

STA Lecture 168 Population Distribution Distribution from which we select the sample Unknown, we want to make inference about its parameters Mean = ? Standard Deviation = ?

STA Lecture 169 Sample Statistic From the sample X 1, …, X n we compute descriptive statistics Sample Mean = Sample Standard Deviation = Sample Proportion = They all can be computed given a sample.

STA Lecture 1610 Sampling Distribution of a sample statistic Probability distribution of a statistic (for example, the sample mean) Describes the pattern that would occur if we could repeatedly take random samples and calculate the statistic as often as we wanted Used to determine the probability that a statistic falls within a certain distance of the population parameter The mean of the sampling distribution of is = The SD of is also called Standard Error =

STA Lecture 1611 The 3 features for the sampling distribution of sample mean also apply to sample proportion. (1. approach normal, 2. centered at p; 3. less and less SD) This sampling distribution tells us how far or how close between “p” and One quantity we can compute, the other we want to know

STA Lecture 1612 Central Limit Theorem For example: If the sample size is n = 100, then the sampling distribution of has mean p and SD (or standard error) =

STA Lecture 1613 Preview of estimation of p Estimation with error bound: Suppose we counted 57 “YES” in 100 interview. (SRS) Since we know just how far is to p. (that is given by the sampling distribution) 95% of the time is going to fall within two SD of p. SD = ?

57/100 = 0.57 = Sqrt of [ 0.57(1-0.57) ] = = 0.495/10 = STA Lecture 1614

Finally, with 95% probability, the difference between p and is within 2 SD or 2(0.0495) = STA Lecture 1615

STA Lecture 1616 Multiple choice question The standard error of a statistic describes 1.The standard deviation of the sampling distribution of that statistic 2.The variability in the values of the statistic for repeated random samples of size n. Both are true

STA Lecture 1617 Multiple Choice Question The Central Limit Theorem implies that 1.All variables have approximately bell-shaped sample distributions if a random sample contains at least 30 observations 2.Population distributions are normal whenever the population size is large 3.For large random samples, the sampling distribution of the sample mean (X-bar) is approximately normal, regardless of the shape of the population distribution 4.The sampling distribution looks more like the population distribution as the sample size increases

In previous page, 3 is correct. STA Lecture 1618

STA Lecture 1619 Chapter 10 Statistical Inference: Estimation of p –Inferential statistical methods provide predictions about characteristics of a population, based on information in a sample from that population –For quantitative variables, we usually estimate the population mean (for example, mean household income) –For qualitative variables, we usually estimate population proportions (for example, proportion of people voting for candidate A)

STA Lecture 1620 Two Types of Estimators Point Estimate –A single number that is the best guess for the parameter –For example, the sample mean is usually a good guess for the population mean Interval Estimate (harder) =point estimator with error bound –A range of numbers around the point estimate –To give an idea about the precision of the estimator –For example, “the proportion of people voting for A is between 67% and 73%”

STA Lecture 1621 Point Estimator A point estimator of a parameter is a sample statistic that predicts the value of that parameter A good estimator is –Unbiased: Centered around the true parameter –Consistent: Gets closer to the true parameter as the sample size gets larger –Efficient: Has a standard error that is as small as possible (made use of all available information)

STA Lecture 1622 Efficiency An estimator is efficient if its standard error is small compared to other estimators Such an estimator has high precision A good estimator has small standard error and small bias (or no bias at all) The following pictures represent different estimators with different bias and efficiency Assume that the true population parameter is the point (0,0) in the middle of the picture

STA Lecture 1623 Bias and Efficiency Note that even an unbiased and efficient estimator does not always hit exactly the population parameter. But in the long run, it is the best estimator.

Sample proportion is an unbiased estimator of the population proportion. It is consistent and efficient. STA Lecture 1624

STA Lecture 1625 Example: Estimators Suppose we want to estimate the proportion of UK students voting for candidate A We take a random sample of size n=400 The sample is denoted X 1, X 2,…, X n, where X i =1 if the ith student in the sample votes for A, X i =0 otherwise

STA Lecture 1626 Example: Estimators Estimator1 = the sample proportion Estimator2 = the answer from the first student in the sample (X 1 ) Estimator3 = 0.3 Which estimator is unbiased? Which estimator is consistent? Which estimator has high precision (small standard error)?

STA Lecture 1627 Attendance Survey Question On a 4”x6” index card –Please write down your name and section number –Today’s Question: –Table or web page for Normal distribution? Which one you like better?

STA Lecture 1628 Central Limit Theorem Usually, the sampling distribution of is approximately normal for n = 30 or above In addition, we know that the parameters of the sampling distribution are “μ” and For example: If the sample size is n=49, then the sampling distribution of has mean μ and SD (or standard error) =

STA Lecture 1629 Cont. Using the “empirical rule” with 95% probability will fall within 2 SD of its center, mu. (since the sampling distribution is approx. normal, so empirical rule apply. In fact, 2 SD should be refined to 1.96 SD) with 95% probability, the falls between

STA Lecture 1630 Unbiased An estimator is unbiased if its sampling distribution is centered around the true parameter For example, we know that the mean of the sampling distribution of “X-bar” equals “mu”, which is the true population mean So, “X-bar” is an unbiased estimator of “mu”

STA Lecture 1631 Unbiased However, for any particular sample, the sample mean “X-bar” may be smaller or greater than the population mean “Unbiased” means that there is no systematic under- or overestimation

STA Lecture 1632 Biased A biased estimator systematically under- or overestimates the population parameter In the definition of sample variance and sample standard deviation uses n-1 instead of n, because this makes the estimator unbiased With n in the denominator, it would systematically underestimate the variance

STA Lecture 1633 Point Estimators of the Mean and Standard Deviation The sample mean is unbiased, consistent, and (often) relatively efficient for estimating “mu” The sample standard deviation is almost unbiased for estimating population SD (no easy unbiased estimator exist) Both are consistent