Section 4.4 Sampling Distribution Models and the Central Limit Theorem Transition from Data Analysis and Probability to Statistics.

Slides:



Advertisements
Similar presentations
Estimation of Means and Proportions
Advertisements

Lesson Estimating a Population Proportion.
Week11 Parameter, Statistic and Random Samples A parameter is a number that describes the population. It is a fixed number, but in practice we do not know.
AP Statistics: Section 9.1 Sampling Distributions
Sampling Distributions and Sample Proportions
Sampling Distributions
Ch 9 實習.
Sampling Distributions
1 Sampling Distributions Chapter Introduction  In real life calculating parameters of populations is prohibitive because populations are very.
Chapter 7 Sampling and Sampling Distributions
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 6 Introduction to Sampling Distributions.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 6-1 Introduction to Statistics Chapter 7 Sampling Distributions.
Part III: Inference Topic 6 Sampling and Sampling Distributions
Chapter 18 Sampling Distribution Models and the Central Limit Theorem Transition from Data Analysis and Probability to Statistics.
Chapter 11: Random Sampling and Sampling Distributions
SAMPLING DISTRIBUTION
1 Sampling Distributions Presentation 2 Sampling Distribution of sample proportions Sampling Distribution of sample means.
Chapter 5 Sampling Distributions
Copyright © 2010 Pearson Education, Inc. Slide
STA Lecture 161 STA 291 Lecture 16 Normal distributions: ( mean and SD ) use table or web page. The sampling distribution of and are both (approximately)
AP Statistics Chapter 9 Notes.
5.5 Distributions for Counts  Binomial Distributions for Sample Counts  Finding Binomial Probabilities  Binomial Mean and Standard Deviation  Binomial.
1 Introduction to Estimation Chapter Concepts of Estimation The objective of estimation is to determine the value of a population parameter on the.
Chap 6-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 6 Introduction to Sampling.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 6-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
Estimation This is our introduction to the field of inferential statistics. We already know why we want to study samples instead of entire populations,
Sampling Design and Analysis MTH 494 Ossam Chohan Assistant Professor CIIT Abbottabad.
AP STATISTICS LESSON INFERENCE FOR A POPULATION PROPORTION.
AP Statistics: Section 9.1 Sampling Distributions.
Chapter 18: Sampling Distribution Models
Chapter 7 Point Estimation of Parameters. Learning Objectives Explain the general concepts of estimating Explain important properties of point estimators.
Statistics and Quantitative Analysis U4320 Segment 5: Sampling and inference Prof. Sharyn O’Halloran.
Confidence Interval for p, Using z Procedure. Conditions for inference about proportion Center: the mean is ƥ. That is, the sample proportion ƥ is an.
Chapter 5 Sampling Distributions. The Concept of Sampling Distributions Parameter – numerical descriptive measure of a population. It is usually unknown.
Lesson Estimating a Population Proportion.
Chapter 18 Sampling distribution models math2200.
Sampling Theory and Some Important Sampling Distributions.
Lecture 5 Introduction to Sampling Distributions.
Sampling Distributions Sampling Distributions. Sampling Distribution Introduction In real life calculating parameters of populations is prohibitive because.
SAMPLING DISTRIBUTION. 2 Introduction In real life calculating parameters of populations is usually impossible because populations are very large. Rather.
Sampling Distributions: Suppose I randomly select 100 seniors in Anne Arundel County and record each one’s GPA
AP STATS: WARM UP I think that we might need a bit more work with graphing a SAMPLING DISTRIBUTION. 1.) Roll your dice twice (i.e. the sample size is 2,
Sampling Distribution Models and the Central Limit Theorem Transition from Data Analysis and Probability to Statistics.
Sampling Distributions Chapter 18. Sampling Distributions A parameter is a number that describes the population. In statistical practice, the value of.
Sampling and Sampling Distributions. Sampling Distribution Basics Sample statistics (the mean and standard deviation are examples) vary from sample to.
Sampling Distributions
Chapter 6: Sampling Distributions
Ch 9 實習.
Sampling Distributions
Chapter 4 Section 4.4 Sampling Distribution Models
Parameter versus statistic
Sampling Distributions and Estimation
Chapter 17 Sampling Distribution Models and the Central Limit Theorem
Another Population Parameter of Frequent Interest: the Population Mean µ
Chapter 5 Sampling Distributions
Section 4.4 Sampling Distribution Models and the Central Limit Theorem
Chapter 5 Sampling Distributions
CONCEPTS OF ESTIMATION
Chapter 7 Sampling Distributions.
Problems: Q&A chapter 6, problems Chapter 6:
Econ 3790: Business and Economics Statistics
MATH 2311 Section 4.4.
Chapter 5 Sampling Distributions
Chapter 7 Sampling Distributions.
Chapter 7 Sampling Distributions.
SAMPLING DISTRIBUTION
Warmup Which of the distributions is an unbiased estimator?
Chapter 7 Sampling Distributions.
Ch 9 實習.
MATH 2311 Section 4.4.
Presentation transcript:

Section 4.4 Sampling Distribution Models and the Central Limit Theorem Transition from Data Analysis and Probability to Statistics

Probability: n From population to sample (deduction) Statistics: n From sample to the population (induction)

Sampling Distributions n Population parameter: a numerical descriptive measure of a population. (for example:  p (a population proportion); the numerical value of a population parameter is usually not known) Example:  mean height of all NCSU students p=proportion of Raleigh residents who favor stricter gun control laws n Sample statistic: a numerical descriptive measure calculated from sample data. (e.g, x, s, p (sample proportion))

Parameters; Statistics n In real life parameters of populations are unknown and unknowable. –For example, the mean height of US adult (18+) men is unknown and unknowable n Rather than investigating the whole population, we take a sample, calculate a statistic related to the parameter of interest, and make an inference. n The sampling distribution of the statistic is the tool that tells us how close the value of the statistic is to the unknown value of the parameter.

DEF: Sampling Distribution n The sampling distribution of a sample statistic calculated from a sample of n measurements is the probability distribution of values taken by the statistic in all possible samples of size n taken from the same population. Based on all possible samples of size n.

n In some cases the sampling distribution can be determined exactly. n In other cases it must be approximated by using a computer to draw some of the possible samples of size n and drawing a histogram.

n If a coin is fair the probability of a head on any toss of the coin is p = 0.5. n Imagine tossing this fair coin 5 times and calculating the proportion p of the 5 tosses that result in heads (note that p = x/5, where x is the number of heads in 5 tosses). n Objective: determine the sampling distribution of p, the proportion of heads in 5 tosses of a fair coin. Sampling distribution of p, the sample proportion; an example

Sampling distribution of p (cont.) Step 1: The possible values of p are 0/5=0, 1/5=.2, 2/5=.4, 3/5=.6, 4/5=.8, 5/5=1 n Binomial Probabilities p(x) for n=5, p = 0.5 xp(x) p P(p) The above table is the probability distribution of p, the proportion of heads in 5 tosses of a fair coin.

Sampling distribution of p (cont.) n E(p) =0* * * * * * = 0.5 = p (the prob of heads) n Var(p) = n So SD(p) = sqrt(.05) =.2236 n NOTE THAT SD(p) = p P(p)

Expected Value and Standard Deviation of the Sampling Distribution of p n E(p) = p n SD(p) = where p is the “success” probability in the sampled population and n is the sample size

Shape of Sampling Distribution of p n The sampling distribution of p is approximately normal when the sample size n is large enough. n large enough means np ≥ 10 and n(1-p) ≥ 10

Shape of Sampling Distribution of p Population Distribution, p=.65 Sampling distribution of p for samples of size n

Example n 8% of American Caucasian male population is color blind. n Use computer to simulate random samples of size n = 1000

The sampling distribution model for a sample proportion p Provided that the sampled values are independent and the sample size n is large enough, the sampling distribution of p is modeled by a normal distribution with E(p) = p and standard deviation SD(p) =, that is where q = 1 – p and where n large enough means np>=10 and nq>=10 The Central Limit Theorem will be a formal statement of this fact.

Example: binge drinking by college students n Study by Harvard School of Public Health: 44% of college students binge drink. n 244 college students surveyed; 36% admitted to binge drinking in the past week n Assume the value 0.44 given in the study is the proportion p of college students that binge drink; that is 0.44 is the population proportion p n Compute the probability that in a sample of 244 students, 36% or less have engaged in binge drinking.

Example: binge drinking by college students (cont.) n Let p be the proportion in a sample of 244 that engage in binge drinking. n We want to compute n E(p) = p =.44; SD(p) = n Since np = 244*.44 = and nq = 244*.56 = are both greater than 10, we can model the sampling distribution of p with a normal distribution, so …

Example: binge drinking by college students (cont.)

Example: texting by college students n 2008 study : 85% of college students with cell phones use text messageing. n 1136 college students surveyed; 84% reported that they text on their cell phone. n Assume the value 0.85 given in the study is the proportion p of college students that use text messaging; that is 0.85 is the population proportion p n Compute the probability that in a sample of 1136 students, 84% or less use text messageing.

Example: texting by college students (cont.) n Let p be the proportion in a sample of 1136 that text message on their cell phones. n We want to compute n E(p) = p =.85; SD(p) = n Since np = 1136*.85 = and nq = 1136*.15 = are both greater than 10, we can model the sampling distribution of p with a normal distribution, so …

Example: texting by college students (cont.)

Another Population Parameter of Frequent Interest: the Population Mean µ n To estimate the unknown value of µ, the sample mean x is often used. n We need to examine the Sampling Distribution of the Sample Mean x (the probability distribution of all possible values of x based on a sample of size n).

Example n Professor Stickler has a large statistics class of over 300 students. He asked them the ages of their cars and obtained the following probability distribution : x x p(x)1/141/142/142/142/143/143/14 n SRS n=2 is to be drawn from pop. n Find the sampling distribution of the sample mean x for samples of size n = 2.

Solution n 7 possible ages (ages 2 through 8) n Total of 7 2 =49 possible samples of size 2 n All 49 possible samples with the corresponding sample mean are on p. 47.

Solution (cont.) n Probability distribution of x: x p(x) 1/196 2/196 5/196 8/196 12/196 18/196 24/196 26/196 28/196 24/196 21/196 18/196 1/196 n This is the sampling distribution of x because it specifies the probability associated with each possible value of x n From the sampling distribution above P(4  x  6) = p(4)+p(4.5)+p(5)+p(5.5)+p(6) = 12/ / / / /196 = 108/196

Expected Value and Standard Deviation of the Sampling Distribution of x

Example (cont.) n Population probability dist. x p(x)1/141/142/142/142/143/143/14 n Sampling dist. of x x p(x) 1/196 2/196 5/196 8/196 12/196 18/196 24/196 26/196 28/196 24/196 21/196 18/196 1/196

Population probability dist. x p(x)1/141/142/142/142/143/143/14 Sampling dist. of x x p(x) 1/196 2/196 5/196 8/196 12/196 18/196 24/196 26/196 28/196 24/196 21/196 18/196 1/196 Population mean E(X)=  = E(X)=2(1/14)+3(1/14)+4(2/14)+ … +8(3/14)=5.714 E(X)=2(1/196)+2.5(2/196)+3(5/196)+3.5(8/196)+4(12/196)+4.5(18/196)+5(24/196) +5.5(26/196)+6(28/196)+6.5(24/196)+7(21/196)+7.5(18/196)+8(1/196) = Mean of sampling distribution of x: E(X) = 5.714

Example (cont.) SD(X)=SD(X)/  2 =  /  2

IMPORTANT

Sampling Distribution of the Sample Mean X: Example n An example –A fair 6-sided die is thrown; let X represent the number of dots showing on the upper face. –The probability distribution of X is x p(x) 1/6 1/6 1/6 1/6 1/6 1/6 Population mean  :  = E(X) = 1(1/6) +2(1/6) + 3(1/6) +……… = 3.5. Population variance  2  2 =V(X) = (1-3.5) 2 (1/6)+ (2-3.5) 2 (1/6)+ ……… ………. = 2.92

Suppose we want to estimate  from the mean of a sample of size n = 2. n What is the sampling distribution of in this situation?

/36 5/36 4/36 3/36 2/36 1/36 E( ) =1.0(1/36)+ 1.5(2/36)+….=3.5 V(X) = ( ) 2 (1/36)+ ( ) 2 (2/36)... = 1.46

Notice that is smaller than Var(X). The larger the sample size the smaller is. Therefore, tends to fall closer to , as the sample size increases.

The variance of the sample mean is smaller than the variance of the population. 123 Also, Expected value of the population = ( )/3 = 2 Mean = 1.5Mean = 2.5Mean = 2. Population Expected value of the sample mean = ( )/3 = 2 Compare the variability of the population to the variability of the sample mean. Let us take samples of two observations

Properties of the Sampling Distribution of x

BUS Topic The central tendency is down the center Unbiased Handout 6.1, Page 1 µ l Confidence l Precision

Consequences

A Billion Dollar Mistake n “Conventional” wisdom: smaller schools better than larger schools n Late 90’s, Gates Foundation, Annenberg Foundation, Carnegie Foundation n Among the 50 top-scoring Pennsylvania elementary schools 6 (12%) were from the smallest 3% of the schools n But …, they didn’t notice … n Among the 50 lowest-scoring Pennsylvania elementary schools 9 (18%) were from the smallest 3% of the schools

A Billion Dollar Mistake (cont.) n Smaller schools have (by definition) smaller n’s. n When n is small, SD(x) = is larger n That is, the sampling distributions of small school mean scores have larger SD’s n s-foundation-schools-oped- cx_dr_1119ravitch.html s-foundation-schools-oped- cx_dr_1119ravitch.html

We Know More! n We know 2 parameters of the sampling distribution of x :