Chapter 7: Sampling and Sampling distributions Statistical Inference is to make decisions that are based on data. We will study hypothesis testing, which.

Slides:



Advertisements
Similar presentations
Chapter 18 Sampling distribution models
Advertisements

Week11 Parameter, Statistic and Random Samples A parameter is a number that describes the population. It is a fixed number, but in practice we do not know.
Sampling Distributions
Sampling Distributions (§ )
Sampling Distributions
Chapter 6 Introduction to Sampling Distributions
1 Sampling Distributions Chapter Introduction  In real life calculating parameters of populations is prohibitive because populations are very.
Chapter 7 Sampling and Sampling Distributions
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 6 Introduction to Sampling Distributions.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 6-1 Introduction to Statistics Chapter 7 Sampling Distributions.
Chapter Sampling Distributions and Hypothesis Testing.
Stat 301 – Day 20 Sampling Distributions for a sample proportion (cont.) (4.3)
Chapter 11: Random Sampling and Sampling Distributions
Normal and Sampling Distributions A normal distribution is uniquely determined by its mean, , and variance,  2 The random variable Z = (X-  /  is.
B AD 6243: Applied Univariate Statistics Understanding Data and Data Distributions Professor Laku Chidambaram Price College of Business University of Oklahoma.
Chapter 8: Confidence Intervals
Chap 6-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 6 Introduction to Sampling.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 6-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
1 SMU EMIS 7364 NTU TO-570-N Inferences About Process Quality Updated: 2/3/04 Statistical Quality Control Dr. Jerrell T. Stracener, SAE Fellow.
Chapter 10 – Sampling Distributions Math 22 Introductory Statistics.
Biostatistics Dr. Chenqi Lu Telephone: Office: 2309 GuangHua East Main Building.
Determination of Sample Size: A Review of Statistical Theory
1 Chapter 7 Sampling Distributions. 2 Chapter Outline  Selecting A Sample  Point Estimation  Introduction to Sampling Distributions  Sampling Distribution.
Probabilistic & Statistical Techniques Eng. Tamer Eshtawi First Semester Eng. Tamer Eshtawi First Semester
1 Chapter 8 Sampling Distributions of a Sample Mean Section 2.
8 Sampling Distribution of the Mean Chapter8 p Sampling Distributions Population mean and standard deviation,  and   unknown Maximal Likelihood.
Estimation Chapter 8. Estimating µ When σ Is Known.
PSY 307 – Statistics for the Behavioral Sciences Chapter 9 – Sampling Distribution of the Mean.
The Sampling Distribution of
Chapter 7 Point Estimation of Parameters. Learning Objectives Explain the general concepts of estimating Explain important properties of point estimators.
Statistics and Quantitative Analysis U4320 Segment 5: Sampling and inference Prof. Sharyn O’Halloran.
Chapter 5 Sampling Distributions. The Concept of Sampling Distributions Parameter – numerical descriptive measure of a population. It is usually unknown.
Chapter 7: The Distribution of Sample Means. Frequency of Scores Scores Frequency.
Chapter 18 Sampling distribution models math2200.
Lecture 5 Introduction to Sampling Distributions.
Sampling Distributions Sampling Distributions. Sampling Distribution Introduction In real life calculating parameters of populations is prohibitive because.
SAMPLING DISTRIBUTION. 2 Introduction In real life calculating parameters of populations is usually impossible because populations are very large. Rather.
Chapter 14 Single-Population Estimation. Population Statistics Population Statistics:  , usually unknown Using Sample Statistics to estimate population.
Sampling Distributions
Sampling and Sampling Distributions
Chapter 7: Sampling Distributions
Ch5.4 Central Limit Theorem
Introduction For inference on the difference between the means of two populations, we need samples from both populations. The basic assumptions.
Sampling Distribution Estimation Hypothesis Testing
CHAPTER 7 Sampling Distributions
Statistics in Applied Science and Technology
AP Statistics: Chapter 7
CONCEPTS OF ESTIMATION
Sampling Distribution of the Sample Mean
MATH 2311 Section 4.4.
Normal Probability Distributions
Chapter 7: Sampling Distributions
Sampling Distribution of a Sample Proportion
Chapter 7: Sampling Distributions
Chapter 7: Sampling Distributions
CHAPTER 7 Sampling Distributions
CHAPTER 7 Sampling Distributions
CHAPTER 7 Sampling Distributions
Chapter 7: Sampling Distributions
Chapter 7: Sampling Distributions
Chapter 7: Sampling Distributions
CHAPTER 7 Sampling Distributions
Chapter 7: Sampling Distributions
Sampling Distributions (§ )
Chapter 7: Sampling Distributions
CHAPTER 7 Sampling Distributions
CHAPTER 7 Sampling Distributions
Chapter 7: Sampling Distributions
Warmup Which of the distributions is an unbiased estimator?
Presentation transcript:

Chapter 7: Sampling and Sampling distributions Statistical Inference is to make decisions that are based on data. We will study hypothesis testing, which is one of the tools of statistical inference. To do hypothesis testing, you will need to know sampling distributions, which are the distributions you encounter when doing sampling. For example, you want to estimate the mean of a distribution. You estimate the mean from your sample. If you continue to do this, you get many different values of sample means. If you find the frequency distribution of these means, that will approximate a sampling distribution. The continuous form of the distribution as the number of samples approaches infinity is a probability distribution know as a sampling distribution.

7.2 Sampling Distribution Central limit theorem: The sampling distribution of sample means approximates a normal distribution as the sample size becomes large no matter what the shape of the population distribution is. Population mean , variance  2 If sample size > 30, the distribution of the sample mean will approach a normal distribution.

Example Assume that a test is normally distributed with a mean of 800 and a standard deviation of 100. a) What is the probability that one person selected at random from the population will have a score at or above 850? z = (X -  )/  = (850 – 800)/100 = 0.5 P(z > 0.5) = b) What is the probability that a sample of 20 people selected at random from the population will have a mean score above 850? Part a requires a normal population to be valid; part b is valid without a normal population although a sample size of 30 or more would be better.

Learning Activity Sampling distribution calculations Assume a machine produces parts with a mean diameter of 60.2 and a s.d. of 2.4. What is the probability that a randomly selected part will have a diameter greater than 62? What is the probability that a sample of 17 parts will have a mean diameter greater than 62? See SamplDist.xls!Solution

Proportions ( 百分比 ) A proportion is a special case of a mean where the data are 0 and 1. The central limit theorem applies to proportions. (See 7.C-2 Proportion as a mean.) Mean of the proportion is (  the population proportion) Standard error of the proportion (s. d. of the proportion) where p represents a sample proportion, and  a population proportion. The mean of the proportion is  and its variance is  (1-  )/n.

Let X be a random variable representing the number of 1s in a sample of n 0s or 1s with the population proportion . Then X has a binomial distribution with E(X) = n  and V(X) = n  (1-  ). Now what are the mean and variance for the sample proportion p = X/n, which itself is a mean? The central limit theorem applies. That is, p is normally distributed if n is large, and E(p) = E(X/n) = n  /n = , V(p) = (1/n) 2 V(X) = (1/n 2 )  n  (1 -  ) =  (1-  )/n s.d. of p (or standard error) is Actually, the population proportion  is often unknown. We will use sample proportion p in the above equation.

TargetSample size EstimatorMeanStandard error  (mean) n sample mean   (proportion) np = X/n sample proportion   : population mean,  2 : population variance  : population proportion

7.A Generating Random Numbers In selecting random samples, it is necessary to generate random numbers. Random numbers are also used for Simulations and can be used to create sample datasets. In Excel you can generate random numbers Random numbersExcel function Between 0 and 1=RAND() [note: no 1s] Between 0 and 100=RAND()*100 [note: no 100s] Integers between 0 and 99=INT(RAND()*100) Integers between 1 to 100=INT(RAND()*100) + 1 Between a and b=RAND()*(b – a) + a You can also use MegaStat | Generate Random Numbers to generate numbers with uniform, normal or exponential distributions.

Learning Activity 7.A-1 Generating Random Numbers  Open RandomNumbers.xls!Start  Create a sample of 20 random numbers between 1 and 100 by using RAND() function.  Use ROUND function to round the value to 3 decimal places. See RandomNumbers.xls!RandInt.  Use MegaStat | Generate Random Numbers to generage 300 normal random numbers with mean of 100 and s.d of 16, specifying 0 decimal places and live function.  Look at how the normal random number are generated by =ROUND(NORMINV(RAND(),100,16),0). [The values from RAND() are random probabilities that are input into NORMINV() function to create normally distributed random numbers.] Note: NORMINV(probability, mean, standard_dev)  Check the histogram of the random numbers you created.

Randomizing data (to select random samples) You can rearrange (or shuffle) a column of existing values randomly. Learning Activity 7.A-2 Randomizing Data (take n samples randomly from a pool of data)  Open RandomNumbers.xls!PriceData.  Type =RAND() in cell C2 and copy it down through C125.  Click anywhere in the random number range and then click Excel’s Sort Ascending or Sort Descending.  You can take the first n values, they are your random samples.  If you want to put the numbers back in their original order, sort the No. column.

7.B Central Limit Theorem Simulation  Open CLT.xls. The Excel workbook contains 600 random samples (=RAND()*100,1), i.e. from a uniform population, and summarize the distribution of the population and the distribution of the means, that is the sampling distribution.)  See the distribution of the means. 7.C-2 Proportion as a mean A professor asks each of his student if he or she owns or rents and codes the data 1 = own, 0 = rent. The proportion of his students who owns their own home is p = count/n = 11/25 = You will get the same answer if you calculate the mean of the data.  Open Proportions.xls!Start.  Check Solution1.  Open Proportions.xls!Practice.  Check Solution2.