1 CHAPTER 7 Homework:5,7,9,11,17,22,23,25,29,33,37,41,45,51, 59,65,77,79 : The U.S. Bureau of Census publishes annual price figures for new mobile homes in Construction Reports. The information is obtained from sampling, not from a census. Suppose a random sample of 40 new mobile homes yields the prices shown in Table 7.1. Data are in thousands of dollars, rounded to the nearest hundred.
2 Table (a) Find the sample mean and sample standard deviation. (b) Find a point estimator for the population mean and a point estimator for the population standard deviation.
3 Output 7.01: Prices of Mobile Homes SAS summary statistics Analysis Variable : PRICE N Mean Std Dev Minimum Maximum
Figure 7.1 Prices of Mobil Homes
5 : (1) Point estimators do not provide any information of the reliability for itself. An interval estimator can provide the reliability information. Therefore, we want to construct an interval estimator of the population mean. (2) Usually, we need to have several reasonable assumptions about the population before we can actually construct the interval estimator.
6 Interval Estimator An interval estimator is a rule that tells us how to use sample data to form an interval estimator of the population parameters. Confidence Coefficient: The confidence coefficient of an interval estimator is the probability that this interval estimator encloses the population parameter. Confidence Level: The confidence level is the confidence coefficient expressed as a percentage.
7 Sec 7.1: Large sample interval estimation of the population mean (a) Is the sample representative? We need to make sure that the sample is randomly selected from the population. This is one of the most basic principles of statistical inference.
8 (b) Check the sample size If the sample size is greater than 30, you can use the large sample procedure because that: (i) the central limit theorem ensures at least approximate normality of the distribution of the sample mean and (ii) the law of large number ensures that the sample standard deviation provides a good estimator of the population standard deviation.
9 (c) Find the sample mean and standard error of the sample mean.
10 (d) Find z /2, where is equal to one minus confidence coefficient. The (1- )100% confidence interval for the population mean is then given by
11 Interpretation of a confidence interval: For any single given confidence interval of the population mean, we don't know whether it contains the population mean or not. But if this confidence interval (construction) procedure is used on a large number of random samples, then about (1 - ) * 100% of the intervals do contain the unknown population mean.
12 (Basic) Find a 90% confidence interval for the population mean, if (a). n = 125, sample mean = 13.1, and sample variance = (b). n = 50, sample mean = 21.9, and sample variance = 3.44.
13 (Basic) Acid rain, caused by the reaction of certain air pollutants with rain water, appears to be a growing problem in the north eastern section of the United States. Pure rainfall through clean air registers a pH value of 5.7. Suppose that water samples from 40 rainfalls are analyzed for pH and that mean and standard deviation are equal to 3.7 and 0.5, respectively. Find a 99% C.I. for the average pH in rainfall and interpret this interval. What assumptions must be made for the confidence interval to be valid?
14 (Intermediate) Suppose that a random sample (with sample mean ) of size n >= 30 is to be taken from a population with mean and standard deviation (a) Determine the probability that the interval will contain the population mean. (b) Interpret your result in part (a) in terms of percentages.
15 (Advance) (a) Figure 7.3 is the box plot of the density of earth data set. Table 1 Measurements of the density of the earth (b) Figure 7.5 is the bax plots of the same data wothout one extreme value. Briefly discuss the main features such as symmetry and outlier issue of this distribution. (c) What is your estimate of the density of the earth based on this density data set?
16 Output 7.02: SAS summary statistics Analysis Variable: DENSITY (All) N Mean Std Dev Minimum Maximum Analysis Variable: DENSITY (without 4.07) N Mean Std Dev Minimum Maximum
Figure 7.3 Density of the Earth
Figure 7.5 Density of the Earth
19 Sec 7.2: Small sample estimation of population mean: The student t Distribution (a) The p.d.f. of a t random variable is given by (b) The limiting distribution (as n approaches infinity) of t is normal distribution.
20 (c) Student t distribution is symmetric about its median or mean. (d) The variance of a student t random variable is larger than the variance of a standard normal random variable.
21 Figure 7.7 Student t Distribution Quantile aa Normal Curve t curve with 4 d.f.
22 (2). Steps to construct the confidence interval: (a) Is this sample representative? We need to make sure that the sample is randomly selected from the population. (b) Is the population approximately normal? Note that this small sample confidence interval is applicable only if the population has a normal distribution!!! This can be assessed roughly by using graphical tools to display the data.
23 (c) Find the sample mean and standard error (d) Find t /2,n-1 from Table VI, where (1 - ) is the confidence coefficient and n is the sample size. (e) The (1- )*100% confidence interval is
24 (Basic) A very costly experiment has been conducted to evaluate a new process for producing synthetic diamonds. Six diamonds have been produced by the new process with recorded weight 0.46, 0.61, 0.52, 0.48, 0.57, and 0.54 karat. (a) What assumptions do we need to use the t confidence interval? (b) Find a 95% C.I. of the population mean.
25 Output 7.03: SAS summary statistics Analysis Variable: DIAMOND N Mean Std Dev Minimum Maximum
26 (Basic) A random sample of size n=12 was selected from a normally distributed population. The sample mean is (a) Find the 95% C.I. of the population mean if the sample variance is 4.7. (b) Find the 95% C.I. of the population mean if the sample size is 64 and sample variance is 4.7. (c) Interpret the intervals in (a) and (b).
27 (Advance) It is recognized that the cigarette smoking has deleterious effect on lung function. A recent study found that the carbon monoxide diffusing capacity (DC) of the lung for 20 current smokers are as follows:
28 Output 7.04: SAS summary statistics Analysis Variable: CARBON N Mean Std Dev Minimum Maximum
29 (a). Find the sample mean and sample standard deviation. (b). What assumptions do you need to construct the t confidence interval of DC for current smokers. (c). Use stem-and-leaf display to check these assumptions. (d). Find 95% C.I. of DC for current smokers.
Figure 7.8 Diffusing Capacity Quantiles
31 Sec 7.3: Large Sample Estimation of the parameter for Binomial Population: Properties of Sample Proportion (1) Sample proportion is = x / n. (2) The expectation of sample proportion is E( ) = p. (3). Variance of is approximately
32 (4). For a sufficiently large sample, the sampling distribution of is approximately normal. (5). Usually, the sample size is large enough if the interval does not include 0 or 1. If the sample size is large enough, then an approximate 1- confidence interval for p is given by
33 (Basic) Suppose that 6841 U.S. households are selected at random in order to estimate the proportion, p, of all U.S. households that have a computer. If 2470 out of the 6841 households chosen have a computer, find the 95% C.I. for p.
34 (Intermediate) A telephone survey conducted by the Florida State University's Policy Science Program found that 74% of the 983 responding Florida adults were in favor of raising the drinking age from 19 to 21. (a). What assumptions is necessary for the 74% to be valid point estimator of the percentages of all the adult Floridians who favor raising the drinking age from 19 to 21? (b). Is it possible that the assumption in part (a) might not be satisfied in a telephone survey? (c). Assume that the assumptions in part (a) were satisfied by the pollsters, find a 95% C.I. for p.
35 Sec 7.4: Sample Size, Width of the Confidence Interval, and Confidence Level: (1). Width and Confidence Level: The width of a confidence interval increased when the confidence level of this interval increased.
36 (Intermediate) The U.S. Energy Information Administration surveys household in order to obtain data on monthly fuel expenditures for household vehicles. Suppose 60 monthly fuel expenditures for household vehicles are randomly selected and their mean is equal to with a standard deviation.
37 (a). Find a 95% confidence interval of household monthly expenditure on vehicles. (b). Find a 90% confidence interval of household monthly expenditure on vehicles. (c). Find a 99% confidence interval of household monthly expenditure on vehicles. (d). Discuss the relationship between confidence level and width of confidence interval.
38 (2). Samples Size and Width of Confidence Interval: Error Bound of Confidence Interval The error bound of the estimator of the population mean at confidence level 1 - is It is equal to (about) the half width of the confidence interval
39 Sample Size Requirements for Estimating the Population Mean The sample size required for a (1 - )*100% confidence interval for the population mean with given error bound B is given by
40 (Intermediate) Suppose that you wish to estimate the mean pH of rainfalls in an area that suffers heavy pollution due to the discharge of smoke from a power plant. (a). Assume that you know that sample standard deviation is 0.5 pH and that you wish to estimate within 0.1 of population mean with probability of Approximately how many rainfalls would have to include in your sample? (b). Would it be valid to select all your specimens from a single rainfall? (c). Suppose that sample s.d. is 0.2. Repeat part (a). Do you trust your answer? Explain.
41 Sample Size Requirement fot the Population Proportion The sample size required for a (1- )*100% confidence interval for the population proportion p with a given error bound B is given by
42 (Advance) Find the p for the following proportion p for n=100. (a). P=0.5. (b). P=0.6. (c). P=0.7. (d). P=0.4. (e). P=0.3.
43 Figure 7.10 Probabilities Standard Error
44 (Intermediate) Suppose that you want to estimate a binomial parameter p correct to within 0.04 (i.e., B = 0.04) at 95% confidence level. (a). How large should the sample size n be? (b). You suspect that P is equal to some value between 0.1 and 0.3. How large should the sample size n be? (c). You suspect that P is equal to some value between 0.6 and 0.8. How large should the sample size n be?