BCOR 1020 Business Statistics

Name: BCOR 1020 Business Statistics
Uploaded: 2017-08-24T17:52:48+00:00
Duration: PTM14S56
Description: BCOR 1020 Business Statistics

BCOR 1020 Business Statistics
Lecture 16 – March 13, 2008

Overview Chapter 8 – Sampling Distributions and Estimation An Example:
Sampling Variation Estimators and Sampling Distributions Sample Mean and the Central Limit Theorem Confidence Intervals Mean (m) with variance (s) known

Chapter 8 – Introductory Example
Suppose your business is planning on bringing a new product to market. There is a business case to proceed only if the cost of production is less than $10 per unit and At least 20% of your target market is willing to pay $25 per unit to purchase this product. How do you determine whether or not to proceed? You will likely conduct experiments/surveys to estimate these variables and make appropriate inferences

Chapter 8 – Introductory Example
Assume the cost of production can be modeled as a continuous variable. You can conduct a random sample of the manufacturing process and collect cost data. If 40 randomly selected production runs yield and average cost of $9.00 with a standard deviation of $1.00, what can you conclude? The percentage of your target market that is willing to pay $25 per unit to purchase this product can be modeled as the probability of a “success” for a binomial variable. You can conduct survey research on your target market. If 200 people are surveyed and 44 say they would pay $25 to purchase this product, what can you conclude?

Chapter 8 – Sampling Variation
Sample statistic – a random variable whose value depends on which population items happen to be included in the random sample. Depending on the sample size, the sample statistic could either represent the population well or differ greatly from the population. This sampling variation can easily be illustrated. Consider eight random samples of size n = 5 from a large population of GMAT scores for MBA applicants. The average of the sample means is fairly close to the population mean (m = 520), but there is considerable variation between samples.

Chapter 8 – Estimators and Sampling Distributions
An estimator is a statistic computed from a random sample which is used to estimate an unknown population parameter (q ). Generally denoted by Some Common Estimators and the parameters they estimate… The sample mean is an estimator for the population mean m. The sample std. dev. S is an estimator for the population std. dev. s. The sample proportion of successes is an estimator for the population proportion of successes p.

The sampling distribution of an estimator is the probability distribution of all possible values the statistic may assume when a random sample of size n is taken. An estimator is a random variable since samples vary Sampling error =  –  ^

Bias: Bias is the difference between the expected value of the estimator and the true parameter. Bias = E(  ) –  ^ An estimator is unbiased if E(  ) =  ^ On average, an unbiased estimator neither overstates nor understates the true parameter. Sampling error is random whereas bias is systematic. An unbiased estimator avoids systematic error.

Efficiency: Efficiency refers to the variance of the estimator’s sampling distribution. A more efficient estimator has smaller variance.

Consistency: A consistent estimator converges toward the parameter being estimated as the sample size increases.

Chapter 8 – Sample Mean and the Central Limit Theorem
The sample mean is an unbiased estimator of m. Therefore, where m is the mean of the population from which we are sampling. The standard error of the mean is the standard deviation of the sampling error of x : where s is the standard deviation of the population from which we are sampling.

If the population is exactly normal, then the sample mean follows a normal distribution.

For example, the average price, m, of a 5 GB MP3 player is $80.00 with a standard deviation, s, equal to $ What will be the mean and standard error from a sample of 20 players? E( X ) = E(X) = m = $80.00 sx = s n = 10 20 = $2.236 If the distribution of prices for these players is a normal distribution, then the sampling distribution on x is N(80.00, 2.236).

Clickers The average price, m, of a titanium mountain bike
is $3, with a standard deviation, s, equal to $ What will be the mean and standard error from a sample of 15 titanium mountain bikes? (A) (B) (C) (D) _ E( X ) = $ , sX = $16.67 _ E( X ) = $ , sX = $64.55 _ E( X ) = $ , sX = $250.00 _ E( X ) = $250.00, sX = $16.67

Central Limit Theorem (CLT) for a Mean: If a random sample of size n is drawn from a population with mean m and standard deviation s, the distribution of the sample mean x approaches a normal distribution with mean m and standard deviation sx = s/ n as the sample size increase. If the population is normal, the distribution of the sample mean is normal regardless of sample size.

In general, we can approximate the distribution of the sample average with the normal distribution if If the population from which we are sampling is symmetric, then a smaller n is required to use this approximation.

Range of Sample Means: The CLT permits a range or interval within which the sample means are expected to fall. m + z s n Where z is from the standard normal table. If we know m and s, the range of sample means for samples of size n are predicted to be: m s n 90% Interval m s n 95% Interval m s n 99% Interval

Illustration: GMAT Scores For samples of size n = 5 applicants, within what range would GMAT means be expected to fall? The parameters are m = and s = The predicted range for 95% of the sample means is: m s n = =

Sample Size and Standard Error: The standard error declines as n increases, but at a decreasing rate. Make the interval small by increasing n. m + z s n

Chapter 8 – Confidence Interval for a Mean (m) with Known s
What is a Confidence Interval? A sample mean x is a point estimate of the population mean m. (It is always wrong! – discuss) A confidence interval for the mean is a range mlower < m < mupper The confidence level is the probability that the confidence interval contains the true population mean. The confidence level (usually expressed as a %) is the area under the curve of the sampling distribution.

What is a Confidence Interval? The confidence interval for m with known s is:

Derivation of the Confidence Interval: We can choose a value of the standard normal, za, such that P(-za < Z < za) = 100(1-a)% {95% for example} If we are sampling from a normal population with mean m and known standard deviation s, then We then substitute this Z into the inequality in the probability above and solve for m …

Choosing a Confidence Level: A higher confidence level leads to a wider confidence interval. Greater confidence implies loss of precision. 95% confidence is most often used.

Choosing a Confidence Level: For example, to generate a 95% confidence interval, we choose za = 1.96.

Clickers Using the table below, what value of za would you choose to find a 99% confidence interval? A = B = 1.96 C = D = 2.576

Assumptions: The standard deviation, s, is known. The sample is drawn from a normal population or n is large enough to use the Central Limit Theorem. Interpretation: A confidence interval either does or does not contain m. The confidence level quantifies the risk. Out of % confidence intervals, approximately 95% would contain m, while approximately 5% would not contain m.

Is s Ever Known? Yes, but not very often. In quality control applications with ongoing manufacturing processes, assume s stays the same over time. In this case, confidence intervals are used to construct control charts to track the mean of a process over time.

Example: A business is considering whether or not to continue using a particular vendor that has had some trouble with shipping. If we assume shipment times have a normal distribution and it is known the standard deviation of shipment times from this vendor is s = 2.5 days and a random sample of 20 shipments from this vendor yields an average shipment time of 7.5 days, … find a 90% confidence interval for the vendors mean shipping time, m.

Example: Using the table below, we determine za for a 90% C.I. za = 1.645 From the problem, we have n = 20, s = 2.5 and days. The 90% C.I. for mean shipment time (in days) is

Clickers A = $8.00 < m < $10.00 B = $8.69 < m < $9.31
Suppose we have a random sample of 40 production runs with an average cost of production of If the standard deviation of the cost of production is known to be s = $1.00, Find the 95% Confidence Interval for the mean production cost. A = $8.00 < m < $10.00 B = $8.69 < m < $9.31 C = $7.04 < m < $10.96 D = $8.98 < m < $9.03

BCOR 1020 Business Statistics

Similar presentations

Presentation on theme: "BCOR 1020 Business Statistics"— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

BCOR 1020 Business Statistics

Similar presentations

Presentation on theme: "BCOR 1020 Business Statistics"— Presentation transcript:

Similar presentations

About project

Feedback