Business Statistics for Managerial Decision

Slides:



Advertisements
Similar presentations
Hypothesis Testing When “p” is small, we reject the Ho.
Advertisements

AP Statistics Section 11.2 B. A 95% confidence interval captures the true value of in 95% of all samples. If we are 95% confident that the true lies in.
Objectives (BPS chapter 18) Inference about a Population Mean  Conditions for inference  The t distribution  The one-sample t confidence interval 
CHAPTER 9 Testing a Claim
Inference for a population mean BPS chapter 18 © 2006 W. H. Freeman and Company.
Introduction to Statistics
EPIDEMIOLOGY AND BIOSTATISTICS DEPT Esimating Population Value with Hypothesis Testing.
Lecture 3 Miscellaneous details about hypothesis testing Type II error
Review of Basic Statistical Concepts Farideh Dehkordi-Vakil.
Section 9.3 One-sample z-test for a Population Mean
Chapter 11: Inference for Distributions
Chapter 9 Hypothesis Testing.
Two-sample problems for population means BPS chapter 19 © 2006 W.H. Freeman and Company.
Confidence Intervals: Estimating Population Mean
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 9 Hypothesis Testing.
Experimental Statistics - week 2
Lesson Carrying Out Significance Tests. Vocabulary Hypothesis – a statement or claim regarding a characteristic of one or more populations Hypothesis.
+ DO NOW What conditions do you need to check before constructing a confidence interval for the population proportion? (hint: there are three)
Estimating a Population Mean
10.1 Estimating With Confidence
CHAPTER 16: Inference in Practice. Chapter 16 Concepts 2  Conditions for Inference in Practice  Cautions About Confidence Intervals  Cautions About.
AP Statistics Section 11.2 A Inference Toolbox for Significance Tests
Week 111 Power of the t-test - Example In a metropolitan area, the concentration of cadmium (Cd) in leaf lettuce was measured in 7 representative gardens.
Chapter 11 Inference for Distributions AP Statistics 11.1 – Inference for the Mean of a Population.
PARAMETRIC STATISTICAL INFERENCE
CHAPTER 18: Inference about a Population Mean
Business Statistics for Managerial Decision Comparing two Population Means.
Two-Sided Hypothesis Tests. p ,29 p ,32,33 p
Copyright © Cengage Learning. All rights reserved. 10 Inferences Involving Two Populations.
Stat 1510 Statistical Inference: Confidence Intervals & Test of Significance.
Essential Statistics Chapter 131 Introduction to Inference.
CHAPTER 14 Introduction to Inference BPS - 5TH ED.CHAPTER 14 1.
10.2 Tests of Significance Use confidence intervals when the goal is to estimate the population parameter If the goal is to.
The Practice of Statistics Third Edition Chapter 10: Estimating with Confidence Copyright © 2008 by W. H. Freeman & Company Daniel S. Yates.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 8: Estimating with Confidence Section 8.3 Estimating a Population Mean.
Chapter 10 AP Statistics St. Francis High School Fr. Chris.
Essential Statistics Chapter 141 Thinking about Inference.
MATH 2400 Ch. 15 Notes.
BPS - 3rd Ed. Chapter 161 Inference about a Population Mean.
Essential Statistics Chapter 161 Inference about a Population Mean.
Week111 The t distribution Suppose that a SRS of size n is drawn from a N(μ, σ) population. Then the one sample t statistic has a t distribution with n.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
AP Statistics Section 11.2 A Inference Toolbox for Significance Tests.
© 2010 Pearson Prentice Hall. All rights reserved Chapter Hypothesis Tests Regarding a Parameter 10.
Statistical Significance. p ,14,15 p ,22,24,26.
+ Unit 6: Comparing Two Populations or Groups Section 10.2 Comparing Two Means.
Learning Objectives After this section, you should be able to: The Practice of Statistics, 5 th Edition1 DESCRIBE the shape, center, and spread of the.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 9 Testing a Claim 9.3 Tests About a Population.
2003 Free Response Question #2 A simple random sample of adults living in a suburb of a large city was selected. The age and annual income of each adult.
+ Z-Interval for µ So, the formula for a Confidence Interval for a population mean is To be honest, σ is never known. So, this formula isn’t used very.
Chapter 7 Inference Concerning Populations (Numeric Responses)
+ Chapter 8 Estimating with Confidence 8.1Confidence Intervals: The Basics 8.2Estimating a Population Proportion 8.3Estimating a Population Mean.
When  is unknown  The sample standard deviation s provides an estimate of the population standard deviation .  Larger samples give more reliable estimates.
Inference about the mean of a population of measurements (  ) is based on the standardized value of the sample mean (Xbar). The standardization involves.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 7 Inferences Concerning Means.
Copyright © 2009 Pearson Education, Inc t LEARNING GOAL Understand when it is appropriate to use the Student t distribution rather than the normal.
16/23/2016Inference about µ1 Chapter 17 Inference about a Population Mean.
CHAPTER 9 Testing a Claim
CHAPTER 9 Testing a Claim
CHAPTER 18: Inference about a Population Mean
CHAPTER 9 Testing a Claim
AP STATISTICS LESSON 10 – 2 (DAY 3)
CHAPTER 9 Testing a Claim
CHAPTER 9 Testing a Claim
CHAPTER 18: Inference about a Population Mean
Carrying Out Significance Tests
CHAPTER 18: Inference about a Population Mean
CHAPTER 9 Testing a Claim
CHAPTER 9 Testing a Claim
CHAPTER 9 Testing a Claim
Presentation transcript:

Business Statistics for Managerial Decision Inference for the Population Mean

Test for a Population Mean There are four steps in carrying out a significance test: State the hypothesis. Calculate the test statistic. Find the p-value. State your conclusion by comparing the p-value to the significance level .

Example: Blood pressures of executives The medical director of a large company is concerned about the effects of stress on the company’s younger executives. According to the National Center for health Statistics, the mean systolic blood pressure for males 35 to 44 years of age is 128 and the standard deviation in this population is 15. The medical director examines the records of 72 executives in this age group and finds that their mean systolic blood pressure is . Is this evidence that the mean blood pressure for all the company’s young male executives is higher than the national average? Use  = 5%.

Example: Blood pressures of executives Hypotheses: H0:  = 128 Ha:  > 128 Test statistic: P-value:

Example: Blood pressures of executives Conclusion: About 14% of the time, a SRS of size 72 from the general male population would have a mean blood pressure as high as that of executive sample. The observed is not significantly higher than the national average.

Example: A company-wide health promotion campaign The company medical director institutes a health promotion campaign to encourage employees to exercise more and eat a healthier diet. One measure of the effectiveness of such a program is a drop in blood pressure. Choose a random sample of 50 employees, and compare their blood pressures from annual physical examination given before the campaign and again a year later. The mean change in blood pressure for these n = 50 employees is . We take the population standard deviation to be  = 20. Use  = 5%.

Example: A company-wide health promotion campaign Hypotheses: H0:  =0 Ha:  < 0 Test statistic: P-value:

Example: A company-wide health promotion campaign Conclusion: A mean change in blood pressure of –6 or better would occur only 17 times in 1000 samples if the campaign had no effect on the blood pressures of the employees. This is convincing evidence that the mean blood pressure in the population of all employees has decreased.

Example:Testing Pharmaceutical products The Deely Laboratory analyzes pharmaceutical products to verify the concentration of active ingredients. Such chemical analyses are not perfectly precise. Repeated measurements on the same specimen will give slightly different results. The results of repeated measurements follow a Normal distribution quite closely, the analysis procedure has no bias, so that the mean  of the population of all measurements is the true concentration in the specimen. The standard deviation of this distribution is a property of the analytical procedure and is known to be  = 0.0068 gram per liter. The laboratory analyzes each specimen three times and reports the mean results.

Example:Testing Pharmaceutical products A client sends a specimen for which the concentration of active ingredient is supposed to be 0.86%. Deely’s three analyses give concentrations 0.8403 0.8363 0.8447 Is there significant evidence at the 1% level that the true concentration is not 0.86%?

Example:Testing Pharmaceutical products Hypotheses: H0:  = 0.86 Ha:   0.86 Test Statistic: The mean of the three analyses is . The one sample z test statistic is therefore

Example:Testing Pharmaceutical products We do not need to find the exact P-value to assess significance at the  = 0.01 level. Look in the table A under tail area 0.005 because the alternative is two-sided. The z-values that are significant at the 1% level are z > 2.575 and z < -2.575. Our observed z = -4.99 is significant

P-value versus fixed  In our example , we concluded that the test statistic z = -4.99 is significant at the 1% level. The observed z is far beyond the critical value for  = 0.01, and the evidence against H0 is far stronger than 1% significance suggests. The P-value P = .0000006 (from a statistical software) gives a better sense of how strong the evidence is. The P-value is the smallest level  at which the data are significant.

Inference for the Mean of a Population Both confidence intervals and tests of significance for the mean  of a Normal population are based on the sample mean , which estimates the unknown . The sampling distribution of depends on . There is no difficulty when  is known. When  is unknown, we must estimate it. The sample standard deviation s is used to estimate the population standard deviation .

The t-distribution Suppose we have a simple random sample of size n from a Normally distributed population with mean  and standard deviation . The standardized sample mean, or one-sample z statistic has the standard Normal distribution N(0, 1). When we substitute the standard deviation of the mean (standard error) s /n for the /n, the statistic does not have a Normal distribution.

The t-distribution It has a distribution called t-distribution. Suppose that a SRS of size n is drawn from a N(, ) population. Then the one sample t statistic has the t-distribution with n-1 degrees of freedom. There is a different t distribution for each sample size. A particular t distribution is specified by giving the degrees of freedom.

The t-distribution We use t(k) to stand for t distribution with k degrees of freedom. The density curves of the t-distributions are symmetric about 0 and are bell shaped. The spread of t distribution is a bit greater than that of standard Normal distribution. As degrees of freedom k increase, t(k) density curve approaches the N(0, 1) curve.

The one –Sample t Confidence Interval Suppose that an SRS of size n is drawn from a population having unknown mean . A level C confidence interval for  is Where t* is the value for the t (n-1) density curve with area C between –t* and t*. The margin of error is This interval is exact when the population distribution is Normal and is approximately correct for large n in other cases.

Example: Estimating the level of Vitamin C The following data are the amount of vitamin C, measured in milligram per 100 grams (mg/100g) of the corn soy blend (dry basis), for a random sample of size 8 from a production run: 26 31 23 22 11 22 14 31 We want to find a 95% confidence interval for , the mean vitamin C content of the corn soy blend (CSB) produced during this run.

Example: Estimating the level of Vitamin C The sample mean and the standard deviation s = 7.19 with degrees of freedom n-1 = 8-1 =7. The standard error of is From table we find t* = 2.365. The 95% confidence interval is

The one-sample t test:Summary

Example: Is the Vitamin C level correct? The specifications for the CSB state that the mixture should contain 2 pounds of vitamin premix fro every 2000 pounds of product. These specifications are designed to produce a mean () vitamin C content in the final product of 40mg/100 g. We test the null hypothesis that the mean vitamin C content of the production run in the previous example conforms to these specifications. Use  = 5%.

Example: Is the Vitamin C level correct? Hypotheses: H0:  = 40 Ha:   40 Test statistic: P-value: Because the degrees of freedom are n-1 =7, this t statistic has t(7) distribution.

Example: Is the Vitamin C level correct? From the largest entry in the df =7 line of the table we see that We conclude that the P-value is less than 20.0005, or P < .001. We reject H0 and conclude that the vitamin C content for this run is below the specification.

Matched Pairs t procedures Comparative studies are usually preferred to single-sample investigations because of the protection they offer against confounding. In a matched pairs study, subjects are matched in pairs and the outcomes are compared within each matched pair. One situation calling for matched pairs is before-and-after observations on the same subjects.

Matched Pairs t procedures A matched pair analysis is needed when there are two measurements or observations on each individual and we want to examine the change from the first to the second. For each individual, subtract the “before” measure from the “after” measure. Analyze the difference using the one-sample confidence interval and significance testing procedures.

Example: The effect of language instruction A company contracts with a language institute to provide individualized instruction in foreign languages for its executives who will be posted overseas. Is the instruction effective? Last year 20 executives studied French. All had some knowledge of French, so they were given the Modern Language Association’s listening test of understanding of spoken French before the instruction began. After several weeks of immersion in French, the executives took the listening test again. The following table gives the pretest and posttest scores.

Example: The effect of language instruction

Example: The effect of language instruction To analyze these data Subtract pretest score from the posttest score. These differences appear in the “gain” column in previous table. These 20 differences form a single sample. To assess whether the institute significantly improved the executives’ comprehension of spoken French we test H0:  = 0 Ha:  > 0

Example: The effect of language instruction Here  is the mean improvement that would be achieved if the entire population of executives received similar instruction. The null hypothesis says that no improvement occurs, and the alternative hypothesis says that posttest scores are higher on the average. The 20 differences have

Example: The effect of language instruction The one sample t statistic is therefore P-value is found from the t(19) distribution. T-table shows that 3.86 lies between the upper .001 and .0005 critical values of the t(19) distribution. The P-value therefore lies between these values.

Example: The effect of language instruction Conclusion: The improvement in scores was significant. We have strong evidence that the posttest scores are systematically higher. A statistically significant but very small improvement in language ability would not justify the expense of the individualized instruction. A confidence interval allows us to estimate the amount of improvement.

Example: The effect of language instruction Find a 90% confidence interval for the mean improvement in the entire population. The critical value t* = 1.729 from t-table for 90% confidence. The confidence interval is: