Hypothesis Testing Using a Single Sample

Slides:



Advertisements
Similar presentations
Chapter 10.  Real life problems are usually different than just estimation of population statistics.  We try on the basis of experimental evidence Whether.
Advertisements

1 Chapter 9 Hypothesis Testing Developing Null and Alternative Hypotheses Type I and Type II Errors One-Tailed Tests About a Population Mean: Large-Sample.
1 1 Slide © 2003 South-Western/Thomson Learning™ Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Hypothesis Testing Developing Null and Alternative Hypotheses Developing Null and Alternative Hypotheses Type I and Type II Errors Type I and Type II Errors.
1 1 Slide STATISTICS FOR BUSINESS AND ECONOMICS Seventh Edition AndersonSweeneyWilliams Slides Prepared by John Loucks © 1999 ITP/South-Western College.
Objectives (BPS chapter 15)
Chapter 10: Hypothesis Testing
Pengujian Hipotesis Nilai Tengah Pertemuan 19 Matakuliah: I0134/Metode Statistika Tahun: 2007.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson Prentice-Hall, Inc.Chap 9-1 Statistics for Managers Using Microsoft® Excel 5th Edition.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 9-1 Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests Basic Business Statistics.
HYPOTHESIS TESTS ABOUT THE MEAN AND PROPORTION
Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc. Chapter Hypothesis Tests Regarding a Parameter 10.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Fundamentals of Hypothesis Testing: One-Sample Tests Statistics.
Ch. 9 Fundamental of Hypothesis Testing
Statistics for Managers Using Microsoft® Excel 5th Edition
Hypothesis Testing with One Sample
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Chapter 11 Introduction to Hypothesis Testing.
Tests of significance: The basics BPS chapter 15 © 2006 W.H. Freeman and Company.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 8 Tests of Hypotheses Based on a Single Sample.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 9 Hypothesis Testing.
Hypothesis Testing A hypothesis is a conjecture about a population. Typically, these hypotheses will be stated in terms of a parameter such as  (mean)
Chapter 10 Hypothesis Testing
Overview Definition Hypothesis
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Chapter 9 Introduction to Hypothesis Testing.
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 9-1 Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests Business Statistics,
Fundamentals of Hypothesis Testing: One-Sample Tests
Section 9.1 Introduction to Statistical Tests 9.1 / 1 Hypothesis testing is used to make decisions concerning the value of a parameter.
Tests of significance: The basics BPS chapter 15 © 2006 W.H. Freeman and Company.
Section 10.2: Errors in Hypothesis Testing. Test Procedure – the method we use to determine whether H 0 should be rejected. Type 1 Error: the error of.
Week 8 Fundamentals of Hypothesis Testing: One-Sample Tests
Chapter 9 Large-Sample Tests of Hypotheses
1 1 Slide Slides Prepared by JOHN S. LOUCKS St. Edward’s University © 2002 South-Western/Thomson Learning.
Hypothesis Testing with One Sample Chapter 7. § 7.1 Introduction to Hypothesis Testing.
Hypothesis Tests with Proportions Chapter 10 Notes: Page 169.
Hypothesis Tests with Proportions Chapter 10. Write down the first number that you think of for the following... Pick a two-digit number between 10 and.
1 1 Slide © 2003 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
10.2 Tests of Significance Use confidence intervals when the goal is to estimate the population parameter If the goal is to.
Chapter 20 Testing hypotheses about proportions
Testing of Hypothesis Fundamentals of Hypothesis.
Section 10.3: Large-Sample Hypothesis Tests for a Population Proportion.
Lecture 16 Section 8.1 Objectives: Testing Statistical Hypotheses − Stating hypotheses statements − Type I and II errors − Conducting a hypothesis test.
1 Chapter 9 Hypothesis Testing. 2 Chapter Outline  Developing Null and Alternative Hypothesis  Type I and Type II Errors  Population Mean: Known 
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Fundamentals of Hypothesis Testing: One-Sample Tests Statistics.
Lecture 9 Chap 9-1 Chapter 2b Fundamentals of Hypothesis Testing: One-Sample Tests.
Economics 173 Business Statistics Lecture 4 Fall, 2001 Professor J. Petry
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Overview.
Copyright ©2013 Pearson Education, Inc. publishing as Prentice Hall 9-1 σ σ.
Hypothesis Testing with One Sample Chapter 7. § 7.1 Introduction to Hypothesis Testing.
Chap 8-1 Fundamentals of Hypothesis Testing: One-Sample Tests.
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 9-1 Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests Business Statistics,
Introduction Suppose that a pharmaceutical company is concerned that the mean potency  of an antibiotic meet the minimum government potency standards.
© Copyright McGraw-Hill 2004
Hypothesis Testing Introduction to Statistics Chapter 8 Feb 24-26, 2009 Classes #12-13.
Understanding Basic Statistics Fourth Edition By Brase and Brase Prepared by: Lynn Smith Gloucester County College Chapter Nine Hypothesis Testing.
Section 10.4: Hypothesis Tests for a Population Mean.
Chapter 10 Lesson 10.2 Hypotheses and Test Procedures 10.2: Errors in Hypothesis Testing.
Chapter 11 Inference with Means Sampling Distribution, Confidence Intervals, and Hypothesis Tests.
Hypothesis Tests Hypothesis Tests Large Sample 1- Proportion z-test.
Inference with Proportions II Hypothesis Testing Using a Single Sample.
© 2010 Pearson Prentice Hall. All rights reserved Chapter Hypothesis Tests Regarding a Parameter 10.
Chapter 9 Hypothesis Testing Understanding Basic Statistics Fifth Edition By Brase and Brase Prepared by Jon Booze.
Hypothesis Testing Chapter Hypothesis Testing  Developing Null and Alternative Hypotheses  Type I and Type II Errors  One-Tailed Tests About.
AP Test Practice. A student organization at a university is interested in estimating the proportion of students in favor of showing movies biweekly instead.
CHAPTER 10 Asking and Answering Questions about a Population Proportion Created by Kathy Fritz.
AP Test Practice. After a frost warning was issued, the owner of a large orange grove asked his workers to spray all his trees with water. The water was.
Introduction to Inference Tests of Significance Proof
Chapter Nine Hypothesis Testing.
Hypothesis Testing Using a Single Sample
Hypothesis Tests with Proportions
Presentation transcript:

Hypothesis Testing Using a Single Sample Chapter 10 Hypothesis Testing Using a Single Sample

How do we answers questions like these using sample data? Sharing prescription drugs with others can be dangerous. Is this a common occurrence among teens? OR The National Association of Colleges and Employers stated that the average starting salary for students graduating with a bachelor’s degree in 2010 is $48,351. Is this true for your college? How do we answers questions like these using sample data? In Chapter 9, we used sample data to estimate the value of an unknown population characteristic. In this chapter, we will use sample data to test some claim or hypothesis about the population characteristic to see if it is plausible. To do this, we use a test of hypotheses or test procedure.

What is a test of hypotheses? Is it one of the values of the sample statistic that are likely to occur? A test of hypotheses is a method that uses sample data to decide between two competing claims (hypotheses) about the population characteristic. Is the value of the sample statistic . . . a random occurrence due to natural variation? OR Is it one that isn’t likely to occur? a value that would be considered surprising?

Hypothesis statements: You are usually trying to determine if this claim is believable. The null hypothesis, denoted by H0, is a claim about a population characteristic that is initially assumed to be true. The alternative hypothesis, denoted by Ha, is the competing claim. The hypothesis statements are ALWAYS about the population – NEVER about a sample! To determine what the alternative hypothesis should be, you need to keep the research objectives in mind.

Let’s consider a murder trial . . . What is the null hypothesis? What is the alternative hypothesis? To determine which hypothesis is correct, the jury will listen to the evidence. Only if there is “evidence beyond a reasonable doubt” would the null hypothesis be rejected in favor of the alternative hypothesis. You are trying to determine if the evidence supports this claim. So we will make one of two decisions: Reject the null hypothesis Fail to reject the null hypothesis This is what you assume is true before you begin. H0: the defendant is innocent If there is not convincing evidence, then we would “fail to reject” the null hypothesis. Remember that the actually verdict that is returned is “GUILTY” or “NOT GUILTY”. We never end up determining the null hypothesis is true – only that there is not enough evidence to say it’s not true. Ha: the defendant is guilty

The Form of Hypotheses: Null hypothesis H0: population characteristic = hypothesized value Alternative hypothesis Ha: population characteristic > hypothesized value Ha: population characteristic < hypothesized value Ha: population characteristic ≠ hypothesized value This one is considered a two-tailed test because you are interested in both direction. The null hypothesis always includes the equal case. This hypothesized value is a specific number determined by the context of the problem This sign is determined by the context of the problem. Notice that the alternative hypothesis uses the same population characteristic and the same hypothesized value as the null hypothesis. These are considered one-tailed tests because you are only interested in one direction. Let’s practice writing hypothesis statements.

H0: p = .1 Ha: p > .1 State the hypotheses : Sharing prescription drugs with others can be dangerous. A survey of a representative sample of 592 U.S. teens age 12 to 17 reported that 118 of those surveyed admitted to having shared a prescription drug with a friend. Is this sufficient evidence that more than 10% of teens have shared prescription medication with friends? State the hypotheses : What is the hypothesized value? What words indicate the direction of the alternative hypothesis? What is the population characteristic of interest? H0: p = .1 Ha: p > .1 The true proportion p of teens who have shared prescription medication with friends

State the hypotheses : H0: m = 8000 Ha: m < 8000 Compact florescent (cfl) lightbulbs are much more energy efficient than regular incandescent lightbulbs. Ecobulb brand 60-watt cfl lightbulbs state on the package “Average life 8000 hours”. People who purchase this brand would be unhappy if the bulbs lasted less than 8000 hours. A sample of these bulbs will be selected and tested. State the hypotheses : What is the hypothesized value? What words indicate the direction of the alternative hypothesis? What is the population characteristic of interest? H0: m = 8000 Ha: m < 8000 The true mean (m) life of the cfl lightbulbs

Because in variation of the manufacturing process, tennis balls produced by a particular machine do not have the same diameters. Suppose the machine was initially calibrated to achieve the specification of m = 3 inches. However, the manager is now concerned that the diameters no longer conform to this specification. If the mean diameter is not 3 inches, production will have to be halted. State the hypotheses : What words indicate the direction of the alternative hypothesis? What is the population characteristic of interest? H0: m = 3 Ha: m ≠ 3 The true mean m diameter of tennis balls

Must use a population characteristic - x is a statistics (sample) For each pair of hypotheses, indicate which are not legitimate and explain why Must use a population characteristic - x is a statistics (sample) Must be only greater than! Must use same number as in H0! H0 MUST be “=“ !

When you perform a hypothesis test you make a decision: reject H0 or fail to reject H0 When you make one of these decisions, there is a possibility that you could be wrong! That you made an error! Each could possibly be a wrong decision; therefore, there are two types of errors.

This is the lower-case Greek letter “alpha”. Type I error The error of rejecting H0 when H0 is true The probability of a Type I error is denoted by a. a is called the significance level of the test This is the lower-case Greek letter “alpha”.

This is the lower-case Greek letter “beta”. Type II error The error of failing to reject H0 when H0 is false The probability of a Type II error is denoted by b This is the lower-case Greek letter “beta”.

Here is another way to look at the types of errors: Suppose H0 is true and we fail to reject it, what type of decision was made? Suppose H0 is false and we reject it, what type of decision was made? Suppose H0 is true and we reject it, what type of decision was made? Suppose H0 is false and we fail to reject it, what type of decision was made? H0 is true H0 is false Reject H0 Fail to reject H0 Type I error Correct Type II error Correct

State a Type I error in context. The U.S. Bureau of Transportation Statistics reports that for 2009 72% of all domestic passenger flights arrived on time (meaning within 15 minutes of its scheduled arrival time). Suppose that an airline with a poor on-time record decides to offer its employees a bonus if, in an upcoming month, the airline’s proportion of on-time flights exceeds the overall 2009 industry rate of .72. State the hypotheses. Type I error – the airline decides to reward the employees when the proportion of on-time flights doesn’t exceeds .72 H0: p = .72 Ha: p > .72 State a Type I error in context. State a Type II error in context. Type II error – the airline employees do not receive the bonus when they deserve it.

State a Type I error in the context of this problem. In 2004, Vertex Pharmaceuticals, a biotechnology company, issued a press release announcing that it had filed an application with the FDA to begin clinical trials on an experimental drug VX-680 that had been found to reduce the growth rate of pancreatic and colon cancer tumors in animal studies. Data resulting from the planned clinical trials can be used to test: Let m = the true mean growth rate of tumors for patients taking the experimental drug H0: m = mean growth rate of tumors for patients not taking the experimental drug Ha: m < mean growth rate of tumors for patients not taking the experimental drug State a Type I error in the context of this problem. A potential consequence of making a Type I error would be that the company would continue to devote resources to the development of the drug when it really is not effective. What is a potential consequence of this error? A Type I error would be to incorrectly conclude that the experimental drug is effective in slowing the growth rate of tumors

Data resulting from the planned clinical trials can be used to test: In 2004, Vertex Pharmaceuticals, a biotechnology company, issued a press release announcing that it had filed an application with the FDA to begin clinical trials on an experimental drug VX-680 that had been found to reduce the growth rate of pancreatic and colon cancer tumors in animal studies. Data resulting from the planned clinical trials can be used to test: H0: m = mean growth rate of tumors for patients not taking the experimental drug Ha: m < mean growth rate of tumors for patients not taking the experimental drug State a Type II error in the context of this problem. A potential consequence of making a Type II error would be that the company might abandon development of a drug that was effective. What is a potential consequence of this error? A Type II error would be to conclude that the drug is ineffective when in fact the mean growth rate of tumors is reduced

The relationship between a and b The ideal test procedure would result in both a = 0 (probability of a Type I error) and b = 0 (probability of a Type II error). This is impossible to achieve since we must base our decision on sample data. Standard test procedures allow us to select a, the significance level of the test, but we have no direct control over b. Selecting a significance level a = .05 results in a test procedure that, used over and over with different samples, rejects a true H0 about 5 times in 100. So why not always choose a small a (like a = .05 or a = .01)?

The relationship between a and b Suppose this normal curve represents the sampling distribution for p when the null hypothesis is true. If the null hypothesis is false and the alternative hypothesis is true, then the true proportion is believed to be greater than .5 – so the curve should really be shifted to the right. Let’s consider the following hypotheses: H0: p = .5 Ha: p > .5 Let a = .05 This tail would represent b, the probability of failing to reject a false H0. This is the part of the curve that represents a or the Type I error. .5

The relationship between a and b If the null hypothesis is false and the alternative hypothesis is true, then the true proportion is believed to be greater than .5 – so the curve should really be shifted to the right. Let’s consider the following hypotheses: H0: p = .5 Ha: p > .5 Let a = .01 This tail would represent b, the probability of failing to reject a false H0. Notice that as a gets smaller, b gets larger!

How does one decide what a level to use? After assessing the consequences of type I and type II errors, identify the largest a that is tolerable for the problem. Then employ a test procedure that uses this maximum acceptable value –rather than anything smaller – as the level of significance. Remember, using a smaller a increases b.

Which type of error has a more serious consequence? The EPA has adopted what is known as the Lead and Copper Rule, which defines drinking water as unsafe if the concentration of lead is 15 parts per billion (ppb) or greater or if the concentration of copper is 1.3 ppb or greater. The manager of a community water system might use lead level measurements from a sample of water specimens to test the following hypotheses: H0: m = 15 versus Ha: m < 15 Which type of error has a more serious consequence? State a Type I error in context. State a Type II error in context. Since most people would consider the consequence of the Type I error more serious, we would want to keep a small – so select a smaller significance level of a = .01. What is a consequence of a Type I? What is a consequence of a Type II? A Type I error leads to the conclusion that a water source meets EPA standards when the water is really unsafe. There are possible health risks to the community A Type II error leads to the conclusion that a water source does NOT meet EPA standards when the water is really safe. The community might lose a good water source.

Large-Sample Hypothesis Test for a Population Proportion The fundamental idea behind hypothesis testing is: We reject H0 if the observed sample is very unlikely to occur if H0 is true.

Recall the General Properties for Sampling Distributions of p These three properties imply that the standardized variable has an approximately standard normal distribution when n is large. 1. 2. As long as the sample size is less than 10% of the population 3. When n is large, the sampling distribution of p is approximately normal.

In June 2006, an Associated Press survey was conducted to investigate how people use the nutritional information provided on food packages. Interviews were conducted with 1003 randomly selected adult Americans, and each participant was asked a series of questions, including the following two: Based on these data, is it reasonable to conclude that a majority of adult Americans frequently check nutritional labels when purchasing packaged foods? Question 1: When purchasing packaged food, how often do you check the nutritional labeling on the package? Question 2: How often do you purchase food that is bad for you, even after you’ve checked the nutrition labels? It was reported that 582 responded “frequently” to the question about checking labels and 441 responded “very often” or “somewhat often” to the question about purchasing bad foods even after checking the labels.

Nutritional Labels Continued . . . H0: p = .5 Ha: p > .5 p = true proportion of adult Americans who frequently check nutritional labels For this sample: We will create a test statistic using: We use p > .5 to test for a majority of adult Americans who frequently check nutritional labels. A test statistic indicates how many standard deviations the sample statistic (p) is from the population characteristic (p). This observed sample proportion is greater than .5. Is it plausible a sample proportion of p = .58 occurred as a result of chance variation, or is it unusual to observe a sample proportion this large when p = .5?

Next we find the P-value for this test statistic. Nutritional Labels Continued . . . H0: p = .5 Ha: p > .5 p = true proportion of adult Americans who frequently check nutritional labels For this sample: In the standard normal curve, seeing a value of 5.08 or larger is unlikely. It’s probability is approximately 0. Next we find the P-value for this test statistic. The P-value is the probability of obtaining a test statistic at least as inconsistent with H0 as was observed, assuming H0 is true. Since the P-value is so small, we reject H0. There is convincing evidence to suggest that the majority of adult Americans frequently check the nutritional labels on packaged foods. P-value ≈ 0

Computing P-values The calculation of the P-value depends on the form of the inequality in the alternative hypothesis. Ha: p > hypothesize value z curve P-value = area in upper tail Calculated z

Computing P-values The calculation of the P-value depends on the form of the inequality in the alternative hypothesis. Ha: p < hypothesize value z curve P-value = area in lower tail Calculated z

Computing P-values The calculation of the P-value depends on the form of the inequality in the alternative hypothesis. Ha: p ≠ hypothesize value P-value = sum of area in two tails z curve Calculated z and –z

Using P-values to make a decision: To decide whether or not to reject H0, we compare the P-value to the significance level a If the P-value > a, we “fail to reject” the null hypothesis. If the P-value < a, we “reject” the null hypothesis.

Summary of the Large-Sample z Test for p Null hypothesis: H0: p = hypothesized value Test Statistic: Alternative Hypothesis: P-value: Ha: p > hypothesized value Area to the right of calculated z Ha: p < hypothesized value Area to the left of calculated z Ha: p ≠ hypothesized value 2(Area to the right of z) of +z or 2(Area to the left of z) of -z

Summary of the Large-Sample z Test for p Continued . . . Assumptions: p is a sample proportion from a random sample The sample size n is large. (np > 10 and n(1 - p) > 10) If sampling is without replacement, the sample size is no more than 10% of the population size

A report states that nationwide, 61% of high school graduates go on to attend a two-year or four-year college the year after graduation. Suppose a random sample of 1500 high school graduates in 2009 from a particular state estimated the proportion of high school graduates that attend college the year after graduation to be 58%. Can we reasonably conclude that the proportion of this state’s high school graduates in 2009 who attended college the year after graduation is different from the national figure? Use a = .01. H0: p = .61 Ha: p ≠ .61 Where p is the proportion of all 2009 high school graduates in this state who attended college the year after graduation State the hypotheses.

College Attendance Continued . . . H0: p = .61 Ha: p ≠ .61 Where p is the proportion of all 2009 high school graduates in this state who attended college the year after graduation Assumptions: Given a random sample of 1500 high school graduates Since 1500(.61) > 10 and 1500(.39) > 10, sample size is large enough. Population size is much larger than the sample size.

The area to the left of -2.38 is approximately .0087 College Attendance Continued . . . H0: p = .61 Ha: p ≠ .61 Where p is the proportion of all 2009 high school graduates in this state who attended college the year after graduation Test statistic: P-value = 2(.0087) = .0174 Since P-value > a, we fail to reject H0. The evidence does not suggest that the proportion of 2009 high school graduates in this state who attended college the year after graduation differs from the national value. What potential error could you have made? Type II The area to the left of -2.38 is approximately .0087 Use a = .01

In December 2009, a county-wide water conservation campaign was conducted in a particular county. In 2010, a random sample of 500 homes was selected and water usage was recorded for each home in the sample. Suppose the sample results were that 220 households had reduced water consumption. The county supervisors wanted to know if their data supported the claim that fewer than half the households in the county reduced water consumption. H0: p = .5 Ha: p < .5 State the hypotheses. where p is the proportion of all households in the county with reduced water usage Calculate p.

Water Usage Continued . . . H0: p = .5 Ha: p < .5 where p is the proportion of all households in the county with reduced water usage Verify assumptions p is from a random sample of households 2. Sample size n is large because np = 250 >10 and n(1-p) = 250 > 10 3. It is reasonable that there are more than 5000 (10n) households in the county.

Calculate the test statistic and P-value Water Usage Continued . . . H0: p = .5 Ha: p < .5 where p is the proportion of all households in the county with reduced water usage Calculate the test statistic and P-value Look this value up in the table of z curve areas What potential error could you have made? Type I P-value = .0037 Since P-value < a, we reject H0. There is convincing evidence that the proportion of households with reduced water usage is less than half. Use a = .01

Since P-value < a, we reject H0. Used a = .01 Water Usage Continued . . . H0: p = .5 Ha: p < .5 where p is the proportion of all households in the county with reduced water usage Since P-value < a, we reject H0. Used a = .01 Confidence intervals are two-tailed, so we need to put .01 in the upper tail (since the curve is symmetrical). With .01 in each tail, that puts .98 in the middle – this is the appropriate confidence level Compute a 98% confidence interval: Notice that the hypothesized value of .5 is NOT in the 98% confidence interval and that we “rejected” H0! Let’s create a confidence interval with this data. What is the appropriate confidence level to use? .5 Since we are testing Ha: p < .05, a would also be in the lower tail. .98

Let’s compute a confidence interval for this problem. College Attendance Revisited . . . H0: p = .61 Ha: p ≠ .61 Where p is the proportion of all 2009 high school graduates in this state who attended college the year after graduation Since P-value > a, we fail to reject H0. Use a = .01 Let’s compute a confidence interval for this problem. This is a two-tailed test so a gets split evenly into both tails, leaving 99% in the middle. Notice that the hypothesized value of .5 IS in the 98% confidence interval and that we “failed to reject” H0! .99

Hypothesis Tests for a Population Mean

Let’s review the assumptions for a confidence interval for a population mean The assumptions are the same for a large-sample hypothesis test for a population mean. 1) x is the sample mean from a random sample, 2) the sample size n is large (n > 30), and 3) s, the population standard deviation, is known or unknown This is the test statistic when s is unknown. This is the test statistic when s is known. P-value is area under the z curve P-value is area under the t curve with df=n-1

The One-Sample t-test for a Population Mean Null hypothesis: H0: m = hypothesized value Test Statistic: Alternative Hypothesis: P-value: Ha: m > hypothesized value Area to the right of calculated t with df = n-1 Ha: m < hypothesized value Area to the left of calculated t with df = n-1 Ha: m ≠ hypothesized value 2(Area to the right of t) of +t or 2(Area to the left of t) of -t

The One-Sample t-test for a Population Mean Continued . . . Assumptions: x and s are the sample mean and sample standard deviation from a random sample The sample size n is large (n > 30) or the population distribution is at least approximately normal.

What is the mean and standard deviation of the sample? A study conducted by researchers at Pennsylvania State University investigated whether time perception, an indication of a person’s ability to concentrate, is impaired during nicotine withdrawal. After a 24-hour smoking abstinence, 20 smokers were asked to estimate how much time had elapsed during a 45-second period. Researchers wanted to see whether smoking abstinence had a negative impact on time perception, causing elapsed time to be overestimated. Suppose the resulting data on perceived elapsed time (in seconds) were as follows: 69 65 72 73 59 55 39 52 67 57 56 50 70 47 45 64 53 What is the mean and standard deviation of the sample? x = 59.30 s = 9.84 n = 20

Smoking Abstinence Continued . . . H0: m = 45 Ha: m > 45 Assumptions: It is reasonable to believe that the sample of smokers is representative of all smokers. 69 65 72 73 59 55 39 52 67 57 56 50 70 47 45 64 53 x = 59.30 s = 9.84 n = 20 Where m is the true mean perceived elapsed time for smokers who have abstained from smoking for 24-hours State the hypotheses. Since the boxplot is approximately symmetrical, it is plausible that the population distribution is approximately normal. Verify assumptions. To do this, we need to graph the data using a boxplot or normal probability plot 2) Since the sample size is not at least 30, we must determine if it is plausible that the population distribution is approximately normal. 50 40 60 70

Compute the test statistic and P-value. Smoking Abstinence Continued . . . H0: m = 45 Ha: m > 45 Test statistic: P-value ≈ 0 a = .05 Since P-value < a, we reject H0. There is convincing evidence that the mean perceived elapsed time is greater than the actual elapsed time of 45 seconds. 69 65 72 73 59 55 39 52 67 57 56 50 70 47 45 64 53 x = 59.30 s = 9.84 n = 20 Where m is the true mean perceived elapsed time for smokers who have abstained from smoking for 24-hours Compute the test statistic and P-value.

Compute the appropriate confidence interval. Smoking Abstinence Continued . . . H0: m = 45 Ha: m > 45 Since P-value < a, we reject H0. a = .05 69 65 72 73 59 55 39 52 67 57 56 50 70 47 45 64 53 x = 59.30 s = 9.84 n = 20 Compute the appropriate confidence interval. Where m is the true mean perceived elapsed time for smokers who have abstained from smoking for 24-hours Notice that the hypothesized value of 45 is NOT in the 90% confidence interval and that we “rejected” H0! Since this is a one-tailed test, a goes in the upper tail. .05 goes in the lower tail, leaving .90 in the middle.

What is the mean and standard deviation of the sample? A growing concern of employers is time spent in activities like surfing the Internet and emailing friends during work hours. The San Luis Obispo Tribune summarized the findings of a large survey of workers in an article that ran under the headline “Who Goofs Off More than 2 Hours a Day? Most Workers, Survey Says” (August 3, 2006). Suppose that the CEO of a large company wants to determine whether the average amount of wasted time during an 8-hour day for employees of her company is less than the reported 120 minutes. Each person in a random sample of 10 employees was contrasted and asked about daily wasted time at work. The resulting data are the following: 108 112 117 130 111 131 113 105 128 What is the mean and standard deviation of the sample? x = 116.80 s = 9.45 n = 10

Verify the assumptions. Surfing Internet Continued . . . 108 112 117 130 111 131 113 105 128 x = 116.80 s = 9.45 n = 10 H0: m = 120 Ha: m < 120 Assumptions: 1) The given sample was a random sample of employees Where m is the true mean daily wasted time for employees of this company The boxplot reveals some skewness, but there is no outliers. It is plausible that the population distribution is approximately normal. State the hypotheses. Verify the assumptions. 2) Since the sample size is not at least 30, we must determine if it is plausible that the population distribution is approximately normal. 110 120 130

Compute the test statistic and P-value. Surfing Internet Continued . . . 108 112 117 130 111 131 113 105 128 x = 116.80 s = 9.45 n = 10 H0: m = 120 Ha: m < 120 Test Statistic: P-value =.150 a = .05 Since p-value > a, we fail to reject H0. There is not sufficient evidence to conclude that the mean daily wasted time for employees of this company is less than 120 minutes. Where m is the true mean daily wasted time for employees of this company Compute the test statistic and P-value. What potential error could we have made? Type II

Power and the Probability of Type II Error

Recall Type I and Type II Errors: Suppose H0 is true and we fail to reject it, what type of decision was made? Suppose H0 is false and we fail to reject it, what type of decision was made? Suppose H0 is true and we reject it, what type of decision was made? Suppose H0 is false and we reject it, what type of decision was made? H0 is true H0 is false Reject H0 Fail to reject H0 Type I error Correct Power Type II error Correct The probability that we correctly reject H0 is called the power of the test.

Suppose that the student body president at a university is interested in studying the amount of money that students spend on textbooks each semester. The director of financial aid services believes that average amount spent on textbooks is $500 each semester, and uses this to determine the amount of financial aid for which a student is eligible. The student body president plans to ask each student in a random sample how much he or she spent on books this semester and use the data to test (using a = .05) the following hypotheses: H0: m = 500 versus Ha: m > 500 The power of a test depends on the true value of the mean! Because the actual value of m is unknown, we cannot know the power for the actual value of m. BUT, we can gain insight to the power of m by investigating some what if scenarios . . . However, if the true mean was $525, it is less likely that the sample would be mistaken for a sample from the population if the mean were $500. So, it is more likely that we will correctly reject H0. If the true mean is greater than $500, then we should reject H0. BUT, if the true mean is ONLY a little greater, say $505, then the sample mean might look like we expect if the true mean were $500. Thus we wouldn’t have convincing evidence to reject H0.

If the null hypothesis is false, then m > hypothesized value Let’s consider a one-sided, upper tail test. Fail to Reject H0 Reject H0 ma Power = 1 - b b a m0 If the null hypothesis is false, then m > hypothesized value

This is the z value with .95 area to its left. Textbooks Continued . . . H0: m = 500 Ha: m > 500 What is the probability of committing a Type I error? If m = 500 is true, for what values of the sample mean would you reject the null hypothesis? Suppose that s = $85 and n = 100. (Since n is large, the sampling distribution of x is approximately normal.) a = .05 This is the z value with .95 area to its left. Rejection Region Use: What is the value of this x? We would reject H0 for x > 513.98. .95

What is the probability of a Type II error (b)? Textbooks Continued . . . H0: m = 500 Ha: m > 500 What is the probability of a Type II error (b)? Suppose that s = $85 and n = 100. We would reject H0 for x > 513.98. This area (to the left of x = 513.98) is b. Rejection Region b = .239 Look this up in the table of areas for z curves

What is the power of the test if m = 520? Power = 1 - .239 = .761 Textbooks Continued . . . H0: m = 500 Ha: m > 500 What is the power of the test if m = 520? Power = 1 - .239 = .761 Suppose that s = $85 and n = 100. We would reject H0 for x > 513.98. Power is the probability of correctly rejecting H0. Notice that power is in the SAME curve as b Power = 1 – b b = .239 Rejection Region

If we reject H0, then m > 500. Textbooks Continued . . . H0: m = 500 Ha: m > 500 Find b and power. b = .03 power = .97 Suppose that s = $85 and n = 100. We would reject H0 for x > 513.98. If we reject H0, then m > 500. What if m = 530? Notice that, as the distance between the null hypothesized value for m and our alternative value for m increases, b decreases AND power increases. Rejection Region

If the null hypothesis is false, then m > 500. Textbooks Continued . . . H0: m = 500 Ha: m > 500 Find b and power. b = .68 power = .32 Suppose that s = $85 and n = 100. We would reject H0 for x > 513.98. If the null hypothesis is false, then m > 500. What if m = 510? Notice that, as the distance between the null hypothesized value for m and our alternative value for m decreases, b increases AND power decreases. Rejection Region

b will increase and power will decrease. Textbooks Continued . . . H0: m = 500 Ha: m > 500 What happens if we use a = .01? Suppose that s = $85 and n = 100. b will increase and power will decrease. Rejection Region Rejection Region Power Power b b

What happens to a, b, & power when the sample size is increased? The standard deviation will decrease making the curve taller and skinnier. b decreases and power increases! The significance level (a) remains the same – so the value where the rejection region begins must move. Fail to Reject H0 Reject H0 a m0 b Power ma

Effects of Various Factors on the Power of a Test The larger the size of the discrepancy between the hypothesized value and the actual value of the population characteristic, the higher the power. The larger the significance level a, the higher the power of a test. The larger the sample size, the higher the power of a test