Hypothesis Testing A hypothesis is a conjecture about a population.

Slides:



Advertisements
Similar presentations
Tests of Hypotheses Based on a Single Sample
Advertisements

Chapter 7 Hypothesis Testing
10- 1 Chapter Ten McGraw-Hill/Irwin © 2005 The McGraw-Hill Companies, Inc., All Rights Reserved.
Hypothesis Testing A hypothesis is a claim or statement about a property of a population (in our case, about the mean or a proportion of the population)
Chapter 10 Section 2 Hypothesis Tests for a Population Mean
Introduction to Hypothesis Testing
Tests of Hypotheses: Large Samples Chapter Rejection region Acceptance
1/55 EF 507 QUANTITATIVE METHODS FOR ECONOMICS AND FINANCE FALL 2008 Chapter 10 Hypothesis Testing.
Lecture 2: Thu, Jan 16 Hypothesis Testing – Introduction (Ch 11)
Introduction to Hypothesis Testing
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Chapter 11 Introduction to Hypothesis Testing.
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 8 Introduction to Hypothesis Testing.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 9-1 Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests Basic Business Statistics.
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Statistics for Business and Economics 7 th Edition Chapter 9 Hypothesis Testing: Single.
BCOR 1020 Business Statistics Lecture 20 – April 3, 2008.
Statistics for Managers Using Microsoft® Excel 5th Edition
Type II Error, Power and Sample Size Calculations
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Chapter 11 Introduction to Hypothesis Testing.
1 Economics 173 Business Statistics Lectures 3 & 4 Summer, 2001 Professor J. Petry.
Hypothesis Testing A hypothesis is a conjecture about a population. Typically, these hypotheses will be stated in terms of a parameter such as  (mean)
Chapter 10 Hypothesis Testing
Introduction to Hypothesis Testing
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Chapter 9 Introduction to Hypothesis Testing.
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 9-1 Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests Business Statistics,
Fundamentals of Hypothesis Testing: One-Sample Tests
Section 9.1 Introduction to Statistical Tests 9.1 / 1 Hypothesis testing is used to make decisions concerning the value of a parameter.
8 - 1 © 1998 Prentice-Hall, Inc. Chapter 8 Inferences Based on a Single Sample: Tests of Hypothesis.
Copyright © Cengage Learning. All rights reserved. 8 Introduction to Statistical Inferences.
Chapter 10 Hypothesis Testing
Chapter 4 Introduction to Hypothesis Testing Introduction to Hypothesis Testing.
LECTURE 19 THURSDAY, 14 April STA 291 Spring
Statistical Methods 5.
Statistical Inference
HYPOTHESIS TESTING. Statistical Methods Estimation Hypothesis Testing Inferential Statistics Descriptive Statistics Statistical Methods.
Lecture 16 Dustin Lueker.  Charlie claims that the average commute of his coworkers is 15 miles. Stu believes it is greater than that so he decides to.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Fundamentals of Hypothesis Testing: One-Sample Tests Statistics.
Economics 173 Business Statistics Lecture 4 Fall, 2001 Professor J. Petry
Chapter 11 Introduction to Hypothesis Testing Sir Naseer Shahzada.
Lecture 18 Dustin Lueker.  A way of statistically testing a hypothesis by comparing the data to values predicted by the hypothesis ◦ Data that fall far.
Lecture 17 Dustin Lueker.  A way of statistically testing a hypothesis by comparing the data to values predicted by the hypothesis ◦ Data that fall far.
Chap 8-1 Fundamentals of Hypothesis Testing: One-Sample Tests.
Chapter 9: Hypothesis Tests Based on a Single Sample 1.
Understanding Basic Statistics Fourth Edition By Brase and Brase Prepared by: Lynn Smith Gloucester County College Chapter Nine Hypothesis Testing.
Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Statistics for Business and Economics 8 th Edition Chapter 9 Hypothesis Testing: Single.
Copyright © 2009 Pearson Education, Inc. 9.2 Hypothesis Tests for Population Means LEARNING GOAL Understand and interpret one- and two-tailed hypothesis.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 1 FINAL EXAMINATION STUDY MATERIAL III A ADDITIONAL READING MATERIAL – INTRO STATS 3 RD EDITION.
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Statistics for Business and Economics 7 th Edition Chapter 9 Hypothesis Testing: Single.
Introduction to Inference Tests of Significance Proof
Chapter Nine Hypothesis Testing.
Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests
Chapter 9 -Hypothesis Testing
Statistics for Managers Using Microsoft® Excel 5th Edition
Introduction to Inference
Keller: Stats for Mgmt & Econ, 7th Ed
Business Statistics Topic 7
Chapters 20, 21 Hypothesis Testing-- Determining if a Result is Different from Expected.
Hypothesis Testing: Hypotheses
Overview and Basics of Hypothesis Testing
Chapter 9 Hypothesis Testing.
Chapter 11: Introduction to Hypothesis Testing Lecture 5b
Chapter Nine Part 1 (Sections 9.1 & 9.2) Hypothesis Testing
Daniela Stan Raicu School of CTI, DePaul University
Chapter 11: Introduction to Hypothesis Testing Lecture 5a
Statistics for Business and Economics
Chapter 9 Hypothesis Testing.
Chapter 9: Significance Testing
Chapter 9 Hypothesis Testing: Single Population
Statistical Test A test of significance is a formal procedure for comparing observed data with a claim (also called a hypothesis) whose truth we want to.
STA 291 Spring 2008 Lecture 17 Dustin Lueker.
Presentation transcript:

Hypothesis Testing A hypothesis is a conjecture about a population. Typically, these hypotheses will be stated in terms of a parameter. A test of hypothesis is a statistical procedure used to make a decision about the conjectured value of a parameter. We will make our decision based on observed values of a statistic.

The purpose of hypothesis testing is to determine whether there is enough statistical evidence in favor of a certain belief about a parameter. Examples Is there statistical evidence in a random sample of potential customers, that support the hypothesis that more than 10% of the potential customers will purchase a new products? Is a new drug effective in curing a certain disease? A sample of patients is randomly selected. Half of them are given the drug while the other half are given a placebo. The improvement in the patients conditions is then measured and compared.

The Hypotheses There are two hypotheses which we are comparing. The null hypothesis, H0, specifies a value of a parameter. This hypothesis is assumed to be true, and the collected data will be analyzed to see if it is contradictory to the null hypothesis. The alternative hypothesis, Ha, gives an opposing statement about the value of the parameter. The collected data will be analyzed to see if it supports the alternative hypothesis.

Examples: Give the null hypothesis and the alternative hypothesis You are investigating a complaint that “special delivery mail takes too much time” to be delivered You want to show that people find the new design for a recliner chair more comfortable than the old design. You are trying to show that cigarette smoke has an effect on the quality of a person’s life

Examples: Give the null hypothesis and the alternative hypothesis The mean age of the students enrolled in evening classes at a certain college is greater than 26 years. The mean weight of packages shipped on Air Express during the past month was less than 36.7 lb. The mean life of fluorescent light bulbs is at least 1600 hours. The mean grade of a student in a statistics class is not 80.

This is what you want to prove Concepts of Hypothesis Testing The critical concepts of hypothesis testing. Example: An operation manager needs to determine if the mean demand during lead time is greater than 350. If so, changes in the ordering policy are needed. There are two hypotheses about a population mean: H0: The null hypothesis  = 350 Ha: The alternative hypothesis  > 350 This is what you want to prove

Assume the null hypothesis is true (= 350). = 350 Sample from the demand population, and build a statistic related to the parameter hypothesized (the sample mean). Pose the question: How probable is it to obtain a sample mean at least as extreme as the one observed from the sample, if H0 is correct?

Assume the null hypothesis is true (= 350). Since the is much larger than 350, the mean  is likely to be greater than 350. Reject the null hypothesis. = 350 In this case the mean  is not likely to be greater than 350. Do not reject the null hypothesis.

Statistical Terminology Once the data is collected, we seek an answer to the question: “If the null hypothesis is true, how likely are we to observe this type of data, or data which is more extreme in the direction of the alternative hypothesis?” The observed significance level, or p-value of a test of hypothesis is the probability of obtaining the observed value of the sample statistic, or one which is even more supportive of the alternative hypothesis, under the assumption that the null hypothesis is true.

Statistical Terminology For example, let’s assume that we are testing the hypothesis that the mean gas mileage for a certain type of car is 24 miles per gallon against an alternative that the mean is more than 24 mpg. H0:  = 24 mpg Ha:  > 24 mpg If a sample of 50 cars have a sample mean of 24.2 mpg, this gives a certain p-value. If the sample of 50 cars had a sample mean of 25.0 mpg, the p-value is even smaller, since this would be more unusual to see if H0 is true ( = 24 mpg).

Test Statistics and P-values Obviously, we must be able to calculate these p-values. A test statistic is a quantity calculated from the sampled value of a statistic, which is then used to calculate a p-value for a test of hypothesis. We recall that under appropriate conditions, we know the sampling distribution of the sample mean follows a normal distribution, and we can find a z-score using

Decision Rule Based on the z-score calculated, we can determine how likely we are to get a sample mean (x-bar) like we observed or one more supportive of Ha. Decision Rule: This tells us when we feel the observed data provided sufficient evident to conclude the alternative hypothesis is true. It will be phrased as: Accept the alternative hypothesis when the p-value of the test is less than . The value of  will be given in each exercise. It does NOT depend on the sample data. (More details later)

Testing the Population Mean When the Population Standard Deviation is Known Example A new billing system for a department store will be cost- effective only if the mean monthly account is more than $170. A sample of 400 accounts has a mean of $178. If accounts are approximately normally distributed with  = $65, can we conclude that the new system will be cost effective?

Testing the Population Mean ( is Known) – Solution The population of interest is the credit accounts at the store. We want to know whether the mean account for all customers is greater than $170. Ha :  > 170 The null hypothesis must specify a single value of the parameter  H0 :  = 170

P-value Method The p-value provides information about the amount of statistical evidence that supports the alternative hypothesis. The p-value of a test is the probability of observing a test statistic at least as extreme as the one computed, given that the null hypothesis is true. Let us demonstrate the concept on this example.

P-value Method The probability of observing a test statistic at least as extreme as 178, given that  = 170 is… The p-value

Interpreting the p-value Because the probability that the sample mean will assume a value of more than 178 when  = 170 is so small (.0069), there are reasons to believe that  > 170. Note how the event is rare under H0 when but... …it becomes more probable under H1, when

Interpreting the p-value We can conclude that the smaller the p-value the more statistical evidence exists to support the alternative hypothesis.

Interpreting the p-value Describing the p-value If the p-value is less than 1%, there is overwhelming evidence that supports the alternative hypothesis. If the p-value is between 1% and 5%, there is a strong evidence that supports the alternative hypothesis. If the p-value is between 5% and 10% there is a weak evidence that supports the alternative hypothesis. If the p-value exceeds 10%, there is no evidence that supports the alternative hypothesis.

The p-value Method  = 0.05 The p-value The p-value can be used when making decisions based on rejection region methods as follows: Define the hypotheses to test, and the required significance level  Perform the sampling procedure, calculate the test statistic and the p-value associated with it. Compare the p-value to Reject the null hypothesis only if p- value <; otherwise, do not reject the null hypothesis.  = 0.05 The p-value

Conclusions of a Test of Hypothesis If we accept the alternative hypothesis, we conclude that there is enough evidence to infer that the alternative hypothesis is true. If we do not accept the alternative hypothesis, we conclude that there is not enough statistical evidence to infer that the alternative hypothesis is true. The alternative hypothesis is the more important one. It represents what we are investigating.

Another Example Historically, the mean GPA of incoming freshmen at NKU has been 2.32 at the end of their first year. With recent changes, the administration believes the mean GPA has increased. How would this be tested using a .05 level of significance? H0: The mean GPA is still 2.32.  = 2.32 Ha: The mean GPA is above 2.32.  > 2.32 The decision rule is to accept Ha if the p-value is less than .05. The test statistic is

Performing the Test of Hypothesis In order to perform an actual test of hypothesis, we must identify several things, ensure the procedure is valid, and draw conclusions without stopping at calculations. For chapter 7, we are focusing on testing hypotheses which deal with the population mean  under the three assumptions: The population standard deviation  is known A random sample is obtained from the population The sample size is “large”. (Sampling Distn of )

Steps of the Test of Hypothesis for  with  Identify the null and alternative hypotheses Give the decision rule Identify the form of the test statistic and calculate it based on the sampled data. Verify the method is valid (n is at least 30 for now) Calculate the p-value based on the z-score and Ha Give an interpretation.

Example from Page 227 #6 H0:  = $122,000 Ha:  < $122,000 Decision Rule: Accept Ha if the p-value < .05. This is valid since n = 40 is at least 30. P-value = probability a z-score is “less than” 2.19 = 0.5 + 0.486 = 0.986 At the .05 level of significance, there is insufficient evidence to conclude the mean price for all new homes in Florida is below $122,000.

Possible Errors Realize, that a small p-value (or observed level of significance) suggests that the alternative hypothesis is true, but does not guarantee it is true. A Type I Error consists of concluding that the alternative hypothesis is true when, in fact, the null hypothesis is true. A Type II Error consists of concluding that the null hypothesis is true when, in fact, the alternative hypothesis is true.

Type I or Type II Errors We can examine each error and attempt to control how willing we are to commit each. The probability of committing a Type I Error is given by (alpha), the level of significance of the test. The probability of committing a Type II Error is given by (beta). We will decide the seriousness of each error, and then determine relative values for  and .

Type I or Type II Errors: #3 on page 234 H0: Lab is not contaminated with mercury vapor. Ha: Lab is contaminated with mercury vapor. A Type I Error occurs if we conclude the lab is contaminated when, in fact, the lab is not contaminated with mercury vapor. Consequences: The office would probably be closed, and procedures to reduce the mercury vapor levels would be implemented. This means costs for decontamination and lack of income from patients, but no one is endangered.

Type I or Type II Errors: #3 on page 234 H0: Lab is not contaminated with mercury vapor. Ha: Lab is contaminated with mercury vapor. A Type II Error occurs if we conclude the lab is NOT contaminated when, in fact, the lab is contaminated with mercury vapor. Consequences: The office would be open and new patients and workers could be exposed to high levels of mercury vapor. This means possible law suits, people getting sick, dangerous work environment. It could lead to a bad reputation for this dental office.

Type I or Type II Errors: #3 on page 234 H0: Lab is not contaminated with mercury vapor. Ha: Lab is contaminated with mercury vapor. Which error is worse? Probably Type II in my opinion. Hence, I want to avoid this, so I set the probability low. The Type I Error means a loss of income, so it’s not good for business, but it’s not as serious. I’d set  > . I’d like  small at .01, and  maybe at .05.

Type I or Type II Errors Be able to define a Type I Error and a Type II error in terms of a scenario. Be able to discuss the consequences of each type of error. Set values for  and  depending on those consequences. There are no rules on one error being worse than the other in all situations.