CHAPTER 17: Tests of Significance: The Basics

Slides:



Advertisements
Similar presentations
CHAPTER 15: Tests of Significance: The Basics Lecture PowerPoint Slides The Basic Practice of Statistics 6 th Edition Moore / Notz / Fligner.
Advertisements

Business Statistics for Managerial Decision
Chapter 9 Hypothesis Testing.
Section 9.1 Introduction to Statistical Tests 9.1 / 1 Hypothesis testing is used to make decisions concerning the value of a parameter.
Chapter 9 Testing a Claim
BPS - 3rd Ed. Chapter 141 Tests of Significance: The Basics.
Hypothesis testing Chapter 9. Introduction to Statistical Tests.
Significance Tests: THE BASICS Could it happen by chance alone?
Stat 1510 Statistical Inference: Confidence Intervals & Test of Significance.
Essential Statistics Chapter 131 Introduction to Inference.
CHAPTER 14 Introduction to Inference BPS - 5TH ED.CHAPTER 14 1.
AP STATISTICS LESSON 10 – 2 DAY 1 TEST OF SIGNIFICANCE.
10.2 Tests of Significance Use confidence intervals when the goal is to estimate the population parameter If the goal is to.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Unit 5: Hypothesis Testing.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 9: Testing a Claim Section 9.1 Significance Tests: The Basics.
Confidence intervals are one of the two most common types of statistical inference. Use a confidence interval when your goal is to estimate a population.
1 Chapter 10: Introduction to Inference. 2 Inference Inference is the statistical process by which we use information collected from a sample to infer.
CHAPTER 9 Testing a Claim
Significance Test A claim is made. Is the claim true? Is the claim false?
Statistics 101 Chapter 10 Section 2. How to run a significance test Step 1: Identify the population of interest and the parameter you want to draw conclusions.
Statistical Significance The power of ALPHA. “ Significant ” in the statistical sense does not mean “ important. ” It means simply “ not likely to happen.
AP STATISTICS LESSON 10 – 2 DAY 2 MORE DETAIL: STATING HYPOTHESES.
Ch 10 – Intro To Inference 10.1: Estimating with Confidence 10.2 Tests of Significance 10.3 Making Sense of Statistical Significance 10.4 Inference as.
CHAPTER 15: Tests of Significance The Basics ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
CHAPTER 9 Testing a Claim
BPS - 3rd Ed. Chapter 141 Tests of significance: the basics.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Logic and Vocabulary of Hypothesis Tests Chapter 13.
Chapter 9: Hypothesis Tests Based on a Single Sample 1.
AP Statistics Section 11.1 B More on Significance Tests.
Business Statistics for Managerial Decision Farideh Dehkordi-Vakil.
Chapter 9 Day 2 Tests About a Population Proportion.
AP Statistics Chapter 11 Notes. Significance Test & Hypothesis Significance test: a formal procedure for comparing observed data with a hypothesis whose.
A significance test or hypothesis test is a procedure for comparing our data with a hypothesis whose truth we want to assess. The hypothesis is usually.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Unit 5: Hypothesis Testing.
Understanding Basic Statistics Fourth Edition By Brase and Brase Prepared by: Lynn Smith Gloucester County College Chapter Nine Hypothesis Testing.
Tests of Significance: Stating Hypothesis; Testing Population Mean.
+ Chapter 9 Testing a Claim 9.1Significance Tests: The Basics 9.2Tests about a Population Proportion 9.3Tests about a Population Mean.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 9 Testing a Claim 9.2 Tests About a Population.
Testing a Single Mean Module 16. Tests of Significance Confidence intervals are used to estimate a population parameter. Tests of Significance or Hypothesis.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Unit 5: Hypothesis Testing.
Uncertainty and confidence Although the sample mean,, is a unique number for any particular sample, if you pick a different sample you will probably get.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 9 Testing a Claim 9.1 Significance Tests:
CHAPTER 15: Tests of Significance The Basics ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
Section 9.1 First Day The idea of a significance test What is a p-value?
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 9: Testing a Claim Section 9.2 Tests About a Population Proportion.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 1 FINAL EXAMINATION STUDY MATERIAL III A ADDITIONAL READING MATERIAL – INTRO STATS 3 RD EDITION.
+ Chapter 9 Testing a Claim 9.1Significance Tests: The Basics 9.2Tests about a Population Proportion 9.3Tests about a Population Mean.
+ Testing a Claim Significance Tests: The Basics.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 9 Testing a Claim 9.1 Significance Tests:
What Is a Test of Significance?
Chapter 9: Testing a Claim
Unit 5: Hypothesis Testing
CHAPTER 9 Testing a Claim
Warm Up Check your understanding p. 541
CHAPTER 9 Testing a Claim
CHAPTER 17: Tests of Significance: The Basics
CHAPTER 9 Testing a Claim
Chapter 9: Hypothesis Tests Based on a Single Sample
CHAPTER 9 Testing a Claim
Significance Tests: The Basics
Significance Tests: The Basics
CHAPTER 9 Testing a Claim
CHAPTER 9 Testing a Claim
Chapter 9: Significance Testing
CHAPTER 9 Testing a Claim
Statistical Test A test of significance is a formal procedure for comparing observed data with a claim (also called a hypothesis) whose truth we want to.
CHAPTER 9 Testing a Claim
Presentation transcript:

CHAPTER 17: Tests of Significance: The Basics Basic Practice of Statistics - 3rd Edition CHAPTER 17: Tests of Significance: The Basics Basic Practice of Statistics 7th Edition Lecture PowerPoint Slides Chapter 5

In chapter 17, we cover … The reasoning of tests of significance Stating hypotheses P-value and statistical significance Tests for a population mean Significance from a table* Resampling: Significance from a simulation* *

Statistical inference Confidence intervals are one of the two most common types of statistical inference. Use a confidence interval when your goal is to estimate a population parameter. The second common type of inference, called tests of significance, has a different goal: to assess the evidence provided by data about some claim concerning a population. A test of significance is a formal procedure for comparing observed data with a claim (also called a hypothesis) whose truth we want to assess. Significance tests use an elaborate vocabulary, but the basic idea is simple: an outcome that would rarely happen if a claim were true is good evidence that the claim is not true.

The reasoning of tests of significance Artificial sweeteners in colas gradually lose their sweetness over time. Manufacturers test for loss of sweetness on a scale of -10 to 10, with negative scores corresponding to a gain in sweetness, positive to loss of sweetness. Suppose we know that for any cola, the sweetness loss scores vary from taster to taster according to a Normal distribution with standard deviation 𝜎= 1. The mean 𝜇 for all tasters measures loss of sweetness and is different for different colas. Here are the sweetness losses for a cola currently on the market, as measured by 10 trained tasters: 2.0 0.4 0.7 2.0 -0.4 2.2 -1.3 1.2 1.1 2.3 The average sweetness loss is given by the sample mean 𝑥 = 1.02. Most scores were positive. That is, most tasters found a loss of sweetness. But the losses are small, and two tasters (the negative scores) thought the cola gained sweetness. Are these data good evidence that the cola lost sweetness in storage?

The reasoning of tests of significance We make a claim and ask if the data give evidence against it. We seek evidence that there is a sweetness loss, so the claim we test is that there is not a loss. In that case, the mean loss for the population of all trained testers would be 𝜇 = 0. If the claim that 𝜇 = 0 is true, the sampling distribution of 𝑥 from 10 tasters is Normal with mean 𝜇 = 0 and standard deviation 𝜎 𝑛 = 2 10 =0.316 This is like calculations we did in Chapter 15—we can locate our 𝑥 of 1.02 in this distribution and comment on whether it is surprising. Figure 17.01 would be useful here!

Stating hypotheses A significance test starts with a careful statement of the claims we want to compare. The claim tested by a statistical test is called the null hypothesis (H0). The test is designed to assess the strength of the evidence against the null hypothesis. Often the null hypothesis is a statement of “no difference.” The claim about the population that we are trying to find evidence for is the alternative hypothesis (Ha). The alternative is one-sided if it states that a parameter is larger or smaller than the null hypothesis value. It is two-sided if it states that the parameter is different from the null value (it could be either smaller or larger). In the sweetness example, our hypotheses are H0 : 𝜇 = 0 Ha : 𝜇 > 0 The alternative hypothesis is one-sided because we are interested only in whether the cola lost sweetness.

Example Does the job satisfaction of assembly workers differ when their work is machine-paced rather than self-paced? Assign workers either to an assembly line moving at a fixed pace or to a self-paced setting. All subjects work in both settings, in random order. This is a matched pairs design. After two weeks in each work setting, the workers take a test of job satisfaction. The response variable is the difference in satisfaction scores, self-paced minus machine-paced. The parameter of interest is the mean 𝜇 of the differences in scores in the population of all assembly workers. The null hypothesis says that there is no difference between self-paced and machine-paced work, that is, 𝐻 0 : 𝜇 = 0 The authors of the study wanted to know if the two work conditions have different levels of job satisfaction. They did not specify the direction of the difference. The alternative hypothesis is therefore two-sided: 𝐻𝑎 : 𝜇≠0

P-value and statistical significance The null hypothesis H0 states the claim that we are seeking evidence against. The probability that measures the strength of the evidence against a null hypothesis is called a P-value. A test statistic calculated from the sample data measures how far the data diverge from what we would expect if the null hypothesis H0 were true. Large values of the statistic show that the data are not consistent with H0. The probability, computed assuming H0 is true, that the statistic would take a value as extreme as or more extreme than the one actually observed is called the P-value of the test. The smaller the P-value, the stronger the evidence against H0 provided by the data. Small P-values are evidence against H0 because they say that the observed result is unlikely to occur when H0 is true. Large P-values fail to give convincing evidence against H0 because they say that the observed result could have occurred by chance if H0 were true.

P-value and statistical significance Tests of significance assess the evidence against H0. If the evidence is strong, we can confidently reject H0 in favor of the alternative. Our conclusion in a significance test comes down to: P-value small → reject H0 → conclude Ha (in context) P-value large → fail to reject H0 → cannot conclude Ha (in context) There is no rule for how small a P-value we should require in order to reject H0 — it’s a matter of judgment and depends on the specific circumstances. But we can compare the P-value with a fixed value that we regard as decisive, called the significance level. We write it as α, the Greek letter alpha. When our P-value is less than the chosen α, we say that the result is statistically significant. If the P-value is smaller than alpha, we say that the data are statistically significant at level α. The quantity α is called the significance level or the level of significance.

Tests of significance TESTS OF SIGNIFICANCE: THE FOUR-STEP PROCESS STATE: What is the practical question that requires a statistical test? PLAN: Identify the parameter, state null and alternative hypotheses, and choose the type of test that fits your situation. SOLVE: Carry out the test in three phases: 1. Check the conditions for the test you plan to use. 2. Calculate the test statistic. 3. Find the P-value. CONCLUDE: Return to the practical question to describe your results in this setting.

Tests for a population mean z TEST FOR A POPulaTION mean Draw an SRS of size 𝑛 from a Normal population that has unknown mean 𝜇 and known standard deviation 𝜎. To test the null hypothesis that 𝜇 has a specified value, H0: 𝜇 = 𝜇0 Compute the one-sample z test statistic. 𝑧= 𝑥 − 𝜇 0 𝜎 𝑛 In terms of a variable Z having the standard Normal distribution, the approximate P-value for a test of H0 against Ha : 𝜇 > 𝜇 0 is 𝑃 𝑍≥𝑧 Ha : 𝜇 < 𝜇 0 is 𝑃 𝑍≤𝑧 Ha : 𝜇 ≠ 𝜇 0 is 2×𝑃 𝑍≥ 𝑧

Example Executives’ blood pressures STATE: NCHS reports that the systolic blood pressure for males 35 to 44 years of age has mean 128 and standard deviation 15. The medical director of a company finds, for the 72 executives in this age group, that 𝑥 =126.07 Is this evidence that the company's executives have a different mean systolic blood pressure from the general population? PLAN: Take 𝜇 to be the mean among all executives. We want to test the hypotheses 𝐻 0 :𝜇=128 𝐻 𝑎 :𝜇≠128 The alternative is two-sided because the medical director did not have a particular direction in mind before examining the data.

Example, cont’d. Executives’ blood pressures, cont’d. SOLVE: As part of the “simple conditions,” suppose we are willing to assume that executives' systolic blood pressures follow a Normal distribution with standard deviation 𝜎=15. Software can now calculate 𝑧 and 𝑃 for you. Going ahead by hand, the test statistic is 𝑧= 𝑥 − 𝜇 0 𝜎 𝑛 = 126.07−128 15 72 =−1.09 Using Table A or software, we find that the P-value is 0.2758. CONCLUDE: More than 27% of the time, an SRS of size 72 from the general male population would have a mean systolic blood pressure at least as far from 128 as that of the executive sample. The observed 𝑥 =126.07 is therefore not good evidence that executives differ from other men.

Significance from a table* Statistics in practice uses technology to get P-values quickly and accurately. In the absence of suitable technology, you can get approximate P-values by comparing your test statistic with critical values from a table. SIGNIFICANCE FROM A TABLE OF CRITICAL VALUES To find the approximate P-value for any z statistic, compare z (ignoring its sign) with the critical values z* at the bottom of Table C. If z falls between two values of z*, the P-value falls between the two corresponding values of P in the “One-sided P” or the “Two-sided P” row of Table C.

Resampling: Significance from a simulation* We saw in Section 15.3 that we can approximate the sampling distribution of 𝑥 by taking a very large number of SRS's of size 𝑛 and constructing the histogram of the values of the sample means, 𝑥 . A corresponding method of taking a large number of repeated SRS's from the population distribution when the null hypothesis is true and using these to approximate P-values is sometimes referred to as resampling. All we need to know is the population distribution under the assumption that the null hypothesis is true. We then resample, using software, many times from this population distribution, compute the value of the sample statistic for each sample, and determine the proportion of times we obtained sample values as or more extreme that that of our actual data. This proportion is an estimate of the P-value.

Resampling: Significance from a simulation* Comments about resampling: First, we must resample in the same manner that we obtain our data. If our data are obtained by an SRS, we resample by taking repeated SRS's from the population distribution determined by the null hypothesis. Second, resampling only provided an estimate of a P-value. Repeat the resampling and you will obtain a different estimate. Accuracy of the estimate is improved by taking a larger number of samples to estimate the sampling distribution. Finally, resampling requires the use of software.