Copyright ©2011 Brooks/Cole, Cengage Learning Testing Hypotheses about Means Chapter 13.

Slides:



Advertisements
Similar presentations
One sample T Interval Example: speeding 90% confidence interval n=23 Check conditions Model: t n-1 Confidence interval: 31.0±1.52 = (29.48, 32.52) STAT.
Advertisements

Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. Analysis of Variance Chapter 16.
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the.
Confidence Interval and Hypothesis Testing for:
Copyright ©2011 Brooks/Cole, Cengage Learning More about Inference for Categorical Variables Chapter 15 1.
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. More About Categorical Variables Chapter 15.
Copyright ©2011 Brooks/Cole, Cengage Learning Testing Hypotheses about Means Chapter 13.
Significance Testing Chapter 13 Victor Katch Kinesiology.
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. More About Significance Tests Chapter 13.
Comparing Two Population Means The Two-Sample T-Test and T-Interval.
Chapter 9: Inferences for Two –Samples
Copyright ©2011 Brooks/Cole, Cengage Learning Analysis of Variance Chapter 16 1.
PSY 307 – Statistics for the Behavioral Sciences
SADC Course in Statistics Comparing Means from Independent Samples (Session 12)
Stat 301- Day 32 More on two-sample t- procedures.
Estimating Means with Confidence
Inferences About Process Quality
Copyright © 2010 Pearson Education, Inc. Chapter 24 Comparing Means.
Chapter 9 Hypothesis Testing.
Hypothesis Testing Using The One-Sample t-Test
Hypothesis Testing: Two Sample Test for Means and Proportions
Quantitative Business Methods for Decision Making Estimation and Testing of Hypotheses.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 8 Tests of Hypotheses Based on a Single Sample.
Hypothesis Testing and T-Tests. Hypothesis Tests Related to Differences Copyright © 2009 Pearson Education, Inc. Chapter Tests of Differences One.
Statistical Inference Dr. Mona Hassan Ahmed Prof. of Biostatistics HIPH, Alexandria University.
Slide 1 Copyright © 2004 Pearson Education, Inc..
Overview Definition Hypothesis
1/2555 สมศักดิ์ ศิวดำรงพงศ์
Testing Hypotheses Tuesday, October 28. Objectives: Understand the logic of hypothesis testing and following related concepts Sidedness of a test (left-,
Chapter 19: Two-Sample Problems STAT Connecting Chapter 18 to our Current Knowledge of Statistics ▸ Remember that these formulas are only valid.
More About Significance Tests
Dependent Samples: Hypothesis Test For Hypothesis tests for dependent samples, we 1.list the pairs of data in 2 columns (or rows), 2.take the difference.
+ Chapter 9 Summary. + Section 9.1 Significance Tests: The Basics After this section, you should be able to… STATE correct hypotheses for a significance.
Comparing Two Population Means
Two Sample Tests Nutan S. Mishra Department of Mathematics and Statistics University of South Alabama.
Chapter 10 Comparing Two Means Target Goal: I can use two-sample t procedures to compare two means. 10.2a h.w: pg. 626: 29 – 32, pg. 652: 35, 37, 57.
One Sample Inf-1 If sample came from a normal distribution, t has a t-distribution with n-1 degrees of freedom. 1)Symmetric about 0. 2)Looks like a standard.
Week 111 Power of the t-test - Example In a metropolitan area, the concentration of cadmium (Cd) in leaf lettuce was measured in 7 representative gardens.
Significance Tests: THE BASICS Could it happen by chance alone?
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 10 Comparing Two Populations or Groups 10.2.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 24 Comparing Means.
Copyright © Cengage Learning. All rights reserved. 10 Inferences Involving Two Populations.
1 Section 9-4 Two Means: Matched Pairs In this section we deal with dependent samples. In other words, there is some relationship between the two samples.
Author(s): Brenda Gunderson, Ph.D., 2011 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution–Non-commercial–Share.
Objectives (BPS chapter 19) Comparing two population means  Two-sample t procedures  Examples of two-sample t procedures  Using technology  Robustness.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 13 Multiple Regression Section 13.3 Using Multiple Regression to Make Inferences.
Lecture 9 Chap 9-1 Chapter 2b Fundamentals of Hypothesis Testing: One-Sample Tests.
© Copyright McGraw-Hill 2000
Lesson Comparing Two Means. Knowledge Objectives Describe the three conditions necessary for doing inference involving two population means. Clarify.
STA 2023 Module 11 Inferences for Two Population Means.
AP Statistics Chapter 24 Comparing Means.
Week111 The t distribution Suppose that a SRS of size n is drawn from a N(μ, σ) population. Then the one sample t statistic has a t distribution with n.
© Copyright McGraw-Hill 2004
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 10 Comparing Two Groups Section 10.1 Categorical Response: Comparing Two Proportions.
Copyright © 1998, Triola, Elementary Statistics Addison Wesley Longman 1 Assumptions 1) Sample is large (n > 30) a) Central limit theorem applies b) Can.
+ Unit 6: Comparing Two Populations or Groups Section 10.2 Comparing Two Means.
Copyright ©2011 Brooks/Cole, Cengage Learning Testing Hypotheses about Difference Between Two Means.
Learning Objectives After this section, you should be able to: The Practice of Statistics, 5 th Edition1 DESCRIBE the shape, center, and spread of the.
Inference for distributions: - Comparing two means.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved.Copyright © 2010 Pearson Education Section 9-3 Inferences About Two Means:
When  is unknown  The sample standard deviation s provides an estimate of the population standard deviation .  Larger samples give more reliable estimates.
Hypothesis Tests for 1-Proportion Presentation 9.
Hypothesis Testing Involving One Population Chapter 11.4, 11.5, 11.2.
Copyright © 2009 Pearson Education, Inc t LEARNING GOAL Understand when it is appropriate to use the Student t distribution rather than the normal.
Objectives (PSLS Chapter 18) Comparing two means (σ unknown)  Two-sample situations  t-distribution for two independent samples  Two-sample t test 
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 1 FINAL EXAMINATION STUDY MATERIAL III A ADDITIONAL READING MATERIAL – INTRO STATS 3 RD EDITION.
More About Confidence Intervals
Hypothesis tests for the difference between two means: Independent samples Section 11.1.
Statistical Inference about Regression
Presentation transcript:

Copyright ©2011 Brooks/Cole, Cengage Learning Testing Hypotheses about Means Chapter 13

Copyright ©2011 Brooks/Cole, Cengage Learning 2 Hypothesis testing about: a population mean a population mean difference (paired data) the difference between means of two populations Three Cautions: 1. Inference is only valid if the sample is representative of the population for the question of interest. 2. Hypotheses and conclusions apply to the larger population(s) represented by the sample(s). 3. If the distribution of a quantitative variable is highly skewed, consider analyzing the median rather than the mean – called nonparametric methods (Topic 2 on CD).

Copyright ©2011 Brooks/Cole, Cengage Learning Introduction to Hypothesis Tests for Means Steps in Any Hypothesis Test 1.Determine the null and alternative hypotheses. 2.Verify necessary data conditions, and if met, summarize the data into an appropriate test statistic. 3.Assuming the null hypothesis is true, find the p-value. 4.Decide whether or not the result is statistically significant based on the p-value. 5.Report the conclusion in the context of the situation.

Copyright ©2011 Brooks/Cole, Cengage Learning HT Module 3: Testing Hypotheses about One Mean Step 1: Determine null and alternative hypotheses 1. H 0 :  =  0 versus H a :    0 (two-sided) 2. H 0 :  =  0 versus H a :  <  0 (one-sided) 3. H 0 :  =  0 versus H a :  >  0 (one-sided) Remember a p-value is computed assuming H 0 is true, and  0 is the value used for that computation.

Copyright ©2011 Brooks/Cole, Cengage Learning 5 Situation 1: Population of measurements of interest is approximately normal, and a random sample of any size is measured. In practice, use method if shape is not notably skewed or no extreme outliers. Situation 2: Population of measurements of interest is not approximately normal, but a large random sample (n  30) is measured. If extreme outliers or extreme skewness, better to have a larger sample. Step 2: Verify Necessary Data Conditions …

Copyright ©2011 Brooks/Cole, Cengage Learning 6 The t-statistic is a standardized score for measuring the difference between the sample mean and the null hypothesis value of the population mean: Continuing Step 2: The Test Statistic This t-statistic has (approx) a t-distribution with df = n - 1.

Copyright ©2011 Brooks/Cole, Cengage Learning 7 For H a less than, the p-value is the area below t, even if t is positive. For H a greater than, the p-value is the area above t, even if t is negative. For H a two-sided, p-value is 2  area above |t|. Step 3: Assuming H 0 true, Find the p-value

Copyright ©2011 Brooks/Cole, Cengage Learning 8 These two steps remain the same for all of the hypothesis tests considered in this book. Choose a level of significance , and reject H 0 if the p-value is less than (or equal to) . Otherwise, conclude that there is not enough evidence to support the alternative hypothesis. Steps 4 and 5: Decide Whether or Not the Result is Statistically Significant based on the p-value and Report the Conclusion in the Context of the Situation

Copyright ©2011 Brooks/Cole, Cengage Learning 9 Example 13.1 Normal Body Temperature What is normal body temperature? Is it actually less than 98.6 degrees Fahrenheit (on average)? Step 1: State the null and alternative hypotheses H 0 :  = 98.6 H a :  < 98.6 where  = mean body temperature in human population.

Copyright ©2011 Brooks/Cole, Cengage Learning 10 Example 13.1 Normal Body Temp (cont) Data: random sample of n = 16 normal body temps Step 2: Verify data conditions … 98.4, 98.6, 98.8, 98.8, 98.0, 97.9, 98.5, 97.6, 98.4, 98.3, 98.9, 98.1, 97.3, 97.8, 98.4, 97.4 Boxplot shows no outliers nor strong skewness. Sample mean of 98.2 is close to sample median of

Copyright ©2011 Brooks/Cole, Cengage Learning 11 Example 13.1 Normal Body Temp (cont) Step 2: … Summarizing data with a test statistic Key elements: Sample statistic: = (under “Mean”) Standard error: (under “SE Mean”) (under “T”)

Copyright ©2011 Brooks/Cole, Cengage Learning 12 Example 13.1 Normal Body Temp (cont) Step 3: Find the p-value From output: p-value = From Table A.3: p-value is less than

Copyright ©2011 Brooks/Cole, Cengage Learning 13 Example 13.1 Normal Body Temp (cont) Step 4: Decide whether or not the result is statistically significant based on the p-value Using  = 0.05 as the level of significance criterion, the results are statistically significant because 0.003, the p-value of the test, is less than In other words, we can reject the null hypothesis. Step 5: Report the Conclusion We can conclude, based on these data, that the mean temperature in the human population is actually less than 98.6 degrees.

Copyright ©2011 Brooks/Cole, Cengage Learning 14 Rejection Region Approach Replaces Steps 3 and 4 with: Substitute Step 3: Find the critical value and rejection region for the test. Substitute Step 4: If the test statistic is in the rejection region, conclude that the result is statistically significant and reject the null hypothesis. Otherwise, do not reject the null hypothesis. Note: Rejection region method and p-value method will always arrive at the same conclusion about statistical significance.

Copyright ©2011 Brooks/Cole, Cengage Learning 15 Rejection Region Approach Summary (use row of Table A.2 corresponding to df) For Example 13.1 Normal Body Temperature? Alternative was one-sided to the left, df = 15, and  = Critical value from table A.2 is –1.75. Rejection region is t  – The test statistic was –3.22 so the null hypothesis is rejected. Same conclusion is reached.

Copyright ©2011 Brooks/Cole, Cengage Learning HT Module 4: Testing Hypotheses about Mean of Paired Differences Data: two variables for n individuals or pairs; use the difference d = x 1 – x 2. Parameter:  d = population mean of differences Sample estimate: = sample mean of the differences Standard deviation and standard error: s d = standard deviation of the sample of differences; Often of interest: Is the mean difference in the population different from 0?

Copyright ©2011 Brooks/Cole, Cengage Learning 17 Steps for a Paired t-Test Step 1: Determine null and alternative hypotheses H 0 :  d =  versus H a :  d   or H a :  d  Watch how differences are defined for selecting the H a. Step 2: Verify data conditions and compute test statistic Conditions apply to the differences. The t-test statistic is: Steps 3, 4 and 5: Similar to t-test for a single mean. The df = n – 1, where n is the number of differences.

Copyright ©2011 Brooks/Cole, Cengage Learning 18 Example 13.2 Effect of Alcohol Study: n = 10 pilots perform simulation first under sober conditions and then after drinking alcohol. Response: Amount of useful performance time. (longer time is better) Question: Does useful performance time decrease with alcohol use? Step 1: State the null and alternative hypotheses H 0 :  d = 0 versus H a :  d > 0 where  d = population mean difference between alcohol and no alcohol measurements if all pilots took these tests.

Copyright ©2011 Brooks/Cole, Cengage Learning 19 Example 13.2 Effect of Alcohol (cont) Data: random sample of n = 10 time differences Step 2: Verify data conditions … Boxplot shows no outliers nor extreme skewness.

Copyright ©2011 Brooks/Cole, Cengage Learning 20 Example 13.2 Effect of Alcohol (cont) Step 2: … Summarizing data with a test statistic Test of mu = 0.0 vs mu > 0.0 Variable N Mean StDev SE Mean T P Diff Key elements: Sample statistic: = (under “Mean”) Standard error: (under “SE Mean”) (under “T”)

Copyright ©2011 Brooks/Cole, Cengage Learning 21 Example 13.2 Effect of Alcohol (cont) Step 3: Find the p-value From output: p-value = From Table A.3: p-value is between and The value t = 2.68 is between column headings 2.58 and 3.00 in the table, and for df =9, the one-sided p-values are and

Copyright ©2011 Brooks/Cole, Cengage Learning 22 Example 13.2 Effect of Alcohol (cont) Steps 4 and 5: Decide whether or not the result is statistically significant based on the p-value and Report the Conclusion Using  = 0.05 as the level of significance criterion, we can reject the null hypothesis since the p-value of is less than Even with a small experiment, it appears that alcohol has a statistically significant effect and decreases performance time.

Copyright ©2011 Brooks/Cole, Cengage Learning HT Module 5: Testing Hypotheses about Difference between Two Means Step 1: Determine null and alternative hypotheses H 0 :  1 –  2 =  versus H a :  1 –  2   or H a :  1 –  2  Watch how Population 1 and 2 are defined. Lesson 1: the General (Unpooled) Case

Copyright ©2011 Brooks/Cole, Cengage Learning 24 Step 2: Verify data conditions and compute the test statistic. Both n’s are large or no extreme outliers or skewness in either sample. Samples are independent. The t-test statistic is: Steps 3, 4 and 5: Similar to t-test for one mean.

Copyright ©2011 Brooks/Cole, Cengage Learning 25 Example 13.4 Effect of Stare on Driving Question: Does stare speed up crossing times? Step 1: State the null and alternative hypotheses H 0 :  1 –  2 =  versus H a :  1 –  2 >  where 1 = no-stare population and 2 = stare population. Randomized experiment: Researchers either stared or did not stare at drivers stopped at a campus stop sign; Timed how long (sec) it took driver to proceed from sign to a mark on other side of the intersection.

Copyright ©2011 Brooks/Cole, Cengage Learning 26 Example 13.3 Effect of Stare (cont) Data: n 1 = 14 no stare and n 2 = 13 stare responses Step 2: Verify data conditions … No outliers nor extreme skewness for either group.

Copyright ©2011 Brooks/Cole, Cengage Learning 27 Example 13.3 Effect of Stare (cont) Step 2: … Summarizing data with a test statistic Sample statistic: = 6.63 – 5.59 = 1.04 seconds Standard error:

Copyright ©2011 Brooks/Cole, Cengage Learning 28 Example 13.3 Effect of Stare (cont) Steps 3, 4 and 5: Determine the p-value and make a conclusion in context. The p-value = 0.013, so we reject the null hypothesis, the results are “statistically significant”. The p-value is determined using a t-distribution with df = 21 (df using Welch approximation formula) and finding area to right of t = Table A.3  p-value is between and We can conclude that if all drivers were stared at, the mean crossing times at an intersection would be faster than under normal conditions.

Copyright ©2011 Brooks/Cole, Cengage Learning 29 Lesson 2: Pooled Two-Sample t-Test Based on assumption that the two populations have equal population standard deviations: Note: Pooled df = (n 1 – 1) + (n 2 – 1) = (n 1 + n 2 – 2).

Copyright ©2011 Brooks/Cole, Cengage Learning 30 The null and alternative hypotheses are: H 0 :  1 –  2 =  versus H a :  1 –  2   where 1 = female population and 2 = male population. Example 13.7 Male and Female Sleep Times Data:The 83 female and 65 male responses from students in an intro stat class. Note: Sample sizes similar, sample standard deviations similar. Use of pooled procedure is warranted. Q: Is there a difference between how long female and male students slept the previous night?

Copyright ©2011 Brooks/Cole, Cengage Learning 31 Example 13.5 Male and Female Sleep Times Two-sample T for sleep [without “Assume Equal Variance” option] Sex N Mean StDev SE Mean Female Male % CI for mu(f) – mu(m): (-0.10, 1.02) T-Test mu (f) = mu(m) (vs not =): T-Value = 1.62 P = 0.11 DF = 140 Two-sample T for sleep [with “Assume Equal Variance” option] Sex N Mean StDev SE Mean Female Male % CI for mu(f) – mu(m): (-0.10, 1.03) T-Test mu (f) = mu(m) (vs not =): T-Value = 1.62 P = 0.11 DF = 146 Both use Pooled StDev = 1.72

Copyright ©2011 Brooks/Cole, Cengage Learning Relationship Between Tests and Confidence Intervals For two-sided tests (for one or two means): H 0 : parameter = null value and H a : parameter  null value Note: 95% confidence interval  5% significance level 99% confidence interval  1% significance level If the null value is covered by a (1 –  )100% confidence interval, the null hypothesis is not rejected and the test is not statistically significant at level . If the null value is not covered by a (1 –  )100% confidence interval, the null hypothesis is rejected and the test is statistically significant at level .

Copyright ©2011 Brooks/Cole, Cengage Learning 33 Example 13.9 Ear Infections and Xylitol 95% CI for p 1 – p 2 is to Reject H 0 : p 1 – p 2 =  and accept H a : p 1 – p 2 >  with  = 0.025, because the entire confidence interval falls above the null value of 0. Note that the p-value for the test was 0.01, which is less than

Copyright ©2011 Brooks/Cole, Cengage Learning Choosing an Appropriate Inference Procedure Confidence Interval or Hypothesis Test? Is main purpose to estimate the numerical value of a parameter? … or to make a “maybe not/maybe yes” conclusion about a specific hypothesized value for a parameter?

Copyright ©2011 Brooks/Cole, Cengage Learning Choosing an Appropriate Inference Procedure Determining the Appropriate Parameter Is response variable categorical or quantitative? Is there one sample or two? If two, independent or paired?

Copyright ©2011 Brooks/Cole, Cengage Learning Evaluating Significance in Research Reports 1.Is the p-value reported? If know p-value, can make own decision, based on severity of Type 1 error and p-value. 2.If word significant is used, determine whether used in everyday sense or in statistical sense only. Statistically significant just means that a null hypothesis has been rejected, no guarantee the result has real-world importance. 3.If you read “no difference” or “no relationship” has been found, determine whether sample size was small. Test may have had very low power because not enough data were collected to be able to make a firm conclusion.

Copyright ©2011 Brooks/Cole, Cengage Learning Evaluating Significance in Research Reports 4.Think carefully about conclusions based on extremely large samples. If very large sample size, even weak relationship or small difference can be statistically significant. 5.If possible, determine what confidence interval should accompany a hypothesis test. Intervals provide information about magnitude of effect as well as information about margin of error in sample estimate. 6.Determine how many hypothesis tests were conducted in study. Sometimes researchers perform multitude of tests, but only few achieve statistical significance. If all null hypotheses true, then ~1 in 20 tests will achieve statistical significance just by chance at the.05 level of significance.