Math 3680 Lecture #15 Hypothesis Testing: The t Test.

Slides:



Advertisements
Similar presentations
1 COMM 301: Empirical Research in Communication Lecture 15 – Hypothesis Testing Kwan M Lee.
Advertisements

Chapter 10 Section 2 Hypothesis Tests for a Population Mean
Introduction to Statistics
Hypothesis Testing Using a Single Sample
10 Hypothesis Testing. 10 Hypothesis Testing Statistical hypothesis testing The expression level of a gene in a given condition is measured several.
Chapter 9 Chapter 10 Chapter 11 Chapter 12
Sampling Distributions
Inference about a Mean Part II
T-Tests Lecture: Nov. 6, 2002.
8-4 Testing a Claim About a Mean
AP Statistics Section 10.2 A CI for Population Mean When is Unknown.
Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc. Chapter Hypothesis Tests Regarding a Parameter 10.
Chapter 11: Inference for Distributions
Inferences About Process Quality
Statistical Methods in Computer Science Hypothesis Testing I: Treatment experiment designs Ido Dagan.
Statistics 270– Lecture 25. Cautions about Z-Tests Data must be a random sample Outliers can distort results Shape of the population distribution matters.
Hypothesis Testing Using The One-Sample t-Test
Definitions In statistics, a hypothesis is a claim or statement about a property of a population. A hypothesis test is a standard procedure for testing.
Getting Started with Hypothesis Testing The Single Sample.
7.3 Hypothesis Testing for the Mean (Small Samples) Statistics Mrs. Spitz Spring 2009.
CHAPTER 23 Inference for Means.
Copyright © 2005 by Evan Schofer
Chapter 24: Comparing Means.
AM Recitation 2/10/11.
Hypothesis Testing:.
Probability Distributions and Test of Hypothesis Ka-Lok Ng Dept. of Bioinformatics Asia University.
Overview Definition Hypothesis
Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University ECON 4550 Econometrics Memorial University of Newfoundland.
Copyright © 2015, 2012, and 2009 Pearson Education, Inc. 1 Section 7.3 Hypothesis Testing for the Mean (  Unknown).
Section 10.1 ~ t Distribution for Inferences about a Mean Introduction to Probability and Statistics Ms. Young.
Slide 23-1 Copyright © 2004 Pearson Education, Inc.
Copyright © Cengage Learning. All rights reserved. 10 Inferences Involving Two Populations.
Confidence Intervals and Significance Testing in the World of T Welcome to the Real World… The World of T T.
Created by Erin Hodgess, Houston, Texas Section 7-5 Testing a Claim About a Mean:  Not Known.
Hypothesis Tests: One Sample Mean
Student’s t-distributions. Student’s t-Model: Family of distributions similar to the Normal model but changes based on degrees-of- freedom. Degrees-of-freedom.
Chapter 11 Inference for Distributions AP Statistics 11.1 – Inference for the Mean of a Population.
CHAPTER 18: Inference about a Population Mean
Significance Tests: THE BASICS Could it happen by chance alone?
10.2 Tests of Significance Use confidence intervals when the goal is to estimate the population parameter If the goal is to.
Math 137 Hypothesis Testing: The t Test Fresno State Dr. Burger.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Section Inference about Two Means: Independent Samples 11.3.
Sociology 5811: Lecture 11: T-Tests for Difference in Means Copyright © 2005 by Evan Schofer Do not copy or distribute without permission.
BPS - 3rd Ed. Chapter 161 Inference about a Population Mean.
Essential Statistics Chapter 161 Inference about a Population Mean.
AP Statistics Chapter 24 Comparing Means.
Chapter 12 Confidence Intervals and Hypothesis Tests for Means © 2010 Pearson Education 1.
Aim: How do we use a t-test?
Business Statistics for Managerial Decision Farideh Dehkordi-Vakil.
Math 3680 Lecture #13 Hypothesis Testing: The z Test.
Applied Quantitative Analysis and Practices LECTURE#14 By Dr. Osman Sadiq Paracha.
Copyright © 1998, Triola, Elementary Statistics Addison Wesley Longman 1 Assumptions 1) Sample is large (n > 30) a) Central limit theorem applies b) Can.
Lecture Slides Elementary Statistics Twelfth Edition
© 2010 Pearson Prentice Hall. All rights reserved Chapter Hypothesis Tests Regarding a Parameter 10.
Inference About Means Chapter 23. Getting Started Now that we know how to create confidence intervals and test hypotheses about proportions, it’d be nice.
Copyright © 2009 Pearson Education, Inc. 9.2 Hypothesis Tests for Population Means LEARNING GOAL Understand and interpret one- and two-tailed hypothesis.
Copyright © 2009 Pearson Education, Inc t LEARNING GOAL Understand when it is appropriate to use the Student t distribution rather than the normal.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Hypothesis Tests Regarding a Parameter 10.
Review of Power of a Test
Lecture Nine - Twelve Tests of Significance.
Math 4030 – 10a Tests for Population Mean(s)
CHAPTER 8 Confidence Estimating with Estimating a Population 8.3 Mean
Hypothesis Tests for a Population Mean,
Hypothesis tests for the difference between two means: Independent samples Section 11.1.
CHAPTER 18: Inference about a Population Mean
Hypothesis Tests for Two Population Standard Deviations
Basic Practice of Statistics - 3rd Edition Two-Sample Problems
Hypothesis Tests for a Standard Deviation
CHAPTER 18: Inference about a Population Mean
CHAPTER 18: Inference about a Population Mean
Presentation transcript:

Math 3680 Lecture #15 Hypothesis Testing: The t Test

In all of the previous examples, we assumed that we knew the population standard deviation  In practice, this is an extremely rare situation! Instead, the sample standard deviation S is used in lieu of  This approximation is called the bootstrap estimate.

Example. The manufacturer of a new fiberglass tire claims that its average life will be at least 40,000 miles. To verify this claim, a sample of 12 tires is tested, and the lifetimes were found to be 36,10040,20033,80038,500 42,00035,80037,00041,000 36,80037,20033,00036,000 Test the manufacture’s claim using  = 0.05.

Solution. H 0 : The average life is at least 40,000 miles. H a : The average life is less than 40,000 miles. We choose  = Notice that we have to compute the sample SD this time; it’s not given.

We now can compute the test statistic But there’s a catch: since we used S instead of , the test statistic t does NOT follow the normal curve.

Since we used S instead of , the test statistic t does NOT follow the normal curve. Instead, there’s a theorem which says that the test statistic t follows the Student t distribution with n - 1 degrees of freedom. (There’s a slight catch with this theorem that we’ll discuss later.) For the current problem, instead of using the normal curve to compute the observed significance level, we will use the Student t distribution with 11 degrees of freedom.

For the sake of completeness, here’s the pdf of the Student’s t-distribution with r degrees of freedom: If you find this intimidating, don’t worry: we will never use it.

Red: t distributionBlue: standard normal curve

For the t distribution with 11 df, we can compute the critical value for  = 0.05 using Table 4 (p. 510). In Excel, be careful: the command is TINV(0.1,11). (The default is for two tails, not one tail.) 5%

In Excel, the command is TDIST(3.448,11,1). (The third entry specifies the number of tails.) Observed Significance Level

Conclusion: We reject the null hypothesis. There is good reason to believe that the average lifespan of the tires is less than 40,000 miles. Note: It is possible to compute power with the Student’s t-distribution, but the computations are much, much more complicated than the normal case (Larsen & Marx, 3rd ed., p. 447). Many statistical software packages are able to compute power for the t-test automatically.

Excel: Use the command =TTEST(A1:A12, B1:B12,1, 1) This is silly, I know, but you have to list the claimed average once for each entry in the list. The blue 1 stands for a one-tailed test; the second 1 is required.

Remember: If the sample is small (n < 30) and the population variance  is unknown, then we use the t-test and not the z-test. On the other hand, if either  is known or the sample is sufficiently large (n > 30), then we may safely use the z-test instead. Also, we must be careful about stating the null and alternative hypotheses so that we correctly choose whether to use a left-tail, a right-tail, or both tails.

Example. Before a substance can be deemed safe for landfilling, its chemical properties must be assessed. In a sample of six replicates of sludge from a New Hampshire wastewater treatment plant, the mean pH was 6.68 with a standard deviation of Can we conclude than the mean pH is less than 7.0? J. Benoit, T. Eighmy and B. Crannell, Journal of Geotechnical and Geoenvironmental Engineering 1999, pp

Example. Certain rectangles appear more pleasing to the eye than others. The ancient Greeks called a rectangle with the golden rectangle, and this ratio was called the golden ratio. The golden ratio has been claimed to be a deliberate design of various art and architecture.

The data below shows the width-to-length ratios of beaded rectangles used by the Shoshone Native Americans to decorate their leather goods. Does it appear that the golden rectangle is also an aesthetic standard for the Shoshones? C. Dubois, ed., Lowie’s Selected Papers in Anthropology (UC Press, Berkeley, 1960), pp

Robustness of the t Test

The t statistic is defined by If X 1, X 2, …, X n follow a normal distribution, then there’s a theorem that says that this t statistic follows the Student t-distribution with n - 1 degrees of freedom. But, in real life, this assumption is almost certainly not true. Models are idealized; real data are, well, real. Now what? (FYI: The exact answer is )

The good news is that the underlying pdf doesn’t have to be very close to normal in order for the test statistic to be close to the Student t-distribution. The following graphs are empirical histograms of the t statistic computed from 10,000 data sets drawn from a “triangular” distribution with pdf which is not too far off from bell shaped. Even for very small samples, the t distribution is accurate.

The following graphs are empirical histograms of the t statistic computed from 10,000 data sets drawn from a Uniform(0,1) distribution, which is symmetric but decisively not bell-shaped. Notice that convergence does not occur as quickly.

The following graphs are empirical histograms of the t statistic computed from 10,000 data sets drawn from an Exponential(1) distribution, which is neither symmetric nor bell-shaped. This time, the sample has to be of size 40 or so for the t distribution to be accurate… that’s mostly due to the Central Limit Theorem. (FYI: The exact answer is )

Observations The distribution of the t statistic is relatively unaffected by the pdf of the X i, as long as –The pdf is not too skewed, and –The sample size is not too small. As the sample size n increases, the distribution of the t statistic gets closer to the Student t-distribution with n -1 degrees of freedom.

Observations We succinctly describe this as saying that the t test is robust, meaning that it is not heavily dependent on the underlying assumption of normality. The practical importance of this robustness is that the t test can be used in real-life situations.

Practical Implications If n < 15, the data should be nearly normal. Make a histogram. If there are outliers or strong skewness, do not use the t-test. If 15  n  40, make a histogram to check that the data is unimodal, free of outliers, and reasonably symmetric. Again, make a histogram. If n > 40, the t-test is safe even if the data is skewed.