T- and Z-Tests for Hypotheses about the Difference between Two Subsamples.

Slides:



Advertisements
Similar presentations
Chapter 12: Testing hypotheses about single means (z and t) Example: Suppose you have the hypothesis that UW undergrads have higher than the average IQ.
Advertisements

Estimation Let's return to our example of the random sample of 200 USC undergraduates. Remember that this is both a large and a random sample, and therefore.
Section 9.3 Inferences About Two Means (Independent)
1 Difference Between the Means of Two Populations.
Business Statistics - QBM117
10 Hypothesis Testing. 10 Hypothesis Testing Statistical hypothesis testing The expression level of a gene in a given condition is measured several.
Chapter Goals After completing this chapter, you should be able to:
Don’t spam class lists!!!. Farshad has prepared a suggested format for you final project. It will be on the web
Inference about a Mean Part II
T-Tests Lecture: Nov. 6, 2002.
Two Population Means Hypothesis Testing and Confidence Intervals With Unknown Standard Deviations.
Hypothesis Testing Sample Means. Hypothesis Testing for Sample Means The goal of a hypothesis test is to make inferences regarding unknown population.
“There are three types of lies: Lies, Damn Lies and Statistics” - Mark Twain.
Definitions In statistics, a hypothesis is a claim or statement about a property of a population. A hypothesis test is a standard procedure for testing.
Hypothesis Testing: Two Sample Test for Means and Proportions
Probability Population:
Hypothesis Testing.
Chapter 9 Hypothesis Testing II. Chapter Outline  Introduction  Hypothesis Testing with Sample Means (Large Samples)  Hypothesis Testing with Sample.
Statistical Analysis. Purpose of Statistical Analysis Determines whether the results found in an experiment are meaningful. Answers the question: –Does.
Psy B07 Chapter 1Slide 1 ANALYSIS OF VARIANCE. Psy B07 Chapter 1Slide 2 t-test refresher  In chapter 7 we talked about analyses that could be conducted.
Statistics 11 Hypothesis Testing Discover the relationships that exist between events/things Accomplished by: Asking questions Getting answers In accord.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 9 Hypothesis Testing.
Probability Distributions and Test of Hypothesis Ka-Lok Ng Dept. of Bioinformatics Asia University.
Chapter 13 – 1 Chapter 12: Testing Hypotheses Overview Research and null hypotheses One and two-tailed tests Errors Testing the difference between two.
Week 9 Chapter 9 - Hypothesis Testing II: The Two-Sample Case.
Hypothesis Testing II The Two-Sample Case.
Hypothesis testing – mean differences between populations
Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University ECON 4550 Econometrics Memorial University of Newfoundland.
Section #4 October 30 th Old: Review the Midterm & old concepts 1.New: Case II t-Tests (Chapter 11)
Copyright © Cengage Learning. All rights reserved. 10 Inferences Involving Two Populations.
T-distribution & comparison of means Z as test statistic Use a Z-statistic only if you know the population standard deviation (σ). Z-statistic converts.
The paired sample experiment The paired t test. Frequently one is interested in comparing the effects of two treatments (drugs, etc…) on a response variable.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Statistical Inferences Based on Two Samples Chapter 9.
The Hypothesis of Difference Chapter 10. Sampling Distribution of Differences Use a Sampling Distribution of Differences when we want to examine a hypothesis.
Comparing Two Population Means
1 CSI5388: Functional Elements of Statistics for Machine Learning Part I.
1 Statistical Inference Greg C Elvers. 2 Why Use Statistical Inference Whenever we collect data, we want our results to be true for the entire population.
Statistics and Quantitative Analysis U4320
Hypothesis Testing. Steps for Hypothesis Testing Fig Draw Marketing Research Conclusion Formulate H 0 and H 1 Select Appropriate Test Choose Level.
Chapter 9 Hypothesis Testing II: two samples Test of significance for sample means (large samples) The difference between “statistical significance” and.
Chapter 9: Testing Hypotheses
Testing Hypotheses about Differences among Several Means.
1 Psych 5500/6500 The t Test for a Single Group Mean (Part 1): Two-tail Tests & Confidence Intervals Fall, 2008.
Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?
1 Psych 5500/6500 Introduction to the F Statistic (Segue to ANOVA) Fall, 2008.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 8 Hypothesis Testing.
Interval Estimation and Hypothesis Testing Prepared by Vera Tabakova, East Carolina University.
Two-Sample Hypothesis Testing. Suppose you want to know if two populations have the same mean or, equivalently, if the difference between the population.
Copyright ©2013 Pearson Education, Inc. publishing as Prentice Hall 9-1 σ σ.
Chapter 8 Parameter Estimates and Hypothesis Testing.
Chapter 9: Testing Hypotheses Overview Research and null hypotheses One and two-tailed tests Type I and II Errors Testing the difference between two means.
Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics.
1 URBDP 591 A Lecture 12: Statistical Inference Objectives Sampling Distribution Principles of Hypothesis Testing Statistical Significance.
© Copyright McGraw-Hill 2004
Inferential Statistics Inferential statistics allow us to infer the characteristic(s) of a population from sample data Slightly different terms and symbols.
One-Way Analysis of Variance Recapitulation Recapitulation 1. Comparing differences among three or more subsamples requires a different statistical test.
Other Types of t-tests Recapitulation Recapitulation 1. Still dealing with random samples. 2. However, they are partitioned into two subsamples. 3. Interest.
Copyright © Cengage Learning. All rights reserved. 9 Inferences Based on Two Samples.
Applied Epidemiologic Analysis - P8400 Fall 2002 Lab 3 Type I, II Error, Sample Size, and Power Henian Chen, M.D., Ph.D.
Significance Tests for Regression Analysis. A. Testing the Significance of Regression Models The first important significance test is for the regression.
CHAPTER 7: TESTING HYPOTHESES Leon-Guerrero and Frankfort-Nachmias, Essentials of Statistics for a Diverse Society.
Welcome to MM570 Psychological Statistics Unit 5 Introduction to Hypothesis Testing Dr. Ami M. Gates.
 What is Hypothesis Testing?  Testing for the population mean  One-tailed testing  Two-tailed testing  Tests Concerning Proportions  Types of Errors.
Chapter 10: The t Test For Two Independent Samples.
Hypothesis Testing. Steps for Hypothesis Testing Fig Draw Marketing Research Conclusion Formulate H 0 and H 1 Select Appropriate Test Choose Level.
Hypothesis Testing. Steps for Hypothesis Testing Fig Draw Marketing Research Conclusion Formulate H 0 and H 1 Select Appropriate Test Choose Level.
When the means of two groups are to be compared (where each group consists of subjects that are not related) then the excel two-sample t-test procedure.
3. The X and Y samples are independent of one another.
INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE Test Review: Ch. 7-9
Chapter 9 Hypothesis Testing.
Presentation transcript:

T- and Z-Tests for Hypotheses about the Difference between Two Subsamples

Random samples, partitioned into two independent subsamples (e.g., men and women). Question: Are the means of some variable (such as salary) significantly different between the two subsample? Key: The sampling distribution of all theoretically possible differences between subsample means. For large samples (i.e., when the Central Limit Theorem holds), this sampling distribution of mean differences is normally shaped; for smaller samples, the sampling distribution takes the shape of one of the Student’s t distributions, identified by degrees of freedom.

The key is: The difference between two means is a single value. In the case of these so-called “means difference tests,” the null hypothesis is that the means in general (i.e., in the universe) do NOT differ. Symbolically, H 0 :  2 -  1 = 0.00 There are two possible alternate hypotheses: nondirectional; H 1 :  2 -  1  0.0 and directional, either H 1 :  2 -  1 > 0.0 or H 1 :  2 -  1  0.0

In the 1984 General Social Survey, female respondents were asked whether or not their mothers had attended college. Then these female respondents were asked about their own education levels. A reasonable (alternate) hypothesis (H 1 ) would be: Women whose mothers attended college will themselves have more formal education than women whose mothers did not attend college. Thus, our two hypotheses are : H 1 :  2 -  1 > 0.0 H 0 :  2 -  1 = 0.00

Dividing respondents into women whose mothers attended college and women whose mothers did not led to the calculation of the following statistics for the two subsamples: Mothers Mothers ATTENDED DID NOT College Attend _ _ Y 2 = Y 1 = s 2 2 = 6.57 s 1 2 = 9.82 N 2 = 90 N 1 = 359 Notice that the sample means DO in fact differ. This is NOT the question.

Because these two subsamples combined exceed 100, we know that the Central Limit Theorem applies. We can convert the difference between the value of the sample mean differences and the presumed value of the mean difference in the universe under the null hypothesis (i.e., 0.0) to z-values by using the estimated standard error of the difference (the standard deviation of the sampling distribution of sample mean differences). Recall that in general standard errors are estimated by dividing the standard deviation of the sample by the square root of the sample size,

However, in the case of this means difference test, we have TWO subsamples and thus TWO standard deviations (actually variances in this example), one for each subsample. What do we do? Simply combine the two subsample variances, as follows:

With the information from above, this means

The algorithm for converting mean differences into z- units should look familiar: In this example,

Selecting alpha = 0.05 for a one-tailed test and looking for a critical value in Appendix 1 (pp ), we again interpolate between and , making z  = Since 8.43 is GREATER THAN the critical value of 1.645, we REJECT the null hypothesis at the 0.05 level and conclude that IN GENERAL women whose mothers attended college have higher education levels than women whose mothers did not attend college.

Z  = 1.645Z = 0.0Z = 8.43

For random samples whose combined size is less than 120, we cannot assume that the sampling distribution of mean differences will be normally shaped. This is because the Central Limit Theorem doesn't hold with samples this small. Student's t distributions must be used instead. The only new wrinkle here is that the standard error of the difference cannot be estimated as above. The sample standard deviations must be pooled in a way that is sensitive to the impact of even slight differences in small numbers. Consider the following example:

The 63-city data set that we are using this semester has been divided into two subsamples, one consisting of SUNBELT cities and the other of FROSTBELT cities. The question is: Did frostbelt cities lose population at a higher rate than sunbelt cities in the fifteen years between ? Sample statistics are these: Frostbelt CitiesSunbelt Cities _ _ Y 2 = Y 1 = 2.84 s 2 2 = 9.98 s 1 2 = N 2 = 37 N 1 = 26

In the sample, frostbelt cities clearly lost population (population change percent) at a greater rate than sunbelt cities (which GAINED population, percent). The question is, is this sample difference sufficient to infer a similar trend in the universe of American cities? Our alternate hypothesis is that in general frostbelt cities lost population at a higher rate than sunbelt cities, hence H 1 :  2 -  1 < 0.0 In other words, we expect  2 to be a larger negative number than  1. Notice again that the presence of the “less than” sign (“<”) dictates a one-tailed test, this time in the left-hand tail where negative mean differences are located.

Our null hypothesis is that there is no difference in the rates of population change, or H 0 :  2 -  1 = 0.00 Since the subsample sizes are relatively small, we can't simply slam the standard deviations together to estimate the standard error of the difference. We must weight the subsample standard deviations (actually the subsample variances) by the size of the subsamples (actually by the number of degrees of freedom in the subsamples) before estimating the standard error. This is called pooling.

In this example,

Now we can use this “pooled” constant to estimate the standard error of the difference. The estimation is where s = s pooled

Now we have the value of the standard error of the difference, 1.396; this is our “currency exchange rate” that will allow us to determine the value of the test statistic.

The algorithm should look familiar: In this example,

We now know that location on the sampling distribution converts to a t-location of on the underlying X-axis. But what is the exact SHAPE of this sampling distribution? It is the Student’s t distribution with 61 degrees of freedom. We have 63 cities selected randomly, but the cities have been subdivided into two subsamples (frostbelt and sunfelt cities). The 37 frostbelt-city subsample has 36 degrees of freedom ( = 36), and the 26 sunbelt-city subsample has 25 degrees of freedom ( = 25). More generally, the number of degrees of freedom in the t-test is df = N 2 + N 1 - 2

Consulting Appendix 2 (p. 543) in search of the critical value for df = 61, we once again find only critical values for df = 60 and for df = 120. Let's assume that we want to make our test with alpha = We could interpolate, by finding the value that is 1 / 60 th of the way from to This value is extremely small and would round back to But we need the critical value for the left (negative) tail, and Appendix 2 has only values for the right (positive) tail. Because Student's t distributions are all SYMMETRICAL, we simply add the negative sign to Thus, our critical value is t 0.05 = –

Since t = is GREATER THAN t  = , this means that our sample difference lies INSIDE the region of rejection in the left-hand tail. Thus, we REJECT the null hypothesis at the 0.05 level and conclude that in general frostbelt cities in the U.S. probably did lose population at a greater rate than sunbelt cities in the period 1960 and 1974.

t = 0.0 t = t  =

Using SAS to Produce T-Tests libname old 'a:\'; libname library 'a:\'; options nonumber nodate ps=66; proc ttest data=old.cities; class agecity2; var manufpct; title1 'An Example of a T-Test'; title2 'SAS Version 8.1'; title3 'PPD 404'; run;

An Example of a T-Test SAS Version 8.1 PPD 404 The TTEST Procedure Statistics Lower CL Upper CL Lower CL Upper CL Variable Class N Mean Mean Mean Std Dev Std Dev Std Dev Std Err MANUFPCT Newer MANUFPCT Older MANUFPCT Diff (1-2) T-Tests Variable Method Variances DF t Value Pr > |t| MANUFPCT Pooled Equal MANUFPCT Satterthwaite Unequal Equality of Variances Variable Method Num DF Den DF F Value Pr > F MANUFPCT Folded F

Reject the null hypothesis (H 0 ) when either: 1. the value of the statistical test (  2, z, t, F', or F) exceeds the critical value at the chosen  -level; or, 2. the p-value for the statistical test is smaller than the chosen value of .

Do NOT reject the null hypothesis (H 0 ) when either: 1. the value of the statistical test (  2, z, t, F', or F) is less than the critical value at the chosen  -level; or, 2. the p-value for the statistical test is greater than the chosen value of .

t = 0.0t = t  = t =

Exercise 1 Means Difference Test In the 1984 General Social Survey (GSS), 234 male respondents with at least some college had a mean occupational prestige score of (with a variance of ). In contrast, 351 male respondents with only a high school education or less had an average occupational prestige score of (with a variance of ). Test the null hypothesis (H 0 ) that there is no statistically significant difference in occupational prestige between these two groups. Assume that  = Perform a two-tailed test. Make your decision regarding the null hypothesis using z-values in Appendix 1 (“Proportions of Area under Standard Normal Curve"), pp What is the value of the standard error? 2. What is the value of Z? 3. What are the values of Z at the 2.5 percent and 97.5 percent areas under the normal curve? 4. Do you reject or accept this null hypothesis?

Exercise 1 Answers Means Difference Test 1. What is the value of the standard error? What is the value of Z? What are the values of Z at the 2.5 percent and 97.5 percent areas under the normal curve?  Do you reject or accept this null hypothesis? Reject

Exercise 2 Two Independent Samples t-test In an experiment to determine the effects of hunger on hand- eye coordination, the following results, representing the number of tasks completed success-fully, were obtained: Experimental Group (#1) Control Group (#2) (Hungry) (Normal) Mean S.D N 1012 Calculate the estimated standard error of the difference, obtain the value of t, and test the hypothesis that the normally-fed (control) group performed better than did the hungry (experimental) group. Use Student's t distribution (Appendix 2, p. 543), and assume that  = Perform a one-tailed test.

Exercise 2 (continued) Two Independent Samples t-test 1. Expressed symbolically, what is the alternate hypothesis? 2. Expressed symbolically, what is the null hypothesis? 3. What is the value of the standard error? 4. What is the value of t? 5. How many degrees of freedom in this problem? 6. What is the critical value of t df ? 7. Do you reject or accept the null hypothesis?

Exercise 2 Answers Two Independent Samples t-test 1. Expressed symbolically, what is the alternate hypothesis?  2 -  1 > Expressed symbolically, what is the null hypothesis?  2 -  1 = What is the value of the standard error? What is the value of t? How many degrees of freedom in this problem? What is the critical value of t df ? Do you reject or accept the null hypothesis? Reject