Lecture 9 TWO GROUP MEANS TESTS EPSY 640 Texas A&M University.

Slides:



Advertisements
Similar presentations
“Students” t-test.
Advertisements

Inference for Regression
Section 9.3 Inferences About Two Means (Independent)
1 Chapter 10 Comparisons Involving Means  1 =  2 ? ANOVA Estimation of the Difference between the Means of Two Populations: Independent Samples Hypothesis.
Nonparametric tests and ANOVAs: What you need to know.
Testing means, part III The two-sample t-test. Sample Null hypothesis The population mean is equal to  o One-sample t-test Test statistic Null distribution.
T-Tests.
© 2010 Pearson Prentice Hall. All rights reserved Least Squares Regression Models.
t-Tests Overview of t-Tests How a t-Test Works How a t-Test Works Single-Sample t Single-Sample t Independent Samples t Independent Samples t Paired.
The Multiple Regression Model Prepared by Vera Tabakova, East Carolina University.
T-Tests.
PSY 307 – Statistics for the Behavioral Sciences
Independent Samples and Paired Samples t-tests PSY440 June 24, 2008.
Topic 2: Statistical Concepts and Market Returns
Final Review Session.
Analysis of Differential Expression T-test ANOVA Non-parametric methods Correlation Regression.
Lecture 9: One Way ANOVA Between Subjects
T-Tests Lecture: Nov. 6, 2002.
EXPERIMENTAL DESIGN Random assignment Who gets assigned to what? How does it work What are limits to its efficacy?
5-3 Inference on the Means of Two Populations, Variances Unknown
Chapter 9: Introduction to the t statistic
COURSE: JUST 3900 Tegrity Presentation Developed By: Ethan Cooper Final Exam Review.
Inferential Statistics
Chapter 12: Analysis of Variance
Variance-Test-1 Inferences about Variances (Chapter 7) Develop point estimates for the population variance Construct confidence intervals for the population.
Copyright © Cengage Learning. All rights reserved. 13 Linear Correlation and Regression Analysis.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Inference on the Least-Squares Regression Model and Multiple Regression 14.
Education 793 Class Notes T-tests 29 October 2003.
The paired sample experiment The paired t test. Frequently one is interested in comparing the effects of two treatments (drugs, etc…) on a response variable.
NONPARAMETRIC STATISTICS
T tests comparing two means t tests comparing two means.
Inferences in Regression and Correlation Analysis Ayona Chatterjee Spring 2008 Math 4803/5803.
Chapter 9 Hypothesis Testing and Estimation for Two Population Parameters.
Inferential Statistics 2 Maarten Buis January 11, 2006.
QMS 6351 Statistics and Research Methods Regression Analysis: Testing for Significance Chapter 14 ( ) Chapter 15 (15.5) Prof. Vera Adamchik.
COURSE: JUST 3900 TIPS FOR APLIA Developed By: Ethan Cooper (Lead Tutor) John Lohman Michael Mattocks Aubrey Urwick Chapter : 10 Independent Samples t.
Go to index Two Sample Inference for Means Farrokh Alemi Ph.D Kashif Haqqi M.D.
t(ea) for Two: Test between the Means of Different Groups When you want to know if there is a ‘difference’ between the two groups in the mean Use “t-test”.
Hypothesis Testing Using the Two-Sample t-Test
Testing means, part II The paired t-test. Outline of lecture Options in statistics –sometimes there is more than one option One-sample t-test: review.
Psychology 301 Chapters & Differences Between Two Means Introduction to Analysis of Variance Multiple Comparisons.
Ordinally Scale Variables
Warsaw Summer School 2011, OSU Study Abroad Program Difference Between Means.
© Copyright McGraw-Hill 2000
1 ANALYSIS OF VARIANCE (ANOVA) Heibatollah Baghi, and Mastee Badii.
Inferential Statistics 4 Maarten Buis 18/01/2006.
Chapter 10 The t Test for Two Independent Samples
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics S eventh Edition By Brase and Brase Prepared by: Lynn Smith.
Introducing Communication Research 2e © 2014 SAGE Publications Chapter Seven Generalizing From Research Results: Inferential Statistics.
Confidence Intervals for a Population Mean, Standard Deviation Unknown.
Chapter Eleven Performing the One-Sample t-Test and Testing Correlation.
T tests comparing two means t tests comparing two means.
Significance Tests for Regression Analysis. A. Testing the Significance of Regression Models The first important significance test is for the regression.
Chapter 13 Understanding research results: statistical inference.
Lecture 8 Estimation and Hypothesis Testing for Two Population Parameters.
Chapter 9: Introduction to the t statistic. The t Statistic The t statistic allows researchers to use sample data to test hypotheses about an unknown.
HYPOTHESIS TESTING FOR DIFFERENCES BETWEEN MEANS AND BETWEEN PROPORTIONS.
SUMMARY EQT 271 MADAM SITI AISYAH ZAKARIA SEMESTER /2015.
Chapter 7 Inference Concerning Populations (Numeric Responses)
Two-Sample-Means-1 Two Independent Populations (Chapter 6) Develop a confidence interval for the difference in means between two independent normal populations.
Today’s lesson (Chapter 12) Paired experimental designs Paired t-test Confidence interval for E(W-Y)
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved Chapter 4 Investigating the Difference in Scores.
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
Chapter 10 Two-Sample Tests and One-Way ANOVA.
Math 4030 – 10a Tests for Population Mean(s)
Psychology 202a Advanced Psychological Statistics
Comparing Populations
Nonparametric Statistics
Correlation and Simple Linear Regression
Correlation and Simple Linear Regression
Presentation transcript:

Lecture 9 TWO GROUP MEANS TESTS EPSY 640 Texas A&M University

Two independent groups experiments Randomization distributions. 6 scores (persons, things) can be randomly split into 2 groups 20 ways:                     1 2 3

Two independent groups experiments Differences between groups can be arranged as follows: look familiar?

t-distribution Gossett discovered it similar to normal, flatter tails different for each sample size, based on N-2 for two groups (degrees of freedom) randomization distribution of differences is approximated by t-distribution

t-distribution assumptions NORMALITY –(W test in SPSS) HOMOGENEITY OF VARIANCES IN BOTH GROUPS’ POPULATIONS –Levene’s test in SPSS INDEPENDENCE OF ERRORS –logical evaluation –Durbin-Watson test in serial data

Null hypothesis for test of means for two independent groups H 0 :  1 -  2 =0 H 1 :  1 -  2  0. fix a significance level, . Then we select a sample statistic. In this case we choose the sample mean for each group, and the test statistic is the sample difference d = y 1 – y 2.

Variance and Standard deviation of differences in the Population The variance in the POPULATION of a difference of two independent scores is:  2 d =  2 ( y 1 – y 2 ) =  2 y 1 +  2 y 2  d =  2 ( y 1 – y 2 ) = standard error of difference Example,  2 1 = 100,  2 2 = 100,  2 (y 1 -y 2 ) = = 200  (y 1 -y 2 ) = 14.14

Variance and Standard deviation of difference in means The variance of the difference in POPULATION MEANS is the variance of score difference divided by the sample sizes:  2 d =  2 ( y 1 – y 2 ) = (  2 y 1 /n 1 +  2 y 2 /n 2 )  d =   2 ( y 1 – y 2 ) Example,  2 1 = 100,  2 2 = 100, n 1 = 16, n 2 =16  2 (y 1 -y 2 ) = = 200 s (y 1 -y 2 ) =  2 d = 100/ /16 =12.5  d = 3.54, standard deviation of mean difference

MEANING OF VARIANCE OF POPULATION MEAN DIFFERENCE WE ASSUMED EQUAL VARIANCES FOR THE TWO POPULATIONS THUS, VARIANCE OF DIFFERENCE IS EQUAL TO SINGLE VARIANCE (AVERAGE OF THE TWO VARIANCES) TIMES SUM OF 1/SAMPLE SIZE:  2 d = (  2 y 1 /n 1 +  2 y 2 /n 2 ) =  2 (1/n 1 + 1/n 2 )

Mean difference = 0 Null Hypothesis t-distribution, df= = 30 Critical t(30) = SD= * 3.54 = 7.22 points needed for significance from difference=0

Standard error of mean difference score for unequal sample size standard error of the sample difference. It consists of the square root of the average variance of the two samples, [(n 1 –1)s (n 2 – 1)s 2 2 ] / (n 1 + n 2 –2) multiplied by the sum of 1/sample size ( 1/n 1 + 1/n 2 ). Same as previous slide, only difference is adjusting for difference sample sizes in the two groups

Null hypothesis for test of means for two independent groups t = d / s d _____________________________________________ = (y 1 – y 2 )/  { {[(n 1 –1)s (n 2 – 1)s 2 2 ] / (n 1 + n 2 –2)} { 1/n 1 + 1/n 2 } Weighted average variance of two groups Sampling weights

Boy-Girl differences on Sense of Inadequacy on BASC for a nonrandom sample

REGRESSION APPROACH ANOVA Model Sum of Squares dfMean SquareFSig. Regression Residual Total a Predictors: (Constant), SEX b Dependent Variable: SENSE OF INADEQUACY Model Summary Model RR Square Adjusted R Square Std. Error of the Estimate Predictors: (Constant), SEX

REGRESSION COEFFICIENTS FOR SEX PREDICTING SENSE OF INADEQUACY

VENN DIAGRAM OF REGRESSION Ssresidual = Sense of Inadequacy sex SSregression = R 2 =.005 =

Path Diagram for Group Mean Difference SEX SENSE OF INADEQUACY ERROR  =.997

Correlation representation of the two independent groups experiment r 2 pb t 2 =   (1 – r 2 pb )/ (n-2) t 2 r 2 pb =  t 2 + n - 2

Correlation representation of the two independent groups experiment t 2 r pb =  t 2 + n - 2 1/2

Test of point biserial=0 H 0 :  pb = 0 H 1 :  pb  0 is equivalent to t-test for difference for two means.

POINT-BISERIAL CORRELATION M F Y X XXXXXXXXXXXXXXXX XXXXXXXXXXXXXXXX m m

Example Willson (1997) studied two groups of college freshman engineering students, one group having participated in an experimental curriculum while the other was a random sample of the standard curriculum. One outcome of interest was performance on the Mechanics Baseline Test, a physics measure (Hestenes & Swackhammer, 1992). The data for the two groups is shown below. A significance level of.01 was selected for the hypothesis that the experimental group performed better than the standard curriculum group (a directional test): GroupMeanSDSample size Exper Std Cur __________________________________________ t = (47 – 37) /  [(74 y 15 2 ) + (49 y 16 2 ) / ( – 2)][1/75 + 1/50] _______________________________ = (10) /  [( ) / (123)][1/75 + 1/50] = The t-statistic is compared with the tabled value for a t-statistic with 123 degrees of freedom at the.01 significance level, The observed probability of occurrence is =.02691, greater than the intended level of significance. The conclusion was that the experimental curriculum group, while performing better than the standard, did not significantly outperform them.

Confidence interval around d d  t   {{[(n 1 –1)s (n 2 – 1)s 2 2 ] / (n 1 + n 2 –2)} { 1/n 1 + 1/n 2 } Thus, for the example, using the.01 significance level the confidence interval is  (1.037) =  = (-1.281, 4.077) This includes 0 (zero) so we do not reject the null hypothesis.

Wilcoxon rank sum test for two independent groups. While the t-distribution is the randomization distribution of standardized differences of sample means for large sample sizes, for small samples it is not the best procedure for all unknown distributions. If we do not know that the population is normally distributed, a better alternative is the Wilcoxon rank sum test.

Wilcoxon rank sum test for two independent groups.

Dependent groups experiments d = y 1 – y 2 for each pair. Now the hypotheses about the new scores becomes H 0 :  = 0 H 1 :   0 The sample statistic is simply the sample difference. The standard error of the difference can be computed from the standard deviation of the difference scores divided by n, the number of pairs

Standard deviation of differences in related (dependent) data s 2 (y 1 -y 2 ) = s s r 12 s 1 s 2 Example, s 2 1 = 100, s 2 2 = 144, r 12 =.7 s 2 (y 1 -y 2 ) = (.7)(10)(12) = = 76 s (y 1 -y 2 ) = 8.72

Dependent groups experiments _________________ s d =  [s s 2 2 –2r 12 s 1 s 2 ]/n. Then the t-statistic is _ t = d / s d

Dependent groups experiments In a study of the change in grade point average for a group of college engineering freshmen, Willson (1997) recorded the following data over two semesters for a physics course: Variable N Mean Std Dev PHYS PHYS Correlation Analysis: r 12 =.5517 To test the hypothesis that the grade average changed after the second semester from the first, for a significance level of.01, the dependent samples t-statistic is ________________________________________ t = [2.648 – 2.233]/ [ – 2 (.5517) x x 1.201]/128 =.415 /.1001 = This is greater than the tabled t-value  t(128-1) = Therefore, it was concluded the students averaged higher the second semester than the first.

VENN DIAGRAM SS Between pairs SS Within pairs SS Treatment SS Treatment*Pair Design for pairs of persons, each assigned to experimental or control condition

VENN DIAGRAM SS Between Persons SS Within Persons SS Time SS Time*Person Time 1 vs. Time 2 comparison of achievement for a group of persons

Nonparametric test of difference in dependent samples. sign test. A count of the positive (or negative) difference scores is compared with a binomial sign table. This sign test is identical to deciding if a coin is fair by flipping it n times and counting the number of heads. Within a standard error of.5n 1/2 the number should be equal to n/2.As n becomes large, the distribution of the number of positive difference scores divided by the standard error is normal. An alternative to the sign test is the Wilcoxon signed rank test or symmetry test

DIFFERENCE BETWEEN SENSE OF INADEQUACY AND ANXIETY IN BASC SAMPLE- NONPARAMETRIC

Summary of two group experimental tests of hypothesis Table below is a compilation of last two chapters: –sample size –one or two groups –normal distribution or not –known or unknown population variance(s)

One or IndependentNormalHypothesesPopulation varianceTest StatisticDistribution Two orDistributionknown? GroupsDependentAssumed? _ Onenot applicable YesH0:  = a  2 Known y. - anormal H1:   az =  [  2 /n ]1/2 Onenot applicable YesH0:  = a  2 unknown y. - at with n-1 df H1:   at =  [ s2 /n ]1/2 Onenot applicable NoH0:  = a  2 unknownS =  R+i, yi > aWilcoxon rank sum H1:   a orn+ =  i+, i+ =1 if yi > a, 0 elsebinomial (sign test)

One or IndependentNormalHypotheses Population varianceTest StatisticDistribution Two orDistribution known? GroupsDependentAssumed? _ _ TwoIndependentYesH0:  0 -  1 = 0  2 0 =  2 1 =  2,y0. – y1. H1:  0 -  1  0 known z =  normal [  2 (1/n0 + 1/n1) ] 1/2 _ _ TwoIndependentYesH0:  0 -  1 = 0  2 0 =  2 1, y0. – y1. H1:  0 -  1  0 unknown t =  t with n0 + n1 –2 df [ s2 (1/n0 + 1/n1) ] 1/2 s 2 = (n0 –1)s (n1 –1)s 2 1 n0 + n1 –2 TwoIndependentNoH0:  0 -  1 = 0  2 0 =  2 1, S =  R+i Wilcoxon rank sum H1:  0 -  1  0 unknown for one of the groups _ _ TwoDependentYesH0:  0 -  1 = 0  2 0 =  2 1=  2, y0. – y1. H1:  0 -  1  0 Known z =  normal [ 2  2 ( 1 -  ) /n ] 1/2  = population correlation between y0 and y1 __ __ TwoDependentYesH0:  0 -  1 = 0  20 =  21=  2, y0. – y1.t with n-1 df H1:  0 -  1  0 inknown t =  [ 2 s 2 ( 1 -  ) /n ] 1/2 r = sample correlation between y0 and y1 s 2 = s s 2 1 – 2r 12 s 0 s 1 TwoDependentNoH0:  0 -  1 = 0  2 0 =  2 1=  2 S =  R+i Wilcoxon Ranks sum H1:  0 -  1  0 unknownfor positive differences