Presentation is loading. Please wait.

Presentation is loading. Please wait.

STAT 3120 Statistical Methods I Lecture Notes 6 Analysis of Variance (ANOVA)

Similar presentations


Presentation on theme: "STAT 3120 Statistical Methods I Lecture Notes 6 Analysis of Variance (ANOVA)"— Presentation transcript:

1 STAT 3120 Statistical Methods I Lecture Notes 6 Analysis of Variance (ANOVA)

2 Testing for Relationships Among Variables Dependent Variable Independent (predictor) Variable Hypothesis TestComments Categorical (Qualitative) Chi-SquareTests if variables are statistically independent (i.e. are they related or not?) QuantitativeCategorical (Qualitative) T-TEST ANOVA Non-Parametric Determines if categorical variable (factor) affects dependent variable; typically used for experimental or planned change studies Quantitative Regression Analysis Test establishes a regression model; used to explain, predict or control dependent variable

3 ANalysis Of VAriance A few points about ANOVA: Used to determine if statistical differences exist among three or more groups; (Question – why not use multiple ttests?) Assumes that the groups are of approximately equal size, have approximately normal distributions and are independent of each other; The test statistic for ANOVA is the F-stat.

4 ANOVA There are two “templates” to retain in your mind when conducting ANOVA. The first involves the structure of the data: This notation can be found on page 387. DATA MEAN Treatment 1Y11Y12Y13Y14Y1 Treatment 2Y21Y22Y23Y24Y2 Treatment 3Y31Y32Y33Y34Y3 Treatment 4Y41Y42Y43Y44Y4 Treatment 5Y51Y52Y53Y54Y5

5 ANOVA The second involves the actual computation of the test statistic: This notation can be found on page 389. Source of Variation Sum of Squares DFMean SquaresF Statistic Between Groups SSBt-1 (where t= number of treatments or different groups) s 2 B = SSB/t-1 s 2 B / s 2 W Within Groups SSWn T -ts 2 W = SSW/(n T -t) TotalTSSn T -1

6 ANOVA Consider an Experiment where 4 groups of 5 subjects are exposed to 4 different advertising strategies, and we measure the level of Retention for the subjects in each group as shown: GroupMEAN Verbal252728243127 Image222519182622 Repetition1714201316 Control182123171619

7 ANOVA If the sample means were identical or very similar, one would not claim that the Ad strategy affects retention. If the sample means are substantially different, one might conclude that the Ad strategy affects retention.

8 ANOVA According to the null hypothesis, the Ad strategy does not influence retention. According to the alternate hypothesis, the Ad strategy affects retention.

9 ANOVA If the classification into groups is ignored, one can compute the sum of squares of the observations about the mean of all of the scores. This is called the TOTAL SUM OF SQUARES. That value can be decomposed into two independent sources of variability.

10 Some of the variability among the observations is a result of differences among people (or things). This is called the WITHIN GROUPS sum of squares. Some of the variability among the observations is a result of differences among the groups. This is called the BETWEEN GROUPS sum of squares. ANOVA

11 Decomposition of Total Deviation SST = SSW + SSB  i  j (X ij -X) 2 =  i  j (X ij -X j ) 2 + n  j (X j -X) 2 _ _ _ SST = Total Sum of Squares SSW = Sum of Squares Within Groups SSB = Sum of Squares Between Groups X = mean of data for all the sample groups combined X j = mean of the jth sample group X ij = the ith element from the jth group n = number of samples in each group _ _

12 Computation of Sums of Squares  Total Sum of Squares (SST) –  i  j (X ij -X) 2 = 210 + 55 + 155 + 54 = 474  Within Groups Sum of Squares –  i  j (X ij -X j ) 2 = 30 + 50 + 30 + 34 = 144  Between Groups Sum of Squares –n  j (X j -X) 2 = 5(6 2 + 1 2 + 5 2 + 2 2 ) = 330 _ _ _

13 Decomposition of SST 474 Exp. Factor (330) All Other Variables (144) Between Within Impact of

14 Degrees of Freedom  Degrees of freedom must be computed for each source of variability.  The degrees of freedom for total is (n T -1) = 19.  The degrees of freedom for between groups is (t-1) = 3  The degrees of freedom for within groups is ( n T - t) = 16.

15 Analysis of Variance Table SOURCESUM OF SQUARES DEGREES OF FREEDOM MEAN SQUARE F-stat BETWEEN3303110 12.22 WITHIN144169 TOTAL47419


Download ppt "STAT 3120 Statistical Methods I Lecture Notes 6 Analysis of Variance (ANOVA)"

Similar presentations


Ads by Google