Presentation is loading. Please wait.

Presentation is loading. Please wait.

Analysis of Variance Introduction The Analysis of Variance is abbreviated as ANOVA The Analysis of Variance is abbreviated as ANOVA Used for hypothesis.

Similar presentations


Presentation on theme: "Analysis of Variance Introduction The Analysis of Variance is abbreviated as ANOVA The Analysis of Variance is abbreviated as ANOVA Used for hypothesis."— Presentation transcript:

1

2 Analysis of Variance Introduction

3 The Analysis of Variance is abbreviated as ANOVA The Analysis of Variance is abbreviated as ANOVA Used for hypothesis testing in Used for hypothesis testing in Simple Regression Simple Regression Multiple Regression Multiple Regression Comparison of Means Comparison of Means

4 Sources There is variation anytime that all of the data values are not identical There is variation anytime that all of the data values are not identical This variation can come from different sources such as the model or the factor This variation can come from different sources such as the model or the factor There is always the left-over variation that can’t be explained by any of the other sources. This source is called the error There is always the left-over variation that can’t be explained by any of the other sources. This source is called the error

5 Variation Variation is the sum of squares of the deviations of the values from the mean of those values Variation is the sum of squares of the deviations of the values from the mean of those values As long as the values are not identical, there will be variation As long as the values are not identical, there will be variation Abbreviated as SS for Sum of Squares Abbreviated as SS for Sum of Squares

6 Degrees of Freedom The degrees of freedom are the number of values that are free to vary once certain parameters have been established The degrees of freedom are the number of values that are free to vary once certain parameters have been established Usually, this is one less than the sample size, but in general, it’s the number of values minus the number of parameters being estimated Usually, this is one less than the sample size, but in general, it’s the number of values minus the number of parameters being estimated Abbreviated as df Abbreviated as df

7 Variance The sample variance is the average squared deviation from the mean The sample variance is the average squared deviation from the mean Found by dividing the variation by the degrees of freedom Found by dividing the variation by the degrees of freedom Variance = Variation / df Variance = Variation / dfVariationdfVariationdf Abbreviated as MS for Mean of the Squares Abbreviated as MS for Mean of the Squares MS = SS / df MS = SS / dfSSdfSSdf

8 F F is the F test statistic F is the F test statistic There will be an F test statistic for each source except for the error and total There will be an F test statistic for each source except for the error and total F is the ratio of two sample variances F is the ratio of two sample variances The MS column contains variances The MS column contains variances The F test statistic for each source is the MS for that row divided by the MS of the error row The F test statistic for each source is the MS for that row divided by the MS of the error row

9 F F requires a pair of degrees of freedom, one for the numerator and one for the denominator F requires a pair of degrees of freedom, one for the numerator and one for the denominator The numerator df is the df for the source The numerator df is the df for the source The denominator df is the df for the error row The denominator df is the df for the error row F is always a right tail test F is always a right tail test

10 The ANOVA Table The ANOVA table is composed of rows, each row represents one source of variation The ANOVA table is composed of rows, each row represents one source of variation For each source of variation … For each source of variation … The variation is in the SS column The variation is in the SS column The degrees of freedom is in the df column The degrees of freedom is in the df column The variance is in the MS column The variance is in the MS column The MS value is found by dividing the SS by the df The MS value is found by dividing the SS by the df

11 ANOVA Table The complete ANOVA table can be generated by most statistical packages and spreadsheets The complete ANOVA table can be generated by most statistical packages and spreadsheets We’ll concentrate on understanding how the table works rather than the formulas for the variations We’ll concentrate on understanding how the table works rather than the formulas for the variations

12 The ANOVA Table SourceSS(variation)dfMS(variance)F Explained* Error Total The explained* variation has different names depending on the particular type of ANOVA problem

13 Example 1 SourceSSdfMSF Explained18.93 Error72.016 Total The Sum of Squares and Degrees of Freedom are given. Complete the table.

14 Example 1 – Find Totals SourceSSdfMSF Explained18.93 Error72.016 Total90.919 Add the SS and df columns to get the totals.

15 Example 1 – Find MS SourceSSdfMSF Explained18.9 ÷ 3÷ 3÷ 3÷ 3 = 6.30 Error72.0 ÷ 16 = 4.50 Total90.9 ÷ 19 = 4.78 Divide SS by df to get MS.

16 Example 1 – Find F SourceSSdfMSF Explained18.936.301.40 Error72.0164.50 Total90.9194.78 F = 6.30 / 4.50 = 1.4

17 Notes about the ANOVA The MS(Total) isn’t actually part of the ANOVA table, but it represents the sample variance of the response variable, so it’s useful to find The MS(Total) isn’t actually part of the ANOVA table, but it represents the sample variance of the response variable, so it’s useful to find The total df is one less than the sample size The total df is one less than the sample size You would either need to find a Critical F value or the p-value to finish the hypothesis test You would either need to find a Critical F value or the p-value to finish the hypothesis test

18 Example 2 SourceSSdfMSF Explained106.621.322.60 Error26 Total Complete the table

19 Example 2 – Step 1 SourceSSdfMSF Explained106.6521.322.60 Error268.20 Total SS / df = MS, so 106.6 / df = 21.32. Solving for df gives df = 5. F = MS(Source) / MS(Error), so 2.60 = 21.32 / MS. Solving gives MS = 8.20.

20 Example 2 – Step 2 SourceSSdfMSF Explained106.6521.322.60 Error213.2268.20 Total31 SS / df = MS, so SS / 26 = 8.20. Solving for SS gives SS = 213.2. The total df is the sum of the other df, so 5 + 26 = 31.

21 Example 2 – Step 3 SourceSSdfMSF Explained106.6521.322.60 Error213.2268.20 Total319.831 Find the total SS by adding the 106.6 + 213.2 = 319.8

22 Example 2 – Step 4 SourceSSdfMSF Explained106.6521.322.60 Error213.2268.20 Total319.83110.32 Find the MS(Total) by dividing SS by df. 319.8 / 31 = 10.32

23 Example 2 – Notes Since there are 31 df, the sample size was 32 Since there are 31 df, the sample size was 32 Since the sample variance was 10.32 and the standard deviation is the square root of the variance, the sample standard deviation is 3.21 Since the sample variance was 10.32 and the standard deviation is the square root of the variance, the sample standard deviation is 3.21

24 Example 3 SourceSSdfMSF Explained56.7 Error1413.50 Total The sample size is n = 20. Work this one out on your own!

25 Example 3 - Solution SourceSSdfMSF Explained56.7511.340.84 Error189.01413.50 Total245.71912.93 How did you do?


Download ppt "Analysis of Variance Introduction The Analysis of Variance is abbreviated as ANOVA The Analysis of Variance is abbreviated as ANOVA Used for hypothesis."

Similar presentations


Ads by Google