Download presentation
Presentation is loading. Please wait.
Published byAmos Eaton Modified over 9 years ago
1
Sequential sums of squares … or … extra sums of squares
2
Sequential sums of squares: what are they? The reduction in the error sum of squares when one or more predictor variables are added to the regression model. Or, the increase in the regression sum of squares when one or more predictor variables are added to the regression model.
3
Sequential sums of squares: why? They can be used to test whether one slope parameter is 0. They can be used to test whether a subset (more than two, but less than all) of the slope parameters are 0.
4
Example: Brain and body size predictive of intelligence? Sample of n = 38 college students Response (Y): intelligence based on the PIQ (performance) scores from the (revised) Wechsler Adult Intelligence Scale. Predictor (X 1 ): Brain size based on MRI scans (given as count/10,000) Predictor (X 2 ): Height in inches Predictor (X 3 ): Weight in pounds
5
OUTPUT #1 The regression equation is PIQ = 4.7 + 1.18 MRI Predictor Coef SE Coef T P Constant 4.65 43.71 0.11 0.916 MRI 1.1766 0.4806 2.45 0.019 Analysis of Variance Source DF SS MS F P Regression 1 2697.1 2697.1 5.99 0.019 Error 36 16197.5 449.9 Total 37 18894.6
6
OUTPUT #2 The regression equation is PIQ = 111 + 2.06 MRI - 2.73 Height Predictor Coef SE Coef T P Constant 111.28 55.87 1.99 0.054 MRI 2.0606 0.5466 3.77 0.001 Height -2.7299 0.9932 -2.75 0.009 Analysis of Variance Source DF SS MS F P Regression 2 5572.7 2786.4 7.32 0.002 Residual 35 13321.8 380.6 Total 37 18894.6 Source DF Seq SS MRI 1 2697.1 Height 1 2875.6
7
OUTPUT #3 The regression equation is PIQ = 111 + 2.06 MRI - 2.73 Height + 0.001 Weight Predictor Coef SE Coef T P Constant 111.35 62.97 1.77 0.086 MRI 2.0604 0.5634 3.66 0.001 Height -2.732 1.229 -2.22 0.033 Weight 0.0006 0.1971 0.00 0.998 Analysis of Variance Source DF SS MS F P Regression 3 5572.7 1857.6 4.74 0.007 Error 34 13321.8 391.8 Total 37 18894.6 Source DF Seq SS MRI 1 2697.1 Height 1 2875.6 Weight 1 0.0
8
Sequential sums of squares: definition using SSE notation SSR(X 2 |X 1 ) = SSE(X 1 ) - SSE(X 1,X 2 ) In general, you subtract the error sum of squares due to all of the predictors both left and right of the bar from the error sum of squares due to the predictor to the right of the bar. SSR(X 2,X 3 |X 1 ) = SSE(X 1 ) - SSE(X 1,X 2,X 3 )
9
Sequential sums of squares: definition using SSR notation SSR(X 2 |X 1 ) = SSR(X 1,X 2 ) – SSR(X 1 ) In general, you subtract the regression sum of squares due to the predictor to the right of the bar from the regression sum of squares due to all of the predictors both left and right of the bar. SSR(X 2,X 3 |X 1 ) = SSR(X 1,X 2,X 3 )-SSR(X 1 )
10
Decomposition of regression sum of squares In multiple regression, there is more than one way to decompose the regression sum of squares. For example:
11
OUTPUT #2 The regression equation is PIQ = 111 + 2.06 MRI - 2.73 Height Predictor Coef SE Coef T P Constant 111.28 55.87 1.99 0.054 MRI 2.0606 0.5466 3.77 0.001 Height -2.7299 0.9932 -2.75 0.009 Analysis of Variance Source DF SS MS F P Regression 2 5572.7 2786.4 7.32 0.002 Residual 35 13321.8 380.6 Total 37 18894.6 Source DF Seq SS MRI 1 2697.1 Height 1 2875.6
12
OUTPUT #4 The regression equation is PIQ = 111 - 2.73 Height + 2.06 MRI Predictor Coef SE Coef T P Constant 111.28 55.87 1.99 0.054 Height -2.7299 0.9932 -2.75 0.009 MRI 2.0606 0.5466 3.77 0.00 Analysis of Variance Source DF SS MS F P Regression 2 5572.7 2786.4 7.32 0.002 Error 35 13321.8 380.6 Total 37 18894.6 Source DF Seq SS Height 1 164.0 MRI 1 5408.8
13
Decomposition of SSR: how?
15
Even more ways to decompose SSR when 3 or more predictors
16
Degrees of freedom and regression mean squares A sequential sum of squares involving one extra predictor variable has one degree of freedom associated with it: A sequential sum of squares involving two extra predictor variables has two degrees of freedom associated with it:
17
Sequential sums of squares in Minitab The SSR is automatically decomposed into one-degree-of-freedom sequential sums of squares, in the order in which the predictor variables are entered into the model. To get sequential sum of squares involving two or more predictor variables, sum the appropriate one-degree-of-freedom sequential sums of squares.
18
OUTPUT #3 The regression equation is PIQ = 111 + 2.06 MRI - 2.73 Height + 0.001 Weight Predictor Coef SE Coef T P Constant 111.35 62.97 1.77 0.086 MRI 2.0604 0.5634 3.66 0.001 Height -2.732 1.229 -2.22 0.033 Weight 0.0006 0.1971 0.00 0.998 Analysis of Variance Source DF SS MS F P Regression 3 5572.7 1857.6 4.74 0.007 Error 34 13321.8 391.8 Total 37 18894.6 Source DF Seq SS MRI 1 2697.1 Height 1 2875.6 Weight 1 0.0
19
OUTPUT #5 The regression equation is PIQ = 111 - 2.73 Height + 0.001 Weight + 2.06 MRI Predictor Coef SE Coef T P Constant 111.35 62.97 1.77 0.086 Height -2.732 1.229 -2.22 0.033 Weight 0.0006 0.1971 0.00 0.998 MRI 2.0604 0.5634 3.66 0.001 Analysis of Variance Source DF SS MS F P Regression 3 5572.7 1857.6 4.74 0.007 Error 34 13321.8 391.8 Total 37 18894.6 Source DF Seq SS Height 1 164.0 Weight 1 169.5 MRI 1 5239.2
20
Testing one slope β 1 = β MRI is 0 Predictor Coef SE Coef T P Constant 111.35 62.97 1.77 0.086 Height -2.732 1.229 -2.22 0.033 Weight 0.0006 0.1971 0.00 0.998 MRI 2.0604 0.5634 3.66 0.001 Analysis of Variance Source DF SS MS F P Regression 3 5572.7 1857.6 4.74 0.007 Error 34 13321.8 391.8 Total 37 18894.6 Source DF Seq SS Height 1 164.0 Weight 1 169.5 MRI 1 5239.2
21
Testing one slope β 2 = β HT is 0 Predictor Coef SE Coef T P Constant 111.35 62.97 1.77 0.086 MRI 2.0604 0.5634 3.66 0.001 Weight 0.0006 0.1971 0.00 0.998 Height -2.732 1.229 -2.22 0.033 Analysis of Variance Source DF SS MS F P Regression 3 5572.7 1857.6 4.74 0.007 Error 34 13321.8 391.8 Total 37 18894.6 Source DF Seq SS MRI 1 2697.1 Weight 1 940.9 Height 1 1934.7
22
Testing one slope β 3 = β WT is 0 Predictor Coef SE Coef T P Constant 111.35 62.97 1.77 0.086 MRI 2.0604 0.5634 3.66 0.001 Height -2.732 1.229 -2.22 0.033 Weight 0.0006 0.1971 0.00 0.998 Analysis of Variance Source DF SS MS F P Regression 3 5572.7 1857.6 4.74 0.007 Error 34 13321.8 391.8 Total 37 18894.6 Source DF Seq SS MRI 1 2697.1 Height 1 2875.6 Weight 1 0.0
23
Testing one slope β k is 0: why it works? Full model:Reduced model:
24
Testing one slope β k is 0: why it works? (cont’d) The general linear test statistic: becomes:
25
Testing whether β 2 = β 3 = 0 Full model:Reduced model:
26
Testing whether β 2 = β 3 = 0 (cont’d) The general linear test statistic: becomes:
27
OUTPUT #3 The regression equation is PIQ = 111 + 2.06 MRI - 2.73 Height + 0.001 Weight Predictor Coef SE Coef T P Constant 111.35 62.97 1.77 0.086 MRI 2.0604 0.5634 3.66 0.001 Height -2.732 1.229 -2.22 0.033 Weight 0.0006 0.1971 0.00 0.998 Analysis of Variance Source DF SS MS F P Regression 3 5572.7 1857.6 4.74 0.007 Error 34 13321.8 391.8 Total 37 18894.6 Source DF Seq SS MRI 1 2697.1 Height 1 2875.6 Weight 1 0.0
28
Cumulative Distribution Function F distribution with 2 DF in numerator and 34 DF in denominator x P( X <= x ) 3.6700 0.9640 P-value is:
29
Getting P-value for F-statistic in Minitab Select Calc >> Probability Distributions >> F… Select Cumulative Probability. Use default noncentrality parameter of 0. Type in numerator DF and denominator DF. Select Input constant. Type in F-statistic. Answer appears in session window. P-value is 1 minus the number that appears.
30
Test whether β 1 = β 3 = 0 Analysis of Variance Source DF SS MS F P Regression 3 5572.7 1857.6 4.74 0.007 Error 34 13321.8 391.8 Total 37 18894.6 Source DF Seq SS Height 1 164.0 Weight 1 169.5 MRI 1 5239.2
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.