INFERENTIAL STATISTICS © LOUIS COHEN, LAWRENCE MANION & KEITH MORRISON
STRUCTURE OF THE CHAPTER Measures of difference between groups The t-test (a difference test for parametric data) Analysis of variance (a difference test for parametric data) The chi-square test (a difference test and a test of goodness of fit for non-parametric data) Degrees of freedom (a statistic that is used in calculating statistical significance in considering difference tests) The Mann-Whitney and Wilcoxon tests (difference tests for non-parametric data)
STRUCTURE OF THE CHAPTER The Kruskal-Wallis and Friedman tests (difference tests for non-parametric data) Regression analysis (prediction tests for parametric data) Simple linear regression (predicting the value of one variable from the known value of another variable) Multiple regression (calculating the different weightings of independent variables on a dependent variable) Standardized scores (used in calculating regressions and comparing sets of data with different means and standard deviations)
REGRESSION Regression is a statistical technique of modelling the relationship between variables. From knowing the values of one variable we can predict the values of another variable
SIMPLE LINEAR REGRESSION Simple linear regression – the model includes one explanatory variable (independent) and one explained variable (dependent) The relationship between examinations and stress
A SIMPLE REGRESSION
A SIMPLE REGRESSION (SPSS)
A SIMPLE REGRESSION The beta weighting (ß) is ‘the amount of standard deviation unit of change in the dependent variable for each standard deviation unit of change in the independent variable’. Here the standardized beta weighting is .966, i.e. it is highly statistically significant (=0.000 in the ‘Sig.’ column); this means that for every standard deviation unit change in the independent variable (‘hours per week on private study’) there is .966 of a unit rise in the dependent variable (‘score on final university examination’) i.e. there is nearly a one-to-one correspondence.
MULTIPLE LINEAR REGRESSION The model is a linear equation with at least two explanatory variables (independent) and one explained variable (dependent)
USING MULTIPLE REGRESSION Multiple regression is useful in calculating the relative weighting of two or more independent variables on a dependent variable. Using the beta () weighting, multiple regression calculates how many standard deviation units are changed in the dependent variable for each standard deviation unit of change in each of the independent variables. For example, let us say that we wished to investigate the relative weighting of ‘hours per week of private study’ and ‘motivation level’ as independent variables acting on the dependent variable ‘score on final university examination’.
Final examination score Hours of study per week Final examination score Level of motivation
USING MULTIPLE REGRESSION (SPSS) The Adjusted R square is .938, i.e. the amount of the dependent variable explained by the two independent variables is very high (93.8%).
USING MULTIPLE REGRESSION The analysis of variance (ANOVA) is highly statistically significant (=.000), i.e. the relationship between the independent variable and the dependent variable is very strong.
USING MULTIPLE REGRESSION (SPSS)
USING MULTIPLE REGRESSION The independent variable ‘hours per week of private study’ has the strongest positive predictive power (=.904) on the dependent variable ‘score on final university examination’, and this is statistically significant (the column ‘Sig.’ indicates that the level of significance, at .000, is stronger than .001); The independent variable ‘motivation level’ has strong positive predictive power (=.104) on the dependent variable ‘score on final university examination’, and this is statistically significant (the column ‘Sig.’ indicates that the level of significance, at .001); Though both independent variables have a statistically significant weighting on the dependent variable, the beta weighting of the independent variable ‘hours per week of private study’ (=.904) is much higher than that of the independent variable ‘motivation level’ (=.104) on the dependent variable ‘score on final university examination’, i.e. ‘hours per week on private study’ is a stronger predictor of ‘score on final university examination’ than ‘motivation level’.
USING MULTIPLE REGRESSION The researcher can predict that, if the hours per week spent in private study were known, and if the motivation level of the student was known, then the likely score on the final university examination could be predicted. The formula would be: ‘Score on final university examination’ = ( x ‘hours per week on private study’) + ( x ‘motivation level’) In the example, the for ‘hours per week on private study’ is 0.904, and the for ‘motivation level’ is 0.104. These are the relative weightings of the two independent variables. So, for example, for a student who spends 60 hours per week on private study and has a high motivation level (9) the formula becomes: ‘Score on final university examination’ = (0.904 x 60) + (0.104 x 9) = 54.24 + 0.936 = 55.176
For example, if the beta weighting were different, and for different factors, the relationship between examination mark, study time and intelligence could be: Examination mark = Study time + Intelligence Examination mark = 0.65 Study time + 0.30 Intelligence A student with an intelligence score of 110 who studies for 30 hours per week will obtain the following examination mark: Examination mark = (0.65 x 30) + (0.30 x 110) = 19.5 + 33 = 52.5 If the same student studies for 40 hours then: Examination mark = (0.65 x 40) + (0.30) x 110 = 26 + 33 = 59
STRESS IN TEACHING (SPSS): BETA WEIGHTINGS OF VARIABLES
STRESS IN TEACHING (SPSS) (removing variables affects beta weightings)
COLLINEARITY DIAGNOSTICS (MULTI-COLLINEARITY) Multicollinearity: The correlation between each independent variable should not be too high. Collinearity diagnostics indicates the level of correlation. If the collinearity is too high between two variables then it may be advisable to remove one. Multi-collinearity is tested by (a) Tolerance and (b) the Variance Inflation Factor.
COLLINEARITY DIAGNOSTICS (MULTI-COLLINEARITY) Tolerance: ‘An indicator of how much of the variability of the specified independent is not explained by the other independent variables in the model . . . . If this value is very small (less than .10), it indicates that the multiple correlation with other variables is high, suggesting the possibility of collinearity’ (Pallant, 2007: 156). ‘The VIF (Variance Inflation Factor), which is just the inverse of the Tolerance Factor (1 divided by Tolerance). VIF values above 10 would be a concern here, indicating multicollinearity’ (Pallant, 2007: 156).
COLLINEARITY DIAGNOSTICS (SPSS) (MULTI-COLLINEARITY) Coefficientsa Model Unstandardized Coefficients Standardized Coefficients t Sig. Collinearity Statistics B Std. Error Beta Tolerance VIF 1 (Constant) 22.577 1.366 16.531 .000 Hours per week on private study .714 .024 .904 29.286 .655 1.528 Motivation level .404 .119 .104 3.385 .001 a. Dependent Variable: Score on final university examination
COLLINEARITY DIAGNOSTICS (MULTI-COLLINEARITY) In the example: Hours per week on private study: Tolerance = .655 VIF = 1.528 Motivation level Tolerance = .655 There is no problem with multicollinearity.
SOME ASSUMPTIONS IN REGRESSION Random sampling; Ratio data; The removal of outliers: check outliers by calculating the Mahalanobis distance (in SPSS) ; The supposed linearity of the measures is justifiable; Interaction effects of independent variables (in non-recursive models) are measured; The selection for the inclusion and exclusion of variables is justifiable; The dependent variable and the residuals (the distance of the cases from the line of best fit) is approximately normally distributed;
SOME ASSUMPTIONS IN REGRESSION The variance of each variable is consistent across the range of values for all other variables (or at least the next assumption is true); The independent variables are approximately normally distributed, the variation is even across the levels/values of the variable (homoscedasticity). Collinearity/multicollinearity is avoided. Regressions are only as robust as the variables included, and the inclusion or removal of one or more independent variables affects their relative weightings on the dependent variable.
STEPWISE REGRESSION To find a model with predictive accuracy, working with a limited number of independent variables from a longer list of independent variables, to determine which ones have a statistically significant influence on the dependent variables. Stepwise multiple regression enters variables one at a time, in a sequence, to see which adds to the explanatory power of a model, by looking at its impact on the R-squared – whether it increases the R-square value.
STEPWISE REGRESSION Stepwise multiple regression enables the researcher to see which variables have predictive power and which do not, which to include and which to exclude in an explanatory model.
LOGISTIC REGRESSION To enable the researcher to work with categorical variables in a multiple regression where the dependent variable is a categorical variable. The independent variables may be categorical, discrete or continuous.
PROCEDURE FOR MULTIPLE REGRESSION Analyze Regression Linear. Send over dependent variable to Dependent box. Send over independent variables to Independent box. Click on Statistics. Tick the boxes Estimates, Confidence Intervals, Model fit, Descriptives, Part and partial correlations, Collinearity diagnostics, Casewise diagnostics and Outliers outside 3 standard deviations. Click Continue. Click on Options. Click on Exclude cases pairwise. Click Continue. Click on Plots. Send over *ZRESID to the Y box. Send over *ZPRED to the X box. Click on Normal probability plots. Click Continue. Click on Save. Click the Mahalanobis box and the Cook’s box. Click Continue. Click OK.
EXAMINING MULTIPLE REGRESSION SPSS OUTPUT Check collinearity statistics: Tolerance must be higher than .10; VIF (Variance Inflation Factor) must not be higher than 10. Check normality, linearity and homoscedasticity: Normality Probability Plot (Normal P-P Plot) to have points going in a straight diagonal line, bottom left to top right; Scatterplot to be a rectangle with scores concentrated in the centre (along the 0 point), avoiding curvilinear or uneven distribution. Check that the Cook’s Distance maximum value is below 1 and that the Mahal. Distance is lower than the critical value.
EXAMINING MULTIPLE REGRESSION SPSS OUTPUT Check the Adjusted R Square. Check ANOVA and its significance level. Check the Standardized Beta Coefficients and their significance levels. Square each Parts correlation coefficient to see the contribution of each variable to the total Adjusted R Square (i.e. how much of the total variance in the dependent variable is explained by each independent variable).
COLLINEARITY DIAGNOSTICS (MULTI-COLLINEARITY) Analyze Regression Linear Statistics Click ‘Collinearity diagnostics’ Click ‘Continue’ Continue with the multiple regression.
PROCEDURES FOR STEPWISE MULTIPLE REGRESSION IN SPSS ‘Analyze’ ‘Regression ‘ ‘Linear ‘ Enter dependent and independent variables In the ‘Method’ box, change ‘Enter’ to ‘Stepwise’ Click ‘OK’.
PROCEDURES FOR LOGISTIC REGRESSION IN SPSS Analyze Regression Binary Logistic Insert dependent variable in the ‘Dependent’ box Insert independent variables into the ‘Covariates’ box Click on ‘Categorical’ Move first categorical variable into the ‘Categorical Covariates’ box Click the radio button ‘First’ Click the ‘Change’ button Repeat this for every categorical variable Click ‘Continue’ to return to the first screen Click ‘Options’ Click the boxes ‘Classification plots’, Hosmer-Lemeshow goodness of fit’, ‘Casewise listing of residuals’ and ‘CI for Exp(B) Click ‘Continue’ to return to first screen Click ‘OK’.
THE NEED FOR A STANDARDIZED SCORE A child tells his parents that he scored a mark of 75 for a maths test; his parents scold him. A child tells his parents that he scored a mark of 2 for a history test; his parents praise him. A child tells his parents that he scored a mark of 25 for an English test and a mark of 60 for a Physics test; his parents praise him for both. A child tells his parents that he scored a mark of 80 for a Geography test and a mark of 120 for a Chemistry test; his parents scold him for both.
THE NEED FOR A STANDARDIZED SCORE We need to know how to judge whether a mark is high or low. We need to be able to compare marks between one test and another. Therefore we need to know the scale of the marks, the range of the marks, the mean of the marks, and the distribution of the marks either side of the mean.
THE NEED FOR A STANDARDIZED SCORE We need to know how to compare marks from a test which: uses one scale with marks from a test which uses another scale; has one range of marks with marks from a test that has another range of marks; has a mean which is different from the mean of another test; Has a distribution around the mean which is different from the distribution of another test.
THE Z-SCORE (STANDARDIZED SCORE) Standardizing scores lets us judge whether a mark is high or low. Standardizing scores lets us compare marks between one test and another when two different tests have different scales, range, means and distributions around the mean.
Z-SCORES z-scores have the same mean and standard deviation, even though the original sets of scores had different means and standard deviations, i.e. z-scores let you compare fairly. A z-score tells us how many standard deviations someone’s scores are above or below the mean.
Z-SCORES To calculate the z-score subtract the mean from the raw score and divide that answer by the standard deviation. For example if the raw score is 15, the mean is 10, and the standard deviation is 4, then 15-10 = 5 and 5 4 = 1.25. Here z-score tells us that the person’s score is 1.25 standard deviations above the mean. Is that score good or bad? How good or bad is it?
Two standard deviations either side of the mean accounts for 95 Two standard deviations either side of the mean accounts for 95.4% of the population. One standard deviation either side of the mean accounts for 68.3% of the population. The mean
STANDARDIZED SCORE (Z-SCORE) A z-score of +1.4 indicates that someone is 1.4 standard deviations above the mean. A z-score of -1.4 indicates that someone is 1.4 standard deviations below the mean. If the z-score is positive, it indicates that the value is above the mean. If the z-score is negative, it means that the value is below the mean. Is that z-score good or bad? How good or bad is it? We need to know about the probability of a certain value falling into a certain range of value.
The normal curve lets us interpret the probability of a score falling into a certain range of scores/values. 68% of the population lie between -1 and +1 standard deviations 95% of the population lie between -2 and +2 standard deviations 99% of the population lie between -1 and +1 standard deviations
The normal curve lets us interpret the probability of a score falling into a certain range of scores/values.
Let us say that, for 1200 people: The mean = 35 The standard deviation = 13 If a member of the group says he is 61 years old then it is clear that this person is much older than the average. But how much older? To be exact, we can convert his score into a z-score.
This tells us that 61 is 2 standard deviations above the mean. Refer to the ‘areas under the standard normal curve’ table (in statistics textbooks), for z = 2, the ‘area under curve beyond one point’ is 0.023. The proportion of people that are 61 years of age or more is only 2.3% of the total.
For example if z-score is 1.25. Is that score good or bad? How good or bad is it? Refer to the ‘areas under the standard normal curve’ table (in statistics textbooks), for z = ±1.25, the ‘area under curve beyond one point’ is 0.1056. The proportion of people who score 1.25 or more is only 10.56% of the total. So, the score is very good.
An online calculator of area under the curve for standardized scores is at: www.danielsoper.com/statcalc/calc02.aspx This calculator gives the cumulative area under the curve (a figure as a decimal fraction that is less than 1 (let us call it X). To find the area under the curve beyond that one point simply subtract this figure from 1 (the formula, then is 1-X).
CALCULATING Z-SCORES WITH SPSS Click on ‘Analyze’ ‘Descriptive Statistics’ Descriptives’. Send over the variables to ‘Variables’ ‘Click the box ‘Save standardized values as variables’ Click ‘OK’ Two new variables will be created.
T-SCORES Some people are uncomfortable with z-scores, as they don’t like negative scores and they do not like an average being 0. To overcome this, z-scores can be converted to T-scores. To convert a z-score to a T-score, multiply the z-score by 10 and add 50 to the result. For example a z-score of .5, multiplied by 10 gives 5, and then, with 50 added, gives 55. The T-score is 55. Many IQ tests and standardized tests convert z-scores. For example a common conversion in IQ tests is to multiply the z-score by 15 and add 100. So a z-score on an IQ test might be .5, multiplied by 15 gives 7.5, with 100 added gives 107.5, i.e. the IQ z-score converts to a T-score of 107.5.
THE MCCALL T-SCORE The McCall T-score has a mean of 50 and a standard deviation of 10: McCall T-score = 50±(z-score x 10) For the ± sign, the part in brackets should be added to the 50 if the z-score is positive (i.e. if the raw score is above the mean) and subtracted if the z-score is negative (i.e. if the raw score is below the mean).