Presentation is loading. Please wait.

Presentation is loading. Please wait.

3.3. SIMPLE LINEAR REGRESSION: DUMMY VARIABLES 1 Design and Data Analysis in Psychology II Salvador Chacón Moscoso Susana Sanduvete Chaves.

Similar presentations


Presentation on theme: "3.3. SIMPLE LINEAR REGRESSION: DUMMY VARIABLES 1 Design and Data Analysis in Psychology II Salvador Chacón Moscoso Susana Sanduvete Chaves."— Presentation transcript:

1 3.3. SIMPLE LINEAR REGRESSION: DUMMY VARIABLES 1 Design and Data Analysis in Psychology II Salvador Chacón Moscoso Susana Sanduvete Chaves

2 1. Introduction -Qualitative independent variable. -Quantitative dependent variable. -Two independent groups: simple linear regression. -More than two independent groups: multiple linear regression. -Codification dummy – Fictious variable: -0 = absence; 1 = presence -Example: 1 = woman; 0 = man (absence of being woman). 2

3 3 1. Introduction Linearity assumption is always accepted. The other assumptions must be tested.

4 4 2. Parameters estimation Regression equation: Formulas to calculate a and b previously used are still useful. In addition:

5 5 Example. Those are the results obtained by women (X=0) and men (X=1) in a linguistic ability test. XY 07 06 07 08 09 13 15 14 15 13 2. Parameters estimation

6 6 1.Calculate the regression equation in raw scores using the new formulas. 2. Interpret the parameters. 3. Calculate the regression equation in raw scores using the classic formulas. 4. Calculate the regression equation supposing that women were codified with 1 and men with 0. 2. Parameters estimation

7 7 1. Calculate the regression equation in raw scores using the new formulas. 2. Parameters estimation

8 8 2. Interpret the parameters. -a is the mean value of Y when X = 0. 7.4 is the mean result obtained in the linguistic ability test in women group. -b is the change in the mean value of Y in the group X = 1 compared to the group X = 0. The mean result is 3.4 lower in men than in women (concretely, the mean result in men is 4). 2. Parameters estimation

9 9 3. Calculate the regression equation in raw scores using the classic formulas. XYxyx*yx2x2 y2y2 07 -0,51,3-0,650,251,69 06 -0,50,3-0,150,250,09 07 -0,51,3-0,650,251,69 08 -0,52,3-1,150,255,29 09 -0,53,3-1,650,2510,89 13 0,5-2,7-1,350,257,29 15 0,5-0,7-0,350,250,49 14 0,5-1,7-0,850,252,89 15 0,5-0,7-0,350,250,49 13 0,5-2,7-1,350,257,29 557Σ-8,52,538,1 2. Parameters estimation

10 10 3. Calculate the regression equation in raw scores using the classic formulas. 2. Parameters estimation

11 11 4. Calculate the regression equation supposing that women were codified with 1 and men with 0. XY 17 16 17 18 19 03 05 04 05 03 2. Parameters estimation

12 12 4. Calculate the regression equation supposing that women were codified with 1 and men with 0. 2. Parameters estimation

13 13 3. Goodness of fit p = ratio of n 1 related to the whole sample. q = ratio of n 0 n 1 related to the whole sample. p + q = 1

14 14 3. Goodness of fit Example. Using the same data, calculate the goodness of fit using the two formulas proposed, and interpret the results obtained.

15 15 3. Goodness of fit Ratio of total variability of Y explained by X. 75.8% of variability of linguistic ability is explained by sex.

16 16 4. Model validation -  Null hypothesis is rejected. The variables are related. The model is valid. –  Null hypothesis is accepted. The variables are not related. The model is not valid. (k = number of independent variables)

17 17 4. Model validation Example. Using the same data, conclude about the model validation.

18 18 4. Model validation

19 19 4. Model validation 19  Conclusión: Null hypothesis is rejected. The variables are related. The model is valid. There is a statistically significant relationship between sex and linguistic ability.

20 4. Model validation Which is the final conclusion? Significant effect Non-significant effect High effect size ( ≥ 0.67) The effect probably exists The non- significance can be due to low statistical power Low effect size ( ≤ 0.18) The statistical significance can be due to an excessive high statistical power The effect probably does not exist 20

21 4. Model validation Which is the final conclusion? Significant effect Non-significant effect High effect size R 2 = 0.759 ( ≥ 0.67) The effect probably exists The non- significance can be due to low statistical power Low effect size ( ≤ 0.18) The statistical significance can be due to an excessive high statistical power The effect probably does not exist 21

22 22 5. Significance Of Pearson correlation coefficient:

23 23 5. Significance Of the slope:

24 24 5. Significance Of samples contrast:

25 25 5. Significance –  Null hypothesis is rejected. The model is valid. The slope is statistically different from 0. There is, therefore, relationship between variables. –  Null hypothesis is accepted. The model is not valid. The slope is statistically equal to 0. There is not, therefore, relationship between variables. 25

26 26 5. Significance Example. Calculate t using the different formulas of t and conclude about the significance.

27 27 5. Significance

28 28 5. Significance XYxyx*yx2x2 y2y2 Y2Y2 X2X2 07 -0,51,3-0,650,251,69490 06 -0,50,3-0,150,250,09360 07 -0,51,3-0,650,251,69490 08 -0,52,3-1,150,255,29640 09 -0,53,3-1,650,2510,89810 13 0,5-2,7-1,350,257,2991 15 0,5-0,7-0,350,250,49251 14 0,5-1,7-0,850,252,89161 15 0,5-0,7-0,350,250,49251 13 0,5-2,7-1,350,257,2991 557∑-8,52,538,13635

29 29 5. Significance

30 30 5. Significance 30 Conclussion: Null hypothesis is rejected. The model is valid. There is statistically significant relationship between sex and linguistic ability.


Download ppt "3.3. SIMPLE LINEAR REGRESSION: DUMMY VARIABLES 1 Design and Data Analysis in Psychology II Salvador Chacón Moscoso Susana Sanduvete Chaves."

Similar presentations


Ads by Google