1 Psych 5510/6510 Chapter 13: ANCOVA: Models with Continuous and Categorical Predictors Part 3: Within a Correlational Design Spring, 2009.

Slides:



Advertisements
Similar presentations
HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Section 12.2.
Advertisements

Copyright © 2009 Pearson Education, Inc. Chapter 29 Multiple Regression.
1 Psych 5510/6510 Chapter Eight--Multiple Regression: Models with Multiple Continuous Predictors Part 3: Testing the Addition of Several Parameters at.
Understanding the General Linear Model
Some Terms Y =  o +  1 X Regression of Y on X Regress Y on X X called independent variable or predictor variable or covariate or factor Which factors.
Common Factor Analysis “World View” of PC vs. CF Choosing between PC and CF PAF -- most common kind of CF Communality & Communality Estimation Common Factor.
Chapter 11 One-way ANOVA: Models with a Single Categorical Predictor
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 7: Interactions in Regression.
1 Psych 5510/6510 Chapter Eight--Multiple Regression: Models with Multiple Continuous Predictors Part 2: Testing the Addition of One Parameter at a Time.
Simple Regression correlation vs. prediction research prediction and relationship strength interpreting regression formulas –quantitative vs. binary predictor.
Lecture 6: Multiple Regression
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 6: Correlation.
Elaboration Elaboration extends our knowledge about an association to see if it continues or changes under different situations, that is, when you introduce.
Week 14 Chapter 16 – Partial Correlation and Multiple Regression and Correlation.
Simple Linear Regression Analysis
SW388R7 Data Analysis & Computers II Slide 1 Multiple Regression – Basic Relationships Purpose of multiple regression Different types of multiple regression.
Correlation Question 1 This question asks you to use the Pearson correlation coefficient to measure the association between [educ4] and [empstat]. However,
Relationships Among Variables
Multiple Linear Regression A method for analyzing the effects of several predictor variables concurrently. - Simultaneously - Stepwise Minimizing the squared.
Chapter 8: Bivariate Regression and Correlation
Objectives of Multiple Regression
1 Psych 5500/6500 Confounding Variables Fall, 2008.
Chapter 13: Inference in Regression
Correlation and Correlational Research Slides Prepared by Alison L. O’Malley Passer Chapter 5.
Moderation & Mediation
Chapter 18 Four Multivariate Techniques Angela Gillis & Winston Jackson Nursing Research: Methods & Interpretation.
Regression Analyses II Mediation & Moderation. Review of Regression Multiple IVs but single DV Y’ = a+b1X1 + b2X2 + b3X3...bkXk Where k is the number.
Statistics and Quantitative Analysis U4320 Segment 12: Extension of Multiple Regression Analysis Prof. Sharyn O’Halloran.
Lecture 6 Correlation and Regression STAT 3120 Statistical Methods I.
Correlation and Linear Regression. Evaluating Relations Between Interval Level Variables Up to now you have learned to evaluate differences between the.
1 Psych 5510/6510 Chapter 12 Factorial ANOVA: Models with Multiple Categorical Predictors and Product Terms. Spring, 2009.
1 Psych 5510/6510 Chapter 10. Interactions and Polynomial Regression: Models with Products of Continuous Predictors Spring, 2009.
Correlational Research Chapter Fifteen Bring Schraw et al.
Does this point lie on this line? The Point-Slope format (y – y 1 ) = m(x – x 1 )
1 Psych 5500/6500 Standard Deviations, Standard Scores, and Areas Under the Normal Curve Fall, 2008.
Multiple Regression. Multiple Regression  Usually several variables influence the dependent variable  Example: income is influenced by years of education.
1 Psych 5500/6500 t Test for Dependent Groups (aka ‘Paired Samples’ Design) Fall, 2008.
Chapter 13 Multiple Regression
Stat 13, Tue 5/29/ Drawing the reg. line. 2. Making predictions. 3. Interpreting b and r. 4. RMS residual. 5. r Residual plots. Final exam.
1 Psych 5510/6510 Chapter 14 Repeated Measures ANOVA: Models with Nonindependent ERRORs Part 3: Factorial Designs Spring, 2009.
1 Psych 5510/6510 Chapter Eight--Multiple Regression: Models with Multiple Continuous Predictors Part 1: Testing the Overall Model Spring, 2009.
1 Psych 5510/6510 Chapter 13 ANCOVA: Models with Continuous and Categorical Predictors Part 2: Controlling for Confounding Variables Spring, 2009.
ANCOVA. What is Analysis of Covariance? When you think of Ancova, you should think of sequential regression, because really that’s all it is Covariate(s)
1 Psych 5510/6510 Chapter 14 Repeated Measures ANOVA: Models with Nonindependent Errors Part 1 (Crossed Designs) Spring, 2009.
FINDING A POLYNOMIAL PASSING THROUGH A POINT. Review: the Linear Factorization Theorem If where n > 1 and a n ≠ 0 then Where c 1, c 2, … c n are complex.
Regression. Outline of Today’s Discussion 1.Coefficient of Determination 2.Regression Analysis: Introduction 3.Regression Analysis: SPSS 4.Regression.
Outline of Today’s Discussion 1.Regression Analysis: Introduction 2.An Alternate Formula For Regression 3.Correlation, Regression, and Statistical Significance.
Outline of Today’s Discussion 1.The Chi-Square Test of Independence 2.The Chi-Square Test of Goodness of Fit.
Single-Subject and Correlational Research Bring Schraw et al.
Linear Regression Chapter 7. Slide 2 What is Regression? A way of predicting the value of one variable from another. – It is a hypothetical model of the.
4 basic analytical tasks in statistics: 1)Comparing scores across groups  look for differences in means 2)Cross-tabulating categoric variables  look.
1 Psych 5510/6510 Chapter 14 Repeated Measures ANOVA: Models with Nonindependent ERRORs Part 2 (Crossed Designs) Spring, 2009.
More on regression Petter Mostad More on indicator variables If an independent variable is an indicator variable, cases where it is 1 will.
CIE Centre A-level Further Pure Maths
Data Analysis: Statistics for Item Interactions. Purpose To provide a broad overview of statistical analyses appropriate for exploring interactions and.
Multiple Regression Scott Hudson January 24, 2011.
Regression. Why Regression? Everything we’ve done in this class has been regression: When you have categorical IVs and continuous DVs, the ANOVA framework.
Topics: Multiple Regression Analysis (MRA)
Regression Analysis.
Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Spring 2016 Room 150 Harvill.
Practice. Practice Practice Practice Practice r = X = 20 X2 = 120 Y = 19 Y2 = 123 XY = 72 N = 4 (4) 72.
Data Analysis Module: Correlation and Regression
Learning Objectives For models with dichotomous intendant variables, you will learn: Basic terminology from ANOVA framework How to identify main effects,
Week 14 Chapter 16 – Partial Correlation and Multiple Regression and Correlation.
Regression.
Regression and Residual Plots
Introduction to bivariate data
Multivariate Statistics
Inferential Statistics
Chapter 15 Multiple Regression.
Presentation transcript:

1 Psych 5510/6510 Chapter 13: ANCOVA: Models with Continuous and Categorical Predictors Part 3: Within a Correlational Design Spring, 2009

2 Contexts We will be looking at three contexts in which this will be useful: 1.Within a ‘true experimental’ design, where we can use this approach to increase the power of the design and to add sophistication to our model. 2.Within a ‘quasi-experimental’ or ‘static group’ design, where we can use this approach to control a confounding variable. 3.Within a correlational design, where we can introduce a categorical variable to better understand a continuous variable.

3 Context 3: Using Categorical Variables to Better Understand Continuous Variables So far we have used a continuous variable to help our analysis of categorical variables. Now we switch to focusing on using a categorical variable to help our understanding of the relationship between a continuous variable and the dependent variable.

4 The Variables 1.Dependent Variable: support for a woman’s right to choose abortion (1-10 scale). 2.Continuous Independent Variable: political ideology (1-10 scale of conservative to liberal) 3.Categorical Independent Variable: Religious preference (Protestant, Catholic, or Jewish).

5 Right to Choose (RTC) & Political Ideology The focus of our research is on the relationship between right to choose (RTC) and political ideology (conservative to liberal...a continuous variable). Dependent variable: Y RTC = view on right to choose Independent variable: X Pol = political ideology Model C: Ŷ RTC = β 0 Model A: Ŷ RTC = β 0 + β 1 X Pol Ŷ RTC = X Pol PRE=.117 F*=52.53 p<.0001 It is worthwhile to add ideology to a model of RTC.

6 Our attention turns to the value of adding religious preference (a categorical variable) to the model. We are particularly interested in what religious preference might add to our understanding of the relationship between the variables of primary interest (RTC and ideology). We will begin by examining the relationship between religious preference and RTC. Note this example came from the text, I apologize if anyone is offended by anything about this example.

7 Religion and RTC Dependent variable: Y RTC = view on right to choose Independent variable: X Rel = religion (Catholic, Protestant, Jewish). This is a categorical variable with three values so it will take two contrast codes to code it. Comparison 1: Catholic vs. (Protestant & Jewish) Comparison 2: Protestant vs. Jewish Model C: Ŷ RTC = β 0 Model A: Ŷ RTC = β 0 + β 1 X 1 + β 2 X 2

8 Ŷ RTC = X X 2 PRE=.17 F*=41.02 p<.0001 It is worthwhile to use religion (coded by X1 and X2) in a model of RTC. To answer more specific questions: For b1=-.84, p<.0001 Can conclude Catholics differ from (Jews and Protestants combined) in RTC. For b2=.23, p=.51 Unable to conclude Jews differ from Protestants in RTC. Even though this comparison is not significant, we will continue to include it in later analyses to see what crops up as we work with the model (e.g. to see if it interacts with our other variable).

9 What We Know So Far We are modeling people’s attitudes on ‘right to choose’, we have established that: 1. Ideology predicts RTC. 2. Religion predicts RTC. We know that Catholics differ from the group of (Protestants and Jews combined) in RTC We were unable to show that Protestants and Jews differ in RTC.

10 Ideology & Religion Both political ideology (a continuous variable ) and religion (a categorical variable) predict RTC, next let’s examine the relationship between our two predictor variables. To accomplish that we can see to what degree religion predicts ideology. By the way, why not the other way around? (think about it) Model C: Ŷ Pol = β 0 Model A: Ŷ Pol = β 0 + β 1 X 1 + β 2 X 2

11 Ideology & Religion Ŷ Pol = X X 2 PRE=.126 F*=28.72 p< Yes, there is a relationship between ideology and religion. To answer more specific questions: For b1=-.21, p=.0036 Can conclude Catholics differ from the group of (Jews and Protestants combined) in their political ideology. For b2=1.06, p=.001 Can conclude Jews differ from Protestants in their political ideology.

12 What This Tells Us We have found that our two predictor variables are related, this is interesting for at least two reasons. 1.It adds to our knowledge about the predictor variables in our model, we now know that religion and ideology are related. We also have more specific information about this relationship thanks to our contrasts. 2.It tells us that our predictor variables (religion and ideology) are somewhat redundant, which is important to know as it might influence our interpretation of significance when all the variables are in the model at once.

13 Expanding our Understanding Again our primary interest was in the relationship between attitudes on RTC and political ideology. We now know that ideology and religion are somewhat redundant. This raises the possibility that religion is a confounding variable in the relationship between ideology and RTC. If religion is influencing both RTC and ideology then perhaps that is the only reason why we are getting a relationship between RTC and ideology. At this point what we want to know is what is the relationship between RTC and ideology when religion is held constant?

14 Answering that Question We can answer that question quite easily: Model C: Ŷ RTC = β 0 + β 1 X 1 + β 2 X 2 Model A: Ŷ RTC = β 0 + β 1 X 1 + β 2 X 2 + β 3 X Pol Ŷ RTC = X X X Pol The partial correlation coefficient for X Pol =.382, this is the correlation between ideology and RTC when religion is held constant (i.e. the correlation between ideology and RTC for people who are of the same religion). PRE=.382²=.146 F*=67.85 p< It is worthwhile to add political ideology to a model that already contains the person’s religion.

15 Looking at the b’s Ŷ RTC = X X X Pol b1=-.73, p<.001 b2=-.42, p=.025 b3=.52, p<.001 Catholics vs. (Jews and Protestants combined), Jews vs. Protestants, and political ideology, are all worthwhile to add last to a model that contains the other variables.

16 Continuing to Expand Understanding Because religion is a categorical variable, it is easy to examine the relationship between ideology and RTC within each religion. This will shed quite a bit more light on the relationship of primary interest (ideology and RTC).

17 Protestant Respondents Ŷ RTC = X X X Pol Being Protestant is coded by a score of -1 on X1, and -1 on X2 (see slide 7). Plugging in those values: Ŷ RTC = (-1) -.42(-1) +.52X Pol Ŷ RTC = X Pol This is the relationship between RTC and ideology for Protestant respondents.

18 Catholic Respondents Catholic respondents have a score of 2 on X1, and 0 on X2. Plugging in those values: Ŷ RTC = X X X Pol Ŷ RTC = (2) -.42(0) +.52X Pol Ŷ RTC = X Pol This is the relationship between right to choose and political ideology for Catholic respondents.

19 Jewish Respondents Jewish respondents have a score of -1 on X1, and 1 on X2. Plugging in those values: Ŷ RTC = X X X Pol Ŷ RTC = (-1) -.42(1) +.52X Pol Ŷ RTC = X Pol This is the relationship between right to choose and political ideology for Jewish respondents.

20 The Relationship Within Each Religion Putting these slides together, the relationships between political ideology and right to choose within each of the three religions are: –Jews: Ŷ RTC = X Pol –Catholic: Ŷ RTC = X Pol –Protestant: Ŷ RTC = X Pol

21 Political Ideology RTC

22 Additive Model The regression equations and the previous graph show the same slope for all three religions, this is because our model so far is an additive model (it has no interaction terms): Ŷ RTC = β 0 + β 1 X 1 + β 2 X 2 + β 3 X Pol

23 Interactions By not having any interaction terms in our model we have precluded having the slopes change as we move from one religious preference to another. Let’s add some interaction terms to see if it is worthwhile to include them in the model.

24 Testing for Interactions We will introduce two new variables to represent the interaction between political ideology and the two contrasts that code religion. Interaction 1: β 4 X 1 X Pol Interaction 2: β 5 X 2 X Pol Now we regress Y on all six predictors. Ŷ RTC = β 0 + β 1 X 1 + β 2 X 2 + β 3 X Pol + β 4 X 1 X Pol + β 5 X 2 X Pol.

25 Testing for an Interaction First, we will test to see if a non-additive model is worthwhile: Model C: Ŷ RTC = β 0 + β 1 X 1 + β 2 X 2 + β 3 X Pol Model A: Ŷ RTC = β 0 + β 1 X 1 + β 2 X 2 + β 3 X Pol + β 4 X 1 X Pol + β 5 X 2 X Pol Ŷ RTC = X X X Pol X 1 X Pol X 2 X Pol PRE=.042, F*=8.53, p=.0002 OK, a non-additive model is worthwhile.

26 Simple Relationships Ŷ RTC = X X X Pol X 1 X Pol X 2 X Pol If we plug in the values for X1 and X2 that code the various religions then we arrive at the following formulas: Protestants: Ŷ RTC = X Pol Catholics: Ŷ RTC = X Pol Jews: Ŷ RTC = X Pol

27 Political Ideology RTC

28 Individual Interactions Ŷ RTC = X X X Pol -.16X 1 X Pol -.06X 2 X Pol X 1 X Pol This is the interaction between the first contrast (Protestant and Jewish combined) vs. (Catholic) and political ideology. Go back to the previous slide. It looks like if we combined the Protestant and Jewish slopes and compared that to the slope for the Catholics we might find a difference in the slopes. β 4 = -.16, p< That difference in the slopes is indeed statistically significant.

29 Individual Interactions Ŷ RTC = X X X Pol X 1 X Pol X 2 X Pol X 2 X Pol This is the interaction between the first contrast (Protestant vs. Jewish) and political ideology. Go back to the graph. It looks like there is not much of a difference between the slopes for Protestant and Jewish groups. β 5 = -.06, p=.68. This interaction is indeed not statistically significant

30 Interpreting the b’s Ŷ RTC = bo + b1X 1 +b2 X 2 + b3X Pol + b4X 1 X Pol + b5X 2 X Pol Ŷ RTC = X X X Pol X 1 X Pol X 2 X Pol b1=.07 Variable X1 codes the contrast Catholic vs. (Jewish and Protestant). In an additive model the p value for b1 tells us whether we can conclude that contrast is significant. When we add the interaction of X1 and X Pol to the model then the p value for b1 tells us whether we can conclude the contrast is significant for the special case where X Pol = 0.

31 When X Pol =0 there is little difference between Catholics and (Jews and Protestants combined). Not only that, it is of little theoretical interest, for our scale for X Pol runs from 1 to 10, it can’t even be zero. RTC Xpol

32 Centering X Pol Let’s solve that problem by centering X Pol by subtracting the mean of X Pol (4.8) form each X Pol score. X Pol’ = X Pol X1 and X2 are contrasts, they are automatically centered (they have a mean of zero).

33 Interpreting the b’s Ŷ RTC = X X X Pol X 1 X Pol X 2 X Pol Ŷ RTC = X X X Pol’ X 1 X Pol’ X 2 X Pol’ Using X Pol’ rather than X Pol changed the values of b1 and b2 but not the values of b3, b4, b5. Remember, b1 is the slope of the relationship between X1 and Y when all the other variables equal 0, and we have changed X Pol to X Pol’, so that would effect X1. The same goes for b2. b3 didn’t change because X1 and X2 didn’t change. And, the interaction terms (b4 and b5) are not affected by centering data.

34 The effect of centering X Pol is just to move the Y axis to the mean of X Pol (4.8). b1 reflects Catholic vs. (Protestant and Jewish) at the new axis, b2 reflects Protestant vs. Jewish at the new axis. Note that the contrast reflected by b1 is now much larger and the contrast reflected by b2 is slightly larger. Both are more meaningful at the new axis (reflecting the contrast when X Pol equals its mean, rather than when it equals zero).

35 Our Model Ŷ RTC = b0 +b1X 1 +b2 X 2 + b3X Pol’ + b4X 1 X Pol’ + b5X 2 X Pol’ Ŷ RTC = X X X Pol’ X 1 X Pol’ X 2 X Pol’ All the b’s are statistically significant except for b5. We can see from the graph that the slopes for Protestant and Jewish don’t differ much, it’s not worth having that difference in our model. Final model (to date): Ŷ RTC = 0 + β1X 1 + β2 X 2 + β3X Pol’ + β4X 1 X Pol’