Chapter 10: Relationships Between Two Variables: CrossTabulation Independent and Dependent Variables Constructing a Bivariate Table Computing Percentages in a Bivariate Table Dealing with Ambiguous Relationships Between Variables Reading the Research Literature Properties of a Bivariate Relationship Elaboration Statistics in Practice
Introduction Bivariate Analysis: A statistical method designed to detect and describe the relationship between two variables. Cross-Tabulation: A technique for analyzing the relationship between two variables that have been organized in a table.
Understanding Independent and Dependent Variables Example: If we hypothesize that English proficiency varies by whether person is native born or foreign born, what is the independent variable, and what is the dependent variable? Independent: nativity Dependent: English proficiency
Constructing a Bivariate Table Bivariate table: A table that displays the distribution of one variable across the categories of another variable. Column variable: A variable whose categories are the columns of a bivariate table. Row variable: A variable whose categories are the rows of a bivariate table. Cell: The intersection of a row and a column in a bivariate table. Marginals: The row and column totals in a bivariate table.
Percentages Can Be Computed in Different Ways: Column Percentages: column totals as base Row Percentages: row totals as base
Support for Abortion by Job Security Absolute Frequencies Support for Abortion by Job Security Abortion Job Find Easy Job Find Not Easy Row Total Yes 24 25 49 No 20 26 46 Column Total 44 51 95
Support for Abortion by Job Security Column Percentages Support for Abortion by Job Security Abortion Job Find Easy Job Find Not Easy Row Total Yes 55% 49% 52% No 45% 51% 48% Column Total 100% 100% 100% (44) (51) (95)
Support for Abortion by Job Security Row Percentages Support for Abortion by Job Security Abortion Job Find Easy Job Find Not Easy Row Total Yes 49% 51% 100% (49) No 43% 57% 100% (46) Column Total 46% 54% 100% (95)
Properties of a Bivariate Relationship Does there appear to be a relationship? How strong is it? What is the direction of the relationship?
Existence of a Relationship IV: Number of Traumas DV: Support for Abortion If the number of traumas were unrelated to attitudes toward abortion among women, then we would expect to find equal percentages of women who are pro-choice (or anti-choice), regardless of the number of traumas experienced.
Existence of the Relationship
# of Traumas Abortion 1 2+ Total Yes 46% 38% 22% 35% No 54% 62% 78% 65% (N) 100% (27) (44) (33) (104) # of Traumas Abortion 1 2+ Total Yes 46% No 54% (N) 100% (27) (44) (33) (104)
Example Education Income Less than HS HS College Total $0-$2,0000 54% 28% 12% 32% $2,0000-$3,0000 36% 62% 10% $3,0000+ 78% 100%
Determining the Strength of the Relationship A quick method is to examine the percentage difference across the different categories of the independent variable. The larger the percentage difference across the categories, the stronger the association. We rarely see a situation with either a 0 percent or a 100 percent difference.
Direction of the Relationship Positive relationship: A bivariate relationship between two variables measured at the ordinal level or higher in which the variables vary in the same direction. Negative relationship: A bivariate relationship between two variables measured at the ordinal level or higher in which the variables vary in opposite directions.
A Positive Relationship
A Negative Relationship
Elaboration Elaboration is a process designed to further explore a bivariate relationship; it involves the introduction of control variables. A control variable is an additional variable considered in a bivariate relationship. The variable is controlled for when we take into account its effect on the variables in the bivariate relationship.
Three Goals of Elaboration Elaboration allows us to test for non-spuriousness. Elaboration clarifies the causal sequence of bivariate relationships by introducing variables hypothesized to intervene between the IV and DV. Elaboration specifies the different conditions under which the original bivariate relationship might hold.
Testing for Nonspuriousness Direct causal relationship: a bivariate relationship that cannot be accounted for by other theoretically relevant variables. Spurious relationship: a relationship in which both the IV and DV are influenced by a causally prior control variable and there is no causal link between them. The relationship between the IV and DV is said to be “explained away” by the control variable.
Number of Firefighters Property Damage The Bivariate Relationship Between Number of Firefighters and Property Damage Number of Firefighters Property Damage (IV) (DV)
Process of Elaboration Partial tables: bivariate tables that display the relationship between the IV and DV while controlling for a third variable. Partial relationship: the relationship between the IV and DV shown in a partial table.
The Process of Elaboration Divide the observations into subgroups on the basis of the control variable. We have as many subgroups as there are categories in the control variable. Re-examine the relationship between the original two variables separately for the control variable subgroups. Compare the partial relationships with the original bivariate relationship for the total group.
Intervening Relationship Intervening variable: a control variable that follows an independent variable but precedes the dependent variable in a causal sequence. Intervening relationship: a relationship in which the control variable intervenes between the independent and dependent variables.
Intervening Relationship: Example Renzi's Hypothesis(1975) Religion Preferred Family Size Support for Abortion (IV) (Intervening Control Variable) (DV) Renzi, Mario.1975. Ideal Family Size as an Intervening Variable between Religion and Attitudes Towards Abortion, Journal for the Scientific Study of Religion, Vol. 14, No. 1,pp. 23-27
Conditional Relationships Conditional relationship: a relationship in which the control variable’s effect on the dependent variable is conditional on its interaction with the independent variable. The relationship between the independent and dependent variables will change according to the different conditions of the control variable.
Conditional Relationships Another way to describe a conditional relationship is to say that there is a statistical interaction between the control variable and the independent variable.
Conditional Relationships: example Stance on Legal Abortion Pro-choice Pre-life Always wrong 37 98 Not Wrong 63 2
Conditional Relationships Male Stance on Legal Abortion Pro-choice Pre-life Always wrong 29 96 Not Wrong 71 4 Female Stance on Legal Abortion Pro-choice Pre-life Always wrong 46 100 Not Wrong 54
Conditional Relationships
A more complex example Data: 326 defendants were recorded as being indicted for homicide in 20 Florida counties during 1976-1977. Frequency Col Pct Defendant’s race Black White Death Penalty 149 89.76 141 88.13 No Yes 17 10.24 19 11.88
Partial Tables: Example Defendant’s race Black White Victim’s race Death Penalty 97 52 9 132 No Yes 6 11 19
Partial Tables: Example Victim’s race=Black Death Penalty Defendant race Total Frequency Col Pct Black White No 97 94.17 9 100.00 106 Yes 6 5.83 0 0.00 6 103 9 112
Partial Tables: Example Victim’s race=White Death Penalty Defendant race Total Frequency Col Pct Black White No 52 82.54 132 87.42 184 Yes 11 17.46 19 12.58 30 63 151 214