University of Warwick, Department of Sociology, 2014/15 SO 201: SSAASS (Surveys and Statistics) (Richard Lampard) Week 3 Multivariate analysis.

Slides:



Advertisements
Similar presentations
Computing in Archaeology Session 12. Multivariate statistics © Richard Haddlesey
Advertisements

Soc 3306a Lecture 6: Introduction to Multivariate Relationships Control with Bivariate Tables Simple Control in Regression.
Chapter 11 Contingency Table Analysis. Nonparametric Systems Another method of examining the relationship between independent (X) and dependant (Y) variables.
(Hierarchical) Log-Linear Models Friday 18 th March 2011.
© 2005 The McGraw-Hill Companies, Inc., All Rights Reserved. Chapter 14 Using Multivariate Design and Analysis.
BACKGROUND RESEARCH QUESTIONS  Does the time parents spend with children differ according to parents’ occupation?  Do occupational differences remain.
TEACHING DATA STRUCTURES AND RELATIONSHIPS USING THE GENERAL SOCIAL SURVEYS Stephen Sweet Ithaca College Professional Enhancement Programs Conference Macalester.
Interpreting published multivariate analyses (using logistic regression) Friday 11 th March 2011.
Analysing Cross-Tabulations
SOWK 6003 Social Work Research Week 10 Quantitative Data Analysis
(Correlation and) (Multiple) Regression Friday 5 th March (and Logistic Regression too!)
Social Research Methods
DTC Quantitative Research Methods Three (or more) Variables: Extensions to Cross- tabular Analyses Thursday 13 th November 2014.
Dr. Michael R. Hyman, NMSU Cross-tabulations and Banners.
Cross-tabulations and Banners. Cross-tabulation Way to organize data by groups or categories, thus facilitating comparisons; joint frequency distribution.
Chapter 7 Correlational Research Gay, Mills, and Airasian
10. Introduction to Multivariate Relationships Bivariate analyses are informative, but we usually need to take into account many variables. Many explanatory.
Week 14 Chapter 16 – Partial Correlation and Multiple Regression and Correlation.
Problem 1: Relationship between Two Variables-1 (1)
Multiple Linear Regression A method for analyzing the effects of several predictor variables concurrently. - Simultaneously - Stepwise Minimizing the squared.
Beyond Bivariate: Exploring Multivariate Analysis.
Chapter 15 – Elaborating Bivariate Tables
LIS 570 Summarising and presenting data - Univariate analysis continued Bivariate analysis.
Soc 3306a Lecture 10: Multivariate 3 Types of Relationships in Multiple Regression.
Chapter 1: The What and the Why of Statistics
Cross-Tabular Analysis Cross-Tabular Analysis
SLWK – 609 Research Methods- online- M. Secret Podcast 5 BASIC CONCEPTS OF RESEARCH DESIGN
Soc 3306a Multiple Regression Testing a Model and Interpreting Coefficients.
The What and the Why of Statistics The Research Process Asking a Research Question The Role of Theory Formulating the Hypotheses –Independent & Dependent.
Chapter 9 Analyzing Data Multiple Variables. Basic Directions Review page 180 for basic directions on which way to proceed with your analysis Provides.
Introducing additional variables POL 242 Renan Levine.
Chapter 1: The What and the Why of Statistics  The Research Process  Asking a Research Question  The Role of Theory  Formulating the Hypotheses  Independent.
DTC Quantitative Methods Three (or more) Variables: Extensions to Analyses Using Cross- tabulations or ANOVA Thursday 1 st March 2012.
Chapter 10: Relationships Between Two Variables: CrossTabulation
Multiple Regression 3 Sociology 5811 Lecture 24 Copyright © 2005 by Evan Schofer Do not copy or distribute without permission.
ANOVA and Linear Regression ScWk 242 – Week 13 Slides.
University of Warwick, Department of Sociology, 2014/15 SO 201: SSAASS (Surveys and Statistics) (Richard Lampard) Week 7 Logistic Regression I.
Section 7.4 ~ The Search for Causality Introduction to Probability and Statistics Ms. Young.
Agenda Review Homework 5 Review Statistical Control Do Homework 6 (In-class group style)
University of Warwick, Department of Sociology, 2012/13 SO 201: SSAASS (Surveys and Statistics) (Richard Lampard) Week 1 Bivariate Analysis with SPSS Revisited.
University of Warwick, Department of Sociology, 2012/13 SO 201: SSAASS (Surveys and Statistics) (Richard Lampard) Week 5 Multiple Regression.
University of Warwick, Department of Sociology, 2012/13 SO 201: SSAASS (Surveys and Statistics) (Richard Lampard) Survival Analysis/Event History Analysis:
10. Introduction to Multivariate Relationships Bivariate analyses are informative, but we usually need to take into account many variables. Many explanatory.
Statistics in Applied Science and Technology Supplemental: Elaborating Crosstabs: Adding a Third Variable.
Chapter 10: Cross-Tabulation Relationships Between Variables  Independent and Dependent Variables  Constructing a Bivariate Table  Computing Percentages.
Introduction. The Role of Statistics in Science Research can be qualitative or quantitative Research can be qualitative or quantitative Where the research.
APPLIED DATA ANALYSIS IN CRIMINAL JUSTICE CJ 525 MONMOUTH UNIVERSITY Juan P. Rodriguez.
Quantitative Methods in Social Research 2012/13 Week 7 (morning) session 22 nd February 2013 Analysing Cross-Tabulations.
DTC Quantitative Research Methods Regression I: (Correlation and) Linear Regression Thursday 27 th November 2014.
Chapter Two Methods in the Study of Personality. Gathering Information About Personality Informal Sources of Information: Observations of Self—Introspection,
10. Introduction to Multivariate Relationships Bivariate analyses are informative, but we usually need to take into account many variables. Many explanatory.
Copyright © 2010 by Nelson Education Limited. Elaborating Bivariate Tables.
Chapter 6 – 1 Relationships Between Two Variables: Cross-Tabulation Independent and Dependent Variables Constructing a Bivariate Table Computing Percentages.
University of Warwick, Department of Sociology, 2012/13 SO 201: SSAASS (Surveys and Statistics) (Richard Lampard) Logistic Regression II/ (Hierarchical)
University of Warwick, Department of Sociology, 2012/13 SO 201: SSAASS (Surveys and Statistics) (Richard Lampard) Survey Design: Some Implications for.
Multiple Independent Variables POLS 300 Butz. Multivariate Analysis Problem with bivariate analysis in nonexperimental designs: –Spuriousness and Causality.
Chapter 17 Basic Multivariate Techniques Winston Jackson and Norine Verberg Methods: Doing Social Research, 4e.
University of Warwick, Department of Sociology, 2014/15 SO 201: SSAASS (Surveys and Statistics) (Richard Lampard) Week 6 Regression: ‘Loose Ends’
RESEARCH METHODS Lecture 32. The parts of the table 1. Give each table a number. 2. Give each table a title. 3. Label the row and column variables, and.
Chapter 1: The What and the Why of Statistics
University of Warwick, Department of Sociology, 2014/15 SO 201: SSAASS (Surveys and Statistics) (Richard Lampard)   Week 5 Multiple Regression  
Hypothesis Testing.
University of Warwick, Department of Sociology, 2014/15 SO 201: SSAASS (Surveys and Statistics) (Richard Lampard) Logistic Regression II/ (Hierarchical)
The Correlation Coefficient (r)
DTC Quantitative Methods Three (or more) Variables: Extensions to Analyses Using Cross- tabulations or ANOVA Thursday 27th February 2014  
DTC Quantitative Methods Three (or more) Variables: Extensions to Analyses Using Cross- tabulations or ANOVA Thursday 28th February 2013  
Social Research Methods
Multivariate Statistics
Theme 4 Elementary Analysis
The Correlation Coefficient (r)
Presentation transcript:

University of Warwick, Department of Sociology, 2014/15 SO 201: SSAASS (Surveys and Statistics) (Richard Lampard) Week 3 Multivariate analysis

Multivariate analysis Introductory modules covering quantitative methods often focus on univariate measures and then on bivariate (i.e. two-way) relationships. However, inter-relationships and phenomena in the social world are usually more complex than a bivariate analysis allows for. Multivariate analyses are thus commonly used in quantitative social research, as a reflection of this complexity. Later in the term we will look at multiple (linear) regression, logistic regression and (hierarchical) log-linear models, all of which are types of multivariate analysis. However, this week we consider the rationale for multivariate analysis in broad terms, and have a look at simple cross-tabular techniques for conducting this form of analysis.

Multivariate analysis de Vaus (1996: 198) suggests that we can use multivariate analysis to elaborate bivariate relationships, in order to answer the following questions: 1.Why does the relationship [between two variables] exist? What are the mechanisms and processes by which one variable is linked to another? 2.What is the nature of the relationship? Is it causal or non-causal? 3.How general is the relationship? Does it hold for people in general, or is it specific to certain subgroups? This is because multivariate analysis enables the identification of: Spurious relationships Intervening variables The replication of relationships The specification of relationships

Age Height Reading ability Spurious relationship Spurious relationships A spurious relationship exists where two variables are not related but a relationship between them is generated by their relationships with a third variable. For example:

Intervening variables Sometimes, although there is a real (non-spurious) relationship between two variables, we want to establish why that relationship exists. For example, if we discover that there is a relationship between risk of unemployment and ethnicity, we want to know why that is the case. One possibility is that some ethnic groups have lower educational levels and that this has implications for their ability to get work. In this case education would be an intervening variable. Intervening variables enable us to answer questions about the bivariate relationship between two variables – suggesting that (in this case) the relationship between ethnicity and unemployment is not direct but (at least in part) occurs via educational levels. EducationEthnicityUnemployment

Is it spurious or intervening? When we do statistical tests we will find similar results for a spurious variable and an intervening variable: In both cases the effect of the independent variable on the dependent variable will be moderated by the third variable. So how do we know whether this third variable provides evidence of a spurious relationship or is an intervening variable? –There is no hard-and-fast statistical rule for deciding this. –But if we are suggesting that a variable is intervening, the logic of the process must make sense – i.e. you must have a cogent theoretical reason for thinking that your independent variable affects the intervening variable which in turn affects the dependent variable. –This kind of causal process is easiest to argue for when the timing supports it, i.e. when the intervening variable can be seen to occur in between the independent and dependent variables (e.g. education in the earlier example of the relationship between ethnicity and unemployment).

Replication Sometimes when we have found a basic (‘zero-order’) relationship between two variables (e.g. ethnicity and unemployment), we want to demonstrate that this relationship exists within different subgroups of the population (e.g. for both men and women; for those of different ages…). Where the relationship IS replicated we can rule out the possibility that it is produced by the variable in question, either as an intervening variable or in a spurious way.

Specification Sometimes a particular variable only has an effect in specific situations. The variable that determines these situations is said to ‘interact’ with the independent variable. For example, an example in de Vaus’s book suggests that going to a religious school makes boys more religious but has little or no effect on girls. In this case type of school interacts with gender: religious education only affects students’ religiosity in combination with being male.

Specification (interactions) Not at allVery How religious was your education? Religiousness high low boys girls Not at allVery How religious was your education? Religiousness high low boys girls Interaction between No interaction sex and religiousness of school Graphical representation of the relationship between religious education and religiousness, controlling for sex:

Using Cramér’s V to classify a multivariate situation If the Cramér’s V values for the layers are all similar, then we have a situation of replication. If the Cramér’s V values are smaller for the layered cross-tabulation than the value for the original cross- tabulation, then we either have a situation where the third variable is acting as an intervening variable, or one where it is inducing a spurious relationship between the original two variables. Deciding between these two options involves reflecting on whether the third variable makes sense conceptually as part of some causal mechanism linking the original two variables. If we use SPSS to produce a cross-tabulation of two variables, then we can elaborate this relationship by introducing a third variable as a layer variable. Examining the Cramér’s V values for the original cross-tabulation and for the layers of the elaborated cross-tabulation tells us what kind of situation we are looking at:

If the Cramér’s V values for the layered cross-tabulation vary in size, perhaps with some being smaller than the original value and some being as large or larger than it, then the situation is one of specification. However, if one or more of the Cramér’s V values is larger than the original value, then a failure to take account of the third variable in the first instance may also have been suppressing an underlying relationship between the two variables. This latter situation is a variation on the theme of spuriousness: in this case, the absence of a bivariate relationship is spurious rather than the presence of one!) Using Cramér’s V to classify a multivariate situation (continued)

Multivariate analyses can utilise a variety of techniques (depending on the form of the data, research questions to be addressed, etc. – we will be looking at multiple (linear) regression, logistic regression and log-linear models), in order to determine whether the relationship between two variables persists or is altered when we ‘control for’ a third (or fourth, or fifth...) variable. Multivariate analysis can also enable us to establish which variable(s) has/have the greatest impact on a dependent variable – e.g. Is sex more important than ‘race’ in determining income? It is often important for a multivariate analysis to check for interactions between the effects of independent variables, as discussed earlier under the heading of specification. More generally…