Concerns about Causality in the Network Influence Model 1 Outcome behavior Prior behavior Behavior of network members selection influence.

Slides:



Advertisements
Similar presentations
MANOVA (and DISCRIMINANT ANALYSIS) Alan Garnham, Spring 2005
Advertisements

EMR 6550: Experimental and Quasi- Experimental Designs Dr. Chris L. S. Coryn Kristin A. Hobson Fall 2013.
Multiple Regression and Model Building
Quantitative Methods.
Random Assignment Experiments
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 12 l Multiple Regression: Predicting One Factor from Several Others.
Copyright EM LYON Par accord du CFC Cession et reproduction interdites Research in Entrepreneurship- The problem of unobserved heterogeneity Frédéric Delmar.
Alvin Kwan Division of Information & Technology Studies
Experimental Design making causal inferences. Causal and Effect The IV precedes the DV in time The IV precedes the DV in time The IV and DV are correlated.
The Use and Interpretation of the Constant Term
Choosing a Functional Form
Causal-Comparative Research
TOOLS OF POSITIVE ANALYSIS
Chapter One: The Science of Psychology
Leon-Guerrero and Frankfort-Nachmias,
Linear Regression/Correlation
Multiple Linear Regression A method for analyzing the effects of several predictor variables concurrently. - Simultaneously - Stepwise Minimizing the squared.
PAI786: Urban Policy Class 2: Evaluating Social Programs.
Correlation Nabaz N. Jabbar Near East University 25 Oct 2011.
McGraw-Hill © 2006 The McGraw-Hill Companies, Inc. All rights reserved. Correlational Research Chapter Fifteen.
Chapter 8: Bivariate Regression and Correlation
This Week: Testing relationships between two metric variables: Correlation Testing relationships between two nominal variables: Chi-Squared.
Chapter 13: Inference in Regression
© 2011 Pearson Prentice Hall, Salkind. Introducing Inferential Statistics.
Replacement Cases Framework Conclusion Correlational Framework overview Thresholds for inference and % bias to invalidate The counterfactual paradigm Internal.
CORRELATION & REGRESSION
Is there a relationship between the lengths of body parts ?
Chapter 15 Correlation and Regression
Influence and Correlation in Social Networks Priyanka Garg.
Discriminant Function Analysis Basics Psy524 Andrew Ainsworth.
Chapter One: The Science of Psychology. Ways to Acquire Knowledge Tenacity Tenacity Refers to the continued presentation of a particular bit of information.
Lecture 8: Generalized Linear Models for Longitudinal Data.
Correlational Designs
Correlation and Linear Regression. Evaluating Relations Between Interval Level Variables Up to now you have learned to evaluate differences between the.
Correlational Research Chapter Fifteen Bring Schraw et al.
Inductive Generalizations Induction is the basis for our commonsense beliefs about the world. In the most general sense, inductive reasoning, is that in.
Replacement Cases Framework overview Thresholds for inference and % bias to invalidate The counterfactual paradigm Internal validity example: kindergarten.
1 Psych 5500/6500 Standard Deviations, Standard Scores, and Areas Under the Normal Curve Fall, 2008.
Estimating Causal Effects from Large Data Sets Using Propensity Scores Hal V. Barron, MD TICR 5/06.
By: Amani Albraikan.  Pearson r  Spearman rho  Linearity  Range restrictions  Outliers  Beware of spurious correlations….take care in interpretation.
Mediation: Solutions to Assumption Violation
C M Clarke-Hill1 Analysing Quantitative Data Forming the Hypothesis Inferential Methods - an overview Research Methods.
Lecture 02.
Experimental Research Methods in Language Learning Chapter 10 Inferential Statistics.
Question paper 1997.
Chapter 9: Correlation and Regression Analysis. Correlation Correlation is a numerical way to measure the strength and direction of a linear association.
5 May Crawford School 1 Comparative Case Studies – 1 Semester 1, 2009 POGO8096/8196: Research Methods Crawford School of Economics and Government.
CHAPTER OVERVIEW Say Hello to Inferential Statistics The Idea of Statistical Significance Significance Versus Meaningfulness Meta-analysis.
Single-Subject and Correlational Research Bring Schraw et al.
Causal inferences This week we have been discussing ways to make inferences about the causal relationships between variables. One of the strongest ways.
More on regression Petter Mostad More on indicator variables If an independent variable is an indicator variable, cases where it is 1 will.
The Psychologist as Detective, 4e by Smith/Davis © 2007 Pearson Education Chapter One: The Science of Psychology.
Introduction to ERGM/p* model Kayo Fujimoto, Ph.D. Based on presentation slides by Nosh Contractor and Mengxiao Zhu.
Replacement Cases Framework Conclusion Correlational Framework overview Thresholds for inference and % bias to invalidate The counterfactual paradigm Internal.
Chapter 9 Introduction to the t Statistic
Concerns about Causality
Is there a relationship between the lengths of body parts?
Betsy Sinclair University of Chicago IGNITE, PolNet 2012, Boulder
Econometrics ITFD Week 8.
Lecture 02.
AERA workshop April 4, 2014 (AERA on-line video – cost is $95)
ST3004: Research Methods Research Design
Welcome to on-line audience, ask questions with microphone
(AERA on-line video – cost is $105)
An Introduction to Correlational Research
Class 2: Evaluating Social Programs
Class 2: Evaluating Social Programs
Positive analysis in public finance
Alternative Scenarios and Related Techniques
(AERA on-line video – cost is $105)
Presentation transcript:

Concerns about Causality in the Network Influence Model 1 Outcome behavior Prior behavior Behavior of network members selection influence

Answer: Control for Prior Behavior! 2 There are heightened concerns about potential dependencies in estimating any social network model (e.g., Robins et al., 2007; Steglich, Snijders and Pearson, 2010 ). Regarding model 91) estimated influence is biased if the errors are not independent of the network exposure term (see Ord, 1975, equations ); the estimate of influence will be positively biased if there is some unexplained aspect of enforcement behavior that is related to the network exposure. The most compelling source of such dependencies would be if people choose to interact with others whose behaviors are similar to their own, known as selection in the network literature. Those who tended to engage in enforcement at time 1 might have chosen to interact with similar others between time 1 and time 2, and also would have been inclined to engage in enforcement behaviors at time 2. Because the network exposure term is likely confounded with prior enforcement behavior, model 1 includes a control for prior enforcement behavior. A second concern in the influence model would arise if the model of a fisherman’s behaviors was a function of the contemporaneous behaviors of his/her network members. This would essentially put the outcome on both sides of the model in which case the errors would be directly related to the exposure term. It is for this reason that we model enforcement behavior as a function of the previous behaviors of others in one’s network. This avoids creating dependencies between the errors and predictors by putting the same variables on both sides of the model. Even given our approach there may still be concerns about omitted variables that create dependencies between the errors and the exposure term. Therefore we quantify the robustness of our inferences to potential omitted variables (Frank, 2000 ). Steglich, Christian E.G. Tom A.B. Snijders, and Michael Pearson (2010). Dynamic Networks and Behavior: Separating Selection from Influence. Sociological Methodology, 40, Dynamic Networks and Behavior: Separating Selection from Influence. Ord, Keith. "Estimation methods for models of spatial interaction."Journal of the American Statistical Association (1975): Robins, Garry L., Tom A.B. Snijders, Peng Wang, Mark Handcock, and Philippa Pattison. Recent developments in exponential random graph (p*) models for social networks. Social Networks 29 (2007), Recent developments in exponential random graph (p*) models for social networks

Verify with Simulation (student Ran Xu) 3

4

home reflection Must Control for Prior 5

home reflection 6

home reflection 7 Must Control for Prior

Concerns about Causality: Omitted Variables 8 Response 2 Shalizi and Thomas (2011) show that network influence is unidentifiable if there is a latent trait X(i) that causes both a tie in the network A(i,j) and the prior behavior or belief Y(I,t-1) (see their Figure 1 below, actually it is Figure 2 in the published article). Shalizi, Cosma Rohilla, and Andrew C. Thomas. "Homophily and contagion are generically confounded in observational social network studies." Sociological Methods & Research 40, no. 2 (2011):

Concerns about Causality: in Terms of Shalizi & Thomas 9 Outcome behavior Prior behavior Behavior of network members selection influence Unobserved Latent Trait X A ij Y t-1 YtYt

home reflection When Regression Doesn’t Work  Instrumental variables?  Alternative assumptions, may be not better  Exclusion restriction  Strong instrument  Large n  Propensity scores?  No better than the covariates that go into it  Heckman, 2005; Morgan & Harding, 2006, page 40; Rosenbaum, 2002, page 297; Shadish et al., 2002, page 164);  How could they be better than covariates?  Propensity=f(covariates). Propensity=f(covariates).  Shalizi & Thomas:  Examine effects of non-neighbors  Looks for bounds (or sensitivity)  Examine clustering PAUL ROSENBAUM

Possible Instrument for Network Influence: Third parties 11 Outcome behavior Prior behavior Behavior of network members selection influence Instrument: Friends of network members who are not friends of i A ij Y t-1 YtYt

home reflection Paul Holland says: PAUL ROSENBAUM

home reflection It’s all in how you talk about it!  Do best method you can  Include relevant controls!  But science is as much in the nature of the discourse as the method  Virtue epistemology  Greco, 2009; Kvanig, 2003; Sosa, 2007  What would it take to invalidate the inference?  How much bias must be present to invalidate an inference?   Frank, K.A., Maroulis, S., Duong, M., and Kelcey, B What would it take to Change an Inference?: Using Rubin’s Causal Model to Interpret the Robustness of Causal Inferences. Education, Evaluation and Policy Analysis. Vol 35: Frank, K.A., Maroulis, S., Duong, M., and Kelcey, B What would it take to Change an Inference?: Using Rubin’s Causal Model to Interpret the Robustness of Causal Inferences. Education, Evaluation and Policy Analysis. Vol 35:   spreadsheet for calculating indices [KonFound-it! © ] spreadsheet for calculating indices [KonFound-it! © ]   powerpoint with examples and calculations (includes reference to STATA, SAS and SPSS) powerpoint with examples and calculations 

Effects of Social Capital on Implementation of Computers in the Classroom 14 Coefficient was reduced by only 7% when controlling for prior.

Importance of Controlling for the Prior: Longitudinal Data 15 Coefficient was reduced by only 7% when controlling for prior.

16 They then argue that influence is identifiable if all of the information about the latent variable X on the prior behavior Y(I, t-1) is contained within the observed variable Z(i) (panel a), or all of the information about X on the network tie A(I,j) is conveyed by z. This is the familiar condition that X is not confounded with the exposure term either because it is unrelated to Y or unrelated to the network tie (Frank, 2000). Shalizi and Thomas (2011) then note that “whether we face the unidentifiable situation of Figure 2 or the identifiable case of Figure 3 currently depends upon subject matter knowledge rather than statistical techniques” (page 218). Correspondingly, we argue that the issue of identification cannot be fully resolved. Instead it can and should be debated in terms of the latent variable X. But the debate should not be in terms of whether such a variable X exists. Of course it does, except in rare circumstances where the selection of network members or the expression of attributes at t-1 is completely determined by observed characteristics. Instead we acknowledge that X exists, but ask how large must it be to invalidate the inference of network influence. In our particular case, X must be correlated with the outcome Y(t) and with the network exposure term (based on the sum of the Yj t-1) at ____ to invalidate our inference.

Modeling influence and selection simultaneously 17 Shalizi and Thomas (2011) prove ­­that network influence is unidentifiable if there is a latent trait X(i) that causes both a tie in the network A(i,j) and the prior behavior or belief Y(i,t-1) (see their Figure 2). They then argue that influence is identifiable if all of the information about the latent variable X on the prior behavior Y(I, t-1) is contained within the observed variable Z(i), or all of the information about X on the network tie A(I,j) is conveyed by z. This is the familiar condition that the latent variable X is not confounded with the network exposure term either because it is unrelated to Y or unrelated to the network tie, conditioning on other variables in the model. Shalizi and Thomas (2011) then note that “whether we face the unidentifiable situation [of Figure 2] or the identifiable case [of Figure 3] currently depends upon subject matter knowledge rather than statistical techniques” (page 218). They then go on to state on page 218 that “It is noteworthy that the most successful attempts at explicit modeling that handle both homophily and influence, as found in the work of Leenders (1995); Steglich et al. (2004) involves, all at once, strong parametric (exponential-family) assumptions, plus the assumption that observable covariates carry all of the dependence from X [latent variable] to Y [observable attribute of individual] and A [network relation]. Thus the identification of the model depends on whether the observed covariates carry all of the dependencies. This is no different than the typical assumption of regression; the assumption applies whether or not one simultaneously models influence and selection. Shalizi and Thomas make a symmetric argument for the identification of the selection model. It depends on the quality of the covariates and not on simultaneously modeling influence. Shalizi and Thomas 2011

home reflection Christakis & Fowler: Contagion of Obesity 18

home reflection 19

home reflection C&F: Methods 20 Lagged controls?

home reflection Christakis & Fowler Model  Should we use simultaneous or staggered behavior?  They use both:  y it = ρ 1 ∑ i’ w ii’t y i’t /∑ i’ w ii’t + ρ 2 ∑ i’ w ii’t y i’t-1 /∑ i’ w ii’t + γ y it-1 +e it  Obesity 2000 =ρ 1 obesity of friends ρ 2 obesity of friends γ own obesity it-1 +e t :  Lyons: ρ 1 and ρ 2 have opposite signs. Hmmmm.  Collinearity problems? 21

Christakis and Fowler Debate: Lyons 22

home reflection Lyons, Russell The spread of evidence-poor medicine via flawed social-network analysis, Stat., Politics, Policy 2, 1 (2011), Article 2. DOI: / See Andrew Gelman: reasonable-to-me-there-could-well-be-something-important-that-im-missing-but-until-i-hear-otherwise- for-example-in-a-convincing-reply-by-christakis-and-f/ reasonable-to-me-there-could-well-be-something-important-that-im-missing-but-until-i-hear-otherwise- for-example-in-a-convincing-reply-by-christakis-and-f/ Critique of Christakis and Fowler “influence” model pages 5-6

home reflection Christakis & Fowler Model  Should we use simultaneous or staggered behavior?  They use both:  y it = ρ 1 ∑ i’ w ii’t y i’t /∑ i’ w ii’t + ρ 2 ∑ i’ w ii’t y i’t-1 /∑ i’ w ii’t + γ y it-1 +e it  Obesity 2000 =ρ 1 obesity of friends ρ 2 obesity of friends γ own obesity it-1 +e t  Same data on right and left hand sides of model  Lyons: ρ 1 and ρ 2 have opposite signs: hmmm  Collinearity problems? 24

home reflection Christakis & Fowler Model  Should we use simultaneous or staggered behavior?  They use both:  y it = ρ 1 ∑ i’ w ii’t y i’t /∑ i’ w ii’t + ρ 2 ∑ i’ w ii’t y i’t-1 /∑ i’ w ii’t + γ y it-1 +e it  Obesity 2000 =ρ 1 obesity of friends ρ 2 obesity of friends γ own obesity it-1 +e t  Same data on right and left hand sides of model  Lyons: ρ 1 and ρ 2 have opposite signs: hmmm  Collinearity problems? 25

home reflection C&F: Methods 26 directionality

home reflection Christakis & Fowler: Directionality Results 27 Are they statistically different from one another? No.

28 Articles on Causality

29

30

31

Articles on Causality 32

33