Analysing the co-evolution of social networks and “behavioural” dimensions with SIENA Christian SteglichUniversity of Groningen Tom SnijdersUniversity.

Slides:



Advertisements
Similar presentations
Statistical Social Network Analysis - Stochastic Actor Oriented Models Johan Koskinen The Social Statistics Discipline Area, School of Social Sciences.
Advertisements

Analysing network-behavioural co-evolution with SIENA Christian SteglichUniversity of Groningen Tom SnijdersUniversity of Groningen Mike PearsonNapier.
CHAPTER TWELVE ANALYSING DATA I: QUANTITATIVE DATA ANALYSIS.
Assessing influence and selection in network-behavioural co-evolution with an application to smoking and alcohol consumption among adolescents. Christian.
Network Matrix and Graph. Network Size Network size – a number of actors (nodes) in a network, usually denoted as k or n Size is critical for the structure.
Where we are Node level metrics Group level metrics Visualization
CHAPTER 13 M ODELING C ONSIDERATIONS AND S TATISTICAL I NFORMATION “All models are wrong; some are useful.”  George E. P. Box Organization of chapter.
The Statistical Analysis of the Dynamics of Networks and Behaviour. An Introduction to the Actor-based Approach. Christian Steglich and Tom Snijders ——————
Succession Model We talked about the facilitation, inhibition, and tolerance models, but all of these were verbal descriptions Today we are going to consider.
Stelios Lelis UAegean, FME: Special Lecture Social Media & Social Networks (SM&SN)
SA-1 Probabilistic Robotics Planning and Control: Partially Observable Markov Decision Processes.
1 How to make a selection table like the one below from SIENA output. The table shows contributions to the objective function for highest / lowest possible.
Some results from Scottish data Topic smoking behaviour and friendship Problem influence and/or selection Theory drifting smoke rings (Pearson, West, Michell)
Advanced model specification for SIENA models: score tests and forward model selection Mark HuismanUniversity of Groningen Christian SteglichUniversity.
Chapter 11 Contingency Table Analysis. Nonparametric Systems Another method of examining the relationship between independent (X) and dependant (Y) variables.
Identity and search in social networks Presented by Pooja Deodhar Duncan J. Watts, Peter Sheridan Dodds and M. E. J. Newman.
Chapter 15 (Ch. 13 in 2nd Can.) Association Between Variables Measured at the Interval-Ratio Level: Bivariate Correlation and Regression.
Analysing the co-evolution of social networks and “behavioural” dimensions with SIENA Christian SteglichUniversity of Groningen Tom SnijdersUniversity.
Some results from Scottish data The Statistical Analysis of the Dynamics of Networks and Behaviour: An Application to Smoking and Drinking Behaviour among.
FRIENDSHIP AND DELINQUENCY OF ADOLESCENTS FRIENDSHIP AND DELINQUENCY OF ADOLESCENTS First results of a four wave study on adolescent’s behavior and relations.
Statistics II: An Overview of Statistics. Outline for Statistics II Lecture: SPSS Syntax – Some examples. Normal Distribution Curve. Sampling Distribution.
Joint social selection and social influence models for networks: The interplay of ties and attributes. Garry Robins Michael Johnston University of Melbourne,
Chapter 3 Probability 3-1 Overview 3-2 Fundamentals 3-3 Addition Rule
1 Learning Entity Specific Models Stefan Niculescu Carnegie Mellon University November, 2003.
EdPsy 511 August 28, Common Research Designs Correlational –Do two qualities “go together”. Comparing intact groups –a.k.a. causal-comparative and.
References References (continued) ANOMALIES Risk-taking Network Density Paradox Apparent contradictions in research findings Network density is an.
Business and Economics 7th Edition
Slide 1 Statistics Workshop Tutorial 4 Probability Probability Distributions.
Sunbelt 2009statnet Development Team ERGM introduction 1 Exponential Random Graph Models Statnet Development Team Mark Handcock (UW) Martina.
Chap 3-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 3 Describing Data: Numerical Statistics for Business and Economics.
Statistics for Managers Using Microsoft® Excel 5th Edition
McGraw-Hill Ryerson Copyright © 2011 McGraw-Hill Ryerson Limited. Adapted by Peter Au, George Brown College.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Section 4-2 Basic Concepts of Probability.
Models of Influence in Online Social Networks
© 2005 The McGraw-Hill Companies, Inc., All Rights Reserved. Chapter 12 Describing Data.
Descriptive Statistics Used to describe the basic features of the data in any quantitative study. Both graphical displays and descriptive summary statistics.
Chapter 4 Probability 4-1 Overview 4-2 Fundamentals 4-3 Addition Rule
Slide 1 Definition Figures 3-4 and 3-5 Events A and B are disjoint (or mutually exclusive) if they cannot both occur together.
Particle Filtering in Network Tomography
Exploring the dynamics of social networks Aleksandar Tomašević University of Novi Sad, Faculty of Philosophy, Department of Sociology
1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Basic Principle of Statistics: Rare Event Rule If, under a given assumption,
Research Methods Chapter 8 Data Analysis. Two Types of Statistics Descriptive –Allows you to describe relationships between variables Inferential –Allows.
Introduction to Probability  Probability is a numerical measure of the likelihood that an event will occur.  Probability values are always assigned on.
Variation This presentation should be read by students at home to be able to solve problems.
Chapter 4 Linear Regression 1. Introduction Managerial decisions are often based on the relationship between two or more variables. For example, after.
EDPSY Chp. 2: Measurement and Statistical Notation.
Susan O’Shea The Mitchell Centre for Social Network Analysis CCSR/Social Statistics, University of Manchester
1 EXAKT SKF Phase 1, Session 2 Principles. 2 The CBM Decision supported by EXAKT Given the condition today, the asset mgr. takes one of three decisions:
ECE-7000: Nonlinear Dynamical Systems Overfitting and model costs Overfitting  The more free parameters a model has, the better it can be adapted.
Hierarchy Overview Background: Hierarchy surrounds us: what is it? Micro foundations of social stratification Ivan Chase: Structure from process Action.
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the.
Review of Probability. Important Topics 1 Random Variables and Probability Distributions 2 Expected Values, Mean, and Variance 3 Two Random Variables.
Data Summary Using Descriptive Measures Sections 3.1 – 3.6, 3.8
Introduction to Statistical Models for longitudinal network data Stochastic actor-based models Kayo Fujimoto, Ph.D.
Edpsy 511 Basic concepts Exploratory Data Analysis.
Network Science K. Borner A.Vespignani S. Wasserman.
1 Automated Planning and Decision Making 2007 Automated Planning and Decision Making Prof. Ronen Brafman Various Subjects.
Introduction to ERGM/p* model Kayo Fujimoto, Ph.D. Based on presentation slides by Nosh Contractor and Mengxiao Zhu.
CPH Dr. Charnigo Chap. 11 Notes Figure 11.2 provides a diagram which shows, at a glance, what a neural network does. Inputs X 1, X 2,.., X P are.
An Introduction to Stochastic Actor-Oriented Models (aka SIENA)
Business and Economics 6th Edition
Agent-Based Methods for Dynamic Social Networks
CHAPTER 29: Multiple Regression*
Tabulations and Statistics
Where did we stop? The Bayes decision rule guarantees an optimal classification… … But it requires the knowledge of P(ci|x) (or p(x|ci) and P(ci)) We.
An Introduction to Correlational Research
Correlations: Correlation Coefficient:
Kalman Filter: Bayes Interpretation
Business and Economics 7th Edition
Longitudinal Social Network Data
Presentation transcript:

Analysing the co-evolution of social networks and “behavioural” dimensions with SIENA Christian SteglichUniversity of Groningen Tom SnijdersUniversity of Groningen Patrick WestUniversity of Glasgow Andrea KnechtUtrecht University SIENA workshop Groningen, 8-11 January,2007 Funded by The Netherlands Organization for Scientific Research (NWO) under grant

Some notation & clarification Social networks: Tie variables “Behavioural” dimensions:...can be any changeable dependent actor variable z i : overt behaviour attitudes cognitions... a1a1 a2a2 a4a4 a3a3 a5a5

Social network dynamics often depend on actors’ characteristics… – patterns of homophily: interaction with similar others can be more rewarding than interaction with dissimilar others – patterns of exchange: selection of partners such that they complement own abilities …but also actors’ characteristics can depend on the social network: – patterns of assimilation: spread of innovations in a professional community pupils copying ‘chic’ behaviour of friends at school traders on a market copying (allegedly) successful behaviour of competitors – patterns of differentiation: division of tasks in a work team

Example 1: (Andrea Knecht, 2003/04) Data on the co-evolution of petty delinquency (graffiti, fighting, stealing, breaking something, buying illegal copies) and friendship among first- grade pupils at Dutch secondary schools (“bridge class”). 125 school classes 4 measurement points Questions to be addressed: Is petty crime a dimension that plays a role in friendship formation? Is petty crime a habit that is acquired by peer influence? The following slides show how the type of data “look like” that we are analysing. Note:we analyse panel data – in principle, continuous-time data is easier to analyse – but the methods are not yet implemented.

Friendship ties inherited from primary school girls yellowboys green

1st wave: August/September 2003 node size indicates strength of delinquency

2nd wave: November/December 2003

3rd wave: February/March 2004

4th wave: May/June 2004

What do these pictures tell us? – there is segregation of the friendship network according to gender (although not as strong as in other classes) – delinquency is stronger among boys than among girls Questions unanswered: – to what degree can social influence and social selection processes account for the observed dynamics? More general…

How to analyse this? - structure of complete networks is complicated to model - additional complication due to the interdependence with behavior - and on top of that often incomplete observation (panel data) beh(t n )beh(t n+1 ) net(t n )net(t n+1 ) persistence (?) selection influence persistence (?)

Agenda for this talk: - Presentation of the stochastic modelling framework - An illustrative research question (Example 2) - Data for Example 2 - Software - Analysis - Interpretation of results - Summary

Brief sketch of the stochastic modelling framework (1) -Stochastic process in the space of all possible network-behaviour configurations (huge!) -First observation of the network as the process’ starting value. beh net For the simplest case of dichotomous ties and one dichotomous actor characteristic, the cardinality of the state space increases at a squared exponential rate with the number of actors:

16 possible states for a network consisting of one dyad only. (assuming actor characteristics to be dichotomous)

Brief sketch of the stochastic modelling framework (2) -Change is modelled as occurring in continuous time. -Network actors drive the process: individual decisions. two domains of decisions*: decisions about network neighbours ( selection, deselection ), decisions about own behaviour. per decision domain two submodels: When can actor i make a decision? (rate function) Which decision does actor i make? (objective function) -Technically: Continuous time Markov process. -Beware: model-based inference! * assumption: conditional independence, given the current state of the process.

How does the model look like? State space Pair (x,z)(t) contains adjacency matrix x and vector(s) of behavioural variables z at time point t. Stochastic process Co-evolution is modelled by specifying transition probabilities between such states (x,z)(t 1 ) and (x,z)(t 2 ). Continuous time model invisibility of to-and-fro changes in panel data poses no problem, evolution can be modelled in smaller units (‘micro steps’). Observed changes are quite complex – they are interpreted as resulting from a sequence of micro steps.

Micro steps that are modelled explicitly network micro steps: (x,z)(t 1 ) and (x,z)(t 2 ) differ in one tie variable x ij only. behavioural micro steps: (x,z)(t 1 ) and (x,z)(t 2 ) differ (by one) in one behavioural score variable z i only. Actor-driven model Micro steps are modelled as outcomes of an actor’s decisions; these decisions are conditionally independent, given the current state of the process. Schematic overview of model components Timing of decisionsDecision rules Network evolution Network rate functionNetwork decision rule Behavioural evolution Behavioural rate functionBehavioural decision rule

Timing of decisions / transitions Waiting times between decisions are assumed to be exponentially distributed (Markov process); they can depend on state, actor and time. Network micro step / network decision by actor i Choice options - change tie variable to one other actor j - change nothing Maximize objective function + random disturbance Deterministic part, depends on network-behavioural neighbourhood of actor i Random part, i.i.d. over x, z, t, i, j, according to extreme value type I Choice probabilities resulting from distribution of  are of multinomial logit shape x(i  j) is the network obtained from x by changing tie to actor j; x(i  i) formally stands for keeping the network as is

Network micro step / network decision by actor i Objective function f is linear combination of “effects”, with parameters as effect weights. Examples: reciprocity effect measures the preference difference of actor i between right and left configuration transitivity effect i i i i j j j j kk

Other possible effects to include in the network objective function: (from Steglich, Snijders & Pearson 2004)

Possible changes of network ties in the dyad case: (diagram renders possible network micro steps only)

Behavioural micro step by actor i Choice options - increase, decrease, or keep score on behavioural variable Maximize objective function + random disturbance Choice probabilities analogous to network part Assume independence also of the network random part Objective function is different from the network objective function

Possible effects to include in the behavioural objective function(s): (from Steglich, Snijders & Pearson 2004)

Possible changes of behaviour in the dyad case: (diagram renders possible behavioural micro steps only)

Total process model Transition intensities (‘infenitesimal generator’) of Markov process: Here  = waiting times,  = change in behavioural, z(i,  ) = behavioural vector after change. Together with starting value, process model is fully defined. Parametrisation of process implies equilibrium distribution, process is a ‘drift’ from 1st observation towards regions of high probability under this equilibrium.

Modelling selection and influence Influence and selection are based on a measure of behavioural similarity Similarity of actor i to network neighbours : Actor i has two ways of increasing friendship similarity: –by choosing friends j who behave the same (network effect): –by adapting own behaviour to that of friends j (behaviour effect): assimilation (social influence) homophily (social selection) i j i i i i i i i j j j j j j j

Remarks on model estimation The likelihood of an observed data set cannot be calculated in closed form, but can at least be simulated.  ‘third generation problem’ of statistical analysis,  simulation-based inference is necessary. Currently available: – Method of Moments estimation (Snijders 2001, 1998) – Maximum likelihood approach (Snijders & Koskinen 2003) Implementation: program SIENA, part of the StOCNET software package (see link in the end).

Example (2): A set of illustrative research questions: To what degree is music taste acquired via friendship ties? Does music taste (co-)determine the selection of friends? Data: social network subsample of the West of Scotland Study (West & Sweeting 1996) three waves, 129 pupils (13-15 year old) at one school pupils named up to 6 friends Take into account previous results on same data (Steglich, Snijders & Pearson 2004): What is the role played by alcohol consumption in both friendship formation and the dynamics of music taste?

43.Which of the following types of music do you like listening to? Tick one or more boxes. Rock  Indie  Chart music  Jazz  Reggae  Classical  Dance  60’s/70’s  Heavy Metal  House  Techno  Grunge  Folk/Traditional  Rap  Rave  Hip Hop  Other (what?)…………………………………. Music question: 16 items Before applying SIENA: data reduction to the 3 most informative dimensions

scale CLASSICAL scale ROCK scale TECHNO

32.How often do you drink alcohol? Tick one box only. More than once a week  About once a week  About once a month  Once or twice a year  I don’t drink (alcohol)  Alcohol question: five point scale General: SIENA requires dichotomous networks and behavioural variables on an ordinal scale.

Some descriptives: average dynamics of the four behavioural variables global dynamics of friendship ties (dyad counts)

Software: The models briefly sketched above are instantiated in the SIENA program. Optionally, evolution models can be estimated from given data, or evolution processes can be simulated, given a model parametrisation and starting values for the process. SIENA is implemented in the StOCNET program package, available at (release 14-feb-05). Currently, it allows for analysing the co-evolution of one social network (directed or undirected) and multiple behavioural variables.

Identification of data sourcefiles Recoding of variables and identification of missing data Specifying subsets of actors for analyses

Data specification: insert data into the model’s “slots”.

Model specification: select parameters to include for network evolution.

Model specification: select parameters to include for behavioural evolution.

Model specification: some additional features.

Model estimation: stochastic approximation of optimal parameter values.

Network objective function: – intercept: outdegree – network-endogenous: reciprocity distance-2 – covariate-determined: gender homophily gender ego gender alter – behaviour-determined: beh. homophily beh. ego beh. alter Rate functions were kept as simple as possible (periodwise constant). Analysis of the music taste data: Behaviour objective function(s): – intercept: tendency – network-determined: assimilation to neighbours – covariate-determined: gender main effect – behaviour-determined: behaviour main effect “behaviour” stands shorthand for the three music taste dimensions and alcohol consumption.

Results: network evolution Ties to just anyone are but costly. Reciprocated ties are valuable (overcompensating the costs). There is a tendency towards transitive closure. There is gender homophily: alter boy girl boy ego girl table gives gender-related contributions to the objective function There is alcohol homophily: alter low high low ego high table shows contributions to the objective function for highest / lowest possible scores There is no general homophily according to music taste: alter techno rock classical techno egorock classical table renders contributions to the objective function for highest possible scores & mutually exclusive music tastes

Results: behavioural evolution Assimilation to friends occurs: – on the alcohol dimension, – on the techno dimension, – on the rock dimension. There is evidence for mutual exclusiveness of: – listening to techno and listening to rock, – listening to classical and drinking alcohol. The classical listeners tend to be girls.

Summary (1) Does music taste (co-)determine the selection of friends? Somewhat. There is no music taste homophily (possible exception: classical music). Listening to rock music seems to coincide with popularity, listening to classical music with unpopularity. To what degree is music taste acquired via friendship ties? It depends on the specific music taste: Listening to techno or rock music is ‘learnt’ from peers, listening to classical music is not – maybe a ‘parent thing’?

Summary (2) What is the role played by alcohol consumption in friendship formation? There is homophilous selection going on: Friends select each other based on similarity in alcohol consumption. What is the role played by alcohol consumption in the dynamics of music taste? Only for the classical scale, an effect was found: Drinking alcohol reduces the chances of listening to classical music (and vice versa). Literature: Christian Steglich, Tom Snijders, and Patrick West, Applying SIENA: An illustrative analysis of the co-evolution of adolescents' friendship networks, taste in music, and alcohol consumption. Methodology 2(1),