Statistical Social Network Analysis - Stochastic Actor Oriented Models Johan Koskinen The Social Statistics Discipline Area, School of Social Sciences.

Slides:



Advertisements
Similar presentations
Mathematical Preliminaries
Advertisements

Chapter 13: Query Processing
© Jim Barritt 2005School of Biological Sciences, Victoria University, Wellington MSc Student Supervisors : Dr Stephen Hartley, Dr Marcus Frean Victoria.
Health Promotion in Fachhochschule Nordhausen. openingdefinitionfacts results 2 health promotion in Fachhochschule Nordhausen lecturer Sheila Fitzgerald.
& dding ubtracting ractions.
STATISTICS Sampling and Sampling Distributions
STATISTICS HYPOTHESES TEST (III) Nonparametric Goodness-of-fit (GOF) tests Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering.
STATISTICS HYPOTHESES TEST (I)
STATISTICS INTERVAL ESTIMATION Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University.
STATISTICS POINT ESTIMATION Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University.
9.2 – Arithmetic Sequences and Series
Lecture 2 ANALYSIS OF VARIANCE: AN INTRODUCTION
CS1512 Foundations of Computing Science 2 Week 3 (CSD week 32) Probability © J R W Hunter, 2006, K van Deemter 2007.
1 Contact details Colin Gray Room S16 (occasionally) address: Telephone: (27) 2233 Dont hesitate to get in touch.
1 Correlation and Simple Regression. 2 Introduction Interested in the relationships between variables. What will happen to one variable if another is.
Introduction Describe what panel data is and the reasons for using it in this format Assess the importance of fixed and random effects Examine the Hausman.
Class 6: Hypothesis testing and confidence intervals
5-1 Chapter 5 Probability 1.
Discrete time Markov Chain
The basics for simulations
Chapter 4: Basic Estimation Techniques
On Comparing Classifiers : Pitfalls to Avoid and Recommended Approach
Von Karman Institute for Fluid Dynamics RTO, AVT 167, October, R.A. Van den Braembussche von Karman Institute for Fluid Dynamics Tuning of Optimization.
Tests of Significance and Measures of Association
6. Statistical Inference: Example: Anorexia study Weight measured before and after period of treatment y i = weight at end – weight at beginning For n=17.
Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.
Freshmen Orientation Survey Summer 2004 Report Mississippi State University Reported by: Rosy Nigro, MS Office of Assessment Division of Student Affairs.
1 Generating Network Topologies That Obey Power LawsPalmer/Steffan Carnegie Mellon Generating Network Topologies That Obey Power Laws Christopher R. Palmer.
Briana B. Morrison Adapted from William Collins
Chapter 16 Goodness-of-Fit Tests and Contingency Tables
5-1 Chapter 5 Theory & Problems of Probability & Statistics Murray R. Spiegel Sampling Theory.
Chris Morgan, MATH G160 January 8, 2012 Lecture 13
Thomas Jellema & Wouter Van Gool 1 Question. 2Answer.
Chapter 6 The Mathematics of Diversification
Text Categorization.
Graphs, representation, isomorphism, connectivity
An introduction to exponential random graph models (ERGM)
Using AVS/Express in an Introductory Computer Graphics Course.
Multivariate Data/Statistical Analysis SC504/HS927 Spring Term 2008 Week 18: Relationships between variables: simple ordinary least squares (OLS) regression.
Statistical Analysis SC504/HS927 Spring Term 2008
Discrete Mathematics Dr.-Ing. Erwin Sitompul
Evaluation of precision and accuracy of a measurement
CHAPTER 15: Tests of Significance: The Basics Lecture PowerPoint Slides The Basic Practice of Statistics 6 th Edition Moore / Notz / Fligner.
& dding ubtracting ractions.
Assessing influence and selection in network-behavioural co-evolution with an application to smoking and alcohol consumption among adolescents. Christian.
Simple Linear Regression Analysis
Multiple Regression and Model Building
Hypothesis Testing. To define a statistical Test we 1.Choose a statistic (called the test statistic) 2.Divide the range of possible values for the test.
Chris Baerveldt, Marijtje van Duijn, Lotte Vermeij, Dianne van Hemert Ethnic Boundaries and Personal Choice NOSMO, Nijmegen, November 25, 2005.
By Venkata Sai Pulluri ( ) Narendra Muppavarapu ( )
Basics of Statistical Estimation
Epp, section 10.? CS 202 Aaron Bloomfield
Where we are Node level metrics Group level metrics Visualization
The Statistical Analysis of the Dynamics of Networks and Behaviour. An Introduction to the Actor-based Approach. Christian Steglich and Tom Snijders ——————
Some results from Scottish data The Statistical Analysis of the Dynamics of Networks and Behaviour: An Application to Smoking and Drinking Behaviour among.
Joint social selection and social influence models for networks: The interplay of ties and attributes. Garry Robins Michael Johnston University of Melbourne,
References References (continued) ANOMALIES Risk-taking Network Density Paradox Apparent contradictions in research findings Network density is an.
CS6800 Advanced Theory of Computation Fall 2012 Vinay B Gavirangaswamy
Network Measures Social Media Mining. 2 Measures and Metrics 2 Social Media Mining Network Measures Klout.
Social Media Mining Graph Essentials.
Statistical Inference for Large Directed Graphs with Communities of Interest Deepak Agarwal.
Probability Theory School of Mathematical Science and Computing Technology in CSU Course groups of Probability and Statistics.
G Lect 21 G Lecture 2 Regression as paths and covariance structure Alternative “saturated” path models Using matrix notation to write linear.
Most of contents are provided by the website Graph Essentials TJTSD66: Advanced Topics in Social Media.
Introduction to Statistical Models for longitudinal network data Stochastic actor-based models Kayo Fujimoto, Ph.D.
Lecture 20. Graphs and network models 1. Recap Binary search tree is a special binary tree which is designed to make the search of elements or keys in.
Network Science: A Short Introduction i3 Workshop
Probability & Statistics Probability Theory Mathematical Probability Models Event Relationships Distributions of Random Variables Continuous Random.
Simple Linear Regression
Longitudinal Social Network Data
Presentation transcript:

Statistical Social Network Analysis - Stochastic Actor Oriented Models Johan Koskinen The Social Statistics Discipline Area, School of Social Sciences Mitchell Centre for Network Analysis Workshop: Monday, 29 August 2011

Stochastic actor-oriented models - data Study of 32 freshman university students, 7 waves in 1 year. (van de Bunt, van Duijn, & Snijders, Computational & Mathematical Organization Theory, 5, (1999), 167 – 192)

Stochastic actor-oriented models - data

The mini-step Say that actor i wants to make a change to his network A piece of notation Adjacency matrix before change x = after change ( i j ) x ( i j ) = Lets say actor i = 1 changes relation to actor j = 2

The mini-step Say that actor i wants to make a change to his network A piece of notation Adjacency matrix before change x = after change ( i j ) x ( i j ) = Lets say actor i = 1 changes relation to actor j = 2

The mini-step Say that actor i wants to make a change to his network A piece of notation Adjacency matrix before change x = after change ( i j ) x ( i j ) = Lets say actor i = 1 changes relation to actor j = 2

The mini-step Say that actor i wants to make a change to his network A piece of notation Adjacency matrix before change x = after change ( i j ) x ( i j ) = Lets say actor i = 1 changes relation to actor j = 2

The mini-step A sample y path is a series of toggles Where a toggle set element

The mini-step x =

The mini-step x =

The mini-step x = x (1 2) =

The mini-step x = x (1 2) =

The mini-step x = x (1 2) = x (1 3) =

The mini-step x = x (1 2) = x (1 3) =

The mini-step x = x (1 2) = x (1 3) = x (1 4) =

The mini-step x = x (1 2) = x (1 3) = x (1 4) =

The mini-step In Snijders stochastic actor-oriented model the actors degree of satisfaction with new state is a weighted sum of e.g. The number of outgoing ties The number of reciprocated ties The number of ties to people of type A The number of ties to people like the actor The number of friends that are also friends The number of indirect friends

The mini-step In Snijders stochastic actor-oriented model the actors degree of satisfaction with new state is a weighted sum of e.g. The number of outgoing ties The number of reciprocated ties The number of ties to people of type A The number of ties to people like the actor The number of friends that are also friends The number of indirect friends

The mini-step In Snijders stochastic actor-oriented model the actors degree of satisfaction with new state is a weighted sum of e.g. The number of outgoing ties The number of reciprocated ties The number of ties to people of type A The number of ties to people like the actor The number of friends that are also friends The number of indirect friends Assume that we believe that these local structures are important to actors when they change their out-going ties

The mini-step x = x (1 2) = x (1 3) = x (1 4) = 1 edge 1 reciprocated tie 1 indirect others +1 edge +2 transitive triangles 1 indirect others 1 edge ±0 reciprocated tie ±0 indirect others

The mini-step x = We model the preference of changing actor: Preference actor i = β 1 × edge + β 2 × reciprocated tie + β 3 × indirect others + ε

The mini-step x = We model the preference of changing actor: Preference actor i = β 1 × edge + β 2 × reciprocated tie + β 3 × indirect others + ε random component Importance of structure

The mini-step x = We model the preference of changing actor: Preference actor i = β 1 × edge + β 2 × reciprocated tie + β 3 × indirect others + ε - If parameter positive: preference for creating reciprocated ties - If parameter negative: preference against creating reciprocated ties

The mini-step x = We model the preference of changing actor: Preference actor i = β 1 × edge + β 2 × reciprocated tie + β 3 × indirect others + ε Random component captures non- systematic, idiosyncratic factors

The evaluation function x = The evaluation function is the systematic part of the preference (e.g.): f i (y) = β 1 × edge + β 2 × reciprocated tie + β 3 × indirect others

The mini-step x = x (1 2) = x (1 3) = x (1 4) = The evaluation function gives the probability of change:

Empirical example The Gerhard van de Bunt data: 32 university freshmen (24 fem, 8 male). 3 observations used (t1, t2, t3) Densities increase from 0.15 at t1 via 0.18 to 0.22 at t3.

Simple model Rate parameters: about 3 opportunities for change/actor between observations; out-degree parameter negative: on average, cost of friendship ties higher than their benefits; reciprocity effect strong and highly significant (t = 1.79/0.27 = 6.6). estimates.e. Rate t1-t2 3.51(0.54) Rate t2-t3 3.09(0.49) Out-degree 1.10(0.15) Reciprocity 1.79(0.27)

The evaluation function in the simple model The evaluation function is: This expresses how much actor i likes the network Adding reciprocated tie (i.e., for which x ji = 1) gives i Adding non-reciprocated tie (i.e., for which x ji = 0) gives i.e., this has negative benefits

Empirical example no 2 Excerpt of 50 girls from the Teenage Friends and Lifestyle Study data set (West and Sweeting 1995) s50-smoke.dat. Smoking: 1 (non), 2 (occasional) and 3 (regular, i.e. more than once per week). s50-drugs.dat. Cannabis use: 1 (non), 2 (tried once), 3 (occasional) and 4 (regular). s50-alcohol.dat. Alcohol: 1 (non), 2 (once or twice a year), 3 (once a month), 4 (once a week) and 5 (more than once a week). s50-sport.dat. Sport: 1 (not regular) and 2 (regular). s50-familyevent.dat. Binary information over whether or not the number of persons has changed with which the pupil shares his home address.

Empirical example no 2 A script for fitting data: friend.data.w1 <- as.matrix(read.table("s50-network1.dat"))# read data friend.data.w2 <- as.matrix(read.table("s50-network2.dat")) friend.data.w3 <- as.matrix(read.table("s50-network3.dat")) drink <- as.matrix(read.table("s50-alcohol.dat")) smoke <- as.matrix(read.table("s50-smoke.dat")) friendship <- sienaNet( array( c( friend.data.w1, friend.data.w2, friend.data.w3 ), dim = c( 50, 50, 3 ) ) )# create dependent variable smoke1 <- coCovar( smoke[, 1 ] )# create constant covariate alcohol <- varCovar( drink )# create time varying covariate

Empirical example no 2 A script for fitting data: mydata <- sienaDataCreate( friendship, smoke1, alcohol )# define data myeff <- getEffects( mydata )# create effects structure print01Report( mydata, myeff, modelname = 's50_3_init' )# siena01 for reports myeff <- includeEffects( myeff, transTrip, cycle3 )# add structural effects myeff <- includeEffects( myeff, egoX, altX, egoXaltX, interaction1 = "alcohol" )# add covariate effects myeff <- includeEffects( myeff, simX, interaction1 = "smoke1" ) mymodel <- sienaModelCreate( useStdInits = TRUE, projname = 's50_3' )# define model as data + effects ans <- siena07( mymodel, data = mydata, effects = myeff) # estimate model

Co-evolution models We model the preference of actor when changing behaviour:

Co-evolution models We model the preference of actor when changing behaviour:

Co-evolution models We model the preference of actor when changing behaviour:

Co-evolution models We model the preference of actor when changing behaviour: Similar evaluation function:

Wrap-up The package RSiena supports analysis of - un-directed networks - bi-partite networks - multiple networks - … and more forthcoming Updates and news are published on Next RSiena workshop: 14 Sept in Zurich