Sunbelt 2009statnet Development Team www.statnet.org ERGM introduction 1 Exponential Random Graph Models Statnet Development Team Mark Handcock (UW) Martina.

Slides:



Advertisements
Similar presentations
Sampling Research Questions
Advertisements

Introduction Simple Random Sampling Stratified Random Sampling
Ricard V. Solè and Sergi Valverde Prepared by Amaç Herdağdelen
Mobile Communication Networks Vahid Mirjalili Department of Mechanical Engineering Department of Biochemistry & Molecular Biology.
An introduction to exponential random graph models (ERGM)
Where we are Node level metrics Group level metrics Visualization
Analysis and Modeling of Social Networks Foudalis Ilias.
Covariance and Correlation: Estimator/Sample Statistic: Population Parameter: Covariance and correlation measure linear association between two variables,
Analysis of Social Media MLD , LTI William Cohen
Information Networks Generative processes for Power Laws and Scale-Free networks Lecture 4.
Models of Network Formation Networked Life NETS 112 Fall 2013 Prof. Michael Kearns.
What role should probabilistic sensitivity analysis play in SMC decision making? Andrew Briggs, DPhil University of Oxford.
What is a sample? Epidemiology matters: a new introduction to methodological foundations Chapter 4.
Statistical inference for epidemics on networks PD O’Neill, T Kypraios (Mathematical Sciences, University of Nottingham) Sep 2011 ICMS, Edinburgh.
Directional triadic closure and edge deletion mechanism induce asymmetry in directed edge properties.

Introduction to Econometrics The Statistical Analysis of Economic (and related) Data.
Joint social selection and social influence models for networks: The interplay of ties and attributes. Garry Robins Michael Johnston University of Melbourne,
A gentle introduction to Gaussian distribution. Review Random variable Coin flip experiment X = 0X = 1 X: Random variable.
Exponential Random Graph Models (ERGM) Michael Beckman PAD777 April 9, 2010.
OMS 201 Review. Range The range of a data set is the difference between the largest and smallest data values. It is the simplest measure of dispersion.
Modern methods The classical approach: MethodProsCons Time series regression Easy to implement Fairly easy to interpret Covariates may be added (normalization)
Introduction Statistical options tend to be limited in most GIS applications. This is likely to be redressed in the future. We will look at spatial statistics.
Hearing About A Job: Networks, Information, and Labor Market Segregation Katherine Stovel and Christine Fountain University of Washington March
The Very Small World of the Well-connected. (19 june 2008 ) Lada Adamic School of Information University of Michigan Ann Arbor, MI
INFERENTIAL STATISTICS – Samples are only estimates of the population – Sample statistics will be slightly off from the true values of its population’s.
Prof. SankarReview of Random Process1 Probability Sample Space (S) –Collection of all possible outcomes of a random experiment Sample Point –Each outcome.
Bump Hunting The objective PRIM algorithm Beam search References: Feelders, A.J. (2002). Rule induction by bump hunting. In J. Meij (Ed.), Dealing with.
Information Networks Power Laws and Network Models Lecture 3.
Topic 13 Network Models Credits: C. Faloutsos and J. Leskovec Tutorial
Concepts and Notions for Econometrics Probability and Statistics.
Overview G. Jogesh Babu. Probability theory Probability is all about flip of a coin Conditional probability & Bayes theorem (Bayesian analysis) Expectation,
Composition of environmental decision-making networks – a case study in Helsinki, Finland M.Soc.Sc. Arho Toikka HERC Seminar Series Department.
Lecture 8: Generalized Linear Models for Longitudinal Data.
Social Network Analysis and Complex Systems Science
Principles of Social Network Analysis. Definition of Social Networks “A social network is a set of actors that may have relationships with one another”
Chapter Outline 3.1THE PERVASIVENESS OF RISK Risks Faced by an Automobile Manufacturer Risks Faced by Students 3.2BASIC CONCEPTS FROM PROBABILITY AND STATISTICS.
Geo597 Geostatistics Ch9 Random Function Models.
Multilevel Data in Outcomes Research Types of multilevel data common in outcomes research Random versus fixed effects Statistical Model Choices “Shrinkage.
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Lecture Slide #1 Logistic Regression Analysis Estimation and Interpretation Hypothesis Tests Interpretation Reversing Logits: Probabilities –Averages.
The Dirichlet Labeling Process for Functional Data Analysis XuanLong Nguyen & Alan E. Gelfand Duke University Machine Learning Group Presented by Lu Ren.
Neighbourhood-based models for social networks: model specification issues Pip Pattison, University of Melbourne [with Garry Robins, University of Melbourne.
Exponential Random Graph Models Under Measurement Error Zoe Rehnberg with Dr. Nan Lin Washington University in St. Louis ARTU 2014.
Yongqin Gao, Greg Madey Computer Science & Engineering Department University of Notre Dame © Copyright 2002~2003 by Serendip Gao, all rights reserved.
A two minute introduction to: Exponential random graph (p*)models for social networks SNAC Workshop, Illinois, November 2005 Garry Robins, University of.
POSC 202A: Lecture 4 Probability. We begin with the basics of probability and then move on to expected value. Understanding probability is important because.
Learning Simio Chapter 10 Analyzing Input Data
Lecture 2: Statistical learning primer for biologists
Review of Probability. Important Topics 1 Random Variables and Probability Distributions 2 Expected Values, Mean, and Variance 3 Two Random Variables.
T tests comparing two means t tests comparing two means.
Introduction to Statistical Models for longitudinal network data Stochastic actor-based models Kayo Fujimoto, Ph.D.
TEMPORAL EXPONENTIAL- FAMILY RANDOM GRAPH MODELING (TERGMS) WITH STATNET Prof. Steven Goodreau Prof. Martina Morris Prof. Michal Bojanowski EUSN BARCELONA.
Near repeat burglary chains: describing the physical and network properties of a network of close burglary pairs. Dr Michael Townsley, UCL Jill Dando Institute.
Lec. 19 – Hypothesis Testing: The Null and Types of Error.
Introduction to ERGM/p* model Kayo Fujimoto, Ph.D. Based on presentation slides by Nosh Contractor and Mengxiao Zhu.
Analysis of Social Media MLD , LTI William Cohen
Groups of vertices and Core-periphery structure
Exponential random graph models for multilevel networks
Test for Goodness of Fit
Chapter Outline 3.1 THE PERVASIVENESS OF RISK
Empirical analysis of Chinese airport network as a complex weighted network Methodology Section Presented by Di Li.
Review of Probability Theory
Using R for Network Models and Infectious Disease Transmission
Introduction to logistic regression a.k.a. Varbrul
Network Science: A Short Introduction i3 Workshop
SA3202 Statistical Methods for Social Sciences
Probability & Statistics Probability Theory Mathematical Probability Models Event Relationships Distributions of Random Variables Continuous Random.
Introduction to Econometrics
Social Network Analysis with Apache Spark and Neo4J
Presentation transcript:

Sunbelt 2009statnet Development Team ERGM introduction 1 Exponential Random Graph Models Statnet Development Team Mark Handcock (UW) Martina Morris (UW) Carter Butts (UCI) Dave Hunter (PSU) Steven Goodreau (UW) Jim Moody (Duke) Skye Bender-deMoll (at Large) A brief introduction

Sunbelt 2009statnet Development Team ERGM introduction 2 Fair use Feel free to use these slides for purposes such as teaching (including multiple copies for classroom use), scholarship, research, criticism, comment, or reporting. If you present these slides in public, please retain the authorship information from the footer below. Thanks from the statnet development team

Sunbelt 2009statnet Development Team ERGM introduction 3 What is a statistical model? The word “model” means different things in different subfields A statistical model is a –formal representation of a –stochastic process –specified at one level (e.g., person, dyad) that –aggregates to a higher level (e.g., population, network)

Sunbelt 2009statnet Development Team ERGM introduction 4 Statistical analysis, given a model Estimate parameters of the process –Joint estimation of multiple, possibly correlated, effects Inference from sampled data to population –Uncertainty in parameter estimates Goodness of fit –Traditional diagnostics Model fit (BIC, AIC) Estimation diagnostics (MCMC performance) –Network-specific GOF Network statistics in the model as covariates Network properties not in the model

Sunbelt 2009statnet Development Team ERGM introduction 5 Why take a statistical approach? Descriptive vs. generative goals –Descriptive: numerical summary measures Nodal level: e.g., centrality, geodesic distribution Configuration level: e.g., cycle census Network level: e.g., centralization, clustering, small world, core/periphery –Generative: micro foundations for macro patterns Recover underlying dynamic process from x-sectional data Test alternative hypotheses Extrapolate and simulate from model

Sunbelt 2009statnet Development Team ERGM introduction 6 ERGMs are a hybrid: Traditional generalized linear models (statistical) but –Unit of analysis: relation (dyad) –Observations may be dependent (like a complex system) –Complex nonlinear and threshold effects –Estimation is different Agent-based models (mathematical) but –Can estimate model parameters from data –Can test model goodness of fit

Sunbelt 2009statnet Development Team ERGM introduction 7 Substantive considerations Different processes can lead to similar macro signatures For example: “clustering” typically observed in social nets Sociality – highly active persons create clusters Homophily – assortative mixing by attribute creates clusters Triad closure – triangles create clusters Want to be able to fit these terms simultaneously, and identify the independent effects of each process on the overall outcome.

Sunbelt 2009statnet Development Team ERGM introduction 8 Basic statistical model coefficient*covariate Probability of the graph Simplest model: Bernoulli random graph (Erdös-Rènyi) All ties x ij are equally probable and independent (iid) So the probability of the graph just depends on the cumulative probability of each tie:

Sunbelt 2009statnet Development Team ERGM introduction 9 Re-expressed in terms of p(tie) Probability of the graph Conditional log odds of the tie.  is the “change statistic”, the change in the value of the covariate g(y) when the ij tie changes from 1 to 0 So  is the impact of the covariate on the log-odds of a tie Probability of ij tie, conditional on the rest of the graph

Sunbelt 2009statnet Development Team ERGM introduction 10 attributes of nodes Heterogeneity by group – Average activity – Mixing by group Individual heterogeneity attributes of links Heterogeneity in – Duration – Types (sex, drug…) configurations Degree distributions (or stars) Cycle distributions (2, 3, 4, etc.) Shared partner distributions What kinds of covariates? What creates heterogeneity in the probability of a tie being formed? Dyad Independent Terms Dyad Dependent Terms

Sunbelt 2009statnet Development Team ERGM introduction 11 Two modeling components Covariates: what terms in the model? Coefficients: Homogeneity constraints for coefficients on terms? For example: covariate: edges, etc. coefficient: each edge has the same probability vs. each edge has different probabilities or some edges have different probabilities