Presentation is loading. Please wait.

Presentation is loading. Please wait.

Autocorrelation in Social Networks: A Preliminary Investigation of Sampling Issues Antonio Páez Darren M. Scott Erik Volz Sunbelt XXVI – International.

Similar presentations


Presentation on theme: "Autocorrelation in Social Networks: A Preliminary Investigation of Sampling Issues Antonio Páez Darren M. Scott Erik Volz Sunbelt XXVI – International."— Presentation transcript:

1 Autocorrelation in Social Networks: A Preliminary Investigation of Sampling Issues Antonio Páez Darren M. Scott Erik Volz Sunbelt XXVI – International Network for Social Network Analysis Network Autocorrelation Analysis

2 Spatial analysis ◘Central tenet: First Law of Geography “Everything is related to everything else, but near things are more related than distant things” (Tobler, 1970) ◘Spatial analysis (Miller, 2004)

3 Spatial statistical models ◘Statistical representation of this principle ○Spatially autoregressive model Y= X  +  WY +WY + Spatial spillovers Economic externalities … (e.g. Fingleton, 2003;2004)

4 Connectivity matrix W ◘Key element of the model ○Defines the spatial structure of the study area ○Position relative to other units

5 First Law: General principle ◘Distance in social space “Everyone is related to everyone else, but near people are more related than distant people” ◘Akerlof’s social distance (1997) “Agents who are initially close interact more strongly while those who are socially distant have little interaction”

6 Social network analysis ◘Network models Y= X  +  WY +WY + Social influence (e.g. Leenders, 2002; Marsden and Friedkin, 1994)

7 Geo-referencing ◘Nature of connectivity is relatively unambiguous even if definition of weights is not

8 Social referencing ◘Identification of network connections

9 Specification of W ◘Research questions Within a linear autoregressive framework: ○What is the effect of under-specifying matrix W… ? (how much effort should go into trying to observe/identify network connections?) ○What is the effect of different network topologies…? On quality of estimators, model identification (Previous work by Stetzer, 1982; Griffith, 1996)

10 Experimental setup ◘Assumptions ○Closed system (interactions with the rest of the world are negligible) ○All individuals are observed, their attributes can be obtained ○Not all network connections are identified »Deliberate effort to minimize observation cost: select individuals and identify all their connections

11 Experimental setup ◘Simulate networks with different topologies (Matrix W ) ○Poisson distribution / exponential distribution ○Degree distribution: 1.5, 3.5, 5.5, 7.5 ○Clustering: 0.2, 0.3, 0.4, 0.5, 0.6, 0.7 Random networks with tunable degree distribution and clustering – Volz, 2004

12 Experimental setup ◘Simulate data ○  : 0.1, 0.3, 0.5, 0.7, 0.9 ○  1 =2.0;  2 =1.0 ○ X 1 : const; X 2 : uniform (1,10) (see Anselin and Florax, 1995) ○  : standard normal ○ n =100 (number of observations) Y =  WY +X 1  1 +X 2  2 + 

13 Experimental setup ◘Randomly sample from connectivity matrix W (e.g. 95% of individual connections) ○ s : 0.95, 0.90, 0.85, 0.80, 0.75, 0.70, 0.65, 0.60, 0.55, 0.50 ◘Estimate coefficients ○1,000 replications @ each level of sampling ◘Calculate mse: bias – variance ◘Model identification: likelihood ratio test

14 Results Degree Distribution ( d ) Clustering ( c ) 1.53.57.55.5 0.2 0.3 0.4 0.5 0.6 0.7 0.10.1 0.30.3 0.50.5 0.7 0.90.70.9  = 0.10.1 0.30.3 0.50.5 0.7 0.90.70.9  = 0.10.1 0.30.3 0.50.5 0.7 0.90.70.9  = 0.10.1 0.30.3 0.50.5 0.7 0.90.70.9  = 0.10.1 0.30.3 0.50.5 0.7 0.90.70.9  = 0.10.1 0.30.3 0.50.5 0.7 0.90.70.9  = 0.2 0.3 0.4 0.5 0.6 0.7 0.10.1 0.30.3 0.50.5 0.7 0.90.70.9  = 0.10.1 0.30.3 0.50.5 0.7 0.90.70.9  = 0.10.1 0.30.3 0.50.5 0.7 0.90.70.9  = 0.10.1 0.30.3 0.50.5 0.7 0.90.70.9  = 0.10.1 0.30.3 0.50.5 0.7 0.90.70.9  = 0.10.1 0.30.3 0.50.5 0.7 0.90.70.9  = 0.2 0.3 0.4 0.5 0.6 0.7 0.10.1 0.30.3 0.50.5 0.7 0.90.70.9  = 0.10.1 0.30.3 0.50.5 0.7 0.90.70.9  = 0.10.1 0.30.3 0.50.5 0.7 0.90.70.9  = 0.10.1 0.30.3 0.50.5 0.7 0.90.70.9  = 0.10.1 0.30.3 0.50.5 0.7 0.90.70.9  = 0.10.1 0.30.3 0.50.5 0.7 0.90.70.9  = 0.2 0.3 0.4 0.5 0.6 0.7 0.10.1 0.30.3 0.50.5 0.7 0.90.70.9  = 0.10.1 0.30.3 0.50.5 0.7 0.90.70.9  = 0.10.1 0.30.3 0.50.5 0.7 0.90.70.9  = 0.10.1 0.30.3 0.50.5 0.7 0.90.70.9  = 0.10.1 0.30.3 0.50.5 0.7 0.90.70.9  = 0.10.1 0.30.3 0.50.5 0.7 0.90.70.9  =

15 Summary and conclusions ◘Specification of connectivity matrix in social network settings ◘Resources available for observing network connections – sampling strategies ◘Simulation experiment using networks with controlled topologies: quality of estimators, power of identification tests

16 Summary and conclusions ◘Main control is degree of network autocorrelation ◘Clustering: relatively small effect

17 Summary and conclusions ◘Weak network autocorrelation (  = 0.1 ~ 0.3) ○Effect of under-specification on coefficients is relatively small ○Tests may fail to identify the effect ◘Moderate network autocorrelation (  = 0.5) ○Effect on coefficients becomes noticeable @ s~0.75, and this effect is sharper with increasing degree distribution ○Tests correctly reject null hypothesis of no autocorrelation 90% of time @ p=0.05

18 Summary and conclusions ◘Strong network autocorrelation (  = 0.7 ~ 0.9) ○Quality of estimators deteriorates very rapidly, even @ s~0.90 ○Tests lose power at higher degree distributions ◘Further research ○Alternative sampling schemes (e.g. snowball, referral) ○Over-specification of connectivity matrix W ○“Seeding” matrix W

19 d = 1.5; c = 0.2;  = 0.1 d c 

20 d = 1.5; c = 0.2;  = 0.3 d c 

21 d = 1.5; c = 0.2;  = 0.5 d c 

22 d = 1.5; c = 0.2;  = 0.7 d c 

23 d = 1.5; c = 0.2;  = 0.9 d c 

24 d = 1.5; c = 0.3;  = 0.1 d c 

25 d = 1.5; c = 0.3;  = 0.3 d c 

26 d = 1.5; c = 0.3;  = 0.5 d c 

27 d = 1.5; c = 0.3;  = 0.7 d c 

28 d = 1.5; c = 0.3;  = 0.9 d c 

29 d = 1.5; c = 0.4;  = 0.1 d c 

30 d = 1.5; c = 0.4;  = 0.3 d c 

31 d = 1.5; c = 0.4;  = 0.5 d c 

32 d = 1.5; c = 0.4;  = 0.7 d c 

33 d = 1.5; c = 0.4;  = 0.9 d c 

34 d = 1.5; c = 0.5;  = 0.1 d c 

35 d = 1.5; c = 0.5;  = 0.3 d c 

36 d = 1.5; c = 0.5;  = 0.5 d c 

37 d = 1.5; c = 0.5;  = 0.7 d c 

38 d = 1.5; c = 0.5;  = 0.9 d c 

39 d = 1.5; c = 0.6;  = 0.1 d c 

40 d = 1.5; c = 0.6;  = 0.3 d c 

41 d = 1.5; c = 0.6;  = 0.5 d c 

42 d = 1.5; c = 0.6;  = 0.7 d c 

43 d = 1.5; c = 0.6;  = 0.9 d c 

44 d = 1.5; c = 0.7;  = 0.1 d c 

45 d = 1.5; c = 0.7;  = 0.3 d c 

46 d = 1.5; c = 0.7;  = 0.5 d c 

47 d = 1.5; c = 0.7;  = 0.7 d c 

48 d = 1.5; c = 0.7;  = 0.9 d c 

49 d = 3.5; c = 0.2;  = 0.1 d c 

50 d = 3.5; c = 0.2;  = 0.3 d c 

51 d = 3.5; c = 0.2;  = 0.5 d c 

52 d = 3.5; c = 0.2;  = 0.7 d c 

53 d = 3.5; c = 0.2;  = 0.9 d c 

54 d = 3.5; c = 0.3;  = 0.1 d c 

55 d = 3.5; c = 0.3;  = 0.3 d c 

56 d = 3.5; c = 0.3;  = 0.5 d c 

57 d = 3.5; c = 0.3;  = 0.7 d c 

58 d = 3.5; c = 0.3;  = 0.9 d c 

59 d = 3.5; c = 0.4;  = 0.1 d c 

60 d = 3.5; c = 0.4;  = 0.3 d c 

61 d = 3.5; c = 0.4;  = 0.5 d c 

62 d = 3.5; c = 0.4;  = 0.7 d c 

63 d = 3.5; c = 0.4;  = 0.9 d c 

64 d = 3.5; c = 0.5;  = 0.1 d c 

65 d = 3.5; c = 0.5;  = 0.3 d c 

66 d = 3.5; c = 0.5;  = 0.5 d c 

67 d = 3.5; c = 0.5;  = 0.7 d c 

68 d = 3.5; c = 0.5;  = 0.9 d c 

69 d = 3.5; c = 0.6;  = 0.1 d c 

70 d = 3.5; c = 0.6;  = 0.3 d c 

71 d = 3.5; c = 0.6;  = 0.5 d c 

72 d = 3.5; c = 0.6;  = 0.7 d c 

73 d = 3.5; c = 0.6;  = 0.9 d c 

74 d = 3.5; c = 0.7;  = 0.1 d c 

75 d = 3.5; c = 0.7;  = 0.3 d c 

76 d = 3.5; c = 0.7;  = 0.5 d c 

77 d = 3.5; c = 0.7;  = 0.7 d c 

78 d = 3.5; c = 0.7;  = 0.9 d c 

79 d = 5.5; c = 0.2;  = 0.1 d c 

80 d = 5.5; c = 0.2;  = 0.3 d c 

81 d = 5.5; c = 0.2;  = 0.5 d c 

82 d = 5.5; c = 0.2;  = 0.7 d c 

83 d = 5.5; c = 0.2;  = 0.9 d c 

84 d = 5.5; c = 0.3;  = 0.1 d c 

85 d = 5.5; c = 0.3;  = 0.3 d c 

86 d = 5.5; c = 0.3;  = 0.5 d c 

87 d = 5.5; c = 0.3;  = 0.7 d c 

88 d = 5.5; c = 0.3;  = 0.9 d c 

89 d = 5.5; c = 0.4;  = 0.1 d c 

90 d = 5.5; c = 0.4;  = 0.3 d c 

91 d = 5.5; c = 0.4;  = 0.5 d c 

92 d = 5.5; c = 0.4;  = 0.7 d c 

93 d = 5.5; c = 0.4;  = 0.9 d c 

94 d = 5.5; c = 0.5;  = 0.1 d c 

95 d = 5.5; c = 0.5;  = 0.3 d c 

96 d = 5.5; c = 0.5;  = 0.5 d c 

97 d = 5.5; c = 0.5;  = 0.7 d c 

98 d = 5.5; c = 0.5;  = 0.9 d c 

99 d = 5.5; c = 0.6;  = 0.1 d c 

100 d = 5.5; c = 0.6;  = 0.3 d c 

101 d = 5.5; c = 0.6;  = 0.5 d c 

102 d = 5.5; c = 0.6;  = 0.7 d c 

103 d = 5.5; c = 0.6;  = 0.9 d c 

104 d = 5.5; c = 0.7;  = 0.1 d c 

105 d = 5.5; c = 0.7;  = 0.3 d c 

106 d = 5.5; c = 0.7;  = 0.5 d c 

107 d = 5.5; c = 0.7;  = 0.7 d c 

108 d = 5.5; c = 0.7;  = 0.9 d c 

109 d = 7.5; c = 0.2;  = 0.1 d c 

110 d = 7.5; c = 0.2;  = 0.3 d c 

111 d = 7.5; c = 0.2;  = 0.5 d c 

112 d = 7.5; c = 0.2;  = 0.7 d c 

113 d = 7.5; c = 0.2;  = 0.9 d c 

114 d = 7.5; c = 0.3;  = 0.1 d c 

115 d = 7.5; c = 0.3;  = 0.3 d c 

116 d = 7.5; c = 0.3;  = 0.5 d c 

117 d = 7.5; c = 0.3;  = 0.7 d c 

118 d = 7.5; c = 0.3;  = 0.9 d c 

119 d = 7.5; c = 0.4;  = 0.1 d c 

120 d = 7.5; c = 0.4;  = 0.3 d c 

121 d = 7.5; c = 0.4;  = 0.5 d c 

122 d = 7.5; c = 0.4;  = 0.7 d c 

123 d = 7.5; c = 0.4;  = 0.9 d c 

124 d = 7.5; c = 0.5;  = 0.1 d c 

125 d = 7.5; c = 0.5;  = 0.3 d c 

126 d = 7.5; c = 0.5;  = 0.5 d c 

127 d = 7.5; c = 0.5;  = 0.7 d c 

128 d = 7.5; c = 0.5;  = 0.9 d c 

129 d = 7.5; c = 0.6;  = 0.1 d c 

130 d = 7.5; c = 0.6;  = 0.3 d c 

131 d = 7.5; c = 0.6;  = 0.5 d c 

132 d = 7.5; c = 0.6;  = 0.7 d c 

133 d = 7.5; c = 0.6;  = 0.9 d c 

134 d = 7.5; c = 0.7;  = 0.1 d c 

135 d = 7.5; c = 0.7;  = 0.3 d c 

136 d = 7.5; c = 0.7;  = 0.5 d c 

137 d = 7.5; c = 0.7;  = 0.7 d c 

138 d = 7.5; c = 0.7;  = 0.9 d c 


Download ppt "Autocorrelation in Social Networks: A Preliminary Investigation of Sampling Issues Antonio Páez Darren M. Scott Erik Volz Sunbelt XXVI – International."

Similar presentations


Ads by Google