Local Networks Overview Personal Relations: Core Discussion Networks Getting Deals Done Questions to answer with local network data Mixing Local Context.

Slides:



Advertisements
Similar presentations
Conceptualization, Operationalization, and Measurement
Advertisements

Cross Cultural Research
Network Matrix and Graph. Network Size Network size – a number of actors (nodes) in a network, usually denoted as k or n Size is critical for the structure.
Random Assignment Experiments
Social Network Analysis
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 12 l Multiple Regression: Predicting One Factor from Several Others.
Sampling Distributions
Chapter 14 Comparing two groups Dr Richard Bußmann.
Business Statistics for Managerial Decision
McGraw-Hill Ryerson Copyright © 2011 McGraw-Hill Ryerson Limited. Adapted by Peter Au, George Brown College.
Chapter 11 Contingency Table Analysis. Nonparametric Systems Another method of examining the relationship between independent (X) and dependant (Y) variables.
Chapter 3 Producing Data 1. During most of this semester we go about statistics as if we already have data to work with. This is okay, but a little misleading.
Chapter 3 Doing Sociological Research 1. Sociology & the Scientific Method The research process: 1.Developing a research question 2.Creating a research.
Beginning the Research Design
Chi-square Test of Independence
Statistical Analysis SC504/HS927 Spring Term 2008 Week 17 (25th January 2008): Analysing data.
A/S/L? Homophily of Online and Face to Face Social Ties Gustavo S. Mesch & Ilan Talmud Department of Sociology and Anthropology, University of Haifa.
Lecture 9: One Way ANOVA Between Subjects
Today Concepts underlying inferential statistics
RESEARCH METHODS IN EDUCATIONAL PSYCHOLOGY
Chapter 4 Selecting a Sample Gay, Mills, and Airasian
10. Introduction to Multivariate Relationships Bivariate analyses are informative, but we usually need to take into account many variables. Many explanatory.
Experiments and Observational Studies.  A study at a high school in California compared academic performance of music students with that of non-music.
Measures of Central Tendency
Psy B07 Chapter 1Slide 1 ANALYSIS OF VARIANCE. Psy B07 Chapter 1Slide 2 t-test refresher  In chapter 7 we talked about analyses that could be conducted.
This Week: Testing relationships between two metric variables: Correlation Testing relationships between two nominal variables: Chi-Squared.
Chapter 8 Hypothesis testing 1. ▪Along with estimation, hypothesis testing is one of the major fields of statistical inference ▪In estimation, we: –don’t.
Overview Granovetter: Strength of Weak Ties What are ‘weak ties’? why are they ‘strong’? Burt: Structural Holes What are they? What do they do? How do.
Cultural Difference: Investment Attitudes and Behaviors of High Income Americans Tahira K. Hira – Iowa State University
Lecture 12 Statistical Inference (Estimation) Point and Interval estimation By Aziza Munir.
Chapter 5 Selecting a Sample Gay, Mills, and Airasian 10th Edition
. Chapter 3 Displaying and Describing Categorical Data.
Methods of Media Research Communication covers a broad range of topics. Also it draws heavily from other fields like sociology, psychology, anthropology,
User Study Evaluation Human-Computer Interaction.
Assignment 2: remarks FIRST PART Please don’t make a division of labor so blatantly obvious! 1.1 recode - don't just delete everything that looks suspicious!
Sociological Research Methods Sociology: Chapter 2, Section 1.
Variables, sampling, and sample size. Overview  Variables  Types of variables  Sampling  Types of samples  Why specific sampling methods are used.
Local Networks Overview Personal Relations: GSS Network Data To Dwell Among Friends Questions to answer with local network data Mixing Local Context Social.
Multiple Regression 3 Sociology 5811 Lecture 24 Copyright © 2005 by Evan Schofer Do not copy or distribute without permission.
Business Statistics for Managerial Decision Farideh Dehkordi-Vakil.
Module 2 Research Strategies.
Gathering Useful Data. 2 Principle Idea: The knowledge of how the data were generated is one of the key ingredients for translating data intelligently.
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 11: Bivariate Relationships: t-test for Comparing the Means of Two Groups.
PCB 3043L - General Ecology Data Analysis.
Bell Ringer Write down your Student ID# and Phone #, and Sports Jersey # from Fall Sport. Some of you will receive candy based on your student ID and Phone.
IMPORTANCE OF STATISTICS MR.CHITHRAVEL.V ASST.PROFESSOR ACN.
Structural Holes & Weak Ties
The Analysis of Variance ANOVA
Organization of statistical investigation. Medical Statistics Commonly the word statistics means the arranging of data into charts, tables, and graphs.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Introduction to Social Network Analysis Columbia University April 2007 James Moody Duke University.
T-tests Chi-square Seminar 7. The previous week… We examined the z-test and one-sample t-test. Psychologists seldom use them, but they are useful to understand.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 11 Inference for Distributions of Categorical.
Chi-Squared Test of Homogeneity Are different populations the same across some characteristic?
Choosing and using your statistic. Steps of hypothesis testing 1. Establish the null hypothesis, H 0. 2.Establish the alternate hypothesis: H 1. 3.Decide.
Chi Square Procedures Chapter 14. Chi-Square Goodness-of-Fit Tests Section 14.1.
Conducting surveys and designing questionnaires. Aims Provide students with an understanding of the purposes of survey work Overview the stages involved.
Local Networks Overview Personal Relations: Core Discussion Networks
CHAPTER 11 Inference for Distributions of Categorical Data
CHAPTER 11 Inference for Distributions of Categorical Data
CHAPTER 11 Inference for Distributions of Categorical Data
Structural Holes & Weak Ties
CHAPTER 11 Inference for Distributions of Categorical Data
CHAPTER 11 Inference for Distributions of Categorical Data
CHAPTER 11 Inference for Distributions of Categorical Data
CHAPTER 11 Inference for Distributions of Categorical Data
CHAPTER 11 Inference for Distributions of Categorical Data
CHAPTER 11 Inference for Distributions of Categorical Data
CHAPTER 11 Inference for Distributions of Categorical Data
Presentation transcript:

Local Networks Overview Personal Relations: Core Discussion Networks Getting Deals Done Questions to answer with local network data Mixing Local Context Social Support Collecting (Ego-)Network Data Cloning Headless Frogs Examples Effect of missing data Strategies for Analysis Content Structure Software: UCINET & PAJEK

Core Discussion networks Question asked: “From time to time, most people discuss important matters with other people. Looking back over the last six months -- who are the people with whom you discussed matters important to you? Just tell me their first names or initials.” Why this question? Only time for one question Normative pressure and influence likely travels through strong ties Similar to ‘best friend’ or other strong tie generators Local Networks Core Discussion Nets

Types of measures: Network Range: the extent to which a person’s ties connects them to a diverse set of other actors. Includes: Size, density, homogeneity Network Composition: The types of alters in ego’s networks. Can include many things, here it is about kin. Local Networks Core Discussion Nets

Distribution of total network size, GSS 1985 Percent Local Networks Core Discussion Nets

Network size by: Age: Drops with age at an increasing rate. Elderly have few close ties. Education: Increases with education. College degree ~ 1.8 times larger Sex (Female): No gender differences on network size. Race: African Americans networks are smaller (2.25) than White Networks (3.1). Local Networks Core Discussion Nets

Proportion Kin, GSS 1985 Local Networks Core Discussion Nets

Proportion Kin by: Age: Age Proportion Kin Local Networks Core Discussion Nets

Proportion Kin by: Education: Proportion decreases with education, but they nominate more of both kin and non-kin in absolute numbers. Sex (Female): Females name slightly more kin than males do. Race: African American cite fewer kin (absolute and proportion) than do Whites. Local Networks Core Discussion Nets

Network Density Recall that density is the average value of the relation among all pairs of ties. Here, density is only calculated over the alters in the network. R  = Local Networks Core Discussion Nets

Density Local Networks Core Discussion Nets

Network Density Age: Increases as we age. Education: Decreases among the most educated. Race: No differences by race. Size of Place: People from large cities have lower density than do those in small cities. Local Networks Core Discussion Nets

Network Heterogeneity Heterogeneity is the variance in type of people in your network. Networks tend to be more homogeneous than the population. Marsden reports differences by Age, Education, Race and Gender. He finds that: Age distribution is fairly wide, almost evenly distributed, though lower than the population at large Homogenous by education (30% differ by less than a year, on average) Very homogeneous with respect to race (96% are single race) Heterogeneous with respect to gender Local Networks Core Discussion Nets

Network Heterogeneity Heterogeneity differs by: Age: Tends to decrease as we age Education: Heterogeneity increases with education Race: No differences in age. Minorities tend to have higher race-heterogeneity (consistent with Blau’s intergroup mixing model) and lower gender heterogeneity. Size of place: Large settings tend to be correlated with greater heterogeneity in the network. Local Networks Core Discussion Nets

Social Network Data Cloning Headless Frogs How good is the name generator? Bearman and Parigi ask about what is being captured in the GSS name generator, which because of it’s placement in the GSS has become a standard question. Others have done this, and found that the resulting list of names does not differ significantly (see Straits 2000). Bearman & Parigi argue that to understand the network, you need to understand what it is people are really talking about. The basic assumption of the GSS question is that people talk about important matters to people who are important to them.

Social Network Data Cloning Headless Frogs Key Questions: 1)What do people talk about? 2)Why do so many people not report talking about anything with anybody? 3)Given the heterogeneity of the topics discussed, is there a foundation from which one could use the GSS data to describe anything meaningful about core discussion networks? 4)Is there a pattern of topics to alters and how does this affect comparative analyses?

Social Network Data Cloning Headless Frogs Key Questions: 1)What do people talk about? & Who did they talk to? Note that the topic was heavily dependent on the questionnaire order. In this survey, it was the first question.

Social Network Data Cloning Headless Frogs Key Questions: 1)Why do so many people not report talking about anything with anybody?

Social Network Data Cloning Headless Frogs talks about what with who? Connections are significant cells from table 5.

Social Network Data Cloning Headless Frogs Who talks about what? Connections are large values from figure 1. Males Females

Social Network Data Cloning Headless Frogs 1)Why do so many people not report talking about anything with anybody? 44% report nobody to talk to More likely to be without spouses, unemployed and non-white 56% report nothing important to talk about.

Social Network Data Cloning Headless Frogs

Social Network Data Cloning Headless Frogs End result suggest using questions that are linked directly to conversation domains of substantive interest. Or, more generally, defining relationships that are of importance for your topic of study.

Local Networks Getting Deals Done If networks provide resources and “social capital,” then different types of networks should matter for business transactions. -Builds on a long line of research about risk control, managing uncertainty and information gathering through networks. -The argument turns on a subtle difference: the networks that help you get information about a deal (and thus bring a deal to the table) are not the same as the networks that generate approval for the deal, and in fact might work in different ways.

Local Networks Getting Deals Done Outline of the argument & findings.

Local Networks Getting Deals Done Claims the results suggest a paradox: a) Uncertain deals require strong networks b) But embeddedness in strong networks makes it less likely a deal will close.* This leads them to the ‘multiple lens’ hypothesis. That decisions are best made when subject to information that comes from multiple, disconnected sources. (*note this works on two different notions of ‘strong’ in the actual models, so there’s some empirical slippage here…)

Local Networks Fischer’s Work. What does Fischer have to say about Age homogeneity in local nets? (hidden)

Fischer’s Work. What does Fischer have to say about marital homogeneity in local nets? (hidden)

Fischer’s Work. What does Fischer have to say about the size of local nets (by context)? (hidden)

Fischer’s Work. What does Fischer have to say about the density of local nets (by context)? (hidden)

Fischer’s Work. What does Fischer have to say about the % of Kin (by context & Relation type)? (hidden)

Fischer: the effect of urbanism. “Urbanism generally increases people’s access to people like themselves and to people unlike themselves -- simply by increasing the sheer numbers of both. For people whose status is a majority or plurality… the increase in similars is of marginal importance; such people are available almost anywhere, and especially in small communities. The increase in dissimilars.. That urbanism brings may be of more consequence since they are now available in large numbers for the first time. The result: in cities majority people may know more minority people; they tend toward networks heterogeneity, if in any direction at all. For people in minority status…the increase in dissimilar others is of little importance; such unlike people are all around them in most places. The increase in people similar to themselves is is more consequential since they now have a large pool of like people to choose from for the first time. The result: In citis minority people know more minority people like themselves; they tend toward network homogeneity. (hidden)

Existing Sources of Social Network Data: There are lots of network data archived. Check INSNA for a listing. The PAJEK data page includes a number of exemplars for large-scale networks. 1-Mode Data Local Network data: Fairly common, because it is easy to collect from sample surveys. GSS, NHSL, Urban Inequality Surveys, etc. Pay attention to the question asked Key features are (a) number of people named and (b) whether alters are able to nominate each other. Social Network Data Network Data Sources: Existing data sources

Existing Sources of Social Network Data: 1-Mode Data Partial network data: Much less common, because cost goes up significantly once you start tracing to contacts. Snowball data: start with focal nodes and trace to contacts CDC style data on sexual contact tracing Limited snowball samples: Colorado Springs drug users data Geneology data Small-world network samples Limited Boundary data: select data within a limited bound Cross-national trade data Friendships within a classroom Family support ties Social Network Data Network Data Sources: Existing data sources

Existing Sources of Social Network Data: 1-Mode Data Complete network data: Significantly less common and never perfect. Start by defining a theoretically relevant boundary Then identify all relations among nodes within that boundary Co-sponsorship patterns among legislators Friendships within strongly bounded settings (sororities, schools) Examples: Add Health on adolescent friendships Hallinan data on within-school friendships McFarland’s data on verbal interaction Electronic data on citations or coauthorship (see Pajek data page) See INSNA home page for many small-scale networks Social Network Data Network Data Sources: Existing data sources

Boundary Specification Problem Network methods describe positions in relevant social fields, where flows of particular goods are of interest. As such, boundaries are a fundamentally theoretical question about what you think matters in the setting of interest. See Marsden (19xx) for a good review of the boundary specification problem In general, there are usually relevant social foci that bound the relevant social field. We expect that social relations will be very clumpy. Consider the example of friendship ties within and between a high- school and a Jr. high: Social Network Data Network Data Sources: Collecting network data

a)Network data collection can be time consuming. It is better (I think) to have breadth over depth. Having detailed information on <50% of the sample will make it very difficult to draw conclusions about the general network structure. b)Question format: If you ask people to recall names (an open list format), fatigue will result in under-reporting If you ask people to check off names from a full list, you can often get over-reporting c) It is common to limit people to a small number if nominations (~5). This will bias network measures, but is sometimes the best choice to avoid fatigue. d) Concrete relational indicators are best (who did you talk to?) over attitudes that are harder to define (who do you like?) Social Network Data Network Data Sources: Collecting network data

Boundary Specification Problem Social Network Data Network Data Sources: Collecting network data While students were given the option to name friends in the other school, they rarely do. As such, the school likely serves as a strong substantive boundary

Local Network data: When using a survey, common to use an “ego-network module.” First part: “Name Generator” question to elicit a list of names Second part: Working through the list of names to get information about each person named Third part: asking about relations among each person named. Social Network Data Network Data Sources: Collecting network data GSS Name Generator: “From time to time, most people discuss important matters with other people. Looking back over the last six months -- who are the people with whom you discussed matters important to you? Just tell me their first names or initials.” Why this question? Only time for one question Normative pressure and influence likely travels through strong ties Similar to ‘best friend’ or other strong tie generators Note there are significant substantive problems with this name generator

Electronic Small World name generator: Social Network Data Network Data Sources: Collecting network data

Local Network data: The second part usually asks a series of questions about each person GSS Example: “Is (NAME) Asian, Black, Hispanic, White or something else?” Social Network Data Network Data Sources: Collecting network data ESWP example: Will generate N x (number of attributes) questions to the survey

Local Network data: The third part usually asks about relations among the alters. Do this by looping over all possible combinations. If you are asking about a symmetric relation, then you can limit your questions to the n(n-1)/2 cells of one triangle of the adjacency matrix: Social Network Data Network Data Sources: Collecting network data GSS: Please think about the relations between the people you just mentioned. Some of them may be total strangers in the sense that they wouldn't recognize each other if they bumped into each other on the street. Others may be especially close, as close or closer to each other as they are to you. First, think about NAME 1 and NAME 2. A. Are NAME 1 and NAME 2 total strangers? B. ARe they especially close? PROBE: As close or closer to eahc other as they are to you?

Local Network data: The third part usually asks about relations among the alters. Do this by looping over all possible combinations. If you are asking about a symmetric relation, then you can limit your questions to the n(n-1)/2 cells of one triangle of the adjacency matrix: Social Network Data Network Data Sources: Collecting network data

Snowball Samples: Snowball samples work much the same as ego-network modules, and if time allows I recommend asking at least some of the basic ego-network questions, even if you plan to sample (some of) the people your respondent names. Start with a name generator, then any demographic or relational questions. Have a sample strategy Random Walk designs (Klovdahl) Strong tie designs All names designs Get contact information from the people named Snowball samples are very effective at providing network context around focal nodes. Detailed treatments of snowball sampling estimates are given in Frank (). Social Network Data Network Data Sources: Collecting network data

Snowball Samples: Social Network Data Network Data Sources: Collecting network data

Complete Network data Data collection is concerned with all relations within a specified boundary. Requires sampling every actor in the population of interest (all kids in the class, all nations in the alliance system, etc.) The network survey itself can be much shorter, because you are getting information from each person (so ego does not report on alters). Two general formats: Recall surveys (“Name all of your best friends”) Check-list formats: Give people a list of names, have them check off those with whom they have relations. Social Network Data Network Data Sources: Collecting network data

Social Network Data Network Data Sources: Collecting network data Complete network surveys require a process that lets you link answers to respondents. You cannot have anonymous surveys. Recall: Need Id numbers & a roster to link, or hand- code names to find matches Checklists Need a roster for people to check through

Social Network Data Network Data Sources: Missing Data Whatever method is used, data will always be incomplete. What are the implications for analysis? Example 1. Ego is a matchable person in the School Ego M M M M Out Un True Network Ego M M M M Out Un Observed Network Un Out

Example 2. Ego is not on the school roster M M M M M Un True Network M M M M M Un Observed Network Un Social Network Data Network Data Sources: Missing Data

Example 3: Node population: 2-step neighborhood of Actor X Relational population: Any connection among all nodes 1-step 2-step 3-step F F FullFull (0) Full F F (0) F (0) Full (0) Unknown UK Full (0) Social Network Data Network Data Sources: Missing Data

Example 4 Node population: 2-step neighborhood of Actor X Relational population: Trace, plus All connections among 1-step contacts F F FullFull (0) Full Unknown F F (0) F (0) Full (0) Unknown UK Full (0) 1-step 2-step 3-step Social Network Data Network Data Sources: Missing Data

Example 5. Node population: 2-step neighborhood of Actor X Relational population: Only tracing contacts F F FullFull (0) UnknownFull Unknown F F (0) F (0) Full (0) Unknown UK Full (0) 1-step 2-step 3-step Social Network Data Network Data Sources: Missing Data

Example 6 Node population: 2-step neighborhood from 3 focal actors Relational population: All relations among actors FullFull (0) Full (0) Full (0) Full (0) Unknown UK Full (0) Full Focal 1-Step 2-Step 3-Step Focal1-Step2-Step3-Step Social Network Data Network Data Sources: Missing Data

Example 7. Node population: 1-step neighborhood from 3 focal actors Relational population: Only relations from focal nodes FullFull (0) Unknown Full (0) Full (0) Full (0) Unknown UK Full (0) Full Focal 1-Step 2-Step 3-Step Focal1-Step2-Step3-Step Social Network Data Network Data Sources: Missing Data

Local Network Analysis Introduction Local network analysis uses data from a simple ego-network survey. These might include information on relations among ego’s contacts, but often not. Questions include: Population Mixing The extent to which one type of person is tied to another type of person (race by race, etc.) Local Network Composition Peer behavior Cultural milieu Opportunities or Resources in the network Social Support Local Network Structural Network Size Density Holes & Constraint Concurrency Dyadic behavior Frequency of contact Interaction content Specific exchange behaviors

Local Network Analysis Introduction Advantages Cost: data are easy to collect and can be sampled Methods are relatively simple extensions of common variable-based methods social scientists are already familiar with Provides information on the local network context, which is often the primary substantive interest. Can be used to describe general features of the global network context Population mixing, concurrency, activity distribution (limited) Disadvantages Treats each local network as independent, which is false. The poor performance of ‘number of partners’ for predicting STD spread is a clear example. Impossible to account for how position in a larger context affects local network characteristics. “popular with who” If “structure matters”, ego-networks are strongly constrained to limit the information you can get on overall structure

Local Network Analysis Network Composition Perhaps the simplest network question is “what types of alters does ego interact with”? Network composition refers to the distribution of types of people in your network. Networks tend to be more homogeneous than the population. Using the GSS, Marsden reports heterogeneity in Age, Education, Race and Gender. He finds that: Age distribution is fairly wide, almost evenly distributed, though lower than the population at large Homogenous by education (30% differ by less than a year, on average) Very homogeneous with respect to race (96% are single race) Heterogeneous with respect to gender

Questions that you can ask / answer Mixing The extent to which one type of person is tied to another type of person (race by race, etc.) Aspects of the local context: Peer delinquency Cultural milieu Opportunities Social Support: Extent of resources (and risks) present in a type of network environment. Structural context (next class) Local Network Analysis General Questions

Calculating local network information. 1) From data, such as the GSS, which has ego-reported information on alter 2) From global network data, such as Add Health, where you have self-reports on alters behaviors. Local Network Analysis Mechanics

Calculating local network information 1: GSS style data. This is the easiest situation. Here you have a separate variable for each alter characteristic, and you can construct density items by summing over the relevant variables. You would, for example, have variables on age of each alter such as: Age_alt1 age_alt2 age_alt3 age_alt4 age_alt You get the mean age, then, with a statement such as: meanage=mean(Age_alt1, age_alt2, age_alt3, age_alt4, age_alt5); Be sure you know how the program you use (SAS, SPSS) deals with missing data. Local Network Analysis Mechanics

Calculating local network information 2: From a global network. There are multiple options when you have complete network information. Type of tie: Sent, Received, or both? Once you decide on a type of tie, you need to get the information of interest in a form similar to that in the example above. Local Network Analysis Mechanics

Calculating local network information from a global network. An example network: All senior males from a small (n~350) public HS. Local Network Analysis Mechanics

You need to: Construct a dataset with (a) ego's id. This allows you to link each person in the network. (b) age of each person, (c) the friendship nominations variables. Then you need to: a) Identify ego's friends b) Identify their age c) compare it to ego's age d) count it if it is greater than ego's. There is a SAS program described in the exercise that shows you how to do this kind of work, using the graduate student network data. Suppose you want to identify ego’s friends, calculate what proportion of ego’s female friends are older than ego, and how many male friends they have (this example came up in a model of fertility behavior). Local Network Analysis Mechanics

1) Go over how to translate network data from one program to another UCINET PAJEK 2) Go over the use of ego-net macros in SAS Local Network Analysis Mechanics