Are people associating based on gender similarity?

Slides:



Advertisements
Similar presentations
An introduction to exponential random graph models (ERGM)
Advertisements

E(X 2 ) = Var (X) = E(X 2 ) – [E(X)] 2 E(X) = The Mean and Variance of a Continuous Random Variable In order to calculate the mean or expected value of.
Normal Distribution; Sampling Distribution; Inference Using the Normal Distribution ● Continuous and discrete distributions; Density curves ● The important.
School of Information University of Michigan SI 614 Random graphs & power law networks preferential attachment Lecture 7 Instructor: Lada Adamic.
Lecture 9 Measures and Metrics. Structural Metrics Degree distribution Average path length Centrality Degree, Eigenvector, Katz, Pagerank, Closeness,
A/S/L? Homophily of Online and Face to Face Social Ties Gustavo S. Mesch & Ilan Talmud Department of Sociology and Anthropology, University of Haifa.
(hyperlink-induced topic search)
Sampling Methods.
Sampling Methods.
Section 1.2 Continued Discrimination in the Workplace: Inference through Simulation: Discussion.
The table shows a random sample of 100 hikers and the area of hiking preferred. Are hiking area preference and gender independent? Hiking Preference Area.
Λ14 Διαδικτυακά Κοινωνικά Δίκτυα και Μέσα Networks and Surrounding Contexts Chapter 4, from D. Easley and J. Kleinberg book.
Probability Definition: Probability: the chance an event will happen. # of ways a certain event can occur # of possible events Probability =  Probability.
© 2011 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.
Testing Differences between Means, continued Statistics for Political Science Levin and Fox Chapter Seven.
Network Theory: Community Detection Dr. Henry Hexmoor Department of Computer Science Southern Illinois University Carbondale.
Math 3Warm Up4/23/12 Find the probability mean and standard deviation for the following data. 2, 4, 5, 6, 5, 5, 5, 2, 2, 4, 4, 3, 3, 1, 2, 2, 3, 4, 6,
Essential Statistics Chapter 191 Comparing Two Proportions.
The  2 (chi-squared) test for independence. One way of finding out is to perform a  2 (chi-squared) test for independence. We might want to find out.
Characteristics of Travelers Within Canada Data Management Project Gosia Przada MDM4U Mr. Brown Sacred Heart Catholic HS.
Lecture 9 Measures and Metrics. Cocitation and Bibliographic coupling 2.
Section 7.13: Homophily (or Assortativity)
Mingze Zhang, Mun Choon Chan and A. L. Ananda School of Computing
Hypothesis Testing Hypothesis testing is an inferential process
BIG IDEA Percent / / out of 100.
Standardized scores and the Normal Model
Multiply and Divide Fractions
Non-Parametric Statistics •. Fastest growing branch of
Testing Hypotheses about a Population Proportion
What does a population that is normally distributed look like?
Local Networks Overview Personal Relations: Core Discussion Networks
Lecture 9 Measures and Metrics.
Guerilla Data Inc. Presents:.
Adoption of Health Information Exchanges and Physicians’ Referral Patterns: Are they Mutually Reinforcing? SAEEDE EFTEKHARI*, School of Management, State.
Comparing Two Proportions
Section 8.6: Clustering Coefficients
Focus: Sociology is a behavioral science that looks a human behavior in groups. Sociologists must maintain objectivity, perspective and imagination. Sociology.
The Normal Distribution…
Milgram’s experiment really demonstrated two striking facts about large social networks: first, that short paths are there in abundance;
surrounding contexts:
Lecture 10 Measures and Metrics.
Network Science: A Short Introduction i3 Workshop
Section 8.6 of Newman’s book: Clustering Coefficients
Models of Network Formation
Models of Network Formation
Continuous Distributions
Models of Network Formation
Hypothesis testing. Chi-square test
Models of Network Formation
Clustering Coefficients
Lecture 36 Section 14.1 – 14.3 Mon, Nov 27, 2006
Assortativity (people associate based on common attributes)
Chi-square test or c2 test
Katz Centrality (directed graphs).
The 2 (chi-squared) test for independence
Testing Hypotheses about a Population Proportion
Section 8.3: Degree Distribution
Day 63 Agenda:.
Case Studies in Information Networks Part I
Some statistics questions answered:
EXAMPLE.
Degree Distribution Ralucca Gera,
Commute to Work Two-Way Frequency Table Female Male Total
Testing Hypotheses about a Population Proportion
Express the fraction to decimal.
Mathematical Foundations of BME Reza Shadmehr
1. What animal 2. Male or Female ? 4. Male or Female? Why? Why?
Fractions: Adding and Subtracting mixed numbers
“The Spread of Physical Activity Through Social Networks”
Testing Hypotheses about a Population Proportion
Presentation transcript:

Are people associating based on gender similarity?

First Make a Block Model Attribute N1 N2 N3 N4 N5 N6 N7 N8 N9 Male 1 Female

First Make a Block Model Attribute N1 N2 N3 N4 N5 N6 N7 N8 N9 Female 1 Male

First Make a Block Model Attribute N2 N4 N9 N1 N3 N5 N6 N7 N8 Female 1 Male

First Make a Block Model Block Densities Attribute N2 N4 N9 N1 N3 N5 N6 N7 N8 Female 1 Male

Naïve Approach – calculate the fraction of same gender ties 1 2 3 4 5 6 7 8 9 10 N1 N6 N3 N2 N7 N4 N8 N5 N9 72% (13/18) of the edges are between vertices of the same gender

Finding the number of same-class ties (“Turn off the mixed-class ties with a Kronecker Delta”) Kronecker Delta 𝛿 𝑐 𝑖 , 𝑐 𝑗 = 0, 𝑖𝑓 𝑐 𝑖 ≠ 𝑐 𝑗 &1, 𝑖𝑓 𝑐 𝑖 = 𝑐 𝑗

Finding the number of same-class ties (“Turn off the mixed-class ties with a Kronecker Delta”) Kronecker Delta Actual number of same-class ties 𝛿 𝑐 𝑖 , 𝑐 𝑗 = 0, 𝑖𝑓 𝑐 𝑖 ≠ 𝑐 𝑗 &1, 𝑖𝑓 𝑐 𝑖 = 𝑐 𝑗 𝑒𝑑𝑔𝑒𝑠 (𝑖,𝑗) 𝛿 𝑐 𝑖 , 𝑐 𝑗 = 1 2 𝑖𝑗 𝐴 𝑖𝑗 𝛿 𝑐 𝑖 , 𝑐 𝑗 =13

Kleinberg’s method of estimating the number of expected edges…

Proportion of Males and Females P(male) p = 6/9 N3 N2 P(Female) q = 3/9 N7 N4 N8 N5 N9

Probability of Selecting a Male or Female P(male) p = 6/9 p = 2/3 N3 N2 P(Female) q = 3/9 q = 1/3 N7 N4 N8 N5 N9

Probability of a Male selecting a Male-Male, Female-Female, Male-Female N1 N6 P(male) p = 6/9 p = 2/3 N3 P(m-m) p2 =4/9 N2 P(Female) q = 3/9 q = 1/3 N7 N4 P(f-f) q2 =1/9 N8 N5 N9 P(male-female) P(female-male) 2pq = 4/9

Male-Male, Female-Female, Male-Female Ties Expected number of Male-Male, Female-Female, Male-Female Ties N1 N6 P(male) p = 6/9 p = 2/3 N3 P(m-m) p2 =4/9 p2 =8/18 N2 P(Female) q = 3/9 q = 1/3 N7 N4 P(f-f) q2 =1/9 q2 =2/18 N8 N5 N9 P(male-female) P(female-male) 2pq = 4/9 2pq = 8/18

Expected number of Male-Male, Female-Female, Male-Female Ties P(male) p = 6/9 p = 2/3 N3 P(m-m) p2 =4/9 p2 =8/18 8 M-M N2 P(Female) q = 3/9 q = 1/3 N7 N4 P(f-f) q2 =1/9 q2 =2/18 2 F-F N8 N5 N9 P(male-female) P(female-male) 2pq = 4/9 2pq = 8/18 8 M-F Total expected # of same gender ties: 10

Newman’s approach “make connections at random while preserving the vertex degrees. Ignoring vertex degrees and making connections truly at random has been show to give much poorer results” 1 2 𝑖𝑗 𝑘 𝑖 𝑘 𝑗 2𝑚 𝛿 𝑐 𝑖 , 𝑐 𝑗

Expected number of same-class ties 1 2 𝑖𝑗 𝑘 𝑖 𝑘 𝑗 2𝑚 𝛿 𝑐 𝑖 , 𝑐 𝑗 =10.36

Measuring the Presence of Homophily – Calculating modularity If there is no homophily effect, we should expect to see 10.36 same gender ties. Since we see 13 same gender ties instead of 10.36, there is some evidence of homophily We see about 3 more same gender ties than we would expect if gender had no effect on tie formation. 1 2 𝑖𝑗 𝐴 𝑖𝑗 𝛿 𝑐 𝑖 , 𝑐 𝑗 − 1 2 𝑖𝑗 𝑘 𝑖 𝑘 𝑗 2𝑚 𝛿 𝑐 𝑖 , 𝑐 𝑗 = 1 2 𝑖𝑗 𝐴 𝑖𝑗 − 𝑘 𝑖 𝑘 𝑗 2𝑚 𝛿 𝑐 𝑖 , 𝑐 𝑗

Measuring the Presence of Homophily - Calculating modularity If there is no homophily effect, we should expect to see 57% same gender ties. Since we see 72% same gender ties instead of 57%, there is some evidence of homophily We see 14.6% more same gender ties than what we would expect if gender had no effect on tie formation. The modularity score is 0.146 𝑄= 1 2𝑚 𝑖𝑗 𝐴 𝑖𝑗 − 𝑘 𝑖 𝑘 𝑗 2𝑚 𝛿 𝑐 𝑖 , 𝑐 𝑗 =0.146

A much easier way to calculate modularity using a “Mixing Matrix”

Making Sociology Relevant: What do we want to say? A few empirical facts: Some racially heterogeneous schools are socially segregated

Making Sociology Relevant: What do we want to say? A few empirical facts: … while other heterogeneous schools are socially integrated. Why?

Making Sociology Relevant: What do we want to say?

Scalar Characteristics Assortative Mixing by Scalar Characteristics