Presentation is loading. Please wait.

Presentation is loading. Please wait.

A Graph-Based Approach to Link Prediction in Social Networks Using a Pareto-Optimal Genetic Algorithm Jeff Naruchitparames University of Nevada, Reno -

Similar presentations


Presentation on theme: "A Graph-Based Approach to Link Prediction in Social Networks Using a Pareto-Optimal Genetic Algorithm Jeff Naruchitparames University of Nevada, Reno -"— Presentation transcript:

1 A Graph-Based Approach to Link Prediction in Social Networks Using a Pareto-Optimal Genetic Algorithm Jeff Naruchitparames University of Nevada, Reno - CSE CS 790: Complex Networks, Fall 2010

2 biological social 2

3 3

4 4

5 ‣S‣S ocial networks = ‣D‣D ynamic, judgmental environment ‣A‣A ffect friendships over time 5 very dynamicheterogeneous

6 6

7 7 ‣1‣1 -2 hop distance only ‣F‣F riend-of-friend

8 ‣M‣M ultiple hops; >1 ‣S‣S tructural; purely graph- based ‣N‣N o explicit correlation between potential friends... 8

9 ‣ Silva, et. al., ‣ A Graph-based Recommendation System Using Genetic Algorithms, 2010 9

10 10

11 11

12 Friends-of-Friends 2 hops Filter Order 12

13 Filtering “It’s more probable that you know a friend of your friend than any other random person” Mitchell M., Complex Systems: Network Thinking, 2006. 13

14 14

15 15

16 Indexes 16

17 ‣H‣H eterogeneity ‣H‣H uman behavior and preferences ‣M‣M ultiple hops 17 What’s missing?

18 Pretty much a filtering problem... 18 My approach

19 ‣C‣C omponents (for filtering) ‣B‣B etweenness centrality ‣C‣C ommunity detection ‣C‣C lique Percolation Method (CPM) ‣F‣F riends of friends ‣1‣1 0-dimensional Pareto-optimal genetic algorithm 19 My approach

20 Betweenness Centrality 20

21 Community Detection 21

22 ‣R‣R emove duplicates ‣R‣R emove our test cases ‣(‣( More on this later...) 22

23 The Genetic Algorithm Part 23

24 Pareto Fronts 24

25 The Features 1. # of shared friends 2. location 3. age range 4. general interest 5. music 6. attended same events 7. groups 8. movies 9. education 10. religion/politics 25

26 Pareto Optimality ‣L‣L ocalized to implementation of selection ‣F‣F eature subset selection ‣W‣W e want to find the best combination of these subsets that can give us the best solutions for how we determine friendships 26

27 Pareto Optimality and Feature Subset Selection 27 F1F1F1F1 F2F2F2F2 F3F3F3F3 F4F4F4F4 F5F5F5F5 F6F6F6F6 F7F7F7F7 F8F8F8F8 F9F9F9F9 F 10 C1C1C1C10101000110 C2C2C2C21100010101... CnCnCnCn0000100100

28 A Point System 28 F1F1F1F1 F2F2F2F2 F3F3F3F3 F4F4F4F4 F5F5F5F5 F6F6F6F6 F7F7F7F7 F8F8F8F8 F9F9F9F9 F 10 U1U1U1U1-3-11---2044- U2U2U2U2-1-13---319-... UnUnUnUn-10-14---4961-

29 Pareto Optimality ‣C‣C ompare with the test cases we removed earlier... ‣F‣F or all chromosomes in population, do: ‣I‣I f ALL test cases ≥ optimal Pareto front ‣C‣C alculate fitness ‣G‣G ood to go ‣E‣E lse ‣C‣C alculate fitness ‣C‣C ontinue onto next chromosome 29

30 Fitness Function ∑ ∑ p i ln( f j ) p i-1 30 n10 i=1j=1

31 Continuing on with the Evolutionary Process ‣A‣A pply fitness proportional selection ‣R‣R andomly select 2 parents to mate ‣A‣A pply 1-point crossover (82% chance) ‣B‣B it mutation (0.05% chance) ‣D‣D o this until ALL test cases better than Pareto front OR fitness does not improve for 5 consecutive generations 31

32 1-Point Crossover 32

33 ‣C‣C omplex network theory + Genetic algorithm + social theory ‣B‣B etweenness centrality ‣C‣C ommunity detection ‣C‣C lique Percolation Method ‣B‣B inary 10-dimensional Pareto-optimal genetic algorithm ‣D‣D ominant, fitness proportional selection ‣S‣S everal levels of filtering and selection (aka filtering ☺) 33 Conclusion

34 ‣B‣B etter fitness function (need to ask Sociologists) ‣W‣W eighted chromosome for Pareto optimization (as opposed to binary) ‣P‣P rove all this stuff actually works (sociology standpoint??) ‣P‣P arallelize or GPU-ize the code (it’s in Python) 34 Future Work

35 35


Download ppt "A Graph-Based Approach to Link Prediction in Social Networks Using a Pareto-Optimal Genetic Algorithm Jeff Naruchitparames University of Nevada, Reno -"

Similar presentations


Ads by Google