1 CoupledLP: Link Prediction in Coupled Networks Yuxiao Dong #, Jing Zhang +, Jie Tang +, Nitesh V. Chawla #, Bai Wang* # University of Notre Dame + Tsinghua.

Slides:



Advertisements
Similar presentations
Oracle Labs Graph Analytics Research Hassan Chafi Sr. Research Manager Oracle Labs Graph-TA 2/21/2014.
Advertisements

Confluence: Conformity Influence in Large Social Networks
Finding your friends and following them to where you are by Adam Sadilek, Henry Kautz, Jeffrey P. Bigham Presented by Guang Ling 1.
Modelling Paying Behavior in Game Social Networks Zhanpeng Fang +, Xinyu Zhou +, Jie Tang +, Wei Shao #, A.C.M. Fong *, Longjun Sun #, Ying Ding -, Ling.
1 Inferring User Demographics and Social Strategies in Mobile Social Networks Yuxiao Dong #, Yang Yang +, Jie Tang +, Yang Yang #, Nitesh V. Chawla # #
Ming Yan, Jitao Sang, Tao Mei, ChangSheng Xu
1 Social Influence Analysis in Large-scale Networks Jie Tang 1, Jimeng Sun 2, Chi Wang 1, and Zi Yang 1 1 Dept. of Computer Science and Technology Tsinghua.
1 Yuxiao Dong *$, Jie Tang $, Sen Wu $, Jilei Tian # Nitesh V. Chawla *, Jinghai Rao #, Huanhuan Cao # Link Prediction and Recommendation across Multiple.
1 1 Chenhao Tan, 1 Jie Tang, 2 Jimeng Sun, 3 Quan Lin, 4 Fengjiao Wang 1 Department of Computer Science and Technology, Tsinghua University, China 2 IBM.
Analysis of Large-Scale Cell Phone Networks Course Project Leman Akoglu Bhavana Dalvi Skyler Speakman April
N EIGHBORHOOD F ORMATION AND A NOMALY D ETECTION IN B IPARTITE G RAPHS Jimeng Sun, Huiming Qu, Deepayan Chakrabarti & Christos Faloutsos Jimeng Sun, Huiming.
Neighborhood Formation and Anomaly Detection in Bipartite Graphs Jimeng Sun Huiming Qu Deepayan Chakrabarti Christos Faloutsos Speaker: Jimeng Sun.
Who Will Follow You Back? Reciprocal Relationship Prediction* 1 John Hopcroft, 2 Tiancheng Lou, 3 Jie Tang 1 Department of Computer Science, Cornell University,
Honglei Zhuang1, Jing Zhang2, George Brova1,
A Multi-Agent Learning Approach to Online Distributed Resource Allocation Chongjie Zhang Victor Lesser Prashant Shenoy Computer Science Department University.
國立陽明大學生資學程 陳虹瑋. Genetic Algorithm Background Fitness function ……. population selection Cross over mutation Fitness values Random cross over.
To Trust of Not To Trust? Predicting Online Trusts using Trust Antecedent Framework Viet-An Nguyen 1, Ee-Peng Lim 1, Aixin Sun 2, Jing Jiang 1, Hwee-Hoon.
Dependency networks Sushmita Roy BMI/CS 576 Nov 26 th, 2013.
1 Yuxiao Dong *, Jie Tang $, Tiancheng Lou #, Bin Wu &, Nitesh V. Chawla * How Long will She Call Me? Distribution, Social Theory and Duration Prediction.
Modelling Paying Behavior in Game Social Networks Zhanpeng Fang +, Xinyu Zhou +, Jie Tang +, Wei Shao #, A.C.M. Fong *, Longjun Sun #, Ying Ding -, Ling.
Transfer Learning From Multiple Source Domains via Consensus Regularization Ping Luo, Fuzhen Zhuang, Hui Xiong, Yuhong Xiong, Qing He.
1 1 Chenhao Tan, 1 Jie Tang, 2 Jimeng Sun, 3 Quan Lin, 4 Fengjiao Wang 1 Department of Computer Science and Technology, Tsinghua University, China 2 IBM.
Active Learning for Networked Data Based on Non-progressive Diffusion Model Zhilin Yang, Jie Tang, Bin Xu, Chunxiao Xing Dept. of Computer Science and.
Personalized Influence Maximization on Social Networks
 An important problem in sponsored search advertising is keyword generation, which bridges the gap between the keywords bidded by advertisers and queried.
Using Transactional Information to Predict Link Strength in Online Social Networks Indika Kahanda and Jennifer Neville Purdue University.
Data Mining and Machine Learning Lab Network Denoising in Social Media Huiji Gao, Xufei Wang, Jiliang Tang, and Huan Liu Data Mining and Machine Learning.
Jure Leskovec PhD: Machine Learning Department, CMU Now: Computer Science Department, Stanford University.
Influence Maximization in Dynamic Social Networks Honglei Zhuang, Yihan Sun, Jie Tang, Jialin Zhang, Xiaoming Sun.
Some working definitions…. ‘Data Mining’ and ‘Knowledge Discovery in Databases’ (KDD) are used interchangeably Data mining = –the discovery of interesting,
Discovering Meta-Paths in Large Heterogeneous Information Network
Predicting Positive and Negative Links in Online Social Networks
Part 1: Biological Networks 1.Protein-protein interaction networks 2.Regulatory networks 3.Expression networks 4.Metabolic networks 5.… more biological.
1 Yang Yang *, Yizhou Sun +, Jie Tang *, Bo Ma #, and Juanzi Li * Entity Matching across Heterogeneous Sources *Tsinghua University + Northeastern University.
1 Panther: Fast Top-K Similarity Search on Large Networks Jing Zhang 1, Jie Tang 1, Cong Ma 1, Hanghang Tong 2, Yu Jing 1, and Juanzi Li 1 1 Department.
Link Prediction Topics in Data Mining Fall 2015 Bruno Ribeiro
Date: 2014/05/27 Author: Xiangnan Kong, Bokai Cao, Philip S. Yu Source: KDD’13 Advisor: Jia-ling Koh Speaker: Sheng-Chih Chu Multi-Label Classification.
Network Community Behavior to Infer Human Activities.
Dependency networks Sushmita Roy BMI/CS 576 Nov 25 th, 2014.
CoNMF: Exploiting User Comments for Clustering Web2.0 Items Presenter: He Xiangnan 28 June School of Computing National.
Du, Faloutsos, Wang, Akoglu Large Human Communication Networks Patterns and a Utility-Driven Generator Nan Du 1,2, Christos Faloutsos 2, Bai Wang 1, Leman.
Towards Social User Profiling: Unified and Discriminative Influence Model for Inferring Home Locations Rui Li, Shengjie Wang, Hongbo Deng, Rui Wang, Kevin.
1 Yang Yang *, Yizhou Sun +, Jie Tang *, Bo Ma #, and Juanzi Li * Entity Matching across Heterogeneous Sources *Tsinghua University + Northeastern University.
RTM: Laws and a Recursive Generator for Weighted Time-Evolving Graphs Leman Akoglu, Mary McGlohon, Christos Faloutsos Carnegie Mellon University School.
Supervised Random Walks: Predicting and Recommending Links in Social Networks Lars Backstrom (Facebook) & Jure Leskovec (Stanford) Proc. of WSDM 2011 Present.
Class 2: Graph Theory IST402. Can one walk across the seven bridges and never cross the same bridge twice? Network Science: Graph Theory THE BRIDGES OF.
Class 2: Graph Theory IST402.
PhD Thesis Proposal Evolution in Social Networks Candidate Giulio Rossetti Supervisor Dino Pedreschi Supervisor Fosca Giannotti Pisa, Computer Science.
ClusCite:Effective Citation Recommendation by Information Network-Based Clustering Date: 2014/10/16 Author: Xiang Ren, Jialu Liu,Xiao Yu, Urvashi Khandelwal,
1 Zi Yang Tsinghua University Joint work with Prof. Jie Tang, Prof. Juanzi Li, Dr. Keke Cai, Jingyi Guo, Chi Wang, etc. July 21, 2011, CASIN 2011, Tsinghua.
1 Zi Yang Tsinghua University Joint work with Prof. Jie Tang, Prof. Juanzi Li, Dr. Keke Cai, Jingyi Guo, Chi Wang, etc. July 21, 2011, CASIN 2011, Tsinghua.
Link Prediction Class Data Mining Technology for Business and Society
Jie Tang Computer Science, Tsinghua
P.Demestichas (1), S. Vassaki(2,3), A.Georgakopoulos(2,3)
Uncovering the Mystery of Trust in An Online Social Network
Hyunghoon Cho, Bonnie Berger, Jian Peng  Cell Systems 
Cross-lingual Knowledge Linking Across Wiki Knowledge Bases
Jie Tang Computer Science, Tsinghua
Link Prediction Seminar Social Media Mining University UC3M
Link Prediction and Network Inference
CIKM Competition 2014 Second Place Solution
CIKM Competition 2014 Second Place Solution
MEgo2Vec: Embedding Matched Ego Networks for User Alignment Across Social Networks Jing Zhang+, Bo Chen+, Xianming Wang+, Fengmei Jin+, Hong Chen+, Cuiping.
Asymmetric Transitivity Preserving Graph Embedding
GANG: Detecting Fraudulent Users in OSNs
Hyunghoon Cho, Bonnie Berger, Jian Peng  Cell Systems 
Representation Learning on Networks
Presentation transcript:

1 CoupledLP: Link Prediction in Coupled Networks Yuxiao Dong #, Jing Zhang +, Jie Tang +, Nitesh V. Chawla #, Bai Wang* # University of Notre Dame + Tsinghua University * Beijing University of Posts and Telecommunications

: : : :29 …… :00 …… Mobile Networks

: : : :29 …… :00 …… Mobile Networks

4 Disease-Gene Networks 1.K.I. Goh, M. E. Cusick, D. Valle, B. Childs, M. Vidal, and A.-L. Barabási. The human disease network. PNAS D. Davis, N. V. Chawla. Exploring and Exploiting Disease Interactions from Multi-Relational Gene and Phenotype Networks. PLoS One J. Menche, A. Sharma, M. Kitsak, S. D. Ghiassian, M. Vidal, J. Loscalzo, A.-L. Barabási. Uncovering disease-disease relationships through the incomplete interactome. Science Disease network Cross networkGene network

5 Coupled Networks Given a source network G S = (V S, E S ) and a target network G T = (V T, E T ), they compose coupled networks if there exists a cross link e ij with one node v i ∈ V S and the other node v j ∈ V T. The cross network G C = (V C, E C ) is a bipartite network containing all the cross links in the coupled networks. Source network Target network Cross network Coupled networks

6 Coupled Link Prediction InputOutput Source network Cross networkTarget network Given the source network G S and the cross network G C in coupled networks G = (G S, G T, G C ), the task is to find a predictive function: f : (G S, G C ) → Y T where Y T is the set of labels for the potential links in the target network G T.

7 Challenges InputOutput Source network Cross networkTarget network Incompleteness Heterogeneity Asymmetry

8 Related Work: Traditional Link Prediction 1. D. Liben-Nowell and J. Kleinberg. The link prediction problem for social networks. CIKM’03. B D C A E F G t1t1 B D C A E F G InputOutput t2t2

9 Related Work: Heterogeneous Link Prediction Input 1. Y. Sun, J. Han, C. C. Aggarwal, N. V. Chawla. Will Will This Happen? Relationship Prediction in Heterogeneous Information Networks. WSDM’12. Output

10 Related Work: Transfer Link Prediction InputOutput 1.Y. Dong, J. Tang, S. Wu, J. Tian, N. V. Chawla. J. Rao, H. Cao. Link Prediction and Recommendation across Heterogeneous Networks. ICDM’12 2.J. Tang, T. Lou, J. Kleinberg, S. Wu. Transfer Link Prediction across Heterogeneous Networks. TOIS Source network Target network

11 Related Work: Cross-Domain Link Prediction InputOutput 1. J. Tang, S. Wu, J. Sun, H. Su. Cross-Domain Collaboration Recommendation. KDD’12. Source network Cross network Target network

12 Related Work: Anchor Link Prediction InputOutput 1.X. Kong, J. Zhang, P. S. Yu. Inferring anchor links across multiple heterogeneous social networks. CIKM’13. 2.Y. Zhang, J. Tang, Z. Yang, J. Pei, and P. S. Yu. COSNET: Connecting Heterogeneous Social Networks with Local and Global Consistency. KDD’15. A network Self-linkage network B network

13 Challenges InputOutput Source network Cross networkTarget network Incompleteness Heterogeneity Asymmetry

14 CoupledLP Framework 1. Implicit Target Network Construction Solve Incompleteness 2. Coupled Factor Graph Model Solve Asymmetry Solve Heterogeneity

15 CoupledLP Framework InputOutput Source network Cross networkTarget network 99% 80% 75% 87% Implicit Target Network Construction Incompleteness 1. V. Leroy, B. B. Cambazoglu, and F. Bonchi. Cold start link prediction. In KDD ’10.

16 CoupledLP: Implicit Target Network 1. R. Guha, R. Kumar, P. Raghavan, and A. Tomkins. Propagation of trust and distrust. In WWW ’04 Atomic Propagations for constructing an implicit target network DirectCouplingCo-citation MM MM T MTMMTM ++ top z%

17 CoupledLP Framework InputOutput Source network Cross networkTarget network 99% 80% 75% 87% Coupled Factor Graph AsymmetryHeterogeneity

18 Basic Idea y 12 y 34 y 23 v1v1 v2v2 v3v3 v4v4 v5v5 v6v6 1.F. R. Kschischang, B. J. Frey, and H. andrea Loeliger. Factor graphs and the sum-product algorithm. In IEEE TOIT Jie Tang, Tiancheng Lou, and Jon Kleinberg. Inferring social ties across heterogeneous networks. In WSDM '12 Input NetworkFactor Graph

19 CoupledLP: Coupled Factor Graph

20 CoupledLP: Coupled Factor Graph model source and target network separatelymeta-path Asymmetry Attribute Factor 1. Jie Tang, Tiancheng Lou, and Jon Kleinberg. Inferring social ties across heterogeneous networks. In WSDM '12.

21 CoupledLP: Coupled Factor Graph model source and target network separatelymeta-path Heterogeneity Asymmetry Meta-path Factor 1. Jie Tang, Tiancheng Lou, and Jon Kleinberg. Inferring social ties across heterogeneous networks. In WSDM '12.

22 CoupledLP: Coupled Factor Graph 1.Y. Sun, J. Han, C. C. Aggarwal, N. V. Chawla. Will Will This Happen? Relationship Prediction in Heterogeneous Information Networks. In WSDM’12. meta-path disease express express -1 associate gene disease express -1 gene express ? ? ……

23  Factor Initialization: exponential-linear CoupledLP: Coupled Factor Graph  Objective Function: model source & target network separately meta-path bridge source & target networks

24 Learning: Gradient Decent method 1. Jie Tang, Tiancheng Lou, and Jon Kleinberg. Inferring social ties across heterogeneous networks. In WSDM '12. CoupledLP: Coupled Factor Graph

25 CoupledLP Framework 1. Implicit Target Network Construction Solve Incompleteness 2. Coupled Factor Graph Model Solve Asymmetry Solve Heterogeneity

26 Experiments: Data 1.D. Davis, N. V. Chawla. Exploring and Exploiting Disease Interactions from Multi-Relational Gene and Phenotype Networks. PLoS One N. Du, C. Faloutsos, B. Wang, and L. Akoglu. Large human communication networks: patterns and a utility-driven generator. In KDD ’09. 3.Y. Dong, Y. Yang, J. Tang, Y. Yang, and N. V. Chawla. Inferring user demographics and social strategies in mobile social networks. In KDD’14. k: average degree; cc: clustering coefficient; ac: associative coefficient Healthcare Networks Disease (D)---Gene (G)

27 Experiments: Data 1.D. Davis, N. V. Chawla. Exploring and Exploiting Disease Interactions from Multi-Relational Gene and Phenotype Networks. PLoS One N. Du, C. Faloutsos, B. Wang, and L. Akoglu. Large human communication networks: patterns and a utility-driven generator. In KDD ’09. 3.Y. Dong, Y. Yang, J. Tang, Y. Yang, and N. V. Chawla. Inferring user demographics and social strategies in mobile social networks. In KDD’14. k: average degree; cc: clustering coefficient; ac: associative coefficient Healthcare Networks Disease (D)---Gene (G) Mobile Phone Call Networks Two Operators: Aa---Ab

28 Experiments: Data 1.D. Davis, N. V. Chawla. Exploring and Exploiting Disease Interactions from Multi-Relational Gene and Phenotype Networks. PLoS One N. Du, C. Faloutsos, B. Wang, and L. Akoglu. Large human communication networks: patterns and a utility-driven generator. In KDD ’09. 3.Y. Dong, Y. Yang, J. Tang, Y. Yang, and N. V. Chawla. Inferring user demographics and social strategies in mobile social networks. In KDD’14. k: average degree; cc: clustering coefficient; ac: associative coefficient Healthcare Networks Disease (D)---Gene (G) Mobile Phone Call Networks Two Operators: Aa---Ab Mobile Phone Call Networks Three Operators: Ea---Eb--- Ec

29 Experiments: Data k: average degree; cc: clustering coefficient; ac: associative coefficient AsymmetryHeterogeneity

30 Experiments: Coupled Networks Coupled Prediction Cases

31 Experiments AUPR or AUROC or k 10 Coupled Prediction Cases

32 Baselines 1.R. N. Lichtenwalter, J. T. Lussier, and N. V. Chawla. New perspectives and methods in link prediction. In KDD ’10. 2.L. Backstrom and J. Leskovec. Supervised random walks: predicting and recommending links in social networks. In WSDM’11. AUPR or AUROC or k Unsupervised Methods: Common Neighbors (CN) Adamic Adar (AA) Jaccard Coefficient (JC) Preferential Attachment (PA) PropFlow (PF) Implicit Target Network (IT) Supervised Methods: Logistic Regression (LRC) o LRC-IT Decision Tree (DT) o DT-IT CoupledLP CoupledLP-IT LRC-IT, DT-IT, CoupledLP-IT: NO Implicit Target network construction features

33 Experiments AUPR or AUROC or k Training Links: source links between nodes with cross links 1% target links Test Links: 99% target links

34 Evaluation Metrics 1.R. N. Lichtenwalter, J. T. Lussier, and N. V. Chawla. New perspectives and methods in link prediction. In KDD ’10. 2.L. Backstrom and J. Leskovec. Supervised random walks: predicting and recommending links in social networks. In WSDM’11. AUPR or AUROC or k Area Under Precision Recall Curve (AUPR) Area Under Receiver Operating Characteristic Curve (AUROC) Precision at Top k

35 AUPR Results AUPR

36 AUROC Results AUROC

37 Results k

38 Effects of Implicit Target Network AUPR

39 Effects of Implicit Target Network AUPR

40 Future Work 1. Efficiency of CoupledLP 2. One-step prediction framework 3. User behavior in coupled networks …

41 Conclusion Output Source network Cross networkTarget network Input  Coupled Link Prediction Problem  CoupledLP Framework Implicit Target Network Construction Solve Incompleteness Coupled Factor Graph Model Solve Asymmetry Solve Heterogeneity

42 Thank You! Questions Data & Code:

43 Effects of Implicit Target Network x-axis: pruning threshold z y-axis: AUPR / AUROC +5% AUPR +8% AUROC Unsupervised methods on implicit target network