Supervised Random Walks: Predicting and Recommending Links in Social Networks Lars Backstrom (Facebook) & Jure Leskovec (Stanford) Proc. of WSDM 2011 Present.

Slides:



Advertisements
Similar presentations
Pat Langley Computational Learning Laboratory Center for the Study of Language and Information Stanford University, Stanford, California
Advertisements

Document Summarization using Conditional Random Fields Dou Shen, Jian-Tao Sun, Hua Li, Qiang Yang, Zheng Chen IJCAI 2007 Hao-Chin Chang Department of Computer.
Mining of Massive Datasets Jure Leskovec, Anand Rajaraman, Jeff Ullman Stanford University Note to other teachers and users of these.
Finding your friends and following them to where you are by Adam Sadilek, Henry Kautz, Jeffrey P. Bigham Presented by Guang Ling 1.
Modeling Relationship Strength in Online Social Networks Rongjian Xiang 1, Jennifer Neville 1, Monica Rogati 2 1 Purdue University, 2 LinkedIn WWW 2010.
1 Yuxiao Dong *$, Jie Tang $, Sen Wu $, Jilei Tian # Nitesh V. Chawla *, Jinghai Rao #, Huanhuan Cao # Link Prediction and Recommendation across Multiple.
Graph Data Management Lab School of Computer Science , Bristol, UK.
Zdravko Markov and Daniel T. Larose, Data Mining the Web: Uncovering Patterns in Web Content, Structure, and Usage, Wiley, Slides for Chapter 1:
INFERRING NETWORKS OF DIFFUSION AND INFLUENCE Presented by Alicia Frame Paper by Manuel Gomez-Rodriguez, Jure Leskovec, and Andreas Kraus.
Web Projections Learning from Contextual Subgraphs of the Web Jure Leskovec, CMU Susan Dumais, MSR Eric Horvitz, MSR.
ML ALGORITHMS. Algorithm Types Classification (supervised) Given -> A set of classified examples “instances” Produce -> A way of classifying new examples.
LLNL-PRES This work was performed under the auspices of the U.S. Department of Energy by Lawrence Livermore National Laboratory under Contract DE-AC52-07NA27344.
Using Friendship Ties and Family Circles for Link Prediction Elena Zheleva, Lise Getoor, Jennifer Golbeck, Ugur Kuter (SNAKDD 2008)
Link Recommendation In P2P Social Networks Yusuf Aytaş, Hakan Ferhatosmanoğlu, Özgür Ulusoy Bilkent University, Ankara, Turkey.
Modeling Relationship Strength in Online Social Networks Rongjing Xiang: Purdue University Jennifer Neville: Purdue University Monica Rogati: LinkedIn.
1 Applications of Relative Importance  Why is relative importance interesting? Web Social Networks Citation Graphs Biological Data  Graphs become too.
1 Contact Prediction, Routing and Fast Information Spreading in Social Networks Kazem Jahanbakhsh Computer Science Department University of Victoria August.
Using Transactional Information to Predict Link Strength in Online Social Networks Indika Kahanda and Jennifer Neville Purdue University.
Iterative Readability Computation for Domain-Specific Resources By Jin Zhao and Min-Yen Kan 11/06/2010.
WALKING IN FACEBOOK: A CASE STUDY OF UNBIASED SAMPLING OF OSNS junction.
윤언근 DataMining lab.  The Web has grown exponentially in size but this growth has not been isolated to good-quality pages.  spamming and.
Abstract Background: In this work, a candidate gene prioritization method is described, and based on protein-protein interaction network (PPIN) analysis.
Predicting Positive and Negative Links in Online Social Networks
The Matrix: Using Intermediate Features to Classify and Predict Friends in a Social Network Michael Matczynski Status Report April 14, 2006.
PREDIcT: Towards Predicting the Runtime of Iterative Analytics Adrian Popescu 1, Andrey Balmin 2, Vuk Ercegovac 3, Anastasia Ailamaki
1 K-Means+ID3 A Novel Method for Supervised Anomaly Detection by Cascading K-Means Clustering and ID3 Decision Tree Learning Methods Author : Shekhar R.
Finding Top-k Shortest Path Distance Changes in an Evolutionary Network SSTD th August 2011 Manish Gupta UIUC Charu Aggarwal IBM Jiawei Han UIUC.
Understanding User’s Query Intent with Wikipedia G 여 승 후.
Exploit of Online Social Networks with Community-Based Graph Semi-Supervised Learning Mingzhen Mo and Irwin King Department of Computer Science and Engineering.
Chapter 11 Statistical Techniques. Data Warehouse and Data Mining Chapter 11 2 Chapter Objectives  Understand when linear regression is an appropriate.
LINK PREDICTION IN CO-AUTHORSHIP NETWORK Le Nhat Minh ( A N) Supervisor: Dongyuan Lu 1.
Comparing the use of Network Structure and Profile Characteristics in Discovering Groups in Social Networks John Johnson.
Automatic Video Tagging using Content Redundancy Stefan Siersdorfer 1, Jose San Pedro 2, Mark Sanderson 2 1 L3S Research Center, Germany 2 University of.
Network Lasso: Clustering and Optimization in Large Graphs
Intelligent DataBase System Lab, NCKU, Taiwan Josh Jia-Ching Ying, Eric Hsueh-Chan Lu, Wen-Ning Kuo and Vincent S. Tseng Institute of Computer Science.
© Devi Parikh 2008 Devi Parikh and Tsuhan Chen Carnegie Mellon University April 3, ICASSP 2008 Bringing Diverse Classifiers to Common Grounds: dtransform.
Jure Leskovec (Stanford), Daniel Huttenlocher and Jon Kleinberg (Cornell)
KAIST TS & IS Lab. CS710 Know your Neighbors: Web Spam Detection using the Web Topology SIGIR 2007, Carlos Castillo et al., Yahoo! 이 승 민.
Learning Photographic Global Tonal Adjustment with a Database of Input / Output Image Pairs.
Advanced Gene Selection Algorithms Designed for Microarray Datasets Limitation of current feature selection methods: –Ignores gene/gene interaction: single.
Progress Report ekker. Problem Definition In cases such as object recognition, we can not include all possible objects for training. So transfer learning.
Ganesh J, Soumyajit Ganguly, Manish Gupta, Vasudeva Varma, Vikram Pudi
Meta-Path-Based Ranking with Pseudo Relevance Feedback on Heterogeneous Graph for Citation Recommendation By: Xiaozhong Liu, Yingying Yu, Chun Guo, Yizhou.
A new protein-protein docking scoring function based on interface residue properties Reporter: Yu Lun Kuo (D )
Link Prediction Class Data Mining Technology for Business and Society
P.Demestichas (1), S. Vassaki(2,3), A.Georgakopoulos(2,3)
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Xiang Li,1 Lili Mou,1 Rui Yan,2 Ming Zhang1
Romantic Partnerships and the Dispersion of Social Ties: A Network Analysis of Relationship Status on Facebook By: Lars Backstrom - Facebook Inc, Jon Kleinberg.
Source: Procedia Computer Science(2015)70:
Estimating Link Signatures with Machine Learning Algorithms
Link Prediction Seminar Social Media Mining University UC3M
Collective Network Linkage across Heterogeneous Social Platforms
Edge Weight Prediction in Weighted Signed Networks
Link Prediction and Network Inference
Networks with Signed Edges
Jinhong Jung, Woojung Jin, Lee Sael, U Kang, ICDM ‘16
Binghui Wang, Le Zhang, Neil Zhenqiang Gong
Asymmetric Transitivity Preserving Graph Embedding
CS224w: Social and Information Network Analysis
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Graph-based Security and Privacy Analytics via Collective Classification with Joint Weight Learning and Propagation Binghui Wang, Jinyuan Jia, and Neil.
GANG: Detecting Fraudulent Users in OSNs
Mingzhen Mo and Irwin King
Affiliation Network Models of Clusters in Networks
Semi-Supervised Learning
PRSim: Sublinear Time SimRank Computation on Large Power-Law Graphs.
Analysis of Large Graphs: Overlapping Communities
Modeling Topic Diffusion in Scientific Collaboration Networks
Presentation transcript:

Supervised Random Walks: Predicting and Recommending Links in Social Networks Lars Backstrom (Facebook) & Jure Leskovec (Stanford) Proc. of WSDM 2011 Present by Mo Mingzhen

Problem Friendship is important on social networks How to predict the future interaction How to recommend potential friends to new user? Link Prediction Problem

Motivation Predicting future interaction brings direct business consequences: possible collaborations Beyond social networks: predicting coauthor/collaborations In link prediction problem, how to combine the node and edge attributes remains an open challenge

Method Based on the Supervised Random Walks – Combines the network structure with the characteristics of nodes and edges Develop an algorithm to estimate the edge strength – bias a PageRank-like random walk to visits given nodes more often

Problem Formulation Given G(V, E) A start point s, learning candidate C = {c i } Destination nodes D = {d 1,…,d k }, no-link nodes L = {l 1,…,l n }, C = D ∪ L For edge (u, v) we compute the strength a uv = f w (ψ uv )

Optimization p is the vector of PageRank scores A “soft” version

Algorithm

Experiments on Synthetic Data A scale-free graph G with 10,000 nodes Evaluated by classification accuracy Strength func. *AUC – Area under the ROC curve. 1.0 means perfect classification and 0.5 means random guessing.

Experiments on Real Data Four co-authorship networks and the Facebook network of Iceland Strength func.

Interaction Procedure The method basically converges in only about 25 iterations

Results LR: logistic regression, precision at top 20

Methods Comparison some unsupervised baselines & two supervised learning methods

Conclusion The Supervised Random Walks has great improvement over Random Walks. It outperforms supervised machine learning techniques It combines rich node and edge features with the structure of the network Apply to: recommendations, anomaly detection, missing link, and expertise search and ranking