Link Prediction Topics in Data Mining Fall 2015 Bruno Ribeiro

Slides:



Advertisements
Similar presentations
Purnamrita Sarkar (Carnegie Mellon) Deepayan Chakrabarti (Yahoo! Research) Andrew W. Moore (Google, Inc.)
Advertisements

+ Multi-label Classification using Adaptive Neighborhoods Tanwistha Saha, Huzefa Rangwala and Carlotta Domeniconi Department of Computer Science George.
Image classification Given the bag-of-features representations of images from different classes, how do we learn a model for distinguishing them?
Notes Sample vs distribution “m” vs “µ” and “s” vs “σ” Bias/Variance Bias: Measures how much the learnt model is wrong disregarding noise Variance: Measures.
Supervised Learning Techniques over Twitter Data Kleisarchaki Sofia.
Data Mining Classification: Alternative Techniques
Analysis and Modeling of Social Networks Foudalis Ilias.
Relationship Mining Network Analysis Week 5 Video 5.
An Introduction to Variational Methods for Graphical Models.
CS 599: Social Media Analysis University of Southern California1 The Basics of Network Analysis Kristina Lerman University of Southern California.
LINK PREDICTION IN CO-AUTHORSHIP NETWORK Le Nhat Minh ( A N) Supervisor: Dongyuan Lu Aobo Tao Chen 1.
More on Rankings. Query-independent LAR Have an a-priori ordering of the web pages Q: Set of pages that contain the keywords in the query q Present the.
DATA MINING LECTURE 12 Link Analysis Ranking Random walks.
Image classification Given the bag-of-features representations of images from different classes, how do we learn a model for distinguishing them?
Using Structure Indices for Efficient Approximation of Network Properties Matthew J. Rattigan, Marc Maier, and David Jensen University of Massachusetts.
CS 590M Fall 2001: Security Issues in Data Mining Lecture 3: Classification.
Purnamrita Sarkar (UC Berkeley) Deepayan Chakrabarti (Yahoo! Research) Andrew W. Moore (Google, Inc.) 1.
1 Fast Incremental Proximity Search in Large Graphs Purnamrita Sarkar Andrew W. Moore Amit Prakash.
The structure of the Internet. How are routers connected? Why should we care? –While communication protocols will work correctly on ANY topology –….they.
The structure of the Internet. The Internet as a graph Remember: the Internet is a collection of networks called autonomous systems (ASs) The Internet.
CPSC 422, Lecture 18Slide 1 Intelligent Systems (AI-2) Computer Science cpsc422, Lecture 18 Feb, 25, 2015 Slide Sources Raymond J. Mooney University of.
Review Rong Jin. Comparison of Different Classification Models  The goal of all classifiers Predicating class label y for an input x Estimate p(y|x)
Discriminative and generative methods for bags of features
Network Measures Social Media Mining. 2 Measures and Metrics 2 Social Media Mining Network Measures Klout.
Bayes Net Perspectives on Causation and Causal Inference
Λ14 Διαδικτυακά Κοινωνικά Δίκτυα και Μέσα Link Prediction.
Jerry Scripps N T O K M I N I N G E W R. Overview What is network mining? What is network mining? Motivation Motivation Preliminaries Preliminaries definitions.
Topic 13 Network Models Credits: C. Faloutsos and J. Leskovec Tutorial
Step 3: Classification Learn a decision rule (classifier) assigning bag-of-features representations of images to different classes Decision boundary Zebra.
Online Social Networks and Media Absorbing Random Walks Link Prediction.
Purnamrita Sarkar (Carnegie Mellon) Deepayan Chakrabarti (Yahoo! Research) Andrew W. Moore (Google, Inc.)
Machine Learning1 Machine Learning: Summary Greg Grudic CSCI-4830.
Using Transactional Information to Predict Link Strength in Online Social Networks Indika Kahanda and Jennifer Neville Purdue University.
DATA MINING LECTURE 10 Classification k-nearest neighbor classifier Naïve Bayes Logistic Regression Support Vector Machines.
1 Data Mining Lecture 5: KNN and Bayes Classifiers.
CSE 473/573 Computer Vision and Image Processing (CVIP) Ifeoma Nwogu Lecture 24 – Classifiers 1.
Introduction to machine learning and data mining 1 iCSC2014, Juan López González, University of Oviedo Introduction to machine learning Juan López González.
Undirected Models: Markov Networks David Page, Fall 2009 CS 731: Advanced Methods in Artificial Intelligence, with Biomedical Applications.
The Link Prediction Problem for Social Networks David Libel-Nowell, MIT John Klienberg, Cornell Saswat Mishra sxm
A Graph-based Friend Recommendation System Using Genetic Algorithm
Collective Classification A brief overview and possible connections to -acts classification Vitor R. Carvalho Text Learning Group Meetings, Carnegie.
Part 1: Biological Networks 1.Protein-protein interaction networks 2.Regulatory networks 3.Expression networks 4.Metabolic networks 5.… more biological.
Classifiers Given a feature representation for images, how do we learn a model for distinguishing features from different classes? Zebra Non-zebra Decision.
Classification Techniques: Bayesian Classification
Λ14 Διαδικτυακά Κοινωνικά Δίκτυα και Μέσα Link Prediction.
Chapter 4: Pattern Recognition. Classification is a process that assigns a label to an object according to some representation of the object’s properties.
LINK PREDICTION IN CO-AUTHORSHIP NETWORK Le Nhat Minh ( A N) Supervisor: Dongyuan Lu 1.
Slides for “Data Mining” by I. H. Witten and E. Frank.
Approximate Inference: Decomposition Methods with Applications to Computer Vision Kyomin Jung ( KAIST ) Joint work with Pushmeet Kohli (Microsoft Research)
DeepDive Model Dongfang Xu Ph.D student, School of Information, University of Arizona Dec 13, 2015.
Contribution and Proposed Solution Sequence-Based Features Collective Classification with Reports Results of Classification Using Reports Collective Spammer.
RTM: Laws and a Recursive Generator for Weighted Time-Evolving Graphs Leman Akoglu, Mary McGlohon, Christos Faloutsos Carnegie Mellon University School.
Mining information from social media
DATA MINING LECTURE 10b Classification k-nearest neighbor classifier
1 Relational Factor Graphs Lin Liao Joint work with Dieter Fox.
Progress Report ekker. Problem Definition In cases such as object recognition, we can not include all possible objects for training. So transfer learning.
The UNIVERSITY of NORTH CAROLINA at CHAPEL HILL Classification COMP Seminar BCB 713 Module Spring 2011.
Dependency Networks for Inference, Collaborative filtering, and Data Visualization Heckerman et al. Microsoft Research J. of Machine Learning Research.
Topics In Social Computing (67810) Module 1 (Structure) Centrality Measures, Graph Clustering Random Walks on Graphs.
Link Prediction Class Data Mining Technology for Business and Society
Link Prediction Seminar Social Media Mining University UC3M
A Theoretical Justification of Link Prediction Heuristics
Link Prediction and Network Inference
Network Science: A Short Introduction i3 Workshop
A Theoretical Justification of Link Prediction Heuristics
Theoretical Justification of Popular Link Prediction Heuristics
Asymmetric Transitivity Preserving Graph Embedding
Discriminative Probabilistic Models for Relational Data
Topological Signatures For Fast Mobility Analysis
Intelligent Systems (AI-2) Computer Science cpsc422, Lecture 18
Presentation transcript:

Link Prediction Topics in Data Mining Fall 2015 Bruno Ribeiro

Overview Deterministic Heuristics Methods Matrix / Probabilistic Methods Supervised Learning Approaches © 2015 Bruno Ribeiro

Goal Input: Snapshot of social network at time t Ouput: Predict edges that will be added to network at time t’ > t © 2015 Bruno Ribeiro

Product Recommendation Example of link prediction application 2 i j 5 2 (c) 2015, Bruno Ribeiro

Twitter’s Who to Follow Another example of link prediction application © 2015 Bruno Ribeiro

Other Applications A few more examples: Fraud Detection: (Beutel, Akoglu, Faloutsos, KDD’15) Focuses on discovering surprising link patterns Anomaly detection: (Rattigan et al, 2005) Focuses on finding unlikely existing links © 2015 Bruno Ribeiro

Deterministic Heuristics © 2015 Bruno Ribeiro

A Very Naïve Approach Every entity is assigned a score Score(v) = how many friends person already has © 2015 Bruno Ribeiro

General Approach Assign connection weight score(u, v) to non-existing edge (u, v) Rank edges and recommend ones with highest score Can be viewed as computing a measure of proximity or “similarity” between nodes u and v. © 2015 Bruno Ribeiro

Shortest Path Negated length of shortest path between u and v. All nodes that share one neighbor will be linked Problem: Network diameter often very small © 2015 Bruno Ribeiro

Common Neighbors Common neighbors uses as score the number of common neighbors between vertices u and v. Score can be high when vertices have too many neighbors © 2015 Bruno Ribeiro

Jaccard Similarity Fixes common neighbors “too many neighbors” problem by dividing the intersection by the union Between 0 and 1 © 2015 Bruno Ribeiro

Adamic / Adar This score gives more weight to neighbors that that are not shared with many others. 5 neighbors 50 neighbors Too active Less evidence of missing link between Alice and Bob Less active More evidence of missing link between Bob and Carol Alice Bob Carol © 2015 Bruno Ribeiro

Preferential Attachment The probability of co-authorship of u and v is proportional to the product of the number of collaborators of u and v © 2015 Bruno Ribeiro

Katz (1953) Exponentially weighted sum of number of paths of length l For small α << 1 : predictions ≈ common neighbors © 2015 Bruno Ribeiro

Hitting Time where, Hu,v is the random walk hitting time between u and v © 2015 Bruno Ribeiro

Personalized PageRank Stationary probability walker is at v under the following random walk: With probability α, jump back to u With probability 1 – α, go to random neighbor of current node © 2015 Bruno Ribeiro

SimRank N(u) and N(v) are number of in-degrees of nodes u and v Only directed graphs score(u,v) ∊ [0,1]. If u=v then, s(u,v)=1 © 2015 Bruno Ribeiro

Latent Space Models © 2015 Bruno Ribeiro

Low Rank Reconstruction Represent the adjacency matrix A with a lower rank matrix Ak. If Ak(u,v) has large value for a missing A(u,v)=0, then recommend link (u,v) Vn⨉k A Ak ≈ Uk⨉ n = © 2015 Bruno Ribeiro

Product Recommendations Users have a propensity rate to buy certain category of product (W) In a category some products have a propensity rate to be bought (H) 2 i j 5 (c) 2015, Bruno Ribeiro 2 (Non-negative matrix factorization)

Example Euclidean Generative Model Vertices uniformly distributed in latent unit hyperplane Vertices close in latent space more likely to be connected Probability of edge = f (distance on latent space) Unit plane © 2015 Bruno Ribeiro The problem of link prediction is to infer distances in the latent space, i.e., find the nearest neighbor in latent space not currently linked Raftery, A. E., Handcock, M. S., & Hoff, P. D. (2002). Latent space approaches to social network analysis. J. Amer. Stat. Assoc., 15, 460.

Bayesian Networks Directed Graphical Models Bayesian networks and Prob. Relational Models (Getoor et al., 2001, Neville & Jensen 2003) Captures the dependence of link existence on attributes of entities Constrains probabilistic dependency to directed acyclic graph Undirected Graphical Models Markov Networks © 2015 Bruno Ribeiro

Example of Today’s Approaches Whom To Follow and Why Credit: N. Barbieri, F. Bonchi, G. Manco KDD’14

Relational Markov Networks A Relational Markov Network (RMN) specifies cliques and the potentials between attributes of related entities at the template level Single model provides distribution for entire graph Train model to maximize the probability of observed edge and labels Use trained model to predict edges and unknown attributes

Link Prediction Heuristic Latent Space Rational Link Prediction Heuristic Most likely neighbor of node v ? Generative model Model parameters (latent space) θ Node u Node v © 2015 Bruno Ribeiro Observed edges

Relationship between Deterministic & Probabilistic methods? © 2015 Bruno Ribeiro Yes = (Sarkar, Chakrabarti, Moore, COLT’10)

How do we justify these observations? Empirically Especially if the graph is sparse How do we justify these observations? Link prediction accuracy* Random Shortest Path Common Neighbors Adamic/Adar Ensemble of short paths *Liben-Nowell & Kleinberg, 2003; Brand, 2005; Sarkar & Moore, 2007 Credit: Sarkar, Chakrabarti, Moore

Raftery’s Model (cont) Two sources of randomness Point positions: uniform in D dimensional space Linkage probability: logistic with parameters α, r α, r and D are known Higher probability of linking α determines the steepness 1 ½ radius r Credit: Sarkar, Chakrabarti, Moore

Raftery’s Model: Probability of New Edge j i Logistic function integrated over volume determined by dij Credit: Sarkar, Chakrabarti, Moore

Connection to Common Neighbors k j r Distinct radius per node gives Adamic/Adar Empirical Bernstein bounds (Maurer & Pontil, 2009) © 2015 Bruno Ribeiro Decreases with N Net size No. common neigh. i k j j

Supervised Learning Approaches Create binary classifier that predicts whether an edge exists between two nodes © 2015 Bruno Ribeiro

Types of Features Classifier SVM Decision Trees Label Features Keywords Frequency of an item Topological Features Shortest Distance Degree Centrality PageRank index SVM Decision Trees Deep Belief Network (DBN) K-Nearest Neighbors (KNN) Naive Bayes Radial Basis Function (RBF) Logistic Regression © 2015 Bruno Ribeiro Link Prediction Heuristic