A Viewpoint-based Approach for Interaction Graph Analysis

Slides:



Advertisements
Similar presentations
Complex Networks Advanced Computer Networks: Part1.
Advertisements

Complex Networks for Representation and Characterization of Images For CS790g Project Bingdong Li 9/23/2009.
Mobile Communication Networks Vahid Mirjalili Department of Mechanical Engineering Department of Biochemistry & Molecular Biology.
Date: 2014/05/06 Author: Michael Schuhmacher, Simon Paolo Ponzetto Source: WSDM’14 Advisor: Jia-ling Koh Speaker: Chen-Yu Huang Knowledge-based Graph Document.
1 Intrusion Monitoring of Malicious Routing Behavior Poornima Balasubramanyam Karl Levitt Computer Security Laboratory Department of Computer Science UCDavis.
1 Greedy Forwarding in Dynamic Scale-Free Networks Embedded in Hyperbolic Metric Spaces Dmitri Krioukov CAIDA/UCSD Joint work with F. Papadopoulos, M.
Analysis and Modeling of Social Networks Foudalis Ilias.
Relationship Mining Network Analysis Week 5 Video 5.
Social Media Mining Chapter 5 1 Chapter 5, Community Detection and Mining in Social Media. Lei Tang and Huan Liu, Morgan & Claypool, September, 2010.
Small-World Graphs for High Performance Networking Reem Alshahrani Kent State University.
Copyright 2006, Data Mining Research Laboratory An Event-based Framework for Characterizing the Evolutionary Behavior of Interaction Graphs Sitaram Asur,
By: Roma Mohibullah Shahrukh Qureshi
Juan (Susan) Pan, Daniel Boston, and Cristian Borcea Department of Computer Science New Jersey Institute of Technology.
1 Analyzing Kleinberg’s (and other) Small-world Models Chip Martel and Van Nguyen Computer Science Department; University of California at Davis.
Sampling from Large Graphs. Motivation Our purpose is to analyze and model social networks –An online social network graph is composed of millions of.
CSE 222 Systems Programming Graph Theory Basics Dr. Jim Holten.
DIDS part II The Return of dIDS 2/12 CIS GrIDS Graph based intrusion detection system for large networks. Analyzes network activity on networks.
Computer Science 1 Web as a graph Anna Karpovsky.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
A Measurement-driven Analysis of Information Propagation in the Flickr Social Network WWW09 报告人: 徐波.
Clustering Unsupervised learning Generating “classes”
Gravity Control™: Is a new generation graphic user interface for searching, sorting and managing large amounts of data from different sources. Makes interaction.
My work: 1. Co-cluster users and content to summarize user  content relationships. 2. Define a new similarity index to efficiently answer complex queries.
Developing Analytical Framework to Measure Robustness of Peer-to-Peer Networks Niloy Ganguly.
Evolutionary Clustering and Analysis of Bibliographic Networks Manish Gupta (UIUC) Charu C. Aggarwal (IBM) Jiawei Han (UIUC) Yizhou Sun (UIUC) ASONAM 2011.
Using Bayesian Networks to Analyze Expression Data N. Friedman, M. Linial, I. Nachman, D. Hebrew University.
1 Applications of Relative Importance  Why is relative importance interesting? Web Social Networks Citation Graphs Biological Data  Graphs become too.
MapReduce and Graph Data Chapter 5 Based on slides from Jimmy Lin’s lecture slides ( (licensed.
Resisting Denial-of-Service Attacks Using Overlay Networks Ju Wang Advisor: Andrew A. Chien Department of Computer Science and Engineering, University.
A Graph-based Friend Recommendation System Using Genetic Algorithm
Understanding Crowds’ Migration on the Web Yong Wang Komal Pal Aleksandar Kuzmanovic Northwestern University
Social Network Analysis Prof. Dr. Daning Hu Department of Informatics University of Zurich Mar 5th, 2013.
Networks Igor Segota Statistical physics presentation.
Graph-based Text Classification: Learn from Your Neighbors Ralitsa Angelova , Gerhard Weikum : Max Planck Institute for Informatics Stuhlsatzenhausweg.
Finding Top-k Shortest Path Distance Changes in an Evolutionary Network SSTD th August 2011 Manish Gupta UIUC Charu Aggarwal IBM Jiawei Han UIUC.
Yongqin Gao, Greg Madey Computer Science & Engineering Department University of Notre Dame © Copyright 2002~2003 by Serendip Gao, all rights reserved.
The new protocol of freenet Taken from Ian Clarke and Oskar Sandberg (The Freenet Project)
Local/Global Term Analysis for Discovering Community Differences in Social Networks David Fuhry, Yiye Ruan, and Srinivasan Parthasarathy Data Mining Research.
CS 8751 ML & KDDData Clustering1 Clustering Unsupervised learning Generating “classes” Distance/similarity measures Agglomerative methods Divisive methods.
Percolation and diffusion in network models Shai Carmi, Department of Physics, Bar-Ilan University Networks Percolation Diffusion Background picture: The.
1 1 COMP5331: Knowledge Discovery and Data Mining Acknowledgement: Slides modified based on the slides provided by Lawrence Page, Sergey Brin, Rajeev Motwani.
Scaling Properties of the Internet Graph Aditya Akella, CMU With Shuchi Chawla, Arvind Kannan and Srinivasan Seshan PODC 2003.
Networks are connections and interactions. Networks are present in every aspect of life. Examples include economics/social/political sciences. Networks.
Informatics tools in network science
Jianping Fan Department of Computer Science University of North Carolina at Charlotte Charlotte, NC Relevance Feedback for Image Retrieval.
Selected Topics in Data Networking Explore Social Networks:
Community Detection based on Distance Dynamics Reporter: Yi Liu Student ID: Department of Computer Science and Engineering Shanghai Jiao Tong.
Web Page Clustering using Heuristic Search in the Web Graph IJCAI 07.
Algorithms and Computational Biology Lab, Department of Computer Science and & Information Engineering, National Taiwan University, Taiwan Network Biology.
Dynamic Network Analysis Case study of PageRank-based Rewiring Narjès Bellamine-BenSaoud Galen Wilkerson 2 nd Second Annual French Complex Systems Summer.
Exploiting Input Features for Controlling Tunable Approximate Programs Sherry Zhou Department of electronic engineering Tsinghua University.
Semantic Graph Mining for Biomedical Network Analysis: A Case Study in Traditional Chinese Medicine Tong Yu HCLS
Graph clustering to detect network modules
Social Networks Some content from Ding-Zhu Du, Lada Adamic, and Eytan Adar.
DATA MINING Spatial Clustering
Groups of vertices and Core-periphery structure
Multi-level predictive analytics and motif discovery across large dynamic spatiotemporal networks and in complex sociotechnical systems: An organizational.
Personalized Social Image Recommendation
Graph Analysis by Persistent Homology
Distributed Representations of Subgraphs
Effective Social Network Quarantine with Minimal Isolation Costs
Lecture 13 Network evolution
Department of Computer Science University of York
CASE − Cognitive Agents for Social Environments
MEgo2Vec: Embedding Matched Ego Networks for User Alignment Across Social Networks Jing Zhang+, Bo Chen+, Xianming Wang+, Fengmei Jin+, Hong Chen+, Cuiping.
Jiawei Han Department of Computer Science
SEG5010 Presentation Zhou Lanjun.
Lecture 21 Network evolution
Practical Applications Using igraph in R Roger Stanton
The Impact of Changes in Network Structure on Diffusion of Warnings
Presentation transcript:

A Viewpoint-based Approach for Interaction Graph Analysis Sitaram Asur and Srinivasan Parthasarathy Department of Computer Science and Engineering Ohio State University, Columbus, Ohio

Motivation Massive social network datasets give you both more and less [Kleinberg, 2007]: More: can observe global phenomena that are genuine, but literally invisible at smaller scales. Less: Don’t really know what any one node or link means. Easy to measure things; hard to pose nuanced questions Community-based analysis useful but limited All nodes in clusters generally treated the same Problem : To extract the local neighborhood of interest for a node To use structure and topology to quantify local relationships To observe effect of changes in the graph from the viewpoint of given node or set of nodes Potentially useful in search, recommendation and advertising

Viewpoint Neighborhood (VPN) VPN(S) : the graph rooted at source node S containing only nodes with some degree of importance to S and their interconnections. But how to measure importance ? Initial Solution : Use distance from source Depth-limited VPN for a node Subgraph representing the set of nodes reachable at a distance <=k from the node and the interactions among them Can be constructed using Depth-limited search (DLS) from the source node But is this enough?

Viewpoint Neighborhoods Problems with Depth-limited VPN All nodes the same distance away are treated the same Hub nodes need to be differentiated Criteria for constructing a VPN Inverse Distance Weighting: Involvement of a node to a VPN inversely proportional to its distance from the source node Intuition : Node is more affected by closer events Link Structure: Local topological information is important Well-connected nodes in the VPN are more important to source node Hub Nodes: Hub nodes can bloat neighborhoods by bringing in many unimportant nodes Need to expand hub nodes with low probability

Activation Spread Model Source node begins activation with a budget M It distributes M among its immediate neighbors activating them Each node retains some amount, activates its neighbors and continues the distribution Distribution handled by Activation Function Each node is activated at most once If a node is touched more than once, it retains the amount it receives Threshold used to hasten convergence Activated nodes form the VPN of source node Value present with each node represents its commitment value for VPN Related to the heat diffusion model for graphs

Betweenness-based Activation Activation Functions Inverse-degree Activation Down-weights nodes with high degrees Each node x retains 1/degree(x) of the amount received Rest distributed equally among its descendants Strong emphasis on hubs Weaker emphasis on link structure Betweenness-based Activation Compute local betweenness values for nodes within VPN Consider shortest paths between source node and members of the VPN Ratio of betweenness values used to distribute Strong emphasis on link structure Can be made to handle hubs by using inverse-degree to construct basic VPN first M/2 M/2 M/6 5M/6 M/6 M/6 M/6 M/2 Inverse Degree Activation Betweenness-based Activation

Activation Functions Semantic Activation Use semantic features from content to extract neighborhoods Semantic similarity w.r.t source used to decide distribution ratios Eliminates noise and irrelevant nodes Useful in personalized and keyword search applications In practice, combination of different activation functions can be employed Domain-specific features can be included

Neighborhood Sizes - Wikipedia Time Number of Nodes 1 6133 2 24160 3 56468 4 105829 5 147079 Global increase (23x) in number of nodes does not affect size of local neighborhoods too much!

Temporal Analysis for VPNs D C B A S Characterize evolution of Viewpoint neighborhoods over time Critical Events Grow Shrink Continue Mutate Attraction Repulsion E C H G A B S G H E F C S D C A B S DBLP : grow/shrink ratio ~1, low continue, high mutate, attract/repel ratio ~ 1 Wikipedia : grow/shrink ratio >>1, high continue events, attract >> repel

Behavioral Measures Incremental behavioral measures composed from events Stability, Sociability, Impact, Popularity

Conclusions Viewpoint Neighborhoods To identify a neighborhood of interest for a node and quantify local relationships within General activation spread model with different activation functions capturing topological, semantic and domain-specific attributes Extension to find the joint VPN of a group of nodes Evolutionary analysis to identify changes to VPNs over time Critical events to define behavior of neighborhoods Behavioral measures for sociability, stability, impact and popularity Pattern mining over VPNs Core Subgraphs to identify core influential structures w.r.t certain nodes Transformation Subgraphs to measure the effect of changes on the graph on specific viewpoint neighborhoods

Acknowledgements Grants: NSF: CAREER-IIS-0347662 NSF SGER Grant IIS-0742999 DOE: DE-FG02-04ER2561