Xiaowei Ying, Xintao Wu Dept. Software and Information Systems Univ. of N.C. – Charlotte 2008 SIAM Conference on Data Mining, April 25 th Atlanta, Georgia.

Slides:



Advertisements
Similar presentations
William Liu1, Harsha Sirisena2, Krzysztof Pawlikowski2
Advertisements

Complex Networks Luis Miguel Varela COST meeting, Lisbon March 27 th 2013.
Social network partition Presenter: Xiaofei Cao Partick Berg.
ICDE 2014 LinkSCAN*: Overlapping Community Detection Using the Link-Space Transformation Sungsu Lim †, Seungwoo Ryu ‡, Sejeong Kwon§, Kyomin Jung ¶, and.
Leting Wu Xiaowei Ying, Xintao Wu Aidong Lu and Zhi-Hua Zhou PAKDD 2011 Spectral Analysis of k-balanced Signed Graphs 1.
Introduction to Network Theory: Modern Concepts, Algorithms
Analysis and Modeling of Social Networks Foudalis Ilias.
Modularity and community structure in networks
Online Social Networks and Media. Graph partitioning The general problem – Input: a graph G=(V,E) edge (u,v) denotes similarity between u and v weighted.
Xiaowei Ying, Xintao Wu, Daniel Barbara Spectrum based Fraud Detection in Social Networks 1.
Spectrum Based RLA Detection Spectral property : the eigenvector entries for the attacking nodes,, has the normal distribution with mean and variance bounded.
Advanced Topics in Data Mining Special focus: Social Networks.
V4 Matrix algorithms and graph partitioning
Xiaowei Ying Xintao Wu Univ. of North Carolina at Charlotte 2009 SIAM Conference on Data Mining, May 1, Sparks, Nevada Graph Generation with Prescribed.
Weighted networks: analysis, modeling A. Barrat, LPT, Université Paris-Sud, France M. Barthélemy (CEA, France) R. Pastor-Satorras (Barcelona, Spain) A.
Mining and Searching Massive Graphs (Networks)
Leting Wu Xiaowei Ying, Xintao Wu Dept. Software and Information Systems Univ. of N.C. – Charlotte Reconstruction from Randomized Graph via Low Rank Approximation.
Lecture 21: Spectral Clustering
Social Networks and Graph Mining Christos Faloutsos CMU - MLD.
1 Epidemic Spreading in Real Networks: an Eigenvalue Viewpoint Yang Wang Deepayan Chakrabarti Chenxi Wang Christos Faloutsos.
Modularity in Biological networks.  Hypothesis: Biological function are carried by discrete functional modules.  Hartwell, L.-H., Hopfield, J. J., Leibler,
Topologically biased random walks with application for community finding Vinko Zlatić Dep. Of Physics, “Sapienza”, Roma, Italia Theoretical Physics Division,
Network analysis and applications Sushmita Roy BMI/CS 576 Dec 2 nd, 2014.
Structure based Data De-anonymization of Social Networks and Mobility Traces Shouling Ji, Weiqing Li, and Raheem Beyah Georgia Institute of Technology.
Models of Influence in Online Social Networks
A Distributed and Privacy Preserving Algorithm for Identifying Information Hubs in Social Networks M.U. Ilyas, Z Shafiq, Alex Liu, H Radha Michigan State.
Spectral coordinate of node u is its location in the k -dimensional spectral space: Spectral coordinates: The i ’th component of the spectral coordinate.
Biological Networks Lectures 6-7 : February 02, 2010 Graph Algorithms Review Global Network Properties Local Network Properties 1.
Preserving Link Privacy in Social Network Based Systems Prateek Mittal University of California, Berkeley Charalampos Papamanthou.
Wherefore Art Thou R3579X? Anonymized Social Networks, Hidden Patterns, and Structural Stenography.
COM1721: Freshman Honors Seminar A Random Walk Through Computing Lecture 2: Structure of the Web October 1, 2002.
1 Burning a graph as a model of social contagion Anthony Bonato Ryerson University Institute of Software Chinese Academy of Sciences.
Using Graph Theory to Study Neural Networks (Watrous, Tandon, Conner, Pieters & Ekstrom, 2012)
Resisting Structural Re-identification in Anonymized Social Networks Michael Hay, Gerome Miklau, David Jensen, Don Towsley, Philipp Weis University of.
Self-Similarity of Complex Networks Maksim Kitsak Advisor: H. Eugene Stanley Collaborators: Shlomo Havlin Gerald Paul Zhenhua Wu Yiping Chen Guanliang.
Emergence of Scaling and Assortative Mixing by Altruism Li Ping The Hong Kong PolyU
Workshop on Applications of Wireless Communications (WAWC 2008) 21 August 2008, Lappeenranta - Finland CONTROLLING EPIDEMICS IN WIRELESS NETWORKS Ranjan.
Mining Social Networks for Personalized Prioritization Shinjae Yoo, Yiming Yang, Frank Lin, II-Chul Moon [KDD ’09] 1 Advisor: Dr. Koh Jia-Ling Reporter:
Xiaowei Ying, Xintao Wu Univ. of North Carolina at Charlotte PAKDD-09 April 28, Bangkok, Thailand On Link Privacy in Randomizing Social Networks.
Algorithms for Biological Networks Prof. Tijana Milenković Computer Science and Engineering University of Notre Dame Fall 2010.
Spectral Analysis based on the Adjacency Matrix of Network Data Leting Wu Fall 2009.
Xiaowei Ying, Leting Wu, Xintao Wu University of North Carolina at Charlotte Privacy and Spectral Analysis on Social Network Randomization.
On-line Social Networks - Anthony Bonato 1 Dynamic Models of On-Line Social Networks Anthony Bonato Ryerson University WAW’2009 February 13, 2009 nt.
Synchronization in complex network topologies
Privacy Preserving Payments in Credit Networks By: Moreno-Sanchez et al from Saarland University Presented By: Cody Watson Some Slides Borrowed From NDSS’15.
When Affinity Meets Resistance On the Topological Centrality of Edges in Complex Networks Gyan Ranjan University of Minnesota, MN [Collaborators: Zhi-Li.
Miniconference on the Mathematics of Computation
Mix networks with restricted routes PET 2003 Mix Networks with Restricted Routes George Danezis University of Cambridge Computer Laboratory Privacy Enhancing.
Community structure in graphs Santo Fortunato. More links “inside” than “outside” Graphs are “sparse” “Communities”
Network Theory: Community Detection Dr. Henry Hexmoor Department of Computer Science Southern Illinois University Carbondale.
Privacy Preserving in Social Network Based System PRENTER: YI LIANG.
Network Partition –Finding modules of the network. Graph Clustering –Partition graphs according to the connectivity. –Nodes within a cluster is highly.
CMU SCS KDD'09Faloutsos, Miller, Tsourakakis P9-1 Large Graph Mining: Power Tools and a Practitioner’s guide Christos Faloutsos Gary Miller Charalampos.
Arizona State University Fast Eigen-Functions Tracking on Dynamic Graphs Chen Chen and Hanghang Tong - 1 -
Privacy Issues in Graph Data Publishing Summer intern: Qing Zhang (from NC State University) Mentors: Graham Cormode and Divesh Srivastava.
Xiaowei Ying, Kai Pan, Xintao Wu, Ling Guo Univ. of North Carolina at Charlotte SNA-KDD June 28, 2009, Paris, France Comparisons of Randomization and K-degree.
1 New metrics for characterizing the significance of nodes in wireless networks via path-based neighborhood analysis Leandros A. Maglaras 1 Dimitrios Katsaros.
Graph clustering to detect network modules
Random Walk for Similarity Testing in Complex Networks
Network analysis.
Graph Analysis by Persistent Homology
Community detection in graphs
Graph and Tensor Mining for fun and profit
Large Graph Mining: Power Tools and a Practitioner’s guide
Properties and applications of spectra for networks
Detecting Important Nodes to Community Structure
Advanced Topics in Data Mining Special focus: Social Networks
Chapter 9 Graph algorithms
Presentation transcript:

Xiaowei Ying, Xintao Wu Dept. Software and Information Systems Univ. of N.C. – Charlotte 2008 SIAM Conference on Data Mining, April 25 th Atlanta, Georgia Randomizing Social Network: A Spectrum Preserving Approach

Randomizing Social Network: a Spectrum Preserving Approach, SDM SIAM Conference on Data Mining, April 25 th Atlanta, Georgia Framework Background & Motivation Graph Spectrum & Structure Spectrum & Perturbation Spectrum Preserving Randomization Privacy Protection Conclusion & Future Work 2

2008 SIAM Conference on Data Mining, April 25 th Atlanta, Georgia Background & Motivation Social Network 3 Randomizing Social Network: a Spectrum Preserving Approach, SDM08 Friendship in Karate club [Zachary, 77] Biological association network of dolphins [Lusseau et al., 03] Collaboration network of scientists [Newman, 06] Network of US political books (105 nodes, 441 edges) Books about US politics sold by Amazon.com. Edges represent frequent co-purchasing of books by the same buyers. Nodes have been given colors of blue, white, or red to indicate whether they are "liberal", "neutral", or "conservative".

2008 SIAM Conference on Data Mining, April 25 th Atlanta, Georgia Background & Motivation Privacy Issues in Social Network: Social network contains much private relation information; Anonymization is not enough for protecting the privacy. Subgraph attacks [Backstrom et al., WWW07, Hay et al., 07]. 4 Randomizing Social Network: a Spectrum Preserving Approach, SDM08

2008 SIAM Conference on Data Mining, April 25 th Atlanta, Georgia Background & Motivation Graph Randomization/Perturbation: 1. Random Add/Del edges (no. of edges unchanged) 2. Random Switch edges (nodes’ degree unchanged) 5 Randomizing Social Network: a Spectrum Preserving Approach, SDM08

2008 SIAM Conference on Data Mining, April 25 th Atlanta, Georgia Background & Motivation Graph perturbation is resilient to subgraph attacks (refer to our paper for more details). 6 Randomizing Social Network: a Spectrum Preserving Approach, SDM08

2008 SIAM Conference on Data Mining, April 25 th Atlanta, Georgia Motivation Graph Randomization/Perturbation: Data utility: How will the graph structure change due to perturbation? How to preserve graph structural features better? Data privacy: Protection on the link privacy. 7 Randomizing Social Network: a Spectrum Preserving Approach, SDM08

2008 SIAM Conference on Data Mining, April 25 th Atlanta, Georgia Graph Spectrum & Structure Background & Motivation Graph Spectrum & Structure Spectrum & Perturbation Spectrum Preserving Randomization Privacy Protection Conclusion & Future Work 8

2008 SIAM Conference on Data Mining, April 25 th Atlanta, Georgia Graph Spectrum and Structure Numerous properties and measures of networks (Graph G contains n nodes and m edges): Harmonic mean of shortest distance; Transitivity(cluster coefficient) Subgraph centrality; Modularity (community structure); And many others 9 Randomizing Social Network: a Spectrum Preserving Approach, SDM08

2008 SIAM Conference on Data Mining, April 25 th Atlanta, Georgia Graph Spectrum and Structure Adjacency Matrix (Graph G contains n nodes and m edges): Adjacency Spectrum A is symmetric, it has n real eigenvalues: 10 Randomizing Social Network: a Spectrum Preserving Approach, SDM08

2008 SIAM Conference on Data Mining, April 25 th Atlanta, Georgia Graph Spectrum and Structure Laplacian Matrix: Laplacian Spectrum 11 Randomizing Social Network: a Spectrum Preserving Approach, SDM08

2008 SIAM Conference on Data Mining, April 25 th Atlanta, Georgia Graph Spectrum and Structure Many real graph structural features are related to adjacency/Laplacian spectrum, e.g.: No. of triangles: Subgraph centrality: Graph diameter: k disconnected parts in the graph ⇔ k 0’s in the Laplacian spectrum. 12 Randomizing Social Network: a Spectrum Preserving Approach, SDM08

2008 SIAM Conference on Data Mining, April 25 th Atlanta, Georgia Graph Spectrum and Structure Two important eigenvalues: and 1. The maximum degree, chromatic number, clique number etc. are related to ; 2. Epidemic threshold for virus propagates in the network is related to [Wang et al., KDD03]; 3. indicates the community structure of the graph: clear community structure ⇔ ≈ Randomizing Social Network: a Spectrum Preserving Approach, SDM08

2008 SIAM Conference on Data Mining, April 25 th Atlanta, Georgia Spectrum & Perturbation Background & Motivation Graph Spectrum & Structure Spectrum & Perturbation Spectrum Preserving Randomization Privacy Protection Conclusion & Future Work 14

2008 SIAM Conference on Data Mining, April 25 th Atlanta, Georgia Spectrum & Perturbation Graph Perturbation: 1. Random Add/Del edges (no. of edges doesn’t change) 2. Random Switch edges (nodes’ degree doesn’t change) 15 Randomizing Social Network: a Spectrum Preserving Approach, SDM08

2008 SIAM Conference on Data Mining, April 25 th Atlanta, Georgia Spectrum & Perturbation Both topological and spectral graph features change along the perturbation, and they shows similar trends. (Networks of US political books, 105 nodes and 441 edges) 16 Randomizing Social Network: a Spectrum Preserving Approach, SDM08

2008 SIAM Conference on Data Mining, April 25 th Atlanta, Georgia Spectrum & Perturbation General bound on spectrum in perturbation: Do the randomization for k times (refer to our paper for more details) 17 Randomizing Social Network: a Spectrum Preserving Approach, SDM08

2008 SIAM Conference on Data Mining, April 25 th Atlanta, Georgia Spectrum Preserving Randomization Background & Motivation Graph Spectrum & Structure Spectrum & Perturbation Spectrum Preserving Randomization Privacy Protection Conclusion & Future Work 18

2008 SIAM Conference on Data Mining, April 25 th Atlanta, Georgia Spectrum Preserving Randomization Intuition: since spectrum is related to many graph topological features, can we preserve more structural features by controlling the movement of eigenvalues? 19 Randomizing Social Network: a Spectrum Preserving Approach, SDM08

2008 SIAM Conference on Data Mining, April 25 th Atlanta, Georgia Spectrum Preserving Randomization Spectral Switch (apply to adjacency matrix): To increase the eigenvalue: To decrease the eigenvalue: 20 Randomizing Social Network: a Spectrum Preserving Approach, SDM08

2008 SIAM Conference on Data Mining, April 25 th Atlanta, Georgia Spectrum Preserving Randomization Spectral Switch (apply to Laplacian matrix): To decrease the eigenvalue: To increase the eigenvalue: 21 Randomizing Social Network: a Spectrum Preserving Approach, SDM08

2008 SIAM Conference on Data Mining, April 25 th Atlanta, Georgia Spectrum Preserving Randomization Evaluation: (Networks of US political books, 105 nodes and 441 edges) 22 Randomizing Social Network: a Spectrum Preserving Approach, SDM08

2008 SIAM Conference on Data Mining, April 25 th Atlanta, Georgia Spectrum Preserving Randomization Similarly, we also develop Spectral Add/Del strategy (Refer to our paper for more details) In summary, by controlling the movement of the eigenvalues, spectrum can preserving randomization strategies better preserve the graph structure. 23 Randomizing Social Network: a Spectrum Preserving Approach, SDM08

2008 SIAM Conference on Data Mining, April 25 th Atlanta, Georgia Privacy Protection Background & Motivation Graph Spectrum & Structure Spectrum & Perturbation Spectrum Preserving Randomization Privacy Protection Conclusion & Future Work 24

2008 SIAM Conference on Data Mining, April 25 th Atlanta, Georgia Privacy Protection Privacy protection measure: A-prior probability (without the released data): Posterior probability (with released the data & perturbation parameters): The absolute measure The relative measure 25 Randomizing Social Network: a Spectrum Preserving Approach, SDM08

2008 SIAM Conference on Data Mining, April 25 th Atlanta, Georgia Privacy Protection How many times shall we do add/del or switches? Objective: the minimum level of protection should be above some threshold: For random add/del and random switch 26 Randomizing Social Network: a Spectrum Preserving Approach, SDM08

2008 SIAM Conference on Data Mining, April 25 th Atlanta, Georgia Privacy Protection Spectral strategy and random strategy do not differ much in protecting the privacy: In the graph, there exits both up-edge pairs and down-edge pairs. Their proportions affect the privacy protection of spectral strategy Further study in future work 27 Randomizing Social Network: a Spectrum Preserving Approach, SDM08

2008 SIAM Conference on Data Mining, April 25 th Atlanta, Georgia Conclusion & Future Work Background & Motivation Graph Spectrum & Structure Spectrum & Perturbation Spectrum Preserving Randomization Privacy Protection Conclusion & Future Work 28

2008 SIAM Conference on Data Mining, April 25 th Atlanta, Georgia Conclusion 1. Graph structure and spectrum are closely related, and perturbation can significantly change both. 2. Spectrum preserving randomization strategies can better preserve the graph structure; 3. Privacy protection issues for random perturbation. 29 Randomizing Social Network: a Spectrum Preserving Approach, SDM08

2008 SIAM Conference on Data Mining, April 25 th Atlanta, Georgia Future Work 1. Further study on privacy issues of spectral strategy; 2. A more flexible algorithm for other eigenvalues; 3. Algorithms controlling the magnitude of eigenvalues’ change. 30 Randomizing Social Network: a Spectrum Preserving Approach, SDM08

Thank You! Questions? 31

2008 SIAM Conference on Data Mining, April 25 th Atlanta, Georgia Reference 1. L. Backstrom et al., Wherefore art thou r3579x?: anonymized social networks, hidden patterns, and structural steganography, M. Hay et al., Anonymizing social networks, Y. Wang et al., Epidemic spreading in real networks: An eigenvalue viewpoint, Randomizing Social Network: a Spectrum Preserving Approach, SDM08

2008 SIAM Conference on Data Mining, April 25 th Atlanta, Georgia Privacy Protection Privacy protection measure: We can proof, for a given (i, j) Therefore, to calculation the measure is based on calculating the number of false edges (refer to our paper for more details). 33 Randomizing Social Network: a Spectrum Preserving Approach, SDM08