Semantic Graph Representation Learning in Attributed Networks

Slides:



Advertisements
Similar presentations
A Unified Framework for Context Assisted Face Clustering
Advertisements

Collaborative Filtering in Social Tagging System on Joint Item-Tag Recommendations Date : 2011/11/7 Source : Jing Peng et. al (CIKM’10) Speaker : Chiu.
Linked data: P redicting missing properties Klemen Simonic, Jan Rupnik, Primoz Skraba {klemen.simonic, jan.rupnik,
Carolina Galleguillos, Brian McFee, Serge Belongie, Gert Lanckriet Computer Science and Engineering Department Electrical and Computer Engineering Department.
Xiaowei Ying, Xintao Wu, Daniel Barbara Spectrum based Fraud Detection in Social Networks 1.
IJCAI Wei Zhang, 1 Xiangyang Xue, 2 Jianping Fan, 1 Xiaojing Huang, 1 Bin Wu, 1 Mingjie Liu 1 Fudan University, China; 2 UNCC, USA {weizh,
Communities in Heterogeneous Networks Chapter 4 1 Chapter 4, Community Detection and Mining in Social Media. Lei Tang and Huan Liu, Morgan & Claypool,
Interactive Generation of Integrated Schemas Laura Chiticariu et al. Presented by: Meher Talat Shaikh.
The Terms that You Have to Know! Basis, Linear independent, Orthogonal Column space, Row space, Rank Linear combination Linear transformation Inner product.
HCC class lecture 14 comments John Canny 3/9/05. Administrivia.
1 Prototype Hierarchy Based Clustering for the Categorization and Navigation of Web Collections Zhao-Yan Ming, Kai Wang and Tat-Seng Chua School of Computing,
POTENTIAL RELATIONSHIP DISCOVERY IN TAG-AWARE MUSIC STYLE CLUSTERING AND ARTIST SOCIAL NETWORKS Music style analysis such as music classification and clustering.
Using Friendship Ties and Family Circles for Link Prediction Elena Zheleva, Lise Getoor, Jennifer Golbeck, Ugur Kuter (SNAKDD 2008)
The Minimal Communication Cost of Gathering Correlated Data over Sensor Networks EL 736 Final Project Bo Zhang.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Advisor : Dr. Hsu Student : Sheng-Hsuan Wang Department.
Modeling Relationship Strength in Online Social Networks Rongjing Xiang: Purdue University Jennifer Neville: Purdue University Monica Rogati: LinkedIn.
Querying Structured Text in an XML Database By Xuemei Luo.
Chengjie Sun,Lei Lin, Yuan Chen, Bingquan Liu Harbin Institute of Technology School of Computer Science and Technology 1 19/11/ :09 PM.
Interoperable Visualization Framework towards enhancing mapping and integration of official statistics Haitham Zeidan Palestinian Central.
Neural and Evolutionary Computing - Lecture 9 1 Evolutionary Neural Networks Design  Motivation  Evolutionary training  Evolutionary design of the architecture.
Mining Social Networks for Personalized Prioritization Shinjae Yoo, Yiming Yang, Frank Lin, II-Chul Moon [KDD ’09] 1 Advisor: Dr. Koh Jia-Ling Reporter:
Exploiting Context Analysis for Combining Multiple Entity Resolution Systems -Ramu Bandaru Zhaoqi Chen Dmitri V.kalashnikov Sharad Mehrotra.
SINGULAR VALUE DECOMPOSITION (SVD)
Character Identification in Feature-Length Films Using Global Face-Name Matching IEEE TRANSACTIONS ON MULTIMEDIA, VOL. 11, NO. 7, NOVEMBER 2009 Yi-Fan.
Andreas Papadopoulos - [DEXA 2015] Clustering Attributed Multi-graphs with Information Ranking 26th International.
Detecting Communities Via Simultaneous Clustering of Graphs and Folksonomies Akshay Java Anupam Joshi Tim Finin University of Maryland, Baltimore County.
Gene Clustering by Latent Semantic Indexing of MEDLINE Abstracts Ramin Homayouni, Kevin Heinrich, Lai Wei, and Michael W. Berry University of Tennessee.
Algorithmic Detection of Semantic Similarity WWW 2005.
A Clustering Method Based on Nonnegative Matrix Factorization for Text Mining Farial Shahnaz.
Enhanced hypertext categorization using hyperlinks Soumen Chakrabarti (IBM Almaden) Byron Dom (IBM Almaden) Piotr Indyk (Stanford)
Ontology Engineering and Feature Construction for Predicting Friendship Links in the Live Journal Social Network Author:Vikas Bahirwani 、 Doina Caragea.
2016/9/301 Exploiting Wikipedia as External Knowledge for Document Clustering Xiaohua Hu, Xiaodan Zhang, Caimei Lu, E. K. Park, and Xiaohua Zhou Proceeding.
Correlation Clustering
Nonlinear Dimensionality Reduction
Data Transformation: Normalization
CSE5544 Final Project Proposal
Document Clustering Based on Non-negative Matrix Factorization
Personalized Social Image Recommendation
Asymmetric Correlation Regularized Matrix Factorization for Web Service Recommendation Qi Xie1, Shenglin Zhao2, Zibin Zheng3, Jieming Zhu2 and Michael.
Associative Query Answering via Query Feature Similarity
Hyper-parameter tuning for graph kernels via Multiple Kernel Learning
Integrating Meta-Path Selection With User-Guided Object Clustering in Heterogeneous Information Networks Yizhou Sun†, Brandon Norick†, Jiawei Han†, Xifeng.
Kai Lei1, Meng Qin1, Bo Bai2,*, Gong Zhang2
Using Friendship Ties and Family Circles for Link Prediction
Outline Nonlinear Dimension Reduction Brief introduction Isomap LLE
Learning with information of features
Postdoc, School of Information, University of Arizona
CSc4730/6730 Scientific Visualization
Discovering Functional Communities in Social Media
Learning Emoji Embeddings Using Emoji Co-Occurrence Network Graph
Approximating the Community Structure of the Long Tail
Noémi Gaskó, Rodica Ioana Lung, Mihai Alexandru Suciu
Word Embedding Word2Vec.
MEgo2Vec: Embedding Matched Ego Networks for User Alignment Across Social Networks Jing Zhang+, Bo Chen+, Xianming Wang+, Fengmei Jin+, Hong Chen+, Cuiping.
[jws13] Evaluation of instance matching tools: The experience of OAEI
COLING‘18 Hailong Jin and Lei Hou and Juanzi Li(DCST,Tsinghua)
Binghui Wang, Le Zhang, Neil Zhenqiang Gong
Asymmetric Transitivity Preserving Graph Embedding
Graph-based Security and Privacy Analytics via Collective Classification with Joint Weight Learning and Propagation Binghui Wang, Jinyuan Jia, and Neil.
Mingzhen Mo and Irwin King
Kostas Kolomvatsos, Christos Anagnostopoulos
Human-object interaction
Non-Negative Matrix Factorization
WSExpress: A QoS-Aware Search Engine for Web Services
Modeling IDS using hybrid intelligent systems
Peng Cui Tsinghua University
Yingze Wang and Shi-Kuo Chang University of Pittsburgh
Presented by Nick Janus
Learning to Detect Human-Object Interactions with Knowledge
Visual Grounding.
Presentation transcript:

Semantic Graph Representation Learning in Attributed Networks Meng Qin(覃孟)1, Kai Lei1,* (mengqin_az@foxmail.com) 1 ICNLAB, SECE, Peking University Shenzhen Graduate School 2019-8-11 (Sun.)

Outline Motivation Problem Definition Methodology Experimental Evaluation Conclusion Graph Representation Learning in attributed networks joint opt. of net. structure & semantic Transform! Network Embedding with Heterogeneous Entities Nonlinear high-order feat. among structure & semantic support advanced semantic-oriented net. inferences (e.g., Semantic Community Detection)

Motivation

Motivation Graph representation learning / Network Embedding (NE) A significant topic in the research of network analysis Powerful to support the downstream net. inference tasks e.g., community detection, link prediction, etc. Encode the network into a low-dim. vector representation With the primary properties preserved [Peng C., et al. 2018]

Motivation (Cont) Existing NE techniques Categorized according to the information sources they utilize Network Structure (Topology) Primary information source available for NE High-Order Node Proximity (e.g., LINE, DeepWalk, node2vec, GraRep, etc.) Community Structure (e.g., M-NMF, DNR, etc.) Network Semantic (Attribute/Content) carries orthogonal & complementary knowledge beyond topology potentially enhance the learned representations Node Attribute (e.g., TADW, AANE, SNE, CANE, etc.)

[Shaosheng C. et al. 2011] (k=1,2,3,4) Motivation (Cont) Limitations of existing NE techniques Only explore non-linear high-order feat. of Net. Topo. With Net. Attribute as the auxiliary regularization Inherently ignore the Non-Linear High-Order Feat. Among Net. Structure & Semantic?! Only Use attribute info. to improve the learned embeddings Cannot be directly applied to some advanced semantic-oriented tasks e.g., Semantic Community Detection community membership & semantic descriptions [Xiao W. et al. 2016] [Shaosheng C. et al. 2011] (k=1,2,3,4)

Motivation (Cont) Semantic Graph Representation (SGR) Reformulate attributed network into an abstracted weighted graph With heterogeneous entities Simultaneously learn the embeddings of node and attribute (keyword) Deal with semantic-oriented downstream tasks Introduce an enhancement scheme based on graph regularization To incorporate other side info. e.g., community structure

Problem Definition

Problem Definition Attributed Network Undirected Unweighted Net. with Discrete Attribute (Keywork) Attributed Network G={V, E, A, F} Node Set: V={v1,…,vn}; Edge Set: E={(vi, vj)|vi, vj∈V} Attribute Set: A={a1,…,am}; Attribute Map: F={f(v1),…,f(vm)} Abstracted Weighted Graph G’={V’, E’} Node Set: V’=V∩A; (Weighted) Edge Set: E’={E1, E2, E3} 3 Types of Relation: E1={W(vi, vj)}; E2={W(vi, aw)}; E3={W(aw, as)} Semantic Graph Representation Learn f’ to map {vi, aj} to k-dim. vec. According to E’, with G primary properties preserved

Methodology

Methodology Semantic Graph Rep. (SGR) Model (1) Construct Heterogeneous (Weighted) Graph G’ Integrate 3 types of relations {E1, E2, E3} Construct the Heterogeneous Adjacency Matrix (2) Learn Embeddings {xvi, xaj} Based on the Weighted Topology of G’ Explore the high-order proximity among entities V’=V ∩A

Methodology (Cont) Construct Heterogeneous Graph G’ Node Relation E1={W(vi, vj)}=E{(vi, vj)} i.e., Topology of original Network G Described by Adjacency Matrix (of G) A∈Rn×n Aij=Aji=1, if (vi, vj) ∈E; Aij=Aji=0, otherwise Attribute Relation E3={W(as, aw)} Described by (Normalized) Node Similarity Matrix P∈Rm×m Use R0∈Rn×n to describe network attribute (R0)iw=1, if aw∈f(vi); (R0)iw=0, otherwise Similarity of each attribute pair (vi, vj)

Heterogeneous Relation E2 Methodology (Cont) Construct Heterogeneous Graph G’ Heterogeneous Relation E2={(vi, aw)} Explore Higher-order Substructure 3 Motif {M1, M2, M3} 3 Relation Matrices {R0∈Rn×m, R1∈Rn×m, R2∈Rn×m} (Rt)iw as co-occurrence counts of (vi, aw) in Mt (t∈{1, 2}) Combine the normalized relation matrices Heterogeneous Adjacency Matrix B∈R(n+m)×(n+m) Node Relation E1 Heterogeneous Relation E2 Attribute Relation E3

Methodology (Cont) Learn Embeddings {xvi, xaj} Basic Unified Model Use the MF Obj. of DeepWalk [Jiezhong Q., 2018] Learn the Embeddings via SVD Use top-k singular values to approximate the reconstruction Adopt X* as the final result

Methodology (Cont) Learn Embeddings {xvi, xaj} Side-Enhancement Use other side info. to enhance the learned emb. Based on Graph Regularization: Example Side Info. Community Structure – Modularity Matrix Q Attribute Similarity – Attribute Similarity Matrix S Side-Enhancement Obj. Updating Rule:

Experimental Evaluation

Experimental Evaluation Performance Evaluation 12 real attributed network 11 Baselines/Competitors (Only) With Topo. High-Order Proximity: DeepWalk, node2vec, SDNE Community Structure: M-NMF, DNR With Topo. & Attribute TADW, AANE, FSCNMF Downstream Applications Node Clustering (a.k.a., Community Detection) – Metric: NMI, AC Node Classification – Metric: AC, Macro-F1

Experimental Evaluation (Cont) Performance evaluation SGR(0): with Default Param. Setting SGR(1): with Fine-Tuned Param. SGR(R): with Side-Enhancement ‘-’: no further performance improvement

Experimental Evaluation (Cont) Performance evaluation SGR(0): with Default Param. Setting SGR(1): with Fine-Tuned Param. SGR(R): with Side-Enhancement ‘-’: no further performance improvement

Experimental Evaluation (Cont) Case study (for Semantic Community Detection) LastFM dataset Collected from online music platform With user friendship (topo.) and tag (attribute) Use X-means to determine Clustering Mem. of nodes & attributes t-SNE Dim. Reduction Vis. of Node & Attribute Emb. Vis. of Cluster Centers

Experimental Evaluation (Cont) Case study Generate Semantic Desc. for each node cluster (community) Select top-5 keywork (with min. dist.) for each Desc. Two Strategies: Case 1: 1 Comprehensive Desc. for each Community Case 2: Mutl. Topics for each community / 1 Desc. For each Topic Case 1 Case 2

Conclusion

Conclusion In this study In our future work Reformulate Net. Embedding in Attributed Network Introduce SGR Explore Non-linear High-order Proximity among Net. Struct. & Semantic Deal with Semantic-oriented application In our future work A more comprehensive but simpler parameter setting strategy Reduce computation time via distributed SVD Here is the brief conclusion of this work.

Semantic Graph Representation Learning in Attributed Networks Thank You Very Much! Q&A Meng Qin (megnqin_az@foxmail.com)