Ganesh J, Soumyajit Ganguly, Manish Gupta, Vasudeva Varma, Vikram Pudi

Slides:



Advertisements
Similar presentations
Multi-label Relational Neighbor Classification using Social Context Features Xi Wang and Gita Sukthankar Department of EECS University of Central Florida.
Advertisements

Linked data: P redicting missing properties Klemen Simonic, Jan Rupnik, Primoz Skraba {klemen.simonic, jan.rupnik,
Analysis and Modeling of Social Networks Foudalis Ilias.
Social networks, in the form of bibliographies and citations, have long been an integral part of the scientific process. We examine how to leverage the.
Intelligent Systems Lab. Recognizing Human actions from Still Images with Latent Poses Authors: Weilong Yang, Yang Wang, and Greg Mori Simon Fraser University,
1 Social Influence Analysis in Large-scale Networks Jie Tang 1, Jimeng Sun 2, Chi Wang 1, and Zi Yang 1 1 Dept. of Computer Science and Technology Tsinghua.
Funding Networks Abdullah Sevincer University of Nevada, Reno Department of Computer Science & Engineering.
Chen Cheng1, Haiqin Yang1, Irwin King1,2 and Michael R. Lyu1
Distributed Representations of Sentences and Documents
POTENTIAL RELATIONSHIP DISCOVERY IN TAG-AWARE MUSIC STYLE CLUSTERING AND ARTIST SOCIAL NETWORKS Music style analysis such as music classification and clustering.
Longbiao Kang, Baotian Hu, Xiangping Wu, Qingcai Chen, and Yan He Intelligent Computing Research Center, School of Computer Science and Technology, Harbin.
TransRank: A Novel Algorithm for Transfer of Rank Learning Depin Chen, Jun Yan, Gang Wang et al. University of Science and Technology of China, USTC Machine.
Evolutionary Clustering and Analysis of Bibliographic Networks Manish Gupta (UIUC) Charu C. Aggarwal (IBM) Jiawei Han (UIUC) Yizhou Sun (UIUC) ASONAM 2011.
IAT Text ______________________________________________________________________________________ SCHOOL OF INTERACTIVE ARTS + TECHNOLOGY [SIAT]
Using Transactional Information to Predict Link Strength in Online Social Networks Indika Kahanda and Jennifer Neville Purdue University.
1 Formal Models for Expert Finding on DBLP Bibliography Data Presented by: Hongbo Deng Co-worked with: Irwin King and Michael R. Lyu Department of Computer.
Annealing Paths for the Evaluation of Topic Models James Foulds Padhraic Smyth Department of Computer Science University of California, Irvine* *James.
2015/10/111 DBconnect: Mining Research Community on DBLP Data Osmar R. Zaïane, Jiyang Chen, Randy Goebel Web Mining and Social Network Analysis Workshop.
Eric H. Huang, Richard Socher, Christopher D. Manning, Andrew Y. Ng Computer Science Department, Stanford University, Stanford, CA 94305, USA ImprovingWord.
This work is supported by the Intelligence Advanced Research Projects Activity (IARPA) via Department of Interior National Business Center contract number.
Predicting Positive and Negative Links in Online Social Networks
Constructing Knowledge Graph from Unstructured Text Image Source: Kundan Kumar Siddhant Manocha.
Semantic Embedding Space for Zero ­ Shot Action Recognition Xun XuTimothy HospedalesShaogang GongAuthors: Computer Vision Group Queen Mary University of.
Greedy is not Enough: An Efficient Batch Mode Active Learning Algorithm Chen, Yi-wen( 陳憶文 ) Graduate Institute of Computer Science & Information Engineering.
Andreas Papadopoulos - [DEXA 2015] Clustering Attributed Multi-graphs with Information Ranking 26th International.
Exploiting Wikipedia Categorization for Predicting Age and Gender of Blog Authors K Santosh Aditya Joshi Manish Gupta Vasudeva Varma
IAT Text ______________________________________________________________________________________ SCHOOL OF INTERACTIVE ARTS + TECHNOLOGY [SIAT]
Paired Sampling in Density-Sensitive Active Learning Pinar Donmez joint work with Jaime G. Carbonell Language Technologies Institute School of Computer.
Gene Clustering by Latent Semantic Indexing of MEDLINE Abstracts Ramin Homayouni, Kevin Heinrich, Lai Wei, and Michael W. Berry University of Tennessee.
Topic Models Presented by Iulian Pruteanu Friday, July 28 th, 2006.
CoNMF: Exploiting User Comments for Clustering Web2.0 Items Presenter: He Xiangnan 28 June School of Computing National.
Deep Visual Analogy-Making
Iterative similarity based adaptation technique for Cross Domain text classification Under: Prof. Amitabha Mukherjee By: Narendra Roy Roll no: Group:
A Supervised Machine Learning Algorithm for Research Articles Leonidas Akritidis, Panayiotis Bozanis Dept. of Computer & Communication Engineering, University.
Supervised Random Walks: Predicting and Recommending Links in Social Networks Lars Backstrom (Facebook) & Jure Leskovec (Stanford) Proc. of WSDM 2011 Present.
Unsupervised Streaming Feature Selection in Social Media
Memoryless Document Vector Dongxu Zhang Advised by Dong Wang
CIS750 – Seminar in Advanced Topics in Computer Science Advanced topics in databases – Multimedia Databases V. Megalooikonomou Link mining ( based on slides.
Concept-Based Analysis of Scientific Literature Chen-Tse Tsai, Gourab Kundu, Dan Roth UIUC.
Ganesh J1, Manish Gupta1,2 and Vasudeva Varma1
Plan for today Introduction Graph Matching Method Theme Recognition Comparison Conclusion.
A Fast Kernel for Attributed Graphs Yu Su University of California at Santa Barbara with Fangqiu Han, Richard E. Harang, and Xifeng Yan.
Document Clustering with Prior Knowledge Xiang Ji et al. Document Clustering with Prior Knowledge. SIGIR 2006 Presenter: Suhan Yu.
Artificial Intelligence DNA Hypernetworks Biointelligence Lab School of Computer Sci. & Eng. Seoul National University.
DeepWalk: Online Learning of Social Representations
Doc2Sent2Vec: A Novel Two-Phase Approach for Learning Document Representation Ganesh J 1 Manish Gupta 1,2 Vasudeva Varma 1 1 IIIT, Hyderabad, India, 2.
Medical Semantic Similarity with a Neural Language Model Dongfang Xu School of Information Using Skip-gram Model for word embedding.
Unsupervised Sparse Vector Densification for Short Text Similarity
Distributed Representations for Natural Language Processing
Recommendation in Scholarly Big Data
Comparison with other Models Exploring Predictive Architectures
On Dataless Hierarchical Text Classification
A Deep Learning Technical Paper Recommender System
Summary Presented by : Aishwarya Deep Shukla
Vector-Space (Distributional) Lexical Semantics
mengye ren, ryan kiros, richard s. zemel
Distributed Representations of Subgraphs
Finding Clusters within a Class to Improve Classification Accuracy
Distributed Representation of Words, Sentences and Paragraphs
Word Embeddings with Limited Memory
Weakly Learning to Match Experts in Online Community
Learning Emoji Embeddings Using Emoji Co-Occurrence Network Graph
Word embeddings based mapping
Word embeddings based mapping
Biased Random Walk based Social Regularization for Word Embeddings
Socialized Word Embeddings
Asymmetric Transitivity Preserving Graph Embedding
GANG: Detecting Fraudulent Users in OSNs
Attention for translation
Presentation transcript:

Ganesh J, Soumyajit Ganguly, Manish Gupta, Vasudeva Varma, Vikram Pudi Author2Vec: Learning Author Representations by Combining Content and Link Information Ganesh J, Soumyajit Ganguly, Manish Gupta, Vasudeva Varma, Vikram Pudi

Problem Learn representations (or feature vectors) for each author in bibliographic co-authorship network. The representation must capture the network properties of each author (i.e. authors who work in the same research area must be closer in the vector space), in a compact form.

Applications The representations learned will help solve the following network mining tasks using off-the-shelf machine learning algorithms. Author classification Author recommendation Co-authorship prediction Author visualization

State-of-the-art: DeepWalk (Perozzi et al.) Existing Work State-of-the-art: DeepWalk (Perozzi et al.) DeepWalk converts a graph into a collection of sequences containing vertices using uniform sampling (truncated random walk). Assuming each sequence as a sentence, they run the Skip-gram model (Mikolov et al.) to learn representation for each vertex.

Challenges Link sparsity problem in real world information network. For instance, two authors who write scientific articles related to the field ‘Machine Learning’ are not considered to be similar by DeepWalk if they are not connected.

Overcoming link sparsity problem Can we use the content information (research article content) to bring authors who write similar content, closer? Can it complement the model focusing on link information only? In this work, we experiment with two models: one capturing the network information and the other capturing the textual information.

Problem formulation Co-authorship network, G = (V, E) Nodes ‘u’ are authors. Edge(u, v) – Edge exists if authors ‘u’ and ‘v’ co-author at least one article. Pu – Set of papers published by author ‘u’. Author2Vec’s goal is to learn author representation 𝒗u∈Rd (∀u ∈ V), where d is the embedding size.

Content-Info model (1) Model author-paper relationship rc(u,p). Input: representation for author u (𝒗𝑢), representation for paper p (𝒗𝑝) Task: Predict whether paper ‘p’ is written by author ‘u’ or not. Output: 1 (-ve pair) or 2 (+ve pair) Generate negative samples with papers not written by the author ‘u’ Aim is to push the representations closer to her content, away from irrelevant content. Note 𝒗𝑝 is initialized by running Paragraph2Vec (Le et al.) on all the abstracts.

Content-Info model (2) For every pair, we run these equations: ℎ 𝐶 (∗) =𝑣𝑢 ° 𝑣𝑝 (angle) ℎ 𝐶 (+) =| 𝑣𝑢 − 𝑣𝑝 | (distance) ℎ 𝐶 = tanh( 𝑊 𝐶 (∗) ℎ 𝐶 (∗) + 𝑊 𝐶 (+) ℎ 𝐶 (+) + 𝑏 𝐶 (ℎ) ) 𝐿 𝐶 =P 𝑟 𝐶 u,p =l =softmax( 𝑈 𝐶 . ℎ 𝐶 + 𝑏 𝐶 (𝑝) ) Where 𝑟 𝐶 denotes whether u wrote paper p; l=1 u did not write paper p; l=2

Link-Info model (1) Model author-author relationship rL(u,v). Input: representation for author ‘u’ (𝒗𝑢), representation for author ‘v’( 𝒗𝑣) Task: Predict whether ‘v’ has collaborated with ‘u’ or not. Output: 1 (-ve pair) or 2 (+ve pair) Generate negative samples with authors who have not collaborated with author ‘u’. Aim is to push the authors who share similar network structure closer, away from irrelevant authors.

Link-Info model (2) For every pair, we run these equations: ℎ 𝐿 (∗) =𝑣𝑢 ° 𝑣𝑣 (angle) ℎ 𝐿 (+) =| 𝑣𝑢 − 𝑣𝑣 | (distance) ℎ 𝐿 = tanh( 𝑊 𝐿 (∗) ℎ 𝐿 (∗) + 𝑊 𝐿 (+) ℎ 𝐿 (+) + 𝑏 𝐿 (ℎ) ) 𝐿 𝐿 =P 𝑟 𝐿 u,v =l =softmax( 𝑈 𝐿 . ℎ 𝐿 + 𝑏 𝐿 (𝑝) ) Where 𝑟 𝐿 denotes whether u and v wrote some paper together; l=1 u and v never wrote a paper together; l=2

Author2Vec Overall objective function: 𝑳 = 𝑳 𝑪 + 𝑳 𝑳 Uses Stochastic Gradient Descent (SGD) and Backpropagation to learn unknown parameters Learning rate is fixed at 0.1.

Evaluation (1) Dataset: DBLP (Chakraborty et al.) 711810 papers (along with abstracts) 500361 authors 24 computer science fields (paper tags) Baseline: DeepWalk

Evaluation (2) Tasks- Link Prediction Clustering Training years: 1990-2009 Testing year: 2010 Logistic Regression Clustering K-Means (with k=24) Pick the field in which the author publishes the most as his/her tag.

Evaluation (3) Performance comparison Task Link Prediction Clustering Model\Metric Accuracy (%) NMI (%) DeepWalk 81.965 19.956 Content-info 80.707 19.823 Link-info 72.808 19.163 Author2Vec 83.894 20.122

Conclusion Author2Vec fuses content and link information to learn author embeddings given a co-authorship network. Future Directions: Considering weighted graphs (‘weight’ indicates the number of papers co-authored). Incorporating the global network information.

References [1] Ahmed, A., Shervashidze, N., Narayanamurthy, S., Josifovski, V., Smola, A.J.: Distributed Large-scale Natural Graph Factorization. In: WWW. (2013) 37-48 [2] Perozzi, B., Al-Rfou, R., Skiena, S.: DeepWalk: online learning of social representations. In: KDD. (2014) 701-710 [3] Le, Q., Mikolov, T.: Distributed Representations of Sentences and Documents. In: ICML. (2014) 1188-1196 [4] Chakraborty, T., Sikdar, S., Tammana, V., Ganguly, N., Mukherjee, A.: Computer Science Fields as Ground-truth Communities: Their Impact, Rise and Fall. In: ASONAM. (2013) 426-433 [5] Mikolov, T., Sutskever, I., Chen, K., Corrado, G., Dean, J.: Distributed Representations of Words and Phrases and their Compositionality. In: NIPS. (2013) 3111-3119 [6] Nowell, D.L., Kleinberg, J.: The link-prediction problem for social networks. In: Journal of the American Society for Information Science and Technology. (2007) 1019-1031 [7] Tai, K.S., Socher, R., Manning, C.D.: Improved semantic representations from tree-structured long short-term memory. In: ACL. (2015) 1556-1566