Non-Negative Tensor Factorization with RESCAL Denis Krompaß 1, Maximilian Nickel 1, Xueyan Jiang 1 and Volker Tresp 1,2 1 Department of Computer Science.

Slides:



Advertisements
Similar presentations
Three-Step Database Design
Advertisements

Nonnegative Matrix Factorization with Sparseness Constraints S. Race MA591R.
Multilinear Algebra for Analyzing Data with Multiple Linkages Tamara G. Kolda plus: Brett Bader, Danny Dunlavy, Philip Kegelmeyer Sandia National Labs.
Probabilistic Latent-Factor Database Models Denis Krompaß 1, Xueyan Jiang 1,Maximilian Nickel 2 and Volker Tresp 1,3 1 Department of Computer Science.
Informatics and Mathematical Modelling / Intelligent Signal Processing 1 Morten Mørup Decomposing event related EEG using Parallel Factor Morten Mørup.
Fitting the PARAFAC model Giorgio Tomasi Chemometrics group, LMT,MLI, KVL Frederiksberg. Denmark
Informatics and Mathematical Modelling / Intelligent Signal Processing 1 Morten Mørup Extensions of Non-negative Matrix Factorization to Higher Order data.
CSCI 347 / CS 4206: Data Mining Module 07: Implementations Topic 03: Linear Models.
Probabilistic Clustering-Projection Model for Discrete Data
Bayesian Robust Principal Component Analysis Presenter: Raghu Ranganathan ECE / CMR Tennessee Technological University January 21, 2011 Reading Group (Xinghao.
ERPWAVELAB 1st International Summer School in Biomedical Engineering1st International Summer School in Biomedical Engineering August 8, st International.
Principal Component Analysis
Communities in Heterogeneous Networks Chapter 4 1 Chapter 4, Community Detection and Mining in Social Media. Lei Tang and Huan Liu, Morgan & Claypool,
Curve-Fitting Regression
Distributed Nonnegative Matrix Factorization for Web-Scale Dyadic Data Analysis on MapReduce Chao Liu, Hung-chih Yang, Jinliang Fan, Li-Wei He, Yi-Min.
Efficient and Numerically Stable Sparse Learning Sihong Xie 1, Wei Fan 2, Olivier Verscheure 2, and Jiangtao Ren 3 1 University of Illinois at Chicago,
Malicious parties may employ (a) structure-based or (b) label-based attacks to re-identify users and thus learn sensitive information about their rating.
Large-Scale Factorization of Type- Constrained Multi-Relational Data Denis Krompaß 1, Maximilian Nickel 2 and Volker Tresp 1,3 1 Department of Computer.
Informatics and Mathematical Modelling / Intelligent Signal Processing ISCAS Morten Mørup Approximate L0 constrained NMF/NTF Morten Mørup Informatics.
Online Stacked Graphical Learning Zhenzhen Kou +, Vitor R. Carvalho *, and William W. Cohen + Machine Learning Department + / Language Technologies Institute.
Online Dictionary Learning for Sparse Coding International Conference on Machine Learning, 2009 Julien Mairal, Francis Bach, Jean Ponce and Guillermo Sapiro.
Cao et al. ICML 2010 Presented by Danushka Bollegala.
Informatics and Mathematical Modelling / Intelligent Signal Processing 1 EUSIPCO’09 27 August 2009 Tuning Pruning in Sparse Non-negative Matrix Factorization.
Copyright R. Weber Machine Learning, Data Mining ISYS370 Dr. R. Weber.
Online Learning for Matrix Factorization and Sparse Coding
Fast Max–Margin Matrix Factorization with Data Augmentation Minjie Xu, Jun Zhu & Bo Zhang Tsinghua University.
1 Information Retrieval through Various Approximate Matrix Decompositions Kathryn Linehan Advisor: Dr. Dianne O’Leary.
Non Negative Matrix Factorization
Ensemble Solutions for Link-Prediction in Knowledge Graphs
Video Tracking Using Learned Hierarchical Features
FODAVA-Lead Education, Community Building, and Research: Dimension Reduction and Data Reduction: Foundations for Interactive Visualization Haesun Park.
Chengjie Sun,Lei Lin, Yuan Chen, Bingquan Liu Harbin Institute of Technology School of Computer Science and Technology 1 19/11/ :09 PM.
SAND C 1/17 Coupled Matrix Factorizations using Optimization Daniel M. Dunlavy, Tamara G. Kolda, Evrim Acar Sandia National Laboratories SIAM Conference.
Efficient and Numerically Stable Sparse Learning Sihong Xie 1, Wei Fan 2, Olivier Verscheure 2, and Jiangtao Ren 3 1 University of Illinois at Chicago,
Pseudo-supervised Clustering for Text Documents Marco Maggini, Leonardo Rigutini, Marco Turchi Dipartimento di Ingegneria dell’Informazione Università.
June 5, 2006University of Trento1 Latent Semantic Indexing for the Routing Problem Doctorate course “Web Information Retrieval” PhD Student Irina Veredina.
Mingyang Zhu, Huaijiang Sun, Zhigang Deng Quaternion Space Sparse Decomposition for Motion Compression and Retrieval SCA 2012.
Querying Factorized Probabilistic Triple Databases Denis Krompaß 1, Maximilian Nickel 2 and Volker Tresp 1,3 1 Department of Computer Science. Ludwig Maximilian.
JAVA AND MATRIX COMPUTATION
Gene Clustering by Latent Semantic Indexing of MEDLINE Abstracts Ramin Homayouni, Kevin Heinrich, Lai Wei, and Michael W. Berry University of Tennessee.
Dual Transfer Learning Mingsheng Long 1,2, Jianmin Wang 2, Guiguang Ding 2 Wei Cheng, Xiang Zhang, and Wei Wang 1 Department of Computer Science and Technology.
A Clustering Method Based on Nonnegative Matrix Factorization for Text Mining Farial Shahnaz.
Extensions of Non-Negative Matrix Factorization (NMF) to Higher Order Data HONMF (Higher Order Non-negative Matrix Factorization) NTF2D/SNTF2D ((Sparse)
CoNMF: Exploiting User Comments for Clustering Web2.0 Items Presenter: He Xiangnan 28 June School of Computing National.
NONNEGATIVE MATRIX FACTORIZATION WITH MATRIX EXPONENTIATION Siwei Lyu ICASSP 2010 Presenter : 張庭豪.
Panel Discussion on Granular Computing at RSCTC2004 J. T. Yao University of Regina Web:
Non-negative Matrix Factorization
Matrix Factorization and its applications By Zachary 16 th Nov, 2010.
IB Computer Science Content developed by Dartford Grammar School Computer Science Department Objects as a programming concept.
Facets: Fast Comprehensive Mining of Coevolving High-order Time Series Hanghang TongPing JiYongjie CaiWei FanQing He Joint Work by Presenter:Wei Fan.
Paper Presentation Social influence based clustering of heterogeneous information networks Qiwei Bao & Siqi Huang.
Semi-Supervised Clustering
Information Management for Digital Humanities and Diplomatics
Zhu Han University of Houston Thanks for Dr. Hung Nguyen’s Slides
A Three-way Model for Collective Learning on Multi-Relational Data
Interactive Machine Learning with a GPU-Accelerated Toolkit
by Hyunwoo Park and Kichun Lee Knowledge-Based Systems 60 (2014) 58–72
Asymmetric Correlation Regularized Matrix Factorization for Web Service Recommendation Qi Xie1, Shenglin Zhao2, Zibin Zheng3, Jieming Zhu2 and Michael.
Nonparametric Latent Feature Models for Link Prediction
Luca Lugini Publication by Yingze Wang, Guang Xiang, and Shi-Kuo Chang
Goodfellow: Chapter 14 Autoencoders
Chap. 7 Regularization for Deep Learning (7.8~7.12 )
MEgo2Vec: Embedding Matched Ego Networks for User Alignment Across Social Networks Jing Zhang+, Bo Chen+, Xianming Wang+, Fengmei Jin+, Hong Chen+, Cuiping.
The European Conference on e-learing ,2017/10
MississaugaTalks! Saif Shaikh March 5, 2016 Code and the City
August 8, 2006 Danny Budik, Itamar Elhanany Machine Intelligence Lab
Non-Negative Matrix Factorization
An Efficient Projection for L1-∞ Regularization
Latent Semantic Analysis
Goodfellow: Chapter 14 Autoencoders
Presentation transcript:

Non-Negative Tensor Factorization with RESCAL Denis Krompaß 1, Maximilian Nickel 1, Xueyan Jiang 1 and Volker Tresp 1,2 1 Department of Computer Science. Ludwig Maximilian University, Oettingenstraße 67, Munich, Germany 2 Corporate Technology, Siemens AG, Otto-Hahn-Ring 6, Munich, Germany

Overview 1.Non-Negative Matrix Factorization 2.Multiplicative Updates 3.RESCAL 4.Non-Negativity for RESCAL 5.Experiments 6.Benefits and Drawbacks

Non-Negative Matrix Factorization Factorize a Matrix/Tensor into non-negative factors e.g. X = AV – Allows interpretation of latent factors – Can be directly used for clustering – Enforces sparse factors

Multiplicative Updates Introduced by Lee & Seung in 2000 Used by Mørup and Hanson to infer NN Tucker decomposition Define a cost-function C(θ) Derive the partial derivative with respect to θ i Identify negative and positive part of the derivative and construct update function: Negative part of the derivative Positive part of the derivative

RESCAL Tensor Factorization for Relational Learning Three-way-tensor factorization model Showed very good results in various relational learning tasks [5,8]

Non-Negative Constraint for RESCAL Regularized Least-Squares Cost Function Regularized Kullback-Leibler-Divergence Cost Function

Normalization and Integrating Entity Attribute Information Normalization of Factor Matrix A [13] Add attribute information to the model [8] Include Also for KL- Divergence Cost- Function

Experiments Nations 14 x 14 x 56 multi-relational data that consist of relations between nations (treaties, immigration, etc). Additionally the dataset contains attribute information for each entry. Kinship 104 x 104 x 26 multi-relational data that consist of several kinship relations within the Alwayarra tribe. UMLS 135 x 135 x 49 multi relational data that consist if biomedical relationships between categorized concepts of the Unified Medical Language System (UMLS). 10 x cross-validation Initialized the non-negative factor matrices with Non-negative Double Singular Value Decomposition method (NNDSVD)[9] Nonzero entries were defined as entries smaller than -1.0e-9 or greater than 1.0e-9.

Results In Kinships and UMLS case, the performance is similar to the original RESCAL Worse performance on the nations dataset Sparsity of latent factors significantly lower for UMLS and Kinships Minimizing KL-Divergence Cost functions leads to more sparse factors KL-Divergence LS

Conclusion Extended non-negative matrix factorization to relational learning tasks with the RESCAL model employing multiplicative update rules. Derived update rules for Least-Squares and KL-Divergence based cost functions including: Regularization, Normalization and Attribute Information (+) Benefits: +Updates also exploit data sparsity +Little loss in predictive performance +Significant gain in sparsity of latent factor representation (-) Drawbacks: -Slower convergence even after using non-random initialization of factor matrices as proposed by [9]

References 1.Harshman, RA: Foundations of the PARAFAC procedure: Models and conditions for an "explanatory" multi-modal factor analysis. UCLA Working Papers in Phonetics, 16, 1-84 (1970) 2.Carroll, JD, Chang, JJ: Analysis of individual differences in multidimensional scaling via an N-way generalization of "Eckert- Young" decomposition. Psychometrika, 35, (1970) 3.Tucker, LR: Some Mathematical notes on three-mode factor analysis. Psychometrika, 31, (1966) 4.Lee, DD, Seung, H.S.: Algorithms for Non-negative Matrix Factorization. In NIPS, (2000) 5.Nickel, M, Tresp, V, Kriegel, HP: A Three-Way Model for Collective Learning on Multi-Relational Data. In Proceedings of the 28th International Conference on Machine Learning (2011) 6.Wang, F, Li, P, König, AC: Efficient Document Clustering via Online Nonnegative Matrix Factorizations In Proceedings of SDM'11, (2011) 7.Kohen, Y: The BellKor Solution to the Netflix Grand Prize. (2009) 8.Nickel, M, Tresp, V, Kriegel, HP: Factorizing YAGO. Scalable Machine Learning for Linked Data. In Proceedings of the 21st International World Wide Web Conference (WWW2012) (2012) 9.Boutsidis, C, Gallopoulos, E: SVD-based initialization: A head start for nonnegative matrix factorization. Pat. Recogn. 41(4), (2008) 10.Langville, AN, Meyer, CD, Albright R: Initializations for the Nonnegative Matrix Factorization. KDD (2006) 11.Lee, DD, Seung, HS: Learning the parts of objects by non-negative matrix factorization. Nature, 401, (1999) 12.Mørup, M, Hanson, LK: Algorithms for Sparse Non-negative Tucker decomposition. Neural Comput. 20(8), (2008) 13.Eggert, J, Körner, E: Sparse coding and NMF. In Neural Networks, volume 4, pages (2004)