Liang Zheng and Yuzhong Qu

Slides:



Advertisements
Similar presentations
Mining User Similarity Based on Location History Yu Zheng, Quannan Li, Xing Xie Microsoft Research Asia.
Advertisements

An Interactive-Voting Based Map Matching Algorithm
Jose-Luis Blanco, Javier González, Juan-Antonio Fernández-Madrigal University of Málaga (Spain) Dpt. of System Engineering and Automation May Pasadena,
Introduction Distance-based Adaptable Similarity Search
+ Multi-label Classification using Adaptive Neighborhoods Tanwistha Saha, Huzefa Rangwala and Carlotta Domeniconi Department of Computer Science George.
Collaborative QoS Prediction in Cloud Computing Department of Computer Science & Engineering The Chinese University of Hong Kong Hong Kong, China Rocky.
Towards Twitter Context Summarization with User Influence Models Yi Chang et al. WSDM 2013 Hyewon Lim 21 June 2013.
CUBELSI : AN EFFECTIVE AND EFFICIENT METHOD FOR SEARCHING RESOURCES IN SOCIAL TAGGING SYSTEMS Bin Bi, Sau Dan Lee, Ben Kao, Reynold Cheng The University.
Optimal Design Laboratory | University of Michigan, Ann Arbor 2011 Design Preference Elicitation Using Efficient Global Optimization Yi Ren Panos Y. Papalambros.
Image Similarity and the Earth Mover’s Distance Empirical Evaluation of Dissimilarity Measures for Color and Texture Y. Rubner, J. Puzicha, C. Tomasi and.
Discriminative and generative methods for bags of features
Computer Vision Group, University of BonnVision Laboratory, Stanford University Abstract This paper empirically compares nine image dissimilarity measures.
Transfer Learning for WiFi-based Indoor Localization
Jierui Xie, Boleslaw Szymanski, Mohammed J. Zaki Department of Computer Science Rensselaer Polytechnic Institute Troy, NY 12180, USA {xiej2, szymansk,
Margin Based Sample Weighting for Stable Feature Selection Yue Han, Lei Yu State University of New York at Binghamton.
EVENT IDENTIFICATION IN SOCIAL MEDIA Hila Becker, Luis Gravano Mor Naaman Columbia University Rutgers University.
Tag Clouds Revisited Date : 2011/12/12 Source : CIKM’11 Speaker : I- Chih Chiu Advisor : Dr. Koh. Jia-ling 1.
Lifetime and Coverage Guarantees Through Distributed Coordinate- Free Sensor Activation ACM MOBICOM 2009.
Problem Statement A pair of images or videos in which one is close to the exact duplicate of the other, but different in conditions related to capture,
Probabilistic Question Recommendation for Question Answering Communities Mingcheng Qu, Guang Qiu, Xiaofei He, Cheng Zhang, Hao Wu, Jiajun Bu, Chun Chen.
1 Formal Models for Expert Finding on DBLP Bibliography Data Presented by: Hongbo Deng Co-worked with: Irwin King and Michael R. Lyu Department of Computer.
Intelligent DataBase System Lab, NCKU, Taiwan Josh Jia-Ching Ying 1, Eric Hsueh-Chan Lu 2 and Vincent S. Tseng 1 1 Institute of Computer Science and Information.
Hierarchical Dirichlet Process (HDP) A Dirichlet process (DP) is a discrete distribution that is composed of a weighted sum of impulse functions. Weights.
Chengjie Sun,Lei Lin, Yuan Chen, Bingquan Liu Harbin Institute of Technology School of Computer Science and Technology 1 19/11/ :09 PM.
Developing Trust Networks based on User Tagging Information for Recommendation Making Touhid Bhuiyan et al. WISE May 2012 SNU IDB Lab. Hyunwoo Kim.
Exploiting Context Analysis for Combining Multiple Entity Resolution Systems -Ramu Bandaru Zhaoqi Chen Dmitri V.kalashnikov Sharad Mehrotra.
Character Identification in Feature-Length Films Using Global Face-Name Matching IEEE TRANSACTIONS ON MULTIMEDIA, VOL. 11, NO. 7, NOVEMBER 2009 Yi-Fan.
Erasmus University Rotterdam Introduction Content-based news recommendation is traditionally performed using the cosine similarity and TF-IDF weighting.
Efficient EMD-based Similarity Search in Multimedia Databases via Flexible Dimensionality Reduction / 16 I9 CHAIR OF COMPUTER SCIENCE 9 DATA MANAGEMENT.
Paired Sampling in Density-Sensitive Active Learning Pinar Donmez joint work with Jaime G. Carbonell Language Technologies Institute School of Computer.
H. Lexie Yang1, Dr. Melba M. Crawford2
A New Method for Automatic Clothing Tagging Utilizing Image-Click-Ads Introduction Conclusion Can We Do Better to Reduce Workload?
Scientific Paper Recommendation Emphasizing Each Researcher’s Most Recent Research Topic Kazunari Sugiyama 8 th January, 2010.
Category Independent Region Proposals Ian Endres and Derek Hoiem University of Illinois at Urbana-Champaign.
CoNMF: Exploiting User Comments for Clustering Web2.0 Items Presenter: He Xiangnan 28 June School of Computing National.
Evaluation of gene-expression clustering via mutual information distance measure Ido Priness, Oded Maimon and Irad Ben-Gal BMC Bioinformatics, 2007.
A Clustering-based QoS Prediction Approach for Web Service Recommendation Shenzhen, China April 12, 2012 Jieming Zhu, Yu Kang, Zibin Zheng and Michael.
Similarity Measurement and Detection of Video Sequences Chu-Hong HOI Supervisor: Prof. Michael R. LYU Marker: Prof. Yiu Sang MOON 25 April, 2003 Dept.
Facial Smile Detection Based on Deep Learning Features Authors: Kaihao Zhang, Yongzhen Huang, Hong Wu and Liang Wang Center for Research on Intelligent.
南台科技大學 資訊工程系 Region partition and feature matching based color recognition of tongue image 指導教授:李育強 報告者 :楊智雁 日期 : 2010/04/19 Pattern Recognition Letters,
Presented by Edith Ngai MPhil Term 3 Presentation
Guillaume-Alexandre Bilodeau
Trustworthiness Management in the Social Internet of Things
CACTUS-Clustering Categorical Data Using Summaries
Convolutional Neural Fabrics by Shreyas Saxena, Jakob Verbeek
WSRec: A Collaborative Filtering Based Web Service Recommender System
CATEGORIZATION OF NEWS ARTICLES USING NEURAL TEXT CATEGORIZER
Chinese Academy of Sciences, Beijing, China
The Earth Mover's Distance
Presenter: Hajar Emami
Learning to Detect a Salient Object
William Norris Professor and Head, Department of Computer Science
Location Recommendation — for Out-of-Town Users in Location-Based Social Network Yina Meng.
William Norris Professor and Head, Department of Computer Science
Postdoc, School of Information, University of Arizona
Liang Zheng, Yuzhong Qu Nanjing University, China
Topic Oriented Semi-supervised Document Clustering
An Empirical Study of Property Collocation on Large Scale of Knowledge Base 龚赛赛
Property consolidation for entity browsing
Rui Yang, Wei Hu and Yuzhong Qu
Expandable Group Identification in Spreadsheets
An Interactive Approach to Collectively Resolving URI Coreference
A Framework for Benchmarking Entity-Annotation Systems
Block Matching for Ontologies
Danyun Xu, Gong Cheng*, Yuzhong Qu
Topological Signatures For Fast Mobility Analysis
Learning Incoherent Sparse and Low-Rank Patterns from Multiple Tasks
Kostas Kolomvatsos, Christos Anagnostopoulos
Who Are Similar to Einstein: A Multi-Type Object Similarity Measure for Entity Recommendation Zheng Liang.
Low-Rank Sparse Feature Selection for Patient Similarity Learning
Presentation transcript:

Liang Zheng and Yuzhong Qu An EMD-based Similarity Measure for Multi-Type Entities by Using Type Hierarchy Liang Zheng and Yuzhong Qu

Outline Introduction Similarity Measures Based on EMD Entity Type Weighting Evaluation Conclusion

Introduction Recommending entities with similar types is an important part of entity recommendation. In general, an entity is associated to a set of types in a knowledge base. The traditional similarity measures between two multi-type entities are determined by collection intersection. Several similarity measures between two types have been proposed by exploiting hierarchical structure in some domain. The weighting of each type within a collection of entity types, represents the contribution to the similarity between the two entities.

Introduction In this study, we measure multi-type entity similarity with the earth mover's distance (EMD), which not only takes into account pairwise type similarity, but also the weighting of entity type. The weighting of types is the key factor in the EMD. we also devise a PageRank-based weighting scheme by using type hierarchy.

Similarity Measures Based on EMD The Earth Mover's Distance (EMD) is proposed by Rubner et al. to measure dissimilarity between two multi-dimensional distributions in a feature space. Computing the EMD is based on a solution to the transportation problem.

Entity Type Weighting Statistics-based Scheme Hierarchy-based Scheme

Entity Type Weighting PageRank-based Scheme

Evaluation The performance of the PageRank(PR)-based type weighting scheme and the EMD-based similarity measure on real-world datasets (i.e., DBpedia, Last.fm). We use DBpedia as the knowledge base, and select four entities from different popular types as our test case. Two “gold standard", one is about weighting type for each of the four entities, the other is about similar entities for each of the four entities. Next, we create a “ground truth” of similar artists recommendation from Last.fm. We compare our EMD-based similarity measure with two traditional recommendation methods

Evaluation Experimental Results for Type Weighting Schemes

Evaluation Experimental Results for Different Similarity Measures

Evaluation Experimental Results for Entity Recommendation

Conclusion We propose an EMD-based similarity measure for multi-type entities, which not only takes into account pairwise type similarity, but also the weighting of types. We also devise a PageRank-based weighting scheme by using type hierarchy. The experimental results show that PageRank-based weighting scheme outperforms base-line weighting schemes and that our EMD-based similarity measure outperforms traditional similarity measures.