Andriy Shepitsen, Jonathan Gemmell, Bamshad Mobasher, and Robin Burke

Slides:



Advertisements
Similar presentations
Albert Gatt Corpora and Statistical Methods Lecture 13.
Advertisements

Clustering Categorical Data The Case of Quran Verses
Personalization in Folksonomies Based on Tag Clustering
Todays topic Social Tagging By Christoffer Hirsimaa.
CVPR2013 Poster Representing Videos using Mid-level Discriminative Patches.
LYRIC-BASED ARTIST NETWORK METHODOLOGY Derek Gossi CS 765 Fall 2014.
Query Dependent Pseudo-Relevance Feedback based on Wikipedia SIGIR ‘09 Advisor: Dr. Koh Jia-Ling Speaker: Lin, Yi-Jhen Date: 2010/01/24 1.
Context-aware Query Suggestion by Mining Click-through and Session Data Authors: H. Cao et.al KDD 08 Presented by Shize Su 1.
Introduction Information Management systems are designed to retrieve information efficiently. Such systems typically provide an interface in which users.
 Users annotate things (resources) with labels (tags)  These annotations are shared, creating a collaborative dataset called a folksonomy coffee java.
The Vector Space Model …and applications in Information Retrieval.
Ranking by Odds Ratio A Probability Model Approach let be a Boolean random variable: document d is relevant to query q otherwise Consider document d as.
Improving web image search results using query-relative classifiers Josip Krapacy Moray Allanyy Jakob Verbeeky Fr´ed´eric Jurieyy.
1 Hierarchical Tag visualization and application for tag recommendations CIKM’11 Advisor : Jia Ling, Koh Speaker : SHENG HONG, CHUNG.
«Tag-based Social Interest Discovery» Proceedings of the 17th International World Wide Web Conference (WWW2008) Xin Li, Lei Guo, Yihong Zhao Yahoo! Inc.,
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 An Efficient Concept-Based Mining Model for Enhancing.
Tag Clouds Revisited Date : 2011/12/12 Source : CIKM’11 Speaker : I- Chih Chiu Advisor : Dr. Koh. Jia-ling 1.
An Integrated Approach to Extracting Ontological Structures from Folksonomies Huairen Lin, Joseph Davis, Ying Zhou ESWC 2009 Hyewon Lim October 9 th, 2009.
1 Context-Aware Search Personalization with Concept Preference CIKM’11 Advisor : Jia Ling, Koh Speaker : SHENG HONG, CHUNG.
Leveraging Conceptual Lexicon : Query Disambiguation using Proximity Information for Patent Retrieval Date : 2013/10/30 Author : Parvaz Mahdabi, Shima.
Date: 2012/10/18 Author: Makoto P. Kato, Tetsuya Sakai, Katsumi Tanaka Source: World Wide Web conference (WWW "12) Advisor: Jia-ling, Koh Speaker: Jiun.
Beyond Co-occurrence: Discovering and Visualizing Tag Relationships from Geo-spatial and Temporal Similarities Date : 2012/8/6 Resource : WSDM’12 Advisor.
Dr. Susan Gauch When is a rock not a rock? Conceptual Approaches to Personalized Search and Recommendations Nov. 8, 2011 TResNet.
PAUL ALEXANDRU CHIRITA STEFANIA COSTACHE SIEGFRIED HANDSCHUH WOLFGANG NEJDL 1* L3S RESEARCH CENTER 2* NATIONAL UNIVERSITY OF IRELAND PROCEEDINGS OF THE.
Mining the Structure of User Activity using Cluster Stability Jeffrey Heer, Ed H. Chi Palo Alto Research Center, Inc – SIAM Web Analytics Workshop.
Presented by: Apeksha Khabia Guided by: Dr. M. B. Chandak
Modern Information Retrieval: A Brief Overview By Amit Singhal Ranjan Dash.
CIKM’09 Date:2010/8/24 Advisor: Dr. Koh, Jia-Ling Speaker: Lin, Yi-Jhen 1.
Exploring Online Social Activities for Adaptive Search Personalization CIKM’10 Advisor : Jia Ling, Koh Speaker : SHENG HONG, CHUNG.
11 A Hybrid Phish Detection Approach by Identity Discovery and Keywords Retrieval Reporter: 林佳宜 /10/17.
Clustering Supervised vs. Unsupervised Learning Examples of clustering in Web IR Characteristics of clustering Clustering algorithms Cluster Labeling 1.
Giorgos Giannopoulos (IMIS/”Athena” R.C and NTU Athens, Greece) Theodore Dalamagas (IMIS/”Athena” R.C., Greece) Timos Sellis (IMIS/”Athena” R.C and NTU.
Basic Machine Learning: Clustering CS 315 – Web Search and Data Mining 1.
Video Google: A Text Retrieval Approach to Object Matching in Videos Josef Sivic and Andrew Zisserman.
Binxing Jiao et. al (SIGIR ’10) Presenter : Lin, Yi-Jhen Advisor: Dr. Koh. Jia-ling Date: 2011/4/25 VISUAL SUMMARIZATION OF WEB PAGES.
NTU Natural Language Processing Lab. 1 An Analysis of Effectiveness of Tagging in Blogs Christopher H. Brooks and Nancy Montanez University of San Francisco.
BioSnowball: Automated Population of Wikis (KDD ‘10) Advisor: Dr. Koh, Jia-Ling Speaker: Lin, Yi-Jhen Date: 2010/11/30 1.
Enhancing Cluster Labeling Using Wikipedia David Carmel, Haggai Roitman, Naama Zwerdling IBM Research Lab (SIGIR’09) Date: 11/09/2009 Speaker: Cho, Chin.
1 Opinion Retrieval from Blogs Wei Zhang, Clement Yu, and Weiyi Meng (2007 CIKM)
Center for E-Business Technology Seoul National University Seoul, Korea Social Ranking: Uncovering Relevant Content Using Tag-based Recommender Systems.
21/11/20151Gianluca Demartini Ranking Clusters for Web Search Gianluca Demartini Paul–Alexandru Chirita Ingo Brunkhorst Wolfgang Nejdl L3S Info Lunch Hannover,
DOCUMENT UPDATE SUMMARIZATION USING INCREMENTAL HIERARCHICAL CLUSTERING CIKM’10 (DINGDING WANG, TAO LI) Advisor: Koh, Jia-Ling Presenter: Nonhlanhla Shongwe.
Extracting Keyphrases to Represent Relations in Social Networks from Web Junichiro Mori and Mitsuru Ishizuka Universiry of Tokyo Yutaka Matsuo National.
Finding Experts Using Social Network Analysis 2007 IEEE/WIC/ACM International Conference on Web Intelligence Yupeng Fu, Rongjing Xiang, Yong Wang, Min.
Harvesting Social Knowledge from Folksonomies Harris Wu, Mohammad Zubair, Kurt Maly, Harvesting social knowledge from folksonomies, Proceedings of the.
A Word Clustering Approach for Language Model-based Sentence Retrieval in Question Answering Systems Saeedeh Momtazi, Dietrich Klakow University of Saarland,Germany.
+ User-induced Links in Collaborative Tagging Systems Ching-man Au Yeung, Nicholas Gibbins, Nigel Shadbolt CIKM’09 Speaker: Nonhlanhla Shongwe 18 January.
Machine Learning Queens College Lecture 7: Clustering.
Date: 2011/1/11 Advisor: Dr. Koh. Jia-Ling Speaker: Lin, Yi-Jhen Mr. KNN: Soft Relevance for Multi-label Classification (CIKM’10) 1.
Basic Machine Learning: Clustering CS 315 – Web Search and Data Mining 1.
Web Search Personalization with Ontological User Profile Advisor: Dr. Jai-Ling Koh Speaker: Shun-hong Sie.
PERSONALIZED DIVERSIFICATION OF SEARCH RESULTS Date: 2013/04/15 Author: David Vallet, Pablo Castells Source: SIGIR’12 Advisor: Dr.Jia-ling, Koh Speaker:
TO Each His Own: Personalized Content Selection Based on Text Comprehensibility Date: 2013/01/24 Author: Chenhao Tan, Evgeniy Gabrilovich, Bo Pang Source:
Semantic Grounding of Tag Relatedness in Social Bookmarking Systems Ciro Cattuto, Dominik Benz, Andreas Hotho, Gerd Stumme ISWC 2008 Hyewon Lim January.
Clustering Algorithms Sunida Ratanothayanon. What is Clustering?
Hybrid Content and Tag-based Profiles for recommendation in Collaborative Tagging Systems Latin American Web Conference IEEE Computer Society, 2008 Presenter:
Contextual Text Cube Model and Aggregation Operator for Text OLAP
Mining Tag Semantics for Social Tag Recommendation Hsin-Chang Yang Department of Information Management National University of Kaohsiung.
Instance Discovery and Schema Matching With Applications to Biological Deep Web Data Integration Tantan Liu, Fan Wang, Gagan Agrawal {liut, wangfa,
Bringing Order to the Web : Automatically Categorizing Search Results Advisor : Dr. Hsu Graduate : Keng-Wei Chang Author : Hao Chen Susan Dumais.
CiteData: A New Multi-Faceted Dataset for Evaluating Personalized Search Performance CIKM’10 Advisor : Jia-Ling, Koh Speaker : Po-Hsien, Shih.
Ontology Engineering and Feature Construction for Predicting Friendship Links in the Live Journal Social Network Author:Vikas Bahirwani 、 Doina Caragea.
Data Mining and Text Mining. The Standard Data Mining process.
Queensland University of Technology
Personalized Folksonomies Based on Hierarchical Tag Clustering
Postdoc, School of Information, University of Arizona
Movie Recommendation System
Ying Dai Faculty of software and information science,
Ying Dai Faculty of software and information science,
Ying Dai Faculty of software and information science,
Presentation transcript:

Personalized Recommendation in Social Tagging Systems Using Hierarchical Clustering Andriy Shepitsen, Jonathan Gemmell, Bamshad Mobasher, and Robin Burke (2008 ACM conference on Recommender systems ) Advisor: Dr. Jai-Ling Koh Speaker: Shun-hong Sie

Outline Introduction Recommendation algorithm Tag clustering for recommendation Experimental evaluation Conclusion

Introduction Collaborative or social tagging systems allow Internet users to manage Resources can be categorized by several tags, rather than one directory or a single branch of a hierarchy. Users may share or discover resources through the collaborative network and onnect to people with similar interests.

Introduction Collaborative tagging applications benefit from the opinions of many users rather than a dominant view provided by a few. As users annotate resources, the system is able to monitor both their interest in resources as well as their vocabulary for describing those resources. 這也帶來挑戰,不同的標籤、語彙對於同一資源的描述

Introduction

Introduction The recommendation task in a folksonomy encompassing only one subject domain appears to be easier, unclouded by the effect of additional domains present in larger folksonomies and aided by a denser dataset. A recommendation strategy designed to reduce the topic space fairs well in both single topic and multi-topic folksonomies, but provides more significant improvement in a multi-topic folksonomy. 在通俗分類,推薦通常只有圍繞在單一主題領域顯的相對簡單,如主題明確的資料集與大型專題領域通俗分類.

Recommendation algorithm A folksonomy can be described as a four-tuple. D=(U,R,T,A) U:user R:resource T:Tags A:annotations

Recommendation algorithm The annotations, A, are represented as a set of triples containing a user, tag and resource.

Recommendation algorithm Each user, u, is modeled as a vector over the set of tags, where each weight, w(ti), in each dimension corresponds to the importance of a particular tag, ti.

Recommendation algorithm Resources can also be modeled as a vector over the set of tags. The tag frequency, tf, for a tag, t, and a resource, r, is the number of times the resource has been annotated with the tag.

Recommendation algorithm The distinctiveness is measured by the log of the total number of resources, N, divided by the number of resources to which the tag was applied, nt.

Recommendation algorithm With either term weighting approach, a similarity measure between a query, q, represented as a vector over the set of tags, and a resource, r, also modeled as a vector over the set of tags, can be calculated.

Recommendation algorithm

Personalized recommendation process Calculate Cosine Similarity. Calculate Relevance of all to u. Calculate the user’s interest in each cluster. Calculate each resource’s closest clusters. Infer the users’s interest in each resource. Calculate Personalized Rank Score.

Tag clustering for recommendation Evaluated several clustering techniques: maximal complete link, K-means, and hierarchical agglomerative clustering The complete link and hierarchical clustering algorithms shows comparable results in different systems The maximal complete link algorithm is effective with ambiguous tags and hierarchical clustering allows to filter out the clusters which do not have strong relation to the query tag.

Hierarchical Agglomerative Clustering

Hierarchical Agglomerative Clustering 1.Assign every tag to a singleton cluster. 2.Combine all tags in one hierarchical cluster. a.Combine clusters. b.Lower similarity. 3.Cut the tree to clusters.

Context-Dependent Hierarchical Agglomerative Clustering 3. Identify the user context. 4. Divide a branch of the tree into separate clusters.

Experimental evaluation Del.icio.us (http://delicious.com/) Last.Fm (http://cn.last.fm/)

Experimental evaluation Using the selected tag in the basic recommendation algorithm the rank of the target resource, rb, was recorded. The rank of the resource in the personalized recommendations, rp, was also recorded.

Experimental evaluation

Experimental evaluation

Experimental evaluation

Experimental evaluation

Conclusion Clusters of tags can be effectively used as a means to ascertain the user’s interest as well as determine the topic of a resource. Clusters therefore provide an effective means to bridge the gap between users and resources. Using additional information in clustering algorithm.