Diversification And Refinement In Collaborative Filtering Recommender Date: 2012/02/16 Source: Rubi Boim et. al (CIKM’11) Speaker: Chiang,guang-ting Advisor:

Slides:



Advertisements
Similar presentations
Answering Approximate Queries over Autonomous Web Databases Xiangfu Meng, Z. M. Ma, and Li Yan College of Information Science and Engineering, Northeastern.
Advertisements

Using Trees to Depict a Forest
Google News Personalization: Scalable Online Collaborative Filtering
Diversity Maximization Under Matroid Constraints Date : 2013/11/06 Source : KDD’13 Authors : Zeinab Abbassi, Vahab S. Mirrokni, Mayur Thakur Advisor :
Personalized Query Classification Bin Cao, Qiang Yang, Derek Hao Hu, et al. Computer Science and Engineering Hong Kong UST.
Heaps1 Part-D2 Heaps Heaps2 Recall Priority Queue ADT (§ 7.1.3) A priority queue stores a collection of entries Each entry is a pair (key, value)
Augmenting Data Structures Advanced Algorithms & Data Structures Lecture Theme 07 – Part I Prof. Dr. Th. Ottmann Summer Semester 2006.
School of Computer Science and Engineering Finding Top k Most Influential Spatial Facilities over Uncertain Objects Liming Zhan Ying Zhang Wenjie Zhang.
Sequence Clustering and Labeling for Unsupervised Query Intent Discovery Speaker: Po-Hsien Shih Advisor: Jia-Ling Koh Source: WSDM’12 Date: 1 November,
Time-sensitive Personalized Query Auto-Completion
DOMAIN DEPENDENT QUERY REFORMULATION FOR WEB SEARCH Date : 2013/06/17 Author : Van Dang, Giridhar Kumaran, Adam Troy Source : CIKM’12 Advisor : Dr. Jia-Ling.
Sentiment Diversification with Different Biases Date : 2014/04/29 Source : SIGIR’13 Advisor : Prof. Jia-Ling, Koh Speaker : Wei, Chang 1.
Exploring the Filter Bubble: The Effect of Using Recommender Systems on Content Diversity Date: 2014/8/25 Author: Tien T.Nguyen, Pik-Mai Hui, F. Maxwell.
Mining Query Subtopics from Search Log Data Date : 2012/12/06 Resource : SIGIR’12 Advisor : Dr. Jia-Ling Koh Speaker : I-Chih Chiu.
2-dimensional indexing structure
Using Trees to Depict a Forest Bin Liu, H. V. Jagadish EECS, University of Michigan, Ann Arbor Presented by Sergey Shepshelvich 1.
1 Collaborative Filtering and Pagerank in a Network Qiang Yang HKUST Thanks: Sonny Chee.
Recommender systems Ram Akella February 23, 2011 Lecture 6b, i290 & 280I University of California at Berkeley Silicon Valley Center/SC.
Techniques and Data Structures for Efficient Multimedia Similarity Search.
Collaborative Filtering CMSC498K Survey Paper Presented by Hyoungtae Cho.
Recommender systems Ram Akella November 26 th 2008.
R-Trees 2-dimensional indexing structure. R-trees 2-dimensional version of the B-tree: B-tree of maximum degree 8; degree between 3 and 8 Internal nodes.
Managing Large RDF Graphs (Infinite Graph) Vaibhav Khadilkar Department of Computer Science, The University of Texas at Dallas FEARLESS engineering.
Priority Queues and Heaps Bryce Boe 2013/11/20 CS24, Fall 2013.
Tag Clouds Revisited Date : 2011/12/12 Source : CIKM’11 Speaker : I- Chih Chiu Advisor : Dr. Koh. Jia-ling 1.
Item Based Collaborative Filtering Recommendation Algorithms Badrul Sarwar, George Karpis, Joseph KonStan, John Riedl (UMN) p.s.: slides adapted from:
Learning Structure in Bayes Nets (Typically also learn CPTs here) Given the set of random variables (features), the space of all possible networks.
A Comparative Study of Search Result Diversification Methods Wei Zheng and Hui Fang University of Delaware, Newark DE 19716, USA
Mining High Utility Itemsets without Candidate Generation Date: 2013/05/13 Author: Mengchi Liu, Junfeng Qu Source: CIKM "12 Advisor: Jia-ling Koh Speaker:
1 Retrieval and Feedback Models for Blog Feed Search SIGIR 2008 Advisor : Dr. Koh Jia-Ling Speaker : Chou-Bin Fan Date :
Beyond Co-occurrence: Discovering and Visualizing Tag Relationships from Geo-spatial and Temporal Similarities Date : 2012/8/6 Resource : WSDM’12 Advisor.
CIKM’09 Date:2010/8/24 Advisor: Dr. Koh, Jia-Ling Speaker: Lin, Yi-Jhen 1.
The X-Tree An Index Structure for High Dimensional Data Stefan Berchtold, Daniel A Keim, Hans Peter Kriegel Institute of Computer Science Munich, Germany.
Exploring Online Social Activities for Adaptive Search Personalization CIKM’10 Advisor : Jia Ling, Koh Speaker : SHENG HONG, CHUNG.
1 Efficient Search Ranking in Social Network ACM CIKM2007 Monique V. Vieira, Bruno M. Fonseca, Rodrigo Damazio, Paulo B. Golgher, Davi de Castro Reis,
MINING FREQUENT ITEMSETS IN A STREAM TOON CALDERS, NELE DEXTERS, BART GOETHALS ICDM2007 Date: 5 June 2008 Speaker: Li, Huei-Jyun Advisor: Dr. Koh, Jia-Ling.
IMPROVING E-COMMERCE COLLABORATIVE RECOMMENDATIONS BY SEMANTIC INFERENCE OF NEIGHBORS’ PRACTICAL EXPERTISE 6 th International Workshop on Semantic Media.
Google News Personalization: Scalable Online Collaborative Filtering
Multidimensional Indexes Applications: geographical databases, data cubes. Types of queries: –partial match (give only a subset of the dimensions) –range.
FINDING RELEVANT INFORMATION OF CERTAIN TYPES FROM ENTERPRISE DATA Date: 2012/04/30 Source: Xitong Liu (CIKM’11) Speaker: Er-gang Liu Advisor: Dr. Jia-ling.
Dynamic Diversification of Continuous Data Marina Drosou and Evaggelia Pitoura Computer Science Department University of Ioannina, Greece
Enhancing Cluster Labeling Using Wikipedia David Carmel, Haggai Roitman, Naama Zwerdling IBM Research Lab (SIGIR’09) Date: 11/09/2009 Speaker: Cho, Chin.
The Sweet Spot between Inverted Indices and Metric-Space Indexing for Top-K–List Similarity Search Evica Milchevski , Avishek Anand ★ and Sebastian Michel.
LOGO Summarizing Conversations with Clue Words Giuseppe Carenini, Raymond T. Ng, Xiaodong Zhou (WWW ’07) Advisor : Dr. Koh Jia-Ling Speaker : Tu.
Templated Search over Relational Databases Date: 2015/01/15 Author: Anastasios Zouzias, Michail Vlachos, Vagelis Hristidis Source: ACM CIKM’14 Advisor:
Date : 2013/03/18 Author : Jeffrey Pound, Alexander K. Hudek, Ihab F. Ilyas, Grant Weddell Source : CIKM’12 Speaker : Er-Gang Liu Advisor : Prof. Jia-Ling.
LOGO 1 Corroborate and Learn Facts from the Web Advisor : Dr. Koh Jia-Ling Speaker : Tu Yi-Lang Date : Shubin Zhao, Jonathan Betz (KDD '07 )
A more efficient Collaborative Filtering method Tam Ming Wai Dr. Nikos Mamoulis.
Query Sensitive Embeddings Vassilis Athitsos, Marios Hadjieleftheriou, George Kollios, Stan Sclaroff.
A Word Clustering Approach for Language Model-based Sentence Retrieval in Question Answering Systems Saeedeh Momtazi, Dietrich Klakow University of Saarland,Germany.
Similarity & Recommendation Arjen P. de Vries CWI Scientific Meeting September 27th 2013.
Recommender Systems. Recommender Systems (RSs) n RSs are software tools providing suggestions for items to be of use to users, such as what items to buy,
Date: 2012/08/21 Source: Zhong Zeng, Zhifeng Bao, Tok Wang Ling, Mong Li Lee (KEYS’12) Speaker: Er-Gang Liu Advisor: Dr. Jia-ling Koh 1.
LINDEN : Linking Named Entities with Knowledge Base via Semantic Knowledge Date : 2013/03/25 Resource : WWW 2012 Advisor : Dr. Jia-Ling Koh Speaker : Wei.
Topical Clustering of Search Results Date : 2012/11/8 Resource : WSDM’12 Advisor : Dr. Jia-Ling Koh Speaker : Wei Chang 1.
Collaborative Filtering via Euclidean Embedding M. Khoshneshin and W. Street Proc. of ACM RecSys, pp , 2010.
1 An Efficient Optimal Leaf Ordering for Hierarchical Clustering in Microarray Gene Expression Data Analysis Jianting Zhang Le Gruenwald School of Computer.
Presented by: Siddhant Kulkarni Spring Authors: Publication:  ICDE 2015 Type:  Research Paper 2.
1 Link Privacy in Social Networks Aleksandra Korolova, Rajeev Motwani, Shubha U. Nabar CIKM’08 Advisor: Dr. Koh, JiaLing Speaker: Li, HueiJyun Date: 2009/3/30.
Collaborative Filtering - Pooja Hegde. The Problem : OVERLOAD Too much stuff!!!! Too many books! Too many journals! Too many movies! Too much content!
Efficient Similarity Search : Arbitrary Similarity Measures, Arbitrary Composition Date: 2011/10/31 Source: Dustin Lange et. al (CIKM’11) Speaker:Chiang,guang-ting.
Where Did You Go: Personalized Annotation of Mobility Records
Analysis and design of algorithm
Spatial Online Sampling and Aggregation
Movie Recommendation System
Enriching Taxonomies With Functional Domain Knowledge
Recommendation Systems
Liang Jin (UC Irvine) Nick Koudas (AT&T Labs Research)
Donghui Zhang, Tian Xia Northeastern University
Recommender Systems Problem formulation Machine Learning.
Presentation transcript:

Diversification And Refinement In Collaborative Filtering Recommender Date: 2012/02/16 Source: Rubi Boim et. al (CIKM’11) Speaker: Chiang,guang-ting Advisor: Dr. Koh. Jia-ling 1

Index Introduction Priorty-Cover-Tree Priority-medoids Recommendation refinement Experiments Conclusion 2

Introduction E-commerce has grown rapidly over the past few years. An important advantage of e-commerce is the great variety of items that online stores offer. 3

Introduction Motivation : 4

Introduction Goal : ▫Proposes a novel technique for diversifying the recommendations that they give to users. ▫Provides a natural zoom-in mechanism to focus on items (clusters) of interest and recommending diversified similar items. 5

Diversity Over-specialized ! Solving: ranking + diversity =>Use “Priority-medoids” to balance these two “NP-hard” Problem =>”Priorty-Cover-Tree” 6 Harry Potter – 1 Harry Potter – 2 Harry Potter – 3 Harry Potter – 4 Harry Potter – 5

7

8

Estimations of similarity Each item is viewed as a vector of ratings in a multi-dimensional speace :rate(i) The distance between item vectors is then used as a measure for the items similarity:dist(i,j) 9

Priorty Cover-Tree(PCT) Cover-tree : a data structure originally designed to speed up a nearest neighbor search. The use of cover-trees as a tool for selecting medoids, in the context of diversification of query results. A key difference between Cover-tree and Priorty Cover-Tree is that item ratings were not taken into consideration. 10

Priorty Cover-Tree(PCT) 11

Priorty Cover-Tree(PCT) 12 rating low high

Construction 1.Sorting the set I of items according to their rating, in a descending order. 2.InsertSingleItem() function. ▫This function finds the first level into which the given item can be inserted. ▫It works in a recursive fashion. ▫But it may have several candidate nodes that may act as the item’s parent, we prefers the node with the smallest dist() measure to the inserted items. 13 Priorty Cover-Tree(PCT)

14 invariant 2 invariant 3 Construction example

Representative Selection 15 Priorty Cover-Tree(PCT)

16 Representative Selection example

Recommendation refinement Each item i that is presented on the screen, DiRec offers a “more of that” zoom-in button that allows the user to view further related items. 17 user Priority- medoids Priorty Cover- Tree Add, prune Recommend item Update tree Representative Selection Refinement Recommendations

Priority-medoids Balancing rating and distance. Medoids: (cluster’s representative) is an element in the cluster s.t. the sum of the distances from it to the other items in the cluster is minimal. Priority-medoids : the representatives are the ones having highest rating in their corresponding clusters. 18

Priority-medoids 19

20 Recommendation refinement Update tree

Experiments Implemented the above algorithms in DiRec, a plug-in that allows CF Recommender systems to diversify the recommendations that they present to users. Experiments setting: ▫Data: provided by Netflix, 100 million distinct raw movie ratings (such as 1 to 5 “stars”), by almost 500,000 users, and no semantic information on the movies 21

Experiments Algorithms: (PCT stands for Priority Cover-Tree) ▫PCT-R : Max-rating, ▫PCT-D: Max-diversity ▫PCT-C: Max-coverage ▫CT: Cover-Tree(did’t consider rating) ▫Swap: use predefined thresholds to balance rating and diversity ▫optR: selecting the k items having highest rating and ignoring diversity. ▫optD: ignores the rating of items and selects k items whose pairwise distance values is the highest. 22

Randomly chose a set of 1000 users from the Netflix data set and run the experiment for each of them. The results presented here are the average of all 1000 users, k=5. 23

24 -R

Conclusions Priority-medoids that declaratively balances the rating and the diversity of the recommended items. Priority cover-trees as a tool for efficient selection of item representatives. DiRec allows for an effective realization of a natural “zoom-in” mechanism,presenting to users similar yet diversified items. Experiments demonstrated the effectiveness of our recommendation diversification technique and its superiority over previous proposals. 25