Combining Content-based and Collaborative Filtering Department of Computer Science and Engineering, Slovak University of Technology

Slides:



Advertisements
Similar presentations
Recommender Systems & Collaborative Filtering
Advertisements

Mining customer ratings for product recommendation using the support vector machine and the latent class model William K. Cheung, James T. Kwok, Martin.
A Vector Space Model for Automatic Indexing
The best indicator that a passenger will show up to board the flight is that she called in for a special meal Filtering and Recommender Systems Content-based.
Jeff Howbert Introduction to Machine Learning Winter Collaborative Filtering Nearest Neighbor Approach.
1 RegionKNN: A Scalable Hybrid Collaborative Filtering Algorithm for Personalized Web Service Recommendation Xi Chen, Xudong Liu, Zicheng Huang, and Hailong.
COLLABORATIVE FILTERING Mustafa Cavdar Neslihan Bulut.
Chapter 8 Content-Based Image Retrieval. Query By Keyword: Some textual attributes (keywords) should be maintained for each image. The image can be indexed.
Filtering and Recommender Systems Content-based and Collaborative Some of the slides based On Mooney’s Slides.
Item-based Collaborative Filtering Idea: a user is likely to have the same opinion for similar items [if I like Canon cameras, I might also like Canon.
CS345 Data Mining Recommendation Systems Netflix Challenge Anand Rajaraman, Jeffrey D. Ullman.
Lecture 14: Collaborative Filtering Based on Breese, J., Heckerman, D., and Kadie, C. (1998). Empirical analysis of predictive algorithms for collaborative.
Filtering and Recommender Systems Content-based and Collaborative Some of the slides based On Mooney’s Slides.
1 Collaborative Filtering Rong Jin Department of Computer Science and Engineering Michigan State University.
RECOMMENDATION SYSTEMS
Retrieval Evaluation. Brief Review Evaluation of implementations in computer science often is in terms of time and space complexity. With large document.
Retrieval Evaluation: Precision and Recall. Introduction Evaluation of implementations in computer science often is in terms of time and space complexity.
Agent Technology for e-Commerce
If you want to change the world, change the metaphor
1 Introduction to Recommendation System Presented by HongBo Deng Nov 14, 2006 Refer to the PPT from Stanford: Anand Rajaraman, Jeffrey D. Ullman.
Information retrieval Finding relevant data using irrelevant keys Example: database of photographic images sorted by number, date. DBMS: Well structured.
Retrieval Evaluation. Introduction Evaluation of implementations in computer science often is in terms of time and space complexity. With large document.
CONTENT-BASED BOOK RECOMMENDING USING LEARNING FOR TEXT CATEGORIZATION TRIVIKRAM BHAT UNIVERSITY OF TEXAS AT ARLINGTON DATA MINING CSE6362 BASED ON PAPER.
Collaborative Filtering & Content-Based Recommending
Relevant words extraction method for recommender system Presentation slides.
Item-based Collaborative Filtering Recommendation Algorithms
School of Electronics Engineering and Computer Science Peking University Beijing, P.R. China Ziqi Wang, Yuwei Tan, Ming Zhang.
Citation Recommendation 1 Web Technology Laboratory Ferdowsi University of Mashhad.
1 Information Filtering & Recommender Systems (Lecture for CS410 Text Info Systems) ChengXiang Zhai Department of Computer Science University of Illinois,
A Hybrid Recommender System: User Profiling from Keywords and Ratings Ana Stanescu, Swapnil Nagar, Doina Caragea 2013 IEEE/WIC/ACM International Conferences.
Improving Web Spam Classification using Rank-time Features September 25, 2008 TaeSeob,Yun KAIST DATABASE & MULTIMEDIA LAB.
UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.
Exploring Online Social Activities for Adaptive Search Personalization CIKM’10 Advisor : Jia Ling, Koh Speaker : SHENG HONG, CHUNG.
Chengjie Sun,Lei Lin, Yuan Chen, Bingquan Liu Harbin Institute of Technology School of Computer Science and Technology 1 19/11/ :09 PM.
Xiaoying Gao Computer Science Victoria University of Wellington Intelligent Agents COMP 423.
Toward the Next generation of Recommender systems
1 Recommender Systems Collaborative Filtering & Content-Based Recommending.
Developing Trust Networks based on User Tagging Information for Recommendation Making Touhid Bhuiyan et al. WISE May 2012 SNU IDB Lab. Hyunwoo Kim.
Recommending Twitter Users to Follow Using Content and Collaborative Filtering Approaches John HannonJohn Hannon, Mike Bennett, Barry SmythBarry Smyth.
Collaborative Information Retrieval - Collaborative Filtering systems - Recommender systems - Information Filtering Why do we need CIR? - IR system augmentation.
Badrul M. Sarwar, George Karypis, Joseph A. Konstan, and John T. Riedl
1 01/10/09 1 INFILE CEA LIST ELDA Univ. Lille 3 - Geriico Overview of the INFILE track at CLEF 2009 multilingual INformation FILtering Evaluation.
Collaborative Data Analysis and Multi-Agent Systems Robert W. Thomas CSCE APR 2013.
A Content-Based Approach to Collaborative Filtering Brandon Douthit-Wood CS 470 – Final Presentation.
Evaluation of Recommender Systems Joonseok Lee Georgia Institute of Technology 2011/04/12 1.
1 Collaborative Filtering & Content-Based Recommending CS 290N. T. Yang Slides based on R. Mooney at UT Austin.
EigenRank: A ranking oriented approach to collaborative filtering By Nathan N. Liu and Qiang Yang Presented by Zachary 1.
Improving Recommendation Lists Through Topic Diversification CaiNicolas Ziegler, Sean M. McNee,Joseph A. Konstan, Georg Lausen WWW '05 報告人 : 謝順宏 1.
Techniques for Collaboration in Text Filtering 1 Ian Soboroff Department of Computer Science and Electrical Engineering University of Maryland, Baltimore.
Recommender Systems. Recommender Systems (RSs) n RSs are software tools providing suggestions for items to be of use to users, such as what items to buy,
Collaborative Filtering Zaffar Ahmed
Pearson Correlation Coefficient 77B Recommender Systems.
Pairwise Preference Regression for Cold-start Recommendation Speaker: Yuanshuai Sun
A Supervised Machine Learning Algorithm for Research Articles Leonidas Akritidis, Panayiotis Bozanis Dept. of Computer & Communication Engineering, University.
Natural Language Processing Topics in Information Retrieval August, 2002.
Personalization Services in CADAL Zhang yin Zhuang Yuting Wu Jiangqin College of Computer Science, Zhejiang University November 19,2006.
User Modeling and Recommender Systems: recommendation algorithms
Artificial Intelligence Techniques Internet Applications 4.
CYUT ISKM 2004/01/13 1 Fuzzy logic methods in recommender systems Author: Ronald R. Yager Source:Fuzzy set and systems, Vol. 134, 2003, pp Presented.
Xiaoying Gao Computer Science Victoria University of Wellington COMP307 NLP 4 Information Retrieval.
Collaborative Deep Learning for Recommender Systems
Analysis of massive data sets Prof. dr. sc. Siniša Srbljić Doc. dr. sc. Dejan Škvorc Doc. dr. sc. Ante Đerek Faculty of Electrical Engineering and Computing.
Collaborative Filtering With Decoupled Models for Preferences and Ratings Rong Jin 1, Luo Si 1, ChengXiang Zhai 2 and Jamie Callan 1 Language Technology.
Data Mining: Concepts and Techniques
Recommender Systems & Collaborative Filtering
Sampath Jayarathna Cal Poly Pomona
Personalized Social Image Recommendation
Collaborative Filtering Nearest Neighbor Approach
ITEM BASED COLLABORATIVE FILTERING RECOMMENDATION ALGORITHEMS
Authors: Wai Lam and Kon Fan Low Announcer: Kyu-Baek Hwang
Presentation transcript:

Combining Content-based and Collaborative Filtering Department of Computer Science and Engineering, Slovak University of Technology Gabriela Polčicová Pavol Návrat

Overview Information Filtering and its Types Combined Method Experiment with Information Filtering Methods Conclusions

Information Filtering (1) –delivery of relevant information to the people who need it Types of Information Filtering –Content-based - for textual documents –Collaborative - for communities of users Interests –information about interests - stored in profiles –expressing opinions to documents - ratings Ratings {i, j, r ij } –for user i, item j, the value of rating r ij

Information Filtering (2) Filter Learning interests Estimating the value of rating Choosing recommendations Rated items {user, item, value} Unrated items {user, item} Recommendations {user, item, estimation}

Content-based Filtering (1) Basic idea –recommending documents based on content and properties of document Profile –consists of keywords with assigned weights –only documents matching profile are recommended Recommendations –based on objective measurable properties

Content-based Filtering (2) Documents rated by the user Documents of interest Documents unrated by the user PROFILE Keywords, phrases with weights Documents matching profile => recommended documents Documents, ratings

Collaborative Filtering (1) Basic idea –automating “word of mouth” –leverage opinions of like-minded users while making decisions Schema –collecting users’ opinions –searching for like-minded users –making recommendations

Collaborative Filtering (2) Profile of current user Profile of user 1 Profile of user 2 Profile of user 3 Profile of user 4 Profile of user 5 Documents from like-minded users’ profiles => recommended documents

k ci =  (r cj - r c ) (r ij - r i ) j  I ci  (r cj - r c ) 2  (r ij - r i ) 2 j  I ci Recommendations computation: weighted sum of ratings r cj = r c +  (r ij - r i ) k ci i  U cj  |k ci | i  U cj Collaborative Filtering (3) Similarity measure: Pearson Correlation Coefficient

Combining Content-based and Collaborative Filtering (1) Computing of estimates for missing ratings by Content- based Filtering method for each user Searching for like-minded users –computing coefficient k ci between current and i-th user (only from ratings) –computing coefficient k ci ’ between current and i-th user (from both ratings and estimates) New recommendations computation –using ratings (with coefficients k ci ) and also ratings with estimates (with coefficient k ci ’) as weights in weighted sum of ratings and estimates

Datasets for Experiments Data: –EachMovie - users‘ ratings for movies –IMDB - textual information for CBF (movies‘ descriptions) Datasets: –A - ratings from the period up to Mar 1, 1996 (810 ratings from 71 users) –B - ratings from the period uo to Mar 15, 1996 (2407 ratings from 131 users) –C - ratings from the period up to Apr 1, 1996 (12290 ratings from 651 users)

EachMovie Data and Constant Method Constant Method r cj = 5

Experiments with Combination of Content- based and Collaborative Filtering (2) Dataset Divide dataset into training set (90%) and test set (10%) Apply filtering methods and evaluate their performance Content-based Filtering method Collaborative Filtering method Combined Filtering method recommendations test, training sets Evaluation of methods’ performance Constant method recommendations test set

Metrics Coverage = percentage of items for which the method is able to compute estimates Accuracy = F-measure = NMAE = 2.Precision.Recall Precision + Recall |R  L| + |R  L| |L| + |L| |R  L| |R| |R  L| |L|  |r ij - r ij | n.s Precision = Recall = R - set of recommended items L - set of liked items

Results of Experiments

Conclusions Combination of content-based and collaborative filtering might help in initial phase Future work Weighting of coefficients Comparing method with additional methods

Content-based Filtering - Vector Representation of Documents and Profiles W j = (0, …, 0, 0.5, 0, …, 0, 0.3, 0, …, 0, 0.2, 0, …, 0) profile i =  r j.w ij n j = 1 D = ( …, computer, …, learning, …, machine, …. ) Document j computer machine learning TF-IDF W. Profile |W|. |Profile| Sim(W, Profile) =

Collaborative Filtering - Example ABCDEFG current

k ci =  (r cj - r c ) (r ij - r i ) j  I ci  (r cj - r c ) 2  (r ij - r i ) 2 j  I ci Recommendations computation: weighted sum of ratings and estimates r cj = r c +  (r ij - r i ) k ci +  (r ij - r i ) k ci ’ i  U cj CBF  |k ci | +  |k ci ’| i  U’ cj i  U cj i  U’ cj Combining Content-based and Collaborative Filtering (2) Similarity measure: Pearson Correlation Coefficient ’ ’ ’’ CBF

Experiments with Combination of Content- based and Collaborative Filtering (1) Content-based Filtering Method (CBF) –documents and profiles: vector representation - weighted keywords (TF-IDF) –estimation computation: normalized dot product of document and profile vectors Collaborative Filtering (CF) –Pearson correlation coefficient –weighted sum of ratings Combination of CF and CBF –Pearson correlation coefficients –weighted sum of ratings and CBF estimations Constant Method (r cj = 5)