Recommender Systems. Outline Limitations of Recommender Systems SMARTMUSEUM Case Study.

Slides:

Advertisements

Similar presentations

Recommender System A Brief Survey.

Advertisements

Improving Learning Object Description Mechanisms to Support an Integrated Framework for Ubiquitous Learning Scenarios María Felisa Verdejo Carlos Celorrio.

Google News Personalization: Scalable Online Collaborative Filtering

Processing XML Keyword Search by Constructing Effective Structured Queries Jianxin Li, Chengfei Liu, Rui Zhou and Bo Ning Swinburne University of Technology,

A Vector Space Model for Automatic Indexing

Chapter 5: Introduction to Information Retrieval

GridVine: Building Internet-Scale Semantic Overlay Networks By Lan Tian.

WWW 2014 Seoul, April 8 th SNOW 2014 Data Challenge Two-level message clustering for topic detection in Twitter Georgios Petkos, Symeon Papadopoulos, Yiannis.

Pete Bohman Adam Kunk.  Introduction  Related Work  System Overview  Indexing Scheme  Ranking  Evaluation  Conclusion.

Introduction to Information Retrieval (Part 2) By Evren Ermis.

Query Dependent Pseudo-Relevance Feedback based on Wikipedia SIGIR ‘09 Advisor: Dr. Koh Jia-Ling Speaker: Lin, Yi-Jhen Date: 2010/01/24 1.

A Quality Focused Crawler for Health Information Tim Tang.

Search Engines and Information Retrieval

Personalizing Search via Automated Analysis of Interests and Activities Jaime Teevan Susan T.Dumains Eric Horvitz MIT,CSAILMicrosoft Researcher Microsoft.

Information Retrieval Ling573 NLP Systems and Applications April 26, 2011.

Database Management Systems, R. Ramakrishnan1 Computing Relevance, Similarity: The Vector Space Model Chapter 27, Part B Based on Larson and Hearst’s slides.

1 Collaborative Filtering and Pagerank in a Network Qiang Yang HKUST Thanks: Sonny Chee.

Gimme’ The Context: Context- driven Automatic Semantic Annotation with CPANKOW Philipp Cimiano et al.

Personalised Search on the World Wide Web Originally by Micarelli, Gasparetti, Sciarrone & Gauch

21 21 Web Content Management Architectures Vagan Terziyan MIT Department, University of Jyvaskyla, AI Department, Kharkov National University of Radioelectronics.

Recommender systems Ram Akella February 23, 2011 Lecture 6b, i290 & 280I University of California at Berkeley Silicon Valley Center/SC.

1 CS 430 / INFO 430 Information Retrieval Lecture 24 Usability 2.

1 CS 430 / INFO 430 Information Retrieval Lecture 3 Vector Methods 1.

Multimedia Databases Text II. Outline Spatial Databases Temporal Databases Spatio-temporal Databases Multimedia Databases Text databases Image and video.

Recommender systems Ram Akella November 26 th 2008.

An investigation of query expansion terms Gheorghe Muresan Rutgers University, School of Communication, Information and Library Science 4 Huntington St.,

Xiaomeng Su & Jon Atle Gulla Dept. of Computer and Information Science Norwegian University of Science and Technology Trondheim Norway June 2004 Semantic.

Overview of Search Engines

Indexing Overview Approaches to indexing Automatic indexing Information extraction.

Improving web image search results using query-relative classifiers Josip Krapacy Moray Allanyy Jakob Verbeeky Fr´ed´eric Jurieyy.

Modeling (Chap. 2) Modern Information Retrieval Spring 2000.

Search Engines and Information Retrieval Chapter 1.

An Integrated Approach to Extracting Ontological Structures from Folksonomies Huairen Lin, Joseph Davis, Ying Zhou ESWC 2009 Hyewon Lim October 9 th, 2009.

1 The BT Digital Library A case study in intelligent content management Paul Warren

Chapter 7 Web Content Mining Xxxxxx. Introduction Web-content mining techniques are used to discover useful information from content on the web – textual.

Xiaoying Gao Computer Science Victoria University of Wellington Intelligent Agents COMP 423.

PAUL ALEXANDRU CHIRITA STEFANIA COSTACHE SIEGFRIED HANDSCHUH WOLFGANG NEJDL 1* L3S RESEARCH CENTER 2* NATIONAL UNIVERSITY OF IRELAND PROCEEDINGS OF THE.

Chapter 2 Architecture of a Search Engine. Search Engine Architecture n A software architecture consists of software components, the interfaces provided.

Web Search. Structure of the Web n The Web is a complex network (graph) of nodes & links that has the appearance of a self-organizing structure  The.

Web Searching Basics Dr. Dania Bilal IS 530 Fall 2009.

UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.

When Experts Agree: Using Non-Affiliated Experts To Rank Popular Topics Meital Aizen.

Mining the Web to Create Minority Language Corpora Rayid Ghani Accenture Technology Labs - Research Rosie Jones Carnegie Mellon University Dunja Mladenic.

Xiaoying Gao Computer Science Victoria University of Wellington Intelligent Agents COMP 423.

Google News Personalization: Scalable Online Collaborative Filtering

Weighting and Matching against Indices. Zipf’s Law In any corpus, such as the AIT, we can count how often each word occurs in the corpus as a whole =

Chapter 6: Information Retrieval and Web Search

1 Computing Relevance, Similarity: The Vector Space Model.

Page 1 Alliver™ Page 2 Scenario Users Contents Properties Contexts Tags Users Context Listener Set of contents Service Reasoner GPS Navigator.

Introduction to Digital Libraries hussein suleman uct cs honours 2003.

RecBench: Benchmarks for Evaluating Performance of Recommender System Architectures Justin Levandoski Michael D. Ekstrand Michael J. Ludwig Ahmed Eldawy.

Personalization for Location-Based E-Learning Rui Zhou and Klaus Rechert Communication Systems, Dept. of Computer Science The University of Freiburg, Germany.

LANGUAGE MODELS FOR RELEVANCE FEEDBACK Lee Won Hee.

Personalized Interaction With Semantic Information Portals Eric Schwarzkopf DFKI

Personalization with user’s local data Personalizing Search via Automated Analysis of Interests and Activities 1 Sungjick Lee Department of Electrical.

Probabilistic Latent Query Analysis for Combining Multiple Retrieval Sources Rong Yan Alexander G. Hauptmann School of Computer Science Carnegie Mellon.

Carnegie Mellon Novelty and Redundancy Detection in Adaptive Filtering Yi Zhang, Jamie Callan, Thomas Minka Carnegie Mellon University {yiz, callan,

PERSONALIZED DIVERSIFICATION OF SEARCH RESULTS Date: 2013/04/15 Author: David Vallet, Pablo Castells Source: SIGIR’12 Advisor: Dr.Jia-ling, Koh Speaker:

Personalization Services in CADAL Zhang yin Zhuang Yuting Wu Jiangqin College of Computer Science, Zhejiang University November 19,2006.

Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:

1 Random Walks on the Click Graph Nick Craswell and Martin Szummer Microsoft Research Cambridge SIGIR 2007.

Xiaoying Gao Computer Science Victoria University of Wellington COMP307 NLP 4 Information Retrieval.

1 CS 430: Information Discovery Lecture 8 Collection-Level Metadata Vector Methods.

Instance Discovery and Schema Matching With Applications to Biological Deep Web Data Integration Tantan Liu, Fan Wang, Gagan Agrawal {liut, wangfa,

Designing Cross-Language Information Retrieval System using various Techniques of Query Expansion and Indexing for Improved Performance Hello everyone,

Information Retrieval on the World Wide Web

Multimedia Information Retrieval

Movie Recommendation System

Recommender Systems Copyright: Dietmar Jannah, Markus Zanker and Gerhard Friedrich (slides based on their IJCAI talk „Tutorial: Recommender Systems”)

Semantic Similarity Methods in WordNet and their Application to Information Retrieval on the Web Yizhe Ge.

Presentation transcript:

Recommender Systems

Outline Limitations of Recommender Systems SMARTMUSEUM Case Study

(2006 – 2009) Open competition to build a collaborative filtering algorithm $1,000,000 cash prize Winner beat Netflix’s current accuracy by 10.06% The Netflix Prize How many approaches were used? However, the winning model wasn’t implemented... “Over 500” [1]

1. 2. They didn’t predict changing user requirements from dvds to streaming Why wasn’t it implemented? The competition used 100 million ratings while Netflix had around 5 billion at the time

SMARTMUSEUM: A Mobile Recommender System for the Web of Data

Smart Museum A mobile recommender system Presents users with site recommendations and on site works of art Provides descriptions and associated multimedia content Ontology based system (item- based)

Limitations of mobile recommender systems Heterogeneous content Different structures Different vocabularies Semantic differences Over-specialisation Objects too similar to past preferences Content vs context

How Recommendations Are Made The system recommends objects on the basis of a user profile and context information such as the physical location and motivation of the user

User Profile

User Profile (Cont.)

System Overview: Scenarios The mobile outdoor scenario: Uses GPS or cell identifier Combines with user profile and visit time The desktop scenario: Can create a user profile specifying preferences and abilities

System Overview: Scenarios (Cont.) The mobile indoor scenario: Manually switch to indoors or by RFID sensor Users can use their current profile or a pre-defined one (avoids cold-start) Object descriptions come from actively maintained collections Objects are annotated with contexts and with weightings User can retrieve related content of each recommendation Liking or disliking an object updates the user profile based on the triples occurring in object annotations 3.

Example Triple

System Components Metadata service Context service User profile service Filtering service

Metadata Service Responsible for storing the object annotations obtained from crawling the web. Checks designated URLS that point to a data dump of triples and stores this in the database for further access. The data is then sent to the filtering service for indexing

Context Service Maps the RFID identifiers and GPS coordinates to URIs in the ontologies. Maps manual and sensor based contextual data to the concepts defined in the ontologies. Spatial search constraints corresponding to the user’s context are used to limit the matching of possible objects to be recommended. The context information is used to retrieve the parts of the user’s profile that are seen as relevant given their previous behaviour.

User Profile Service Stores user profiles and maintains context information alongside the profiles. Adapts to explicit relevance feedback determined through indication of specific objects as relevant or non-relevant. The user’s interest is modelled as a conditional probability.

Filtering Service Indexes the content from the meta data service. Filters recommendations upon the mobile client’s request based on the user profile and context.

The Recommender System User profiling Data indexing (won’t cover) Result ranking Query expansion Feature balancing (count based normalisation of content vs context triples) Result clustering

User Profiling Context-aware user profiling requires flexible models that can be used to represent different context variables. Therefore, we have adopted a probabilistic user profiling model. A user profile consists of a set of profile entries e where e = Assume independence of the triples which is a simplification but keeps the computation simple and has been proven to perform well in practice. [2]

User Profiling (Example Entry) Duomo di Milano Assume the user ‘likes’ the Duomo di Milano We would insert the following into the user profile: triple = contextTriple = tone = positive

Computing Likelihoods To compute the weight of each triple given a context we get the likelihood of a context generating a certain triple. P(t|ct) = Compute this separately for positive and negative feedback count(t|ct) count(ct)

Information Filtering w i,j = tf i,j xidf i N i,j ∑ k N k,j tf i,j = Where N is the total number of objects, n i is the num of objects where the triple i appears. idf i = log N nini N i,j Where is the num of times a triple i is mentioned in object j (Note: This is the weighting for a feature, not an object)

Why use this weighting? NZ is likely to have more occurrences than Wellington Wellington has a “deductive closure” of NZ (subsumption) The weighting counters this to preference Wellington Allows specific triples to be matched to more general ones

Result Ranking Item Based Cosine similarity between each object and the profile of the user Previously calculated weightings and probabilities included in ranking

Query Expansion (ontology based) Users interested in the Duomo di Milano may also be interested in Milan or Florence (nearby). Wu-Palmer similarity measure [3] Concepts about a specified threshold of are selected for expansion MilanFlorence

Result Clustering After result ranking we have the similarity between objects and user profile Users may want results based on different preferences Prevents over-specialisation in recommendations FastICA Algorithm [4]

Experimental Evaluation Recommendation experiment and Linking Experiment Tested on objects indexed with Getty Vocabularies to build concepts, terms, and descriptions of the objects. Museum professionals provided relevance assessments for the dataset (500 objects, 28 user profiles) The accuracy of the methods was measure in terms of recall, precision, and mean average precision

Results

When used together, query expansion, feature balancing, and clustering improve filtering accuracy. Clustering improved the MAP of the filtering process by 11%. (only when combined) Clustering is effective for reducing over-specialisation, and increasing the diversity of the results.

User Trials: Outline Conducted at the Mueseum of Fine Arts in Malta, and at Museo Galileo. 24 participants were recruited (11 and 13 respectively). Given 30-minute presentation about the system. Users filled out a questionnaire on a modified version of the System Usability Scale.

User Trials: Questionnaire

User Trials: Results Users were asked ideas for improvements: Indoor map support. Explanations behind how other objects are related to the one being examined and relation with the user profile. Support for planning a tour beforehand

Improvements Testing generalisation in other domains Utilising sensors that don’t require users to read RFID tags Indoor map functionality Incorporating other sensor technologies (e.g. camera, microphones, accelerometers, compasses) Collaborative filtering based on other user’s profiles and recommendations

Author Conclusions Using ontologies to represent objects and enhance information retrieval leads to substantial improvement in recommendation accuracy. (strong evidence) Post-retrieval clustering increased the diversity of recommendations and improved the general retrieval performance.

Additional Resources SMARTMUSEUM demonstrations: and Onboarding New users in Recommender Systems: systems/#more systems/#more-5563 Introduction to Recommender Systems MOOC by the University of Minnesota:

References [1] Chen, E. (2011). Winning the Netflix Prize: A Summary. Retrieved from summary/ summary/ [2] Manning, C.D., Schuetze, H. (1999). Foundations of Statistical Natural Language Processing. 1 st ed. The MIT Press. [3] Wu, Z., Palmer, M. (1994). Verbs semantics and lexical selection. In: Proceedings of the 32 nd annual meeting on Association for Computational Linguistics. Morristown, NJ, USA: Association for Computational Linguistics; p. 133 – 138. [4] Hyvarinen, A., Oja, E. (1997). A fast fixed-point algorithm for independent component analysis; p – 1492.