INFSCI 3954 The Adaptive Web Session 5: Personalized Search on the Web Peter Brusilovsky

Slides:



Advertisements
Similar presentations
Recommender System A Brief Survey.
Advertisements

Adaptive Hypermedia Dr. Alexandra Cristea
Chapter 5: Introduction to Information Retrieval
Modern information retrieval Modelling. Introduction IR systems usually adopt index terms to process queries IR systems usually adopt index terms to process.
Basic IR: Modeling Basic IR Task: Slightly more complex:
To See, or Not to See—Is That the Query? Robert R. Korfhage Dept. of Information Science University of Pittsburgh 1991 Reviewed by Yi-Bu Chen LIS 551 Information.
Search Engines and Information Retrieval
T.Sharon - A.Frank 1 Internet Resources Discovery (IRD) Classic Information Retrieval (IR)
Query Operations: Automatic Local Analysis. Introduction Difficulty of formulating user queries –Insufficient knowledge of the collection –Insufficient.
Basic IR: Queries Query is statement of user’s information need. Index is designed to map queries to likely to be relevant documents. Query type, content,
WebMiningResearch ASurvey Web Mining Research: A Survey Raymond Kosala and Hendrik Blockeel ACM SIGKDD, July 2000 Presented by Shan Huang, 4/24/2007.
1 Collaborative Filtering and Pagerank in a Network Qiang Yang HKUST Thanks: Sonny Chee.
6/16/20151 Recent Results in Automatic Web Resource Discovery Soumen Chakrabartiv Presentation by Cui Tao.
Chapter 2Modeling 資工 4B 陳建勳. Introduction.  Traditional information retrieval systems usually adopt index terms to index and retrieve documents.
Personalised Search on the World Wide Web Originally by Micarelli, Gasparetti, Sciarrone & Gauch
1 Information Retrieval and Web Search Introduction.
SIMS 202 Information Organization and Retrieval Prof. Marti Hearst and Prof. Ray Larson UC Berkeley SIMS Tues/Thurs 9:30-11:00am Fall 2000.
WebMiningResearchASurvey Web Mining Research: A Survey Raymond Kosala and Hendrik Blockeel ACM SIGKDD, July 2000 Presented by Shan Huang, 4/24/2007 Revised.
Adaptive Hypermedia: What is it and why are we doing it? Dr. Alexandra Cristea
Personalized Ontologies for Web Search and Caching Susan Gauch Information and Telecommunications Technology Center Electrical Engineering and Computer.
Chapter 5: Information Retrieval and Web Search
INFSCI 2955 Adaptive Web Systems Session 1: Personalized Web Search Peter Brusilovsky
Challenges in Information Retrieval and Language Modeling Michael Shepherd Dalhousie University Halifax, NS Canada.
Search Engines and Information Retrieval Chapter 1.
COMP423.  Query expansion  Two approaches ◦ Relevance feedback ◦ Thesaurus-based  Most Slides copied from ◦
Personalized Information Retrieval in Context David Vallet Universidad Autónoma de Madrid, Escuela Politécnica Superior,Spain.
Xiaoying Gao Computer Science Victoria University of Wellington Intelligent Agents COMP 423.
Which of the two appears simple to you? 1 2.
1 Query Operations Relevance Feedback & Query Expansion.
Information Retrieval Models - 1 Boolean. Introduction IR systems usually adopt index terms to process queries Index terms:  A keyword or group of selected.
1 Information Retrieval Acknowledgements: Dr Mounia Lalmas (QMW) Dr Joemon Jose (Glasgow)
Learning Probabilistic User Profiles: Interesting Web Sites, Notifying User of Relevant Changes to Web Pages, and Locating Grant Opportunities M. Ackerman,
Lecture 2 Jan 13, 2010 Social Search. What is Social Search? Social Information Access –a stream of research that explores methods for organizing users’
Chapter 6: Information Retrieval and Web Search
Introduction to Digital Libraries hussein suleman uct cs honours 2003.
Search Engine Architecture
Lecture 2 Jan 15, 2008 Social Search. What is Social Search? Social Information Access –a stream of research that explores methods for organizing users’
IR Theory: Relevance Feedback. Relevance Feedback: Example  Initial Results Search Engine2.
Searching the web Enormous amount of information –In 1994, 100 thousand pages indexed –In 1997, 100 million pages indexed –In June, 2000, 500 million pages.
IT-522: Web Databases And Information Retrieval By Dr. Syed Noman Hasany.
Adaptive Information Systems: From Adaptive Hypermedia to the Adaptive Web Peter Brusilovsky School of Information Sciences University of Pittsburgh,
Personalized Interaction With Semantic Information Portals Eric Schwarzkopf DFKI
AnnotatEd: A Social Navigation and Annotation Service for Web-based Educational Resources Rosta Farzan & Peter Brusilovsky Personalized Adaptive Web Systems.
How Do We Find Information?. Key Questions  What are we looking for?  How do we find it?  Why is it difficult? “A prudent question is one-half of wisdom”
Introduction to Information Retrieval Aj. Khuanlux MitsophonsiriCS.426 INFORMATION RETRIEVAL.
1 Information Retrieval LECTURE 1 : Introduction.
Information Retrieval
Web Information Retrieval Prof. Alessandro Agostini 1 Context in Web Search Steve Lawrence Speaker: Antonella Delmestri IEEE Data Engineering Bulletin.
User Modeling and Recommender Systems: Introduction to recommender systems Adolfo Ruiz Calleja 06/09/2014.
ASSIST: Adaptive Social Support for Information Space Traversal Jill Freyne and Rosta Farzan.
Peter Brusilovsky. Index What is adaptive navigation support? History behind adaptive navigation support Adaptation technologies that provide adaptive.
Open-Corpus Adaptive Hypermedia Peter Brusilovsky School of Information Sciences University of Pittsburgh, USA
1 FollowMyLink Individual APT Presentation First Talk February 2006.
1 CS 430 / INFO 430 Information Retrieval Lecture 12 Query Refinement and Relevance Feedback.
1 Personalizing Search via Automated Analysis of Interests and Activities Jaime Teevan, MIT Susan T. Dumais, Microsoft Eric Horvitz, Microsoft SIGIR 2005.
Adaptivity, Personalisation and Assistive Technologies Hugh Davis.
Lecture-6 Bscshelp.com. Todays Lecture  Which Kinds of Applications Are Targeted?  Business intelligence  Search engines.
Web-Technology Lecture 14.
Information Retrieval and Web Search
Search Engine Architecture
Information Retrieval and Web Search
Information Retrieval on the World Wide Web
Information Retrieval and Web Search
Information Integration for Digital Libraries
Information Retrieval
Author: Kazunari Sugiyama, etc. (WWW2004)
CSA3212: User Adaptive Systems
Search Engine Architecture
Information Retrieval and Web Design
Information Retrieval and Web Search
Presentation transcript:

INFSCI 3954 The Adaptive Web Session 5: Personalized Search on the Web Peter Brusilovsky

User Profile Common term for user models in IR/IF A user’s profile is a collection of information about the user of the system. These information is used to get the user to more relevant information Views on user profiles in IR community –Classic (Korfhage) - a reference point –Modern (Belkin) - starting part of the user model

Core vs. Extended User Profile Core profile –contains information related to the user search goals and interests Extended profile –contains information related to the user as a person in order to understand or model the use that a person will make with the information retrieved

Simple (vector) Core Profiles Primitive profile (any model) –A set of search terms (0-1 vector) For Boolean model of IR –A Boolean query For vector model of IR (dominated) –A set of terms with their weights (vector) –An overlay (set of weights) over a simple domain model that is just a list of terms that could be of interest to the users

Advanced Core Profiles Domain model is a network of terms –A subset of the model (simple overlay) –A weighted overlay over DM Domain model is a hierarchy of topics –typically, each topic is a vector of terms –A subset of the model (simple overlay) –A weighted overlay over DM

Group Profiles A system can maintain a group profile in parallel or instead of user profile Could resolve the privacy issue (navigation with group profile) Could be use for new group members at the beginning Could be used in addition to the user profile to add group “wisdom”

Extended Profile Knowledge: about the system and the subject Goals: local and global Interests Background: profession, language, prospect, capabilities Preferences (types of docs, authors, sources…)

Who Maintains the Profile? Profile is provided and maintained by the user/administrator –Sometimes the only choice The system constructs and updates the profile (automatic personalization) Collaborative - user and system –User creates, system maintains –User can influence and edit –Does it help or not?

General System Types Search-oriented Web IR systems –Ad-hoc IR –Information Filtering Content-based recommender systems Browsing-oriented IR systems –Hypertext and the Web Information Visualization Personalized information spaces –Bookmarking systems, MyLibrary systems Search services –Cover several functions around single user model

What Can Be Adapted? Adaptive search and filtering Adaptive presentation –Presenting a page (analyze results) Adaptive information visualization Adaptive navigation support –Presenting search results (analyze results) –Presenting links on a page (proceed from) Adaptive collection (crawling)

Adaptive Information Filtering Goals: –Improve the long-term user profile to get better filtering results Methods –Variations of relevance feedback for improving the profile –Machine learning approaches to learn users’ “true” long-tem interests

Example: Adaptive News Server Adapts to long-term user preferences What to consider: –Which news items the user browses –How many pages in a new item the user read (mobile platform) Uses machine learning Significantly improved the number of messages read.

Adaptive Search Goals: –Find documents (pages) that are most suitable for the individual user Methods: –Employ user profiles representing long- term interests (Korfhage) –Use heuristics for adaptation to user types and actions

User Profile for Adaptive Search The profile is used to give a context to the query in order to reduce ambiguity. The profile of interests will allow to distinguish what the user asked about “Pirates” really wants. For example the background of the user can be helpful to understand what kind of information he is looking for. A query about the theory of groups has a different meaning for a mathematician and a sociologist. Moreover a student in math is interested in the basic concepts, while a an expert is interested in advanced materials

Using Profiles for Search The user profile can be applied in four ways: To modify the query itself (pre-filter) To process results of a query (post- filter), To change the usual way of retrieval –Profile is treated as a reference point Special case: meta-adaptation for meta- search

Examples of Systems Pre-filter with QE - Koutrika, Mobasher Pre-filter with RF : SmartGuide Post-filters: Syskill & Webert, WIFS Adaptive Meta-Search engines –Adaptive selection and ranking of sources

Pre-Filter: Query Expansion User profile is applied to add terms to the query (works with any IR model) –Similar terms could be added to resolve indexer-user mismatch –Related terms could be added to resolve ambiguity Requires an advanced profile

Example: Koutrika & Ioannidis’05 Advanced relevance network for query expansion java -> java and programming -> java and (programming or development) A Unified User-Profile Framework for Query Disambiguation and Personalization Georgia Koutrika and Yannis Ioannidis,

Pre-Filter: Relevance Feedback In this case the profile is used to “move” the query vector (vector IR only) Imagine that: –the documents, –the query –the user profile are represented by the same set of weighted index terms

Pre-filter: Linear Transformation The query q=q 1, q 2, … q n The profile p=p 1, p 2, … p n The query modified by the user profile will be something like that: modified q i =Kp i +(1-K)q i i=1,2,…n

Pre-filter: Linear Transformation modified q i =Kp i +(1-K)q i In this case we add the terms of the profile to the query ones, weighted by K for K=0 modified q i =q i the query is unmodified for K=1 modified q i =p i the query is substituted by the profile

Piecewise Linear Transformation if the term appears in the query and in the profile then the linear transformation is applied if the term appears in the query but not in the profile is left unmodified or diminished slightly if the term appears in the profile but not in the query it is not introduced, or introduced with a weight lower than in the profile.

Example: SmartGuide Access to the CIS-like information User has a long-term interests profile and current queries Information is searched using a combination of both Profile is initiated from a stereotype and kept updated Increased user satisfaction, decreased navigation overhead Gates, K. F., Lawhead, P. B., and Wilkins, D. E. (1998) Toward an adaptive WWW: a case study in customized hypermedia. New Review of Multimedia and Hypermedia 4,

Post-Filter The user profile is used to organize the results of the retrieval process –present to the user the most interesting documents –Filter out irrelevant documents Extended profile can be used effectively In this case the use of the profile adds an extra step to processing Similar to classic information filtering problem Typical way for adaptive Web IR

Post-Filter: Re-Ranking Re-ranking is a typical approach for post-filtering Each document is rated according to its relevance (similarity) to the user or group profile This rating is fused with the relevance rating returned by the search engine The results are ranked by fused rating –User model: WIFS, group model: I-Spy

Example: WIFS (Micarelli) Adaptive post-filter to AltaVista search engine Maintains an advanced stereotype- based user model (Humos subsystem) User model is updated by watching the user The model is used to filter and re-order the links returned by AltaVista

Post-Filter: Annotations The result could be relevant to the user in several aspects. Fusing this relevance with query relevance is error prone and leads to a loss of data Results are ranked by the query relevance, but annotated with visual cues reflecting other kinds of relevance –User interests - Syskill and Webert, group interests - KnowledgeSea

Example: Syskill and Webert First example of annotation Post-filter to Lycos Hot, cold, lukewarm Pazzani, M., Muramatsu, J., and Billsus, D. (1996) Syskill & Webert: Identifying interesting Web sites. In: Proceedings of the Thirteen National Conference on Artificial Intelligence, AAAI'96, Portland, OR, August, 1996, AAAI Press /MIT Press, pp , also available at yskill.html.

Example: Knowledge Sea II General annotation Question Praise Negative Positive Similarity score Document with high traffic (higher rank) Document with positive annotation (higher rank)

Separate Reference Points In this case documents are retrieved if they are “near” the query or the profile. In the following discussion we assume that the similarity is measured by distance where D is the document and Q is the query

We have different way to integrate query and profile as separate reference points: –Disjunctive model of query-profile integration –Conjunctive model of query-profile integration –Ellipsoidal model –Cassini oval model Separate reference points

Disjunctive model In this case we will take the document if the following condition is satisfied: The D document should be “near” the query Q or the profile P

Conjunctive model Condition to satisfy The D document should be “near” the query Q and the profile P In this case if profile and query have little, or noting, in common very few documents are retrieved

Ellipsoidal model this is the equation of a ellipse. P Q P Q If the profile and query are “far apart” a lot of documents not relevant are retrieved Condition to satisfy

Cassini model Condition to satisfy P Q P Q P Q P Q