Relevance Feedback User tells system whether returned/disseminated documents are relevant to query/information need or not Feedback: usually positive sometimes.

Slides:



Advertisements
Similar presentations
Image Retrieval With Relevant Feedback Hayati Cam & Ozge Cavus IMAGE RETRIEVAL WITH RELEVANCE FEEDBACK Hayati CAM Ozge CAVUS.
Advertisements

Chapter 5: Query Operations Hassan Bashiri April
Information Retrieval and Organisation Chapter 11 Probabilistic Information Retrieval Dell Zhang Birkbeck, University of London.
Improvements and extras Paul Thomas CSIRO. Overview of the lectures 1.Introduction to information retrieval (IR) 2.Ranked retrieval 3.Probabilistic retrieval.
Chapter 5: Introduction to Information Retrieval
Introduction to Information Retrieval (Part 2) By Evren Ermis.
Information Retrieval Models: Probabilistic Models
1 Relevance Feedback and other Query Modification Techniques 課程名稱 : 資訊擷取與推薦技術 指導教授 : 黃三益 教授 報告者 : 博一 楊錦生 (d ) 博一 曾繁絹 (d )
Query Operations: Automatic Local Analysis. Introduction Difficulty of formulating user queries –Insufficient knowledge of the collection –Insufficient.
1 CS 430 / INFO 430 Information Retrieval Lecture 8 Query Refinement: Relevance Feedback Information Filtering.
Database Management Systems, R. Ramakrishnan1 Computing Relevance, Similarity: The Vector Space Model Chapter 27, Part B Based on Larson and Hearst’s slides.
Learning Techniques for Information Retrieval Perceptron algorithm Least mean.
Chapter 5: Query Operations Baeza-Yates, 1999 Modern Information Retrieval.
Introduction to Information Retrieval Introduction to Information Retrieval Hinrich Schütze and Christina Lioma Lecture 11: Probabilistic Information Retrieval.
1 CS 430 / INFO 430 Information Retrieval Lecture 12 Probabilistic Information Retrieval.
T.Sharon - A.Frank 1 Internet Resources Discovery (IRD) IR Queries.
1 CS 430 / INFO 430 Information Retrieval Lecture 12 Probabilistic Information Retrieval.
Modeling Modern Information Retrieval
Modern Information Retrieval Chapter 5 Query Operations.
1 Query Language Baeza-Yates and Navarro Modern Information Retrieval, 1999 Chapter 4.
Recall: Query Reformulation Approaches 1. Relevance feedback based vector model (Rocchio …) probabilistic model (Robertson & Sparck Jones, Croft…) 2. Cluster.
SIMS 202 Information Organization and Retrieval Prof. Marti Hearst and Prof. Ray Larson UC Berkeley SIMS Tues/Thurs 9:30-11:00am Fall 2000.
Query Reformulation: User Relevance Feedback. Introduction Difficulty of formulating user queries –Users have insufficient knowledge of the collection.
Information retrieval Finding relevant data using irrelevant keys Example: database of photographic images sorted by number, date. DBMS: Well structured.
1 CS 430 / INFO 430 Information Retrieval Lecture 10 Probabilistic Information Retrieval.
9/21/2000Information Organization and Retrieval Ranking and Relevance Feedback Ray Larson & Marti Hearst University of California, Berkeley School of Information.
Other IR Models Non-Overlapping Lists Proximal Nodes Structured Models Retrieval: Adhoc Filtering Browsing U s e r T a s k Classic Models boolean vector.
Modern Information Retrieval Chapter 5 Query Operations 報告人:林秉儀 學號:
Query Operations: Automatic Global Analysis. Motivation Methods of local analysis extract information from local set of documents retrieved to expand.
Chapter 5: Information Retrieval and Web Search
Information Retrieval and Web Search Relevance Feedback. Query Expansion Instructor: Rada Mihalcea Class web page:
COMP423.  Query expansion  Two approaches ◦ Relevance feedback ◦ Thesaurus-based  Most Slides copied from ◦
1 CS 430 / INFO 430 Information Retrieval Lecture 8 Query Refinement and Relevance Feedback.
Modern Information Retrieval: A Brief Overview By Amit Singhal Ranjan Dash.
Query Operations J. H. Wang Mar. 26, The Retrieval Process User Interface Text Operations Query Operations Indexing Searching Ranking Index Text.
1 Query Operations Relevance Feedback & Query Expansion.
Probabilistic Query Expansion Using Query Logs Hang Cui Tianjin University, China Ji-Rong Wen Microsoft Research Asia, China Jian-Yun Nie University of.
1 Computing Relevance, Similarity: The Vector Space Model.
CPSC 404 Laks V.S. Lakshmanan1 Computing Relevance, Similarity: The Vector Space Model Chapter 27, Part B Based on Larson and Hearst’s slides at UC-Berkeley.
Relevance Feedback: New Trends Derive global optimization methods: More computationally robust Consider the correlation between different attributes Incorporate.
Relevance Feedback Hongning Wang What we have learned so far Information Retrieval User results Query Rep Doc Rep (Index) Ranker.
IR Theory: Relevance Feedback. Relevance Feedback: Example  Initial Results Search Engine2.
LANGUAGE MODELS FOR RELEVANCE FEEDBACK Lee Won Hee.
Query Expansion By: Sean McGettrick. What is Query Expansion? Query Expansion is the term given when a search engine adding search terms to a user’s weighted.
© 2004 Chris Staff CSAW’04 University of Malta of 15 Expanding Query Terms in Context Chris Staff and Robert Muscat Department of.
Query Suggestion. n A variety of automatic or semi-automatic query suggestion techniques have been developed  Goal is to improve effectiveness by matching.
C.Watterscsci64031 Probabilistic Retrieval Model.
Total Recall: Automatic Query Expansion with a Generative Feature Model for Object Retrieval O. Chum, et al. Presented by Brandon Smith Computer Vision.
Information Retrieval and Web Search Relevance Feedback. Query Expansion Instructor: Rada Mihalcea.
Relevance Feedback Hongning Wang
Xiaoying Gao Computer Science Victoria University of Wellington COMP307 NLP 4 Information Retrieval.
Introduction to Information Retrieval Introduction to Information Retrieval Lecture Probabilistic Information Retrieval.
Relevance Feedback Prof. Marti Hearst SIMS 202, Lecture 24.
1 CS 430 / INFO 430 Information Retrieval Lecture 12 Query Refinement and Relevance Feedback.
ASSOCIATIVE BROWSING Evaluating 1 Jin Y. Kim / W. Bruce Croft / David Smith by Simulation.
1 CS 430: Information Discovery Lecture 21 Interactive Retrieval.
Lecture 12: Relevance Feedback & Query Expansion - II
Multimedia Information Retrieval
Information Retrieval Models: Probabilistic Models
Compact Query Term Selection Using Topically Related Text
Relevance Feedback Hongning Wang
Murat Açar - Zeynep Çipiloğlu Yıldız
Basic Information Retrieval
CS 4501: Information Retrieval
Semantic Similarity Methods in WordNet and their Application to Information Retrieval on the Web Yizhe Ge.
Query Operations Berlin Chen 2003 Reference:
CS 430: Information Discovery
Relevance Feedback and Query Modification
Retrieval Utilities Relevance feedback Clustering
CS 430: Information Discovery
Presentation transcript:

Relevance Feedback User tells system whether returned/disseminated documents are relevant to query/information need or not Feedback: usually positive sometimes negative always incomplete Hypothesis: relevant docs should be more like each other than like non-relevant docs

Relevance Feedback: Purpose Augment keyword retrieval: Query Reformulation give user opportunity to refine their query tailored to individual exemplar based – different type of information from the query Iterative, subjective improvement Evaluation!

Relevance Feedback: Examples Image Retrieval

Relevance Feedback: Early Usage by Rocchio Modify original keyword query strengthen terms in relevant docs weaken terms in non-relevant docs modify original query by weighting based on amount of feedback

Relevance Feedback: Early Results Evaluation: how much feedback needed how did recall/precision change Conclusion: improved recall & precision over even 1 iteration and return of up to 20 non- relevant docs Promising technique

Query Reformulation User does not know enough about document set to construct optimal query initially. Querying is iterative learning process repeating two steps: 1. expand original query with new terms (query expansion) 2. assign weights to the query terms (term reweighting)

Query Reformulation Approaches 1. Relevance feedback based vector model (Rocchio …) probabilistic model (Robertson & Sparck Jones, Croft…) 2. Cluster based 1. Local analysis: derive information from retrieved document set 2. Global analysis: derive information from corpus

Vector Based Reformulation Rocchio (~1965)with adjustable weights Ide Dec Hi (~1968) counts only the most similar non-relevant document

Probabilistic Reformulation Recall from earlier: still need to estimate probabilities: do so using relevance feedback!

Estimating Probabilities by Accumulating Statistics D r is set of relevant docs D r,i is set of relevant docs with term k i ni is number of docs in corpus containing term k i

Computing Similarity (Term Reweighting) assume: term independence and binary document indexing Cons: no term weighting, no query expansion, ignores previous weights

Croft Extensions include within document frequency weights initial search variant Last term is normalized within-document frequency. C and K are adjustable parameters.

Query Reformulation: Summary so far… Relevance feedback can produce dramatic improvements. However, must be careful that previously judged documents are not part of improvement and techniques have limitations. Next round of improvements requires clustering…

Croft Feedback Searches Use probability updates as in Robertson

Assumptions 1. Initial query was a good approximation. 2. Ideal query is approximated by shared terms in relevant documents.

Assumptions 1. Initial query was a good approximation. polysemy? synonyms? slang? concept drift? 2. Ideal query is approximated by shared terms in relevant documents.