Information Retrieval Visualization CPSC 533c Class Presentation Qixing Zheng March 22, 2004.

Slides:



Advertisements
Similar presentations
Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Advertisements

Search in Source Code Based on Identifying Popular Fragments Eduard Kuric and Mária Bieliková Faculty of Informatics and Information.
Chapter 5: Introduction to Information Retrieval
Basic IR: Modeling Basic IR Task: Slightly more complex:
Information Retrieval: Human-Computer Interfaces and Information Access Process.
Ranked Retrieval INST 734 Module 3 Doug Oard. Agenda  Ranked retrieval Similarity-based ranking Probability-based ranking.
Introduction to Information Retrieval (Part 2) By Evren Ermis.
Overview of Collaborative Information Retrieval (CIR) at FIRE 2012 Debasis Ganguly, Johannes Leveling, Gareth Jones School of Computing, CNGL, Dublin City.
To See, or Not to See—Is That the Query? Robert R. Korfhage Dept. of Information Science University of Pittsburgh 1991 Reviewed by Yi-Bu Chen LIS 551 Information.
Web- and Multimedia-based Information Systems. Assessment Presentation Programming Assignment.
T.Sharon - A.Frank 1 Internet Resources Discovery (IRD) Classic Information Retrieval (IR)
Interfaces for Retrieval Results. Information Retrieval Activities Selecting a collection –Talked about last class –Lists, overviews, wizards, automatic.
ISP 433/533 Week 2 IR Models.
Basic IR: Queries Query is statement of user’s information need. Index is designed to map queries to likely to be relevant documents. Query type, content,
© Anselm Spoerri Lecture 10 Visual Tools for Text Retrieval (cont.)
Internet Resources Discovery (IRD) Search Engines Quality.
Modern Information Retrieval Chapter 2 Modeling. Can keywords be used to represent a document or a query? keywords as query and matching as query processing.
© Anselm Spoerri Lecture 13 Housekeeping –Term Projects Evaluations –Morse, E., Lewis, M., and Olsen, K. (2002) Testing Visual Information Retrieval Methodologies.
SIMS 202 Information Organization and Retrieval Prof. Marti Hearst and Prof. Ray Larson UC Berkeley SIMS Tues/Thurs 9:30-11:00am Fall 2000.
Information retrieval Finding relevant data using irrelevant keys Example: database of photographic images sorted by number, date. DBMS: Well structured.
WXGB6106 INFORMATION RETRIEVAL Week 3 RETRIEVAL EVALUATION.
Information Retrieval
Overview of Search Engines
Search and Retrieval: Relevance and Evaluation Prof. Marti Hearst SIMS 202, Lecture 20.
Modeling (Chap. 2) Modern Information Retrieval Spring 2000.
Evaluation Experiments and Experience from the Perspective of Interactive Information Retrieval Ross Wilkinson Mingfang Wu ICT Centre CSIRO, Australia.
1 Adapting the TileBar Interface for Visualizing Resource Usage Session 602 Adapting the TileBar Interface for Visualizing Resource Usage Session 602 Larry.
Content-Based Image Retrieval
Xiaoying Gao Computer Science Victoria University of Wellington Intelligent Agents COMP 423.
UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.
Thanks to Bill Arms, Marti Hearst Documents. Last time Size of information –Continues to grow IR an old field, goes back to the ‘40s IR iterative process.
Modern Information Retrieval: A Brief Overview By Amit Singhal Ranjan Dash.
Hao Wu Nov Outline Introduction Related Work Experiment Methods Results Conclusions & Next Steps.
1 Information Retrieval Acknowledgements: Dr Mounia Lalmas (QMW) Dr Joemon Jose (Glasgow)
Implicit Acquisition of Context for Personalization of Information Retrieval Systems Chang Liu, Nicholas J. Belkin School of Communication and Information.
Xiaoying Gao Computer Science Victoria University of Wellington Intelligent Agents COMP 423.
Personalized Search Xiao Liu
CSM06 Information Retrieval Lecture 6: Visualising the Results Set Dr Andrew Salway
Distributed Information Retrieval Server Ranking for Distributed Text Retrieval Systems on the Internet B. Yuwono and D. Lee Siemens TREC-4 Report: Further.
Supporting Effective Access through User- and Topic- Based Language Models.
Introduction to Digital Libraries hussein suleman uct cs honours 2003.
Web Image Retrieval Re-Ranking with Relevance Model Wei-Hao Lin, Rong Jin, Alexander Hauptmann Language Technologies Institute School of Computer Science.
University of Malta CSA3080: Lecture 6 © Chris Staff 1 of 20 CSA3080: Adaptive Hypertext Systems I Dr. Christopher Staff Department.
Lecture 1: Overview of IR Maya Ramanath. Who hasn’t used Google? Why did Google return these results first ? Can we improve on it? Is this a good result.
Searching the web Enormous amount of information –In 1994, 100 thousand pages indexed –In 1997, 100 million pages indexed –In June, 2000, 500 million pages.
4 1 SEARCHING THE WEB Using Search Engines and Directories Effectively New Perspectives on THE INTERNET.
WIRED Week 3 Syllabus Update (next week) Readings Overview - Quick Review of Last Week’s IR Models (if time) - Evaluating IR Systems - Understanding Queries.
LANGUAGE MODELS FOR RELEVANCE FEEDBACK Lee Won Hee.
Query Expansion By: Sean McGettrick. What is Query Expansion? Query Expansion is the term given when a search engine adding search terms to a user’s weighted.
ICIP 2004, Singapore, October A Comparison of Continuous vs. Discrete Image Models for Probabilistic Image and Video Retrieval Arjen P. de Vries.
Copyright 2010, The World Bank Group. All Rights Reserved. COMMUNICATION AND DISSEMINATION, PART 2 DEVELOPING DISSEMINATION PRODUCTS 1.
Language Model in Turkish IR Melih Kandemir F. Melih Özbekoğlu Can Şardan Ömer S. Uğurlu.
Introduction to Information Retrieval Example of information need in the context of the world wide web: “Find all documents containing information on computer.
Information Retrieval
Lesson 13 Databases Unit 2—Using the Computer. Computer Concepts BASICS - 22 Objectives Define the purpose and function of database software. Identify.
Relevance Models and Answer Granularity for Question Answering W. Bruce Croft and James Allan CIIR University of Massachusetts, Amherst.
Augmenting (personal) IR Readings Review Evaluation Papers returned & discussed Papers and Projects checkin time.
CS798: Information Retrieval Charlie Clarke Information retrieval is concerned with representing, searching, and manipulating.
The Development of a search engine & Comparison according to algorithms Sung-soo Kim The final report.
Xiaoying Gao Computer Science Victoria University of Wellington COMP307 NLP 4 Information Retrieval.
User Interfaces for Information Access Prof. Marti Hearst SIMS 202, Lecture 26.
Relevance Feedback in Image Retrieval System: A Survey Tao Huang Lin Luo Chengcui Zhang.
Relevance Feedback Prof. Marti Hearst SIMS 202, Lecture 24.
What Happens After the Search? User Interface Ideas for Information Retrieval Results Marti A. Hearst Xerox PARC.
CS791 - Technologies of Google Spring A Web­based Kernel Function for Measuring the Similarity of Short Text Snippets By Mehran Sahami, Timothy.
SIMS 202, Marti Hearst Final Review Prof. Marti Hearst SIMS 202.
Text Visualization Lecture 11
INFORMATION VISUALIZATION (CS 5984) PRESENTATION
Authors: Wai Lam and Kon Fan Low Announcer: Kyu-Baek Hwang
Information Retrieval and Web Design
Presentation transcript:

Information Retrieval Visualization CPSC 533c Class Presentation Qixing Zheng March 22, 2004

Purpose of Information Retrieval (IR) “The purpose of information retrieval is to help users effectively access large collections of objects with the goal of satisfying users’ stated information needs.” -- W. Bruce Croft

Too Few or Too Many Your Search:{collaborative};{visualization};{tool} Search Results:Records found: 2 / Total characters: 5667{collaborative};{visualization};{tool} Your Search:{collaborative,visualization,tool} Search Results:Records found: 3213 / Total characters: {collaborative,visualization,tool}

The Search Results…

Outline Background on IR InfoCrystal (Spoerri, 1993) TileBars (Hearst, 1995) Evaluation of a Tool for Visualization of Information Retrieval Results (Veerasamy & Belkin, 1996)

Background on IR Common approaches of text retrieval –Boolean term specification e.g. information retrieval AND (query language OR human factors) –Similarity search: vector space model, probabilistic models, and etc. Rank documents according to how close they are to the terms in the query

Functionalities of IR Visualization Systems Generating Boolean Queries Keyword-based / Full text Relationships between queries and retrieved documents Search support Providing overview of query words in the document space Document length Frequency of query terms Query terms distribution in the document Transparency of Ranking

Outline Background on IR InfoCrystal (Spoerri, 1993) TileBars (Hearst, 1995) Evaluation of a Tool for Visualization of Information Retrieval Results (Veerasamy & Belkin, 1996)

InfoCrystal Formation Shape coding Proximity coding Rank coding Color or texture coding Orientation coding Size or Brightness &saturation coding

InfoCrystal Numbers indicate the amount of documents retrieved Ranking vs. proximity principle Users can select relationships by clicking icons The threshold slider

Features of InfoCrystal A visualization tool and a visual query language Visualize all the possible discrete and continuous relationships among N concepts User can selectively emphasize the qualitative or the quantitative information Users can specify Boolean and vector-space queries graphically

Functionality Check Generating Boolean Queries Keyword-based / Full text Relationships between queries and retrieved documents Search support Providing overview of query words in the document space Document length Frequency of query terms Query terms distribution in the document Transparency of Ranking

Critique Pros –Very smart idea –Nice comparison with relevant previous work Cons –No user studies to test the effectiveness of the visualization –Concentrate on the short comings all other systems

Outline Background on IR InfoCrystal (Spoerri, 1993) TileBars (Hearst, 1995) Evaluation of a Tool for Visualization of Information Retrieval Results (Veerasamy & Belkin, 1996)

TileBars Three Term sets Click on a tile to see the contents of the document. Term frequency and distribution information is important for determining relevance. Large rectangle indicates a document

Functionality Check Generating Boolean Queries Keyword-based / Full text Relationships between queries and retrieved documents Search support Providing overview of query words in the document space Document length Frequency of query terms Query terms distribution in the document Transparency of Ranking

Critique Pros –One of the first paper focused on long texts information access –Provides information on how different query facets overlap in different sections of a long document Cons –No user studies to test the effectiveness of the visualization –Good for long text retrieval, constrained by length

Outline Background on IR InfoCrystal (Spoerri, 1993) TileBars (Hearst, 1995) Evaluation of a Tool for Visualization of Information Retrieval Results (Veerasamy & Belkin, 1996)

Another IR Visualization

Metrics for Evaluation Test effectiveness, usability, and acceptability of the visualization tool Prediction: the visualization tool will make better decisions about which documents to look at than those without visualization Parameters: –# of documents saved per search (s-p-s) –Interactive trec precision (i-t-p) –Interactive user precision (i-u-p) –Precision of the seach

Experiment 1 36 subjects, 3 groups –Group “with-out: with” initial tutorial, 1 st search without visualization, intermediate tutorial, 2 nd search with visualization tool –Group with: with –Group without: without Results –No significant differences between any two groups in any of the four measures

Experiment 2 36 subjects, 2 groups –Group “viz” –Group “noviz” Results –Favor “viz” group, but not significant –One explanation: visualization of this sort is helpful for naïve searchers, but loses its effect when users become more experienced with the IR system

Critique Pros –Initial attempt to evaluate visualization tool for IR –Generate possible metrics for evaluation Cons –Many confounds in the experiment –No user feedback was reported –Did not state why the authors decided to choose the particular vis tool to evaluate

Conclusion How can we use visualization to help us to filter the huge information collection? What are the key features that make a IR visualization useful? How can we design better user studies to test these systems? Would the combination of IR visualization tools and IR intelligent agents be more powerful, and can assists users better?