User Interfaces for Information Access Prof. Marti Hearst SIMS 202, Lecture 26.

Slides:



Advertisements
Similar presentations
Recuperação de Informação B Cap. 10: User Interfaces and Visualization , , , November 29, 1999.
Advertisements

Computer Concepts BASICS 4th Edition
Query Chain Focused Summarization Tal Baumel, Rafi Cohen, Michael Elhadad Jan 2014.
INFO624 - Week 2 Models of Information Retrieval Dr. Xia Lin Associate Professor College of Information Science and Technology Drexel University.
Modern information retrieval Modelling. Introduction IR systems usually adopt index terms to process queries IR systems usually adopt index terms to process.
10/4/01 IS202: Information Organization & Retrieval Interfaces for Information Retrieval Ray Larson & Warren Sack IS202: Information Organization and Retrieval.
Ranked Retrieval INST 734 Module 3 Doug Oard. Agenda  Ranked retrieval Similarity-based ranking Probability-based ranking.
To See, or Not to See—Is That the Query? Robert R. Korfhage Dept. of Information Science University of Pittsburgh 1991 Reviewed by Yi-Bu Chen LIS 551 Information.
Information Retrieval Visualization CPSC 533c Class Presentation Qixing Zheng March 22, 2004.
Information Retrieval in Practice
Search and Retrieval: More on Term Weighting and Document Ranking Prof. Marti Hearst SIMS 202, Lecture 22.
Interfaces for Retrieval Results. Information Retrieval Activities Selecting a collection –Talked about last class –Lists, overviews, wizards, automatic.
© Anselm Spoerri Lecture 10 Visual Tools for Text Retrieval (cont.)
1 Block-based Web Search Deng Cai *1, Shipeng Yu *2, Ji-Rong Wen * and Wei-Ying Ma * * Microsoft Research Asia 1 Tsinghua University 2 University of Munich.
1 CS 430 / INFO 430 Information Retrieval Lecture 8 Query Refinement: Relevance Feedback Information Filtering.
SIMS 202 Information Organization and Retrieval Prof. Marti Hearst and Prof. Ray Larson UC Berkeley SIMS Tues/Thurs 9:30-11:00am Fall 2000.
9/18/2001Information Organization and Retrieval Vector Representation, Term Weights and Clustering (continued) Ray Larson & Warren Sack University of California,
Automating Discovery from Biomedical Texts Marti Hearst & Barbara Rosario UC Berkeley Agyinc Visit August 16, 2000.
Web Site Design Marti Hearst (UCB SIMS) SIMS 213, UI Design & Development April 6, 1999.
INFM 700: Session 7 Unstructured Information (Part II) Jimmy Lin The iSchool University of Maryland Monday, March 10, 2008 This work is licensed under.
Chapter 2Modeling 資工 4B 陳建勳. Introduction.  Traditional information retrieval systems usually adopt index terms to index and retrieve documents.
SIMS 296a-3: UI Background Marti Hearst Fall ‘98.
Marti Hearst SIMS 247 SIMS 247 Lecture 20 Visualizing Text & Text Collections (cont.) April 2, 1998.
SIMS 202 Information Organization and Retrieval Prof. Marti Hearst and Prof. Ray Larson UC Berkeley SIMS Tues/Thurs 9:30-11:00am Fall 2000.
WMES3103: INFORMATION RETRIEVAL WEEK 10 : USER INTERFACES AND VISUALIZATION.
Interfaces for Querying Collections. Information Retrieval Activities Selecting a collection –Lists, overviews, wizards, automatic selection Submitting.
UCB CS Research Fair Search Text Mining Web Site Usability Marti Hearst SIMS.
ISP 433/633 Week 12 User Interface in IR. Why care about User Interface in IR Human Search using IR depends on –Search in IR and search in human memory.
Overview of Search Engines
Search and Retrieval: Relevance and Evaluation Prof. Marti Hearst SIMS 202, Lecture 20.
Modeling (Chap. 2) Modern Information Retrieval Spring 2000.
Chapter 15: Informational Reading
Challenges in Information Retrieval and Language Modeling Michael Shepherd Dalhousie University Halifax, NS Canada.
1 Adapting the TileBar Interface for Visualizing Resource Usage Session 602 Adapting the TileBar Interface for Visualizing Resource Usage Session 602 Larry.
1 CS 430 / INFO 430 Information Retrieval Lecture 8 Query Refinement and Relevance Feedback.
A Simple Unsupervised Query Categorizer for Web Search Engines Prashant Ullegaddi and Vasudeva Varma Search and Information Extraction Lab Language Technologies.
Chapter 2 Architecture of a Search Engine. Search Engine Architecture n A software architecture consists of software components, the interfaces provided.
UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.
Thanks to Bill Arms, Marti Hearst Documents. Last time Size of information –Continues to grow IR an old field, goes back to the ‘40s IR iterative process.
©2003 Paula Matuszek CSC 9010: Text Mining Applications Document Summarization Dr. Paula Matuszek (610)
Concept Mapping concepts and exercises K. Yue. Why Learning is difficult? When we learn a new ‘thing’? There are many concepts: 10s to 1000s, depending.
CSM06 Information Retrieval Lecture 6: Visualising the Results Set Dr Andrew Salway
Toward A Session-Based Search Engine Smitha Sriram, Xuehua Shen, ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.
Virtual University - Human Computer Interaction 1 © Imran Hussain | UMT Imran Hussain University of Management and Technology (UMT) Lecture 40 Observing.
Document Collections cs5984: Information Visualization Chris North.
Comparing and Ranking Documents Once our search engine has retrieved a set of documents, we may want to Rank them by relevance –Which are the best fit.
Graphic Organizers. Free Template from 2 Index of workshop Graphic Organizers workshop.
ITGS Databases.
1 FollowMyLink Individual APT Presentation Third Talk February 2006.
UWMS Data Mining Workshop Content Analysis: Automated Summarizing Prof. Marti Hearst SIMS 202, Lecture 16.
Visualization in Text Information Retrieval Ben Houston Exocortex Technologies Zack Jacobson CAC.
Interaction LBSC 734 Module 4 Doug Oard. Agenda Where interaction fits Query formulation Selection part 1: Snippets Selection part 2: Result sets  Examination.
1 CS 430: Information Discovery Lecture 5 Ranking.
Basics of Databases and Information Retrieval1 Databases and Information Retrieval Lecture 1 Basics of Databases and Information Retrieval Instructor Mr.
Search and Retrieval: Finding Out About Prof. Marti Hearst SIMS 202, Lecture 18.
Search and Retrieval: Query Languages Prof. Marti Hearst SIMS 202, Lecture 19.
Relevance Feedback Prof. Marti Hearst SIMS 202, Lecture 24.
1 CS 430 / INFO 430 Information Retrieval Lecture 12 Query Refinement and Relevance Feedback.
What Happens After the Search? User Interface Ideas for Information Retrieval Results Marti A. Hearst Xerox PARC.
1 CS 430: Information Discovery Lecture 21 Interactive Retrieval.
SIMS 202, Marti Hearst Final Review Prof. Marti Hearst SIMS 202.
Plan for Today’s Lecture(s)
Text Based Information Retrieval
SIMS 202 Information Organization and Retrieval
Compact Query Term Selection Using Topically Related Text
Visualizing Document Collections
Document Clustering Matt Hughes.
INFORMATION VISUALIZATION (CS 5984) PRESENTATION
cs5984: Information Visualization Chris North
Information Retrieval and Web Design
Presentation transcript:

User Interfaces for Information Access Prof. Marti Hearst SIMS 202, Lecture 26

Marti A. Hearst SIMS 202, Fall 1997 User Interfaces for Information Access n User Interfaces is a large topic n Focus on only two aspects here n How to impart information about the relationship between n the query, n the document collection, and n the retrieval results n How to support the iterative process

Marti A. Hearst SIMS 202, Fall 1997 Today v Imparting Information about query, collection, and retrieval results v Supporting the Process of search v Videos!

Marti A. Hearst SIMS 202, Fall 1997 UIs for Information Access n Why Text is Tough n Collections are very large n Information is high-dimensional n Text is difficult to summarize n summarization produces more text n visualization doesn’t show the essence

Marti A. Hearst SIMS 202, Fall 1997 Assuming: User in the loop Full text documents Goal: reduce cognitive load on user n simple query n informative retrieval resultsQuestion: What is the relationship between the n query n retrieved documents? User Interface Goals

Marti A. Hearst SIMS 202, Fall 1997 To impart information (about the collection and retrieval results) n Interfaces should n give hints about the roles terms play in the collection n give hints about what will happen if various terms are combined n show explicitly why documents are retrieved in response to the query n summarize compactly the subset of interest

Marti A. Hearst SIMS 202, Fall 1997 Retrieval Result Display: Current Approaches n Ranked list of titles n Number of hits of each term n Inter-document similarity

Marti A. Hearst SIMS 202, Fall 1997

Marti A. Hearst SIMS 202, Fall 1997 Main Topics and Subtopics Observation: n Long texts often consist of main topics and a sequence of subtopical discussions n Standard IR does not take this into account

Marti A. Hearst SIMS 202, Fall 1997 Variations on Topic and Subtopic Distribution B A A B B A

Marti A. Hearst SIMS 202, Fall 1997 A Scenario v Query: funding for cold fusion research v A relevant article exists, in which: n the main topic is cold fusion n there is a subtopic funding discussion n but “cold fusion” is not adjacent to it v A standard system either: n retrieves all docs with both term sets n discards this doc -- no term overlap

Marti A. Hearst SIMS 202, Fall 1997 TextTiling Partition expository texts into multi- paragraph discourse units that reflect the texts’ subtopic structure 1-2 Intro: Magellan Space Probe 3-4 Intro: Venus 5-7 Missing Craters on Venus 8-11 Volcanic Action River Styx Crustal spreading Recent volcanism Future of Magellan

Marti A. Hearst SIMS 202, Fall 1997 Display of Retrieval Results Goal: minimize time/effort for deciding which documents to examine in detail Idea: show the roles of the query terms in the retrieved documents, making use of document structure

Marti A. Hearst SIMS 202, Fall 1997 TileBars v Graphical Representation of Term Distribution and Overlap v Simultaneously Indicate: n relative document length n query term frequencies n query term distributions n query term overlap

Marti A. Hearst SIMS 202, Fall 1997 Query terms: What roles do they play in retrieved documents? DBMS (Database Systems) Reliability Mainly about both DBMS & reliability Mainly about DBMS, discusses reliability Mainly about, say, banking, with a subtopic discussion on DBMS/Reliability Mainly about high-tech layoffs Example

Marti A. Hearst SIMS 202, Fall 1997

Marti A. Hearst SIMS 202, Fall 1997

Marti A. Hearst SIMS 202, Fall 1997 Exploiting Visual Properties n Variation in gray scale saturation imposes a universal, perceptual order (Bertin et al. ‘83) n Varying shades of gray show varying quantities better than color (Tufte ‘83) n Differences in shading should align with the values being presented (Kosslyn et al. ‘83)

Marti A. Hearst SIMS 202, Fall 1997 Key Aspect: Query formation n Conjunct of disjuncts n Each disjunct is a concept n osteoporosis, bone loss n prevention, cure n research, Mayo clinic, study n User does not have to specify which are main topics, which are subtopics n Ranking algorithm gives higher weight to overlap of topics

Marti A. Hearst SIMS 202, Fall 1997 Main Topic Context n Potential Problem with TileBars Given retrieved documents in which no query terms are well-distributed, The user does not know the context in which the query terms are used n Solution: Accompany with main topic display

Marti A. Hearst SIMS 202, Fall 1997 TileBars Summary n Compact, graphical representation of term distribution for full text retrieval results n simultaneously display term frequency, distribution, overlap, and doc length n allow for simple user-determined ordering strategies n Part of a larger effort: user-centric, content-sensitive information access

Marti A. Hearst SIMS 202, Fall 1997 TileBars v Preliminary User Studies v users understand them v find them helpful in some situations v sometimes terms need to be disambiguated v Future: v Integrate with Main Topic Context Indicators v Modify for Web information

Marti A. Hearst SIMS 202, Fall 1997 Other Approaches v Show how often each query term occurs in retrieved documents n VIBE (Korfhage ‘91) n InfoCrystal (Spoerri ‘94) n Problems: n can’t see overlap of terms within docs n quantities not represented graphically n more than 4 terms hard to handle n no help in selecting terms to begin with

Marti A. Hearst SIMS 202, Fall 1997 InfoCrystal (Spoerri 94)

Marti A. Hearst SIMS 202, Fall 1997 Support the Process n Two recent similar approaches that focus on supporting the process n SketchTrieve (Hendry & Harper, in reader) n DLITE (Cousins, on videotape)

Marti A. Hearst SIMS 202, Fall 1997 Informal Interface n Informal does not mean less useful n Show how the search is n unfolding or evolving n expanding or contracting n Prompt the user to n reformulate and abandon plans n backtrack to points of task deferral n make side-by-side comparisons n define and discuss problems

Marti A. Hearst SIMS 202, Fall 1997 SketchTrieve: An Informal Interface (Hendry & Harper 96, 97) n A “spreadsheet” for information access n Make use of layout, space, and locality n comprehension and explanation n search planning n A data-flow notation for information seeking n link sources to queries n link both to retrieved documents n align results in space for comparison

Marti A. Hearst SIMS 202, Fall 1997 Connecting Results with Next Query

Marti A. Hearst SIMS 202, Fall 1997 Improving User Interfaces n The surface has only been scratched n This is where the big improvements will happen (in my opinion).