The Perfect Search Engine Is Not Enough Jaime Teevan †, Christine Alvarado †, Mark S. Ackerman ‡ and David R. Karger † † MIT, CSAIL ‡ University of Michigan
Let Me Interview You! – What’s the last you read? What did you do with it? – Have you gone back to an you’ve read before? Web: Files: – What’s the last Web page you visited? How did you get there? – Have you looked for anything on the Web? – What’s the last file you looked at? How did you get to it? – Have you looked for a file?
Search Overview: Understanding Introduction Related work Methodology What we learned – How? – Why? – Who? – So what? Directed Introduction Related work Methodology What we learned – How? – Why? – Who? – So what?
Haystack: Personal Information Storage Web pages Files Calendar Contacts Haystack
Directed Search in Haystack What was that paper I read last week about Information Retrieval? Haystack
Directed Search in Haystack Ah yes! Thank you. Haystack “Perfect Search Engine”
Related Work Directed search – Lab studies [Capra03, Maglio97] – Log analysis [Broder02, Spink01] Observational studies [Malone83] Information Seeking – Marchionini, O’Day and Jeffries, Bates, Belkin, … – Evolving information need
Modified Diary Study Subjects: 15 CS graduate students Ten interviews each (2/day x 5 days) Two question types – Last /file/Web page looked at – Last /file/Web page looked for Supplemented with direct observation and an hour-long semi-structured interview
DirectedSearch Overview: Understanding Introduction Related work Methodology What we learned – How? – Why? – Who? – So what?
Directed Search Today Target: Connie Monroe’s office number Type into a search engine: “Connie Monroe, office number”
What We Observed Interviewer: Have you looked for anything on the Web today? Jim: I had to look for the office number of the Harvard professor. I: So how did you go about doing that? J: I went to the homepage of the Math department at Harvard
What We Observed I: So you went to the Math department, and then what did you do over there? J: It had a place where you can find people and I went to that page and they had a dropdown list of visiting faculty, and so I went to that link and I looked for her name and there it was.
What We Observed J: I knew that she had a very small Web page saying, “I’m here at Harvard. Here’s my contact information.”
Strategies Looking for Information Teleporting Orienteering
Why Do People Orienteer? Easier than saying what you want You know where you are You know what you find The tools don’t work
Easier Than Saying What You Want Describing the target is hard – Can’t – Prefer not to Habit – “Whichever way I remember first.” Search for source – E.g., Your last search
You Know Where You Are Stay in known space – URL manipulation – Bookmarks – History Backtracking – Following an information scent – Never end up at a dead end
You Know What You Find Context gives understanding of answer “I was looking for a specific file. But even when I saw its name, I wouldn’t have known that that was the file I wanted until I saw all of the other names in the same directory…” Understanding negative results “I basically clicked on every single button until I was convinced… I don’t think that it exists…”
Individual Search Behavior Search behavior varied by individual Categorize based on usage – Filers – Pilers People who pile information take small steps People who file information take big steps
How Individuals Search For Files Filers Pilers Big steps Small steps
More to Learn from the Data Differences in finding v. re-finding How organization relates to search Importance of type ( , files and Web) Looked at v. looked for Keep in mind population
Support orienteering Applying What We Learned Advantages to orienteering – Easier than saying what you want – You know where you are – You know what you find Individual differences in step size – Highlight source (e.g., flag sources with info) – Integrate tools used for steps – Support exhaustive search – Allow for different step sizes
More to Learn from the Data Differences in finding v. re-finding How organization relates to search Importance of type ( , files and Web) Looked at v. looked for Keep in mind population
Structural Consistency Important All must be the same to re-find the information!
Supports orienteering for re-finding Allows access to new information Preserve What User Remembers
File or Pile Filer Piler
Searching Other Collections Ah yes! Thank you.
Keep Population in Mind CS grad students not representative Very familiar with search tools Would expect to see lots of tool use
Orienteer to specific information Relating How and What People only keyword search 39% of the time What people look for related to how they look SpecificGeneralDocument Other Keyword Surprise:
Relating How and Corpus and files: Almost never keyword searched Easy to associate information with document Web: Used keyword search much more often FilesWeb Other Keyword
Relating What and Corpus FilesWeb Specific39733 General10730 Document searches were primarily for specific information File searches were primarily for documents Web searches were more evenly distributed