ACL/HLT – June 18, 2008 Using Context to Support Searchers in Searching Susan Dumais Microsoft Research

Slides:



Advertisements
Similar presentations
Ken Hinckley, Shengdong Zhao, Raman Sarin, Patrick Baudisch, Edward Cutrell, Michael Shilman & Desney Tan oronto.
Advertisements

Data Mining and the Web Susan Dumais Microsoft Research KDD97 Panel - Aug 17, 1997.
Taxonomy & Ontology Impact on Search Infrastructure John R. McGrath Sr. Director, Fast Search & Transfer.
SCOPUS Searching for Scientific Articles By Mohamed Atani UNEP.
Office 2007 November 28, 2006 Sam Nasr Software Engineer Berbee Information Networks.
Introduction Lesson 1 Microsoft Office 2010 and the Internet
For Details Visit : or For any Help Contact the Librarian EBSCOhost 2.0.
Business Development Suit Presented by Thomas Mathews.
Microsoft Office Illustrated Fundamentals Unit C: Getting Started with Unit C: Getting Started with Microsoft Office 2010 Microsoft Office 2010.
Services Course Windows Live SkyDrive Participant Guide.
Macromedia Dreamweaver MX 2004 – Design Professional Dreamweaver GETTING STARTED WITH.
® Microsoft Office 2010 Browser and Basics.
Haystack: Per-User Information Environment 1999 Conference on Information and Knowledge Management Eytan Adar et al Presented by Xiao Hu CS491CXZ.
1 Distributed Agents for User-Friendly Access of Digital Libraries DAFFODIL Effective Support for Using Digital Libraries Norbert Fuhr University of Duisburg-Essen,
South Dakota Library Network MetaLib User Interface South Dakota Library Network 1200 University, Unit 9672 Spearfish, SD © South Dakota.
Authentication Administration Storage Compliance Authentication Administration Storage Compliance Audio Conferencing and Calendaring .
FindAll: A Local Search Engine for Mobile Phones Aruna Balasubramanian University of Washington.
Personalization and Search Jaime Teevan Microsoft Research.
IDM 2003 Workshop Stuff I’ve Seen: Susan Dumais Microsoft Research A System for Personal Information Retrieval and.
WWW Challenges : Supporting Users in Search and Navigation Natasa Milic-Frayling Microsoft Research, Cambridge UK SOFSEM 2004 January 28, 2004.
Search Engines and Information Retrieval
Personalizing Search via Automated Analysis of Interests and Activities Jaime Teevan Susan T.Dumains Eric Horvitz MIT,CSAILMicrosoft Researcher Microsoft.
Seesaw Personalized Web Search Jaime Teevan, MIT with Susan T. Dumais and Eric Horvitz, MSR.
This document is for informational purposes only. MICROSOFT MAKES NO WARRANTIES, EXPRESS OR IMPLIED, IN THIS DOCUMENT. © 2007 Microsoft Corporation. All.
1 of 5 This document is for informational purposes only. MICROSOFT MAKES NO WARRANTIES, EXPRESS OR IMPLIED, IN THIS DOCUMENT. © 2007 Microsoft Corporation.
Transient Life: Collecting and sharing personal information Stephanie Smale and Saul Greenberg University of Calgary, Canada.
Enterprise Search With SharePoint Portal Server V2 Steve Tullis, Program Manager, Business Portal Group 3/5/2003.
1 of 6 This document is for informational purposes only. MICROSOFT MAKES NO WARRANTIES, EXPRESS OR IMPLIED, IN THIS DOCUMENT. © 2007 Microsoft Corporation.
Stuff I’ve Seen: A System for Personal Information Retrieval and Re-use by Seher Acer Elif Demirli Susan Dumais, Edward Cutrell, JJ Cadiz, Gavin Jancke,
Overview of Search Engines
1 of 7 This document is for informational purposes only. MICROSOFT MAKES NO WARRANTIES, EXPRESS OR IMPLIED, IN THIS DOCUMENT. © 2007 Microsoft Corporation.
Section 2: Finding and Refinding Jaime Teevan Microsoft Research 1.
By Kyle Rector Senior, EECS, OSU. Agenda Background My Approach Demonstration How it works The Survey Plans for User Evaluation Future Plans.
Result presentation. Search Interface Input and output functionality – helping the user to formulate complex queries – presenting the results in an intelligent.
Challenges in Information Retrieval and Language Modeling Michael Shepherd Dalhousie University Halifax, NS Canada.
Crystal Hoyer Program Manager IIS Team Preview of features that will be announced at MIX09 Please do not blog, take pictures or video of session.
Research paper: Web Mining Research: A survey SIGKDD Explorations, June Volume 2, Issue 1 Author: R. Kosala and H. Blockeel.
Search Engines and Information Retrieval Chapter 1.
XP New Perspectives on Browser and Basics Tutorial 1 1 Browser and Basics Tutorial 1.
Facets of Personalization Jaime Teevan Microsoft Research (CLUES) with S. Dumais, E. Horvitz, D. Liebling, E. Adar, J. Elsas, R. Hughes.
Ken Hinckley, Shengdong Zhao, Raman Sarin, Patrick Baudisch, Edward Cutrell, Michael Shilman & Desney Tan oronto r.
Sackler – May 11, 2003 Organizing Search Results Susan Dumais Microsoft Research.
COMPREHENSIVE Windows Tutorial 4 Working with the Internet and .
Personal Information Management Vitor R. Carvalho : Personalized Information Retrieval Carnegie Mellon University February 8 th 2005.
Research & Learning For Libraries and Patrons that need to stay Ahead of the Learning Curve Presenter Name Here Books24x7® for Libraries.
SUMMON ® 2.0 DISCOVERY REINVENTED. What is Summon 2.0? A new, streamlined, modern interface New and enhanced features providing layers of contextual guidance.
Search Result Interface Hongning Wang Abstraction of search engine architecture User Ranker Indexer Doc Analyzer Index results Crawler Doc Representation.
CSM06 Information Retrieval Lecture 6: Visualising the Results Set Dr Andrew Salway
Collaborative Information Retrieval - Collaborative Filtering systems - Recommender systems - Information Filtering Why do we need CIR? - IR system augmentation.
0 SharePoint Search 2013 Rafael de la Cruz SharePoint Developer Seneca Resources twitter.com/delacruz_rafael
인지구조기반 마이닝 소프트컴퓨팅 연구실 박사 2 학기 박 한 샘 2006 지식기반시스템 응용.
Discovering and Using Groups to Improve Personalized Search Jaime Teevan, Merrie Morris, Steve Bush Microsoft Research.
Individualized Knowledge Access David Karger Lynn Andrea Stein Mark Ackerman Ralph Swick.
Personalizing Search Jaime Teevan, MIT Susan T. Dumais, MSR and Eric Horvitz, MSR.
Personalized Interaction With Semantic Information Portals Eric Schwarzkopf DFKI
Personalization with user’s local data Personalizing Search via Automated Analysis of Interests and Activities 1 Sungjick Lee Department of Electrical.
Search Result Interface Hongning Wang Abstraction of search engine architecture User Ranker Indexer Doc Analyzer Index results Crawler Doc Representation.
2004/051 >> Supply Chain Solutions That Deliver Users.
User Guide Enhanced Knowledge Hub. 2 Note Accessing Knowledge Hub 1 2 Access K-Hub by selecting: 1.Knowledge Hub tab, OR 2.Knowledge Hub under My Communities.
Personalizing Web Search Jaime Teevan, MIT with Susan T. Dumais and Eric Horvitz, MSR.
Microsoft Office 2008 for Mac – Illustrated Unit D: Getting Started with Safari.
Microsoft Office System UK Developers Conference Radisson Edwardian, Heathrow 29 th & 30 th June 2005.
1 Personalizing Search via Automated Analysis of Interests and Activities Jaime Teevan, MIT Susan T. Dumais, Microsoft Eric Horvitz, Microsoft SIGIR 2005.
Fishing for Information InkSeine is a tool for thought that lets you fish for useful information directly from your ink notes. Fishermen catching salmon.
SEARCH AND CONTEXT Susan Dumais, Microsoft Research INFO 320.
Summon® 2.0 Discovery Reinvented
Newly updated World eBooks
SIS: A system for Personal Information Retrieval and Re-Use
William Jones, Harry Bruce
Presentation transcript:

ACL/HLT – June 18, 2008 Using Context to Support Searchers in Searching Susan Dumais Microsoft Research

ACL/HLT – June 18, 2008 Search Today User Context Task/Use Context Query Words Ranked List Query Words Ranked List Using Context to Support Searchers Document Context

ACL/HLT – June 18, 2008 Web Info through the Years Number of pages indexed Number of pages indexed 7/94 Lycos – 54,000 pages 7/94 Lycos – 54,000 pages 95 – 10^6 millions 95 – 10^6 millions 97 – 10^7 97 – 10^7 98 – 10^8 98 – 10^8 01 – 10^9 billions 01 – 10^9 billions 05 – 10^10 … 05 – 10^10 … Types of content Types of content Web pages, newsgroups Web pages, newsgroups Images, videos, maps Images, videos, maps News, blogs, spaces News, blogs, spaces Shopping, local, desktop Shopping, local, desktop Books, papers Books, papers Health, finance, travel … Health, finance, travel … Whats available How its accessed

ACL/HLT – June 18, 2008 Some Support for Searchers The search box The search box Spelling suggestions Spelling suggestions Query suggestions Query suggestions Advanced search operators and options (e.g.,, +/-, site:, language:, filetype:, intitle:) Advanced search operators and options (e.g.,, +/-, site:, language:, filetype:, intitle:) Richer snippets Richer snippets But, we can do better … using context But, we can do better … using context

ACL/HLT – June 18, 2008 Key Contexts Users: Users: Individual, group (topic, time, location, etc.) Individual, group (topic, time, location, etc.) Short-term or long-term models Short-term or long-term models Explicit or implicit capture Explicit or implicit capture Documents/Domains: Documents/Domains: Document-level metadata, usage/change patterns Document-level metadata, usage/change patterns Relations among documents Relations among documents Tasks/Uses: Tasks/Uses: Information goal – Navigational, fact-finding, informational, monitoring, research, learning, social, etc. Information goal – Navigational, fact-finding, informational, monitoring, research, learning, social, etc. Physical setting – Device, location, time, etc. Physical setting – Device, location, time, etc.

ACL/HLT – June 18, 2008 Using Contexts Identify: Identify: What context(s) are of interest? What context(s) are of interest? Accommodate: Accommodate: What do we do differently for different contexts? What do we do differently for different contexts? Outcome (Q|context) >> Outcome (Q) Outcome (Q|context) >> Outcome (Q) Influence points within the search process Influence points within the search process Articulating the information need Articulating the information need Initial query, subsequent interaction/dialog Initial query, subsequent interaction/dialog Selecting and/or ranking content Selecting and/or ranking content Presenting results Presenting results Using and sharing results Using and sharing results

ACL/HLT – June 18, 2008 Context in Action Research prototypes: provide insights about algorithmic, user experience, and policy challenges User Contexts: User Contexts: Finding and Re-Finding (Stuff Ive Seen) Finding and Re-Finding (Stuff Ive Seen) Personalized Search (PSearch) Personalized Search (PSearch) Novelty in News (NewsJunkie) Novelty in News (NewsJunkie) Document/Domain Contexts: Document/Domain Contexts: Metadata and search (Phlat) Metadata and search (Phlat) Visualizing patterns in results (GridViz) Visualizing patterns in results (GridViz) Task/Use Contexts: Task/Use Contexts: Pages as context (Community Bar, IQ) Pages as context (Community Bar, IQ) Richer collections as context (NewsJunkie, PSearch) Richer collections as context (NewsJunkie, PSearch) Working, understanding, sharing (SearchTogether, InkSeine) Working, understanding, sharing (SearchTogether, InkSeine)

ACL/HLT – June 18, 2008 SIS: Stuff Ive Seen Unified index of stuff youve seen Many info silos (e.g., files, , calendar, contacts, web pages, rss, im) Unified index, not storage Index of content and metadata (e.g., time, author, title, size, access) Re-finding vs. finding Vista Desktop Search (and Live Toolbar) Dumais et al., SIGIR 2003 Stuff Ive Seen Windows Live- DS Also, Spotlight, GDS, X1, …

ACL/HLT – June 18, 2008 SIS Demo

ACL/HLT – June 18, 2008 SIS Usage Experiences Internal deployment ~3000 internal Microsoft users Analyzed: Free-form feedback, Questionnaires, Structured interviews, Log analysis (characteristics of interaction), UI expts, Lab expts Personal store characteristics 5k – 500k items Query characteristics Short queries (1.6 words) Few advanced operators or fielded search in query box (~7%) Many advanced operators and query iteration in UI (48%) Filters (type, date); modify query; re-sort results

ACL/HLT – June 18, 2008 Importance of people, time, and memory People 25% of queries contained names People in roles (to:, from:) vs. people as entities in text Log(Freq) = * log(DaysSinceSeen) Time Age of items opened 5% today; 21% last week 50% of the cases in 36 days Web (11); Mail (36); Files (55) Date most common sort field, even when Rank was the default Support for episodic memory Few searches for best topical match … many other criteria SIS Usage Data, contd

ACL/HLT – June 18, 2008 SIS Usage Data, contd Observations about unified access Metadata quality is variable rich, pretty clean Web: little, available to application Files: some, but often wrong Memory depends on abstractions Useful date is dependent on the object ! Appointment, when it happens File, when it is changed and Web, when it is seen People attribute vs. contains To, From, Cc, Attendee, Author, Artist

ACL/HLT – June 18, 2008 Ranked list vs. Metadata (for personal content) Why Rich Metadata? People remember many attributes in re-finding - Often: time, people, file type, etc. - Seldom: only general overall topic Rich client-side interface - Support fast iteration/refinement - Fast filter-sort-scroll vs. next-next-next

ACL/HLT – June 18, 2008 Re-finding on the Web 50-80% URL visits are revisits 50-80% URL visits are revisits 30-40% of queries are re-finding queries 30-40% of queries are re-finding queries Teevan et al., SIGIR 2007

ACL/HLT – June 18, 2008 Cutrell et al., CHI 2006 Shell for WDS; publically available Shell for WDS; publically available Features: Features: Search / Browse (faceted metadata) Search / Browse (faceted metadata) Unified Tagging Unified Tagging In-Context Search In-Context Search Phlat: Search and Metadata

ACL/HLT – June 18, 2008 Phlat: Faceted metadata Tight coupling of search and browse Tight coupling of search and browse Q Results & Q Results & Associated metadata w/ query previews Associated metadata w/ query previews 5 default properties to filter on (extensible) 5 default properties to filter on (extensible) Includes tags Includes tags Property filters integrated with query Property filters integrated with query Query = words and/or properties Query = words and/or properties No stuck filters No stuck filters Search == Browse Search == Browse

ACL/HLT – June 18, 2008 Phlat: Tagging Apply a single set of user-generated tags to all content (e.g., files, , web, rss, etc.) Tagging interaction Tag widget or drag-to-tag Tag structure Allow but do not require hierarchy Tag implementation Tags directly associated with files as NTFS or MAPI properties

ACL/HLT – June 18, 2008 Phat: In-Context Search Selecting a result … Linked view to show associated tags Rich actions Open, drag-drop, etc. Pivot on metadata Sideways search Refine or replace query

ACL/HLT – June 18, 2008 Phlat Phlat shell for Windows Desktop Search Tight coupling of searching/browsing Rich faceted metadata support Including unified tagging across data types In-context search and actions Download:

ACL/HLT – June 18, 2008 Web Search using Metadata Many queries include implicit metadata Many queries include implicit metadata portrait of barak obama portrait of barak obama recent news about midwest floods recent news about midwest floods good painters near redmond good painters near redmond starbucks near me starbucks near me overview of high blood pressure overview of high blood pressure … Limited support for users to articulate this Limited support for users to articulate this

ACL/HLT – June 18, 2008 Search in Context Search is not the end goal … Support information access in the context of ongoing activities (e.g., writing talk, finding out about, planning trip, buying, monitoring, etc.) Search always available Search from within apps (keywords, regions, full doc) Show results within app Csikszentmihalyi) Maintains flow (Csikszentmihalyi) Can improve relevance

ACL/HLT – June 18, 2008 Documents as (a simple) Context Recommendations Recommendations People who bought this also bought … People who bought this also bought … Contextual Ads Contextual Ads Ads relevant to page Ads relevant to page Community Bar Community Bar Notes, Chat, Tags, Inlinks, Queries Notes, Chat, Tags, Inlinks, Queries Implict Queries (IQ) Implict Queries (IQ) Also Y!Q, Watson, Rememberance Agent Also Y!Q, Watson, Rememberance Agent Proactive query specification depending on current document content and activities

ACL/HLT – June 18, 2008 Background search on top k terms, based on users index Score = tf doc / log(tf corpus +1) Quick links for People and Subject. Top matches for this Implicit Query (IQ). Document Contexts (Implicit Query, IQ ) Dumais et al., SIGIR 2004 Proactively find info related to item being read/created Proactively find info related to item being read/created Quick links Quick links Related content Related content Challenges Challenges Relevance, fine Relevance, fine When to show? (useful) When to show? (useful) How to show? (peripheral awareness) How to show? (peripheral awareness)

ACL/HLT – June 18, 2008 InkSeine: Search in Context Tablet PC application for active note taking Tablet PC application for active note taking Unifies ink, search and gather functions into a fluid workflow Unifies ink, search and gather functions into a fluid workflow Note taking, enriched w/: Note taking, enriched w/: Search from ink Search from ink Show results in app Show results in app Integrate results, links and clippings into notes Integrate results, links and clippings into notes Maintain work flow Maintain work flow Inking for thinking Inking for thinking Hinckley et al., CHI 2007

ACL/HLT – June 18, 2008 PSearch: Personalized Search (Even Richer Context) Today: People get the same results, independent of current session, previous search history, etc. PSearch: Uses rich client-side info to personalize results Teevan et al., SIGIR 2005 Building a user profile Personalized ranking When to personalize? How to personalize display? ACM SIGIR Special Interest Group on Information Retrieval Home Page Welcome to the ACM SIGIR Web site … SIGIR thanks Doug Oard, Bill Hersh, David Carmel, Noriko Kando, Diane Kelly… Get ready for SIGIR 2008! sigir.org

ACL/HLT – June 18, 2008 Building a User Profile Type of information: Type of information: – Explicit: Judgments, categories – Content: Past queries, web pages, desktop – Behavior: Visited pages, dwell time Time frame: Short term, long term Time frame: Short term, long term Who: Individual, group Who: Individual, group Where the profile resides: Where the profile resides: – Local: Richer profile, improved privacy – Server: Richer communities, portability PSearch

ACL/HLT – June 18, 2008 Personalized Ranking Personalized Ranking Personal Rank = f(Cont, Beh, Web) Personal Rank = f(Cont, Beh, Web) Pers_Content Match: sim(result, user_content_profile) Pers_Content Match: sim(result, user_content_profile) Pers_Behavior Match: visited URLs Pers_Behavior Match: visited URLs Web Match: web rank Web Match: web rank

ACL/HLT – June 18, 2008 N Personalized Search (Matching: Relevance Feedback + Interaction) nini riri R (r i +0.5)(N-n i -R+r i +0.5) (n i -r i +0.5)(R-r i +0.5) w i = log Score = Σ tf i * w i (N) (n i ) w i = log (r i +0.5)(N-n i -R+r i +0.5) (n i -r i +0.5)(R-r i +0.5) w i = log riri R Where: N = N+R, n i = n i +r i World Client

ACL/HLT – June 18, 2008 When to Personalize? Personal ranking Personal relevance (explicit or implicit) Group ranking Decreases as you add more people Gap is potential for personalization (p4p) Potential for Personalization Personalization works well for some queries, … but not for others Framework for understanding when to personalize

ACL/HLT – June 18, 2008 More Personalized Search PSearch - rich long-term context; single individual PSearch - rich long-term context; single individual Short-term session/task context Short-term session/task context Session analysis Session analysis Query: ACL, ambiguous in isolation Query: ACL, ambiguous in isolation Natural language … summarization … ACL Natural language … summarization … ACL Knee surgery … orthopedic surgeon … ACL Knee surgery … orthopedic surgeon … ACL Groups of similar people Groups of similar people Groups: Location, demographics, interests, behavior, etc. Groups: Location, demographics, interests, behavior, etc. Mei & Church (2008) Mei & Church (2008) H(URL) = 22.4 H(URL) = 22.4 Search: H(URL|Q) = 2.8 Search: H(URL|Q) = 2.8 Personalization: H(URL|Q, IP) = 1.2 Personalization: H(URL|Q, IP) = 1.2 Many models … smooth individual, group, global models Many models … smooth individual, group, global models

ACL/HLT – June 18, 2008 Beyond Search - Gathering Info Support for more than retrieving documents Retrieve -> Analyze -> Use Lightweight scratchpad or workspace support Iterative and evolving nature of search Resuming at a later time or on other device Sharing with others ScratchPad

ACL/HLT – June 18, 2008 SearchTogether Collaborative web search prototype Sync. or async. sharing w/ others or self Collaborative search tasks E.g., Planning travel, purchases, events; understanding medical info; researching joint project or report Today little support links, instant messaging, phone SearchTogether adds support for Awareness (history, metadata) Coordination (IM, recommend, split) Persistence (history, summaries) SearchTogether Morris et al., UIST 2007 Beyond Search – Sharing & Collaborating

ACL/HLT – June 18, 2008 Looking Ahead … Continued advances in scale of systems, diversity of resources, ranking, etc. Tremendous new opportunities to support searchers by Understanding user intent Modeling user interests and activities over time Representing non-content attributes and relations Supporting the search process Developing interaction and presentation techniques that allow people to better express their information needs Supporting understanding, using, sharing results Considering search as part of richer landscape

ACL/HLT – June 18, 2008 Using Context to Support Searchers User Context Document Context Task/Use Context Query Words Ranked List Think Outside the IR Box(es)

ACL/HLT – June 18, 2008 Thank You ! Questions/Comments … More info, Windows Live Desktop Search, Phlat, Search Together,

ACL/HLT – June 18, 2008 Stuff Ive Seen S. T. Dumais, E. Cutrell, J. J. Cadiz, G. Jancke, R. Sarin & D. C. Robbins (2003). Stuff I've Seen: A system for personal information retrieval and re-use. SIGIR 2003.Stuff I've Seen: A system for personal information retrieval and re-use. Download: and Vista Search Phlat E. Cutrell, D. C. Robbins, S. T. Dumais & R. Sarin (2006). Fast, flexible filtering with Phlat - Personal search and organization made easy. CHI 2006.Fast, flexible filtering with Phlat - Personal search and organization made easy. Download: Memory Landmarks M. Ringel, E. Cutrell, S. T. Dumais & E. Horvitz (2003). Milestones in time: The value of landmarks in retrieving information from personal stores. Interact 2003.Milestones in time: The value of landmarks in retrieving information from personal stores. Personalized Search J. Teevan, S. T. Dumais & E. Horvitz (2005). Personalizing search via automated analysis of interests and activities. SIGIR Implicit Queries S. T. Dumais, E. Cutrell, R. Sarin & E. Horvitz (2004). Implicit queries (IQ) for contextualized search. SIGIR Revisitation on Web J. Teevan, E. Adar, R. Jones & M. Potts (2007). Information re-retrieval. SIGIR InkSeine K. Hinckley, S. Zhao, R. Sarin, P Baudisch, E. Cutrell & M. Shilman (2007). InkSeine: In situ search for active note taking. CHI Download: Search Together M. Morris & E. Horvitz (2007). Search Together: An interface for collaborative web search. UIST Download: References