IDM 2003 Workshop Stuff I’ve Seen: Susan Dumais Microsoft Research A System for Personal Information Retrieval and.

Slides:



Advertisements
Similar presentations
Data Mining and the Web Susan Dumais Microsoft Research KDD97 Panel - Aug 17, 1997.
Advertisements

ACL/HLT – June 18, 2008 Using Context to Support Searchers in Searching Susan Dumais Microsoft Research
DIGIDOC A web based tool to Manage Documents. System Overview DigiDoc is a web-based customizable, integrated solution for Business Process Management.
For Details Visit : or For any Help Contact the Librarian EBSCOhost 2.0.
Haystack: Per-User Information Environment 1999 Conference on Information and Knowledge Management Eytan Adar et al Presented by Xiao Hu CS491CXZ.
Authentication Administration Storage Compliance Authentication Administration Storage Compliance Audio Conferencing and Calendaring .
An introduction to Cambridge Collections Online… Full online access to collections of classic and newly- published scholarly titles in PDF format Contains.
Elibrary.worldbank.org World Bank eLibrary User Guide Take full advantage of your eLibrary subscription!
Page 1 of 29 Net-Scale Technologies, Inc. Network Based Personal Information and Messaging Services Urs Muller Beat Flepp
DEV392: Extending SharePoint Products And Technologies Through Web Parts And ASP.NET Clint Covington, Program Manager Data And Developer Services - Office.
Web- and Multimedia-based Information Systems. Assessment Presentation Programming Assignment.
Information Retrieval in Practice
ISP 433/533 Week 8 IR in libraries. Goal Universal Access to Information Vannevar Bush 1945 article Memex A memex is a device in which an individual stores.
Case study - usability evaluation Howell Istance.
WinFS. Overview of WinFS WinFS stands for Windows Future storage. WinFS is the code name of a Windows storage subsystem, being developed by Microsoft.
1 SIMS 247: Information Visualization and Presentation Marti Hearst Oct 19, 2005.
Transient Life: Collecting and sharing personal information Stephanie Smale and Saul Greenberg University of Calgary, Canada.
Enterprise Search With SharePoint Portal Server V2 Steve Tullis, Program Manager, Business Portal Group 3/5/2003.
Information & Library Services SwetsWise User Guide Emma Crowley Senior Academic Services Librarian
We are partners in learning.. Note: Office 365 works best in Internet Explorer V 9 or above. Some features do not work in PWCS’s Chrome Browser or in.
Stuff I’ve Seen: A System for Personal Information Retrieval and Re-use by Seher Acer Elif Demirli Susan Dumais, Edward Cutrell, JJ Cadiz, Gavin Jancke,
Search on Journal of Dairy Science ® An Overview April
Overview of Search Engines
The Public Library Catalogue as a Social Space: A Case Study of Social Discovery Systems in Two Canadian Public Libraries Louise Spiteri. School of Information.
Introducing Microsoft Lync 2010 Connect and Collaborate.
The Future of Search Brett Roberts Chief Technology Officer Microsoft NZ.
By Kyle Rector Senior, EECS, OSU. Agenda Background My Approach Demonstration How it works The Survey Plans for User Evaluation Future Plans.
AppExchange Partner Academy- Building Your Application Listing By Jesse Dailey.
2012 National BDPA Technology Conference Creating Rich Data Visualizations using the Google API Yolanda M. Davis Senior Software Engineer AdvancED August.
Software All parts of the computer people can NOT touch, such as programs, files, documents and any other data.
Classroom User Training June 29, 2005 Presented by:
Database Types of database programs Charles w. Bachman Well- Designed Databases Database Management Systems Types of database programs Daabase Techniques.
Search Engines and Information Retrieval Chapter 1.
CIS 375—Web App Dev II Microsoft’s.NET. 2 Introduction to.NET Steve Ballmer (January 2000): Steve Ballmer "Delivering an Internet-based platform of Next.
Sackler – May 11, 2003 Organizing Search Results Susan Dumais Microsoft Research.
Chapter 2 Architecture of a Search Engine. Search Engine Architecture n A software architecture consists of software components, the interfaces provided.
Personal Information Management Vitor R. Carvalho : Personalized Information Retrieval Carnegie Mellon University February 8 th 2005.
NCSU Libraries Kristin Antelman NCSU Libraries June 24, 2006.
PTT GSP Knowledge Management System User Training Ekkarin Sereechuenpojit System Engineer Infrastructure Solutions Wannee Govitsutthisak System Engineer.
SUMMON ® 2.0 DISCOVERY REINVENTED. What is Summon 2.0? A new, streamlined, modern interface New and enhanced features providing layers of contextual guidance.
Search and Navigation Based on the paper, “Improved Search Engines and Navigation Preference in Personal Information Management” Ofer Bergman, Ruth Beyth-Marom,
WISER : OxLIP+ Workshops in Information Skills and Electronic Research Oxford Libraries Information Platform Craig Finlay Gillian Beattie.
0 SharePoint Search 2013 Rafael de la Cruz SharePoint Developer Seneca Resources twitter.com/delacruz_rafael
인지구조기반 마이닝 소프트컴퓨팅 연구실 박사 2 학기 박 한 샘 2006 지식기반시스템 응용.
Individualized Knowledge Access David Karger Lynn Andrea Stein Mark Ackerman Ralph Swick.
Developer TECH REFRESH 15 Junho 2015 #pttechrefres h Understand your end-users and your app with Application Insights.
Getting Started with SharePoint 2010 Gareth Johns IT Skills Development Advisor.
R7 Integrator and Enterprise Integrator: You won’t believe this is XA… Deborah Vermillion, VP Consulting Services, CPIM, CIRM Belinda Daub, Senior Consultant.
CiteSight: Contextual Citation Recommendation with Differential Search Avishay Livne 1, Vivek Gokuladas 2, Jaime Teevan 3, Susan Dumais 3, Eytan Adar 1.
We facilitate access to information We facilitate access to information
Unplugged FAST meets SharePoint (FS4SP)
WIRED Future Quick review of Everything What I do when searching, seeking and retrieving Questions? Projects and Courses in the Fall Course Evaluation.
Introducing Microsoft Lync 2010 Connect and Collaborate.
Microsoft Office 2013 Try It! Chapter 4 Storing Data in Access.
Personalizing Web Search Jaime Teevan, MIT with Susan T. Dumais and Eric Horvitz, MSR.
CONFIDENTIAL Overview NTP Software Object Store and Cloud Connector™ (OSCC™) has a carefully structured architecture that includes a number of collaborative.
1 Personalizing Search via Automated Analysis of Interests and Activities Jaime Teevan, MIT Susan T. Dumais, Microsoft Eric Horvitz, Microsoft SIGIR 2005.
Vannevar Bush: As we may think. Consider a future device for individual use, which is a sort of mechanized private file and library. It needs a name,
21st ACM Symposium on Operating Systems Principles, Oct 2007 DejaView: A Personal Virtual Computer Recorder.
 1- Definition  2- Helpdesk  3- Asset management  4- Analytics  5- Tools.
AdisInsight User Guide July 2015
Information Retrieval in Practice
Summon® 2.0 Discovery Reinvented
Search Engine Architecture
SIS: A system for Personal Information Retrieval and Re-Use
Communication and Information Resource Centre Administrator
Introduction to Information Retrieval
Haystack: an Adaptive Personalized Information Retrieval System
Gizem MISIRLI Gülden OLGUN
Presentation transcript:

IDM 2003 Workshop Stuff I’ve Seen: Susan Dumais Microsoft Research A System for Personal Information Retrieval and Re-Use

IDM 2003 Workshop Outline Search today Search with Stuff I’ve Seen (SIS) With: Edward Cutrell, JJ Cadiz, Gavin Jancke, Raman Sarin, Daniel Robbins Experiences with SIS Deployment Usage data UI innovations Next steps for SIS

IDM 2003 Workshop Search Today … Many locations, interfaces for finding things (e.g., web, mail, local files, help, history, notes) Often slow “… the No.1 question we're trying to solve [in Longhorn] is ‘Where's my stuff?’ Right now, file space on any PC is a cesspool. “ Bill Gates, FORTUNE interview, June 23, 2002

IDM 2003 Workshop Search With SIS Unified index of stuff you’ve seen All types of information, e.g., files of all types, , calendar, contacts, web pages, etc. Full-text index of content plus metadata attributes (e.g., creation time, author, title, size) Automatic and immediate update of index Rich UI possibilities, since it’s your content  Get back to information you’ve seen  Re-use vs. initial discovery

IDM 2003 Workshop Related Work Several systems for improving access for specific sources (e.g., web, mail, files, photos, music) Some integration across sources KFTF [Jones et al., 2002] Lifestreams/Scopeware [Fertig, Freeman, Gelernter, 1996] MyLife Bits [Gemmell et al., 2002] Haystack [Adar et al., 1999; Huynh et al. 2002] Commercial products OS: Mac Sherlock, Windows Indexing Service Apps: Enfish, retriever, dtSearch, X1, etc. What’s new with SIS … Full content and metadata for many different sources Extensible architecture Usage experiences and experimental data UI focus

IDM 2003 Workshop SIS Architecture Indexing infrastructure uses MS Search components (note: IR platform) Gatherer – interface to content sources, e.g., files, http, MAPI Filters – decode different file types, e.g., word, powerpoint, html, pdf, journal Tokenizer – break into words, including date normalization, stemming, etc. Indexer – standard inverted index Retriever – Boolean, best match (Okapi) User interface Client side indexing and storage

IDM 2003 Workshop SIS Design Principles Indexing … No additional work is required User sees something, and it gets indexed Retrieval … Fast, flexible Interactive refinement Sort and filter on metadata Note: Sort/filter automatically triggers query UI experiments Previews, Top/Side, Previews, Richer visualizations Richer visualizations

IDM 2003 Workshop SIS Demo

IDM 2003 Workshop SIS Demo Points Search Fast Integrates content from many places Search by full-text or properties, including null queries Sort and filter results Update index in real time, with no explicit user action ?Right-click and other advanced functionality ?Saved queries, queries from other apps, IQ UI alternatives Top/Side Preview/Not Default sort order

IDM 2003 Workshop Evaluating SIS Internal deployment ~1500 downloads Users include: program management, test, sales, development, administrative, executives, etc. Research techniques Free-form feedback Questionnaires; Structured interviews Usage patterns from log data UI experiments (randomly deploy different versions) Lab studies for richer UI (e.g., timeline, trends) But even here must work with users’ own content

IDM 2003 Workshop Top vs. Side Views Previews vs. Not Sort By Date vs. Rank

IDM 2003 Workshop SIS Usage Data Detailed analysis for 234 people, 6 weeks usage Personal store characteristics 5k – 100k items; index <150 meg Query characteristics Short queries (1.59 words) Few advanced operators or fielded search in query box (7.5%) Frequent use of query iteration (48%) 50% refined queries involve filters – type, date most common 35% refined queries involve changes to query 13% refined queries involve re-sort Query content Vs. Spink et al.’s analysis of web queries Importance of people 29% of the queries involve people’s names

IDM 2003 Workshop SIS Usage Data, cont’d Characteristics of items opened File types opened 76% 14% Web pages 10% Files Age of items opened 7% today 22% within the last week 46% within the last month Ease of finding information Easier after SIS for web, , files Non-SIS search decreases for web, , files Log(Freq) = * log(DaysSinceSeen)

IDM 2003 Workshop SIS Usage, cont’d UI Usage Small effects of Top/Side, Previews Sort order Date by far the most common sort field, even for people who had Okapi Rank as default Importance of time Few searches for “best” match; many other criteria … Number of Queries Issued

IDM 2003 Workshop SIS Usage, cont’d Observations about unified access Metadata quality is variable rich, pretty clean Web: little, not very useful for retrieval Files: some, but often wrong Human annotation: don’t depend on it … Need abstractions, e.g., “Useful date” Initially, used ‘date seen’ But … Appointment, when it happens and Web, seen Files, changed What do people remember about time? Memory landmarks

IDM 2003 Workshop SIS, Timeline w/ Landmarks Timeline interface Timeline interface Augmented with landmarks as pointers into human memory Augmented with landmarks as pointers into human memory General: holidays, world events General: holidays, world events Personal: important photos, appointments Personal: important photos, appointments Heuristics or Bayesian models to identify memorable events Heuristics or Bayesian models to identify memorable events

IDM 2003 Workshop SIS, Timeline w/ Landmarks Search ResultsDistribution of Results Over Time Memory Landmarks - General (world, calendar) - Personal (appts, photos)

IDM 2003 Workshop SIS, Timeline Experiment Dates OnlyLandmarks + Dates Search Time (s) With Landmarks Without Landmarks

IDM 2003 Workshop SIS, Visualizing Trends Summarize the results of a search Summarize the results of a search Abstraction beyond individual results Abstraction beyond individual results Grid-based design Grid-based design Axes represent topic, time, people Axes represent topic, time, people Cells encode frequency, recency Cells encode frequency, recency Supports activities like: Supports activities like: What newsgroups are active (on topic x)? What newsgroups are active (on topic x)? What people are active, authoritative (on topic x)? What people are active, authoritative (on topic x)? When did I last interact w/ people? When did I last interact w/ people?

IDM 2003 Workshop SIS, Visualizing Trends

IDM 2003 Workshop SIS, Grid vs. List Experiment Grid View List View

IDM 2003 Workshop Next Steps Continue explorations of rich UI Augment index with “usage” data SIS as service, with many entry points “Contextualize” retrieval Retrieve using Implicit Queries Identify Stuff I Should See Flat-land Good search makes filing less important Attributes rather than directory locations

IDM 2003 Workshop SIS Summary Unified index of stuff you’ve seen Fast access to full-text and metadata Heterogeneous content: files, , web, etc. Automatic and immediate update of index Studied usage with several techniques Ease of finding improves with SIS Importance of people and time Short queries, quick iteration Novel UI to leverage personal memories New capabilities for personal information management More info,

IDM 2003 Workshop Vannevar Bush’s Vision Consider a future device for individual use, which is a sort of mechanized private file and library. It needs a name, and, to coin one at random, "memex" will do. A memex is a device in which an individual stores all his books, records, and communications, and which is mechanized so that it may be consulted with exceeding speed and flexibility. It is an enlarged intimate supplement to his memory. Consider a future device for individual use, which is a sort of mechanized private file and library. It needs a name, and, to coin one at random, "memex" will do. A memex is a device in which an individual stores all his books, records, and communications, and which is mechanized so that it may be consulted with exceeding speed and flexibility. It is an enlarged intimate supplement to his memory. V. Bush (1945). As we may think. Atlantic Monthly, 176, July 1945,