Faceted Metadata in Search Interfaces Marti Hearst UC Berkeley School of Information This Research Supported by NSF IIS-9984741.

Slides:



Advertisements
Similar presentations
Search Systems From Information Architecture Rosenfeld and Morville From Information Architecture Rosenfeld and Morville.
Advertisements

Content Metadata and Search Remarks to the Dublin Core Workshop Marti Hearst SIMS, UC Berkeley September 28, 2003.
Bringing Order to the Web: Automatically Categorizing Search Results Hao Chen SIMS, UC Berkeley Susan Dumais Adaptive Systems & Interactions Microsoft.
MaNIS Interface Project Mayjane Co Denise Green Jane Lee Rebecca Shapley.
1 Using Words to Search a Thousand Images Hierarchical Faceted Metadata in Search & Browsing Marti Hearst SIMS, UC Berkeley Research funded by: NSF CAREER.
Incorporating Metadata into Search User Interfaces Ame ElliottJen English Ping YeeKirsten Swearington UC Berkeley
MaNIS Interface Project Mayjane Co Denise Green Jane Lee Rebecca Shapley.
SIMS 213: User Interface Design & Development Marti Hearst Thurs, March 3, 2005.
Measuring Information Architecture CHI 01 Panel Position Statement Marti Hearst UC Berkeley.
1 Ideas for Integrating Browsing and Search in the CDL Marti Hearst SIMS, UC Berkeley
Faceted Metadata for Site Navigation and Search Marti Hearst 12/17/2009.
1 Using Words to Search a Thousand Images Hierarchical Faceted Metadata in Search & Browsing Marti Hearst SIMS, UC Berkeley Research funded by: NSF CAREER.
Social Tagging and Search Marti Hearst UC Berkeley.
Nearly-Automated Metadata Hierarchy Creation Emilia Stoica and Marti Hearst SIMS University of California, Berkeley.
Faceted Metadata in Search Interfaces Marti Hearst UC Berkeley School of Information This Research Supported by NSF IIS
1 Flexible Search and Navigation using Faceted Metadata Prof. Marti Hearst Dr. Rashmi Sinha, Ame Elliott, Jennifer English, Kirsten Swearingen, Ping Yee.
Measuring Information Architecture Marti Hearst UC Berkeley.
Measuring Information Architecture Marti Hearst UC Berkeley.
Semi-Automated Creation of Facet Hierarchies Marti Hearst School of Information, UC Berkeley Joint work with Dr. Emilia Stoica.
A metadata-based approach Marti Hearst Associate Professor BT Visit August 18, 2005.
Yahoo Visit Day Joint Reseach Opportunities Marti Hearst UC Berkeley School of Information.
Inspection Methods. Inspection methods Heuristic evaluation Guidelines review Consistency inspections Standards inspections Features inspection Cognitive.
Faceted Metadata in Search Interfaces Marti Hearst UC Berkeley School of Information This Research Supported by NSF IIS
Incorporating Metadata into Search User Interfaces Marti Hearst UC Berkeley.
Using Metadata to Improve Search User Interfaces Marti Hearst UC Berkeley FLINT Workshop, August 2001.
Faceted Metadata for Information Architecture and Search Marti Hearst, SIMS at UC Berkeley Preston Smalley & Corey Chandler, eBay User Experience & Design.
Faceted Metadata in Image Search & Browsing Using Words to Browse a Thousand Images Ka-Ping Yee, Kirsten Swearingen, Kevin Li, Marti Hearst Group for User.
UIs for Faceted Navigation Recent Advances and Remaining Open Problems HCIR’08 Marti Hearst, UC Berkeley (including some slides from Corey Chandler of.
Measuring Information Architecture Marti Hearst UC Berkeley.
SIMS 213: User Interface Design & Development Marti Hearst Thurs, March 18, 2004.
SIMS 213: User Interface Design & Development Marti Hearst Thurs Feb 15, 2001.
Incorporating Metadata into Search UIs Marti Hearst UC Berkeley.
Faceted Metadata in Search Interfaces Marti Hearst UC Berkeley School of Information This Research Supported by NSF IIS
Transforming Tags to (Faceted) Tagsonomies Marti Hearst UC Berkeley School of Information This Research Supported by NSF IIS
1 Flexible Search and Navigation using Faceted Metadata Prof. Marti Hearst University of California, Berkeley Search Engines Meeting, April 2002 Research.
MaNIS Interface Project Mayjane Co Denise Green Jane Lee Rebecca Shapley.
Considering a Faceted Search-based Model Marti Hearst UCB SIMS NAS CSTB DNS Meeting on Internet Navigation and the Domain Name.
Ideas for USA.gov Marti Hearst USA.gov & Web Best Practices Team Meeting July 29, 2009.
1 Using Words to Search a Thousand Images Hierarchical Faceted Metadata in Search & Browsing Marti Hearst SIMS, UC Berkeley Research funded by: NSF CAREER.
SIMS 213: User Interface Design & Development Marti Hearst Thurs, March 14, 2002.
1 User Interface Design CIS 375 Bruce R. Maxim UM-Dearborn.
Cumulus vs. Portfolio: An interactivity slam-down between two Digital Asset Management Applications Theories and Practice of Interactive Media 7 December.
CSI-553 Internet Information Presented by: Ignacio Castro June 28, 2006 Internet Usability.
Personalization in Local Search Personalization of Content Ranking in the Context of Local Search Philip O’Brien, Xiao Luo, Tony Abou-Assaleh, Weizheng.
1 The BT Digital Library A case study in intelligent content management Paul Warren
JASS 2005 Next-Generation User-Centered Information Management Information visualization Alexander S. Babaev Faculty of Applied Mathematics.
Information retrieval wed sept data…. -start at 6.45.
What is Usability? Usability Is a measure of how easy it is to use something: –How easy will the use of the software be for a typical user to understand,
-1- Philipp Heim, Thomas Ertl, Jürgen Ziegler Facet Graphs: Complex Semantic Querying Made Easy Philipp Heim 1, Thomas Ertl 1 and Jürgen Ziegler 2 1 Visualization.
Heuristic evaluation Functionality: Visual Design: Efficiency:
SUMMON ® 2.0 DISCOVERY REINVENTED. What is Summon 2.0? A new, streamlined, modern interface New and enhanced features providing layers of contextual guidance.
Definition of a taxonomy “System for naming and organizing things into groups that share similar characteristics” Taxonomy Architectures Applications.
How can Search Interfaces Enhance the Value of Semantic Annotations (and Vice Versa?) Keynote Talk ESAIR’13: Sixth International Workshop on Exploiting.
Jacobsen, D. M. EDER Computer Based Learning II Jan 17 – 2 nd Seminar Web Portfolio Course Project Discussion / Collaboration / Lab 40% 60%
Faceted Navigation An Alternative to Search and Browse Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services
Searching the web Enormous amount of information –In 1994, 100 thousand pages indexed –In 1997, 100 million pages indexed –In June, 2000, 500 million pages.
Recuperação de Informação B Cap. 10: User Interfaces and Visualization , , 10.9 November 29, 1999.
Personalized Interaction With Semantic Information Portals Eric Schwarzkopf DFKI
CMPS 435 F08 These slides are designed to accompany Web Engineering: A Practitioner’s Approach (McGraw-Hill 2008) by Roger Pressman and David Lowe, copyright.
Power to the People IU Bloomington Libraries’ Content Management System Doug Ryner, Tadas Paegle, Julie Hardesty.
WEB 2.0 PATTERNS Carolina Marin. Content  Introduction  The Participation-Collaboration Pattern  The Collaborative Tagging Pattern.
Websites with good heuristics Irene Wachirawutthichai.
Advanced Semantics and Search Beyond Tag Clouds and Taxonomies Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services.
The Research Paper Hitting the ground running. Research Research is a way of… What are some everyday uses of research? What experiences have you had with.
After testing users Compile Data Compile Data Summarize Summarize Analyze Analyze Develop recommendations Develop recommendations Produce final report.
Achieving Semantic Interoperability at the World Bank Designing the Information Architecture and Programmatically Processing Information Denise Bedford.
Assess usability of a Web site’s information architecture: Approximate people’s information-seeking behavior (Monte Carlo simulation) Output quantitative.
Summon® 2.0 Discovery Reinvented
The Use of Facets in Web Search Engines
Incorporating Metadata into Search User Interfaces
Presentation transcript:

Faceted Metadata in Search Interfaces Marti Hearst UC Berkeley School of Information This Research Supported by NSF IIS

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS Focus: Search and Navigation of Large Collections Image Collections E-Government Sites Example: the University of California Library Catalog Shopping Sites Digital Libraries

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS Web Sites and Collections A report by Forrester research in 2001 showed that while 76% of firms rated search as “extremely important” only 24% consider their Web site’s search to be “extremely useful”. Johnson, K., Manning, H., Hagen, P.R., and Dorsey, M. Specialize Your Site's Search. Forrester Research, (Dec. 2001), Cambridge, MA;

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS What do we want done differently? Organization of results Hints of where to go next Flexible ways to move around … How to structure the information?

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS The Problem With Hierarchy Forces a choice of one dimension vs another –Either you commit to one path, –Or you have to provide many redundant combinations Examples –Each topic followed by all time periods followed by all locations AND –Each topic followed by all locations followed by all time periods AND –Each location followed by all topics followed by all time periods … etc

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS The Problem with Hierarchy

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS The Problem with Hierarchy

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS The Problem with Hierarchy

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS The Problem with Hierarchy

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS The Problem with Hierarchy

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS The Problem with Hierarchy

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS How to Structure Information for Search and Browsing? Hierarchy is too rigid Full meaning is too compex Hierarchical faceted metadata: –A useful middle ground

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS What are facets? Sets of categories, each of which describe a different aspect of the objects in the collection. Each of these can be hierarchical. (Not necessarily mutually exclusive nor exhaustive, but often that is a goal.) Time/DateTopicRoleGeoRegion 

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS Facet example: Recipes Course Main Course Cooking Method Stir-fry Cuisine Thai Ingredient Red Bell Pepper Curry Chicken

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS Example of Faceted Metadata: Categories for Biomedical Journal Articles 1. Anatomy [A] 2. Organisms [B] 3. Diseases [C] 4. Chemicals and Drugs [D] 1. Lung 2. Mouse 3. Cancer 4. Tamoxifen

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS Goal: assign labels from facets

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS Motivation Description: 19th c. paint horse; saddle and hackamore; spurs; bandana on rider; old time cowboy hat; underchin thong; flying off. Nature Animal Mammal Horse Occupations Cowboy Clothing Hats Cowboy Hat Media Engraving Wood Eng. Location North America America

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS Motivation Description: 19th c. paint horse; saddle and hackamore; spurs; bandana on rider; old time cowboy hat; underchin thong; flying off. By using facets, what we are not capturing? The hat flew off; The bandana stayed on. The thong is part of the hat. The bandana is on the cowboy (not the horse). The saddle is on the horse (not the cowboy).

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS Hierarchical Faceted Metadata A simplification of knowledge representation Does not represent relationships directly BUT can be understood well by many people when browsing rich collections of information.

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS How to Use in an Interface? Users don’t like new search interfaces. How to show lots of information without overwhelming or confusing? There are many ways to do it wrong. –Say I want unabridged nonfiction audiobooks –Audible.com, BooksOnTape.com, and BrillianceAudio: no way to browse a given category and simultaneuosly select unabridged versions –Amazon.com: has finally gotten browsing over multiple kinds of features working; this is a recent development but still restricted on what can be added into the query

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS A Solution (The Flamenco Project) Incorporating Faceted Hierarchical Metadata into Interfaces for Large Collections Key Goals: –Support integrated browsing and keyword search Provide an experience of “browsing the shelves” –Add power and flexibility without introducing confusion or a feeling of “clutter” –Allow users to take the path most natural to them Method: –User-centered design, including needs assessment and many iterations of design and testing

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS Nobel Prize Winners Collection

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS Faceted Metadata Approach

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS Art History Images Collection

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS Information previews Use the metadata to show where to go next –More flexible than canned hyperlinks –Less complex than full search Help users see and return to previous steps Reduces mental work –Recognition over recall –Suggests alternatives More clicks are ok iff (J. Spool) The “scent” of the target does not weaken If users feel they are going towards, rather than away, from their target.

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS What is Tricky About This? It is easy to do it poorly It is hard to be not overwhelming –Most users prefer simplicity unless complexity really makes a difference –Small details matter It is hard to “make it flow”

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS Search Usability Design Goals 1.Strive for Consistency 2.Provide Shortcuts 3.Offer Informative Feedback 4.Design for Closure 5.Provide Simple Error Handling 6.Permit Easy Reversal of Actions 7.Support User Control 8.Reduce Short-term Memory Load From Shneiderman, Byrd, & Croft, Clarifying Search, DLIB Magazine, Jan

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS Most Recent Usability Study Participants & Collection –32 Art History Students –~35,000 images from SF Fine Arts Museum Study Design –Within-subjects Each participant sees both interfaces Balanced in terms of order and tasks –Participants assess each interface after use –Afterwards they compare them directly Data recorded in behavior logs, server logs, paper-surveys; one or two experienced testers at each trial. Used 9 point Likert scales. Session took about 1.5 hours; pay was $15/hour

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS The Baseline System Floogle Take the best of the existing keyword-based image search systems

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS sword

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS Evaluation Quandary How to assess the success of browsing? –Timing is usually not a good indicator –People often spend longer when browsing is going well. Not the case for directed search –Can look for comprehensiveness and correctness (precision and recall) … –… But subjective measures seem to be most important here.

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS Hypotheses We attempted to design tasks to test the following hypotheses: –Participants will experience greater search satisfaction, feel greater confidence in the results, produce higher recall, and encounter fewer dead ends using FC over Baseline –FC will perceived to be more useful and flexible than Baseline –Participants will feel more familiar with the contents of the collection after using FC –Participants will use FC to create multi-faceted queries

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS Post-Test Comparison FacetedBaseline Overall Assessment More useful for your tasks Easiest to use Most flexible More likely to result in dead ends Helped you learn more Overall preference Find images of roses Find all works from a given period Find pictures by 2 artists in same media Which Interface Preferable For:

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS Post-Interface Assessments All significant at p<.05 except simple and overwhelming

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS Perceived Uses of Interfaces Baseline FC

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS Advantages of the Approach Honors many of the most important usability design goals –User control –Provides context for results –Reduces short term memory load –Allows easy reversal of actions –Provides consistent view Allows different people to add content without breaking things Can make use of standard technology

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS Advantages of the Approach Systematically integrates search results: –reflect the structure of the info architecture –retain the context of previous interactions Gives users control and flexibility –Over order of metadata use –Over when to navigate vs. when to search Allows integration with advanced methods –Collaborative filtering, predicting users’ preferences

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS Disadvantages Does not model relations explicitly Does it scale to millions of items? –Adaptively determine which facets to show for different combinations of items Requires faceted metadata!

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS Usability Studies Usability studies done on 3 collections: –Recipes: 13,000 items –Architecture Images: 40,000 items –Fine Arts Images: 35,000 items Conclusions: –Users like and are successful with the dynamic faceted hierarchical metadata, especially for browsing tasks –Very positive results, in contrast with studies on earlier iterations.

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS Opportunities New opportunity: Tagging, folksonomies –(flickr de.lici.ous) –People are created facets in a decentralized manner –They are assigning multiple facets to items –This is done on a massive scale –This leads naturally to meaningful associations

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS This Doesn’t Solve Everything Harder to determine what’s related to more complex terms Still not good for finding a recipe using potatoes

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS Linking Metadata Into Tasks Old Yahoo restaurant guide combined: –Region –Topic (restaurants) –Related Information Other attributes (cuisines) Other topics related in place and time (movies)

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS Green: restaurants & attributes Red: related in place & time Yellow: geographic region

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS Other Possible Combinations Region + A&E City + Restaurant + Movies City + Weather City + Education: Schools Restaurants + Schools …

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS Creating Tasks from HFM Recipes Example: –Click Ingredient > Avocado –Click Dish > Salad –Implies task of “I want to make a Dish type d with an Ingredient i that I have lying around” –Maybe users will prefer to select tasks like these over navigating through the metadata.

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS Summary Flexible application of hierarchical faceted metadata is a proven approach for navigating large information collections. –Midway in complexity between simple hierarchies and deep knowledge representation. Perhaps HFM is a good stepping stone to deeper semantic relations –Currently in use on e-commerce sites; spreading to other domains

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS Opportunities Creating hierarchical faceted categories –Assigning items to those categories –Adaptively adding new facets as data changes A new approach to personalization: –User-tailored facet combinations Create task-based search interfaces –Equate a task with a sequence of facet types Make use of folksonomies data!

Search Engines: Technology, Society, and BusinessMarti Hearst: UC Berkeley SIMS Acknowledgements Flamenco team –Brycen Chun –Ame Elliott –Jennifer English –Kevin Li –Rashmi Sinha –Emilia Stoica –Kirsten Swearingen –Ping Yee Thanks also to NSF (IIS )

Thank you! Marti Hearst UC Berkeley School of Information This Research Supported by NSF IIS