How can Search Interfaces Enhance the Value of Semantic Annotations (and Vice Versa?) Keynote Talk ESAIR’13: Sixth International Workshop on Exploiting.

Slides:



Advertisements
Similar presentations
Federated Searching Feedback From Usability Testing & Focus Groups Sarah C. Williams Angela Bonnell Milner Library, Illinois State University.
Advertisements

Microsoft® Office Word 2007 Training
Bringing Order to the Web: Automatically Categorizing Search Results Hao Chen SIMS, UC Berkeley Susan Dumais Adaptive Systems & Interactions Microsoft.
Information retrieval mon jan data…. framework for today’s lecture…
Faceted Metadata for Information Architecture and Search CHI 2007 Course Notes Session I Marti Hearst, School of Information, UC Berkeley Preston Smalley.
1 Using Words to Search a Thousand Images Hierarchical Faceted Metadata in Search & Browsing Marti Hearst SIMS, UC Berkeley Research funded by: NSF CAREER.
Innovation in Search? Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services
Semi-Automated Creation of Facet Hierarchies Marti Hearst School of Information, UC Berkeley Joint work with Dr. Emilia Stoica.
Castanet: Using WordNet to Build Facet Hierarchies Emilia Stoica and Marti Hearst School of Information, Berkeley.
Measuring Information Architecture CHI 01 Panel Position Statement Marti Hearst UC Berkeley.
Algorithms and Problem Solving-1 Algorithms and Problem Solving.
Faceted Metadata for Site Navigation and Search Marti Hearst 12/17/2009.
Social Tagging and Search Marti Hearst UC Berkeley.
Information Retrieval Concerned with the: Representation of Storage of Organization of, and Access to Information items.
Algorithms and Problem Solving. Learn about problem solving skills Explore the algorithmic approach for problem solving Learn about algorithm development.
Module 10b: Wrapup IMT530: Organization of Information Resources Winter, 2007 Michael Crandall.
Faceted Metadata for Information Architecture and Search CHI Course - April 24, 2006 Session I Marti Hearst, School of Information, UC Berkeley Preston.
Nearly-Automated Metadata Hierarchy Creation Emilia Stoica and Marti Hearst SIMS University of California, Berkeley.
Faceted Metadata in Search Interfaces Marti Hearst UC Berkeley School of Information This Research Supported by NSF IIS
Measuring Information Architecture Marti Hearst UC Berkeley.
Qualitative Data Analysis Systems
Measuring Information Architecture Marti Hearst UC Berkeley.
Semi-Automated Creation of Facet Hierarchies Marti Hearst School of Information, UC Berkeley Joint work with Dr. Emilia Stoica.
A metadata-based approach Marti Hearst Associate Professor BT Visit August 18, 2005.
Yahoo Visit Day Joint Reseach Opportunities Marti Hearst UC Berkeley School of Information.
Best Practices for Search for the Federal Government Marti Hearst Web Manager University November 10, 2009.
Faceted Metadata in Search Interfaces Marti Hearst UC Berkeley School of Information This Research Supported by NSF IIS
Faceted Metadata in Search Interfaces Marti Hearst UC Berkeley School of Information This Research Supported by NSF IIS
Confidential © 2005 eBay Inc. All rights reserved. eBay and the eBay logo are registered trademarks of eBay Inc. Faceted Metadata for Information Architecture.
Faceted Metadata for Information Architecture and Search Marti Hearst, SIMS at UC Berkeley Preston Smalley & Corey Chandler, eBay User Experience & Design.
UIs for Faceted Navigation Recent Advances and Remaining Open Problems HCIR’08 Marti Hearst, UC Berkeley (including some slides from Corey Chandler of.
Measuring Information Architecture Marti Hearst UC Berkeley.
Incorporating Metadata into Search UIs Marti Hearst UC Berkeley.
Faceted Metadata in Search Interfaces Marti Hearst UC Berkeley School of Information This Research Supported by NSF IIS
Transforming Tags to (Faceted) Tagsonomies Marti Hearst UC Berkeley School of Information This Research Supported by NSF IIS
Considering a Faceted Search-based Model Marti Hearst UCB SIMS NAS CSTB DNS Meeting on Internet Navigation and the Domain Name.
Knowledge organisation and information architecture, Nils Pharo Knowledge organisation and the Web Nils Pharo, 6th November 2002.
Faceted Metadata for Information Architecture and Search CHI Course - April 24, 2006 Session I Marti Hearst, School of Information, UC Berkeley Preston.
Information Architecture Donna Maurer Usability Specialist.
Library 10 – Information Competency Search Engines.
Accessible Word Document Training Microsoft Word 2010.
Information retrieval thur jan data…. framework for today’s lecture…
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Information Retrieval – and projects we have done. Group Members: Aditya Tiwari ( ) Harshit Mittal ( ) Rohit Kumar Saraf ( ) Vinay.
3.02 The Information Superhighway
Lesson 12 — The Internet and Research
Personalization of the Digital Library Experience: Progress and Prospects Nicholas J. Belkin Rutgers University, USA
Information retrieval wed sept data…. -start at 6.45.
More on “The Huddersfield Method” A lightweight, pattern-driven method based on SSM, Domain Driven Design and Naked Objects.
XP New Perspectives on The Internet, Sixth Edition— Comprehensive Tutorial 3 1 Searching the Web Using Search Engines and Directories Effectively Tutorial.
The Internet 8th Edition Tutorial 4 Searching the Web.
Electronic Scriptorium, Ltd. AIIM Minnesota Chapter Metadata and Taxonomy Presentation Copyright Electronic Scriptorium, Ltd. All rights reserved, 1991.
Collaborative Information Retrieval - Collaborative Filtering systems - Recommender systems - Information Filtering Why do we need CIR? - IR system augmentation.
Chapter 4c, Database H Definition H Structure H Parts H Types.
Recuperação de Informação B Cap. 10: User Interfaces and Visualization , , 10.9 November 29, 1999.
WEB 2.0 PATTERNS Carolina Marin. Content  Introduction  The Participation-Collaboration Pattern  The Collaborative Tagging Pattern.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Web Directories: Group 5 Jack Baker Laura Bingham Morgan Stewart.
Database Systems: Design, Implementation, and Management Eighth Edition Chapter 3 The Relational Database Model.
Do This file can be found at
Achieving Semantic Interoperability at the World Bank Designing the Information Architecture and Programmatically Processing Information Denise Bedford.
1 CS 430 Database Theory Winter 2005 Lecture 3: A Fifty Minute Introduction to Data Modeling.
1 Management Information Systems M Agung Ali Fikri, SE. MM.
Semantic Web Technologies Readings discussion Research presentations Projects & Papers discussions.
Education 499-R01 Search Basics.
NLP Support for Faceted Navigation in Scholarly Collections
Evaluating Adaptive Authoring of AH
Kenneth Baclawski et. al. PSB /11/7 Sa-Im Shin
Document Clustering Matt Hughes.
Algorithms and Problem Solving
Presentation transcript:

How can Search Interfaces Enhance the Value of Semantic Annotations (and Vice Versa?) Keynote Talk ESAIR’13: Sixth International Workshop on Exploiting Semantic Annotations in Information Retrieval Marti Hearst, UC Berkeley October 28, 2013

2 Talk Structure  Four Interlocking Points about State-of-the-Art  Together: Roadmap for a New Direction:  Richer, usable search that makes use of semantic information in a realistic way.

3 Four Interlocking Points  Faceted Navigation solves a Search UI problem  Now users are ready for the next evolution  Auto-suggest is a good Search UI paradigm  Is currently limited by data/tasks applied to  Behavior Log Analysis has made great strides  Results are not being fully utilized in Search UIs  Current Knowledge Base use focuses on head queries  The real value will come in addressing harder queries

4 Faceted Navigation Solves a UI Problem  Used in thousands of websites and tools now  Successful because:  Conforms to user expectations, even if they don’t fully understand the underlying model  Helps navigate better than keyword search alone  Reduces likelihood of empty results  Suggests new ways to go  The category structure is logical, understandable

5  Inflexible  Force the user to start with a particular category  What if I don ’ t know the animal ’ s diet, but the interface makes me start with that category?  Wasteful  Have to repeat combinations of categories  Makes for extra clicking and extra coding  Difficult to modify  To add a new category type, must duplicate it everywhere or change things everywhere The Problem with Hierarchy

6 The Problem With Hierarchy start furscalesfeathers swimflyrun slither furscalesfeathersfurscalesfeathers fish rodents insects fish rodents insects fish rodents insects fish rodents insects fish rodents insects fish rodents insects fish rodents insects fish rodents insects fish rodents insects salmonbatrobinwolf …

7 The Idea of Facets  Facets are a way of labeling data  A kind of Metadata (data about data)  Can be thought of as properties of items  Facets vs. Categories  Items are placed INTO a category system  Multiple facet labels are ASSIGNED TO items

8 What are facets?  Sets of categories, each of which describe a different aspect of the objects in the collection.  Each of these can be hierarchical.  (Not necessarily mutually exclusive nor exhaustive, but often that is a goal.) Time/DateTopicRoleGeoRegion 

9 The Idea of Facets  Create INDEPENDENT categories (facets)  Each facet has labels (sometimes arranged in a hierarchy)  Assign labels from the facets to every item  Example: recipe collection Course Main Course Cooking Method Stir-fry Cuisine Thai Ingredient Bell Pepper Curry Chicken

10 The Idea of Facets  Break out all the important concepts into their own facets  Sometimes the facets are hierarchical  Assign labels to items from any level of the hierarchy Preparation Method Fry Saute Boil Bake Broil Freeze Desserts Cakes Cookies Dairy Ice Cream Sorbet Flan Fruits Cherries Berries Blueberries Strawberries Bananas Pineapple

11 Using Facets  Now there are multiple ways to get to each item Preparation Method Fry Saute Boil Bake Broil Freeze Desserts Cakes Cookies Dairy Ice Cream Sherbet Flan Fruits Cherries Berries Blueberries Strawberries Bananas Pineapple Fruit > Pineapple Dessert > Cake Preparation > Bake Dessert > Dairy > Sherbet Fruit > Berries > Strawberries Preparation > Freeze

12 Using Facets  The system only shows the labels that correspond to the current set of items  Start with all items and all facets  The user then selects a label within a facet  This reduces the set of items (only those that have been assigned to the subcategory label are displayed)  This also eliminates some subcategories from the view.

13 Advantages of Facets  Can ’ t end up with empty results sets  (except with keyword search)  Helps avoid feelings of being lost.  Easier to explore the collection.  Helps users infer what kinds of things are in the collection.  Evokes a feeling of “ browsing the shelves ”  Is preferred over standard search for collection browsing in usability studies.  (Interface must be designed properly)

14 Advantages of Facets  Seamless to add new facets and subcategories  Seamless to add new items.  Helps with “ categorization wars ”  Don ’ t have to agree exactly where to place something  Interaction can be implemented using a standard relational database.  May be easier for automatic categorization

15 Facets vs. Hierarchy  Early Flamenco studies compared allowing multiple hierarchical facets vs. just one facet.  Multiple facets was preferred and more successful.

16 Evolution of IA at eBay Flat Structure (2000 and earlier) Clothing, Shoes & Accessories  Shoes Women ’ s Shoes - Boots - Pumps - Sandals

17 Evolution of IA at eBay Issues with approach:  Products had to be categorized in just one way. Ex: Where are all the red Women ’ s shoes?  Adding more descriptors meant creating a deep and complicated category structure. Ex: Shoes > Women ’ s > Boots > Black > Size 8 Flat Structure (2000 and earlier) Clothing, Shoes & Accessories  Shoes Women ’ s Shoes - Boots - Pumps - Sandals

18 + Product Facets (2001 – 2005) Clothing, Shoes & Accessories  Shoes  Women ’ s Shoes - Style (Boots, Pumps, Sandals…) - Size (6, 6.5, 7, 7.5…) - Color (Black, Red, Tan…) - Condition (New, Used…) Evolution of IA at eBay Added Facets (flat)

19 Evolution of IA at eBay Issues with approach:  Encourages over-constrained queries (Values “ ANDED ” together)  Placing facets behind dropdowns reduces the exposure of the values to the user  Left-Navigation Placement is only used a minority of the time by users  While effective within a product domain their still is a need for facets above that level Ex: Everything Coach makes that is Red. + Product Facets (2001 – 2005) Clothing, Shoes & Accessories  Shoes  Women ’ s Shoes - Style (Boots, Pumps, Sandals…) - Size (6, 6.5, 7, 7.5…) - Color (Black, Red, Tan…) - Condition (New, Used…)

20 Evolution of IA at eBay Faceted Metadata (May 2005 Magellan Test) Clothing, Shoes & Accessories  Shoes  Women ’ s Shoes - Style (Boots, Pumps, Sandals…) - Size (6, 6.5, 7, 7.5…) - Color (Black, Red, Tan…) - Condition (New, Used…) - Brand (Nine West, Coach…)  Brands  Coach  Louis Vuitton  Materials  Cotton  Leather Added Hierarchical Facets Moved to a Top Positioned Link Structure

21 Matching items

22 Matching items

23 Matching items

24 Matching items

25 Matching items

26 Matching items

27 Matching items

28 Matching items

29 Matching items Key new addition: query can contain terms that correspond to facets; system responds gracefully.

30 Matching items

31 Matching items

32 Matching items

33 Matching items

34 Matching items

35 Matching items

36 Queries consisting of facet terms Lesson: “ Parsing ” feels natural to users (and the text in the search box is not sacred) athletic shoes

37 Limitation of Faceted Metadata  Do not naturally capture MAIN THEMES  Facets do not show RELATIONS explicitly Aquamarine Red Orange Door Doorway Wall  Which color associated with which object? Photo by J. Hearst, jhearst.typepad.com

38 Motivation Description: 19th c. paint horse; saddle and hackamore; spurs; bandana on rider; old time cowboy hat; underchin thong; flying off. Nature Animal Mammal Horse Occupations Cowboy Clothing Hats Cowboy Hat Media Engraving Wood Eng. Location North America America

39 Motivation Description: 19th c. paint horse; saddle and hackamore; spurs; bandana on rider; old time cowboy hat; underchin thong; flying off. By using facets, what we are not capturing? The hat flew off; The bandana stayed on. The thong is part of the hat. The bandana is on the cowboy (not the horse). The saddle is on the horse (not the cowboy).

40 Hierarchical Faceted Metadata  A simplification of knowledge representation  Does not represent relationships directly  BUT can be understood well by many people when browsing rich collections of information.

41 Linking Metadata Into Tasks  Old Yahoo restaurant guide combined:  Region  Topic (restaurants)  Related Information  Other attributes (cuisines)  Other topics related in place and time (movies)

42 Green: restaurants & attributes Red: related in place & time Yellow: geographic region

43 Other Possible Combinations  Region + A&E  City + Restaurant + Movies  City + Weather  City + Education: Schools  Restaurants + Schools  …

44 Creating Tasks from HFM  Recipes Example:  Click Ingredient > Avocado  Click Dish > Salad  Implies task of “I want to make a Dish type d with an Ingredient i that I have lying around”  Maybe users will prefer to select tasks like these over navigating through the metadata.

45 Taking Facets to the Next Level  Allow queries to express combinations of facets  Be smart about recognizing those already expressed  Go beyond just co-occurrence of nouns  Also allow relations between concepts  Be smart about recognizing information needs:  Tasks that are combinations of concepts  Complex relations among concepts.

46 Query Auto-suggest

47 Query auto-suggest

48 Auto-suggest at the Next Level

49 Taking Auto-Suggest to the Next Level  It’s gotten more conservative lately on the web  It can try to organize possible results  As seen in the earlier NextBio interface  Group by gene, compound, tissue  As seen in the art museum database  Group by exhibition, artwork, video  The trick is knowing which categories are relevant

50 Knowledge bases: popular head queries

51 Knowledge base: popular head queries

52 Knowledge Base: Relevant Information  What do people most want to know about Albert Einstein?  His theories of physics?  His brain?  His formulae?  His views on life and god and the dropping of the bomb.  What do you get?  His high school.  The names of his wife and kids.

53 Knowledge Base Results

54 This Can Be Better!  If the searcher doesn’t know what they want…  The tools should lead them to new discovery  If they do have some ideas  The tools should show good next paths  But get beyond faceted choices  While still not being chaotic and unpredictable

55 Query and Behavior Log Analysis  Long “long tail” queries  30.4% of the unique queries in the two-month period were not issued to any search engine in the previous 6 months  16.8% were both long (4 or more words) and unseen  “Mount Rainier’s scenic hiking trails”  Focuses on matching Open Directory categories  Only matches 1 in 5 times  Potentially would work better with richer representations Mining Past Query Trails to Label Long and Rare Search Engine Queries, Bailey, White, Liu, Kumaran, ACM TWEB 4(4) 2009

56 Association Rules on Queries  Goal: identify underperforming queries  Approach: use lexical and topical attributes and association rule mining  They do this to classify queries as underperforming  I suggest this as a way to determine what parts of the knowledge base to show for these queries Playing by the Rules: Mining Query Associations to Predict Search Performance, Kim, Hassan, White, Wang, WSDM 2013.

57 Review of the Main Points  Faceted Navigation solves a Search UI problem  Now users are ready for the next evolution  Auto-suggest is a good Search UI paradigm  Is currently limited by data/tasks applied to  Behavior Log Analysis has made great strides  Results are not being fully utilized in Search UIs  Current Knowledge Base use focuses on head queries  The real value will come in addressing harder queries

58 Putting It All Together  KBs can improve harder queries. Here’s how:  Create Search UIs that:  Encourage longer queries by:  Matching different meaning components  Like faceted navigation did, but more richly,  Based on behavior log analysis,  Match concepts against with knowledge base relations, and  Scaffold with auto-suggest interface to help get it right.

59 Envisionment

60 Motivation Description: 19th c. paint horse; saddle and hackamore; spurs; bandana on rider; old time cowboy hat; underchin thong; flying off. By using facets, what we are not capturing? The hat flew off; The bandana stayed on. The thong is part of the hat. The bandana is on the cowboy (not the horse). The saddle is on the horse (not the cowboy).

How can Search Interfaces Enhance the Value of Semantic Annotations (and Vice Versa?) Keynote Talk ESAIR’13: Sixth International Workshop on Exploiting Semantic Annotations in Information Retrieval Marti Hearst, UC Berkeley October 28, 2013