Dynamic Facet Ordering for Faceted Product Search Engines*

Slides:



Advertisements
Similar presentations
Diversity Maximization Under Matroid Constraints Date : 2013/11/06 Source : KDD’13 Authors : Zeinab Abbassi, Vahab S. Mirrokni, Mayur Thakur Advisor :
Advertisements

Principal Component Analysis Based on L1-Norm Maximization Nojun Kwak IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008.
RCQ-ACS: RDF Chain Query Optimization Using an Ant Colony System WI 2012 Alexander Hogenboom Erasmus University Rotterdam Ewout Niewenhuijse.
VisualRank: Applying PageRank to Large-Scale Image Search Yushi Jing, Member, IEEE, and Shumeet Baluja, Member, IEEE.
A Linguistic Approach for Semantic Web Service Discovery International Symposium on Management Intelligent Systems 2012 (IS-MiS 2012) July 13, 2012 Jordy.
Optimal Design Laboratory | University of Michigan, Ann Arbor 2011 Design Preference Elicitation Using Efficient Global Optimization Yi Ren Panos Y. Papalambros.
Connecting Customer Relationship Management Systems to Social Networks 7th International Conference on Knowledge Management, Services, and Cloud Computing.
RCQ-GA: RDF Chain Query Optimization using Genetic Algorithms BNAIC 2009 Alexander Hogenboom, Viorel Milea, Flavius Frasincar, and Uzay Kaymak Erasmus.
Time-dependent Similarity Measure of Queries Using Historical Click- through Data Qiankun Zhao*, Steven C. H. Hoi*, Tie-Yan Liu, et al. Presented by: Tie-Yan.
Mastering the Internet, XHTML, and JavaScript Chapter 7 Searching the Internet.
Sentiment Lexicon Creation from Lexical Resources BIS 2011 Bas Heerschop Erasmus School of Economics Erasmus University Rotterdam
A machine learning approach to improve precision for navigational queries in a Web information retrieval system Reiner Kraft
Academic Advisor: Prof. Ronen Brafman Team Members: Ran Isenberg Mirit Markovich Noa Aharon Alon Furman.
Automatically Annotating Web Pages Using Google Rich Snippets 11th Dutch-Belgian Information Retrieval Workshop (DIR 2011) February 4, 2011 Frederik Hogenboom.
J. Chen, O. R. Zaiane and R. Goebel An Unsupervised Approach to Cluster Web Search Results based on Word Sense Communities.
Interface for the University Library Catalogue Implementing Direct Manipulation Proposal 4.
Federated Search of Text Search Engines in Uncooperative Environments Luo Si Language Technology Institute School of Computer Science Carnegie Mellon University.
Query session guided multi- document summarization THESIS PRESENTATION BY TAL BAUMEL ADVISOR: PROF. MICHAEL ELHADAD.
Word Sense Disambiguation for Automatic Taxonomy Construction from Text-Based Web Corpora 12th International Conference on Web Information System Engineering.
Sentiment Analysis with a Multilingual Pipeline 12th International Conference on Web Information System Engineering (WISE 2011) October 13, 2011 Daniëlla.
Evaluation IMD07101: Introduction to Human Computer Interaction Brian Davison 2010/11.
Erasmus University Rotterdam Introduction With the vast amount of information available on the Web, there is an increasing need to structure Web data in.
Supporting the Automatic Construction of Entity Aware Search Engines Lorenzo Blanco, Valter Crescenzi, Paolo Merialdo, Paolo Papotti Dipartimento di Informatica.
Learning with large datasets Machine Learning Large scale machine learning.
06 - Boundary Models Overview Edge Tracking Active Contours Conclusion.
VTT-STUK assessment method for safety evaluation of safety-critical computer based systems - application in BE-SECBS project.
A Comparative Study of Search Result Diversification Methods Wei Zheng and Hui Fang University of Delaware, Newark DE 19716, USA
Ontology Updating Driven by Events Dutch-Belgian Database Day 2012 (DBDBD 2012) November 21, 2012 Frederik Hogenboom Jordy Sangers.
School of Computing FACULTY OF ENGINEERING Developing a methodology for building small scale domain ontologies: HISO case study Ilaria Corda PhD student.
1 Web Search Personalization via Social Bookmarking and Tagging Michael G. Noll & Christoph Meinel Hasso-Plattner-Institut an der Universit¨at Potsdam,
1 Applying Collaborative Filtering Techniques to Movie Search for Better Ranking and Browsing Seung-Taek Park and David M. Pennock (ACM SIGKDD 2007)
ITIS 1210 Introduction to Web-Based Information Systems Chapter 27 How Internet Searching Works.
When Experts Agree: Using Non-Affiliated Experts To Rank Popular Topics Meital Aizen.
Querying Structured Text in an XML Database By Xuemei Luo.
25/03/2003CSCI 6405 Zheyuan Yu1 Finding Unexpected Information Taken from the paper : “Discovering Unexpected Information from your Competitor’s Web Sites”
Implicit User Feedback Hongning Wang Explicit relevance feedback 2 Updated query Feedback Judgments: d 1 + d 2 - d 3 + … d k -... Query User judgment.
80 million tiny images: a large dataset for non-parametric object and scene recognition CS 4763 Multimedia Systems Spring 2008.
A fast algorithm for the generalized k- keyword proximity problem given keyword offsets Sung-Ryul Kim, Inbok Lee, Kunsoo Park Information Processing Letters,
The Anatomy of a Large-Scale Hypertextual Web Search Engine Sergey Brin & Lawrence Page Presented by: Siddharth Sriram & Joseph Xavier Department of Electrical.
CS2003 Usability Engineering Human-Centred Design Dr Steve Love.
Greedy is not Enough: An Efficient Batch Mode Active Learning Algorithm Chen, Yi-wen( 陳憶文 ) Graduate Institute of Computer Science & Information Engineering.
Mingyang Zhu, Huaijiang Sun, Zhigang Deng Quaternion Space Sparse Decomposition for Motion Compression and Retrieval SCA 2012.
Erasmus University Rotterdam Introduction Content-based news recommendation is traditionally performed using the cosine similarity and TF-IDF weighting.
Lexico-semantic Patterns for Information Extraction from Text The International Conference on Operations Research 2013 (OR 2013) Frederik Hogenboom
A Classification-based Approach to Question Answering in Discussion Boards Liangjie Hong, Brian D. Davison Lehigh University (SIGIR ’ 09) Speaker: Cho,
ASSOCIATIVE BROWSING Evaluating 1 Jinyoung Kim / W. Bruce Croft / David Smith for Personal Information.
Descriptive Statistics for one Variable. Variables and measurements A variable is a characteristic of an individual or object in which the researcher.
The Development of a search engine & Comparison according to algorithms Sung-soo Kim The final report.
1 Random Walks on the Click Graph Nick Craswell and Martin Szummer Microsoft Research Cambridge SIGIR 2007.
ASSOCIATIVE BROWSING Evaluating 1 Jin Y. Kim / W. Bruce Croft / David Smith by Simulation.
Finding similar items by leveraging social tag clouds Speaker: Po-Hsien Shih Advisor: Jia-Ling Koh Source: SAC 2012’ Date: October 4, 2012.
Stored Procedures for Web Search Engine Dejan Sunderic
Efficient Similarity Search : Arbitrary Similarity Measures, Arbitrary Composition Date: 2011/10/31 Source: Dustin Lange et. al (CIKM’11) Speaker:Chiang,guang-ting.
What this activity will show you
MIS2502: Data Analytics Advanced Analytics - Introduction
Simultaneous Support for Finding and Re-Finding
Erasmus University Rotterdam
IST 516 Fall 2011 Dongwon Lee, Ph.D.
Numerical Measures: Centrality and Variability
Data Mining (and machine learning)
CS & CS Capstone Project & Software Development Project
Efficient Distribution-based Feature Search in Multi-field Datasets Ohio State University (Shen) Problem: How to efficiently search for distribution-based.
Martin Rajman, EPFL Switzerland & Martin Vesely, CERN Switzerland
6.2 Grid Search of Chi-Square Space
Learning Literature Search Models from Citation Behavior
CS246: Information Retrieval
Retrieval Performance Evaluation - Measures
Software Agent.
Project title Company name.
Presentation transcript:

Dynamic Facet Ordering for Faceted Product Search Engines* Introduction Figure 3 shows the steps taken to compute the property score (one of the steps in Figure 2). If the property is a qualitative property, our approach computes the disjoint facet count for this property. Using the disjoint facet count a temporary property score (as a measure of specificity) is computed based on the Gini impurity measure. If the property is a quantitative property, in order to exploit the ratio scale of this data, we compute the Gini coefficient, which represents the temporary property score (as a measure of dispersion). The temporary scores are weighted based on the property popularity among products to favour properties that have facets spread among many products and to better cope with missing facets in the dataset. At the end, the properties are sorted in the descending order of the property scores. The facets associated to a qualitative property are sorted in the descending order of the disjoint facet counts. Many of the current Web shops that use faceted search have a manual, expert-based selection procedure for displaying product facets. Figure 1 shows an example of displaying such facets for Amazon.com. However, such approaches fail to consider that after each query the resulting set of products changes, and thus the previously fixed ordering of facets might not be that relevant any more. Facet Ordering Algorithm Figure 2 shows the main flow in a user search session in our approach. The user may not know the name of the product (s)he is looking for, but knows the characteristics of the desired product. At the beginning, the user is presented with the complete set of products available in the Web shop. Our solution computes the property scores and the facet scores in parallel, after which the user is presented an ordered list of properties and, per property, an ordered list of facets. If the result set is too large, the user performs a drill-down (by selecting additional properties and their corresponding facets) and thus the query is updated triggering a new search iteration. If the result set is small enough, the user scans the returned products. If the desired product is not found, the user performs a roll-up (deselecting properties or selecting additional facets) and thus the query is updated again, triggering a new iteration. On the other hand, if the product is found, the search session ends. Evaluation For our experiments, we have gathered data from Tweakers Pricewatch. The complete catalogue contains 794 mobile phones, 53 properties, and 1,816 facets, from which 348 are qualitative and 1,468 are quantitative. We performed a large-scale simulation study based on 3 drill-down models (Least Scanning, Best Facet, and Combined Model), 4 ordering schemes (Expert-based, Greedy Count, Kim et al. (2014), and our approach), 794 target products, and 50 repetitions for the Combined Model. Our approach has the lowest number of clicks, has the lowest number of roll-ups, and has the highest percentage of successful sessions. In addition to the simulation study, we also performed a user experiment with 27 real users that were given 10 tasks to find products that match given product descriptions. Each user performed half of the experiments with our system and half of the experiments with the expert-based fixed facets system. Again the experiments show that the users need less clicks to find the desired product using our system. *Damir Vandic, Steven S. Aanen, Flavius Frasincar, and Uzay Kaymak, “Dynamic Facet Ordering for Faceted Product Search Engines,” IEEE Transactions on Knowledge and Data Engineering, vol. 29, no. 5, pp. 1004–1016, 2017. Figure 1. Fixed facet search in Amazon.com. Figure 3. Activity diagram describing the property score computation. Figure 2. Activity diagram describing the main flow of a search session. Contact Flavius Frasincar Postal: Erasmus University Rotterdam P.O. Box 1738 NL-3000 DR Rotterdam The Netherlands E-Mail: frasincar@ese.eur.nl Web: https://personal.eur.nl/frasincar/ Damir Vandic, Steven S. Aanen, Flavius Frasincar, and Uzay Kaymak