On Enhancing the User Experience in Web Search Engines Franco Maria Nardini.

Slides:



Advertisements
Similar presentations
Recommender Systems & Collaborative Filtering
Advertisements

CONTRIBUTIONS Ground-truth dataset Simulated search tasks environment Multiple everyday applications (MS Word, MS PowerPoint, Mozilla Browser) Implicit.
Improvements and extras Paul Thomas CSIRO. Overview of the lectures 1.Introduction to information retrieval (IR) 2.Ranked retrieval 3.Probabilistic retrieval.
Chapter 5: Introduction to Information Retrieval
1 Evaluation Rong Jin. 2 Evaluation  Evaluation is key to building effective and efficient search engines usually carried out in controlled experiments.
Center for E-Business Technology Seoul National University Seoul, Korea Socially Filtered Web Search: An approach using social bookmarking tags to personalize.
Evaluating Search Engine
Information Retrieval in Practice
Search Engines and Information Retrieval
Personalizing Search via Automated Analysis of Interests and Activities Jaime Teevan Susan T.Dumains Eric Horvitz MIT,CSAILMicrosoft Researcher Microsoft.
Ryen W. White, Microsoft Research Jeff Huang, University of Washington.
Information Retrieval in Practice
FACT: A Learning Based Web Query Processing System Hongjun Lu, Yanlei Diao Hong Kong U. of Science & Technology Songting Chen, Zengping Tian Fudan University.
1 Information Retrieval and Web Search Introduction.
Recall: Query Reformulation Approaches 1. Relevance feedback based vector model (Rocchio …) probabilistic model (Robertson & Sparck Jones, Croft…) 2. Cluster.
Web Logs and Question Answering Richard Sutcliffe 1, Udo Kruschwitz 2, Thomas Mandl University of Limerick, Ireland 2 - University of Essex, UK 3.
1 CS 430: Information Discovery Lecture 20 The User in the Loop.
Basic IR Concepts & Techniques ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.
Web Search – Summer Term 2006 II. Information Retrieval (Basics Cont.) (c) Wolfgang Hürst, Albert-Ludwigs-University.
Overview of Search Engines
JOURNAL OF INFORMATION SCIENCE AND ENGINEERING 30, (2014) BERLIN CHEN, YI-WEN CHEN, KUAN-YU CHEN, HSIN-MIN WANG2 AND KUEN-TYNG YU Department of Computer.
Result presentation. Search Interface Input and output functionality – helping the user to formulate complex queries – presenting the results in an intelligent.
CS598CXZ Course Summary ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.
Challenges in Information Retrieval and Language Modeling Michael Shepherd Dalhousie University Halifax, NS Canada.
Search Engines and Information Retrieval Chapter 1.
TREC 2009 Review Lanbo Zhang. 7 tracks Web track Relevance Feedback track (RF) Entity track Blog track Legal track Million Query track (MQ) Chemical IR.
©2008 Srikanth Kallurkar, Quantum Leap Innovations, Inc. All rights reserved. Apollo – Automated Content Management System Srikanth Kallurkar Quantum Leap.
1 Formal Models for Expert Finding on DBLP Bibliography Data Presented by: Hongbo Deng Co-worked with: Irwin King and Michael R. Lyu Department of Computer.
PERSONALIZED SEARCH Ram Nithin Baalay. Personalized Search? Search Engine: A Vital Need Next level of Intelligent Information Retrieval. Retrieval of.
Xiaoying Gao Computer Science Victoria University of Wellington Intelligent Agents COMP 423.
Internet Information Retrieval Sun Wu. Course Goal To learn the basic concepts and techniques of internet search engines –How to use and evaluate search.
Probabilistic Query Expansion Using Query Logs Hang Cui Tianjin University, China Ji-Rong Wen Microsoft Research Asia, China Jian-Yun Nie University of.
Lecture 2 Jan 13, 2010 Social Search. What is Social Search? Social Information Access –a stream of research that explores methods for organizing users’
Giorgos Giannopoulos (IMIS/”Athena” R.C and NTU Athens, Greece) Theodore Dalamagas (IMIS/”Athena” R.C., Greece) Timos Sellis (IMIS/”Athena” R.C and NTU.
Search Result Interface Hongning Wang Abstraction of search engine architecture User Ranker Indexer Doc Analyzer Index results Crawler Doc Representation.
Personalized Search Xiao Liu
Authors: Rosario Sotomayor, Joe Carthy and John Dunnion Speaker: Rosario Sotomayor Intelligent Information Retrieval Group (IIRG) UCD School of Computer.
Toward A Session-Based Search Engine Smitha Sriram, Xuehua Shen, ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.
Contextual Ranking of Keywords Using Click Data Utku Irmak, Vadim von Brzeski, Reiner Kraft Yahoo! Inc ICDE 09’ Datamining session Summarized.
Search Engine Architecture
Lecture 2 Jan 15, 2008 Social Search. What is Social Search? Social Information Access –a stream of research that explores methods for organizing users’
 hd.jpg hd.jpg Information Retrieval and Interaction.
IR Theory: Relevance Feedback. Relevance Feedback: Example  Initial Results Search Engine2.
IT-522: Web Databases And Information Retrieval By Dr. Syed Noman Hasany.
Ben Carterette Paul Clough Evangelos Kanoulas Mark Sanderson.
WIRED Week 3 Syllabus Update (next week) Readings Overview - Quick Review of Last Week’s IR Models (if time) - Evaluating IR Systems - Understanding Queries.
Personalizing Web Search using Long Term Browsing History Nicolaas Matthijs, Cambridge Filip Radlinski, Microsoft In Proceedings of WSDM
Lucene. Lucene A open source set of Java Classses ◦ Search Engine/Document Classifier/Indexer 
Query Suggestion. n A variety of automatic or semi-automatic query suggestion techniques have been developed  Goal is to improve effectiveness by matching.
Search Result Interface Hongning Wang Abstraction of search engine architecture User Ranker Indexer Doc Analyzer Index results Crawler Doc Representation.
Post-Ranking query suggestion by diversifying search Chao Wang.
Augmenting (personal) IR Readings Review Evaluation Papers returned & discussed Papers and Projects checkin time.
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
Why Decision Engine Bing Demos Search Interaction model Data-driven Research Problems Q & A.
Personalizing Web Search Jaime Teevan, MIT with Susan T. Dumais and Eric Horvitz, MSR.
Text Information Management ChengXiang Zhai, Tao Tao, Xuehua Shen, Hui Fang, Azadeh Shakery, Jing Jiang.
UOS Personalized Search Zhang Tao 장도. Zhang Tao Data Mining Contents Overview 1 The Outride Approach 2 The outride Personalized Search System 3 Testing.
Seesaw Personalized Web Search Jaime Teevan, MIT with Susan T. Dumais and Eric Horvitz, MSR.
Potential for Personalization Transactions on Computer-Human Interaction, 17(1), March 2010 Data Mining for Understanding User Needs Jaime Teevan, Susan.
WHIM- Spring ‘10 By:-Enza Desai. What is HCIR? Study of IR techniques that brings human intelligence into search process. Coined by Gary Marchionini.
University Of Seoul Ubiquitous Sensor Network Lab Query Dependent Pseudo-Relevance Feedback based on Wikipedia 전자전기컴퓨터공학 부 USN 연구실 G
Information Retrieval in Practice
Information Storage and Retrieval Fall Lecture 1: Introduction and History.
Evaluation Anisio Lacerda.
Information Retrieval (in Practice)
Search Engine Architecture
SIS: A system for Personal Information Retrieval and Re-Use
Information Retrieval on the World Wide Web
CS246: Leveraging User Feedback
Information Retrieval and Web Design
Presentation transcript:

On Enhancing the User Experience in Web Search Engines Franco Maria Nardini

About Me I joined the HPC Lab in 2006 – Master Thesis Ph.D. in 2011, University of Pisa – Thesis: “Query Log Mining to Enhance User Experience in Search Engines” mail: web: skype: francomaria.nardini

Query Suggestion with Daniele Broccolo, Lorenzo Marcon Raffaele Perego, Fabrizio Silvestri

Our Contribution: Search Shortcuts

Search Shortcuts: – It uses the “happy ending” stories in the query log to help new users; Efficient: – All the “stuff” is stored on a inverted index: retrieval problem; Effective: (head, torso, tail) – New evaluation methodology confirming this evidencies: TREC Diversity Track. Daniele Broccolo, Lorenzo Marcon, Franco Maria Nardini, Fabrizio Silvestri, Raffaele Perego, Generating Suggestions for Queries in the Long Tail with an Inverted Index, IP&M, 2011.

Some Results

What’s Next?! Why not to use Machine Learning? – Machine learning is helping a lot in the IR community; – Better and “fine-graned” ranking as it could take into account important signals that are not fully- exploited nowadays; – It may helps in filtering redundant suggestions and choosing the “best” expressive ones (for each intent). under exploration with Marcin Sydow (PJIIT), Raffaele Perego, Fabrizio Silvestri

Signals Which signals we would like to capture? – Relevance to the given query; – Diversity with respect to a subtopic list; – Serendipity of suggestions; – Novelty with respect to news/trends on Twitter; How do we catch them? How do we combine them? The “training” set is a problem.

Query Suggestion: Ranking A two-step architecture – First step to produce a list of candidates; – Second step as a ML architecture composed of two different (cascade) stages of ranking: First round to rank suggestions w.r.t. the query; Second round to understand “diversity”.

Diversification of Web Search Engine Results with Gabriele Capannini, Raffaele Perego, Fabrizio Silvestri

Our Contribution We design a method for efficiently diversify results from Web search engines. – Same effectiveness of other state-of-the-art approaches; – Extremely fast in doing the “hard” work; Intents behind “ambiguous” queries are mined from query logs; Capannini G., Nardini F.M., Silvestri F., Perego R., A Search Architecture Enabling Efficient Diversification of Search Results, Proc. DDR Workshop Capannini G., Nardini F.M., Silvestri F., Perego R., Efficient Diversification of Web Search Results. Proceedings of VLDB 2011 (PVLDB), Volume 4, Issue 7.

Our Contribution

Some Results

What’s Next? A modern ranking architecture: – Effective: Users should be happy of the results they receive; – Efficient: Low response times (< 0.1 s); – Easy to adapt: Continuous crawling from the Web; Continuous users’ feedback; with Berkant Barla Cambazoglu (Yahoo! Barcelona), Gabriele Capannini, Raffaele Perego, Fabrizio Silvestri

Let’s Plug All Together BM25 Scorer 1 … … Scorer n Query Index Second Phase First Phase Results Scorer div SS A way for efficiently diversifying “ambiguous” queries; SS teaches how to “diversify” the current user query; Scorer div computes the diversity “signal” of each document and rerank the final results list; Possible intents behind the query

Retrieval over Query Sessions with M-Dyaa AlBakour (University of Glasgow)

Main Goals Question 1) – Can Web search engines improve their performance by using previous user interactions? (including previous queries, clicks on ranked results, dwell times, etc.) Question 2) – How do we evaluate system performance over an entire query session instead of a single query?

TREC Session Track Two editions of the challenge: 2010, 2011 – query, previous queries; – urls + docs, urls + docs + dwell time; – Two different evaluations: last subtop., all subtop. “Query expansion” with Search Shortcuts: – weighted by means of user interaction data; – “history-based” recommendation; Follow-up with tuning of the parameters. Ibrahim Adeyanju, Franco Maria Nardini, M-Dyaa Albakour, Dawei Song, Udo Kruschwitz, RGU-ISTI-Essex at TREC 2011 Session Track, TREC Conference, Franco Maria Nardini, M-Dyaa Albakour, Ibrahim Adeyanju, Udo Kruschwitz, Studying Search Shortcuts in a Query Log to Improve Retrieval Over Query Sessions, SIR 2012 in conjunction with ECIR 2012.

Some Results What’s Next? Entity-based representation of the user session. – to reduce the “sparsity” of the space.

Challenges How those systems really affect (and modify) the behavior of the user? – Is it possible to quantify it? (metrics?) – What do we need to observe? Toward the “perfect result page”: – accurate models for blending different sources of results.

Little Announcement Models and Techniques for Tourist Facilities Evaluation and Test Collections User Interaction and Interfaces Paper Deadline 06/25/2012

Questions!?!