Computational User Intent Modeling

Slides:



Advertisements
Similar presentations
L3S Research Center University of Hanover Germany
Advertisements

Struggling or Exploring? Disambiguating Long Search Sessions
A Two-Dimensional Click Model for Query Auto-Completion Yanen Li 1, Anlei Dong 2, Hongning Wang 1, Hongbo Deng 2, Yi Chang 2, ChengXiang Zhai 1 1 University.
Psychological Advertising: Exploring User Psychology for Click Prediction in Sponsored Search Date: 2014/03/25 Author: Taifeng Wang, Jiang Bian, Shusen.
1 Evaluation Rong Jin. 2 Evaluation  Evaluation is key to building effective and efficient search engines usually carried out in controlled experiments.
Modelling Relevance and User Behaviour in Sponsored Search using Click-Data Adarsh Prasad, IIT Delhi Advisors: Dinesh Govindaraj SVN Vishwanathan* Group:
SIGIR 2013 Recap September 25, 2013.
1.Accuracy of Agree/Disagree relation classification. 2.Accuracy of user opinion prediction. 1.Task extraction performance on Bing web search log with.
Query Dependent Pseudo-Relevance Feedback based on Wikipedia SIGIR ‘09 Advisor: Dr. Koh Jia-Ling Speaker: Lin, Yi-Jhen Date: 2010/01/24 1.
Context-aware Query Suggestion by Mining Click-through and Session Data Authors: H. Cao et.al KDD 08 Presented by Shize Su 1.
Evaluating Search Engine
Context-Aware Query Classification Huanhuan Cao 1, Derek Hao Hu 2, Dou Shen 3, Daxin Jiang 4, Jian-Tao Sun 4, Enhong Chen 1 and Qiang Yang 2 1 University.
Retrieval Evaluation. Brief Review Evaluation of implementations in computer science often is in terms of time and space complexity. With large document.
Adapting Deep RankNet for Personalized Search
In Situ Evaluation of Entity Ranking and Opinion Summarization using Kavita Ganesan & ChengXiang Zhai University of Urbana Champaign
Temporal Event Map Construction For Event Search Qing Li Department of Computer Science City University of Hong Kong.
Modern Retrieval Evaluations Hongning Wang
Attention and Event Detection Identifying, attributing and describing spatial bursts Early online identification of attention items in social media Louis.
CS598CXZ Course Summary ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.
Personalization in Local Search Personalization of Content Ranking in the Context of Local Search Philip O’Brien, Xiao Luo, Tony Abou-Assaleh, Weizheng.
1 Context-Aware Search Personalization with Concept Preference CIKM’11 Advisor : Jia Ling, Koh Speaker : SHENG HONG, CHUNG.
 An important problem in sponsored search advertising is keyword generation, which bridges the gap between the keywords bidded by advertisers and queried.
1 Formal Models for Expert Finding on DBLP Bibliography Data Presented by: Hongbo Deng Co-worked with: Irwin King and Michael R. Lyu Department of Computer.
Introduction to Web Mining Spring What is data mining? Data mining is extraction of useful patterns from data sources, e.g., databases, texts, web,
Fan Guo 1, Chao Liu 2 and Yi-Min Wang 2 1 Carnegie Mellon University 2 Microsoft Research Feb 11, 2009.
CIKM’09 Date:2010/8/24 Advisor: Dr. Koh, Jia-Ling Speaker: Lin, Yi-Jhen 1.
25/03/2003CSCI 6405 Zheyuan Yu1 Finding Unexpected Information Taken from the paper : “Discovering Unexpected Information from your Competitor’s Web Sites”
Probabilistic Query Expansion Using Query Logs Hang Cui Tianjin University, China Ji-Rong Wen Microsoft Research Asia, China Jian-Yun Nie University of.
Giorgos Giannopoulos (IMIS/”Athena” R.C and NTU Athens, Greece) Theodore Dalamagas (IMIS/”Athena” R.C., Greece) Timos Sellis (IMIS/”Athena” R.C and NTU.
Implicit User Feedback Hongning Wang Explicit relevance feedback 2 Updated query Feedback Judgments: d 1 + d 2 - d 3 + … d k -... Query User judgment.
Detecting Dominant Locations from Search Queries Lee Wang, Chuang Wang, Xing Xie, Josh Forman, Yansheng Lu, Wei-Ying Ma, Ying Li SIGIR 2005.
Contextual Ranking of Keywords Using Click Data Utku Irmak, Vadim von Brzeski, Reiner Kraft Yahoo! Inc ICDE 09’ Datamining session Summarized.
Query Suggestion Naama Kraus Slides are based on the papers: Baeza-Yates, Hurtado, Mendoza, Improving search engines by query clustering Boldi, Bonchi,
1 Web-Page Summarization Using Clickthrough Data* JianTao Sun, Yuchang Lu Dept. of Computer Science TsingHua University Beijing , China Dou Shen,
MODEL ADAPTATION FOR PERSONALIZED OPINION ANALYSIS MOHAMMAD AL BONI KEIRA ZHOU.
Algorithmic Detection of Semantic Similarity WWW 2005.
Jiafeng Guo(ICT) Xueqi Cheng(ICT) Hua-Wei Shen(ICT) Gu Xu (MSRA) Speaker: Rui-Rui Li Supervisor: Prof. Ben Kao.
Implicit User Feedback Hongning Wang Explicit relevance feedback 2 Updated query Feedback Judgments: d 1 + d 2 - d 3 + … d k -... Query User judgment.
CoNMF: Exploiting User Comments for Clustering Web2.0 Items Presenter: He Xiangnan 28 June School of Computing National.
More Than Relevance: High Utility Query Recommendation By Mining Users' Search Behaviors Xiaofei Zhu, Jiafeng Guo, Xueqi Cheng, Yanyan Lan Institute of.
1/14/ :59 PM1/14/ :59 PM1/14/ :59 PM Research overview Koen Victor, 12/2007.
Context-Aware Query Classification Huanhuan Cao, Derek Hao Hu, Dou Shen, Daxin Jiang, Jian-Tao Sun, Enhong Chen, Qiang Yang Microsoft Research Asia SIGIR.
Finding the Right Facts in the Crowd: Factoid Question Answering over Social Media J. Bian, Y. Liu, E. Agichtein, and H. Zha ACM WWW, 2008.
A New Algorithm for Inferring User Search Goals with Feedback Sessions.
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
Text Information Management ChengXiang Zhai, Tao Tao, Xuehua Shen, Hui Fang, Azadeh Shakery, Jing Jiang.
Beyond Ranking: Optimizing Whole-Page Presentation
University Of Seoul Ubiquitous Sensor Network Lab Query Dependent Pseudo-Relevance Feedback based on Wikipedia 전자전기컴퓨터공학 부 USN 연구실 G
Opinion spam and Analysis 소프트웨어공학 연구실 G 최효린 1 / 35.
Introduction to Machine Learning, its potential usage in network area,
Evaluation Anisio Lacerda.
Deep Compositional Cross-modal Learning to Rank via Local-Global Alignment Xinyang Jiang, Fei Wu, Xi Li, Zhou Zhao, Weiming Lu, Siliang Tang, Yueting.
Evaluation of IR Systems
An Empirical Study of Learning to Rank for Entity Search
Personalized Social Image Recommendation
Content-Aware Click Modeling
Learning to Rank Shubhra kanti karmaker (Santu)
Lecture 24: NER & Entity Linking
The Four Dimensions of Search Engine Quality
Location Recommendation — for Out-of-Town Users in Location-Based Social Network Yina Meng.
Author: Kazunari Sugiyama, etc. (WWW2004)
ISWC 2013 Entity Recommendations in Web Search
Course Summary ChengXiang “Cheng” Zhai Department of Computer Science
Ryen White, Ahmed Hassan, Adish Singla, Eric Horvitz
March 2019 Project: IEEE P Working Group for Wireless Personal Area Networks (WPANs) Submission Title: [Security vs. Sequence Length Considerations]
Retrieval Evaluation - Measures
Efficient Multiple-Click Models in Web Search
Retrieval Performance Evaluation - Measures
Communication Driven Remapping of Processing Element (PE) in Fault-tolerant NoC-based MPSoCs Chia-Ling Chen, Yen-Hao Chen and TingTing Hwang Department.
GhostLink: Latent Network Inference for Influence-aware Recommendation
Presentation transcript:

Computational User Intent Modeling Hongning Wang March 6, 2013

Research Summary Joint relevance and freshness learning [WWW’12] Content-Aware Click Modeling [WWW’13] Cross-Session Search Task Extraction [WWW’13] 9/18/2018

Understanding User Intent is Important “Apple Company” @ Oct. 4, 2011 Release of iPhone 4S Let’s take the event happened on “apple company” last October as an example. When a user submit the query “Apple Company” into a news search engine, on Oct. 4, 2011, she was expecting to find a list of news reports covering the new release of Apple’s iPhone4S.

Understanding User Intent is Important “Apple Company” @ Oct. 5, 2011 Steve Jobs passed away However, the news talking about release of iPhone 4s became less relevant to the query just one day after, when Apple Inc.’s former CEO Steve Jobs passed away on Oct. 5, 2011. Such case is quite common in news search, and urges us to give seriously consider and model freshness and the trade-off between relevance and freshness in news search scenario. Release of iPhone 4S

Relevance v.s. Freshness Topical relatedness Metric: tf*idf, BM25, Language Model Freshness Temporal closeness Metric: age, elapsed time Trade-off Query specific To meet user’s information need In traditional web search, relevance, is narrowly defined as topical relatedness, where various ranking features have been proposed to capture such relevance, including tf*idf, BM25 and language modeling approaches. While in news search, the increased importance of freshness is one distinctive characteristic. The emphasis on freshness is crucial for news search, as even the seemingly current event can quickly become outdated, dwarfed by the importance of new developments. In addition, the relative emphasis, or trade-off, between these two aspects are highly specific to the query and time when the query was issued. And such trade-off reflects the end-user’s underlying information need.

Joint Relevance and Freshness Learning Our Contribution Joint Relevance and Freshness Learning JRFL: (Relevance, Freshness) -> Click Query => trade-off URL => relevance/freshness Click => overall impression Inspired by this example, we proposed to model the users' click behavior in news search as a direct consequence of examining the relevance and freshness aspects of the returned documents. And we name our method as Joint Relevance and Freshness Learning, or JRFL in short. We can abstract the user’s decision process discussed in the last slide by this figure. In addition to give a good estimate of a candidate URL’s relevance and freshness aspects, a good ranking function should be able to infer the trade-off and return the optimally combined ranking results for individual queries as well, because the relative emphasis the users put over these two aspects can vary substantially, reflecting the searching intention for the specific news event. However, we cannot explicitly obtain the users' relevance/freshness judgments and the preferences over these two aspects, since their evaluation process is not directly observable from the search engine. Fortunately, the users' click patterns are recorded, from which we can assume the clicked documents are more meaningful to her than the non-clicked ones. Therefore, we model relevance and freshness as two latent factors and assume a linear combination of these two, which is also latent, generates the observed click preferences.

Quantitative Comparison Ranking performance Random bucket clicks To validate the effectiveness of the proposed JRFL model in real news search tasks, we quantitatively compare it with all our baseline methods on: random bucket clicks, normal clicks, and editorial judgments. All the click-based learning algorithms are trained on all 150K click preferences. Since all these models have several parameters to be tuned, we report their best performance on the corresponding testing set according to MAP@4 metric in the following results and perform t-test to validate the significance of improvement (against the second best performance accordingly). On this unbiased random bucket clicks, we can find the proposed JRFL model achieves encouraging improvement over the second best GBRank model, especially on MAP@4 the relative improvement is over 5.88%. This improvement confirms that properly integrating relevance and freshness can indeed improve the user's search satisfaction.

Content-Aware Click Modeling Study the underlying mechanism of user clicks Freshness weight=0.8 R=0.39 F=2.34 Y=1.95 R=1.72 F=2.18 Y=2.01 R=2.41 F=1.76 Y=2.09

Modeling User Clicks Match my query? Redundant doc? Shall I move on?

Content-Aware Click Modeling Our Contribution Content-Aware Click Modeling Encode rich dependency within user browsing behaviors via descriptive features Chance to further examine the result documents: e.g., position, # clicks, distance to last click Chance to click on an examined and relevant document: e.g., clicked/skipped content similarity Relevance quality of a document: e.g., ranking features

Experimental Results Take advantage of both counting-based and feature-based methods

Learning to Extract Search Tasks An atomic information need that may result in one or more queries 5/29/2012 S1 5/29/2012 5:26 bank of america 5/29/2012 S2 5/29/2012 11:11 macy's sale 5/29/2012 11:12 sas shoes 5/30/2012 S1 5/30/2012 10:19 credit union 5/30/2012 S2 5/30/2012 12:25 6pm.com 5/30/2012 12:49 coupon for 6pm shoes 5/29/2012 S1 5/29/2012 5:26 bank of america 5/29/2012 S2 5/29/2012 11:11 macy's sale 5/29/2012 11:12 sas shoes 5/30/2012 S1 5/30/2012 10:19 credit union 5/30/2012 S2 5/30/2012 12:25 6pm.com 5/30/2012 12:49 coupon for 6pm shoes

Solution Heuristic constraints Structural knowledge Semi-supervised Our Contribution Solution Heuristic constraints Structural knowledge Identical queries Sub-queries Identical clicked URLs Same task => tasks sharing related queries Latent Semi-supervised Structural Learning

Semi-supervised Structural Learning Our Contribution Semi-supervised Structural Learning Structural inference Hierarchical clustering on best links Flexibility Exact inference exists

Experimental Results

plausible explanation of task structure 1 il volo singing tous les visages de l'amour 1.1 french version of album by il volo 1.1.1 french version of album by il volo 1.1.1.1 french version of album by il volo 2 amazon.com international sites 2.1 amazon.com international 3 pottery barn warehouse clearance sale 4 amazon.com phone number 4.1 amazon.com phone number 4.1.1 amazon customer service phone number 4.1.1.1 amazon customer service phone number 5 condo rentals in salter path, n.c. 6 piero barone's 19th birthday plans 6.1 piero barone family 6.1.1 piero barone family 6.2 piero barone's 19th birthday plans 6.2.1 +piero barone's 19th birthday plans 6.2.2 piero barone's 19th birthday plans 6.2.2.1 piero barone singing piove 6.2.2.1.1 piero barone singing piove

Publications Hongning Wang, Anlei Dong, Lihong Li, Yi Chang and Evgeniy Gabrilovich. Joint Relevance and Freshness Learning From Clickthroughs for News Search. The 2012 World Wide Web Conference (WWW'2012), p579-588. Hongning Wang, ChengXiang Zhai, Anlei Dong and Yi Chang. Content-Aware Click Modeling. The 23rd International World-Wide Web Conference (WWW'2013) (To Appear) Hongning Wang, Yang Song, Ming-Wei Chang, Xiaodong He, Ryen White and Wei Chu. Learning to Extract Cross-Session Search Tasks. The 23rd International World-Wide Web Conference (WWW'2013) (To Appear) Yang Song, Hao Ma, Hongning Wang and Kuansan Wang. Exploring and Exploiting User Search Behaviors on Mobile and Tablet Devices to Improve Search Relevance. The 23rd International World-Wide Web Conference (WWW'2013) (To Appear) Ryen White, Wei Chu, Ahmed Hassan, Xiaodong He, Yang Song and Hongning Wang. Enhancing Personalized Search by Mining and Modeling Task Behavior. The 23rd International World-Wide Web Conference (WWW'2013) (To Appear) Chi Wang, Hongning Wang, Jialu Liu, Ming Ji, Lu Su, Yuguo Chen, Jiawei Han. On the Detectability of Node Grouping in Networks. SIAM International Conference on Data Mining (SDM'2013) (To Appear) Hongbo Deng, Jiawei Han, Hao Li, Heng Ji, Hongning Wang and Yue Lu. Exploring and Inferring User-User Pseudo-Friendship for Sentiment Analysis with Heterogeneous Networks. SIAM International Conference on Data Mining (SDM'2013) (To Appear) Mianwei Zhou, Hongning Wang and Kevin Chen-Chuan Chang. Learning to Rank from Distant Supervision: Exploiting Noisy Redundancy for Relational Entity Search. The 29th IEEE International Conference on Data Engineering (ICDE'2013) Yue Lu, Hongning Wang, ChengXiang Zhai and Dan Roth. Unsupervised Discovery of Opposing Opinion Networks From Forum Discussions. The 21st ACM International Conference on Information and Knowledge Management (CIKM'2012), p1642-1646.

Thank you! Q&A 9/18/2018

User’s Judgment on Relevance and Freshness v.s. Relevance User’s searching behavior Freshness weight=0.8 R=1.72 F=2.18 Y=2.01 R=2.41 F=1.76 Y=2.09 R=0.39 F=2.34 Y=1.95 Suppose, when a user submits a query to a news search engine and gets an according list of ranked news documents, she has a clear mind of her emphasis over relevance and freshness on the expected results. Then, she would sequencially judge the usefulness of each document by her underlining sense of relevance and freshness, and gives it an overall impression grade by her preference over relevance and freshness at that particular time. Once she has such impressions in mind, she would deliberately click the documents most interesting to her and skip all the others.

User Clicks Are Biased Position-bias Higher position More clicks Not necessarily relevant Modeling Clicks => Decompose relevance-driven clicks from position-driven clicks

Learning to Extract Search Tasks An atomic information need that may result in one or more queries An impression tѱ = 30 minutes