Improving Search Relevance for Short Queries in Community Question Answering Date: 2014/09/25 Author : Haocheng Wu, Wei Wu, Ming Zhou, Enhong Chen, Lei.

Slides:



Advertisements
Similar presentations
Date: 2013/1/17 Author: Yang Liu, Ruihua Song, Yu Chen, Jian-Yun Nie and Ji-Rong Wen Source: SIGIR12 Advisor: Jia-ling Koh Speaker: Chen-Yu Huang Adaptive.
Advertisements

Learning to Suggest: A Machine Learning Framework for Ranking Query Suggestions Date: 2013/02/18 Author: Umut Ozertem, Olivier Chapelle, Pinar Donmez,
Psychological Advertising: Exploring User Psychology for Click Prediction in Sponsored Search Date: 2014/03/25 Author: Taifeng Wang, Jiang Bian, Shusen.
Date : 2014/06/10 Author :Shahab Kamali Frank Wm. Tompa Source : SIGIR’13 Advisor : Jia-ling Koh Speaker : Shao-Chun Peng Retrieving Documents With Mathematical.
Term Level Search Result Diversification DATE : 2013/09/11 SOURCE : SIGIR’13 AUTHORS : VAN DANG, W. BRUCE CROFT ADVISOR : DR.JIA-LING, KOH SPEAKER : SHUN-CHEN,
Processing XML Keyword Search by Constructing Effective Structured Queries Jianxin Li, Chengfei Liu, Rui Zhou and Bo Ning Swinburne University of Technology,
DQR : A Probabilistic Approach to Diversified Query recommendation Date: 2013/05/20 Author: Ruirui Li, Ben Kao, Bin Bi, Reynold Cheng, Eric Lo Source:
Knowledge Base Completion via Search-Based Question Answering
Entity-Centric Topic-Oriented Opinion Summarization in Twitter Date : 2013/09/03 Author : Xinfan Meng, Furu Wei, Xiaohua, Liu, Ming Zhou, Sujian Li and.
Date : 2013/09/17 Source : SIGIR’13 Authors : Zhu, Xingwei
Time-sensitive Personalized Query Auto-Completion
WSCD INTRODUCTION  Query suggestion has often been described as the process of making a user query resemble more closely the documents it is expected.
Toward Whole-Session Relevance: Exploring Intrinsic Diversity in Web Search Date: 2014/5/20 Author: Karthik Raman, Paul N. Bennett, Kevyn Collins-Thompson.
Query Dependent Pseudo-Relevance Feedback based on Wikipedia SIGIR ‘09 Advisor: Dr. Koh Jia-Ling Speaker: Lin, Yi-Jhen Date: 2010/01/24 1.
Searchable Web sites Recommendation Date : 2012/2/20 Source : WSDM’11 Speaker : I- Chih Chiu Advisor : Dr. Koh Jia-ling 1.
Context-aware Query Suggestion by Mining Click-through and Session Data Authors: H. Cao et.al KDD 08 Presented by Shize Su 1.
Mining Query Subtopics from Search Log Data Date : 2012/12/06 Resource : SIGIR’12 Advisor : Dr. Jia-Ling Koh Speaker : I-Chih Chiu.
1 The Four Dimensions of Search Engine Quality Jan Pedersen Chief Scientist, Yahoo! Search 19 September 2005.
What’s The Difference??  Subject Directory  Search Engine  Deep Web Search.
Quality-Aware Collaborative Question Answering: Methods and Evaluation Maggy Anastasia Suryanto, Ee-Peng Lim, Aixin Sun, and Roger H. L. Chiang. In Proceedings.
Web Usage Mining with Semantic Analysis Date: 2013/12/18 Author: Laura Hollink, Peter Mika, Roi Blanco Source: WWW’13 Advisor: Jia-Ling Koh Speaker: Pei-Hao.
1 Context-Aware Search Personalization with Concept Preference CIKM’11 Advisor : Jia Ling, Koh Speaker : SHENG HONG, CHUNG.
Date: 2012/10/18 Author: Makoto P. Kato, Tetsuya Sakai, Katsumi Tanaka Source: World Wide Web conference (WWW "12) Advisor: Jia-ling, Koh Speaker: Jiun.
Reyyan Yeniterzi Weakly-Supervised Discovery of Named Entities Using Web Search Queries Marius Pasca Google CIKM 2007.
1 Applying Collaborative Filtering Techniques to Movie Search for Better Ranking and Browsing Seung-Taek Park and David M. Pennock (ACM SIGKDD 2007)
When Experts Agree: Using Non-Affiliated Experts To Rank Popular Topics Meital Aizen.
CIKM’09 Date:2010/8/24 Advisor: Dr. Koh, Jia-Ling Speaker: Lin, Yi-Jhen 1.
Exploring Online Social Activities for Adaptive Search Personalization CIKM’10 Advisor : Jia Ling, Koh Speaker : SHENG HONG, CHUNG.
Internet Information Retrieval Sun Wu. Course Goal To learn the basic concepts and techniques of internet search engines –How to use and evaluate search.
南台科技大學 資訊工程系 A web page usage prediction scheme using sequence indexing and clustering techniques Adviser: Yu-Chiang Li Speaker: Gung-Shian Lin Date:2010/10/15.
Understanding and Predicting Personal Navigation Date : 2012/4/16 Source : WSDM 11 Speaker : Chiu, I- Chih Advisor : Dr. Koh Jia-ling 1.
Retrieval Models for Question and Answer Archives Xiaobing Xue, Jiwoon Jeon, W. Bruce Croft Computer Science Department University of Massachusetts, Google,
Intent Subtopic Mining for Web Search Diversification Aymeric Damien, Min Zhang, Yiqun Liu, Shaoping Ma State Key Laboratory of Intelligent Technology.
ON THE SELECTION OF TAGS FOR TAG CLOUDS (WSDM11) Advisor: Dr. Koh. Jia-Ling Speaker: Chiang, Guang-ting Date:2011/06/20 1.
Detecting Dominant Locations from Search Queries Lee Wang, Chuang Wang, Xing Xie, Josh Forman, Yansheng Lu, Wei-Ying Ma, Ying Li SIGIR 2005.
Text Based Information Retrieval Text Based Information Retrieval H02C8A H02C8B Marie-Francine Moens Karl Gyllstrom Katholieke Universiteit Leuven.
Binxing Jiao et. al (SIGIR ’10) Presenter : Lin, Yi-Jhen Advisor: Dr. Koh. Jia-ling Date: 2011/4/25 VISUAL SUMMARIZATION OF WEB PAGES.
NTU Natural Language Processing Lab. 1 An Analysis of Effectiveness of Tagging in Blogs Christopher H. Brooks and Nancy Montanez University of San Francisco.
Improving Web Search Results Using Affinity Graph Benyu Zhang, Hua Li, Yi Liu, Lei Ji, Wensi Xi, Weiguo Fan, Zheng Chen, Wei-Ying Ma Microsoft Research.
Enhancing Cluster Labeling Using Wikipedia David Carmel, Haggai Roitman, Naama Zwerdling IBM Research Lab (SIGIR’09) Date: 11/09/2009 Speaker: Cho, Chin.
1 Date: 2012/9/13 Source: Yang Song, Dengyong Zhou, Li-wei Heal(WSDM’12) Advisor: Jia-ling, Koh Speaker: Jiun Jia, Chiou Query Suggestion by Constructing.
+ User-induced Links in Collaborative Tagging Systems Ching-man Au Yeung, Nicholas Gibbins, Nigel Shadbolt CIKM’09 Speaker: Nonhlanhla Shongwe 18 January.
A Classification-based Approach to Question Answering in Discussion Boards Liangjie Hong, Brian D. Davison Lehigh University (SIGIR ’ 09) Speaker: Cho,
Date: 2012/08/21 Source: Zhong Zeng, Zhifeng Bao, Tok Wang Ling, Mong Li Lee (KEYS’12) Speaker: Er-Gang Liu Advisor: Dr. Jia-ling Koh 1.
Post-Ranking query suggestion by diversifying search Chao Wang.
The World Wide Web. What is the worldwide web? The content of the worldwide web is held on individual pages which are gathered together to form websites.
Date: 2012/11/29 Author: Chen Wang, Keping Bi, Yunhua Hu, Hang Li, Guihong Cao Source: WSDM’12 Advisor: Jia-ling, Koh Speaker: Shun-Chen, Cheng.
Advisor: Koh Jia-Ling Nonhlanhla Shongwe EFFICIENT QUERY EXPANSION FOR ADVERTISEMENT SEARCH WANG.H, LIANG.Y, FU.L, XUE.G, YU.Y SIGIR’09.
Multi-Aspect Query Summarization by Composite Query Date: 2013/03/11 Author: Wei Song, Qing Yu, Zhiheng Xu, Ting Liu, Sheng Li, Ji-Rong Wen Source: SIGIR.
Divided Pretreatment to Targets and Intentions for Query Recommendation Reporter: Yangyang Kang /23.
TO Each His Own: Personalized Content Selection Based on Text Comprehensibility Date: 2013/01/24 Author: Chenhao Tan, Evgeniy Gabrilovich, Bo Pang Source:
Date: 2013/9/25 Author: Mikhail Ageev, Dmitry Lagun, Eugene Agichtein Source: SIGIR’13 Advisor: Jia-ling Koh Speaker: Chen-Yu Huang Improving Search Result.
Predicting Short-Term Interests Using Activity-Based Search Context CIKM’10 Advisor: Jia Ling, Koh Speaker: Yu Cheng, Hsieh.
E-commerce in Your Inbox Product Recommendations at Scale
QUERY-PERFORMANCE PREDICTION: SETTING THE EXPECTATIONS STRAIGHT Date : 2014/08/18 Author : Fiana Raiber, Oren Kurland Source : SIGIR’14 Advisor : Jia-ling.
Information Storage and Retrieval Fall Lecture 1: Introduction and History.
Click Through Rate Prediction for Local Search Results
Next Question Prediction
Summarizing answers in non-factoid community Question-answering
The Four Dimensions of Search Engine Quality
Mining Query Subtopics from Search Log Data
Beliefs and Biases in Web Search
Citation-based Extraction of Core Contents from Biomedical Articles
Struggling and Success in Web Search
Intent-Aware Semantic Query Annotation
Date : 2013/1/10 Author : Lanbo Zhang, Yi Zhang, Yunfei Chen
Date: 2012/11/15 Author: Jin Young Kim, Kevyn Collins-Thompson,
Connecting the Dots Between News Article
Ranking using Multiple Document Types in Desktop Search
Preference Based Evaluation Measures for Novelty and Diversity
Presentation transcript:

Improving Search Relevance for Short Queries in Community Question Answering Date: 2014/09/25 Author : Haocheng Wu, Wei Wu, Ming Zhou, Enhong Chen, Lei Duan, Heung-Yeung Shum Source: WSDM’14 Advisor: Jia-ling Koh Speaker: Sz-Han,Wang

OUTLINE INTRODUCTION METHOD USER INTENT MINING MODELS EXPERIMENT CONCLUSION

INTRODUCTION Community question answering How to leverage historical content to answer new queries? YAHOO!ANSWER Quora

INTRODUCTION Existing methods usually focus on long and syntactically structured queries. When searching CQA archives, users influenced by web search are used to issuing short queries. On many CQA sites, the search results are not satisfactory when an input query is short.

INTRODUCTION Goal: Improve search relevance for short queries in CQA question search. How to improving search relevance? Propose an intent-based language model by leveraging search intent that is mined from question descriptions in CQA archives, web query logs, and web search results.

OUTLINE INTRODUCTION METHOD USER INTENT MINING MODELS EXPERIMENT CONCLUSION

USER INTENT MINING Mining user intent from three different sources: (1) question descriptions in CQA archives (2) web search logs (3) the top search results from a commercial search engine

Intent Mining from CQA Archives Reveal an asker’s specific needs for a question Example: Question: Why do you love Baltimore? Description: Maryland、Charm City

Intent Mining from CQA Archives Extract intent from the descriptions: Predict user intent for short queries given a short query q relevance score rank terms by Pcqa(t|q) Get the intent word set W = {(t, ϕ)} from CQA archives Source-Question a b c d Target-Description e f P(e | a) P(e | b) P(e | c) P(e | d) P(f | a) P(f | b) P(f | c) P(f | d) Term to term translation model

Intent Mining from Query Log Conveys common preferences about the query Example: Query: Beijing Top intent : travel Most searchers of “Beijing” are interested in travel guides

Intent Mining from Query Log Extract intent from both the queries and URLs: given a query, collects queries that share the same suffix or prefix and aggregates the co-clicked URLs of these queries queries and URLs are clustered based on word overlap and the similarity of co-click patterns Merge the terms from queries and URLs to get the intent word set W = {(t, ϕ)} from query log

Intent Mining from Web Search Results Contain popular subtopics related to the query Example: Apple just announced the “iPhone 6” Query: iPhone Questions about the new product may be more attractive than those about “iPhone 5”

Intent Mining from Web Search Results Extract popular intent for short queries from the top search results Given a query Crawl the newest search results Parse URLs, titles, and snippets Form an intent candidate set Calculate the intent final score intent final score=𝛼BM25(t,h,H)+𝛽BM25(t,s,S)+𝛾BM25(t,u,U) Rank intent based on the final score Get the intent word set W = {(t, ϕ)} from web search results title URL snippet

MODELS Language Model for Information Retrieval Translation-based Language Model Translation-based Language Model plus answer language model

MODELS Intent-based Language Model intent from source i is Wi = {(tij, ϕij )}, 1≤i≤3, 1≤j≤N

OUTLINE INTRODUCTION METHOD USER INTENT MINING MODELS EXPERIMENT CONCLUSION

EXPERIMENT Data set Collected a one-year query log from a commercial search engine and randomly sampled 1,782 queries

EXPERIMENT For each sampled query, retrieved several candidate questions from the indexed data Recruited human judges to label the relevance of the candidate questions regarding the queries with one of four levels: “Excellent”, “Good”, “Fair”, and “Bad”.

EXPERIMENT Evaluation results on Yahoo data and Quora data

EXPERIMENT Evaluation results of different intent models

OUTLINE INTRODUCTION METHOD USER INTENT MINING MODELS EXPERIMENT CONCLUSION

CONCLUSION Propose an intent-based language model that takes advantage of both the state-of-the-art question retrieval models and the extra intent information mined from three data sources. The evaluation results show that with user intent prediction, our model can significantly improve state-of-the-art relevance models on question retrieval for short queries.