CIKM’09 Date:2010/8/24 Advisor: Dr. Koh, Jia-Ling Speaker: Lin, Yi-Jhen 1.

Slides:

Advertisements

Similar presentations

A Comparison of Implicit and Explicit Links for Web Page Classification Dou Shen 1 Jian-Tao Sun 2 Qiang Yang 1 Zheng Chen 2 1 Department of Computer Science.

Advertisements

Date: 2013/1/17 Author: Yang Liu, Ruihua Song, Yu Chen, Jian-Yun Nie and Ji-Rong Wen Source: SIGIR12 Advisor: Jia-ling Koh Speaker: Chen-Yu Huang Adaptive.

Temporal Query Log Profiling to Improve Web Search Ranking Alexander Kotov (UIUC) Pranam Kolari, Yi Chang (Yahoo!) Lei Duan (Microsoft)

Psychological Advertising: Exploring User Psychology for Click Prediction in Sponsored Search Date: 2014/03/25 Author: Taifeng Wang, Jiang Bian, Shusen.

Diversity Maximization Under Matroid Constraints Date : 2013/11/06 Source : KDD’13 Authors : Zeinab Abbassi, Vahab S. Mirrokni, Mayur Thakur Advisor :

1 Evaluation Rong Jin. 2 Evaluation  Evaluation is key to building effective and efficient search engines usually carried out in controlled experiments.

Modelling Relevance and User Behaviour in Sponsored Search using Click-Data Adarsh Prasad, IIT Delhi Advisors: Dinesh Govindaraj SVN Vishwanathan* Group:

Personalized Query Classification Bin Cao, Qiang Yang, Derek Hao Hu, et al. Computer Science and Engineering Hong Kong UST.

Optimizing search engines using clickthrough data

Time-sensitive Personalized Query Auto-Completion

Toward Whole-Session Relevance: Exploring Intrinsic Diversity in Web Search Date: 2014/5/20 Author: Karthik Raman, Paul N. Bennett, Kevyn Collins-Thompson.

Query Dependent Pseudo-Relevance Feedback based on Wikipedia SIGIR ‘09 Advisor: Dr. Koh Jia-Ling Speaker: Lin, Yi-Jhen Date: 2010/01/24 1.

Searchable Web sites Recommendation Date : 2012/2/20 Source : WSDM’11 Speaker : I- Chih Chiu Advisor : Dr. Koh Jia-ling 1.

1 Learning User Interaction Models for Predicting Web Search Result Preferences Eugene Agichtein Eric Brill Susan Dumais Robert Ragno Microsoft Research.

Mining Query Subtopics from Search Log Data Date : 2012/12/06 Resource : SIGIR’12 Advisor : Dr. Jia-Ling Koh Speaker : I-Chih Chiu.

Explorations in Tag Suggestion and Query Expansion Jian Wang and Brian D. Davison Lehigh University, USA SSM 2008 (Workshop on Search in Social Media)

Evaluating Search Engine

Context-Aware Query Classification Huanhuan Cao 1, Derek Hao Hu 2, Dou Shen 3, Daxin Jiang 4, Jian-Tao Sun 4, Enhong Chen 1 and Qiang Yang 2 1 University.

COMP 630L Paper Presentation Javy Hoi Ying Lau. Selected Paper “A Large Scale Evaluation and Analysis of Personalized Search Strategies” By Zhicheng Dou,

1 Web Query Classification Query Classification Task: map queries to concepts Application: Paid advertisement 问题：百度 /Google 怎么赚钱？

Topic-Sensitive PageRank Taher H. Haveliwala. PageRank Importance is propagated A global ranking vector is pre-computed.

Advisor: Hsin-Hsi Chen Reporter: Chi-Hsin Yu Date:

Tag Clouds Revisited Date : 2011/12/12 Source : CIKM’11 Speaker : I- Chih Chiu Advisor : Dr. Koh. Jia-ling 1.

1 Context-Aware Search Personalization with Concept Preference CIKM’11 Advisor ： Jia Ling, Koh Speaker ： SHENG HONG, CHUNG.

Topics and Transitions: Investigation of User Search Behavior Xuehua Shen, Susan Dumais, Eric Horvitz.

A Simple Unsupervised Query Categorizer for Web Search Engines Prashant Ullegaddi and Vasudeva Varma Search and Information Extraction Lab Language Technologies.

Personalized Search Cheng Cheng (cc2999) Department of Computer Science Columbia University A Large Scale Evaluation and Analysis of Personalized Search.

Exploring Online Social Activities for Adaptive Search Personalization CIKM’10 Advisor ： Jia Ling, Koh Speaker ： SHENG HONG, CHUNG.

Understanding and Predicting Personal Navigation Date : 2012/4/16 Source : WSDM 11 Speaker : Chiu, I- Chih Advisor : Dr. Koh Jia-ling 1.

Presenter: Lung-Hao Lee ( 李龍豪 ) January 7, 309.

Giorgos Giannopoulos (IMIS/”Athena” R.C and NTU Athens, Greece) Theodore Dalamagas (IMIS/”Athena” R.C., Greece) Timos Sellis (IMIS/”Athena” R.C and NTU.

ON THE SELECTION OF TAGS FOR TAG CLOUDS (WSDM11) Advisor: Dr. Koh. Jia-Ling Speaker: Chiang, Guang-ting Date:2011/06/20 1.

Binxing Jiao et. al (SIGIR ’10) Presenter : Lin, Yi-Jhen Advisor: Dr. Koh. Jia-ling Date: 2011/4/25 VISUAL SUMMARIZATION OF WEB PAGES.

Probabilistic Models of Novel Document Rankings for Faceted Topic Retrieval Ben Cartrette and Praveen Chandar Dept. of Computer and Information Science.

LOGO Finding High-Quality Content in Social Media Eugene Agichtein, Carlos Castillo, Debora Donato, Aristides Gionis and Gilad Mishne (WSDM 2008) Advisor.

BioSnowball: Automated Population of Wikis (KDD ‘10) Advisor: Dr. Koh, Jia-Ling Speaker: Lin, Yi-Jhen Date: 2010/11/30 1.

Enhancing Cluster Labeling Using Wikipedia David Carmel, Haggai Roitman, Naama Zwerdling IBM Research Lab (SIGIR’09) Date: 11/09/2009 Speaker: Cho, Chin.

21/11/20151Gianluca Demartini Ranking Clusters for Web Search Gianluca Demartini Paul–Alexandru Chirita Ingo Brunkhorst Wolfgang Nejdl L3S Info Lunch Hannover,

Improving Search Engines Using Human Computation Games (CIKM09) Date: 2011/3/28 Advisor: Dr. Koh. Jia-Ling Speaker: Chiang, Guang-ting 1.

Jiafeng Guo(ICT) Xueqi Cheng(ICT) Hua-Wei Shen(ICT) Gu Xu (MSRA) Speaker: Rui-Rui Li Supervisor: Prof. Ben Kao.

Gang WangDerek HoiemDavid Forsyth. INTRODUCTION APROACH (implement detail) EXPERIMENTS CONCLUSION.

Finding Experts Using Social Network Analysis 2007 IEEE/WIC/ACM International Conference on Web Intelligence Yupeng Fu, Rongjing Xiang, Yong Wang, Min.

Retroactive Answering of Search Queries Beverly Yang Glen Jeh.

LOGO Identifying the Influential Bloggers in a Community Nitin Agarwal, Huan Liu, Lei Tang and Philip S. Yu WSDM 2008 Advisor ： Dr. Koh Jia-Ling Speaker.

LOGO Identifying Opinion Leaders in the Blogosphere Xiaodan Song, Yun Chi, Koji Hino, Belle L. Tseng CIKM 2007 Advisor ： Dr. Koh Jia-Ling Speaker ： Tu.

A Classification-based Approach to Question Answering in Discussion Boards Liangjie Hong, Brian D. Davison Lehigh University (SIGIR ’ 09) Speaker: Cho,

Post-Ranking query suggestion by diversifying search Chao Wang.

More Than Relevance: High Utility Query Recommendation By Mining Users' Search Behaviors Xiaofei Zhu, Jiafeng Guo, Xueqi Cheng, Yanyan Lan Institute of.

1 Adaptive Subjective Triggers for Opinionated Document Retrieval (WSDM 09’) Kazuhiro Seki, Kuniaki Uehara Date: 11/02/09 Speaker: Hsu, Yu-Wen Advisor:

Context-Aware Query Classification Huanhuan Cao, Derek Hao Hu, Dou Shen, Daxin Jiang, Jian-Tao Sun, Enhong Chen, Qiang Yang Microsoft Research Asia SIGIR.

26/01/20161Gianluca Demartini Ranking Categories for Faceted Search Gianluca Demartini L3S Research Seminars Hannover, 09 June 2006.

TO Each His Own: Personalized Content Selection Based on Text Comprehensibility Date: 2013/01/24 Author: Chenhao Tan, Evgeniy Gabrilovich, Bo Pang Source:

Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:

Date: 2013/9/25 Author: Mikhail Ageev, Dmitry Lagun, Eugene Agichtein Source: SIGIR’13 Advisor: Jia-ling Koh Speaker: Chen-Yu Huang Improving Search Result.

A Framework to Predict the Quality of Answers with Non-Textual Features Jiwoon Jeon, W. Bruce Croft(University of Massachusetts-Amherst) Joon Ho Lee (Soongsil.

{ Adaptive Relevance Feedback in Information Retrieval Yuanhua Lv and ChengXiang Zhai (CIKM ‘09) Date: 2010/10/12 Advisor: Dr. Koh, Jia-Ling Speaker: Lin,

Predicting User Interests from Contextual Information R. W. White, P. Bailey, L. Chen Microsoft (SIGIR 2009) Presenter : Jae-won Lee.

Learning to Rank: From Pairwise Approach to Listwise Approach Authors: Zhe Cao, Tao Qin, Tie-Yan Liu, Ming-Feng Tsai, and Hang Li Presenter: Davidson Date:

Predicting Short-Term Interests Using Activity-Based Search Context CIKM’10 Advisor: Jia Ling, Koh Speaker: Yu Cheng, Hsieh.

1 Learning to Impress in Sponsored Search Xin Supervisors: Prof. King and Prof. Lyu.

Automatic Categorization of Query Results Kaushik Chakrabarti, Surajit Chaudhuri, Seung-won Hwang Sushruth Puttaswamy.

Finding similar items by leveraging social tag clouds Speaker: Po-Hsien Shih Advisor: Jia-Ling Koh Source: SAC 2012’ Date: October 4, 2012.

Usefulness of Quality Click- through Data for Training Craig Macdonald, ladh Ounis Department of Computing Science University of Glasgow, Scotland, UK.

CiteData: A New Multi-Faceted Dataset for Evaluating Personalized Search Performance CIKM’10 Advisor : Jia-Ling, Koh Speaker : Po-Hsien, Shih.

1 Clustering Web Queries John S. Whissell, Charles L.A. Clarke, Azin Ashkan CIKM ’ 09 Speaker: Hsin-Lan, Wang Date: 2010/08/31.

Click Through Rate Prediction for Local Search Results

Improving Search Relevance for Short Queries in Community Question Answering Date： 2014/09/25 Author ： Haocheng Wu, Wei Wu, Ming Zhou, Enhong Chen, Lei.

Date : 2013/1/10 Author : Lanbo Zhang, Yi Zhang, Yunfei Chen

Date: 2012/11/15 Author: Jin Young Kim, Kevyn Collins-Thompson,

Using Link Information to Enhance Web Page Classification

Presentation transcript:

CIKM’09 Date:2010/8/24 Advisor: Dr. Koh, Jia-Ling Speaker: Lin, Yi-Jhen 1

Agenda  Introduction  PQC: Personalized Query Classification  Experiments  Conclusions and Future Work 2

Introduction  Query Classification (QC) aims to classify Web queries into topical categories.  Since queries are short in length and ambiguous, the same query may need to be classified to different categories according to different people’s perspectives. 3

Introduction (cont)  Users’ preferences that are hidden in clickthrough logs are helpful to improve the understanding of users’ queries.  We propose to connect QC with users’ preference learning from clickthrough logs for PQC.  To tackle the sparseness problem in clickthrough logs, we propose a collaborative ranking model to leverage similar users’ information. 4

PQC: Personalized Query Classification  Overall Model  User Preference for PQC  Collaborative Ranking Model for Learning User Preferences  Combining Long-Term Preferences and Short-Term Preferences to Improve PQC 5

Overall Model  QC aims to estimate  PQC aims to estimate  Assume that user u has stable interests that are independent of the current query q. 6

Overall Model (cont)  Goal: to estimate  We use the winning solution to QC in KDDCUP 2005 to estimate  To estimate  We targets at the problem of estimating the user’s preference on categories 7

User Preference for PQC  An intuitive idea is that the historical queries submitted by a user can be used to help learn user preferences:  We can treat the problem of estimating user preferences as a query classification problem.  The approach above can be used to learn the short-term user preferences within a short time period, e.g., within a search session. 8

User Preference for PQC (cont)  This method has a problem for preference learning. 1) The sessions containing the current query may not reflect all search interests of a given user. 2) The method does not utilize the user’s click history information. 3) The sessions of one user may be too limited to infer user preferences. 9

Collaborative Ranking Model  We use a collaborative ranking model to estimate 1. Generating Pairwise Preferences 2. A log-likelihood function for learning User Preferences 10

1. Generating Pairwise Preferences , a query classifier classify a query q to several categories , categories of a clicked page  11

2. Learning User Preferences  To model the pairwise preferences, we use Bradley-Terry Model to define its likelihood. , the preference score for the user u and the category i.  For each user u, we have a set of pairwise preferences. e^-2e^-1e^0e^1e^

2. Learning User Preferences (cont)  We model as the joint probability (PLSA model) 13

2. Learning User Preferences (cont)  Our log-likelihood function to estimate  Bradley-Terry Model + PLSA Model 14

2. Learning User Preferences (cont)  We used Gradient-descent algorithm to optimize the objective function.  The complexity of calculating the gradient is in the order of N N: the # of preference pairs in the training data  Therefore, our algorithm can handle large- scale data with millions of users.  After obtaining the preference score matrix, we can get 15

Combining Long-Term Preferences and Short-Term Preferences to Improve PQC  Users typically have long-term preferences as well as short-term preferences.  We can use the method with queries in a session to learn short-term preference and the collaborative ranking model to learn long- term preference. 16

Experiments (dataset)  Clickthrough log dataset is obtained from a commercial search engine.  Duration: 2007/11/1 ~ 2007/11/15  Randomly select 10,000 users  22,696 queries and 51,366 urls 17

Experiments (evaluation)  We check the consistency of the prediction from PQC and user’s clicking behavior.  We give a consistency measurement by defining a metric: , the top k categories we predicted for the query q , all categories we predicted for the query q , the categories obtained from the user’s clicked pages p 18

Experiments (classification label)  We use the categories defined in the ACM KDDCUP’05 data set  Total 67 categories that form a hierarchy 19

Effects of Personalization on QC 20

Effect of Long-Term and Short-Term 21

On User Preferences Prediction 22

Conclusions  We developed a personalized query classification model PQC which can significantly improve the accuracy of QC.  The PQC solution uses a collaborative ranking model for users’ preference learning to leverage many similar users’ preferences.  We also proposed an evaluation method for PQC using clickthrough logs. 23

Future Work  A functional output for PQC, like categorizing queries into commercial/non-commercial ones.  Users’ interests change with time, we can take time into consideration.  Apply the PQC solution to other personalized services such as personalized search and advertising. 24