1 Learning User Clicks in Web Search Ding Zhou et al. The Pennsylvania State University IJCAI 2007.

Slides:



Advertisements
Similar presentations
Basic IR: Modeling Basic IR Task: Slightly more complex:
Advertisements

1 Evaluation Rong Jin. 2 Evaluation  Evaluation is key to building effective and efficient search engines usually carried out in controlled experiments.
Crawling, Ranking and Indexing. Organizing the Web The Web is big. Really big. –Over 3 billion pages, just in the indexable Web The Web is dynamic Problems:
Named Entity Mining From Click-Through Data Using Weakly Supervised LDA Gu Xu 1, Shuang-Hong Yang 1,2, Hang Li 1 1 Microsoft Research Asia, China 2 College.
1 Fuchun Peng Microsoft Bing 7/23/  Query is often treated as a bag of words  But when people are formulating queries, they use “concepts” as.
Personalized URLs (PURLs). Step One Collect Data from customers through multiple touch points (point-of-purchase, website, customer service, etc)
1 Learning User Interaction Models for Predicting Web Search Result Preferences Eugene Agichtein Eric Brill Susan Dumais Robert Ragno Microsoft Research.
Context-aware Query Suggestion by Mining Click-through and Session Data Authors: H. Cao et.al KDD 08 Presented by Shize Su 1.
Mining Query Subtopics from Search Log Data Date : 2012/12/06 Resource : SIGIR’12 Advisor : Dr. Jia-Ling Koh Speaker : I-Chih Chiu.
Web Markov Skeleton Processes and Applications Zhi-Ming Ma 10 June, 2013, St.Petersburg
BBM: Bayesian Browsing Model from Petabyte-scale Data Chao Liu, MSR-Redmond Fan Guo, Carnegie Mellon University Christos Faloutsos, Carnegie Mellon University.
Introduction to Data-driven Animation Jinxiang Chai Computer Science and Engineering Texas A&M University.
Looking at both the Present and the Past to Efficiently Update Replicas of Web Content Luciano Barbosa * Ana Carolina Salgado ! Francisco Tenorio ! Jacques.
Time-dependent Similarity Measure of Queries Using Historical Click- through Data Qiankun Zhao*, Steven C. H. Hoi*, Tie-Yan Liu, et al. Presented by: Tie-Yan.
Ph.D. DefenceUniversity of Alberta1 Approximation Algorithms for Frequency Related Query Processing on Streaming Data Presented by Fan Deng Supervisor:
Learning to Extract Symbolic Knowledge from the World Wide Web Changho Choi Source: Mark Craven,
How can Behavioral Targeting Help Online Advertising? Yan et. al October 23, 2014 Sam Hewitt.
Build a SharePoint App with Microsoft Access. About me.
Current Gen. Consoles By Jonathan Mrema 5/6/2015.
Daphne Koller Bayesian Networks Application: Diagnosis Probabilistic Graphical Models Representation.
The College of Saint Rose CSC 460 / CIS 560 – Search and Information Retrieval David Goldschmidt, Ph.D. from Search Engines: Information Retrieval in Practice,
1 Announcements Research Paper due today Research Talks –Nov. 29 (Monday) Kayatana and Lance –Dec. 1 (Wednesday) Mark and Jeremy –Dec. 3 (Friday) Joe and.
Language Identification of Search Engine Queries Hakan Ceylan Yookyung Kim Department of Computer Science Yahoo! Inc. University of North Texas 2821 Mission.
David Pardoe Doran Chakraborty Peter Stone The University of Texas at Austin Department of Computer Science TacTex-09: A Champion Bidding Agent for Ad.
Statistical problems in network data analysis: burst searches by narrowband detectors L.Baggio and G.A.Prodi ICRR TokyoUniv.Trento and INFN IGEC time coincidence.
Lecture 8—Probability and Statistics (Ch. 3) Friday January 25 th Quiz on Chapter 2 Classical and statistical probability The axioms of probability theory.
Understanding and Predicting Graded Search Satisfaction Tang Yuk Yu 1.
Know your Neighbors: Web Spam Detection Using the Web Topology Presented By, SOUMO GORAI Carlos Castillo(1), Debora Donato(1), Aristides Gionis(1), Vanessa.
Improving Web Search Ranking by Incorporating User Behavior Information Eugene Agichtein Eric Brill Susan Dumais Microsoft Research.
Fan Guo 1, Chao Liu 2 and Yi-Min Wang 2 1 Carnegie Mellon University 2 Microsoft Research Feb 11, 2009.
CIKM’09 Date:2010/8/24 Advisor: Dr. Koh, Jia-Ling Speaker: Lin, Yi-Jhen 1.
Understanding Query Ambiguity Jaime Teevan, Susan Dumais, Dan Liebling Microsoft Research.
External user invited This creates invitation in Access Request List Invitation sent to guest with invitation URL Guest clicks URL. Verification.
Host and Application Security Lesson 20: How the Web Does not Work.
Dependent and Independent Events. Events are said to be independent if the occurrence of one event has no effect on the occurrence of another. For example,
Implicit User Feedback Hongning Wang Explicit relevance feedback 2 Updated query Feedback Judgments: d 1 + d 2 - d 3 + … d k -... Query User judgment.
A Hierarchical Monothetic Document Clustering Algorithm for Summarization and Browsing Search Results Kummamuru et al. Presented by Bei Yu Sept. 22 nd,
Query Suggestion Naama Kraus Slides are based on the papers: Baeza-Yates, Hurtado, Mendoza, Improving search engines by query clustering Boldi, Bonchi,
Analysis of Topic Dynamics in Web Search Xuehua Shen (University of Illinois) Susan Dumais (Microsoft Research) Eric Horvitz (Microsoft Research) WWW 2005.
Introduction to touchdevelop ✿ art read only resources Disclaimer: This document is provided “as-is”. Information and views expressed in this document,
Evaluation of Recommender Algorithms for an Internet Information Broker based on Simple Association Rules and on the Repeat-Buying Theory WEBKDD 2002 Edmonton,
Advisor : Prof. Sing Ling Lee Student : Chao Chih Wang Date :
Xbox 360/Xbox Live Dani Helmlinger
Understanding and Predicting Personal Navigation.
A New Approach for English- Chinese Named Entity Alignment Donghui Feng Yayuan Lv Ming Zhou USC MSR Asia EMNLP-04.
Context-Aware Query Classification Huanhuan Cao, Derek Hao Hu, Dou Shen, Daxin Jiang, Jian-Tao Sun, Enhong Chen, Qiang Yang Microsoft Research Asia SIGIR.
Testing Predictive Performance of Ecological Niche Models A. Townsend Peterson, STOLEN FROM Richard Pearson.
HOW TO FIND RESEARCH PROJECT PAPER HOW TO FIND RESEARCH PROJECT PAPER Click here to begin.
Identifying Spam Web Pages Based on Content Similarity Sole Pera CS 653 – Term paper project.
Predicting Short-Term Interests Using Activity-Based Search Context CIKM’10 Advisor: Jia Ling, Koh Speaker: Yu Cheng, Hsieh.
SOFTWARE REQUIREMENTS. INTERNET GAMING SOFTWARE Windows XP/VistaAdobe Reader DirectX 9.0cActiveX Windows media player 3D hardware Acccelerator 2X DVD.
To Personalize or Not to Personalize: Modeling Queries with Variation in User Intent Presented by Jaime Teevan, Susan T. Dumais, Daniel J. Liebling Microsoft.
By: Bryce  1.They have evolved a long way.  2.They are better than the other game consoles.  3. About million people have a xbox360 or more.
By This video tutorial will show you how to download Thief The Bank Heist DLC full game free on your Xbox 360.
1 Personalizing Search via Automated Analysis of Interests and Activities Jaime Teevan, MIT Susan T. Dumais, Microsoft Eric Horvitz, Microsoft SIGIR 2005.
Ariel Fuxman, Panayiotis Tsaparas, Kannan Achan, Rakesh Agrawal (2008) - Akanksha Saxena 1.
© The Author(s) Published by Science and Education Publishing.
Unit 8 Probability.
Internet Search What you need to know!.
Search User Behavior: Expanding The Web Search Frontier
Experimental Probability
Mining Query Subtopics from Search Log Data
Evidence from Behavior
To insert a hyperlink ( a web page address, URL) using text
Query Type Classification for Web Document Retrieval
CLICK TO START.
CLICK TO START.
Call Now : Click : -
Call Now : Click : -
Call Now : Click : -
Presentation transcript:

1 Learning User Clicks in Web Search Ding Zhou et al. The Pennsylvania State University IJCAI 2007

2 What is Click Prediction? microsoft xbox 360 kinect game

3 What is Click Prediction? microsoft xbox 360 kinect game 

4 Click Prediction Model P( | microsoft xbox 360 kinect game) P( | microsoft xbox 360 kinect game) P( | microsoft xbox 360 kinect game)

5 Two approaches 1. Full Model: P F ( TW/kinect | “ microsoft xbox 360 kinect game ” ) 2. Independent Model: P I ( | microsoft, xbox, 360, kinect, game)

6 Two Approaches: Pros and Cons Full Model High prediction accuracy Low coverage Independent Model Low prediction accuracy High coverage

7 New Approach: Conditional Probability Hierarchy P(url | microsoft xbox 360 kinect game) = P F (url | “ microsoft xbox 360 kinect game ” ) + (1- )P I (url | microsoft, xbox, 360, kinect, game)

8 New Approach: Conditional Probability Hierarchy P(url | microsoft xbox 360 kinect game) = P F (url | “ microsoft xbox 360 kinect game ” ) + (1- )P I (url | microsoft, xbox, 360, kinect, game) P I (url | microsoft, xbox, 360, kinect, game) = P(url|microsoft)P(url|xbox)P(url|360)P(url|kinect) P(url|game)

9 New Approach: Conditional Probability Hierarchy P(url | microsoft xbox 360 kinect game) = P F (url | “ microsoft xbox 360 kinect game ” ) + (1- )P I (url | microsoft, xbox, 360, kinect, game) P I (url | microsoft, xbox, 360, kinect, game) = P(url|microsoft)P(url|xbox)P(url|360)P(url|kinect) P(url|game)

10 New Approach: Conditional Probability Hierarchy microsoft xbox360kinect game P(url|microsoft xbox)P(url|360 kinect) P(url|360 kinect game) P(url|microsoft xbox 360 kinect game)

11 New Approach: Conditional Probability Hierarchy P(url | microsoft xbox 360 kinect game) = P F (url | “ microsoft xbox 360 kinect game ” ) + (1- )P I (url | microsoft, xbox, 360, kinect, game) P I (url | microsoft, xbox, 360, kinect, game) = f( P F (url | “ microsoft xbox ” ), (1- )P I (url | microsoft, xbox), P F (url | “ 360 kinect game ” ), (1- )P I (url | 360, kinect, game))

12 New Approach: Conditional Probability Hierarchy P(url | microsoft xbox 360 kinect game) = P F (url | “ microsoft xbox 360 kinect game ” ) + (1- )P I (url | microsoft, xbox, 360, kinect, game) is directly proportional to the occurrence frequency of “ microsoft xbox 360 kinect game ”

13 How to predict? What ’ s the threshold probability for considering P(url | microsoft xbox 360 kinect game) as a click???

14 How to predict? What ’ s the threshold probability for considering P(url | microsoft xbox 360 kinect game) as a click??? Not said in paper …

15 Experimental Corpus CiteSeer 1,826,817 query-click pairs

16 Prediction Accuracy

17 Prediction Coverage

18 Accuracy-Coverage Tradeoff