Detecting Online Commercial Intention (OCI)

Slides:



Advertisements
Similar presentations
Query Classification Using Asymmetrical Learning Zheng Zhu Birkbeck College, University of London.
Advertisements

Temporal Query Log Profiling to Improve Web Search Ranking Alexander Kotov (UIUC) Pranam Kolari, Yi Chang (Yahoo!) Lei Duan (Microsoft)
Psychological Advertising: Exploring User Psychology for Click Prediction in Sponsored Search Date: 2014/03/25 Author: Taifeng Wang, Jiang Bian, Shusen.
1 Evaluation Rong Jin. 2 Evaluation  Evaluation is key to building effective and efficient search engines usually carried out in controlled experiments.
Optimizing search engines using clickthrough data
Bringing Order to the Web: Automatically Categorizing Search Results Hao Chen SIMS, UC Berkeley Susan Dumais Adaptive Systems & Interactions Microsoft.
Distant Supervision for Emotion Classification in Twitter posts 1/17.
■ Google’s Ad Distribution Network ■ Primary Benefits of AdWords ■ Online Advertising Stats and Trends ■ Appendix: Basic AdWords Features ■ Introduction.
Moving Beyond the Search Silo. Before There was Adtech, There was Search.
Explorations in Tag Suggestion and Query Expansion Jian Wang and Brian D. Davison Lehigh University, USA SSM 2008 (Workshop on Search in Social Media)
Context-Aware Query Classification Huanhuan Cao 1, Derek Hao Hu 2, Dou Shen 3, Daxin Jiang 4, Jian-Tao Sun 4, Enhong Chen 1 and Qiang Yang 2 1 University.
By Andrei Broder, IBM Research 1 A Taxonomy of Web Search Presented By o Onur Özbek o Mirun Akyüz.
Web queries classification Nguyen Viet Bang WING group meeting June 9 th 2006.
By Abdul Muizz Creative Director / Marketing Strategist Digital Marketing.
The Google Display Network. Why Display Matters.
Query Log Analysis Naama Kraus Slides are based on the papers: Andrei Broder, A taxonomy of web search Ricardo Baeza-Yates, Graphs from Search Engine Queries.
Website Content, Forms and Dynamic Web Pages. Electronic Portfolios Portfolio: – A collection of work that clearly illustrates effort, progress, knowledge,
Designing Ranking Systems for Hotels on Travel Search Engines by Mining User-Generated and Crowd sourced Content Author - Anindya Ghose, Panagiotis G.
PRIMISTA ONLINE MARKETING MADE EASY. Slide 2 Agenda Presentation Topics: 1.Introduction to Targeted Marketing 2.Ad Distribution Network 3.Primary Benefits.
CAMBRIDGE COMMUNITY TELEVISION Web Analytics Review & Recommendations.
WEB FORUM MINING BASED ON USER SATISFACTION PAGE 1 WEB FORUM MINING BASED ON USER SATISFACTION By: Suresh Pokharel Information and Communications Technologies.
Fall 2006 Davison/LinCSE 197/BIS 197: Search Engine Strategies 5-1 Search Engine Strategies: Road Map INTRO: What is SEM PLANNING: Things to Know BEFORE.
Tag-based Social Interest Discovery
Personalization in Local Search Personalization of Content Ranking in the Context of Local Search Philip O’Brien, Xiao Luo, Tony Abou-Assaleh, Weizheng.
1 Context-Aware Search Personalization with Concept Preference CIKM’11 Advisor : Jia Ling, Koh Speaker : SHENG HONG, CHUNG.
Learning to Classify Short and Sparse Text & Web with Hidden Topics from Large- scale Data Collections Xuan-Hieu PhanLe-Minh NguyenSusumu Horiguchi GSIS,
Searching the Web Dr. Frank McCown Intro to Web Science Harding University This work is licensed under Creative Commons Attribution-NonCommercial 3.0Attribution-NonCommercial.
Xiaoying Gao Computer Science Victoria University of Wellington Intelligent Agents COMP 423.
Detecting Semantic Cloaking on the Web Baoning Wu and Brian D. Davison Lehigh University, USA WWW 2006.
CIKM’09 Date:2010/8/24 Advisor: Dr. Koh, Jia-Ling Speaker: Lin, Yi-Jhen 1.
CIS 430 November 6, 2008 Emily Pitler. 3  Named Entities  1 or 2 words  Ambiguous meaning  Ambiguous intent 4.
Presenter: Lung-Hao Lee ( 李龍豪 ) January 7, 309.
Improving Cloaking Detection Using Search Query Popularity and Monetizability Kumar Chellapilla and David M Chickering Live Labs, Microsoft.
Math Information Retrieval Zhao Jin. Zhao Jin. Math Information Retrieval Examples: –Looking for formulas –Collect teaching resources –Keeping updated.
Improving Classification Accuracy Using Automatically Extracted Training Data Ariel Fuxman A. Kannan, A. Goldberg, R. Agrawal, P. Tsaparas, J. Shafer Search.
Detecting Dominant Locations from Search Queries Lee Wang, Chuang Wang, Xing Xie, Josh Forman, Yansheng Lu, Wei-Ying Ma, Ying Li SIGIR 2005.
Binxing Jiao et. al (SIGIR ’10) Presenter : Lin, Yi-Jhen Advisor: Dr. Koh. Jia-ling Date: 2011/4/25 VISUAL SUMMARIZATION OF WEB PAGES.
GrammAds: Keyword and Ad Creative Generator for Online Advertising Campaigns Author : Stamatina Thomaidou, Konstantinos Leymonis, and Michalis Vazirgiannis.
Retroactive Answering of Search Queries Beverly Yang Glen Jeh.
CoCQA : Co-Training Over Questions and Answers with an Application to Predicting Question Subjectivity Orientation Baoli Li, Yandong Liu, and Eugene Agichtein.
 Who Uses Web Search for What? And How?. Contribution  Combine behavioral observation and demographic features of users  Provide important insight.
Post-Ranking query suggestion by diversifying search Chao Wang.
Date: 2012/11/29 Author: Chen Wang, Keping Bi, Yunhua Hu, Hang Li, Guihong Cao Source: WSDM’12 Advisor: Jia-ling, Koh Speaker: Shun-Chen, Cheng.
26/01/20161Gianluca Demartini Ranking Categories for Faceted Search Gianluca Demartini L3S Research Seminars Hannover, 09 June 2006.
1 Search Engine Strategies SJ 2006 Presented By Anne Frisbie VP, Category Yahoo! Search Marketing Power of Search Reach and Engage the Empowered Consumer.
A System for Automatic Personalized Tracking of Scientific Literature on the Web Tzachi Perlstein Yael Nir.
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
Distinguishing humans from robots in web search logs preliminary results using query rates and intervals Omer Duskin Dror G. Feitelson School of Computer.
Bringing Order to the Web : Automatically Categorizing Search Results Advisor : Dr. Hsu Graduate : Keng-Wei Chang Author : Hao Chen Susan Dumais.
Web Intelligence and Intelligent Agent Technology 2008.
Resources of a Resource By, Anupama Atmakur Pooja Adudodla.
Google Display Network. Targeting options.
The Google Display Network. Why Display Matters..
ecommerce + affiliate marketing 2003 ecommerce store 2004 – 2006 ecommerce + affiliate marketing 2007 – present TopLine Media Group.
Click Through Rate Prediction for Local Search Results
PIWIK JUNIOR TIDAL ASSOCIATE PROF., WEB SERVICES & MULTIMEDIA LIBRARIAN NEW YORK CITY COLLEGE OF TECHNOLOGY, CUNY.
Business Plan Strategy
Search Engine Optimisation
Translating Google Analytics into Marketing Metrics
“ ” The challenge The solution The results Vegas.com deals aces high
Web Design and Development
Consumers Online Before firms can begin to sell their products online, they must first understand what kinds of people they will find online and how.
Training Deck – SEM & Facebook Ads
Mining Query Subtopics from Search Log Data
Text Analysis and Search Analytics
Text Analysis and Search Analytics
Ping LUO*, Fen LIN^, Yuhong XIONG*, Yong ZHAO*, Zhongzhi SHI^
WSExpress: A QoS-Aware Search Engine for Web Services
Journal of Web Semantics 55 (2019)
Presentation transcript:

Detecting Online Commercial Intention (OCI) Honghua (Kathy) Dai, Zaiqing Nie, Lee Wang, Lingzhi Zhao, Ji-Rong Wen, and Ying Li 11/12/2018

Agenda Motivations and introduction to OCI (Online Commercial Intention) A machine learning-based approach for OCI detection Experiments Conclusion and future work 11/12/2018

Motivation Serving ads will be more effective and less annoying, when user has intent to purchase We are interested in detecting web pages / queries that show intention to commit a commercial activity (purchase, rent, bid, or sell…) 11/12/2018

OCI vs 3 search goal categories Navigational The immediate intent is to reach a particular site Informational The intent is to acquire some information assumed to be present on one or more web pages. Transactional The intent is to perform some web-mediated activity OCI can bee seen as a new dimension of user search goals. Commercial Non-Commercial Navigational walmart hotmail Informational Digital camera San Francisco Transactional / Resource U2 music download Collide lyrics 11/12/2018

Define the OCI detection problem A binary classification problem OCI: Query/Page -> {Commercial, Non-Commercial} We can derive the commercial sense from a confidence value that ranges from 0 (no commercial intent) to 1 (strong commercial intent) 11/12/2018

Framework of Detecting Page OCI 11/12/2018

Keywords selection Select significant and reliable keywords Significance: Frequency: Keyword selection threshold For simplicity we use the same threshold for the two measures in the experiments. 11/12/2018

Page feature composition We define two aspects of properties for each keyword in a page p: keyword occurrences in inner text keyword occurrences in tag attributes As the result, a page p is represented by a feature vector using these two aspects 11/12/2018

Detecting query OCI Challenges Solution Only few search queries contain explicit commercial indicators, such as “buy”, “price”, “rent”, “discount”, etc. Search queries are usually short. Solution Enrich query from external resource (search engine) First result page (Query snippets) Top N landing URLs Query classification problem -> page classification problem 11/12/2018

Search result page and Landing URLs 11/12/2018

Query OCI Detector based on Top N Landing URLs 11/12/2018

Query OCI Detector based on first search result page Build a dedicated search result page classifier for this purpose 11/12/2018

Labeling process We adopted majority vote: 3 human labelers voted for the labels Initial Web pages and queries were randomly selected from our page/query repository. Pages Queries Commercial 4074 602 Non-Commercial 21823 790 Total 25897 1408 11/12/2018

Experiment Results - Page OCI detector Reach best performance (CF) when keyword selection threshold = 0.1 (using SVM as the classifier) CP, CR and CF are the precision, recall and F1 metrics for detecting commercial intent. Keyword Selection Threshold Keyword Number CP CR CF 0.1 391 0.930 0.925 0.928 11/12/2018

Experiment Results- Query OCI detector Model based on first result page returns best performance. 11/12/2018 Query OCI Detector Performance

OCI Distribution among Query Frequency Ranges 11/12/2018

Conclusions The notion of OCI (Online Commercial Intention) and the problem of detecting OCI from pages and queries. The framework of building machine learning models to detect OCI based on Web page content. Based on this framework, we build models to detect OCI from search queries. 11/12/2018

Conclusions (cont.) Our framework trains learning models from two types of data sources for a given search query: content of first search result page (query snippets) content of top landing URLs returned by search engine. Experiments showed that the model based on the first search result page achieved better performance. We also discovered an interesting phenomenon that the portion of queries having commercial intention is higher in frequent query sets. 11/12/2018

Future work Utilize search query click through logs Reduce labeling effort Take user online context into consideration in studying user’s online intention Detect at which commercial activity phase a user is (research/commit). 11/12/2018

Future Work (Cont.) A lot of more interesting research problems! Detect more detailed commercial intentions in different verticals Traveling intention and preferences. Branding awareness and preferences. Study how specific the user intention is: “Halo2” vs “video games” “cheap airline ticket new york to las vegas” vs “book a flight” Study the correlations between conversion rate and user intention. A lot of more interesting research problems! We are HIRING! Contact:KathyDai@microsoft.com 11/12/2018

Thank You for Your Attention! 11/12/2018