RapStar’s Solution to Data Mining Hackathon on Best Buy Mobile Site Kingsfield, Dragon.

Slides:



Advertisements
Similar presentations
Data Mining and Text Analytics Advertising Laura Quinn.
Advertisements

Adaptive Mesh Subdivision for Precomputed Radiance Transfer Jaroslav Křivánek Univ. of Central Florida CTU Prague IRISA – INRIA Rennes Sumanta Pattanaik.
Group 3 Chad Mills Esad Suskic Wee Teck Tan. Outline  System and Data  Document Retrieval  Passage Retrieval  Results  Conclusion.
WWW 2014 Seoul, April 8 th SNOW 2014 Data Challenge Two-level message clustering for topic detection in Twitter Georgios Petkos, Symeon Papadopoulos, Yiannis.
Political Party, Gender, and Age Classification Based on Political Blogs Michelle Hewlett and Elizabeth Lingg.
Media Queries Jeffery Davis CIT media is used to define different style rules for different media types and devices. Media queries.
Searchable Web sites Recommendation Date : 2012/2/20 Source : WSDM’11 Speaker : I- Chih Chiu Advisor : Dr. Koh Jia-ling 1.
Mining the Search Trails of Surfing Crowds: Identifying Relevant Websites from User Activity Data Misha Bilenko and Ryen White presented by Matt Richardson.
WebMiningResearch ASurvey Web Mining Research: A Survey Raymond Kosala and Hendrik Blockeel ACM SIGKDD, July 2000 Presented by Shan Huang, 4/24/2007.
Presented by Li-Tal Mashiach Learning to Rank: A Machine Learning Approach to Static Ranking Algorithms for Large Data Sets Student Symposium.
1 Statistical correlation analysis in image retrieval Reporter : Erica Li 2004/9/30.
Research Plan of Affective learning Affective Learning Group.
Why we search: Visualizing and predicting user behavior E Adar, D S Weld, B N Bershad, S Gribble Raju Balakrishnan.
RRT-Connect path solving J.J. Kuffner and S.M. LaValle.
WebMiningResearchASurvey Web Mining Research: A Survey Raymond Kosala and Hendrik Blockeel ACM SIGKDD, July 2000 Presented by Shan Huang, 4/24/2007 Revised.
Search engines. The number of Internet hosts exceeded in in in in in
Understanding and Predicting Graded Search Satisfaction Tang Yuk Yu 1.
1 A Graph-Theoretic Approach to Webpage Segmentation Deepayan Chakrabarti Ravi Kumar
Introduction With TimeCard users can tag SharePoint events with information that converts them into time sheets. This way they can report.
Ramakrishnan Srikant Sugato Basu Ni Wang Daryl Pregibon 1.
Understanding and Predicting Personal Navigation Date : 2012/4/16 Source : WSDM 11 Speaker : Chiu, I- Chih Advisor : Dr. Koh Jia-ling 1.
Finding the missing number PPT Game. Divide your class into teams. Divide your class into teams. Start with the first question. Start with the first question.
Web Searching. How does a search engine work? It does NOT search the Web (when you make a query) It contains a database with info on numerous Web sites.
Qingqing Gan Torsten Suel CSE Department Polytechnic Institute of NYU Improved Techniques for Result Caching in Web Search Engines.
Improving Classification Accuracy Using Automatically Extracted Training Data Ariel Fuxman A. Kannan, A. Goldberg, R. Agrawal, P. Tsaparas, J. Shafer Search.
Data Mining Algorithms for Large-Scale Distributed Systems Presenter: Ran Wolff Joint work with Assaf Schuster 2003.
in Search PDF, EBooks, Acrobat Files Faster Relevant Lightweight Quick Read Feature Search PDF's from University websites.
Evaluation of Recommender Systems Joonseok Lee Georgia Institute of Technology 2011/04/12 1.
A Word Clustering Approach for Language Model-based Sentence Retrieval in Question Answering Systems Saeedeh Momtazi, Dietrich Klakow University of Saarland,Germany.
Date: 2012/11/29 Author: Chen Wang, Keping Bi, Yunhua Hu, Hang Li, Guihong Cao Source: WSDM’12 Advisor: Jia-ling, Koh Speaker: Shun-Chen, Cheng.
Michael Bendersky, W. Bruce Croft Dept. of Computer Science Univ. of Massachusetts Amherst Amherst, MA SIGIR
Divided Pretreatment to Targets and Intentions for Query Recommendation Reporter: Yangyang Kang /23.
CPSC 203 Introduction to Computers T59 & T64 By Jie (Jeff) Gao.
C LUSTERING FOR T AXONOMY E VOLUTION By -Anindya Das - Sneha Bankar.
TO Each His Own: Personalized Content Selection Based on Text Comprehensibility Date: 2013/01/24 Author: Chenhao Tan, Evgeniy Gabrilovich, Bo Pang Source:
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
Estimating Types in Binaries using Predictive Modeling Omer Katz, Ran El-Yaniv, Eran Yahav Technion, Israel 21/01/2016.
Why Decision Engine Bing Demos Search Interaction model Data-driven Research Problems Q & A.
GENERATING RELEVANT AND DIVERSE QUERY PHRASE SUGGESTIONS USING TOPICAL N-GRAMS ELENA HIRST.
Predicting Short-Term Interests Using Activity-Based Search Context CIKM’10 Advisor: Jia Ling, Koh Speaker: Yu Cheng, Hsieh.
 Abbreviation of fourth generation wireless technology  It will provide a comprehensive IP solution where voice, data and multimedia can be given to.
13 Trends That Will Drive SEO in 2016 Presented By, Chennaiseocompany
Adversarial Information System Tanay Tandon Web Enhanced Information Management April 5th, 2011.
1 SINAI at CLEF 2004: Using Machine Translation resources with mixed 2-step RSV merging algorithm Fernando Martínez Santiago Miguel Ángel García Cumbreras.
What is LinkScanner In AVG Antivirus AVG antivirus software 2017 has came up with several new and latest features for their users to make their.
Scattertext: a Browser-Based Tool for Visualizing how Corpora Differ
Search User Behavior: Expanding The Web Search Frontier
Introduction to Data Mining
AdWords and Generalized On-line Matching
Demographics and Weblog Hackathon – Case Study
Easy methods to control your RSS Feeds Footer in WordPress Guided By: wpglobalsupportwpglobalsupport.
BACK SOLUTION:
Backpage York An Alternative To Backpage. Backpage.me.uk Backpage.me.uk provides similar features like backpage with some additional services. Backpage.
Case-Based Reasoning System for Bearing Design
Mining Query Subtopics from Search Log Data
Backpage mobile ibackpage.com ibackpage.com is very innovative and creative website that lets you post any kind of ad as per your.
Classification and Prediction
Agenda What is SEO ? How Do Search Engines Work? Measuring SEO success ? On Page SEO – Basic Practices? Technical SEO - Source Code. Off Page SEO – Social.
For use on your feedback page
CS246: Leveraging User Feedback
CLICK TO START.
INF 141: Information Retrieval
Predicting Loan Defaults
CLICK TO START.
Title: select a descriptive title
THE ASSISTIVE SYSTEM SHIFALI KUMAR BISHWO GURUNG JAMES CHOU
Divide 9 × by 3 ×
Call Now : Click : -
Call Now : Click : -
Call Now : Click : -
Presentation transcript:

RapStar’s Solution to Data Mining Hackathon on Best Buy Mobile Site Kingsfield, Dragon

Beat Benchmark

Use Time information Time is a good feature in data mining.

Use Time information Divided data into 12 time periods based on click_time field Use frequency at time period where click_time belongs to as “prior” instead of global frequency.

Use Time information Smooth data

Unigram to Bigram

Data Processing The most important part: Query Correction – Lemmatization – Split words and number – Query correction(in small version) A lot of thing that can help to improve: – “x box”, “x men” – New algorithm for query correction Rank predictions that user clicked lower.

Conclusion Data Preprocessing and feature Engineering are most important things.