Download presentation
Presentation is loading. Please wait.
Published byRalph Chandler Modified over 9 years ago
1
RapStar’s Solution to Data Mining Hackathon on Best Buy Mobile Site Kingsfield, Dragon
2
Beat Benchmark
4
Use Time information Time is a good feature in data mining.
5
Use Time information Divided data into 12 time periods based on click_time field Use frequency at time period where click_time belongs to as “prior” instead of global frequency.
6
Use Time information Smooth data
7
Unigram to Bigram
8
Data Processing The most important part: Query Correction – Lemmatization – Split words and number – Query correction(in small version) A lot of thing that can help to improve: – “x box”, “x men” – New algorithm for query correction Rank predictions that user clicked lower.
9
Conclusion Data Preprocessing and feature Engineering are most important things.
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.