Presentation is loading. Please wait.

Presentation is loading. Please wait.

Kaggle competition Airbnb Recruiting: New User Bookings

Similar presentations


Presentation on theme: "Kaggle competition Airbnb Recruiting: New User Bookings"— Presentation transcript:

1 Kaggle competition Airbnb Recruiting: New User Bookings
Advanced Network Database Lab Kaggle competition Airbnb Recruiting: New User Bookings Where will a new guest book their first travel experience?

2 Registration Site: https://www.kaggle.com/competitions
Account: IKDD1(Group Number)

3 Airbnb AirBed&Breakfast https://www.airbnb.com.tw/
Book rooms with locals, rather than hotels

4 Airbnb Competition url: new-user-bookings Data url: new-user-bookings/data Leaderboard: new-user-bookings/leaderboard

5 Data Attribute

6 Classification

7 Prediction

8 Decision Tree

9 Sklearn – Python tool Simple and efficient tools for data mining and data analysis! Decision tree url : learn.org/stable/modules/tree.html

10 Homework 1 Registration
Apply a simple algorithm to build the classifier Use the classifier to predict the country a new user will make his or her first booking Submit the result to Kaggle Deadline: next Thursday (12/10)

11 Homework 2 Oral report Deadline: next Thursday (12/17)

12 Homework 3 Try different algorithms to build the best classifier
Use the classifier to predict the survival passengers Submit the result to Kaggle

13 Final project Deadline: 12/23 23:59 Submission:
Submit the results to kaggle your project to Project file content: code prediction result report

14 Report The details of the your best method
The description of the methods that you tried The important attributes or surprised features you found

15 Grading Homework 1: 20% Homework 2: 10% Final Project : 70%
The ranking: 30% Algorithm and coding : 30% Report: 10%

16 XGBoost General purpose gradient boosting library, including generalized linear model and gradient boosted decision tree SITE:

17 tslm A linear model with time series components
SITE: r.org/packages/cran/forecast/docs/tslm

18 H2o.randomForest Random Forest (RF) is a powerful classification tool. When given a set of data, RF generates a forest of classification trees, rather than a single classification tree. Each of these trees generates a classification for a given set of attributes. The classification from each H2O tree can be thought of as a vote; the most votes determines the classification. SITE: ml


Download ppt "Kaggle competition Airbnb Recruiting: New User Bookings"

Similar presentations


Ads by Google