Presentation is loading. Please wait.

Presentation is loading. Please wait.

COMP 4332 / RMBI 4330 Big Data Mining (Spring 2015) Lei Chen Hong Kong University of Science and Technology

Similar presentations


Presentation on theme: "COMP 4332 / RMBI 4330 Big Data Mining (Spring 2015) Lei Chen Hong Kong University of Science and Technology"— Presentation transcript:

1

2 COMP 4332 / RMBI 4330 Big Data Mining (Spring 2015) Lei Chen Hong Kong University of Science and Technology leichen@cse.ust.hk http://www.cse.ust.hkhttp://www.cse.ust.hk/~leichen

3 Topics Review of Basics Practical Data Mining –Imbalanced Data –Text and Web Mining –Big Data –Social Recommendation –Social Media and Social Networks Hands on: 2 Major Projects Student Presentations 2015/9/11Course Introduction2

4 Outcome and Objective Student will know the current state of the art in Data Mining Student will be able to implement a practical data mining project Student will be able to present their ideas well Prepared for PG study, Internship, etc. 2015/9/11Course Introduction3

5 Projects: based on KDDCUPs Project 1: –KDDCUPs on credit rating and customer retention (KDDCUP 2009) Project 2: –Micro-blog (Weibo) User Recommendation (KDDCUP 2012) Project 3 (Optional): KDDCUP 2013 2015/9/11Course Introduction4

6 2015/9/11Course Introduction5 5 KDDCUP Examples —KDDCUP from past years —2007: —Predict if a user is going to rate a movie? —Predict how many users are going to rate a movie? —2006: —Predict if a patient has cancer from medical images —2005: —Given a web query (“Apple”), predict the categories (IT, Food) —1998: —Given a person, predict if this person is going to donate money —In general, we wish to —Input: Data —Output: —Build model —Apply model to future data

7 2015/9/11Course Introduction6 Important Sites  Course Web Site  http://course.cse.ust.hk/comp4332 http://course.cse.ust.hk/comp4332  TA: Yue Wang ywangby@connect.ust.hk  Assignment Hand-in: CASS

8 2015/9/11Course Introduction7 Prerequisites  Statistics and Probability would help,  But will be reviewed in class  Machine Learning/Pattern Recognition would help,  We will review some most important algorithms  One programming language  We will teach new languages in the tutorial

9 2015/9/11Course Introduction8 Grading  Assignments: 20%  Course Projects: 60%  Presentations: 10%  Term Paper: 10%

10 2015/9/11Course Introduction9 More info Textbooks: –Listed on Course Website –Buy them online if you wish


Download ppt "COMP 4332 / RMBI 4330 Big Data Mining (Spring 2015) Lei Chen Hong Kong University of Science and Technology"

Similar presentations


Ads by Google