Presentation is loading. Please wait.

Presentation is loading. Please wait.

Asst. Prof. Sotarat Thammaboosadee, Ph.D.

Similar presentations


Presentation on theme: "Asst. Prof. Sotarat Thammaboosadee, Ph.D."— Presentation transcript:

1 Asst. Prof. Sotarat Thammaboosadee, Ph.D.
EGIT532- Data Science and Big Data Analytics Individual Project Specification Asst. Prof. Sotarat Thammaboosadee, Ph.D.

2 Project Individual Project. Submit report in pdf via email.
Before 12 May 2019 subject: project-61xxxxx

3 Topics Problems Source of Data Data Mining Tasks Data Mining Process
Business understanding Data understanding Data preprocessing Model building Model Evaluation Deployment

4 Problems What are the motivations to apply data science with your data?

5 Source of Data Any data sources
At least 10,000 examples At least 8 attributes or text data But if you take more concentration for this stage, it may be a part of your thesis/thematic paper.

6 Data Mining Tasks Classification Clustering Association Etc…. What?
Why? Association Etc….

7 Business Understanding
Provide some paragraph to introduce your work.

8 Presentation Please provide one or more flow chart of your data mining process. You may capture the Rapidminer workflow Please rename each box in a meaningful name

9 Data Understanding Type of each attributes Example data set Meaning?
Statistical report Data profile Visualization

10 Data preprocessing More than one method
State the reason why you choose them. Data visualization or profiling of each processing step

11 Model building More than one algorithm
Maybe several algorithm in one model, depend on your design More (reasonable) complex process will get more points

12 Model Evaluation Compare between each preprocessing method and each algorithm Select appropriate criteria

13 Deployment What do you obtain from the results? Using visualization
Knowledge Application Policy Etc…


Download ppt "Asst. Prof. Sotarat Thammaboosadee, Ph.D."

Similar presentations


Ads by Google