Download presentation
Presentation is loading. Please wait.
1
Asst. Prof. Sotarat Thammaboosadee, Ph.D.
EGIT532- Data Science and Big Data Analytics Individual Project Specification Asst. Prof. Sotarat Thammaboosadee, Ph.D.
2
Project Individual Project. Submit report in pdf via email.
Before 12 May 2019 subject: project-61xxxxx
3
Topics Problems Source of Data Data Mining Tasks Data Mining Process
Business understanding Data understanding Data preprocessing Model building Model Evaluation Deployment
4
Problems What are the motivations to apply data science with your data?
5
Source of Data Any data sources
At least 10,000 examples At least 8 attributes or text data But if you take more concentration for this stage, it may be a part of your thesis/thematic paper.
6
Data Mining Tasks Classification Clustering Association Etc…. What?
Why? Association Etc….
7
Business Understanding
Provide some paragraph to introduce your work.
8
Presentation Please provide one or more flow chart of your data mining process. You may capture the Rapidminer workflow Please rename each box in a meaningful name
9
Data Understanding Type of each attributes Example data set Meaning?
Statistical report Data profile Visualization
10
Data preprocessing More than one method
State the reason why you choose them. Data visualization or profiling of each processing step
11
Model building More than one algorithm
Maybe several algorithm in one model, depend on your design More (reasonable) complex process will get more points
12
Model Evaluation Compare between each preprocessing method and each algorithm Select appropriate criteria
13
Deployment What do you obtain from the results? Using visualization
Knowledge Application Policy Etc…
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.