Download presentation
Presentation is loading. Please wait.
Published byCharlene Carter Modified over 8 years ago
1
DATA MODELING & PREPARATION Biz Pro 9 th Study Group
2
DM Task Formulation Data Collection Data Cleaning Data Exploration Data Modeling Define PurposeModel Selection Perform. Evaluation Model Deployment Initial Phase 90% Efforts Middle Phase 90% Professions Final Phase 90% Domains
3
Data Preparation and Exploration Data Preparation Data Visualization Dimension Reduction Prediction Linear Regression K-Nearest Neighbors Neural Networks Classification K-Nearest Neighbors Decision Trees Logistic Regression Neural Networks Unsupervised Cluster Analysis Performance Evaluation Cross Validation Performance Measures Time Series Forecasting Regression Methods Smoothing Methods Linear Processes Non-linear Processes Model Deployment Scoring New Data Domain Expertise Analysis
4
SPSS is a Windows based program Perform data entry and analysis and to create tables and graphs After SPSS 18.0, the company is acquisited by IBM (2009)
5
Menu Bar & Icons Data View/ Variable View
6
Just like Spreadsheet In EXCEL
7
The Variable View allows you to name your variables, to identify missing values, assign variable and value labels etc.
8
檔案 >> 開啟 >> 資料
12
Most of the time, the given variables are not coded as we need. Example – Gender = “Male” and “Female” But when modeling, we want “Is_male” = 1 or 0 Example – Score = 0 – 100 (Raw Score) But when analyzing, we want “grades” = A,B,C,D,X Example – Binning a Continuous Variables (Discretization)
13
轉換 >> 重新編碼成同一變數
14
轉換 >>Visual Binning/ 最適 Binning
16
轉換 >> 建立虛擬變數 (Need Python Essentials) 重新編碼成同一 / 不同變數 (Spend some time…)
17
資料 >> 選擇觀察值 (Only Numeric Variable Works)
18
資料 >> 合併檔案 >> 新增變數 (Need Same Variable Names)
19
Merged files according to start_station Merged files according to end_station
20
END! Thanks!
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.