Outline Class Intros Overview of Course & Series Example Research Projects Beginning R
What are your goals? What types of problems? datasets? Which software do you currently use? Introductions
Specialized Software Programming Language Statistical Software DataViz Who What When Where Why or How?
Course / Certificate Overview u
TBAN100 APPLIED STATISTICS TBAN120 DATABASE MANAGEMEN T TBAN130 DATA MINING TBAN110 PREDICTIVE MODELING
Comparison of Data-Related Certificates
Modeler Survey of Data-Related Roles
Predictive Analytics Data Analysis SoftwareStatistics Data Mining
Continuous & Categorical Variables ContinuousCategorical Continuous Categorical Histogram Scatter Bar Cross Table Boxplot Predictor Variable (X-Axis) Pie Height Smartphone? Yes or No (Frequency) Outcome, Dependent Variable (Y-Axis) Mosaic Cross Table Logistic Regression Linear Regression Decision Trees
Breadth vs. Depth vs. Relevancy Class Project
Variables Y X’s Height Independent Variables Dependent Variables Y X4 X3 X2X1
Extensions to Linear Regression Interaction (Non-Linear) Structural Equation Modeling Moderation Mediation Advanced Lasso Ridge Regularized
HR Example
TBAN110 PREDICTIVE MODELING
TBAN120 DATABASE MANAGEMENT SYSTEMS Normalization Database creation Generating Reports Changing Databases Join Structured Query Language
TBAN130 DATA MINING