Data Analytics at CNU Dmitriy Shaltayev

Slides:



Advertisements
Similar presentations
Introduction to Data Mining with XLMiner
Advertisements

© Prentice Hall1 DATA MINING TECHNIQUES Introductory and Advanced Topics Eamonn Keogh (some slides adapted from) Margaret Dunham Dr. M.H.Dunham, Data Mining,
© Prentice Hall1 DATA MINING Introductory and Advanced Topics Part II Margaret H. Dunham Department of Computer Science and Engineering Southern Methodist.
Formulas and Function Course Overview Logical Functions Data and Time Functions Math Functions Trigonometric Functions Statistical Functions Text Functions.
Business Intelligence components Introduction. Microsoft® SQL Server™ 2005 is a complete business intelligence (BI) platform that provides the features,
Multivariate Statistics for the Environmental Sciences Peter J. A. Shaw Chapter 1 Introduction.
Gavin Russell-Rockliff BI Technical Specialist Microsoft BIN305.
Lean Principles for Managers (1 day) Understand the true purpose of Lean and its application. Includes a simple simulation. Managing a Process (2 days)
Data Mining. 2 Models Created by Data Mining Linear Equations Rules Clusters Graphs Tree Structures Recurrent Patterns.
Highline Class, BI 348 Basic Business Analytics using Excel, Chapter 01 Intro to Business Analytics BI 348, Chapter 01.
INTRODUCTION TO DATA MINING MIS2502 Data Analytics.
Outline Class Intros Overview of Course & Series Example Research Projects Beginning R.
A way to integrate IR and Academic activities to enhance institutional effectiveness. Introduction The University of Alabama (State of Alabama, USA) was.
Map of the Great Divide Basin, Wyoming, created using a neural network and used to find likely fossil beds See:
Chapter 5: Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization DECISION SUPPORT SYSTEMS AND BUSINESS.
Nurissaidah Ulinnuha. Introduction Student academic performance ( ) Logistic RegressionNaïve Bayessian Artificial Neural Network Student Academic.
MIS2502: Data Analytics Advanced Analytics - Introduction.
Kaggle Competition Prudential Life Insurance Assessment
Random Forests Ujjwol Subedi. Introduction What is Random Tree? ◦ Is a tree constructed randomly from a set of possible trees having K random features.
Copyright © 2016 Pearson Education, Inc. Modern Database Management 12 th Edition Jeff Hoffer, Ramesh Venkataraman, Heikki Topi CHAPTER 11: BIG DATA AND.
Data Science Interview Questions 1.What do you mean by word Data Science? Data Science is the extraction of knowledge from large.
PREDICTING SONG HOTNESS
Alteryx User Group May 25 th Agenda Introductions What is Alteryx? Community and learning Learning resources Alteryx community Online user groups.
Data Resource Management – MGMT An overview of where we are right now SQL Developer OLAP CUBE 1 Sales Cube Data Warehouse Denormalized Historical.
DATA MINING and VISUALIZATION Instructor: Dr. Matthew Iklé, Adams State University Remote Instructor: Dr. Hong Liu, Embry-Riddle Aeronautical University.
Wake Technical Community College “Wake Tech” Largest community college in NC 70,000+ students a year attending.
Audit Analytics --An innovative course at Rutgers Qi Liu Roman Chinchila.
Introduction to Machine Learning, its potential usage in network area,
What Business Analytics Can Do For You!
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Product Overview.
Department of Supply Chain Management & Analytics Proposed Bachelor of Science (B.S.) in Business - Analytics Track Paolo Catasti, PhD, MBA, CSSBB Teaching.
A data visualization course for undergraduate data science students
Introduction to Machine Learning
Management Information Systems
MIS2502: Data Analytics Advanced Analytics - Introduction
21 Essential Data Visualization Tools
How to Select the Right Chart for Your Data
Business analytics Lessons from an undergraduate introductory course
A linear approach to predicting house prices
Three Using Visuals in Written and Oral Communication.
Map of the Great Divide Basin, Wyoming, created using a neural network and used to find likely fossil beds See:
Advanced Analytics Using Enterprise Miner
Data Mining: Concepts and Techniques Course Outline
Machine Learning & Data Science
MIS5101: Data Analytics Advanced Analytics - Introduction
An Excel-based Data Mining Tool
David M. Levine, Baruch College (CUNY)
Understanding Your Numbers: Data Visualisation with PowerBI
CSc4730/6730 Scientific Visualization
Lecturer: Geoff Hulten TAs: Kousuke Ariga & Angli Liu
Prepared by: Mahmoud Rafeek Al-Farra
Workshop: JMP & R for Analytics Instruction
Analytics Plus Product Overview 1.
Experiences and Lessons Learned from UNC Wilmington
Analytics: Its More than Just Modeling
Course Lab Introduction to IBM Watson Analytics
Course Introduction CSC 576: Data Mining.
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
HEADLINE GOES HERE AND HERE Tableau Data Visualization Software
Exemplary Practices for Displaying Public Health Data
MIS2502: Data Analytics Introduction to Advanced Analytics
Analytics Plus Product Overview.
Business concentration, minor and certificate programs
Python for Data Analysis
Welcome! Knowledge Discovery and Data Mining
Information Visualization - Week 01
An Introduction to Data Science using Python
Palestinian Central Bureau of Statistics
Presentation transcript:

Data Analytics at CNU Dmitriy Shaltayev Associate Professor of Management Christopher Newport University

Introduction to Data Analytics BUSN 305 Undergraduate program only at the moment Four majors: ACCT, FINC, MGMT, MKTG Business core class (required for all majors, except ACCT) Replaced general Management Information Systems class in fall 2017 Class received favorable evaluations after first semester Material is still being tweaked around

Software Used Tried combination of JMP and R Selection was done in favor of JMP for an introductory class Tableau is used for visualizations Continuation of JMP from business statistics class Course is meant to be a “story telling with data” class Focus is on both informing students of popular data analytics techniques and effectively value of communicating results

BUSN 305 Course Content What is “big data”? After this introduction never using terms “big data”, working with “small data” Descriptive/diagnostic/predictive/prescriptive Focus is on the first three Visualization: Use JMP Focus is on Tableau Types of charts, focus on line, bar, bubble, boxplot, map (bubble and filled), treemap, tables/highlight tables/ heat maps Why pie charts are bad?

BUSN 305 Course Content Pre-attentive attributes Use of color in visualizations, color blindness Invited speaker to discuss effective oral presentations delivery techniques Supervised methods: Classification/regression trees Multiple regression Logistic regression Neural networks Unsupervised methods: Clustering (k-means, hierarchical)

Learning by Doing Four group projects (groups of 3) Project 1 – build charts with Tableau (line, bar, map, treemap), create a fully interactive dashboard Project 2 – use classification tree to predict delayed flights from IAD/DCA/BWI to JFK/LGA/EWR Introduced to idea of binning continuous variable Project 3 – predict movie box office based on budget, genre, cast composition. Choose three movies and predict their box office Project 4 – choose data set, create presentation, deliver it to class (10-15 minutes) Graded by both peers and instructor

Learning by Doing Each project should follow the structure: What is the issue at hand and why is that important Describe the data which was given and any data manipulations (binning) Any preliminary insights from data visualization Explain the choice of modeling tool selection Discuss model properties Which variables are important/significant How accurate is the model? Model applications examples

Advanced Data Analytics (MGMT 495) Issues relevant to data analytics How data are stored (database design principles) How data are retrieved (SQL “by hand”) What are the most popular data formats (.xlsx, .csv, .json, .xml) Data quality issues Outliers Missing values and imputation Skewed distributions

Advanced Data Analytics (MGMT 495) Introduction to Python programming Variables Built-in functions If-else For loops and while loops User-defined functions Importing libraries Most popular visualization libraries: pandas, seaborn, matplotlib

Advanced Data Analytics (MGMT 495) Advanced modeling techniques Random forest A/B testing Association rules Text mining Feature engineering

Where Is It Going? Plans are to split a current management concentration and create a “HR/Leadership” and “Decision Analytics” concentrations While introductory class is required for everyone, advanced analytics will be in “Decision Analytics” stream Trying to develop partnerships with companies Sentara Healthcare Booz Allen Hamilton Ferguson Enterprises