Syllabus. We covered Regression in Applied Stats. We will review Regression and cover Time Series and Principle Components Analysis. Reference Book.

Slides:



Advertisements
Similar presentations
Data Mining Classification: Alternative Techniques
Advertisements

Clementine Server Clementine Server A data mining software for business solution.
Modeling Gene Interactions in Disease CS 686 Bioinformatics.
Data Mining – Intro.
Chapter 5 Data mining : A Closer Look.
Machine Learning Usman Roshan Dept. of Computer Science NJIT.
CSC 478 Programming Data Mining Applications Course Summary Bamshad Mobasher DePaul University Bamshad Mobasher DePaul University.
Peter Myers Bitwise Solutions Pty Ltd. Predictive Analytics PresentationExplorationDiscovery Passive Interactive Proactive Business Insight Canned.
1 Chapter 1: Introduction 1.1 Introduction to SAS Enterprise Miner.
Chapter 1: Introduction
April 11, 2008 Data Mining Competition 2008 The 4 th Annual Business Intelligence Symposium Hualin Wang Manager of Advanced.
Data Mining. 2 Models Created by Data Mining Linear Equations Rules Clusters Graphs Tree Structures Recurrent Patterns.
Data Mining Dr. Chang Liu. What is Data Mining Data mining has been known by many different terms Data mining has been known by many different terms Knowledge.
Data Mining Joyeeta Dutta-Moscato July 10, Wherever we have large amounts of data, we have the need for building systems capable of learning information.
Anomaly detection with Bayesian networks Website: John Sandiford.
Machine Learning1 Machine Learning: Summary Greg Grudic CSCI-4830.
Machine Learning Lecture 11 Summary G53MLE | Machine Learning | Dr Guoping Qiu1.
Bayesian networks Classification, segmentation, time series prediction and more. Website: Twitter:
Introduction to machine learning and data mining 1 iCSC2014, Juan López González, University of Oviedo Introduction to machine learning Juan López González.
Machine Learning Lecture 1. Course Information Text book “Introduction to Machine Learning” by Ethem Alpaydin, MIT Press. Reference book “Data Mining.
Knowledge Discovery and Data Mining Evgueni Smirnov.
Outline Class Intros Overview of Course & Series Example Research Projects Beginning R.
Time Series 1.
Jeff Howbert Introduction to Machine Learning Winter Regression Linear Regression.
A Few Handful Many Time Stamps One Time Snapshot Many Time Series Number of Variables Mobile Phone Galton Height Census Titanic Survivors Stock Market.
Statistical Tools for Solar Resource Forecasting Vivek Vijay IIT Jodhpur Date: 16/12/2013.
STAT 3610/5610 – Time Series Analysis Topics to be covered Chapters 1 – 8 Chapter 9 – Skip Chapters 10 – 12.
Data Mining – Intro. Course Overview Spatial Databases Temporal and Spatio-Temporal Databases Multimedia Databases Data Mining.
Loan Default Model Saed Sayad 1www.ismartsoft.com.
Chapter 5: Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization DECISION SUPPORT SYSTEMS AND BUSINESS.
1 STAT 5814 Statistical Data Mining. 2 Use of SAS Data Mining.
Classification Derek Hoiem CS 598, Spring 2009 Jan 27, 2009.
Overview of the final test for CSC Overview PART A: 7 easy questions –You should answer 5 of them. If you answer more we will select 5 at random.
Jeff Howbert Introduction to Machine Learning Winter Regression Linear Regression Regression Trees.
Finding Hidden Intelligence with Predictive Analysis of Data Mining Rafal Lukawiecki Strategic Consultant, Project Botticelli Ltd
Theoretic Frameworks for Data Mining Reporter: Qi Liu.
No Yes No Yes Longitudinal & Time Series Cross-Sectional & Panel Data PEW Mobile Phone Galton Children Height Census Text Sentiment Old Faithful Web Analytics.
1 Data Mining at work Krithi Ramamritham. 2 Dynamics of Web Data Dynamically created Web Pages -- using scripting languages Ad Component Headline Component.
CS Statistical Machine learning Lecture 12 Yuan (Alan) Qi Purdue CS Oct
Main Themes Few vs. Many Variables Linear vs. Non-Linear Statistics vs. Machine Learning.
CSC 478 Programming Data Mining Applications Course Summary Bamshad Mobasher DePaul University Bamshad Mobasher DePaul University.
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
Stock market forecasting using LASSO Linear Regression model
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
FNA/Spring CENG 562 – Machine Learning. FNA/Spring Contact information Instructor: Dr. Ferda N. Alpaslan
Machine Learning Usman Roshan Dept. of Computer Science NJIT.
Prepared by Fayes Salma.  Introduction: Financial Tasks  Data Mining process  Methods in Financial Data mining o Neural Network o Decision Tree  Trading.
Oracle Advanced Analytics
Software Defects Cmpe 550 Fall 2005
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Data Mining – Intro.
KAIR 2013 Nov 7, 2013 A Data Driven Analytic Strategy for Increasing Yield and Retention at Western Kentucky University Matt Bogard Office of Institutional.
Decision Trees in Analytical Model Development
Boosting and Additive Trees (2)
Table 1. Advantages and Disadvantages of Traditional DM/ML Methods
USE OF DATA ANALYTICS TO PREDICT THE DEMAND OF BIKES
Introduction to R Programming with AzureML
Data Mining Lecture 11.
Probabilistic Models for Linear Regression
Session 2:50 PM - 4:20 PM; Saturday, Nov 18, 2017
Dr. Morgan C. Wang Department of Statistics
Analytics: Its More than Just Modeling
Biointelligence Laboratory, Seoul National University
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Cases. Simple Regression Linear Multiple Regression.
Azure Machine Learning
FOUNDATIONS OF BUSINESS ANALYTICS Introduction to Machine Learning
Presentation transcript:

Syllabus

We covered Regression in Applied Stats. We will review Regression and cover Time Series and Principle Components Analysis. Reference Book

Probabilities Probability Distribution

Conditional Probability & Bayesian Networks

Linear Regression

More Regression Interaction (Non-Linear) Structural Equation Modeling Moderation Mediation Advanced Lasso Ridge Regularized

No Yes No Yes Longitudinal & Time Series Cross-Sectional & Panel Data PEW Mobile Phone Galton Children Height Census Stock Market Historical River Levels Old Faithful Web Analytics Titanic Survivors Bank Loans

plot(stl(beer,s.window="periodic")) Time Series

Datasets: Training and Test Develop Model Using Training Dataset and Apply to Test Data

Bank Loan

Decision Trees

Principle Components Analysis & Factor Analysis Here 13 variables are reduced to 4.

People Variables Cluster Analysis Customers are grouped by common characteristics

People Variables Variable/Dimension Reduction Principle Components Analysis & Factor Analysis

Tom Brady Not Tom Brady Machine Learning

Same Data, Different Algorithms

One aspect of Predictive Modeling is comparing the performance of various models towards then choosing the one which performs best

“Combine predictions from multiple, complementary models… one model’s strengths compensating for the weaknesses of others.” Ensembles of People and Approaches

Text Mining / Sentiment Analysis

Social Network Analysis

Conditional Probability & Bayesian Networks