USE OF DATA ANALYTICS TO PREDICT THE DEMAND OF BIKES

Slides:



Advertisements
Similar presentations
The Robert Gordon University School of Engineering Dr. Mohamed Amish
Advertisements

Kin 304 Regression Linear Regression Least Sum of Squares
Introduction to Data Mining with XLMiner
Neural Network Based Approach for Short-Term Load Forecasting
Load Forecasting Eugene Feinberg Applied Math & Statistics Stony Brook University NSF workshop, November 3-4, 2003.
Copyright 2013 John Wiley & Sons, Inc. Chapter 8 Supplement Forecasting.
April 11, 2008 Data Mining Competition 2008 The 4 th Annual Business Intelligence Symposium Hualin Wang Manager of Advanced.
Midterm Review. 1-Intro Data Mining vs. Statistics –Predictive v. experimental; hypotheses vs data-driven Different types of data Data Mining pitfalls.
U NIVERSITY OF M ASSACHUSETTS A MHERST Department of Computer Science 2011 Predicting Solar Generation from Weather Forecasts Using Machine Learning Navin.
INTRODUCTION TO MACHINE LEARNING. $1,000,000 Machine Learning  Learn models from data  Three main types of learning :  Supervised learning  Unsupervised.
Loan Default Model Saed Sayad 1www.ismartsoft.com.
CONFIDENTIAL1 Hidden Decision Trees to Design Predictive Scores – Application to Fraud Detection Vincent Granville, Ph.D. AnalyticBridge October 27, 2009.
1 FORECASTING ENERGY PRODUCT PRICES M.E. Malliaris Loyola University Chicago S.G. Malliaris Massachusetts Institute of Technology.
An Investigation of Commercial Data Mining Presented by Emily Davis Supervisor: John Ebden.
Syllabus. We covered Regression in Applied Stats. We will review Regression and cover Time Series and Principle Components Analysis. Reference Book.
Neural Networks Demystified by Louise Francis Francis Analytics and Actuarial Data Mining, Inc.
Factor & Cluster Analyses. Factor Analysis Goals Data Process Results.
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
Bike Sharing Demand Prediction PRESENTED BY:- AKSHAY PATIL 14MCB1031 RESEARCH FACILITATOR: PROF. BVANSS PRABHAKAR RAO M.TECH 1 ST YEAR RBL.
Data Summit 2016 H104: Building Hadoop Applications Abhik Roy Database Technologies - Experian LinkedIn Profile:
DESIGN OF AN INNOVATIV E E N ERGY - AWARE IT ECOSYS T EM FOR MOTIVATING BEHAVIOU R AL CHANGES T O WARDS THE ADO P TION OF ENERGY EFFICIENT LIFEST Y LES.
Forecasting. Model with indicator variables The choice of a forecasting technique depends on the components identified in the time series. The techniques.
Energy Consumption Forecast Using JMP® Pro 11 Time Series Analysis
Combining Models Foundations of Algorithms and Machine Learning (CS60020), IIT KGP, 2017: Indrajit Bhattacharya.
CSE 4705 Artificial Intelligence
Predicting the performance of US Airline carriers
Principal Component Analysis (PCA)
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Machine Learning with Spark MLlib
LENDING CLUB LOAN ANALYSIS
PREDICTING Flight Delays
Decision Trees in Analytical Model Development
Belinda Boateng, Kara Johnson, Hassan Riaz
Kin 304 Regression Linear Regression Least Sum of Squares
Table 1. Advantages and Disadvantages of Traditional DM/ML Methods
CSE 4705 Artificial Intelligence
Predicting the Market Value of the Property Using JMP® Pro 11
Neural Networks for Machine Learning Lecture 1e Three types of learning Geoffrey Hinton with Nitish Srivastava Kevin Swersky.
BPK 304W Regression Linear Regression Least Sum of Squares
Prediction as Data Mining Task
Predict House Sales Price
Predicting Academic Performance of University Students
Using Data Analytics to Predict Liquor Sales in Iowa State
NBA Draft Prediction BIT 5534 May 2nd 2018
Vincent Granville, Ph.D. Co-Founder, DSC
Employee Turnover: Data Analysis and Exploration
IDENTIFYING BERNIE SANDERS’ VOTER BASE THROUGH PREDICTIVE ANALYTICS
Predicting Government Spending on Professional Services
Prepared by: Mahmoud Rafeek Al-Farra
Machine Learning with Weka
Analytics: Its More than Just Modeling
Course Introduction CSC 576: Data Mining.
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
How decision tree is derived from a data set
Somi Jacob and Christian Bach
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Analysis for Predicting the Selling Price of Apartments Pratik Nikte
Chapter 7: Transformations
Artificial Intelligence 10. Neural Networks
Seasonal Forecasting Using the Climate Predictability Tool
Adequacy of Linear Regression Models
1 A Novel Multi-Target Prediction Framework and its Application to Flood Forecasting BY ANUSHA NEMILIDINNE CHRISTOPH F.EICK YUE CAO CHRISTARINY.
Cases. Simple Regression Linear Multiple Regression.
Machine Learning: Decision Tree Learning
Factor Analysis.
A Data Partitioning Scheme for Spatial Regression
Regression and Correlation of Data
Primer on Neural networks
What is Artificial Intelligence?
Presentation transcript:

USE OF DATA ANALYTICS TO PREDICT THE DEMAND OF BIKES BIT 5534: Applied Business Intelligence and Analytics Project report by: Surabhi Anurag, Trushit Vaishnav, Shikha Varkie Group 2

Business Objective To determine the demand for the bike rentals based upon the various parameters such as temperature, working day, humidity, weather, windspeed etc. Temperature, working day or holiday, month, weather,humidity,windspeed etc

Data Source and Description Kaggle.com : Bike Sharing Demand Dataset Total attributes : 16 Total observations in dataset : 17380 Training set: 10886 Validation set: 6494

Data Analysis Process Define Business Objective Data Collection / Selection Exploratory Data Analysis Data Preparation Data Transformation Data Modelling Model Selection Conclusion Exploratory: To access data Quality for instance if we have any missing values, outliers,or if we need any data transforamtions.

Exploratory Data Analysis

Correlation

Multiple Linear Regression

Principal Component Analysis

Decision Trees R Square RMSE N Number of Splits AICc Training 0.841   R Square RMSE N Number of Splits AICc Training 0.841 762.41202 593 20 9599.51 Validation 0.842 793.3215 135

Conclusion All models including Multiple Linear Regression, Principal Component Analysis, Clustering, Decision Trees and Neural Networks performed well. Our business objective is typical demand forecasting problem. Thus, we choose regression model as the final model because: Model performs decently. Business objective is supervised learning with dependent and independent variables clearly identified.