Stock market forecasting using LASSO Linear Regression model

Slides:



Advertisements
Similar presentations
Stock Price Prediction Based on Social Network A survey Presented by: CHEN En.
Advertisements

Division of Operation and Maintenance Engineering Wear prediction of grinding mill liners Farzaneh Ahmadzadeh, Jan Lundberg
CSCI 347 / CS 4206: Data Mining Module 07: Implementations Topic 03: Linear Models.
Application of Stacked Generalization to a Protein Localization Prediction Task Melissa K. Carroll, M.S. and Sung-Hyuk Cha, Ph.D. Pace University, School.
An Overview of Machine Learning
R ADIAL BASIS ARTIFICIAL NEURAL NETWORK MODELS FOR PREDICTING SOLUBILITY INDEX OF ROLLER DRIED GOAT WHOLE MILK POWDER Sumit Goyal G. K. Goyal Sumit Goyal.
Chapter 12 - Forecasting Forecasting is important in the business decision-making process in which a current choice or decision has future implications:
Chapter 3 Forecasting McGraw-Hill/Irwin
1 MACHINE LEARNING TECHNIQUES IN IMAGE PROCESSING By Kaan Tariman M.S. in Computer Science CSCI 8810 Course Project.
Modeling Gene Interactions in Disease CS 686 Bioinformatics.
Mr. Perminous KAHOME, University of Nairobi, Nairobi, Kenya. Dr. Elisha T.O. OPIYO, SCI, University of Nairobi, Nairobi, Kenya. Prof. William OKELLO-ODONGO,
Chapter 5 Data mining : A Closer Look.
Classification and Prediction: Regression Analysis
Computer Science Universiteit Maastricht Institute for Knowledge and Agent Technology Data mining and the knowledge discovery process Summer Course 2005.
Forecasting with Twitter data Presented by : Thusitha Chandrapala MARTA ARIAS, ARGIMIRO ARRATIA, and RAMON XURIGUERA.
The Stock Market Forecasting and Risk Management System using Genetic Programming Li Wang Ross School of Business Ann Arbor, MI
NEURAL NETWORKS FOR TECHNICAL ANALYSIS: A STUDY ON KLCI 授課教師:楊婉秀 報告人:李宗霖.
Data Mining BS/MS Project Decision Trees for Stock Market Forecasting Presentation by Mike Calder.
Data Mining Chun-Hung Chou
Support Vector Machine Applications Electrical Load Forecasting ICONS Presentation Spring 2007 N. Sapankevych 20 April 2007.
Chapter 2 – Business Forecasting Takesh Luckho. What is Business Forecasting?  Forecasting is about predicting the future as accurately as possible,
Crowdsourcing Predictors of Behavioral Outcomes. Abstract Generating models from large data sets—and deter¬mining which subsets of data to mine—is becoming.
Data Mining Joyeeta Dutta-Moscato July 10, Wherever we have large amounts of data, we have the need for building systems capable of learning information.
Shrinkage Estimation of Vector Autoregressive Models Pawin Siriprapanukul 11 January 2010.
11 CSE 4705 Artificial Intelligence Jinbo Bi Department of Computer Science & Engineering
Data Mining Techniques in Stock Market Prediction
AUTHORS: ASAF SHABTAI, URI KANONOV, YUVAL ELOVICI, CHANAN GLEZER, AND YAEL WEISS "ANDROMALY": A BEHAVIORAL MALWARE DETECTION FRAMEWORK FOR ANDROID.
A* Lasso for Learning a Sparse Bayesian Network Structure for Continuous Variances Jing Xiang & Seyoung Kim Bayesian Network Structure Learning X 1...
19 October 2015All rights reserved, Edward Tsang & Serafin Martinez jaramillo CHASM Co-evolutionary Heterogeneous Artificial Stock Markets Serafín Martínez.
Jeff Howbert Introduction to Machine Learning Winter Regression Linear Regression.
A hybrid SOFM-SVR with a filter-based feature selection for stock market forecasting Huang, C. L. & Tsai, C. Y. Expert Systems with Applications 2008.
Jennifer Lewis Priestley Presentation of “Assessment of Evaluation Methods for Prediction and Classification of Consumer Risk in the Credit Industry” co-authored.
Jeff Howbert Introduction to Machine Learning Winter Regression Linear Regression Regression Trees.
Syllabus. We covered Regression in Applied Stats. We will review Regression and cover Time Series and Principle Components Analysis. Reference Book.
Financial Time Series Analysis with Wavelets Rishi Kumar Baris Temelkuran.
Patch Based Prediction Techniques University of Houston By: Paul AMALAMAN From: UH-DMML Lab Director: Dr. Eick.
CpSc 881: Machine Learning
Hybrid Load Forecasting Method With Analysis of Temperature Sensitivities Authors: Kyung-Bin Song, Seong-Kwan Ha, Jung-Wook Park, Dong-Jin Kweon, Kyu-Ho.
1 Minimum Error Rate Training in Statistical Machine Translation Franz Josef Och Information Sciences Institute University of Southern California ACL 2003.
Soft Computing methods for High frequency tradin.
Learning Photographic Global Tonal Adjustment with a Database of Input / Output Image Pairs.
Blackbox classifiers for preoperative discrimination between malignant and benign ovarian tumors C. Lu 1, T. Van Gestel 1, J. A. K. Suykens 1, S. Van Huffel.
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
Forecasting is the art and science of predicting future events.
Financial Data mining and Tools CSCI 4333 Presentation Group 6 Date10th November 2003.
A Decision Tree Classification Model For Determining The Location For Solar Power Plant A PRESENTATION BY-  DISHANT MITTAL  DEV GAURAV VIT UNIVERSITY,VELLORE.
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
Neural Networks Lecture 4 out of 4. Practical Considerations Input Architecture Output.
LOAD FORECASTING. - ELECTRICAL LOAD FORECASTING IS THE ESTIMATION FOR FUTURE LOAD BY AN INDUSTRY OR UTILITY COMPANY - IT HAS MANY APPLICATIONS INCLUDING.
Managerial Decision Modeling 6 th edition Cliff T. Ragsdale.
DEMAND FORECASTING & MARKET SEGMENTATION. Why demand forecasting?  Planning and scheduling production  Acquiring inputs  Making provision for finances.
Asymmetries in Forecasting Energy Product Prices APF Conference, Deutsche Bundesbank Frankfurt, Germany 2004 M. E. Malliaris Loyola University Chicago.
Prepared by Fayes Salma.  Introduction: Financial Tasks  Data Mining process  Methods in Financial Data mining o Neural Network o Decision Tree  Trading.
Introduction to Machine Learning, its potential usage in network area,
John A. Paulson School of Engineering and Applied Sciences
Market Intelligence Analysis
Discussion/Presentation of Park and Basu: “Alternative Evaluation Metrics for Risk Adjustment Models” Stephen P. Ryan, Olin.
Artificial neural networks
Mechanical Engineering Haldia Institute of Technology
An Investigation of Market Dynamics and Wealth Distributions
CSE 4705 Artificial Intelligence
2. Industry 4.0: novel sensors, control algorithms, and servo-presses
Mid Semester UEB Energy Analysis of a Wastewater Treatment Plant by using Artificial Neural Network Presented by: Dr. Nor Azuana Ramli Electrical Engineering.
Li Wang Ross School of Business Ann Arbor, MI 48105
L. Isella, A. Karvounaraki (JRC) D. Karlis (AUEB)
Dr. Morgan C. Wang Department of Statistics
Analysis for Predicting the Selling Price of Apartments Pratik Nikte
Jong-Min Kim1 and Hojin Jung2
Using Machine Learning to Analyze Serial Killer Patterns
Presentation transcript:

Stock market forecasting using LASSO Linear Regression model

Abstract The forecasting of stock price movement in general is considered to be a thought-provoking and essential task for financial time series' exploration. In this paper, a Least Absolute Shrinkage and Selection Operator (LASSO) method based on a linear regression model is proposed as a novel method to predict financial market behavior. LASSO method is able to produce sparse solutions and performs very well when the numbers of features are less as compared to the number of observations. Experiments were performed with Goldman Sachs Group Inc. stock to determine the efficiency of the model.

Why is prediction of stock market crucial ? Stock market helps in flourishing commerce and industry that ultimately has an effect on the country's economy. Moreover, it provides a generic platform for the sellers and buyers of stocks listed on the stock market. The buyers and sellers are basically retail and institutional investors. If the stock’s future price can be predicted, it can resist the significant losses and can certainly increase the profits

Related Works J. L. Ticknor proposed a Bayesian Regularized Artificial Neural Network as the prediction model. Daily market prices and financial technical indicators were used as inputs. This technique reduces the potential for over-fitting and over-training and hence improves the quality of prediction. S. Deng, Takashi M., Kei S., Tatsuro S., and Akito Sakurai proposed a prediction model which extracts features from time series data and social networks. Using features such as frequency of news and comments, a multiple kernel learning regression framework was used as the model. Paul D. Yoo, Maria H. Kim, and Tony Jan made a survey of various machine learning techniques for the task of prediction. They concluded that incorporating event information with prediction model plays important role for accurate prediction.

Related Works M. P. Naeini, et al. proposed neural networks for prediction. They used two kinds of neural networks. The first being a feed-forward multi-layer perceptron(MLP) and the other being an Elman recurrent network to predict the stock price. They inferred that the latter can predict the direction of the changes of the stock value better than MLP. S. Aseervatham, et al. proposed a new model selection method which tried to approach the Ridge regression method’s results. This method first computes the Ridge solution and then performs feature selection. B. B. Nair, et al. proposed an automated decision tree-adaptive neuro-fuzzy hybrid stock market prediction system. It used technical analysis for feature extraction and decision tree for feature selection.

Proposed model – LASSO model Least Absolute Shrinkage and Selection Operator (LASSO) is based on linear regression. It is able to produce sparse solutions and performs very well when the number of features are less as compared to the number of observations. For a general linear regression model, the cost function 𝐽 w = 1 𝑁 𝑖=0 𝑁 ℎ x 𝑖 − 𝑦 𝑖 2 = 1 𝑁 Xw−y 2 For LASSO we add a l1-norm to the cost function 𝐽 w = 1 𝑁 𝑖=0 𝑁 ℎ x 𝑖 − 𝑦 𝑖 2 +𝜆 𝑗=1 𝑑 𝑤 𝑗 2 = 1 𝑁 Xw−y 2 + 𝜆 𝑗=1 𝑑 𝑤 𝑗 2

Data Preparation Research data utilized for prediction was gathered for Goldman Sachs Group, Inc. The total number of instances composed of 3686 trading days. The features taken into consideration are, Date Opening Price High Price Low Price Close Price Trading Volume The target variable is “Adjusted Close Price”.

Data Preparation Two kinds of analysis were performed on the data set: A comparison study for analyzing the model which we have proposed, with the Ridge regression model. The Ridge model incorporates regularization with the original least squares approach. For this, all the available instances were utilized. Analysis of our algorithm with Bayesian Regularized Artificial Neural Networks as proposed by J.N. Ticknor in [6]. Here, identical trading days (as taken in [6]) were taken as the instances.

Evaluation metrics The performance of LASSO and Ridge linear regression method was measured by computing RMSE and MAPE Root mean squared error(RMSE) RMSE = 𝑖=1 𝑛 ( 𝑦 𝑖 − 𝑝 𝑖 ) 2 𝑛 Mean Absolute Percentage Error(MAPE) MAPE = 𝑖=1 𝑛 | 𝑦 𝑖 − 𝑝 𝑖 | 𝑦 𝑖 𝑛 ×100%

Results

Evaluation on full data set between Ridge and LASSO Results Method Training RMSE Test RMSE Training MAPE Test MAPE Ridge 1.7648 3.2272 1.3028 1.8065 LASSO 1.1403 2.5401 0.9304 1.4726 Evaluation on full data set between Ridge and LASSO

Results LASSO Ridge

Bayesian Regularized ANN Results Method Training MAPE (%) Testing MAPE (%) Bayesian Regularized ANN 1.5235 1.3291 LASSO 0.1806 0.6869 Comparison with J.L. Ticknor’s method

Conclusion The results indicate that the proposed model outperforms Ridge linear regression model and Bayesian Regularized ANN model. The technique proposed here is a different approach which can provide traders a dependable technique for estimating future prices.

THANK YOU