Data Mining BS/MS Project Decision Trees for Stock Market Forecasting Presentation by Mike Calder.

Slides:



Advertisements
Similar presentations
CPSC 502, Lecture 15Slide 1 Introduction to Artificial Intelligence (AI) Computer Science cpsc502, Lecture 15 Nov, 1, 2011 Slide credit: C. Conati, S.
Advertisements

Data Mining Classification: Basic Concepts, Decision Trees, and Model Evaluation Lecture Notes for Chapter 4 Part I Introduction to Data Mining by Tan,
Classification: Definition Given a collection of records (training set ) –Each record contains a set of attributes, one of the attributes is the class.
Indian Statistical Institute Kolkata
Risk & Return (Part 2) Lecture 06. EFFECTIVE ANNUAL RETURN:  The return on an investment expressed on a per-year, or “annualized,” basis.  Till now.
1 CHAPTER FOURTEEN FINANCIAL ANALYSIS OF COMMON STOCKS.
SLIQ: A Fast Scalable Classifier for Data Mining Manish Mehta, Rakesh Agrawal, Jorma Rissanen Presentation by: Vladan Radosavljevic.
Lecture Notes for Chapter 4 Introduction to Data Mining
ID3 Algorithm Abbas Rizvi CS157 B Spring What is the ID3 algorithm? ID3 stands for Iterative Dichotomiser 3 Algorithm used to generate a decision.
Stock Movement Prediction Deepathi Lingala Sathindra K. Kamepalli Sudhir K. V. Potturi.
More on Decision Trees. Numerical attributes Tests in nodes can be of the form x j > constant Divides the space into rectangles.
Mr. Perminous KAHOME, University of Nairobi, Nairobi, Kenya. Dr. Elisha T.O. OPIYO, SCI, University of Nairobi, Nairobi, Kenya. Prof. William OKELLO-ODONGO,
Statistics The systematic and scientific treatment of quantitative measurement is precisely known as statistics. Statistics may be called as science of.
© 2014 CY Lin, Columbia University E6893 Big Data Analytics – Lecture 4: Big Data Analytics Algorithms 1 E6893 Big Data Analytics: Financial Market Volatility.
Evaluation of Results (classifiers, and beyond) Biplav Srivastava Sources: [Witten&Frank00] Witten, I.H. and Frank, E. Data Mining - Practical Machine.
General Mining Issues a.j.m.m. (ton) weijters Overfitting Noise and Overfitting Quality of mined models (some figures are based on the ML-introduction.
Data Mining – Intro.
Ordinal Decision Trees Qinghua Hu Harbin Institute of Technology
Chapter 5 Data mining : A Closer Look.
Session 5: Demand forecasting
Data Mining BS/MS Project Clustering for Market Segmentation Presentation by Mike Calder.
Ensemble Learning (2), Tree and Forest
1 Machine Learning: Lecture 5 Experimental Evaluation of Learning Algorithms (Based on Chapter 5 of Mitchell T.., Machine Learning, 1997)
M. Sulaiman Khan Dept. of Computer Science University of Liverpool 2009 COMP527: Data Mining Classification: Evaluation February 23,
Data Mining Dr. Chang Liu. What is Data Mining Data mining has been known by many different terms Data mining has been known by many different terms Knowledge.
Data Mining BS/MS Project Bayesian Models for Estimating Software Quality Presentation by Mike Calder.
Distributed Networks & Systems Lab. Introduction Collaborative filtering Characteristics and challenges Memory-based CF Model-based CF Hybrid CF Recent.
A Comparison of Approaches to Investment Analysis John Favaro Proc. Fourth International Conference on Software Reuse, 1996, IEEE Computer Press, p
Algorithms: The Basic Methods Witten – Chapter 4 Charles Tappert Professor of Computer Science School of CSIS, Pace University.
Dr. Tucker Balch Associate Professor School of Interactive Computing Computational Investing, Part I 221: Intro to Machine Learning Find out how modern.
Computational Intelligence: Methods and Applications Lecture 19 Pruning of decision trees Włodzisław Duch Dept. of Informatics, UMK Google: W Duch.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology A data mining approach to the prediction of corporate failure.
Decision Trees Jyh-Shing Roger Jang ( 張智星 ) CSIE Dept, National Taiwan University.
Comparing Univariate and Multivariate Decision Trees Olcay Taner Yıldız Ethem Alpaydın Department of Computer Engineering Bogazici University
Machine Learning in Stock Price Trend Forecasting BY PRADEEP KUMAR REDDY MUSKU.
A hybrid SOFM-SVR with a filter-based feature selection for stock market forecasting Huang, C. L. & Tsai, C. Y. Expert Systems with Applications 2008.
IT Management Case # 8 - A Case on Decision Tree: Customer Churning Forecasting and Strategic Implication in Online Auto Insurance using Decision Tree.
Tanja Magoč, François Modave, Xiaojing Wang, and Martine Ceberio Computer Science Department The University of Texas at El Paso.
Pattern Discovery of Fuzzy Time Series for Financial Prediction -IEEE Transaction of Knowledge and Data Engineering Presented by Hong Yancheng For COMP630P,
MACHINE LEARNING 10 Decision Trees. Motivation  Parametric Estimation  Assume model for class probability or regression  Estimate parameters from all.
1 CSI5388 Current Approaches to Evaluation (Based on Chapter 5 of Mitchell T.., Machine Learning, 1997)
ALGORITHMICTRADIN G 07/02/12. S&P 500 Futures Daily chart.
Konstantina Christakopoulou Liang Zeng Group G21
Stock Market Prediction Using Machine Learning By Conor Carey Advisors – Professor Cass and Professor Yaisawarng.
Soft Computing methods for High frequency tradin.
Classification and Prediction: Ensemble Methods Bamshad Mobasher DePaul University Bamshad Mobasher DePaul University.
GRAHAM DODD INVESTOR RATIO. The strategy of selecting stocks that trade for less than their intrinsic values. Value investors actively seek stocks that.
Classification and Regression Trees
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
DECISION TREES Asher Moody, CS 157B. Overview  Definition  Motivation  Algorithms  ID3  Example  Entropy  Information Gain  Applications  Conclusion.
INTRODUCTION RSI- RELATIVE STRENGTH INDEX MOVING AVERAGE BOLLINGER BANDSSUMMARY.
Stock market forecasting using LASSO Linear Regression model
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
LOAD FORECASTING. - ELECTRICAL LOAD FORECASTING IS THE ESTIMATION FOR FUTURE LOAD BY AN INDUSTRY OR UTILITY COMPANY - IT HAS MANY APPLICATIONS INCLUDING.
Tree and Forest Classification and Regression Tree Bagging of trees Boosting trees Random Forest.
Data Mining – Clustering and Classification 1.  Review Questions ◦ Question 1: Clustering and Classification  Algorithm Questions ◦ Question 2: K-Means.
DATA MINING TECHNIQUES (DECISION TREES ) Presented by: Shweta Ghate MIT College OF Engineering.
Data Mining With SQL Server Data Tools Mining Data Using Tools You Already Have.
DEMAND FORECASTING & MARKET SEGMENTATION. Why demand forecasting?  Planning and scheduling production  Acquiring inputs  Making provision for finances.
Prepared by Fayes Salma.  Introduction: Financial Tasks  Data Mining process  Methods in Financial Data mining o Neural Network o Decision Tree  Trading.
The Value of Common Stocks
Some Lessons from Capital Market History
NBA Draft Prediction BIT 5534 May 2nd 2018
What is Pattern Recognition?
Machine Learning Week 1.
Regression Computer Print Out
Prepared by: Mahmoud Rafeek Al-Farra
Mean Reverting Asset Trading
Ninja Trader: Introduction to data mining in financial applications
Data Science introduction.
Presentation transcript:

Data Mining BS/MS Project Decision Trees for Stock Market Forecasting Presentation by Mike Calder

Decision Trees Used for stock market forecasting –Classification trees –Regression trees Analysts attempt to predict the value of a given stock at some point in the future The methods can also be used to predict trends in the stock market as a whole 2

Motivation Accurate predictions in the stock market allow investing companies to thrive Identifying attributes that correlate with success in the stock market may lead to finding causation –If causes of success can be controlled, the economy can be pushed in a good direction 3

Stock Market Challenges Training set can be very large –All stock data over a period of time Predicting attributes tend to be binarized –like we saw when using the ID3 algorithm The target attribute (increase/decrease in stock value) can be numeric or nominal 4

Sample Decision Tree 5 VAR1 – if a stock’s average true range (average max-min) is greater than the stock’s moving average (average of end-of-day prices) VAR3 – if a stock’s end-of-day price is greater than the 50-day moving average VAR4 – if a stock’s end-of-day price divided by the daily range (max-min) is > 0.5 VAR5 – if a stock’s auto-correlation (degree of correlation with respect to its previous values) return has been > 0 for 5 days Taken from (Trader, 2014)

Processing Used Correlation based feature selection Numeric attribute binarization Target regression calculation N-fold cross-validation (usually n=10) Decision tree pruning 6

Sample Results 7 The red lines show real data for the 4 stocks, black lines show the decision tree’s predictions (y-axis represents the increase/decrease in stock value by percentage) Taken from (Trader, 2014)

Additional Complexity Advanced techniques can be combined with decision tree construction –Hierarchical hidden Markov model (HHMM) has been combined with decision trees for stock trend prediction –Other machine learning algorithms have been used with decision tree methods to forecast stock market changes as well 8

References R. Trader. “Using CART for Stock Market Forecasting”. Data Science and Trading Strategies R. Trader. “Using CART for Stock Market Forecasting”. Data Science and Trading Strategies S. Tiwara. “Predicting future trends in stock market by decision tree rough-set based hybrid system with HHMM”. International Journal of Electronics and Computer Science Engineering S. Tiwara. “Predicting future trends in stock market by decision tree rough-set based hybrid system with HHMM”. International Journal of Electronics and Computer Science Engineering T. Zhang. “Stock Market Forecasting Using Machine Learning Algorithms”. Stanford University T. Zhang. “Stock Market Forecasting Using Machine Learning Algorithms”. Stanford University