P REDICTING THE D AILY R ETURNS USING F INANCIAL Q UANTITATIVE D ATA AND ASX A NNOUNCEMENTS Zhendong Zhao ( ) Supervisor: Mark Johnson
O UTLINE Motivations Framework & formulation Dataset & features Experiments
M OTIVATIONS – T O PREDICT THE DAILY RETURNS Previous works Use either textual or financial quantitative features Our work Use heterogeneous features (both textual and financial quantitative features) Textual Features (Announcements, Financial News, etc.) Financial Quantitative Features (past daily returns, past trading value, etc.) The daily Returns
E XAMPLES
T HE DAILY RETURNS ( LOG )
O UTLINE Motivations Framework & formulation Dataset & features Experiments
P ROPOSED F RAMEWORK A regression model using heterogeneous features Textual Features (Announcements, Financial News, etc.) Regression Algorithms The daily Returns Financial Quantitative Features (past daily returns, past trading value, etc.)
F ORMULATION
R EGRESSION S OLVER
O UTLINE Motivations Framework & formulation Dataset & features Experiments
D ATASET & F EATURES
O UTLINE Motivations Framework & formulation Dataset & features Experiments
E XPERIMENTS Objectives: to find 1. The best features; Combined vs. individual features; 2. The best textual features; Unigram vs. sentiment features; 3. The best regression solver; Equal vs. unequal penalty factors on quantitative features.
O BJECTIVE 1 -- T HE BEST FEATURES C OMBINED VS. INDIVIDUAL FEATURES Textual Features (Announcements, Financial News, etc.) Quantitative Features (Panel data) Regression Algorithms The Stock Returns
O BJECTIVE 1 -- T HE BEST FEATURES C OMBINED VS. INDIVIDUAL FEATURES
O BJECTIVE 2 -- T HE BEST TEXTUAL FEATURES U NIGRAM VS. SENTIMENT FEATURES Unigram features (all words in corpus) Huge size of vocabulary (100,000 features) but sparse for each document Sentiment features (negative, positive, uncertainty) Smaller size of features But may loss information
O BJECTIVE 2 -- T HE BEST TEXTUAL FEATURES U NIGRAM VS. SENTIMENT FEATURES
O BJECTIVE 3 -- T HE BEST REGRESSION SOLVER E QUAL VS. UNEQUAL PENALTY FACTORS Quantitative Features (dense) Textual Features (sparse) Quantitative Features (dense) Textual Features (sparse)
O BJECTIVE 3 -- T HE BEST REGRESSION SOLVER E QUAL VS. UNEQUAL PENALTY FACTORS
Q & A Thanks!