Download presentation
Presentation is loading. Please wait.
Published byMyron Taylor Modified over 8 years ago
1
Statistical Arbitrage Ying Chen, Leonardo Bachega Yandong Guo, Xing Liu February, 2010
2
Outline Overview of the project Improvements in the last week Speedup the data access Improve the PCA algorithm Use adjusted price in PNL calculation Taking trading volume into account Future work
3
Framework Raw Historical Data From WRDS PCA Eigenportfolios PCA Eigenportfolios Residuals as increments of AR process Compute S-scores ETFs for industry sectors ETFs for industry sectors Signal trade orders Market model 60-day returns Residual process model Current stock prices Market model 252-day returns Adjusted Stock price Series + indices Data pre-processing (python scripts) Back-testing simulations (matlab scripts)
4
Code Speedup Data access Tradeoff Always read from disk: very slow Everything in memory: not robust, can be slow Cache parts of dataset in memory Fast code Same Total Speedup > 16 times BeforeAfter
5
PCA amelioration (1/4) Suppose X is a nxp matrix including n samples and p features; Original algorithm: Calculate the Eigen-decomposition of the correlation matrix: The matrix Q consists of the Eigen-vectors of the correlation matrix
6
PCA amelioration (2/4) Suppose X is a nxp matrix including n samples and p features; Substituted algorithm: We use singular value decomposition (SVD) to get the eigenvectors. Then V consists of Eigen-vectors of the correlation matrix. This will reduce the computational complexity by around 80%
7
PCA amelioration (3/4) Proof: Since U and V are orthogonal, V consists of the eigen-vectors of the correlation matrix And equals to diagonal matrix D
8
PCA amelioration (4/4) Notice: if p is one eigenvector of X, then –p is also its eigenvector Since if Then The effect of “negative” can be removed by the estimation.
9
Experiment result (Fig. 1) Top 50 eigenvalues of the correlation matrix of market returns computed on May 1 2007 estimated using a 1-year window and a universe of 1590 stocks
10
Experiment result 2 Value of the first eigenvector
11
Experiment result 2 Value of the second eigenvector
12
Experiment result 2 Value of the third eigenvector
13
Preliminary PNL Experiment Dec-13-1994 Feb-12-1998
14
After correction
15
Taking trading volume into account Problem mean-reversion strategies are sensitive to trading volume immediately before the signal was triggered. Modified returns is the average daily trading volume over a given trading window. Experiments PCA/ETF actual price vs. using trading volume
16
Top 50 eigenvalues of the correlation matrix—trading time Top 50 eigenvalues of the correlation matrix of market returns computed on May 1 2007 estimated using a 1-year window and a universe of 1590 stocks
17
Top 50 eigenvalues of the correlation matrix——calendar time Top 50 eigenvalues of the correlation matrix of market returns computed on May 1 2007 estimated using a 1-year window and a universe of 1590 stocks
18
Value of the first eigenvector
19
Future work Experiment on ETF Associate each stock with one ETF Compare ETF with PCA Take into account Transaction fee, interest, dividend Calculate PCA using trading-time modified return
20
THANK YOU
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.