Presentation is loading. Please wait.

Presentation is loading. Please wait.

© APT 2006 ICA And Hedge Fund Returns Dr. Andrew Robinson APT Program Trading Techniques and Financial Models for Hedge Funds June 27 th, 2007.

Similar presentations


Presentation on theme: "© APT 2006 ICA And Hedge Fund Returns Dr. Andrew Robinson APT Program Trading Techniques and Financial Models for Hedge Funds June 27 th, 2007."— Presentation transcript:

1 © APT 2006 ICA And Hedge Fund Returns Dr. Andrew Robinson APT Program Trading Techniques and Financial Models for Hedge Funds June 27 th, 2007

2 © APT 2006 Overview of Talk  Preliminaries – Why Use a Factor Model?  Model Specification & It’s consequences  ICA vs. PCA  Simulation-Based Tests  Sample Analysis  Conclusions

3 © APT 2006 Factor Models: Motivation  In practice, factor models are employed across all asset classes  Why?  Fundamental Issue: Lack of sufficient and reliable data. ‘The Curse of Dimensionality’  Other motivations:  Increased Flexibility in Scenario Analysis / Risk Attribution  Performance Analysis  Stress Testing  Computational Efficiency  Parsimonious Framework for Advanced Analysis  Derivative Pricing  Risk

4 © APT 2006 Long-short optimization on TOPIX Advantages of Factor Models For VAR: – Short history securities – Best of Breed Stress Testing – Superior correlation forecasts, – and… PREDICTEDREALIZED 99% VAR (Historical) 99% VAR (APT) % OF VIOLATIONS (Historical VAR) % OF VIOLATIONS (APT VAR) 20050601-0.17%-0.58%6.15%0.00% 20050831-0.17%-0.76%20.00%0.00% 20051130-0.17%-0.85%27.69%0.00% 20060301-0.16%-0.51%9.23%0.00% 20060531-0.12%-1.51%43.94%0.00% AVERAGE-0.16%-0.84%21.40%0.00% What’s Wrong With Historical VAR? 4

5 © APT 2006 Theoretical Interpretation Consider the standard expression for the efficient frontier (work in units of unit variance for simplicity) Consider the same expression, but in terms of the eigenvectors

6 © APT 2006 Model Specification  Consider the following returns-generating process:  Suppose that instead of observing the true latent variables, we observe another variable, namely:  Effect on  depends on assumptions made for dynamics of error term  But in any case:

7 © APT 2006 The Effects of Measurement Error  Suppose (in general things will be more complicated) that the measurement error is uncorrelated with the true latent variables…  Consider the asymptotic estimates of  :  In this case, one can show that:

8 © APT 2006 The Effect of Specification Error If a proxy is employed or a variable is dropped from a multivariate regression, all betas will be affected Relationship between omitted variables bias and measurement error. Compare, for simplicity, the vectors of biases in the cases of a single dropped and a single noisy regressor Where

9 © APT 2006 Extracting “True” Drivers The Arbitrage Pricing Theory But what are the “true” drivers of security market returns? Traditional statistical models employ techniques based on, for example, Principle Components Analysis or Factor Analysis In this case, latent variables have no clear interpretation Can we interpret them?

10 © APT 2006 Risk Attribution  We have argued that there are some advantages to employing a purely statistical framework.  These include:  The ability to avoid Specification Error  Flexibility for the incorporation of new asset classes/data  But what about interpretation?  Is there a way to express the results in terms of “real-world” variables?  There are many methodologies. Here we mention three:  Standard “Position-Based” Approach (and refinements) based on marginal risks  Transformation methods based on time series data  Transformation methods based on fundamental data

11 © APT 2006 Coordinate transformations

12 © APT 2006 Systematic Risk Breakdown

13 © APT 2006 Ambiguities in Principle Components? Eigenvalue Decomposition of Covariance Matrix: “Whitening” Transformation But under any orthogonal transformation, we have

14 © APT 2006 The “Cocktail Party” Problem

15 © APT 2006 Blind Source Separation Can we extract signals based only on the fact that we know they have certain properties? Fat Tails (kurtosis, higher moments) Autocorrelation Yes! Principle Components Analysis Independent Components Analysis Second Order Blind Identification Back and Weigend (1997) were the first to apply ICA to financial data Useful Adjunct to PCA Recent analyses (e.g. Korizis, et. al. (2007)) Have begun to reconsider application of modified, non-independent factorisation algorithms taking into account time series dynamics.

16 © APT 2006 Independent Component Analysis Technique employed for tackling the so-called ‘cocktail party problem’ Searching for a set of latent variables for whom the joint probability distribution ‘factorises’ Alternatively, we could formulate this in terms of the moments of the latent variables…

17 © APT 2006 ICA By Non-Gaussianity Κ = 11.3 Κ = 13.8Κ = 6.7 += Can we employ the central limit theorem to our advantage? Consider sums of indepdendent GARCH(1,1) processes

18 © APT 2006 ICs via Kurtosis? First step, pre-processing or “whitening” to generate z. Consider linear combinations of z, s, such that some measure of non-Gaussianity, namley f(s) is maximized. And optimize by gradient ascent for each signal…

19 © APT 2006 ICs via Negentropy Consider Entropy of a Random Variable Well known fact that this is maximal for Gaussian variable. As such, maximize “negentropy” Statistically-speaking negentropy provides “optimal” discrimination and detection of NonGaussianity

20 © APT 2006 Simple GARCH Test Case We generate 1000 returns employing the following two proceses With Variances Defined By and Apply mixing with random matrix M to generate observed signals: Can ICA extract the original signals?

21 © APT 2006 Return Distributions of Signals

22 © APT 2006 Distributions of Independent Components

23 © APT 2006 Scatter Plot

24 © APT 2006 GARCH Tests Extend this analysis in order to accommodate multiple GARCH series Try five independent GARCH times series and one Gaussian Apply Random Mixing To Generate Observed Signals Sources (Latent)Signals (Observed)

25 © APT 2006 GARCH Tests Compare Extracted and Original Signals Amplitudes and Signs Arbitrary Sources (Latent)Components (Extracted)

26 © APT 2006 GARCH Tests Compare Correlations Between Sources and Recovered Components IC1IC2IC3IC4IC5IC6 G1-0.050.01-0.06-0.040.00 G2-0.05-0.14-0.940.27-0.05-0.15 G30.08-0.01-0.31-0.940.11-0.01 G4-0.02-0.090.050.120.99-0.04 G50.06-0.990.070.00-0.060.03 G6-0.02-0.01-0.130.060.100.99

27 © APT 2006 Common Feature Extraction Generate Six Time Series With the g i defined as before and

28 © APT 2006 Common Feature Extraction N=0.1IC 1IC 2IC 3IC 4IC 5IC 6 g10.990.000.060.04-0.040.02 g20.05-0.94-0.31-0.100.010.00 g3-0.09-0.350.920.12-0.07-0.05 N=0.25IC 1IC 2IC 3IC 4IC 5IC 6 g1-0.98-0.010.07-0.040.04-0.02 g2-0.05-0.92-0.320.110.01-0.01 g30.11-0.360.90-0.110.080.09

29 © APT 2006 Common Feature Extraction N=0.5IC 1IC 2IC 3IC 4IC 5IC 6 g1-0.930.040.09-0.05-0.040.01 g2-0.040.86-0.370.11-0.050.06 g30.130.400.82-0.10-0.09-0.17 N=1.0IC 1IC 2IC 3IC 4IC 5IC 6 g1-0.780.070.050.03-0.050.08 g2-0.020.62-0.45-0.29-0.10-0.05 g30.130.520.530.27-0.130.07

30 © APT 2006 Interpretation of the Mixing Matrix Interpret Mixing Matrix, A, for N=0.5

31 © APT 2006 ICA-Based Clustering Very recently, there have been proposals for ICA-based clustering Yu & Wu (2006) employ a heuristic-based approach based on thresholded elements of the mixing matrix Instead, we employ the L1 norm: L1 preserves sensitivity to factor rotation! Combine with (standard) clustering algorithms (KMeans, etc…).

32 © APT 2006 Interpretation of the Mixing Matrix via Clustering Garch #1 Garch #2 Garch #3

33 © APT 2006 Reality Check Dataset We can confirm the results for a standard “reality check” dataset Whether by linkage-based techniques or K-Means, we obtain “correct” clusters Not employing time series information (in particular autocorrelation) Reduction in dimensionality…

34 © APT 2006 Does ICA work in all cases? Try regular signals without excess kurtosis – such as sinusoids Sources (Latent)Signals (Observed)

35 © APT 2006 Time Series-Based Methods Compare Results for two different methods of signal separation ICA-based techniques do not, by default, employ time series information Techniques such as SOBI (Second-Order Blind Identification) do, however. (see Korzis, et al. for a related study!) ICASOBI (AR(2))

36 © APT 2006 ICA and HFRX Index Returns As a first attempt, can apply ICA to returns of HFRX indices Looking for dependence on particular components (or combinations) LoadingsTime Series (Mar 2000 to Feb 2007)

37 © APT 2006 ICA-Based Clustering

38 © APT 2006 AutoCorrelation in HFRX Index Returns Do we see autocorrelation in the returns – is there a time structure in the return or are they temporally independent? ICA Doesn’t Consider Temporal Relations HF AutocorrelationsIC Autocorrelations

39 © APT 2006 Second-Order Blind Analysis What happens if we explicitly consider autocorrelations? Results for joint diagonalization of order 2 serial correlations… LoadingsTime Series (Mar 2000 to Feb 2007)

40 © APT 2006 SOBI-Based Clustering

41 © APT 2006 AutoCorrelation in HFRX Index Returns SOBI performs a joint decomposition based on the autocorrelation structure of the candidate signals HF AutocorrelationsSOBI Autocorrelations

42 © APT 2006 Conclusions Correct Specification of factor models is vital Interpretation of Statistical Factors and Higher-Order Effects Latent Variable Estimation and Study Blind Source Separation Non-Gaussianity Time-Series Models Future Directions Out of Sample Tests of Higher Moments Simulation / Risk Modelling

43 © APT 2006 References Hyvarinen, et al., Independent Component Analysis, Wiley, Ney York (2001). A.D. Back and A.S. Weigend, “A first application of independent component analysis to extracting structure from stock returns”, Int. Journal of Neural Systems, Vol. 8, No. 4, Aug. 1997, pp. 473-484. A. Beloucharnai, et al., “A blind source separation technique based on second order statistics”. IEEE Trans. On Signal Processing, Vol. 45, No. 2, pp. 434-444, (1997). H. C. Wu a; Philip L. H. Yu, “A robust and scalable clustering model for time series via independent component analysis.”, International Journal of Systems Science, Volume 37, No. 13, October 2006, pages 987 - 1001 Korizis, et. al., Smooth Component Extraction From a Set of Finanacial Financial Data Mixtures, Proceedings of the Fourth IASTED International Conference on Signal Processing, Pattern Recognition, and Applications (2007). E. Keogh and T. Folias, ‘‘The UCR Time Series Data Mining Archive’’, [http://www.cs.ucr.edu/eamonn/TSDMA/index.html]. Riverside CA. University of California – Computer Science & Engineering Department, 2002


Download ppt "© APT 2006 ICA And Hedge Fund Returns Dr. Andrew Robinson APT Program Trading Techniques and Financial Models for Hedge Funds June 27 th, 2007."

Similar presentations


Ads by Google