Duan Wang Center for Polymer Studies, Boston University Advisor: H. Eugene Stanley.

Slides:



Advertisements
Similar presentations
Copula Representation of Joint Risk Driver Distribution
Advertisements

Ordinary least Squares
Noise & Data Reduction. Paired Sample t Test Data Transformation - Overview From Covariance Matrix to PCA and Dimension Reduction Fourier Analysis - Spectrum.
Managerial Economics in a Global Economy
Principal Component Analysis Based on L1-Norm Maximization Nojun Kwak IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008.
FTP Biostatistics II Model parameter estimations: Confronting models with measurements.
Regression Analysis Once a linear relationship is defined, the independent variable can be used to forecast the dependent variable. Y ^ = bo + bX bo is.
Financial Applications of RMT Max Timmons May 13, 2013 Main Application: Improving Estimates from Empirical Covariance Matricies Overview of optimized.
Introduction Data and simula- tion methodology VaR models and estimation results Estimation perfor- mance analysis Conclusions Appendix Doctoral School.
ANTICIPATING CORRELATIONS Robert Engle Stern School of Business.
Capital Asset Pricing Model and Single-Factor Models
Lecture Presentation Software to accompany Investment Analysis and Portfolio Management Seventh Edition by Frank K. Reilly & Keith C. Brown Chapter.
Chapter 10 Simple Regression.
Correlation and Simple Regression Introduction to Business Statistics, 5e Kvanli/Guynes/Pavur (c)2000 South-Western College Publishing.
Data Sources The most sophisticated forecasting model will fail if it is applied to unreliable data Data should be reliable and accurate Data should be.
AN INTRODUCTION TO PORTFOLIO MANAGEMENT
Chapter 6 An Introduction to Portfolio Management.
Copyright © 2011 Pearson Prentice Hall. All rights reserved. Chapter 10 Capital Markets and the Pricing of Risk.
Chapter 11 Multiple Regression.
Topic 3: Regression.
Correlations and Copulas Chapter 10 Risk Management and Financial Institutions 2e, Chapter 10, Copyright © John C. Hull
1 CHAPTER 14 FORECASTING VOLATILITY II Figure 14.1 Autocorrelograms of the Squared Returns González-Rivera: Forecasting for Economics and Business, Copyright.
Tables, Figures, and Equations
Estimation Error and Portfolio Optimization Global Asset Allocation and Stock Selection Campbell R. Harvey Duke University, Durham, NC USA National Bureau.
Introduction to Regression Analysis, Chapter 13,
AN INTRODUCTION TO PORTFOLIO MANAGEMENT
Estimating High Dimensional Covariance Matrix and Volatility Index by making Use of Factor Models Celine Sun R/Finance 2013.
Statistical Arbitrage Ying Chen, Leonardo Bachega Yandong Guo, Xing Liu March, 2010.
1 MBF 2263 Portfolio Management & Security Analysis Lecture 2 Risk and Return.
Structural Risk Models. Elementary Risk Models Single Factor Model –Market Model –Plus assumption residuals are uncorrelated Constant Correlation Model.
Understanding Multivariate Research Berry & Sanders.
Version 1.2 Copyright © 2000 by Harcourt, Inc. All rights reserved. Requests for permission to make copies of any part of the work should be mailed to:
Portfolio Management-Learning Objective
Lecture Presentation Software to accompany Investment Analysis and Portfolio Management Seventh Edition by Frank K. Reilly & Keith C. Brown Chapter 7.
Some Background Assumptions Markowitz Portfolio Theory
Investment Analysis and Portfolio Management Chapter 7.
Empirical Financial Economics Asset pricing and Mean Variance Efficiency.
Lecture 10 The Capital Asset Pricing Model Expectation, variance, standard error (deviation), covariance, and correlation of returns may be based on.
Chapter 10 Capital Markets and the Pricing of Risk.
Chapter 10 Capital Markets and the Pricing of Risk
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
Investment Analysis and Portfolio Management First Canadian Edition By Reilly, Brown, Hedges, Chang 6.
1 Chapter 12 Simple Linear Regression. 2 Chapter Outline  Simple Linear Regression Model  Least Squares Method  Coefficient of Determination  Model.
Finance 300 Financial Markets Lecture 3 Fall, 2001© Professor J. Petry
INVESTMENTS | BODIE, KANE, MARCUS Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin CHAPTER 8 Index Models.
Modern Navigation Thomas Herring MW 11:00-12:30 Room
CHAPTER 5 SIGNAL SPACE ANALYSIS
Robert Engle UCSD and NYU and Robert F. Engle, Econometric Services DYNAMIC CONDITIONAL CORRELATIONS.
Dynamic Models, Autocorrelation and Forecasting ECON 6002 Econometrics Memorial University of Newfoundland Adapted from Vera Tabakova’s notes.
Lecture 3. Option Valuation Methods  Genentech call options have an exercise price of $80 and expire in one year. Case 1 Stock price falls to $60 Option.
Chapter 7 An Introduction to Portfolio Management.
ESTIMATION METHODS We know how to calculate confidence intervals for estimates of  and  2 Now, we need procedures to calculate  and  2, themselves.
Biostatistics Regression and Correlation Methods Class #10 April 4, 2000.
Single Index Model. Lokanandha Reddy Irala 2 Single Index Model MPT Revisited  Take all the assets in the world  Create as many portfolios possible.
Central limit theorem - go to web applet. Correlation maps vs. regression maps PNA is a time series of fluctuations in 500 mb heights PNA = 0.25 *
Chapter 14 EXPLORATORY FACTOR ANALYSIS. Exploratory Factor Analysis  Statistical technique for dealing with multiple variables  Many variables are reduced.
Chapter 12 REGRESSION DIAGNOSTICS AND CANONICAL CORRELATION.
Estimating standard error using bootstrap
Chapter 4 Demand Estimation
Linear Regression.
Chapter 6: Autoregressive Integrated Moving Average (ARIMA) Models
Saif Ullah Lecture Presentation Software to accompany Investment Analysis and.
Chapter 4 Demand Estimation
Basic Statistical Terms
Estimation Error and Portfolio Optimization
Estimation Error and Portfolio Optimization
Simple Linear Regression
Estimation Error and Portfolio Optimization
Chapter 13 Additional Topics in Regression Analysis
Estimation Error and Portfolio Optimization
Presentation transcript:

Duan Wang Center for Polymer Studies, Boston University Advisor: H. Eugene Stanley

Basic Concepts in Financial Time Series Random Matrix Theory ◦ Meaning of eigenvalues and eigenvectors ◦ Eigenvalue distribution for random correlation matrix ◦ Interpreting empirical eigenvalue distribution Extensions of Random Matrix Theory ◦ Autoregressive random matrix theory (ARRMT) ◦ Time-lag random matrix theory (TLRMT) ◦ Global factor model (GFM) (Application of RMT in portfolio optimization)

Price (S i,t ): index i at time t Return: Magnitude of return: Black Monday 2001 internet bubble subprime mortgage crisis 22% price drop

For time series {X t } and {Y t },  Cross-correlation  Autocorrelation  Time-lag Cross-correlation Properties: Example: highly correlated France and Italy indices Standard deviations <> Denotes average time lag

Simulate return autocorrelations: AR(1) Simulate magnitude autocorrelations: GARCH(1,1)

6 Stocks 6x6 Correlation matrix Symmetric matrix Main diagonal=1 Meaning of eigenvalues? Eigenvalue decomposition Explained by random matrix theory (RMT) Annual Return of 6 US stocks

 Original RMT  Aim: find existence of collective behavior  method: compare the eigenvalue distribution between (1) a symmetric random matrix and (2) a Hamiltonian matrix  RMT in Econophysics  Aim: find existence of cross-correlation in multiple time series  Method: compare the eigenvalue distribution of (1) a Wishart matrix (random correlation matrix) and (2) empirical cross-correlation matrix Wishart matrix: sample correlation matrix for uncorrelated time series

Cross-correlation distribution between 2 uncorrelated time series, each with length T=2000. Sample correlation distribution: C ij ~N(0,1/T) Gaussian distribution with mean zero and variance 1/T Sample cross-correlations between independent time series are not zero if time series are of finite length. Mathematically: C ij =0 Statistically: C ij falls in, with 95% probability. 95% confidence interval

Principal Components Linear combination of individual time series Orthogonal transformation of correlated time series Eigenvectors (Factor Loadings) Weight of each individual time series in a principal component Eigenvalues Variance of each principal component Measures the significance of a factor Not all of them are significant Compare them with the eigenvalues of Wishart matrix to find out the number of significant factors

Largest eigenvalue of uncorrelated time series Significant factors Eigenvalue distribution of Wishart matrix From: Plerou et. al., 1999, PRL., 2002, PRE. (422 stocks from S&P 500, 1737 daily returns) Largest eigenvalue (not shown): Average behavior (Bahavior of the whole market) Noise Other large eigenvalues: Subgroups (Sectors, industries, …)

Aim (1)Test significance of correlations in multiple time series (2)Find number of significant factors (3)Reduce noise in correlation matrix

Efficient Frontier Efficient Frontier H. Markowitz (1952)

Minimum Variance Portfolio: Cumulative Return After Applying RMT in constructing the minimum variance portfolio, we increased returns and reduced risk. μσ Markowitz RMT Portfolio: 404 stocks from S&P 500, rebalancing at the end of each year

AR(1) Model:

Variance Correlation distribution for different AR(1) Coefficient Variance of sample crosscorrelation

Eigenvalue distribution for different AR(1) Coefficient Autocorrelation in time series may influence the eigenvalue distribution for uncorrelated time series Eigenvalue to compare with empirical eigenvalues Noise detected by RMT Actual Noise

Aim Adjust RMT for autocorrelated time series

Histogram of AR(1) Coefficients Data: Change of air pressure in 95 US cities

RMT and ARRMT for Daily Air Pressure of 95 US Cities RMT 11 significant factors ARRMT 8 significant factors

(Podobnik, Wang et. al., 2010, EPL.)  Aim: extend RMT to time lag correlation matrix  Unsymmetrical matrices, eigenvalues are complex numbers  Use singular value decomposition (SVD) instead of eigenvalue  Largest singular value: strength of correlation Lag=0 GEMSFTJNJ GE MSFT JNJ Lag=1 GEMSFTJNJ GE MSFT JNJ Correlation matrix for lag=0 and lag=1

We find: (1)Short-range return cross-correlations (after time lag=2) (2)Long-range magnitude cross-correlation (scaling exponent=0.25) magnitudes returns

 Aim: explain cross-correlations using one single process.  Assumption: Each individual index fluctuates in response to one common process, the “global factor” M t. What is the global factor?  A linear combination of all individual index returns  The weight of each index return is calculated using statistical method. Global factor Measures dependence of R i,t on M t mean return unsystematic noise individual return Wang et. al., 2011, PRE.

 Variance of global factor---- Cross-correlation among individual indices (holds for both returns and magnitudes)  Autocorrelation of global factor---- Time lag cross- correlation among individual indices Conclusion: Cross-correlation among individual indices decay in the same pattern as the autocorrelation of the global factor. Return cross-correlation Magnitude cross-correlation A M = autocorrelation of global factor GFM explains the reason for the long range magnitude cross-correlations.

 Aim: Linear regression with unobservable M t  Basis: Least total squared errors  Calculation: Eigenvalue decomposition (C=U + DU) (1) The principal components are related with the eigenvectors. (2) M t = the first principal component  Procedure: (1) Find M t using eigenvalue decomposition. (2) Linear regression, find μ, b i, epsilon. Correlation matrix Eigenvalue matrix Unknown, linear combinition of R i,t Eigenvector matrix

Subprime mortgage crisisInternet bubble

 Only 3 eigenvalues above Wishart maximum (significant).  The largest eigenvalue (global factor) constitute 31% of total variance, and 75% of total variance of 3 significant principal components.  Conclusion: the single global factor is sufficient in explaining correlations. Eigenvalues above and below Wishart maximum. Wishart maximum Global factor 3 eigenvalues above Wishart maximum Insignificant principal components (eigenvalues below Wishart maximum)

 Define risk: volatility (time dependent standard deviation) volatility=expected standard deviation of a time series at time t given information till time t-1.  How to calculate risk: Apply GARCH to global factor  Historical risk and expectation of future risk Coefficients estimated using maximum likelihood (MLE) Forecasted using recursion with estimated coefficients

 PC1: American and EU countries  PC2: Asian and Pacific countries  PC3: Middle east countries Application: Multiple Global Factors 3 significant principal components (PCs) Wang et., al., working paper. Statistically, the world economy can be grouped as 3 sectors. Global factorSectors

(1)Large correlation between Western and Asian economies (2) Small correlation between Middle East Economy and the other 2 groups (3) Each group has its own financial crises. (4) The 2008 market crash influenced all 3 groups, indicating globalization (large volatility correlation).

 Duan Wang 

Eigenvectors: Implied Participation Number For Wishart matrix

Empirical Implied Participation Number 422 stocks from S&P 500, 1737 daily returns

Empirical Implied Participation Number First Eigenvector All positive, similar weights Indicating global factor Last Eigenvector Small number of participants Indicating small subgroup AVB EQR SPG VNO

Efficient Frontier H. Markowitz (1952)

Example: 404 stocks from S&P 500

Problem: in sample estimate of the weights does not minimize the out of sample error

Reason for Instability

RMT in Portfolio Optimization: Procedure

Realized Frontier: RMT Applied

Minimum Variance Portfolio: Cumulative Return After Applying RMT in constructing the minimum variance portfolio, we increased returns and reduced risk. μσ Markowitz RMT

Minimum Variance Portfolio: Turnover Rate

 Time-lag RMT (TLRMT)  Global Factor Model (GFM)  Variance Crosscorrelation Conditional Variance Adjusted Regression Model (CVARM) Conditional Heteroskedasticity Adjusted Regression Model (CHARM)

Variance

N=2000 T=8000

95 th Percentile

 RMT with existence of multicolinarity  RMT with N>T  (Show Figures Here!)