A Statistical Arbitrage Strategy

Slides:



Advertisements
Similar presentations
Cointegration and Error Correction Models
Advertisements

Nonstationary Time Series Data and Cointegration
COMM 472: Quantitative Analysis of Financial Decisions
Quantitative Stock Selection Portable Alpha Gambo Audu Preston Brown Xiaoxi Li Vivek Sugavanam Wee Tang Yee.
Use of Business Tendency Survey Results for Forecasting Industry Production in Slovakia Use of Business Tendency Survey Results for Forecasting Industry.
Nonstationary Time Series Data and Cointegration Prepared by Vera Tabakova, East Carolina University.
COINTEGRATION 1 The next topic is cointegration. Suppose that you have two nonstationary series X and Y and you hypothesize that Y is a linear function.
1 MF-852 Financial Econometrics Lecture 11 Distributed Lags and Unit Roots Roy J. Epstein Fall 2003.
Unit Roots & Forecasting
Time Series Analysis Materials for this lecture Lecture 5 Lags.XLS Lecture 5 Stationarity.XLS Read Chapter 15 pages Read Chapter 16 Section 15.
Regression with Time-Series Data: Nonstationary Variables
Introduction to Algorithmic Trading Strategies Lecture 3 Pairs Trading by Cointegration Haksun Li
Chapter 8 Linear Regression © 2010 Pearson Education 1.
STAT 497 APPLIED TIME SERIES ANALYSIS
Lecture Presentation Software to accompany Investment Analysis and Portfolio Management Seventh Edition by Frank K. Reilly & Keith C. Brown Chapter.
Chapter 10 Simple Regression.
Asset Allocation Week 4. Asset Allocation: The Fundamental Question How do you allocate your assets amongst different assets ? –There are literally thousands.
Time Series Model Estimation Materials for this lecture Read Chapter 15 pages 30 to 37 Lecture 7 Time Series.XLS Lecture 7 Vector Autoregression.XLS.
Yale School of Management Hedge Funds and International Opportunities and Threats William N. Goetzmann Yale School of Management.
QA-3 FRM-GARP Sep-2001 Zvi Wiener Quantitative Analysis 3.
Hedge Funds Lecture 2: Risk Management & Portfolio Construction
ARIMA Forecasting Lecture 7 and 8 - March 14-16, 2011
FRM Zvi Wiener Following P. Jorion, Financial Risk Manager Handbook Financial Risk Management.
Stat 217 – Day 25 Regression. Last Time - ANOVA When?  Comparing 2 or means (one categorical and one quantitative variable) Research question  Null.
Copyright 2013 John Wiley & Sons, Inc. Chapter 8 Supplement Forecasting.
McGraw-Hill/Irwin © 2008 The McGraw-Hill Companies, Inc., All Rights Reserved. Capital Asset Pricing and Arbitrage Pricing Theory CHAPTER 7.
BCOR 1020 Business Statistics Lecture 24 – April 17, 2008.
Correlation and Regression Analysis
 Derek Crookes: ◦ Basic introduction to pairs Trading  Jonathan Roche: ◦ Highlights of a Pairs Trading Case Study ◦ Searching.
1 Using Correlated Stocks as a Leading Indicator: An Applied Pair Trading Strategy Don Bright & Darren Clifford
Time-Varying Volatility and ARCH Models
CH5 Overview 1. Agenda 1.History 2.Motivation 3.Cointegration 4.Applying the model 5.A trading strategy 6.Road map for strategy design 2.
CHAPTER 05 RISK&RETURN. Formal Definition- RISK # The variability of returns from those that are expected. Or, # The chance that some unfavorable event.
© 2008 Pearson Education Canada7.1 Chapter 7 The Stock Market, the Theory of Rational Expectations, and the Efficient Markets Hypothesis.
Pairs Trading Ch6 Pairs Selection in Equity Markets 郭孟鑫.
Version 1.2 Copyright © 2000 by Harcourt, Inc. All rights reserved. Requests for permission to make copies of any part of the work should be mailed to:
Lecture Presentation Software to accompany Investment Analysis and Portfolio Management Seventh Edition by Frank K. Reilly & Keith C. Brown Chapter 7.
Chapter 13 CAPM and APT Investments
Lorenzo Coviello and Petros Mol
Financial Econometrics II Lecture 2. 2 Up to now: Tests for informational WFE assuming constant expected returns Autocorrelations Variance ratios Time.
CH7 Testing For Tradability 1. Agenda 1.Introduction 2.The linear relationship 3.Estimating the linear relationship: The multifactor approach 4.Estimating.
Chapter 06 Risk and Return. Value = FCF 1 FCF 2 FCF ∞ (1 + WACC) 1 (1 + WACC) ∞ (1 + WACC) 2 Free cash flow (FCF) Market interest rates Firm’s business.
Chapter 10 Capital Markets and the Pricing of Risk.
Portfolio Construction Strategies Using Cointegration M.Sc. Student: IONESCU GABRIEL Supervisor: Professor MOISA ALTAR BUCHAREST, JUNE 2002 ACADEMY OF.
TECHNICAL ANALYSIS. Technical analysis defined It is an art of identifying a trend reversal at a relatively early stage and ride on that trend until the.
Market Risk A financial firm’s market risk is the potential volatility in its income due to changes in market conditions such as interest rates, liquidity,
Yale School of Management 1 Pairs Trading: performance of a relative value arbitrage strategy Evan G. Gatev William N. Goetzmann K. Geert Rouwenhorst Yale.
Lecture 12 Time Series Model Estimation
Time Series Analysis Lecture 11
11 Chapter 5 The Research Process – Hypothesis Development – (Stage 4 in Research Process) © 2009 John Wiley & Sons Ltd.
EC208 – Introductory Econometrics. Topic: Spurious/Nonsense Regressions (as part of chapter on Dynamic Models)
Previously Definition of a stationary process (A) Constant mean (B) Constant variance (C) Constant covariance White Noise Process:Example of Stationary.
Time Series Analysis PART II. Econometric Forecasting Forecasting is an important part of econometric analysis, for some people probably the most important.
Stationarity and Unit Root Testing Dr. Thomas Kigabo RUSUHUZWA.
Comments from Instructor: A detailed yet analytical paper, which puts class materials into good application, and takes one step further, if simple, to.
Econometric methods of analysis and forecasting of financial markets Lecture 4. Cointegration.
MODEL DIAGNOSTICS By Eni Sumarminingsih, Ssi, MM.
1 Lecture Plan : Statistical trading models for energy futures.: Stochastic Processes and Market Efficiency Trading Models Long/Short one.
Lecture 12 Time Series Model Estimation Materials for lecture 12 Read Chapter 15 pages 30 to 37 Lecture 12 Time Series.XLSX Lecture 12 Vector Autoregression.XLSX.
Momentum and Reversal.
Vera Tabakova, East Carolina University
Lecture 12 Time Series Model Estimation
Nonstationary Time Series Data and Cointegration
6 Efficient Diversification Bodie, Kane and Marcus
Chapter 6: Autoregressive Integrated Moving Average (ARIMA) Models
Momentum Effect (JT 1993).
From: Dynamic Dependence and Diversification in Corporate Credit*
CHAPTER 16 ECONOMIC FORECASTING Damodar Gujarati
Unit Roots 31/12/2018.
Presentation transcript:

A Statistical Arbitrage Strategy Pairs Trading A Statistical Arbitrage Strategy Chris Emmanuel Fua Christopher Melgaard James (Yi-Wei) Li

Background Developed in the 1980’s by a group of Quants at Morgan Stanley, who reportedly made over $50 million profit for the firm in 1987 A contrarian strategy that tries to profit from the principles of mean-reversion processes In theory, one could expand the strategy to include a basket of more than a pair of related stocks Chris

Main Idea Choose a pair of stocks that move together very closely, based on a certain criteria (i.e. Coke & Pepsi) Wait until the prices diverge beyond a certain threshold, then short the “winner” and buy the “loser” Reverse your positions when the two prices converge --> Profit from the reversal in trend Emmanuel

Example of a Pairs Trade Emmanuel

Investor Decisions Pair Selection Criteria Correlation (Parametric & Non-Parametric Spearman’s Rho) Dickey-Fuller Test Statistic (Cointegration) Trading Threshold (areas of consideration) Volatility of the Market Historical returns Cost of each transaction James

Normalization of Stock Data METHOD: Find pair that has maximal correlation Normalize price series, plot spread over 1 year “formation period” Generate optimal threshold non-parametrically: choose a threshold Ti=c*sd(spread), calculate profit for each Ti, choose Ti generating max profit Calculate profit by going $1 short on winner, $1 long on loser; close position when prices converge, i.e. spread=0 Normalize price series in 6 month “trading period” using mean and sd from formation period Plot spread using optimal threshold found from formation period, calculate profit Lower thresholds More transactions Higher transaction costs  Lower Returns Higher transaction costs Smaller Returns E

Chevron & Exxon Formation Period Corr=0.93 Trading Period Corr=0.96 Optimal Threshold=1.25*sd’s # Transactions=10 Returns=15% Win. E

Electronic Arts & GAP Formation Corr=0.12 Trading Corr=0.56 Optimal Threshold=1 sd # Transactions=0 (Open a position, but spread never returns to 0) Return= -0.04 Lose. E

Nike & McDonald’s Formation Corr=0.87 Trading Corr=0.02 #Transactions=1 Return= -0.05 Lose. Correlation is imperfect criteria for selecting pairs. E

Interesting result involving market volatility Formation Period Trading Period Pair Dates Corr. Optimal Threshold* #Trans Returns Exxon, Chevron Period 1 0.93 1 6 0.11 0.85 Period 2 1.75 0.05 0.69 Period 3 1.25 10 0.15 0.96 Nike, McDonald’s 0.87 1.5 2 -0.05 0.02 0.10 -0.02 0.29 4 0.04 Electronic Arts, GAP 0.12 -0.04 0.56 0.19 -0.03 -0.09 0.31 0.06 E Positive profits for all measured pairs in period 3. Period 3 includes January 2008, a very volatile month for the stock market. It seems that high market volatility allows the possibility for positive profits for uncorrelated pairs which would not generate such profits in low volatility periods, although this can surely work either way.

Cointegration If there exists a relationship between two non-stationary I(1) series, Y and X , such that the residuals of the regression are stationary, then the variables in question are said to be cointegrated Note: X and Y here are clearly not stationary, but they seem to move together. In fact, they are cointegrated --> (Y- β1X-β0 )should be stationary J

Application to Pairs Trading If we have two stocks, X & Y, that are cointegrated in their price movements, then any divergence in the spread from 0 should be temporary and mean-reverting. The important issues here are: 1) how to test for cointegration between prices and 2) estimating the constant Spread J

Testing For Cointegration Many Methods – most of them focus on testing whether the residuals of are stationary processes We use the Cointegrating Regression Dickey-Fuller Test, which essentially operates the following regression: Δut = φ ut-1 + et H0: φ = 0 => no cointegration* Ha: φ < 0 => cointegration* To obtain the cointegration factor estimates, we must regress the de-trended Yt on the de-trended Xt J * We must use critical values different from Gaussian ones due to non-symmetric properties of the Dickey-Fuller distribution

Results of Test NO PAIR OF PRICES ARE COINTEGRATED! No surprise there Alternative: take the “most cointegrated” pair & optimize thresholds as we did with normalized data Compare the results against normalized thresholds in the same time period J

Normalization Vs Cointegration Figure 5: Normalized strategy VS Cointegrated strategy LUV(Southwest Airlines) & PLL (Pall Corporation) Normalization Cointegration Correlation coef. /CRDF stat. over Formation Period 0.24 -0.52* Cointegration Factor N/A 0.43 Optimal SD Threshold over Formation Period 1.25 SDs 1.75 SDs Optimal Returns ~0% ~2% Number of Transactions over Trading Period 4 Returns ~5% ~13% J *CRDF statistic insignificant against the H0: The Time Series is not cointegrated (5% critical value = -3.39) **Fixed transaction costs implicit in both models

Trading Period Comparison Figure 6: Normalized LUV & PLL spread VS Cointegrated LUV & PLL spread J

Auto-Regressive Time Series Cointegration is an ideal construct for pairs trading But Dickey-Fuller Hypothesis Test is inconclusive Instead we can fit a time series to the spread data AR(1): Yt = β Yt-1 + εt Looking for a spread that produces an AR(1) with |β| < 1, so that will be stationary. C

Choosing thresholds with AR(1) For the interest of time, we are only going to focus our most cointegrated pair: LUV and PLL. We will fit an AR(1) to the data by estimating β and the standard deviation of each iid white noise εt. Then we will run one thousand simulations of this AR(1) model and estimate each of their optimal benchmarks The average of the optimal benchmarks from each simulation will serve as our estimate for the optimal benchmark in the formation period.

Results of AR(1) Thresholds AR(1) Coefficient estimate (μ-hat) 0.8605 Optimal Threshold estimate 1.046 SD of Optimal Threshold 0.2597 Number of Transactions 12 Returns over Trading Period 17.7%

Alternative Strategies Conditional correlation or some other measure of “relatedness”, such as Copulas Modeling the spread as GARCH processes Optimize profits w.r.t. certain global indicators (i.e. market volatility, industry growth, etc.) Factor Analysis on the spread J

Bibliography Gatev, Evan, William N. Goetzmann, and K. Geert Rouwenhorst, “Pairs Trading: Performance of a Relative-Value Arbitrage Rule,” Review of Financial Studies (2006): 797- 827. Vidyamurthy, Ganapathy, Pairs Trading: Quantitative Methods and Analysis (New Jersey: John Wiley & Sons, Inc., 2004). Wooldridge, Jefferey M., Introductory Econometrics, A Modern Approach, Third Edition (Ohio: Thomson South-Western, 2006).