Backtesting strategies based on multiple signals Robert Novy-Marx University of Rochester and NBER 1.

Slides:



Advertisements
Similar presentations
COMM 472: Quantitative Analysis of Financial Decisions
Advertisements

Quantitative Stock Selection Portable Alpha Gambo Audu Preston Brown Xiaoxi Li Vivek Sugavanam Wee Tang Yee.
Lecture 3 HSPM J716. Efficiency in an estimator Efficiency = low bias and low variance Unbiased with high variance – not very useful Biased with low variance.
PTP 560 Research Methods Week 9 Thomas Ruediger, PT.
Chapter 10 Section 2 Hypothesis Tests for a Population Mean
Lecture Presentation Software to accompany Investment Analysis and Portfolio Management Seventh Edition by Frank K. Reilly & Keith C. Brown Chapter.
1/55 EF 507 QUANTITATIVE METHODS FOR ECONOMICS AND FINANCE FALL 2008 Chapter 10 Hypothesis Testing.
AN INTRODUCTION TO PORTFOLIO MANAGEMENT
Information Analysis. Introduction How can we judge the information content of an idea? More concretely: –Is this a signal we can profitably deploy.
Portfolio Construction. Introduction Information analysis ignored real world issues. We now confront those issues directly, especially: –Constraints –Transactions.
Chapter Sampling Distributions and Hypothesis Testing.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 9-1 Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests Basic Business Statistics.
Hypothesis Tests for Means The context “Statistical significance” Hypothesis tests and confidence intervals The steps Hypothesis Test statistic Distribution.
Analysis of Individual Variables Descriptive – –Measures of Central Tendency Mean – Average score of distribution (1 st moment) Median – Middle score (50.
Probability Population:
AN INTRODUCTION TO PORTFOLIO MANAGEMENT
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Chapter 11 Introduction to Hypothesis Testing.
Lecture 11 Implementation Issues – Part 2. Monte Carlo Simulation An alternative approach to valuing embedded options is simulation Underlying model “simulates”
Evaluation of portfolio performance
Presented by: Lauren Rudd
Chapter 10 Hypothesis Testing
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 9-1 Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests Business Statistics,
Estimation and Hypothesis Testing. The Investment Decision What would you like to know? What will be the return on my investment? Not possible PDF for.
Fundamentals of Hypothesis Testing: One-Sample Tests
Means Tests Hypothesis Testing Assumptions Testing (Normality)
Chapter 13 Alternative Models of Systematic Risk.
Portfolio Management-Learning Objective
Lecture Presentation Software to accompany Investment Analysis and Portfolio Management Seventh Edition by Frank K. Reilly & Keith C. Brown Chapter 7.
Week 8 Fundamentals of Hypothesis Testing: One-Sample Tests
Is this quarter fair? How could you determine this? You assume that flipping the coin a large number of times would result in heads half the time (i.e.,
1 Today Null and alternative hypotheses 1- and 2-tailed tests Regions of rejection Sampling distributions The Central Limit Theorem Standard errors z-tests.
Hypothesis Testing: One Sample Cases. Outline: – The logic of hypothesis testing – The Five-Step Model – Hypothesis testing for single sample means (z.
Lecture 7 Introduction to Hypothesis Testing. Lecture Goals After completing this lecture, you should be able to: Formulate null and alternative hypotheses.
Vegas Baby A trip to Vegas is just a sample of a random variable (i.e. 100 card games, 100 slot plays or 100 video poker games) Which is more likely? Win.
Statistics (cont.) Psych 231: Research Methods in Psychology.
Statistical Methods 5.
Instrumental Variables: Problems Methods of Economic Investigation Lecture 16.
Inference We want to know how often students in a medium-size college go to the mall in a given year. We interview an SRS of n = 10. If we interviewed.
Investment Analysis and Portfolio Management First Canadian Edition By Reilly, Brown, Hedges, Chang 6.
The stock market, rational expectations, efficient markets, and random walks The Economics of Money, Banking, and Financial Markets Mishkin, 7th ed. Chapter.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Fundamentals of Hypothesis Testing: One-Sample Tests Statistics.
Chap 8-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 8 Introduction to Hypothesis.
1 Objective Compare of two population variances using two samples from each population. Hypothesis Tests and Confidence Intervals of two variances use.
Capital Asset Pricing and Arbitrage Pricing Theory
Introductory Investment Analysis Part II Course Leader: Lauren Rudd January 12, weeks.
Financial Time Series Analysis with Wavelets Rishi Kumar Baris Temelkuran.
Chap 8-1 Fundamentals of Hypothesis Testing: One-Sample Tests.
1 The Stock Market, the Theory of Rational Expectations, and the Efficient Market Hypothesis Chapter 7.
Welcome to MM570 Psychological Statistics
12.3 Efficient Diversification with Many Assets We have considered –Investments with a single risky, and a single riskless, security –Investments where.
STA Lecture 221 !! DRAFT !! STA 291 Lecture 22 Chapter 11 Testing Hypothesis – Concepts of Hypothesis Testing.
1 Optimizing Decisions over the Long-term in the Presence of Uncertain Response Edward Kambour.
12-1. Copyright © 2005 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin 12 Return, Risk, and the Security Market Line.
Education 793 Class Notes Inference and Hypothesis Testing Using the Normal Distribution 8 October 2003.
© 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Chapter 7 An Introduction to Portfolio Management.
Inference for Proportions Section Starter Do dogs who are house pets have higher cholesterol than dogs who live in a research clinic? A.
Statistics (cont.) Psych 231: Research Methods in Psychology.
SPSS Problem and slides Is this quarter fair? How could you determine this? You assume that flipping the coin a large number of times would result in.
Tests of Significance We use test to determine whether a “prediction” is “true” or “false”. More precisely, a test of significance gets at the question.
Welcome to MM570 Psychological Statistics Unit 5 Introduction to Hypothesis Testing Dr. Ami M. Gates.
Introduction to Hypothesis Testing: The Binomial Test
Psych 231: Research Methods in Psychology
Market-Risk Measurement
Risk and Return in Capital Markets
Chapter 11 Risk ad Return in Capital markets.
Psych 231: Research Methods in Psychology
Psych 231: Research Methods in Psychology
Psych 231: Research Methods in Psychology
Returns – Nominal vs. Real
Presentation transcript:

Backtesting strategies based on multiple signals Robert Novy-Marx University of Rochester and NBER 1

Multi-signal Strategies Proliferation in industry  E.g., MSCI Quality Index High ROE, low ROE vol., low leverage  “Smart beta” products RAFI: weight on sales, CF, BE, and dividends. Increasingly common in academia  Piotroski’s F-score (9 signals)  Asness et. al. Quality Score (21 signals) 2

Why the increased interest? Because finding “alpha” is hard  And they work great! Impressive backtest performance Too good?  Alpha should be hard to find Lots of smart people looking  Huge incentives to try And even to believe! 3

Issues Every choice has potential to bias results  Much bigger problem with multiple signals Not just which signals are used… But how they are used! Basic issue  Each signal is used so that it individually predicts positive in-sample returns Seems like a small thig—but it’s not! 4

Types of biases Snooping: in-sample aspect of data  guides strategy formation Two types to worry about:  Multiple testing bias Consider multiple strategies, show only best one  Overfitting E.g., Ex post MVE SRs always high  MVE strat buys “winners” and sell “losers” 5

Examples Bet on a series of fair coin flips What if you knew that there were: 1.More heads in the first (or second) half And could bet on just the early (or late) flips? 2.More heads than tails? What sorts of biases?  Do we account for these in finance? 6

First type: multiple testing (or selection)  Don’t really account for it, formally Do suspect (know) people look at more thing Second type: overfitting  Bet heads, not tails!  Account for it? One signal: Absolutely!  t 5% = 1.96 (not 1.65) Multiple signals: No! 7

Thought experiment? 8

Null hypothesis  “Signals” don’t predict differences in average returns E.g., monkeys selecting stocks by throwing darts at the WSJ Performance distribution  t-statistics ~ N(0,1) More or less  Excess kurtosis and heteroscedasticity 9

What if you diversify across the lucky monkeys?  Those with positive alpha Clearly “snooping”  Using in-sample aspect of data to form the strategy How does this bias the results?  Expected t-stat? 10

Get the average return Diversify across their risks Yields a high t-statistic:  Can also frame this in SRs 11

Same thing (essentially) happens if you use all the signals  But sign them so that they “predict” positive in-sample returns Standard statistics account for this…  If and only if N = 1! Again, strategy has high backtested SR  Question: expect high SR going forward? 12

Issues Combine things that backtest well  Get even better backtests Not surprising!  But what do the backtests mean? Biased?  Why? What biases?  If so, by how much? (Quantify!) Other intuitions? 13

Can address these Calculate empirical distributions  When signals are not informative  But multiple signals are used to select stocks Big boot-strapping exercise Derive theoretical distributions  In a simplified model Normal, homoscedastic returns  Use these to develop intuition 14

Strategy Construction 15

16 “Smart beta”Market

Signals Generate individually as pure noise!  Random normal variables Composite signals sum individual signals  Technical reason—mapping to theory Not important for the empirical work Cap multiplier is market equity  Essentially value-weighted strategies Again, not important 17

Best k-of-n strategies “Natural” construction  Investigate n signals  Pick the k “strongest” I.e., with most significant in-sample performance  Combine them how? Bootstrap for k ≤ n ≤ 100  Again, do it 10,000 times  Collect strategy t-statistics 18

Two Issues When k < n, selection bias  When k = 1 < n, multiple testing bias Well understood When k > 1, overfitting  Data snooping In-sample aspect of data used to form strategy  Pure overfitting only if k = n Interaction! 19

Special Cases 20 Overfitting only Multiple-testing only

Pure Selection 21

Pure Overfitting 22

Both Biases 23

General Case What sort of strategies should we worry about?  How do we think researchers design strategies in practice? 3-of-20?  How many signals did MSCI consider for its quality index? 5-of-100? 24

General Case 25

Model (theory) Strategies signal-weight stocks Returns normally dist. (assumption)  Equal volatilities  Uncorrelated Combine signals by averaging  Or weighted averaging  combined strat = portfolio of pure strats So can apply facts from portfolio theory 26

27

Best k-of-n strategies Yields t-statistic distributions: 28

Critical values Analytic for special cases:  k = 1  k = n, with signal-weighting Generally by numeric integration  Simple computationally But don’t provide much intuition  Also derive good analytic approximations Useful for comparative statics 29

Special Cases 30

Special Cases 31

General Cases 32

General Cases (Empirical) 33

General case, when k ~ n n =

General case, when k ~ n n = 40 35

General case, when k ~ n n = 20 36

Tension when increasing k Decreases vol.  improves performance Decreases average signal quality  lowers returns  impairs performance  Initially first effect dominates (esp. w/ large n) “Optimal” use of worst ~1/2 of signals:  Throw them away! Mean k/2-of-k t-stats. ~13% higher than k-of-k Mean k-of-2k t-stats. ~59% higher than k-of-k 37

Alternative Quantification Pure multiple-testing bias equivalence  How many single signals would you have to look at to get the same bias? That is, given any critical value τ (i.e., for some best k-of-n strategy), find n* s.t.  38

39

Approximate Power Law Best k-of-n strategy bias: Similar to those from a best 1-of-n k strategy!  Using analytic approximation, can show that log-n * roughly affine in log-n With slope ≈ k Can see this graphically 40

41

Conclusion View multi-signal claims skeptically  Multiple good signals  better performance when combined  Good backtested performance does NOT  any good signals “High tech” solution: use different tests “Low tech”: evaluate signals individually  Marginal power of each variable 42

General Approximation 43

How They Work Specify mean, S.D. of approx. normal  Combine with p-value  how far out in tail E.g., 5% crit.  mean + two standard deviations 44

General Approximation Where 45