Authored by Werner Antweiler and Murray Z. Frank

Slides:



Advertisements
Similar presentations
Ordinary least Squares
Advertisements

Decomposition Method.
Multiple Linear Regression Model
Information-based Trading, Price Impact of Trades, and Trade Autocorrelation Kee H. Chung Mingsheng Li Thomas H. McInish.
Chapter 12 - Forecasting Forecasting is important in the business decision-making process in which a current choice or decision has future implications:
PREDICTABILITY OF NON- LINEAR TRADING RULES IN THE US STOCK MARKET CHONG & LAM 2010.
The Role of Financial System in Economic Growth Presented By: Saumil Nihalani.
University of Missouri Southwind Finance Conference
Forecasting with Twitter data Presented by : Thusitha Chandrapala MARTA ARIAS, ARGIMIRO ARRATIA, and RAMON XURIGUERA.
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
McGraw-Hill/Irwin © 2007 The McGraw-Hill Companies, Inc., All Rights Reserved. Behavioral Finance and Technical Analysis CHAPTER 19.
CHAPTER 19 Behavioral Finance and Technical Analysis.
Financial Econometrics II Lecture 2. 2 Up to now: Tests for informational WFE assuming constant expected returns Autocorrelations Variance ratios Time.
Federico M. Bandi and Jeffrey R. Russell University of Chicago, Graduate School of Business.
Determinants of Credit Default Swap Spread: Evidence from the Japanese Credit Derivative Market.
Chapter 10 Capital Markets and the Pricing of Risk.
Chapter 4 Linear Regression 1. Introduction Managerial decisions are often based on the relationship between two or more variables. For example, after.
Copyright © 2014, 2011 Pearson Education, Inc. 1 Chapter 21 The Simple Regression Model.
Transportation Planning Asian Institute of Technology
Analysis of Financial Data Spring 2012 Lecture 9: Volatility Priyantha Wijayatunga Department of Statistics, Umeå University
INTRODUCTION TO EQUITIES by Matt Ingram Invest Ed® All Rights Reserved Oklahoma Securities Commission July 2016.
1 Arbitrage risk and the book- to-market anomaly Ali, Hwang and Trombley JFE (2003)
Presenter: Siddharth Krishna Sinha Instructor: Jing Gao
Kian Guan LIM and Christopher TING Singapore Management University
Momentum and Reversal.
Investment Management
Chapter 14 Introduction to Multiple Regression
Behavioral Finance and Technical Analysis
Comparing Systems Using Sample Data
Market-Risk Measurement
Demand Estimation and Forecasting
Chapter 9 Audit Sampling: An Application to Substantive Tests of Account Balances McGraw-Hill/Irwin ©2008 The McGraw-Hill Companies, All Rights Reserved.
INTRODUCTION TO EQUITIES
Logistic Regression APKC – STATS AFAC (2016).
The Pricing of Stock Options Using Black-Scholes Chapter 12
Why Firms Adopt and Discontinue New-Issue Dividend Reinvestment Plans
Measuring Exposure To Exchange Rate Fluctuations
Risk and Return Fundamentals
Momentum Effect (JT 1993).
Multiple Regression Analysis and Model Building
Pin-Huang Chou Robin K. Chou Kuan-Cheng Ko National Central University
Role of Short-Sales Constraints
The Opening Bell Deviation Theory
The Opening Bell Deviation Theory
Competition, financial innovation and commercial
MICHAEL NEEL, University of Houston
Journal of Economics and Finance(2017) 41:311–329
Building a Trading Diary
Data Mining Lecture 11.
A Logit model of brand choice calibrated on scanner data
CHAPTER 16 ECONOMIC FORECASTING Damodar Gujarati
CHAPTER 29: Multiple Regression*
Li Wang Ross School of Business Ann Arbor, MI 48105
Ninja Trader: Introduction to data mining in financial applications
Logistic Regression.
Alexandros Dimitriadis
Corporate governance, chief executive officer compensation, and firm performance 刘铭锋
© 2017 by McGraw-Hill Education
Part III Exchange Rate Risk Management
Roberts and Sufi (2009) Here the concern is financial policies.
Lecture 7 Sampling and Sampling Distributions
Chapter 7: The Normality Assumption and Inference with OLS
Applied Economic Analysis
Statistics II: An Overview of Statistics
Regression Forecasting and Model Building
Even-Tov O Journal of Accounting & Economics, 2017, 64(1).
Momentum Momentum is used to indicate the speed with which prices are changing. When a change in direction occurs in a short-term trend, technicians say.
Measuring Exposure to Exchange Rate Fluctuations
Authored by Mingyi Hung, T.J. Wong, Tianyu Zhang
Presentation transcript:

Authored by Werner Antweiler and Murray Z. Frank Is All That Talk Just Noise? The Information Content of Internet Stock Message Boards Authored by Werner Antweiler and Murray Z. Frank Presented by: Suisui Huang Both authors are at the Sauder School of Business, University of British Columbia

Agenda Research Results Preview Paper Structure My Opinions Message Board Data and Classification (Data pre-processing) Data Description Relating Message Boards and Stock Returns Volatility Trading Volume My Opinions Here’s my agenda today FIRST of all, we will have a preview of the research results There are three issues discussed in this paper

Research Results Preview Three specific issues considered Does the number of messages posted or the bullishness of these messages help to predict returns? Is disagreement among the messages associated with trading volume? Does the number of messages posted or the bullishness of the messages help to predict volatility?

Research Results Preview The First Issue Question: Does the number of messages posted or the bullishness of these messages help to predict returns? Result: An increase in the number of messages posted has a statistically significant but economically small negative return on the next day

Research Results Preview The Second Issue Question: Is disagreement among the messages associated with trading volume? Historical Researches (Opposite opinions): Traditional Hypothesis: Disagreement induces trading No-trade Theorem: Disagreement leads to a revision of market prices and beliefs Result: Disagreement induces contemporaneous trading but reduces the trading volume on the next day Traditional Hypothesis: Disagreement induces trading (Hirshleifer(1977),Diamond and Verrecchia(1981), Karpoff(1986), Harris and Raviv(1993) ) No-trade Theorem: Disagreement leads to a revision of market prices and beliefs (Milgrom and Stokey(1982))

Research Results Preview The Third Issue Question: Does the number of messages posted or the bullishness of the messages help to predict volatility? Model for volatility: Fractionally integrated realized volatility news response function Result: Message posting and trading volume help predict volatility

Message Board Data and Classification Message Data collection (Data Pre-processing Part 1) Source: Yahoo! Finance & Raging Bull message boards Object: 45 firms that together made up the Dow Jones Industrial Average (DIA) and the Dow Jones Internet Commerce Index (XLK). Format: 1.5 million text messages Bullish? Bearish? Neither? Those firms are fairly large and well-known to download messages Wrote a specialized software to store messages in the following format The problem here is that, for our research, we need to classify those messages into different categories. Is it bullish or bearish or neither? If we do this work manually it would take forever, that’s why we need to utilize a well-known machine learning model named “Naïve Bayes Classification” to automatically accomplish this classification in just a few minutes

Message Board Data and Classification Naïve Bayes Classification Method Key Assumption: Occurrences of words are independent of each other (Naïve) Procedure: Consider a stream of words Wi that are found either in a message of type T or itsanti-type T-prime Updating Rule: Though the assumption may be somewhat unrealistic, the model performs rather well in practice. It allow us to ignore grammatical structure Now consider Bayes' rule, updating our prior P(T|Wi-1) to posterior P(T|Wi) when we observe word Wi With mi: the number of occurrences of this word Wi in type T mi-prime: the number of occurrences of this word Wi in anti-type T-prime ni: the total number of words in type T ni-prime: the total number of words in type T-prime

Message Board Data and Classification Naïve Bayes Classification Method Procedures: Manually split 1000 messages into buy, sell and hold Run a software named rainbow, set ’Naïve Bayes’ for method, get 3 probabilities P(T|Wk )for each message k and each categories T (Buy, hold or sell) Choose the classification with the highest probability for message k In-sample Accuracy: In-sample accuracy is 18.1+65.9+4.1=88.1%

Message Board Data and Classification Aggregation of the Coded Messages (Data Pre-processing Part 2) Bullishness signal Bt: MtBuy : total number of messages in category ‘BUY’ during the time interval t. MtSell : total number of messages in category ‘SELL’ during the time interval t. Agreement Index At: Xi is an indicator variable that is one when message I is of type c and zero otherwise So it makes Bt the average of X1 The volatility can a representative of disagreement At=1-vol, which can naturally be a representative of agreement If we do not observe any messages in a given time period, we assume that the bullishness signal is neutral, namely B=0,then A=0 as well. Intuitively, periods without

Messages & Financial Data Data Description Messages & Financial Data Figure2 shows the weekly message posting over the full year 2000 message posting activity was reasonably stable over the year Figure3 shows the weekly trading volume over the full year 2000 There was a decline in the volume of stock trading over the year The trading volume is often elevated during the same week that message posting is elevated

Messages & Financial Data Data Description Messages & Financial Data Yahoo! Finance vs Raging Bull: More messages More bullishness Shorter messages DIA vs XLK: Lower losses Lower volatility Lower activity levels Less bullishness Higher level of coverage by WSJ This table reports a number of descriptive statistics for our 45 sample companies

Relating Message Boards and Stock Returns Time sequencing tests Significant negative relationship between the number of messages on day t and stock returns on day (t+1). Significant positive relationship between the number of messages on day t and stock returns on day (t+2). Results: The return predictability is statistically significant but very small in magnitude Difficult to take advantage of because potential gains would likely be offset by small transaction costs NWK is an indicator variable for a day being the first trading day after a weekend or holiday If the coefficient is significant then it would be indicated with a superscript a denotes significance at 99% level b 99.9% c 99.99%

Relating Message Boards and Volatility Fractionally integrated realized volatility news response function 1. What is Vi,t: 2. What is d (fractional integration parameter): When we are modeling volatility, we usually use only previous volatility as independent variables, however, in our case, our central interest is in understanding the extent to which the message boards have an impact on the realized volatility. That’s why we introduce A M and N here Google FIGARCH and GPH for more details *Log-periodogram method (Geweke and Porter-Hudak(1983)) Using estimates of the periodogram for company i at frequency points k=0,1,...,T^0.6, we get d=0.3097 Rt,i,d: Return of company i on day t and intra-day period d (15 mins) Vi,t: Volatility of company i on day t

Relating Message Boards and Volatility Fractionally integrated realized volatility news response function 3. What is I(ri,t-1<0): 4. What is L,A, M and N: L: Lag operator A: Agreement Index M: Number of messages N: Number of trades An indicator variable for yesterday’s return to be negative Leverage effect: When the coefficient is not 0, then there exists leverage effect. The sign of yesterday’s return, negative or positive, will have an impact on today’s volatility L is a lag operator. See white board for details.

Relating Message Boards and Volatility Time sequencing tests We can predict volatility using the message posting data More messages today imply significantly greater market volatility tomorrow Message posting activity has a more significant effect on market volatility than market volatility has on message posting

Relating Message Boards and Volatility Volatility Panel Regressions The number of messages is a predictive factor of volatility for both DIA and XLK firms The number of trades is a significant factor for the DIA firms, but not significant for the XLK firms Agreement among the posted messages is not significant

Relating Message Boards and Trading Volume Volume regressions WSJ’s effect If there is an article in today’s WSJ, trading is elevated. If there was an article in yesterday’s WSJ, then todays trading is depressed Message posting The message board posting volume remains a predictor of trading volume. The sign is positive and it is statistically significant Contemporaneous Regressions Although the magnitudes are economically smaller. The explanatory power of the volume regressions is quite high

Relating Message Boards and Trading Volume Contemporaneous Regressions Contemporaneous Regressions The Contemporaneous Regressions support for the hypothesis that disagreement produces trading

Relating Message Boards and Trading Volume Time sequencing tests Namely, greater disagreement will reduce the trading volume Greater agreement on a given day is followed by more trades on the next day “Sluggish traders hypothesis”: It takes a while for people to get around to completing their trades. Cao(2002) suggests that when a person learns that others have received the same signal that they have received, this will make them more willing to trade

Agreement Disagreement My Opinions Innovative model for volatility Interesting results found between message posting and return Test error for Naïve Bayes Method? R square for volume regressions relatively small Granger causality test An increase in the number of messages posted has a statistically significant but economically small negative return on the next day

Thanks for your time! Any questions?