Chapter 1 : Introduction and Review

Slides:



Advertisements
Similar presentations
Cointegration and Error Correction Models
Advertisements

Autocorrelation Functions and ARIMA Modelling
Ordinary least Squares
Part II – TIME SERIES ANALYSIS C5 ARIMA (Box-Jenkins) Models
Time Series Building 1. Model Identification
Regression Analysis Once a linear relationship is defined, the independent variable can be used to forecast the dependent variable. Y ^ = bo + bX bo is.
R. Werner Solar Terrestrial Influences Institute - BAS Time Series Analysis by means of inference statistical methods.
Objectives (BPS chapter 24)
STAT 497 APPLIED TIME SERIES ANALYSIS
The Multiple Regression Model Prepared by Vera Tabakova, East Carolina University.
Linear Regression.
Diversification and Portfolio Management (Ch. 8)
Chapter 12 Simple Regression
Chapter 12 - Forecasting Forecasting is important in the business decision-making process in which a current choice or decision has future implications:
Simple Linear Regression
Chapter Topics Types of Regression Models
Linear Regression and Correlation Analysis
ARIMA Forecasting Lecture 7 and 8 - March 14-16, 2011
Economics 20 - Prof. Anderson
Non-Seasonal Box-Jenkins Models
Ch. 14: The Multiple Regression Model building
C82MCP Diploma Statistics School of Psychology University of Nottingham 1 Linear Regression and Linear Prediction Predicting the score on one variable.
Business Statistics - QBM117 Statistical inference for regression.
Correlation and Regression Analysis
Introduction to Regression Analysis, Chapter 13,
Simple Linear Regression Analysis
Lecture 5 Correlation and Regression
Correlation & Regression
1 Doing Statistics for Business Doing Statistics for Business Data, Inference, and Decision Making Marilyn K. Pelosi Theresa M. Sandifer Chapter 11 Regression.
Introduction to Linear Regression and Correlation Analysis
Inference for regression - Simple linear regression
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 12-1 Chapter 12 Simple Linear Regression Statistics for Managers Using.
Chapter 11 Simple Regression
Linear Regression and Correlation
Regression Method.
Linear Trend Lines Y t = b 0 + b 1 X t Where Y t is the dependent variable being forecasted X t is the independent variable being used to explain Y. In.
© 2002 Prentice-Hall, Inc.Chap 14-1 Introduction to Multiple Regression Model.
1 Least squares procedure Inference for least squares lines Simple Linear Regression.
Lecture 10 The Capital Asset Pricing Model Expectation, variance, standard error (deviation), covariance, and correlation of returns may be based on.
Copyright © 2015 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill Education.
Chapter 10 Capital Markets and the Pricing of Risk.
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
1 Chapter 12 Simple Linear Regression. 2 Chapter Outline  Simple Linear Regression Model  Least Squares Method  Coefficient of Determination  Model.
Chapter 5: Regression Analysis Part 1: Simple Linear Regression.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Y X 0 X and Y are not perfectly correlated. However, there is on average a positive relationship between Y and X X1X1 X2X2.
Chapter 4 Linear Regression 1. Introduction Managerial decisions are often based on the relationship between two or more variables. For example, after.
Finance 300 Financial Markets Lecture 3 Fall, 2001© Professor J. Petry
Autocorrelation, Box Jenkins or ARIMA Forecasting.
Investments, 8 th edition Bodie, Kane and Marcus Slides by Susan Hine McGraw-Hill/Irwin Copyright © 2009 by The McGraw-Hill Companies, Inc. All rights.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
1 B IVARIATE AND MULTIPLE REGRESSION Estratto dal Cap. 8 di: “Statistics for Marketing and Consumer Research”, M. Mazzocchi, ed. SAGE, LEZIONI IN.
1 Chapter 5 : Volatility Models Similar to linear regression analysis, many time series exhibit a non-constant variance (heteroscedasticity). In a regression.
The Box-Jenkins (ARIMA) Methodology
Stats Term Test 4 Solutions. c) d) An alternative solution is to use the probability mass function and.
13 – 1 Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall. Forecasting 13 For Operations Management, 9e by Krajewski/Ritzman/Malhotra.
Portfolio risk and return
Time Series Analysis PART II. Econometric Forecasting Forecasting is an important part of econometric analysis, for some people probably the most important.
Lecturer: Ing. Martina Hanová, PhD.. Regression analysis Regression analysis is a tool for analyzing relationships between financial variables:  Identify.
Lecturer: Ing. Martina Hanová, PhD. Business Modeling.
1 CAPM & APT. 2 Capital Market Theory: An Overview u Capital market theory extends portfolio theory and develops a model for pricing all risky assets.
1 Simple Linear Regression Chapter Introduction In Chapters 17 to 19 we examine the relationship between interval variables via a mathematical.
The simple linear regression model and parameter estimation
Chapter 4 Basic Estimation Techniques
Linear Regression.
Ch8 Time Series Modeling
Chapter 6: Autoregressive Integrated Moving Average (ARIMA) Models
CHAPTER 29: Multiple Regression*
Chapter 3 Statistical Concepts.
Presentation transcript:

Chapter 1 : Introduction and Review 1.1 What is econometrics? 1.2 Review of linear regression 1.3 Review of univariate non-seasonal ARIMA models

1.1 What is econometrics? Econometrics is the main technique used in this course It is a branch in economics that brings together economic theory and statistics to study economic phenomena. Econometric theory is concerned with the development and extension of statistical techniques appropriate for economic data (e.g. VAR models, Cointegration analysis, etc.)

In this course, we are primarily concerned with applied econometrics. In recent years, a branch in econometrics, has emerged with emphasis on financial applications. Examples of econometric tools in terms of financial applicability : Test whether financial markets are weak-form efficient. Test whether the Capital Asset Pricing Model (CAPM) and the Abitrage Pricing Theory (APT) are appropriate models for the determination of returns on risky assets. Measuring and forecasting the volatility of bond or stock returns (e.g. GARCH models). Modelling long-term relationships between prices and exchange rates (cointegration analysis). Forecasting the correlation between the stock indices of two counties. and so on.

Economic Data vs. Financial Data Financial data and Economic data often differ in terms of their frequency, accuracy, seasonality and other properties. Lack of data is a serious problem for economic data analysis. For example, it might be that the data required on government budget deficits, on population figures, which are measured only on an annual basis. Measurement errors and data revisions are the two other major problems with economic data. Economic data are often estimated based on sample information.

These issues are rarely of concern in finance. Generally the prices and other entities recorded are those at which trades actually took place. Also, financial data are observed at much higher frequencies than economic data. Asset prices are often available at minute-by-minute frequency!!

On the other hand, financial data are often very “noisy” (i. e On the other hand, financial data are often very “noisy” (i.e. it is more difficult to separate the trends and other patterns from the random behaviour). Financial data are almost always not normally distributed, but most techniques in econometrics assume they are. High frequency data often contain additional patterns which are the result of the way the market works (e.g. clustering). These features need to be considered in the model-building process.

1.2 Review of Linear Regression Regression analysis is certainly the single most important technique in econometrics. A statistical technique that attempts to explain movements in one variable (dependent) as a function of movements in a set of other variables (explanatory).

Simple Linear Regression Model Yt = 0 + 1Xt + t ; t ~ N(0, 2) The ’s are the coefficients 0 is the constant (intercept) term in the regression equation. 1 is the slope coefficient. It measures the amount Y will change when X changes by 1 unit. 2 is the variance of t , the random disturbance term.

Examples of regressions used in Financial analysis: How asset returns vary with their level of market risk. Measuring the relationship between spot prices and market risk. Constructing an optimal hedge ratio.

Slope Coefficients Recall that we are trying to explain or predict changes in Y using X. 1 defines the relationship for us. For linear models, the slope is constant.

Linear Regression i.e., the dependent variable has a linear relationship with the explanatory variable cannot use linear regression methods if the equation is not linear in the coefficients

Expected Value of Yt 0 + 1Xt is the deterministic part of the regression equation. i.e. it is the value of Yt determined by a given value of Xt , which we assume to be non-stochastic. can also think of it as the expected value of Yt given Xt E(Yt | Xt) = 0 + 1Xt

Stochastic Error Term (t) Almost always the case that Xt alone cannot explain all the variations in Yt . Can add more variables. But there will still be some variation in Yt that cannot be explained by the model. Sources of error : random shocks, measurement errors etc.

Example : Capital Asset Pricing Model (CAPM) A fundamental idea of modern finance is that an investor needs a financial incentive to take a risk Said differently, the expected return on a risky asset, R, must exceed the return on a safe, or a risk free investment, Rf . Thus the expected excess return, R – Rf , should be positive.

According to the CAPM, the expected excess return on an asset is proportional to the expected excess return on a portfolio of all available assets (the “market portfolio”) i.e. R – Rf = (Rm – Rf ) So, a stock with  < 1 (> )1 has less (more) risk than the market portfolio and therefore has a lower (higher) expected excess return them the market portfolio. The ’s are usually estimated by least squares regression.

The table below gives the estimated ’s for six U.S. stocks Company Estimated  Kellogg (breakfast cercal) 0.24 Waste Management (waste disposal) 0.38 Sprint (long distance telephone) 0.59 Walmat (discount retailer) 0.89 Barnes and Noble (book retailer) 1.03 Best Buy (electronic equipment) 1.80 Microsoft (software) 1.83

Finding the “Best Fit” Recall that we are trying to estimate a linear relationship such that the line passing through the data best represents the underlying true relationship. Difference between the actual values and the estimated values is the residual. Best fit  minimize the residuals.

Ordinary Least Squares A regression estimation techniques that calculates estimates of the slope parameters in our linear regression model so as to minimize the squared residuals :

Goodness of Fit measure : R2 , Adjusted R2 Hypothesis testing : Significance of individual coefficients t test Overall significance of model F test Significance of linear restrictions partial F test Serial correlations Durbin-Watson test (1st order) Lagrange Multipler test (higher order)

1.3 Review of Univariate Non-seasonal ARIMA Models Univariate forecasts based on a statistical analysis of the past data. Differs from conventional regression methods in that the mutual dependence of the observations is of primary interest. Forecasts are linear functions of the sample observations.

Box-Jenkins Methodology Identification of the type of model to be used is a critical step. Differs from other univariate techniques in that a thorough study of the properties of a time series is carried out before applying a forecasting technique.

Principle of Parsimony Find efficiently parameterized models. A model with a large number of parameters will achieve a good historical fit, but post-sample forecasts are likely to be poor. By building a model based on past realizations of a time series we are implicitly assuming that there is some regularity in the process generating the series. One way to view such regularity is through the concept of stationarity. The use of Box-Jenkins modeling techniques requires a stationary process.

A stochastic process is a collection {Xt : t = 1, 2, …, T} of random variables ordered in time. Example : the error term in a linear regression model is assumed to be a stochastic process. A stochastic process is weakly stationary if for all t values E[Xt] =  var(Xt) = 2 cov(XtXt-k) = k  t i.e. its statistical properties do not change over time.

More precisely, a stationary series is one for which the mean and variance are constant across time and the covariance between current and lagged values of the series (autocovariances) depends only on the distance between the time points.

Autocorrelations Covariances are often difficult to interpret because they depend on the units of measurement of the data. Correlations, on the other hand, are scale-free. Thus, we can obtain the same information about the time series by computing the autocorrelations of a time series.

Autocorrelations Coefficient The autocorrelation coefficient between Xt and Xt-k is A graph of the autocorrelations is called a correlogram. Knowledge of the correlogram implies knowledge of the process which generated the series and vice versa.

Partial Autocorrelations Another important function in the Box-Jenkins methodology is the partial autocorrelation function. It measures the strength of the relationship between observations in a series controlling for the effect of the intervening time periods.

Box-Jenkins Methodology Identification. Determine, given a sample of time series observations, what the model of the [stationary] data is. Estimation. Estimate the parameters of the chosen model

Diagnosting checking and Model Selection t-test for the significance of individual coefficients Ljung Box-Pierce (Q) test for the correlations of the residuals Minimize the Akaike Information Criteria and the Schwarz Bayesian Criteria Forecasting with fitted model.

Guidelines for Box-Jenkins Identification Model Correlogram Partial Correlogram AR(p) Dies off Truncates after Lag p MA(q) Truncates after Lag q ARMA(p,q)

Applied Example (Box-Jenkins Modelling of Dow-Jones Industrial Index) DJI index is an index of 30 industrial firms’ stock prices dataset : dji.txt The values in this data set represent monthly averages of the end-of-day values for the index over the period January 1984 to February 1994

SAS Program: Example 1.2 data djm; infile 'd:\teaching\ms4221\dji.txt'; input date:monyy5. djiam @@; format date monyy5.; title 'Dow Jones Index Data'; title2 'Monthly Average'; proc print; run; symbol1 i=join; proc gplot data=djm; format date year4.; plot djiam*date/vminor=1; proc arima data=djm; identify var=djiam; identify var=djiam(1); estimate q=1 method=ml; estimate p=1 method=ml; estimate p=1 q=1 method=ml;