Principal Components The Basics Principal Components 1. 11/30/2018.

Slides:



Advertisements
Similar presentations
Multivariate Description. What Technique? Response variable(s)... Predictors(s) No Predictors(s) Yes... is one distribution summary regression models...
Advertisements

Canonical Correlation
Lesson 10: Linear Regression and Correlation
Structural Equation Modeling
Mutidimensional Data Analysis Growth of big databases requires important data processing.  Need for having methods allowing to extract this information.
Factor Analysis and Principal Components Removing Redundancies and Finding Hidden Variables.
Economics 173 Business Statistics Lecture 14 Fall, 2001 Professor J. Petry
Objectives (BPS chapter 24)
Response Surface Method Principle Component Analysis
Lecture 7: Principal component analysis (PCA)
1 Multivariate Statistics ESM 206, 5/17/05. 2 WHAT IS MULTIVARIATE STATISTICS? A collection of techniques to help us understand patterns in and make predictions.
Overview This model takes advantage of the correlation between price movements in commodities future contracts and price movements in equities’ whose core.
LINEAR REGRESSION: Evaluating Regression Models. Overview Standard Error of the Estimate Goodness of Fit Coefficient of Determination Regression Coefficients.
Corporate Finance Introduction to risk Prof. André Farber SOLVAY BUSINESS SCHOOL UNIVERSITÉ LIBRE DE BRUXELLES.
Combining Individual Securities Into Portfolios (Chapter 4)
Exploring Microarray data Javier Cabrera. Outline 1.Exploratory Analysis Steps. 2.Microarray Data as Multivariate Data. 3.Dimension Reduction 4.Correlation.
Estimating High Dimensional Covariance Matrix and Volatility Index by making Use of Factor Models Celine Sun R/Finance 2013.
Factor Analysis Psy 524 Ainsworth.
Chapter 2 Dimensionality Reduction. Linear Methods
Principal Components Analysis BMTRY 726 3/27/14. Uses Goal: Explain the variability of a set of variables using a “small” set of linear combinations of.
Empirical Financial Economics Asset pricing and Mean Variance Efficiency.
 Lecture #9.  The course assumes little prior applied knowledge in the area of finance.  References  Kristina (2010) ‘Investment Analysis and Portfolio.
A P STATISTICS LESSON 2 – 2 STANDARD NORMAL CALCULATIONS.
McGraw-Hill/Irwin Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. Portfolio risk and return measurement Module 5.2.
1 Dimension Reduction Examples: 1. DNA MICROARRAYS: Khan et al (2001): 4 types of small round blue cell tumors (SRBCT) Neuroblastoma (NB) Rhabdomyosarcoma.
Tests and Measurements Intersession 2006.
Advanced Correlational Analyses D/RS 1013 Factor Analysis.
Multiple Linear Regression Partial Regression Coefficients.
Descriptive Statistics vs. Factor Analysis Descriptive statistics will inform on the prevalence of a phenomenon, among a given population, captured by.
Techniques for studying correlation and covariance structure Principal Components Analysis (PCA) Factor Analysis.
Principle Component Analysis and its use in MA clustering Lecture 12.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Simple Linear Regression Analysis Chapter 13.
Managing Portfolios: Theory
Principal Component Analysis Zelin Jia Shengbin Lin 10/20/2015.
AP STATISTICS LESSON 3 – 3 (DAY 2) The role of r 2 in regression.
Multivariate statistical methods. Multivariate methods multivariate dataset – group of n objects, m variables (as a rule n>m, if possible). confirmation.
Chapter 12 REGRESSION DIAGNOSTICS AND CANONICAL CORRELATION.
Lecture 2 Survey Data Analysis Principal Component Analysis Factor Analysis Exemplified by SPSS Taylan Mavruk.
Types of risk Market risk
Optimal Risky Portfolios
REGRESSION G&W p
Information Management course
Exploring Microarray data
COMP 1942 PCA TA: Harry Chan COMP1942.
CHAPTER 8 Risk and Rates of Return
6 Efficient Diversification Bodie, Kane and Marcus
Introduction to Sampling Distributions
Kin 304 Regression Linear Regression Least Sum of Squares
Information Management course
Principal Component Analysis
BPK 304W Regression Linear Regression Least Sum of Squares
Dimension Reduction via PCA (Principal Component Analysis)
BPK 304W Correlation.
Slope of the regression line:
Types of risk Market risk
Introduction to Behavioral Statistics
Quality Control at a Local Brewery
Principal Component Analysis
No notecard for this quiz!!
AP STATISTICS LESSON 3 – 3 (DAY 2)
Descriptive Statistics vs. Factor Analysis
EPSY 5245 EPSY 5245 Michael C. Rodriguez
Optimal Risky Portfolios
Principal Component Analysis (PCA)
Principal Component Analysis
Section 6.2 Prediction.
Figure 6.1 Risk as Function of Number of Stocks in Portfolio
Lecture 8: Factor analysis (FA)
3 basic analytical tasks in bivariate (or multivariate) analyses:
Marios Mattheakis and Pavlos Protopapas
Presentation transcript:

Principal Components The Basics Principal Components 1. 11/30/2018

Dimensionality Reduction One of the most fundamental questions in managing a portfolio is what risk is it exposed to, and what do these sources of risk look like. If we have a portfolio with 8,000 different securities, it may well be that there are just 20 sources of fundamental risk that it is exposed to. Principal Components 1. 11/30/2018

Dimensionality Reduction In the Treasury Complex, for example, it is long observed that only 3 factors are needed to account for all the risks in the entire yield curve. Litterman and Scheinkman use Principal Components on the bond yields, and identify the 3 factors: Level Slope Curvature (or Volatility) Principal Components 1. 11/30/2018

Principal Components Analysis We can use S-Plus to analyze the data and provide factor loadings, as well as report the relative importance of each factor. First set up the data so that it is as you want to analyze in Excel. Next launch S-plus, and under the “File” heading, select “Import Data” and then “From File.” Identify your excel file here. Principal Components 1. 11/30/2018

PCA 2 Under the Statistics Header, Select “Multivariate.” Then “Principal Components.” Principal Components 1. 11/30/2018

*** Principal Components Analysis *** Standard deviations: Comp.1 Comp.2 Comp.3 Comp.4 Comp.5 0.02774643 0.009870496 0.001872219 0.0009389359 0.0004868663 The number of variables is 5 and the number of observations is 351 Principal Components 1. 11/30/2018

Component names: "sdev" "loadings" "correlations" "scores" "center" "scale" "n.obs" "terms" "call" "factor.sdev" "coef" Call: princomp(x = ~ ., data = kfbyo, scores = T, cor = F, na.action = na.exclude) Principal Components 1. 11/30/2018

Importance of components: Comp.1 Comp.2 Comp.3 Comp.4 Standard deviation 0.02774643 0.009870496 0.001872219 0.0009389359 Proportion of Variance 0.88295817 0.111738742 0.004020121 0.0010111083 Cumulative Proportion 0.88295817 0.994696910 0.998717032 0.9997281400 Comp.5 Standard deviation 0.0004868663 Proportion of Variance 0.0002718600 Cumulative Proportion 1.0000000000 Principal Components 1. 11/30/2018

Loadings: Comp.1 Comp.2 Comp.3 Comp.4 Comp.5 X90.day 0.619 -0.348 0.447 0.398 0.370 X180.day 0.612 -0.292 -0.222 -0.529 -0.460 X5.Year 0.388 0.403 -0.713 0.207 0.368 X15.Year 0.261 0.615 0.289 0.353 -0.588 X25.Year 0.152 0.503 0.398 -0.628 0.413 Principal Components 1. 11/30/2018

The “Factors” We can use this analysis to construct the historical realizations of each of our factors. In this example, the first PC is equal to: .69 * y1 + .612 * y2 + .388 * y3 + .261 * y4 + .152 * y5. We can use this along with the actual yields to construct the “mimicking portfolio” for the first factor. Principal Components 1. 11/30/2018

Factor Regressions Next, when we regress each of the yields on the factor, we note that indeed the regression coefficient corresponds to the weight we put on that yield in constructing the factor. We also see the importance of each factor in explaining each yield. For example, we know that the first factor explains 88.3% of the total variance of the entire yield curve. Principal Components 1. 11/30/2018

Variance Decomposition This includes 95.8% of the 90-Day Bill yield, but only 41.0% of the 25-Year PO Strip yield. But note that during this sample, the variance (standard deviation) of the former is .0003 (1.8%) while the variance of the latter is only .00004 (0.66%). Principal Components 1. 11/30/2018