Principal Components Shyh-Kang Jeng

Slides:



Advertisements
Similar presentations
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Advertisements

Lesson 10: Linear Regression and Correlation
Covariance Matrix Applications
Inference for Regression
11 Simple Linear Regression and Correlation CHAPTER OUTLINE
Dimension reduction (1)
Prediction, Correlation, and Lack of Fit in Regression (§11. 4, 11
1-1 Regression Models  Population Deterministic Regression Model Y i =  0 +  1 X i u Y i only depends on the value of X i and no other factor can affect.
Chapter 10 Simple Regression.
Linear Regression and Correlation
Exploring Microarray data Javier Cabrera. Outline 1.Exploratory Analysis Steps. 2.Microarray Data as Multivariate Data. 3.Dimension Reduction 4.Correlation.
Quantitative Business Analysis for Decision Making Simple Linear Regression.
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
1 Confidence Intervals for Means. 2 When the sample size n< 30 case1-1. the underlying distribution is normal with known variance case1-2. the underlying.
Correlation and Regression Analysis
Techniques for studying correlation and covariance structure
1 Multivariate Normal Distribution Shyh-Kang Jeng Department of Electrical Engineering/ Graduate Institute of Communication/ Graduate Institute of Networking.
Lecture 5 Correlation and Regression
1 Statistical Tools for Multivariate Six Sigma Dr. Neil W. Polhemus CTO & Director of Development StatPoint, Inc.
Factor Analysis and Inference for Structured Covariance Matrices
Regression Analysis Regression analysis is a statistical technique that is very useful for exploring the relationships between two or more variables (one.
Chapter 2 Dimensionality Reduction. Linear Methods
Principal Components Analysis BMTRY 726 3/27/14. Uses Goal: Explain the variability of a set of variables using a “small” set of linear combinations of.
Model Building III – Remedial Measures KNNL – Chapter 11.
1 Comparison of Several Multivariate Means Shyh-Kang Jeng Department of Electrical Engineering/ Graduate Institute of Communication/ Graduate Institute.
Principal Component Analysis Bamshad Mobasher DePaul University Bamshad Mobasher DePaul University.
1 Multivariate Linear Regression Models Shyh-Kang Jeng Department of Electrical Engineering/ Graduate Institute of Communication/ Graduate Institute of.
11 Comparison of Several Multivariate Means Shyh-Kang Jeng Department of Electrical Engineering/ Graduate Institute of Communication/ Graduate Institute.
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Interpreting Principal Components Simon Mason International Research Institute for Climate Prediction The Earth Institute of Columbia University L i n.
1 11 Simple Linear Regression and Correlation 11-1 Empirical Models 11-2 Simple Linear Regression 11-3 Properties of the Least Squares Estimators 11-4.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
1 Sample Geometry and Random Sampling Shyh-Kang Jeng Department of Electrical Engineering/ Graduate Institute of Communication/ Graduate Institute of Networking.
1 Matrix Algebra and Random Vectors Shyh-Kang Jeng Department of Electrical Engineering/ Graduate Institute of Communication/ Graduate Institute of Networking.
EE4-62 MLCV Lecture Face Recognition – Subspace/Manifold Learning Tae-Kyun Kim 1 EE4-62 MLCV.
1 Inferences about a Mean Vector Shyh-Kang Jeng Department of Electrical Engineering/ Graduate Institute of Communication/ Graduate Institute of Networking.
1 Factor Analysis and Inference for Structured Covariance Matrices Shyh-Kang Jeng Department of Electrical Engineering/ Graduate Institute of Communication/
Linear Correlation (12.5) In the regression analysis that we have considered so far, we assume that x is a controlled independent variable and Y is an.
Principal Component Analysis
1 Canonical Correlation Analysis Shyh-Kang Jeng Department of Electrical Engineering/ Graduate Institute of Communication/ Graduate Institute of Networking.
©The McGraw-Hill Companies, Inc. 2008McGraw-Hill/Irwin Linear Regression and Correlation Chapter 13.
1 Principal Components Shyh-Kang Jeng Department of Electrical Engineering/ Graduate Institute of Communication/ Graduate Institute of Networking and Multimedia.
Multivariate statistical methods. Multivariate methods multivariate dataset – group of n objects, m variables (as a rule n>m, if possible). confirmation.
Chapter 14 EXPLORATORY FACTOR ANALYSIS. Exploratory Factor Analysis  Statistical technique for dealing with multiple variables  Many variables are reduced.
Using the regression equation (17.6). Why regression? 1.Analyze specific relations between Y and X. How is Y related to X? 2.Forecast / Predict the variable.
Chapter 12 REGRESSION DIAGNOSTICS AND CANONICAL CORRELATION.
Lecture 2 Survey Data Analysis Principal Component Analysis Factor Analysis Exemplified by SPSS Taylan Mavruk.
Estimating standard error using bootstrap
Howard Community College
Factor and Principle Component Analysis
Exploring Microarray data
Chapter 13 Created by Bethany Stubbe and Stephan Kogitz.
Principal Components Analysis
Dimension Reduction via PCA (Principal Component Analysis)
Interpreting Principal Components
Measuring latent variables
Matrix Algebra and Random Vectors
Confidence Ellipse for Bivariate Normal Data
Principal Components Analysis
Multivariate Linear Regression Models
Chapter_19 Factor Analysis
Factor Analysis (Principal Components) Output
Multivariate Methods Berlin Chen
Determining Which Method to use
Principal Component Analysis
Principal Component Analysis
Measuring latent variables
Canonical Correlation Analysis
Comparison of Two Univariate Means
Presentation transcript:

Principal Components Shyh-Kang Jeng Department of Electrical Engineering/ Graduate Institute of Communication/ Graduate Institute of Networking and Multimedia

Concept of Principal Components x2 x1

Principal Component Analysis Explain the variance-covariance structure of a set of variables through a few linear combinations of these variables Objectives Data reduction Interpretation Does not need normality assumption in general

Principal Components

Result 8.1

Proof of Result 8.1

Result 8.2

Proof of Result 8.2

Proportion of Total Variance due to the kth Principal Component

Result 8.3

Proof of Result 8.3

Example 8.1

Example 8.1

Example 8.1

Geometrical Interpretation

Geometric Interpretation

Standardized Variables

Result 8.4

Proportion of Total Variance due to the kth Principal Component

Example 8.2

Example 8.2

Principal Components for Diagonal Covariance Matrix

Principal Components for a Special Covariance Matrix

Principal Components for a Special Covariance Matrix

Sample Principal Components

Sample Principal Components

Example 8.3

Example 8.3

Scree Plot to Determine Number of Principal Components

Example 8.4: Pained Turtles

Example 8.4

Example 8.4: Scree Plot

Example 8.4: Principal Component One dominant principal component Explains 96% of the total variance Interpretation

Geometric Interpretation

Standardized Variables

Principal Components

Proportion of Total Variance due to the kth Principal Component

Example 8.5: Stocks Data Weekly rates of return for five stocks X1: Allied Chemical X2: du Pont X3: Union Carbide X4: Exxon X5: Texaco

Example 8.5

Example 8.5

Example 8.6 Body weight (in grams) for n=150 female mice were obtained after the birth of their first 4 litters

Example 8.6

Comment An unusually small value for the last eigenvalue from either the sample covariance or correlation matrix can indicate an unnoticed linear dependency of the data set One or more of the variables is redundant and should be deleted Example: x4 = x1 + x2 + x3

Check Normality and Suspect Observations Construct scatter diagram for pairs of the first few principal components Make Q-Q plots from the sample values generated by each principal component Construct scatter diagram and Q-Q plots for the last few principal components

Example 8.7: Turtle Data

Example 8.7

Large Sample Distribution for Eigenvalues and Eigenvectors

Confidence Interval for li

Approximate Distribution of Estimated Eigenvectors

Example 8.8

Testing for Equal Correlation

Example 8.9

Monitoring Stable Process: Part 1

Example 8.10 Police Department Data *First two sample cmponents explain 82% of the total variance

Example 8.10: Principal Components

Example 8.10: 95% Control Ellipse

Monitoring Stable Process: Part 2

Example 8.11 T2 Chart for Unexplained Data

Example 8.12 Control Ellipse for Future Values *Example 8.10 data after dropping out-of-control case

Example 8.12 99% Prediction Ellipse

Avoiding Computation with Small Eigenvalues