SVD, PCA, AND THE NFL By: Andrew Zachary.

Slides:



Advertisements
Similar presentations
Principal Component Analysis (PCA) or Empirical Orthogonal Functions (EOFs) Arnaud Czaja (SPAT Data analysis lecture Nov. 2011)
Advertisements

Eigen Decomposition and Singular Value Decomposition
PCA Data. PCA Data minus mean Eigenvectors Compressed Data.
Chapter 28 – Part II Matrix Operations. Gaussian elimination Gaussian elimination LU factorization LU factorization Gaussian elimination with partial.
Covariance Matrix Applications
Tensors and Component Analysis Musawir Ali. Tensor: Generalization of an n-dimensional array Vector: order-1 tensor Matrix: order-2 tensor Order-3 tensor.
Dimensionality Reduction PCA -- SVD
M. A. Miceli “SVD and LS” - Stats in the Château - August 31 - September SVD and LS M.A. Miceli University of Rome I Stats in the Château Jouy-en-Josas.
1er. Escuela Red ProTIC - Tandil, de Abril, 2006 Principal component analysis (PCA) is a technique that is useful for the compression and classification.
Lecture 19 Singular Value Decomposition
1cs542g-term High Dimensional Data  So far we’ve considered scalar data values f i (or interpolated/approximated each component of vector values.
Principal Component Analysis
3D Geometry for Computer Graphics
Unsupervised Learning - PCA The neural approach->PCA; SVD; kernel PCA Hertz chapter 8 Presentation based on Touretzky + various additions.
Lecture 19 Quadratic Shapes and Symmetric Positive Definite Matrices Shang-Hua Teng.
Principal component analysis (PCA)
Slide 1 EE3J2 Data Mining EE3J2 Data Mining Lecture 9(b) Principal Components Analysis Martin Russell.
Singular Value Decomposition (SVD) (see Appendix A.6, Trucco & Verri) CS485/685 Computer Vision Prof. George Bebis.
Face Recognition Using Eigenfaces
The Terms that You Have to Know! Basis, Linear independent, Orthogonal Column space, Row space, Rank Linear combination Linear transformation Inner product.
Microarray analysis Algorithms in Computational Biology Spring 2006 Written by Itai Sharon.
Lecture 20 SVD and Its Applications Shang-Hua Teng.
Exploring Microarray data Javier Cabrera. Outline 1.Exploratory Analysis Steps. 2.Microarray Data as Multivariate Data. 3.Dimension Reduction 4.Correlation.
Principal Component Analysis. Consider a collection of points.
Slide 1 EE3J2 Data Mining EE3J2 Data Mining Lecture 9 Data Analysis Martin Russell.
Dimensionality reduction Kenneth D. Harris 24/6/15.
Techniques for studying correlation and covariance structure
Matlab for Scientific Programming A Brief Introduction Mark Levene Follow the links to learn more! Many features will be demonstrated.
Chapter 2 Dimensionality Reduction. Linear Methods
Presented By Wanchen Lu 2/25/2013
Next. A Big Thanks Again Prof. Jason Bohland Quantitative Neuroscience Laboratory Boston University.
Eigen Decomposition Based on the slides by Mani Thomas Modified and extended by Longin Jan Latecki.
1 Dimension Reduction Examples: 1. DNA MICROARRAYS: Khan et al (2001): 4 types of small round blue cell tumors (SRBCT) Neuroblastoma (NB) Rhabdomyosarcoma.
Canonical Correlation Analysis and Related Techniques Simon Mason International Research Institute for Climate Prediction The Earth Institute of Columbia.
Statistical Arbitrage Ying Chen, Leonardo Bachega Yandong Guo, Xing Liu February, 2010.
Chapter 7 Multivariate techniques with text Parallel embedded system design lab 이청용.
Principal Components: A Mathematical Introduction Simon Mason International Research Institute for Climate Prediction The Earth Institute of Columbia University.
Introduction to Linear Algebra Mark Goldman Emily Mackevicius.
Reduces time complexity: Less computation Reduces space complexity: Less parameters Simpler models are more robust on small datasets More interpretable;
EIGENSYSTEMS, SVD, PCA Big Data Seminar, Dedi Gadot, December 14 th, 2014.
Separation of the largest eigenvalues in eigenanalysis of genotype data from discrete populations Katarzyna Bryc Postdoctoral Fellow, Reich Lab, Harvard.
Principal Component Analysis Zelin Jia Shengbin Lin 10/20/2015.
Matrix Factorization & Singular Value Decomposition Bamshad Mobasher DePaul University Bamshad Mobasher DePaul University.
Xiaoying Pang Indiana University March. 17 th, 2010 Independent Component Analysis for Beam Measurement.
Lecture Note 1 – Linear Algebra Shuaiqiang Wang Department of CS & IS University of Jyväskylä
Multivariate statistical methods. Multivariate methods multivariate dataset – group of n objects, m variables (as a rule n>m, if possible). confirmation.
CS246 Linear Algebra Review. A Brief Review of Linear Algebra Vector and a list of numbers Addition Scalar multiplication Dot product Dot product as a.
Principal Component Analysis (PCA)
Exploring Microarray data
Eigen Decomposition Based on the slides by Mani Thomas and book by Gilbert Strang. Modified and extended by Longin Jan Latecki.
Lecture: Face Recognition and Feature Reduction
Singular Value Decomposition
Eigen Decomposition Based on the slides by Mani Thomas and book by Gilbert Strang. Modified and extended by Longin Jan Latecki.
Eigen Decomposition Based on the slides by Mani Thomas and book by Gilbert Strang. Modified and extended by Longin Jan Latecki.
Eigen Decomposition Based on the slides by Mani Thomas and book by Gilbert Strang. Modified and extended by Longin Jan Latecki.
Techniques for studying correlation and covariance structure
Covariance Vs Correlation Matrix
Recitation: SVD and dimensionality reduction
Eigen Decomposition Based on the slides by Mani Thomas and book by Gilbert Strang. Modified and extended by Longin Jan Latecki.
Outline Singular Value Decomposition Example of PCA: Eigenfaces.
Digital Image Processing Lecture 21: Principal Components for Description Prof. Charlene Tsai *Chapter 11.4 of Gonzalez.
Maths for Signals and Systems Linear Algebra in Engineering Lectures 13 – 14, Tuesday 8th November 2016 DR TANIA STATHAKI READER (ASSOCIATE PROFFESOR)
Lecture 13: Singular Value Decomposition (SVD)
Eigen Decomposition Based on the slides by Mani Thomas and book by Gilbert Strang. Modified and extended by Longin Jan Latecki.
Principal Component Analysis
Canonical Correlation Analysis and Related Techniques
Eigen Decomposition Based on the slides by Mani Thomas
Lecture 20 SVD and Its Applications
Marios Mattheakis and Pavlos Protopapas
Outline Variance Matrix of Stochastic Variables and Orthogonal Transforms Principle Component Analysis Generalized Eigenvalue Decomposition.
Presentation transcript:

SVD, PCA, AND THE NFL By: Andrew Zachary

Introduction What is PCA? What is SVD? What is the NFL? Principal Component Analysis What is SVD? Singular Value Decomposition What is the NFL? National Football League

SVD 𝐴=𝑼𝚺 𝐕 ∗

Computing the SVD Matrices Consider, 𝐴 𝐴 𝑇 = 𝑈Σ 𝑉 ∗ 𝑈Σ 𝑉 ∗ 𝑇 = 𝑈Σ 𝑉 ∗ 𝑉Σ 𝑈 ∗ =𝑈 Σ 2 𝑈 ∗ 𝐴 𝐴 𝑇 𝑈=𝑈 Σ 2 And, 𝐴 𝑇 𝐴= 𝑈Σ 𝑉 ∗ 𝑇 𝑈Σ 𝑉 ∗ =𝑉Σ 𝑈 ∗ 𝑈Σ 𝑉 ∗ =𝑉 Σ 2 𝑉 ∗ 𝐴 𝑇 𝐴𝑉=𝑉 Σ 2

Eigenvalue Problem 𝑈=𝜆 Σ= 𝜎 1 0 0 𝜎 2 Where 𝜆 is the eigenvectors and 𝜎 1 and 𝜎 2 are the eigenvalues of 𝐴 𝐴 𝑇 𝑈=𝑈 Σ 2 Similarly, 𝑉=𝜆 Where 𝜆 is the eigenvectors and 𝜎 1 and 𝜎 2 are the eigenvalues of 𝐴 𝑇 𝐴𝑉 =𝑉 Σ 2

Covariance Matrix 𝐶 𝑋 = 1 𝑛−1 𝑋 𝑋 𝑇 Where X is a matrix of data and n is the number of data points. Elements from the covariance matrix Diagonals → Variance (dynamics) Off-diagonals → Covariance (redundancy)

PCA Ideal basis No Redundancy Diagonalization

NFL NFL Ranking Fantasy Ranking PCA Ranking Single stat Several Stats (Position Dependent) PCA Ranking All available NFL stats

NFL Correlation Matrix

Correlation vs. Covariance 𝜌 𝐴,𝐵 = 1 𝑁−1 𝑖=1 𝑁 𝐴 𝑖 − 𝜇 𝐴 𝜎 𝐴 𝐵 𝑖 − 𝜇 𝐵 𝜎 𝐵 , 𝑃𝑒𝑎𝑟𝑠𝑜𝑛 𝐶𝑜𝑒𝑓. 𝜌 𝐴,𝐵 = 𝑐𝑜𝑣 𝐴,𝐵 𝜎 𝐴 𝜎 𝐵 𝑅= 𝜌(𝐴,𝐴) 𝜌(𝐴,𝐵) 𝜌(𝐵,𝐴) 𝜌(𝐵,𝐵)

Singular Values PCA Projection: 𝑋 𝑝 =𝑈 Σ 𝑟𝑒𝑑𝑢𝑐𝑒𝑑

Results

Conclusions PCA is a powerful data analysis tool PCA is the ideal basis for a dataset SVD and Covariance Matrix equal PCA PCA has strong applications in sports

References “Data-Driven Modeling and Scientific Computation, Methods for Complex Systems & Big Data”, J. Nathan Kutz, 2013 “An Introductory application of Principal Components to Cricket Data”, Ananda B. W. Manage et. al., 2013