Lecture #9 The four fundamental subspaces. Outline SVD and its uses SVD: basic features SVD: key properties Examples: simple reactions & networks Genome-scale.

Slides:



Advertisements
Similar presentations
Lecture #8 Stoichiometric Structure. Outline Cofactors and carriers Bi-linear nature of reactions Pathways versus cofactors Basics of high energy bond.
Advertisements

3D Geometry for Computer Graphics
Topological Properties of the Stoichiometric Matrix
Mutidimensional Data Analysis Growth of big databases requires important data processing.  Need for having methods allowing to extract this information.
Lecture #10 Metabolic Pathways. Outline Glycolysis; a central metabolic pathway Fundamental structure (m x n = 20 x 21) Co-factor coupling (NAD, ATP,
Dimensionality reduction. Outline From distances to points : – MultiDimensional Scaling (MDS) Dimensionality Reductions or data projections Random projections.
Lecture #1 Introduction.
The (Right) Null Space of S Systems Biology by Bernhard O. Polson Chapter9 Deborah Sills Walker Lab Group meeting April 12, 2007.
Object Orie’d Data Analysis, Last Time Finished NCI 60 Data Started detailed look at PCA Reviewed linear algebra Today: More linear algebra Multivariate.
Lecture 19 Singular Value Decomposition
Lecture 7: Principal component analysis (PCA)
Principal Component Analysis
Mathematical Representation of Reconstructed Networks The Left Null space The Row and column spaces of S.
Computer Graphics Recitation 5.
3D Geometry for Computer Graphics
Singular Value Decomposition
The Terms that You Have to Know! Basis, Linear independent, Orthogonal Column space, Row space, Rank Linear combination Linear transformation Inner product.
Dimension reduction : PCA and Clustering Christopher Workman Center for Biological Sequence Analysis DTU.
Previously Two view geometry: epipolar geometry Stereo vision: 3D reconstruction epipolar lines Baseline O O’ epipolar plane.
3D Geometry for Computer Graphics
10-603/15-826A: Multimedia Databases and Data Mining SVD - part I (definitions) C. Faloutsos.
Lecture 20 SVD and Its Applications Shang-Hua Teng.
Ordinary least squares regression (OLS)
Lecture #11 Coupling Pathways. Outline Some biochemistry The pentose pathway; –a central metabolic pathway producing pentoses and NADPH Co-factor coupling.
Linear Algebra Review By Tim K. Marks UCSD Borrows heavily from: Jana Kosecka Virginia de Sa (UCSD) Cogsci 108F Linear.
Introduction The central problems of Linear Algebra are to study the properties of matrices and to investigate the solutions of systems of linear equations.
SVD(Singular Value Decomposition) and Its Applications
Extreme Pathways introduced into metabolic analysis by the lab of Bernard Palsson (Dept. of Bioengineering, UC San Diego). The publications of this lab.
Summarized by Soo-Jin Kim
NUS CS5247 A dimensionality reduction approach to modeling protein flexibility By, By Miguel L. Teodoro, George N. Phillips J* and Lydia E. Kavraki Rice.
Linear Algebra Review 1 CS479/679 Pattern Recognition Dr. George Bebis.
6.837 Linear Algebra Review Patrick Nichols Thursday, September 18, 2003.
MA2213 Lecture 5 Linear Equations (Direct Solvers)
Next. A Big Thanks Again Prof. Jason Bohland Quantitative Neuroscience Laboratory Boston University.
Feature extraction 1.Introduction 2.T-test 3.Signal Noise Ratio (SNR) 4.Linear Correlation Coefficient (LCC) 5.Principle component analysis (PCA) 6.Linear.
CSE554AlignmentSlide 1 CSE 554 Lecture 5: Alignment Fall 2011.
CPSC 491 Xin Liu Nov 17, Introduction Xin Liu PhD student of Dr. Rokne Contact Slides downloadable at pages.cpsc.ucalgary.ca/~liuxin.
D. van Alphen1 ECE 455 – Lecture 12 Orthogonal Matrices Singular Value Decomposition (SVD) SVD for Image Compression Recall: If vectors a and b have the.
Chapter 10: The Left Null Space of S - or - Now we’ve got S. Let’s do some Math and see what happens. - or - Now we’ve got S. Let’s do some Math and see.
Principal Component vs. Common Factor. Varimax Rotation Principal Component vs. Maximum Likelihood.
Introduction to Linear Algebra Mark Goldman Emily Mackevicius.
Review of Linear Algebra Optimization 1/16/08 Recitation Joseph Bradley.
20. Lecture WS 2006/07Bioinformatics III1 V20 Extreme Pathways introduced into metabolic analysis by the lab of Bernard Palsson (Dept. of Bioengineering,
EIGENSYSTEMS, SVD, PCA Big Data Seminar, Dedi Gadot, December 14 th, 2014.
CMU SCS : Multimedia Databases and Data Mining Lecture #18: SVD - part I (definitions) C. Faloutsos.
Lecture #16 The Left Null Space of S. Outline 1.Definition 2.Convex basis – formation of non- negative pools 3.Alignment of the affine concentration space.
1. Systems of Linear Equations and Matrices (8 Lectures) 1.1 Introduction to Systems of Linear Equations 1.2 Gaussian Elimination 1.3 Matrices and Matrix.
Feature Extraction 主講人:虞台文. Content Principal Component Analysis (PCA) PCA Calculation — for Fewer-Sample Case Factor Analysis Fisher’s Linear Discriminant.
Factor Analysis Basics. Why Factor? Combine similar variables into more meaningful factors. Reduce the number of variables dramatically while retaining.
Instructor: Mircea Nicolescu Lecture 8 CS 485 / 685 Computer Vision.
Matrix Factorization & Singular Value Decomposition Bamshad Mobasher DePaul University Bamshad Mobasher DePaul University.
Feature Extraction 主講人:虞台文.
Chapter 61 Chapter 7 Review of Matrix Methods Including: Eigen Vectors, Eigen Values, Principle Components, Singular Value Decomposition.
Unsupervised Learning II Feature Extraction
Reduced echelon form Matrix equations Null space Range Determinant Invertibility Similar matrices Eigenvalues Eigenvectors Diagonabilty Power.
CS246 Linear Algebra Review. A Brief Review of Linear Algebra Vector and a list of numbers Addition Scalar multiplication Dot product Dot product as a.
Introduction to Vectors and Matrices
Principal Component Analysis (PCA)
Introduction The central problems of Linear Algebra are to study the properties of matrices and to investigate the solutions of systems of linear equations.
Introduction The central problems of Linear Algebra are to study the properties of matrices and to investigate the solutions of systems of linear equations.
ECE 3301 General Electrical Engineering
Back to Chapter 10: Sections
Structure from motion Input: Output: (Tomasi and Kanade)
Multivariate Analysis: Theory and Geometric Interpretation
Symmetric Matrices and Quadratic Forms
The Convex Basis of the Left Null Space of the Stoichiometric Matrix Leads to the Definition of Metabolically Meaningful Pools  Iman Famili, Bernhard.
Lecture 13: Singular Value Decomposition (SVD)
Introduction to Vectors and Matrices
Structure from motion Input: Output: (Tomasi and Kanade)
Lecture 20 SVD and Its Applications
Presentation transcript:

Lecture #9 The four fundamental subspaces

Outline SVD and its uses SVD: basic features SVD: key properties Examples: simple reactions & networks Genome-scale stoichiometric matrices Examples Tilting of basis vectors

SVD AND ITS USES

The Singular Value Decomposition (SVD) v x S=U  V T dx dt =Sv;; v VTVT  “stretches” x S U diagonal matrix VTvVTv linear combination of fluxes v VTVT  x U x =U( ) time derivatives are a linear combination ()

Singular Value Decomposition in Image Processing 5 values10 values303 values30 values52 valuesOriginal

Applications of Singular Value Decomposition Image processing Noise reduction Kinematics mRNA expression analysis

SVD: BASIC FEATURES

dx dt =Sv MATLAB: [U, S, V]= svd(A) Numerical check: ||A-USV T ||=0 ? dx dt =U  V T v

The Singular Values Diagonal entries in  The singular values  1,  2,…….  r largesmall singular value spectrum fractional singular values fi=fi= ii ii  r i=1 Fi=Fi=  i k=1 f k ; F r =1 cumulative fractional singular values

Orthonormal Basis Sets R(S) N(S) C(S) LN(S) S =U  V T x’x’ Dim =n Dim =m Dim (R) =r Dim(N) =n-r Dim(LN) =m-r Dim(C) = r v

SVD: KEY PROPERTIES

Property #1: Mode by Mode Reconstruction of S S=   i r i=1 =1=1 () ( ) + 22 () ( ) +…… =1=1 ( ) + 22 +…… m x n m x ll x n m x n ||||~1 ||u i ||=1 ||u j ||=1 =0 i≠j =1i=j definition of orthonormality  are scaling factors: i.e., S= ….

Property #2: S Maps the Right Singular Vector (v i ) to the Left Singular Vector (u i ) S = U  V T SV = U  (V T V) SV = U  S ( ) = ( )( ) |||| 0 0 k kk Sv k =  k u k ( )| = | (xV) =I U T U=I V T V=I |||| () ( ) =I U T =U -1 v k  k u k S m x m n x n Independent dimension in the Row space Independent dimension in the Columnspace Dimension in the Row space

Orthonormality and dynamic decoupling VTVT  x’x’ S U v 0 0 ( )(|) d(|) dt = x U T Decoupled motion

EXAMPLES

Example #1 m = n=2; r=1

Bounded Spaces

Example #2

Orthonormal basis for Column and Left Null Left Null Col 3 3

An Alternative Set of Vectors for the Left Null Space 1 ( ) Col l 1 and l 2 are convex basis vectors (0,1,1)

GENOME-SCALE STOICHIOMETRIC MATRICES

SVD of S: global view

Dynamic equation Flux drivers Motion of concentrations Mapping: from fluxes to concentration time derivatives

Mapping: chemical reaction interpretation

Systems rate equation Systems pseudo-elementary reactions: v_ki are ‘systems’ partition numbers Systems (eigen) reaction: u_ki are ‘systems’ stoichiometric coefficients Systemic chemical reactions

DECOMPOSITION OF THE CORE METABOLIC NETWORK IN E. COLI

Systemic reactions w/o biomass Translocation of a proton ATP synthesis Transhydrogenation and AcCoA charging

Systemic reactions w/ biomass Translocation of a proton And ATP synthesis Growth Transhydrogenation and AcCoA charging

DECOMPOSITION OF GENOME- SCALE MATRICES

The singular value spectrum

1st mode: high energy phosphate bonds Motion: stoichiometry Drivers: reactions

2 nd mode: NADPH redox metabolism Motion: stoichiometry Drivers: reactions

3 rd Mode: translocated proton Motion: stoichiometry Drivers: reactions

4 th mode Motion: stoichiometry Drivers: reactions

Angles as measures of similarity

ROTATING BASIS VECTORS

The effects of rotation: Rotation of the Basis Vectors for Col(S) Metabolites NADP, NADPH Q, QH 2 ATP,ADP

Interpreting the basis vectors Interpreting the variable loadings on the basis vectors can be hard due to the maximal variance characteristic of SVD. In order to gain biological insights from the principal components, the basis vectors can be rotated. – Rotation is just a change of basis. – There is no gain or loss of information From Barrett et al

Applying to Metabolic Networks Compute bases vectors for the subspaces of S Rotate the PC’s and interpret biochemical basis Identify Reaction and compound sets that define the basis

Basis Rotation Methods The two major categories 1.Orthogonal Rotations: maintain all PCs perpendicular to each other Examples: varimax, orthomax, quartimax 2.Oblique Rotations: Relax the orthogonality constraint gain simplicity in the interpretation. Allow PCs to be correlated Examples: promax, oblimin In MATLAB A=rotatefactors(B,’Method’,…)

Summary S=U  V T is the most fundamental decomposition of a matrix  has the singular values and gives the “effective” dimensionality of the mapping that S represents U and V T have orthonormal basis vectors for the four subspaces We may want oblique basis vectors to represent chemistry/biology

The end

Extras

Summary (detailed) SVD provides unbiased and decoupled information about all the fundamental subspaces of S simultaneously. The first r columns of the left singular matrix U contain a basis for the column space of S, and the remaining m-r columns contain a basis for the left null space. The first r columns of the right singular matrix r contain a basis for the row space of S and the remaining n-r columns contain a basis for the null space. The sets of basis vectors in U and V are orthonormal. The first r columns of U give systemic reactions, analogous to a single column of S, representing a single reaction. The corresponding column of V gives the combination of the reactions that drive a systemic reaction. Orthonormal basis vectors are mathematically convenient but not necessarily biologically or chemically meaningful.

Methods for Factor Rotation The two major categories 1.Orthogonal Rotations: maintain all PCs perpendicular to each other Examples: varimax, orthomax, quartimax 2.Oblique Rotations: Relax the orthogonality constraint Gain simplicity in the interpretation. Allow PCs to be correlated Examples: promax, oblimin In MATLAB A=rotatefactors(B,’Method’,…) CategoryMethod NameComments Orthnormal Quartimax Maximizes the sums of squares of the coefficients across the resultant vectors for each of the original variable Varimax Maximizes the sum of the variance of the loading vectors Equimax Spread the extracted variance evenly across the rotated factors Oblique Promax Uses an orthogonal solution as the basis for creating an oblique solution using a procrustes rotation Oblimax Maximizes the kurtosis of all the loadings across all variables and factors without consideration of the relative positon of the factors Direct Oblimin Similar to a quartimax approach, but minimizes and does away with reference vectors

APPLICATIONS OF FACTOR ROTATION TO METABOLIC NETWORKS Compute bases vectors for the subspaces of S Rotate the PC’s and interpret biochemical basis Identify Reaction and compound sets that define the basis

FACTOR ROTATION ON THE CORE E. COLI MODEL

Singular Value Spectrum of the core E. coli 14 Modes account for >50 % of the network. 43 out of 72 modes account for > 90% of the network. Modes FiFi F i : Cumulative fractional singular value

Rotation of the Basis Vectors for Col(S) 1 st Mode High Energy Phosphate Bonds H,ATP,H 2 O, ADP, P i ATP, ADP Before Rotation After Rotation NAD, NADH, CoA, NADPH, CO 2, NADP, NADPH NAD, NADH 3 rd Mode NAD Redox metabolism

Rotation of the Basis Vectors for Col(S) Metabolites NADP, NADPH Q, QH 2 ATP,ADP

Rotation of the Basis Vectors for LN(S) Modes Metabolites AMP,ADP,ATPNADH,NAD Upon rotation, the time invariant pools are clearly resolved CoA,SuccCoA NADH,NAD,NADPH, NADP, QH 2

ROTATION OF BASIS VECTORS AT THE GENOME-SCALE

Rotating the bases vectors of LN(S) for iAF1260 The LN(S) basis vectors correspond to time invariant pools The pools found are: – Amino acyl tRNAs – tRNAs – Charge Carriers (NADH. NAD) – Co-factor Pools – Apolipoprotein-lipoprotein Factor Loading

TILTING OF BASIS VECTORS

Tilting the Left Null Space of iAF1260 The basis vectors correspond to the time invariant pools – Amino acyl tRNAs – tRNAs – Charge carriers (NADH + NAD) – Other co-factor pools – Apolipoprotein-lipoprotein