Download presentation
Presentation is loading. Please wait.
Published byIsabella Hamilton Modified over 9 years ago
1
Object Orie’d Data Analysis, Last Time PCA Redistribution of Energy - ANOVA PCA Data Representation PCA Simulation Alternate PCA Computation Primal – Dual PCA vs. SVD (centering by means is key)
2
Primal - Dual PCA Toy Example 3: Random Curves, all in Dual Space: 1 * Constant Shift 2 * Linear 4 * Quadratic 8 * Cubic (chosen to be orthonormal) Plus (small) i.i.d. Gaussian noise d = 40, n = 20
3
Primal - Dual PCA Toy Example 3: Raw Data
4
Primal - Dual PCA Toy Example 3: Raw Data Similar Structure to e.g. 1 But Rows and Columns trade places And now cubics visually dominant (as expected)
5
Primal - Dual PCA Toy Example 3: Primal PCA Column Curves as Data Gaussian Noise Only 3 components Poly Scores (as expected)
6
Primal - Dual PCA Toy Example 3: Dual PCA Row Curves as Data Components as expected No Gram-Schmidt (since stronger signal)
7
Primal - Dual PCA Toy Example 3: SVD – Matrix-Image
8
Primal - Dual PCA Toy Example 4: Mystery #1
9
Primal - Dual PCA Toy Example 4: SVD – Curves View
10
Primal - Dual PCA Toy Example 4: SVD – Matrix-Image
11
Primal - Dual PCA Toy Example 4: Mystery #1 Structure: Primal - Dual Constant Gaussian Gaussian Linear Parabola Gaussian Gaussian Cubic Nicely revealed by Full Matrix decomposition and views
12
Primal - Dual PCA Toy Example 5: Mystery #2
13
Primal - Dual PCA Toy Example 5: SVD – Curves View
14
Primal - Dual PCA Toy Example 5: SVD – Matrix-Image
15
Primal - Dual PCA Toy Example 5: Mystery #2 Structure: Primal - Dual Constant Linear Parabola Cubic Gaussian Gaussian Visible via either curves, or matrices …
16
Primal - Dual PCA Is SVD (i.e. no mean centering) always “ better ” ? What does “ better ” mean??? A definition: Provides most useful insights into data Others???
17
Primal - Dual PCA Toy Example where SVD is less informative: Simple Two dimensional Key is subtraction of mean is bad I.e. Mean dir ’ n different from PC dir ’ ns And Mean Less Informative
18
Primal - Dual PCA Toy Example where SVD is less informative: Raw Data
19
Primal - Dual PCA PC1 mode of variation (centered at mean): Yields useful major mode of variation
20
Primal - Dual PCA PC2 mode of variation (centered at mean): Informative second mode of variation
21
Primal - Dual PCA SV1 mode of variation (centered at 0): Unintuitive major mode of variation
22
Primal - Dual PCA SV2 mode of variation (centered at 0): Unintuitive second mode of variation
23
Primal - Dual PCA Summary of SVD: Does give a decomposition I.e. sum of two pieces is data But not good insights about data structure Since center point of analysis is far from center point of data So mean strongly influences the impression of variation Maybe better to keep these separate???
24
Primal - Dual PCA Bottom line on: Primal PCA vs. SVD vs. Dual PCA These are not comparable: Each has situations where it is “ best ” And where it is “ worst ” Generally should consider all And choose on basis of insights See work of Lingsong Zhang on this …
25
Vectors vs. Functions Recall overall structure: Object Space Feature Space Curves (functions) Vectors Connection 1: Digitization Parallel Coordinates Connection 2: Basis Representation
26
Vectors vs. Functions Connection 1: Digitization: Given a function, define vector Where is a suitable grid, e.g. equally spaced:
27
Vectors vs. Functions Connection 1: Parallel Coordinates: Given a vector, define a function where And linearly interpolate to “ connect the dots ” Proposed as High Dimensional Visualization Method by Inselberg (1985)
28
Vectors vs. Functions Parallel Coordinates: Given, define Now can “ rescale argument ” To get function on [0,1], evaluated at equally spaced grid
29
Vectors vs. Functions Bridge between vectors & functions: Vectors Functions Isometry follows from convergence of: Inner Products By Reimann Summation
30
Vectors vs. Functions Main lesson: -OK to think about functions -But actually work with vectors For me, there is little difference But there is a statistical theory, and mathematical statistical literature on this Start with Ramsay & Silverman (2005)
31
Vectors vs. Functions Recall overall structure: Object Space Feature Space Curves (functions) Vectors Connection 1: Digitization Parallel Coordinates Connection 2: Basis Representation
32
Vectors vs. Functions Connection 2: Basis Representations: Given an orthonormal basis (in function space) E.g. –Fourier –B-spline –Wavelet Represent functions as:
33
Vectors vs. Functions Connection 2: Basis Representations: Represent functions as: Bridge between discrete and continuous:
34
Vectors vs. Functions Connection 2: Basis Representations: Represent functions as: Finite dimensional approximation: Again there is mathematical statistical theory, based on (same ref.)
35
Vectors vs. Functions Repeat Main lesson: -OK to think about functions -But actually work with vectors For me, there is little difference (but only personal taste)
36
PCA for shapes New Data Set: Corpus Callossum Data “ Window ” between right and left halves of the brain From a vertical slice MR image of head “ Segmented ” (ie. found boundary) Shape is resulting closed curve Have sample from n = 71 people Feature vector of d = 80 coeffic ’ ts from Fourier boundary representation (closed curve)
37
PCA for shapes Raw Data: Special thanks to Sean HoSean Ho View curves asmovie Modes of variation?
38
PCA for shapes PC1: Movie shows evolution along eigenvector Projections in bottom plot 2 Data Subclasses Schizophrenics Controls
39
PCA for shapes PC1 Summary (Corpus Callossum Data) Direction is “ overall bending ” Colors studied later (sub populations) An outlier??? Find it in the data? Case 2: could delete & repeat (will study outliers in more detail)
40
PCA for shapes Raw Data: This time with numbers So can identify outlier
41
PCA for shapes PC2: Movie shows evolution along eigenvector Projections in bottom plot
42
PCA for shapes PC2 Summary (Corpus Callossum Data) Rotation of right end “ Sharpening ” of left end “ Location ” of left end These are correlated with each other But independent of PC1
43
PCA for shapes PC3: Thin vs. fat Important mode of variation?
44
PCA for shapes Raw Data: Revisit to look for 3 modes Bending Endpts Thinning
45
PCA for shapes Raw Data: Medial Repr ’ n Heart is Medial Atoms Spokes imply boundary Modes of Variation?
46
PCA for shapes PC1 Summary (medial representation) From same data as above Fourier boundary rep ’ n But they look different Since different type of fitting was done Also, worst outlier was deleted Modes of variation?
47
PCA for shapes PC1: Overall Bending Same as for Fourier above Corr ’ d with right end fattening
48
PCA for shapes PC2: Rotation of ends Similar to PC2 of Fourier rep ’ n above
49
PCA for shapes PC3: Distortion of Curvature Different from PC2 of Fourier rep ’ n above
50
PCA for shapes PC3 Summary (medial representation) Systematic “ distortion of curvature ” This time different from above Fourier boundary PC3 Lesson: different rep ’ ns focus on different aspects of data I.e. not just differences in fitting But instead on features that are emphasized Thus choice of “ features ” is very important
51
PCA for shapes PC4: Fattening and Thinning? Relate to Fourier rep ’ n ???
52
PCA for shapes PC4 Summary (medial representation) more like fattening and thinning i.e. similar to Fourier boundary PC3 (view again below) but “ more local ” in nature an important property of M-reps
53
PCA for shapes PC3: Review this For Comparison with PC4 from M-reps
55
Cont. vs. discrete Need to say something about this
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.