Presentation is loading. Please wait.

Presentation is loading. Please wait.

Object Orie’d Data Analysis, Last Time PCA Redistribution of Energy - ANOVA PCA Data Representation PCA Simulation Alternate PCA Computation Primal – Dual.

Similar presentations


Presentation on theme: "Object Orie’d Data Analysis, Last Time PCA Redistribution of Energy - ANOVA PCA Data Representation PCA Simulation Alternate PCA Computation Primal – Dual."— Presentation transcript:

1 Object Orie’d Data Analysis, Last Time PCA Redistribution of Energy - ANOVA PCA Data Representation PCA Simulation Alternate PCA Computation Primal – Dual PCA vs. SVD (centering by means is key)

2 Primal - Dual PCA Toy Example 3: Random Curves, all in Dual Space: 1 * Constant Shift 2 * Linear 4 * Quadratic 8 * Cubic (chosen to be orthonormal) Plus (small) i.i.d. Gaussian noise d = 40, n = 20

3 Primal - Dual PCA Toy Example 3: Raw Data

4 Primal - Dual PCA Toy Example 3: Raw Data Similar Structure to e.g. 1 But Rows and Columns trade places And now cubics visually dominant (as expected)

5 Primal - Dual PCA Toy Example 3: Primal PCA Column Curves as Data Gaussian Noise Only 3 components Poly Scores (as expected)

6 Primal - Dual PCA Toy Example 3: Dual PCA Row Curves as Data Components as expected No Gram-Schmidt (since stronger signal)

7 Primal - Dual PCA Toy Example 3: SVD – Matrix-Image

8 Primal - Dual PCA Toy Example 4: Mystery #1

9 Primal - Dual PCA Toy Example 4: SVD – Curves View

10 Primal - Dual PCA Toy Example 4: SVD – Matrix-Image

11 Primal - Dual PCA Toy Example 4: Mystery #1 Structure: Primal - Dual Constant Gaussian Gaussian Linear Parabola Gaussian Gaussian Cubic Nicely revealed by Full Matrix decomposition and views

12 Primal - Dual PCA Toy Example 5: Mystery #2

13 Primal - Dual PCA Toy Example 5: SVD – Curves View

14 Primal - Dual PCA Toy Example 5: SVD – Matrix-Image

15 Primal - Dual PCA Toy Example 5: Mystery #2 Structure: Primal - Dual Constant Linear Parabola Cubic Gaussian Gaussian Visible via either curves, or matrices …

16 Primal - Dual PCA Is SVD (i.e. no mean centering) always “ better ” ? What does “ better ” mean??? A definition: Provides most useful insights into data Others???

17 Primal - Dual PCA Toy Example where SVD is less informative: Simple Two dimensional Key is subtraction of mean is bad I.e. Mean dir ’ n different from PC dir ’ ns And Mean Less Informative

18 Primal - Dual PCA Toy Example where SVD is less informative: Raw Data

19 Primal - Dual PCA PC1 mode of variation (centered at mean): Yields useful major mode of variation

20 Primal - Dual PCA PC2 mode of variation (centered at mean): Informative second mode of variation

21 Primal - Dual PCA SV1 mode of variation (centered at 0): Unintuitive major mode of variation

22 Primal - Dual PCA SV2 mode of variation (centered at 0): Unintuitive second mode of variation

23 Primal - Dual PCA Summary of SVD: Does give a decomposition I.e. sum of two pieces is data But not good insights about data structure Since center point of analysis is far from center point of data So mean strongly influences the impression of variation Maybe better to keep these separate???

24 Primal - Dual PCA Bottom line on: Primal PCA vs. SVD vs. Dual PCA These are not comparable: Each has situations where it is “ best ” And where it is “ worst ” Generally should consider all And choose on basis of insights See work of Lingsong Zhang on this …

25 Vectors vs. Functions Recall overall structure: Object Space Feature Space Curves (functions) Vectors Connection 1: Digitization Parallel Coordinates Connection 2: Basis Representation

26 Vectors vs. Functions Connection 1: Digitization: Given a function, define vector Where is a suitable grid, e.g. equally spaced:

27 Vectors vs. Functions Connection 1: Parallel Coordinates: Given a vector, define a function where And linearly interpolate to “ connect the dots ” Proposed as High Dimensional Visualization Method by Inselberg (1985)

28 Vectors vs. Functions Parallel Coordinates: Given, define Now can “ rescale argument ” To get function on [0,1], evaluated at equally spaced grid

29 Vectors vs. Functions Bridge between vectors & functions: Vectors  Functions Isometry follows from convergence of: Inner Products By Reimann Summation

30 Vectors vs. Functions Main lesson: -OK to think about functions -But actually work with vectors For me, there is little difference But there is a statistical theory, and mathematical statistical literature on this Start with Ramsay & Silverman (2005)

31 Vectors vs. Functions Recall overall structure: Object Space Feature Space Curves (functions) Vectors Connection 1: Digitization Parallel Coordinates Connection 2: Basis Representation

32 Vectors vs. Functions Connection 2: Basis Representations: Given an orthonormal basis (in function space) E.g. –Fourier –B-spline –Wavelet Represent functions as:

33 Vectors vs. Functions Connection 2: Basis Representations: Represent functions as: Bridge between discrete and continuous:

34 Vectors vs. Functions Connection 2: Basis Representations: Represent functions as: Finite dimensional approximation: Again there is mathematical statistical theory, based on (same ref.)

35 Vectors vs. Functions Repeat Main lesson: -OK to think about functions -But actually work with vectors For me, there is little difference (but only personal taste)

36 PCA for shapes New Data Set: Corpus Callossum Data “ Window ” between right and left halves of the brain From a vertical slice MR image of head “ Segmented ” (ie. found boundary) Shape is resulting closed curve Have sample from n = 71 people Feature vector of d = 80 coeffic ’ ts from Fourier boundary representation (closed curve)

37 PCA for shapes Raw Data: Special thanks to Sean HoSean Ho View curves asmovie Modes of variation?

38 PCA for shapes PC1: Movie shows evolution along eigenvector Projections in bottom plot 2 Data Subclasses Schizophrenics Controls

39 PCA for shapes PC1 Summary (Corpus Callossum Data) Direction is “ overall bending ” Colors studied later (sub populations) An outlier??? Find it in the data? Case 2: could delete & repeat (will study outliers in more detail)

40 PCA for shapes Raw Data: This time with numbers So can identify outlier

41 PCA for shapes PC2: Movie shows evolution along eigenvector Projections in bottom plot

42 PCA for shapes PC2 Summary (Corpus Callossum Data) Rotation of right end “ Sharpening ” of left end “ Location ” of left end These are correlated with each other But independent of PC1

43 PCA for shapes PC3: Thin vs. fat Important mode of variation?

44 PCA for shapes Raw Data: Revisit to look for 3 modes Bending Endpts Thinning

45 PCA for shapes Raw Data: Medial Repr ’ n Heart is Medial Atoms Spokes imply boundary Modes of Variation?

46 PCA for shapes PC1 Summary (medial representation) From same data as above Fourier boundary rep ’ n But they look different Since different type of fitting was done Also, worst outlier was deleted Modes of variation?

47 PCA for shapes PC1: Overall Bending Same as for Fourier above Corr ’ d with right end fattening

48 PCA for shapes PC2: Rotation of ends Similar to PC2 of Fourier rep ’ n above

49 PCA for shapes PC3: Distortion of Curvature Different from PC2 of Fourier rep ’ n above

50 PCA for shapes PC3 Summary (medial representation) Systematic “ distortion of curvature ” This time different from above Fourier boundary PC3 Lesson: different rep ’ ns focus on different aspects of data I.e. not just differences in fitting But instead on features that are emphasized Thus choice of “ features ” is very important

51 PCA for shapes PC4: Fattening and Thinning? Relate to Fourier rep ’ n ???

52 PCA for shapes PC4 Summary (medial representation) more like fattening and thinning i.e. similar to Fourier boundary PC3 (view again below) but “ more local ” in nature an important property of M-reps

53 PCA for shapes PC3: Review this For Comparison with PC4 from M-reps

54

55 Cont. vs. discrete Need to say something about this


Download ppt "Object Orie’d Data Analysis, Last Time PCA Redistribution of Energy - ANOVA PCA Data Representation PCA Simulation Alternate PCA Computation Primal – Dual."

Similar presentations


Ads by Google