Presentation is loading. Please wait.

Presentation is loading. Please wait.

Data Mining Course 0 Manifold learning Xin Yang. Data Mining Course 1 Outline Manifold and Manifold Learning Classical Dimensionality Reduction Semi-Supervised.

Similar presentations


Presentation on theme: "Data Mining Course 0 Manifold learning Xin Yang. Data Mining Course 1 Outline Manifold and Manifold Learning Classical Dimensionality Reduction Semi-Supervised."— Presentation transcript:

1 Data Mining Course 0 Manifold learning Xin Yang

2 Data Mining Course 1 Outline Manifold and Manifold Learning Classical Dimensionality Reduction Semi-Supervised Nonlinear Dimensionality Reduction Experiment Results Conclusions

3 Data Mining Course 2 What is a manifold?

4 Data Mining Course 3 Examples: sphere and torus

5 Data Mining Course 4 Why we need manifold?

6 Data Mining Course 5

7 6 Manifold learning Raw format of natural data is often high dimensional, but in many cases it is the outcome of some process involving only few degrees of freedom.

8 Data Mining Course 7 Manifold learning Intrinsic Dimensionality Estimation Dimensionality Reduction

9 Data Mining Course 8 Dimensionality Reduction Classical Method: Linear: MDS & PCA (Hastie 2001) Nonlinear: LLE (Roweis & Saul, 2000), ISOMAP (Tenebaum 2000), LTSA (Zhang & Zha 2004) -- in general, low dimensional coordinates lack physical meaning

10 Data Mining Course 9 Semi-supervised NDR Prior information Can be obtained from experts or by performing experiments Eg: moving object tracking

11 Data Mining Course 10 Semi-supervised NDR Assumption: Assuming the prior information has a physical meaning, then the global low dimensional coordinates bear the same physical meaning.

12 Data Mining Course 11 Basic LLE

13 Data Mining Course 12 Basic LTSA Characterized the geometry by computing an approximate tangent space

14 Data Mining Course 13 SS-LLE & SS-LTSA Give m the exact mapping data points. Partition Y as Our problem :

15 Data Mining Course 14 SS-LLE & SS-LTSA To solve this minimization problem, partition M as: Then the minimization problem can be written as

16 Data Mining Course 15 SS-LLE & SS-LTSA Or equivalently Solve it by setting its gradient to be zero, we get:

17 Data Mining Course 16 Sensitivity Analysis With the increase of prior points, the condition number of the coefficient matrix gets smaller and smaller, the computed solution gets less sensitive to the noise in and

18 Data Mining Course 17 Sensitivity Analysis The sensitivity of the solution depends on the condition number of the matrix

19 Data Mining Course 18 Inexact Prior Information Add a regularization term, weighted with a parameter

20 Data Mining Course 19 Inexact Prior Information Its minimizer can be computed by solving the following linear system:

21 Data Mining Course 20 Experiment Results “incomplete tire” --compare with basic LLE and LTSA --test on different number of prior points Up body tracking --use SSLTSA --test on inexact prior information algorithm

22 Data Mining Course 21 Incomplete Tire

23 Data Mining Course 22

24 Data Mining Course 23 Relative error with different number of prior points

25 Data Mining Course 24 Up body tracking

26 Data Mining Course 25 Results of SSLTSA

27 Data Mining Course 26 Results of inexact prior information algorithm

28 Data Mining Course 27 Conclusions Manifold and manifold learning Semi-supervised manifold learning Future work

29 Data Mining Course 28


Download ppt "Data Mining Course 0 Manifold learning Xin Yang. Data Mining Course 1 Outline Manifold and Manifold Learning Classical Dimensionality Reduction Semi-Supervised."

Similar presentations


Ads by Google