Download presentation
Presentation is loading. Please wait.
1
COMP 328: Midterm Review Spring 2010 Nevin L. Zhang Department of Computer Science & Engineering The Hong Kong University of Science & Technology http://www.cse.ust.hk/~lzhang/ Can be used as cheat sheet
2
Page 2 Overview l Algorithms for supervised learning n Decision trees n Naïve Bayes classifiers n Neural networks n Instance-based learning n Support vector machines l General issues regarding supervised learning n Classification error and confidence interval n Bias-Variance tradeoff n PAC learning theory
3
Supervised Learning Page 3
4
Decision Trees Page 4
5
Decision trees Page 5
6
Reduced-Error Pruning Page 6
7
Decision Trees l Issues with attributes n Continuous n Attributes with many values Use GainRatio instead of Gain n Missing values l Tree construction is a search process n Local minimum Page 7
8
Naïve Bayes Classifier Page 8 Can classify using this rule: But, joint too expensive to get
9
Naïve Bayes Classifier Page 9
10
Learning Naïve Bayes Classifier Page 10 l Laplace smoothing l Continuous attribute l When independence not true, double counting of evidence l Generalization: Bayesian networks
11
Neural Networks Page 11 For classification and regression
12
Neural Networks l Activation function n Step, sign n Sigmoid, tanh (hyperbolic tangent) Page 12
13
Neural Network/Properties l Perceptrons are linear classifier l Two-layer network with enough perceptron units can represent all Boolean functions l One layer with enough sigmoid units can approximate any functions well Page 13
14
Neural Network Page 14 l Converge only when linearly separable
15
Neural Network Page 15 l Adaline learning: Delta rule
16
Neural Network Page 16
17
Instance-Based Learning l Lazy learning n K-NN n Distance-weighted k-NN (kernel regression) n Locally weighted regression Page 17
18
Support Vector Machines Page 18
19
SVM Page 19
20
SVM Page 20
21
SVM Page 21
22
SVM l Data not linearly separable Page 22
23
SVM Page 23
24
Nonlinear SVM Page 24
25
Impact of σ and C Page 25
26
Classifier Evaluation l Relationship between Page 26
27
Algorithm Evaluation/Model Selection Page 27 l W hich learning algorithm to use? l Given algorithm, which model to use? (How many hidden units?)
28
Algorithm Evaluation/Model Selection Page 28
29
Bias-Variance Decomposition Page 29
30
Bias-Variance Tradeoff Page 30 For classification problem also
31
PAC Learning Theory l Probably approximate correct (PAC) l Relationship between Page 31
32
PAC Learning Theory Page 32
33
VC Dimension Page 33
34
Sample Complexity Page 34
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.