Download presentation
Presentation is loading. Please wait.
1
Copyright © Siemens Medical Solutions, USA, Inc.; 2008. All rights reserved. Polyhedral Classifier for Target Detection A Case Study: Colorectal Cancer Murat Dundar, Matthias Wolf, Sarang Lakare, Marcos Salganicoff, Vikas C. Raykar Siemens Medical Solutions, Inc. USA Malvern, PA 19355
2
Page 2 July-08 Copyright © Siemens Medical Solutions, USA, Inc.; 2008. All rights reserved. CAD & Knowledge Solutions / Malvern, USA / IKMDundar et al. Computer Aided Diagnosis (CAD) for Colon Cancer Identify suspicious regions (candidates) Extract features for each candidate Classify candidates as a polyp or non-polyp
3
Page 3 July-08 Copyright © Siemens Medical Solutions, USA, Inc.; 2008. All rights reserved. CAD & Knowledge Solutions / Malvern, USA / IKMDundar et al. Challenges in Classification Extremely unbalanced between positive and negative classes Non-IID data Multiple instances Run-time limitations Multi-mode data
4
Page 4 July-08 Copyright © Siemens Medical Solutions, USA, Inc.; 2008. All rights reserved. CAD & Knowledge Solutions / Malvern, USA / IKMDundar et al. Multi-mode nature of CAD data The only ground truth available is the location of the polyp. All other candidates that are not pointing to a known polyp are pooled into the negative class. Variation among the different negatives is large.
5
Page 5 July-08 Copyright © Siemens Medical Solutions, USA, Inc.; 2008. All rights reserved. CAD & Knowledge Solutions / Malvern, USA / IKMDundar et al. A CAD Example: Colorectal Cancer Polyps vs. common false positives Fold Stool Noise Rectal tube Sessile polyp Pedunculated polyp
6
Page 6 July-08 Copyright © Siemens Medical Solutions, USA, Inc.; 2008. All rights reserved. CAD & Knowledge Solutions / Malvern, USA / IKMDundar et al. Model class distribution by a mixture model, one mode for each subclass, then design a maximum a posteriori or maximum likelihood classifier Too few positives, too many features with redundancy! Robust estimation of model parameters for positive class is very difficult, if not impractical State-of-the-Art – Finite Mixture Models
7
Page 7 July-08 Copyright © Siemens Medical Solutions, USA, Inc.; 2008. All rights reserved. CAD & Knowledge Solutions / Malvern, USA / IKMDundar et al. State-of-the-Art – Discriminative Techniques Pool all negative candidates into a single class and learn a binary classifier, i.e. polyps vs. negatives A kernel-based discriminative technique (SVM, RVM, KFD) can yield nonlinear decision boundaries suitable for classifying multi-mode data. Too few positive candidates, too many features with redundancy! Data can be easily overfit by a nonlinear classifier
8
Page 8 July-08 Copyright © Siemens Medical Solutions, USA, Inc.; 2008. All rights reserved. CAD & Knowledge Solutions / Malvern, USA / IKMDundar et al. State-of-the-Art – One-Class Classifiers Omits the negative class, learns a model with positive samples only. Kernel-based and neural network implementation yield nonlinear decision boundaries suitable for classifying multi-mode data. Like other nonlinear classifiers susceptible to overfitting
9
Page 9 July-08 Copyright © Siemens Medical Solutions, USA, Inc.; 2008. All rights reserved. CAD & Knowledge Solutions / Malvern, USA / IKMDundar et al. State-of-the-art in a Nutshell Linear classifiers less prone to overfitting not enough capacity to deal with multi-mode data Finite mixture models Parameter estimation is an issue! Discriminative & One-class Classifiers good capacity more prone to overfitting
10
Page 10 July-08 Copyright © Siemens Medical Solutions, USA, Inc.; 2008. All rights reserved. CAD & Knowledge Solutions / Malvern, USA / IKMDundar et al. A Viable solution A series of linear classifiers one for each subclass of the negatives More capacity than a linear classifier, yet less prone to overfitting than a nonlinear classifier An unseen sample is classified as positive if all the classifier classifies it as positive
11
Page 11 July-08 Copyright © Siemens Medical Solutions, USA, Inc.; 2008. All rights reserved. CAD & Knowledge Solutions / Malvern, USA / IKMDundar et al. Training Multiple Linear Classifiers Train each classifier independently: Negative subclass k vs. Positives, for k=1,…,K. Inefficient! Potentially excessive penalization due to a misclassified positive sample
12
Page 12 July-08 Copyright © Siemens Medical Solutions, USA, Inc.; 2008. All rights reserved. CAD & Knowledge Solutions / Malvern, USA / IKMDundar et al. Proposed Approach Optimize classifiers jointly One classifier for each subclass of negative data Objective function is penalized once due to a misclassified positive sample Yields a polyhedral decision surface
13
Page 13 July-08 Copyright © Siemens Medical Solutions, USA, Inc.; 2008. All rights reserved. CAD & Knowledge Solutions / Malvern, USA / IKMDundar et al. A Toy Example
14
Page 14 July-08 Copyright © Siemens Medical Solutions, USA, Inc.; 2008. All rights reserved. CAD & Knowledge Solutions / Malvern, USA / IKMDundar et al. Hyperplane Classifiers with Hinge Loss TP+ FP- ξ ξ
15
Page 15 July-08 Copyright © Siemens Medical Solutions, USA, Inc.; 2008. All rights reserved. CAD & Knowledge Solutions / Malvern, USA / IKMDundar et al. i-th Positive example: -- “AND” i-th Negative example: Polyhedral Classifier with AND Framework If the hinge loss = 0, the example is correctly classified, If the hinge loss > 0, the example is mis-classified Let be the hinge loss of i-th example induced by the classifier k
16
Page 16 July-08 Copyright © Siemens Medical Solutions, USA, Inc.; 2008. All rights reserved. CAD & Knowledge Solutions / Malvern, USA / IKMDundar et al. Objective Function with the AND Framework Error on Negative Examples Error on Positive Examples Regularization to Control Complexity Convex Problem!
17
Page 17 July-08 Copyright © Siemens Medical Solutions, USA, Inc.; 2008. All rights reserved. CAD & Knowledge Solutions / Malvern, USA / IKMDundar et al. Incomplete Ground Truth for Subclasses AND algorithm assumes the subclass membership is known for all samples. Not Realistic! Annotate a small portion of the negatives identify potential subclasses pool training samples for each subgroup. Three different types of samples in the training data Positives Negatives with known and unknown subclass membership
18
Page 18 July-08 Copyright © Siemens Medical Solutions, USA, Inc.; 2008. All rights reserved. CAD & Knowledge Solutions / Malvern, USA / IKMDundar et al. Objective Function with the AND-OR Framework Error on Negative Examples with known subclasses Error on Positive Examples AND operation Regularization to Control Complexity Error on Negative Examples with unknown subclasses, OR operation Not Convex!
19
Page 19 July-08 Copyright © Siemens Medical Solutions, USA, Inc.; 2008. All rights reserved. CAD & Knowledge Solutions / Malvern, USA / IKMDundar et al. Each iteration contains K steps, and each step optimizes a single classifier Alternating Optimization Iterative Algorithm At the k-th step, Fix all classifiers (α’s) but the classifier k Minimize J(α 1,…, α k,… α K ) for optimal α k
20
Page 20 July-08 Copyright © Siemens Medical Solutions, USA, Inc.; 2008. All rights reserved. CAD & Knowledge Solutions / Malvern, USA / IKMDundar et al. Cascaded Design 12K T1T1 …. rejected candidates T2T2 T K-1 TKTK F1F1 F2F2 FKFK Candidates Training Sets: T1T1 T2T2 TKTK …. TP
21
Page 21 July-08 Copyright © Siemens Medical Solutions, USA, Inc.; 2008. All rights reserved. CAD & Knowledge Solutions / Malvern, USA / IKMDundar et al. Cascade Design with Sparse Linear Classifiers Setting P( k )=| k | yields K sparse classifiers, each with varying number of non-zero coefficients Run-time order does not change the outcome Start with the classifier that has the least number of nonzero coefficients Classify the sample, if negative reject, if positive pass it to the next classifier that requires computation of least number of additional features. Continue until all K classifiers are run
22
Page 22 July-08 Copyright © Siemens Medical Solutions, USA, Inc.; 2008. All rights reserved. CAD & Knowledge Solutions / Malvern, USA / IKMDundar et al. Experiments – Automatic Polyp Detection Data 98 numerical image features are computed, out of 1249 negatives, 177 are annotated, 9 subclasses are identified
23
Page 23 July-08 Copyright © Siemens Medical Solutions, USA, Inc.; 2008. All rights reserved. CAD & Knowledge Solutions / Malvern, USA / IKMDundar et al. ROC plots
24
Page 24 July-08 Copyright © Siemens Medical Solutions, USA, Inc.; 2008. All rights reserved. CAD & Knowledge Solutions / Malvern, USA / IKMDundar et al. Run-time Performance 25 % gain in execution time over SVDD and RBF-SVM ClassifiersSens (at 3fp/vol)Time (t) Polyhedral84452 SVDD80595 Rbf-SVM60595 Linear-SVM45437
25
Page 25 July-08 Copyright © Siemens Medical Solutions, USA, Inc.; 2008. All rights reserved. CAD & Knowledge Solutions / Malvern, USA / IKMDundar et al. Conclusions Polyhedral classifier for multi-mode data AND framework when subclass information is fully available AND-OR framework when subclass information is partially available Cascade design as a by-product to speed-up online execution Thank you! Questions and Comments
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.