Blackbox classifiers for preoperative discrimination between malignant and benign ovarian tumors C. Lu 1, T. Van Gestel 1, J. A. K. Suykens 1, S. Van Huffel.

Slides:



Advertisements
Similar presentations
SISTA seminar Feb 28, 2002 Preoperative Prediction of Malignancy of Ovarian Tumors Using Least Squares Support Vector Machines C. Lu 1, T. Van Gestel 1,
Advertisements

AIME03, Oct 21, 2003 Classification of Ovarian Tumors Using Bayesian Least Squares Support Vector Machines C. Lu 1, T. Van Gestel 1, J. A. K. Suykens.
ECG Signal processing (2)
Brief introduction on Logistic Regression
Learning Algorithm Evaluation
Data Mining Classification: Alternative Techniques
1 Statistical Modeling  To develop predictive Models by using sophisticated statistical techniques on large databases.
Supervised Learning Recap
Fei Xing1, Ping Guo1,2 and Michael R. Lyu2
PhD Hearing (Oct 15, 2003) Predictive Computer Models for Medical Classification Problems Predictive Computer Models for Medical Classification Problems.
Relational Learning with Gaussian Processes By Wei Chu, Vikas Sindhwani, Zoubin Ghahramani, S.Sathiya Keerthi (Columbia, Chicago, Cambridge, Yahoo!) Presented.
Lecture 14 – Neural Networks
Introduction to Predictive Learning
Artificial Intelligence Statistical learning methods Chapter 20, AIMA (only ANNs & SVMs)
An Optimal Learning Approach to Finding an Outbreak of a Disease Warren Scott Warren Powell
Modeling Gene Interactions in Disease CS 686 Bioinformatics.
1 Diagnosing Breast Cancer with Ensemble Strategies for a Medical Diagnostic Decision Support System David West East Carolina University Paul Mangiameli.
Biomedical Image Analysis and Machine Learning BMI 731 Winter 2005 Kun Huang Department of Biomedical Informatics Ohio State University.
Machine Learning Usman Roshan Dept. of Computer Science NJIT.
Breast Cancer Diagnosis A discussion of methods Meena Vairavan.
Color-based Diagnosis: Clinical Images
PhD defense C. LU 25/01/ Probabilistic Machine Learning Approaches to Medical Classification Problems Probabilistic Machine Learning Approaches to.
Digital Camera and Computer Vision Laboratory Department of Computer Science and Information Engineering National Taiwan University, Taipei, Taiwan, R.O.C.
Assessment of Model Development Techniques and Evaluation Methods for Binary Classification in the Credit Industry DSI Conference Jennifer Lewis Priestley.
UOG Journal Club: January 2013
This week: overview on pattern recognition (related to machine learning)
COMP3503 Intro to Inductive Modeling
Digital Camera and Computer Vision Laboratory Department of Computer Science and Information Engineering National Taiwan University, Taipei, Taiwan, R.O.C.
Prediction model building and feature selection with SVM in breast cancer diagnosis Cheng-Lung Huang, Hung-Chang Liao, Mu- Chen Chen Expert Systems with.
A Comparative Study on Variable Selection for Nonlinear Classifiers C. Lu 1, T. Van Gestel 1, J. A. K. Suykens 1, S. Van Huffel 1, I. Vergote 2, D. Timmerman.
Reduced the 4-class classification problem into 6 pairwise binary classification problems, which yielded the conditional pairwise probability estimates.
Statistical Modeling with SAS/STAT Cheng Lei Department of Electrical and Computer Engineering University of Victoria April 9, 2015.
Bayesian networks Classification, segmentation, time series prediction and more. Website: Twitter:
Radial Basis Function Networks:
Machine Learning Using Support Vector Machines (Paper Review) Presented to: Prof. Dr. Mohamed Batouche Prepared By: Asma B. Al-Saleh Amani A. Al-Ajlan.
Prediction of Malignancy of Ovarian Tumors Using Least Squares Support Vector Machines C. Lu 1, T. Van Gestel 1, J. A. K. Suykens 1, S. Van Huffel 1, I.
Empirical Research Methods in Computer Science Lecture 7 November 30, 2005 Noah Smith.
EMBC2001 Using Artificial Neural Networks to Predict Malignancy of Ovarian Tumors C. Lu 1, J. De Brabanter 1, S. Van Huffel 1, I. Vergote 2, D. Timmerman.
Today Ensemble Methods. Recap of the course. Classifier Fusion
Jennifer Lewis Priestley Presentation of “Assessment of Evaluation Methods for Prediction and Classification of Consumer Risk in the Credit Industry” co-authored.
Chapter 11 Statistical Techniques. Data Warehouse and Data Mining Chapter 11 2 Chapter Objectives  Understand when linear regression is an appropriate.
Digital Camera and Computer Vision Laboratory Department of Computer Science and Information Engineering National Taiwan University, Taipei, Taiwan, R.O.C.
Support Vector Machines in Marketing Georgi Nalbantov MICC, Maastricht University.
Machine Learning 5. Parametric Methods.
CS Statistical Machine learning Lecture 12 Yuan (Alan) Qi Purdue CS Oct
A Brief Introduction and Issues on the Classification Problem Jin Mao Postdoc, School of Information, University of Arizona Sept 18, 2015.
Feature Selction for SVMs J. Weston et al., NIPS 2000 오장민 (2000/01/04) Second reference : Mark A. Holl, Correlation-based Feature Selection for Machine.
Stock market forecasting using LASSO Linear Regression model
Artificial Neural Networks for Data Mining. Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 6-2 Learning Objectives Understand the.
Introduction Background Medical decision support systems based on patient data and expert knowledge A need to analyze the collected data in order to draw.
Next, this study employed SVM to classify the emotion label for each EEG segment. The basic idea is to project input data onto a higher dimensional feature.
Computer Vision Lecture 7 Classifiers. Computer Vision, Lecture 6 Oleh Tretiak © 2005Slide 1 This Lecture Bayesian decision theory (22.1, 22.2) –General.
Ch 1. Introduction Pattern Recognition and Machine Learning, C. M. Bishop, Updated by J.-H. Eom (2 nd round revision) Summarized by K.-I.
Canadian Bioinformatics Workshops
Machine Learning Usman Roshan Dept. of Computer Science NJIT.
ETHEM ALPAYDIN © The MIT Press, Lecture Slides for.
Bootstrap and Model Validation
CSE 4705 Artificial Intelligence
Machine Learning – Classification David Fenyő
COMPARISON OF RISK OF MALIGNANCY INDICES AND ASSESSMENT OF DIFFERENT NEOPLASIAS IN THE ADNEXA (ADNEX) MODEL AS PREOPERATIVE MALİGNANCY EVALUATION METHODS.
Support Vector Machines (SVM)
Project 1 Binary Classification
UOG Journal Club: August 2013
Somi Jacob and Christian Bach
Parametric Methods Berlin Chen, 2005 References:
Using Clustering to Make Prediction Intervals For Neural Networks
ECE – Pattern Recognition Lecture 4 – Parametric Estimation
What is Artificial Intelligence?
Presentation transcript:

Blackbox classifiers for preoperative discrimination between malignant and benign ovarian tumors C. Lu 1, T. Van Gestel 1, J. A. K. Suykens 1, S. Van Huffel 1, I. Vergote 2, D. Timmerman 2 1 Department of Electrical Engineering, Katholieke Universiteit Leuven, Leuven, Belgium, 2 Department of Obstetrics and Gynecology, University Hospitals Leuven, Leuven, Belgium address:

Demographic, serum marker, color Doppler imaging and morphologic variables Visualizing the correlation between the variables and the relations between the variables and clusters. Biplot of Ovarian Tumor Data 1. Introduction  Ovarian masses is a common problem in gynecology. A reliable test for preoperative discrimination between benign and malignant ovarian tumors is of considerable help for clinicians in choosing appropriate treatments for patients.  In this study, we develop and evaluate several blackbox models, particularly multi-layer perceptrons (MLP) and least squares support vector machines (LS-SVMs), both within Bayesian evidence framework, to preoperatively predict malignancy of ovarian tumors. Model performance is accessed via Receiver Operating Characteristic (ROC) curve analysis. 2. Data o: benign case x: malignant case

ROC curves constructed by plotting the sensitivity (true positive rate) versus the1-specificity, or false positive rate, for varying probability cutoff level. visualization of the relationship between sensitivity and specificity of a test. Area under the ROC curves (AUC) measures the probability of the classifier to correctly classify events and nonevents.  Patient Data Unv. Hospitals Leuven 1994~ records, 25 features 32% malignant Univariate Analysis Preprocessing Multivariate Analysis PCA, Factor analysis Stepwise logistic regression Model Building Bayesian LS-SVM + sparse approxi. Bayesian MLP Model Evaluation ROC analysis: AUC Cross validation (temporal, random) Descriptive statistics Input Variable Selection Data Exploration Model Development Procedure of developing models to predict the malignancy of ovarian tumors Goal: find a model  With High sensitivity for malignancy and low false positive rate.  Providing probability of malignancy for individual. Bayesian LS-SVM (RBF, Linear) Forward Selection (Max. Evidence) 3. Methods

4. Bayesian MLPs and Bayesian LS-SVMs for classification LS-SVM Classifier (VanGestel,Suykens 2002) Computing posterior class probabilities solved in dual space Model evidence Bayesian Evidence Framework Inferences are divided into distinct levels. MLP Classifiers (Mackay 1992)

Computing posterior class probabilities for minimum risk decision making Incorporate the different misclassification costs into the class priors: e.g. Set the adjusted prior probability for malignant and benign class to: 2/3 and 1/3. 5. Experimental results RMI: risk of malignancy index = score morph × score meno × CA125  Training set : data from the first treated 265 patients  Test set : data from the latest treated 160 patients Performance from Temporal validation ROC curve on test set Performance on Test set Input variable selection The forward selection procedure tries to maximize the model evidence of LS-SVM given a certain type of kernel 10 variables were selected using RBF kernels. l_ca125, pap, sol, colsc3, bilat, meno, asc, shadows, colsc4, irreg

 The forward selection procedure which tries to maximize the evidence of LS-SVM model is able to identify the important variables.  The performance of LS-SVMs and MLPs are comparable.  Both models have the potential to give reliable preoperative prediction of malignancy of ovarian tumors.  A larger scale validation is needed. References 1. C. Lu, T. Van Gestel, et al. Preoperative prediction of malignancy of ovarian tumors using Least Squares Support Vector Machines (2002), submitted paper. 2. D. Timmerman, H. Verrelst, et al., Artificial neural network models for the preoperative discrimination between malignant and benign adnexal masses. Ultrasound Obstet Gynecol (1999). 3. J.A.K. Suykens, J. Vandewalle, Least Squares support vector machine classifiers, Neural Processing Letters (1999), 9(3). 4. T. Van Gestel, J.A.K. Suykens, et al., Bayesian framework for least squares support vector machine classifiers, Gaussian process and kernel fisher discriminant analysis, Neural Computation (2002), 15(5). 5. D.J.C. MacKay, The evidence framework applied to classification networks, Neural Computation (1992), 4(5). Performance from randomized cross-validation (30 runs)  randomly separating training set (n=265) and test set (n=160)  Stratified, #malignant : #benign ~ 2:1 for each training and test set.  Repeat 30 times Averaged Performance on 30 runs of validations 6. Conclusions