PhD Hearing (Oct 15, 2003) Predictive Computer Models for Medical Classification Problems Predictive Computer Models for Medical Classification Problems.

Slides:



Advertisements
Similar presentations
Generative Models Thus far we have essentially considered techniques that perform classification indirectly by modeling the training data, optimizing.
Advertisements

SISTA seminar Feb 28, 2002 Preoperative Prediction of Malignancy of Ovarian Tumors Using Least Squares Support Vector Machines C. Lu 1, T. Van Gestel 1,
AIME03, Oct 21, 2003 Classification of Ovarian Tumors Using Bayesian Least Squares Support Vector Machines C. Lu 1, T. Van Gestel 1, J. A. K. Suykens.
Data Mining Classification: Alternative Techniques
Computer vision: models, learning and inference Chapter 8 Regression.
CS Statistical Machine learning Lecture 13 Yuan (Alan) Qi Purdue CS Oct
An Overview of Machine Learning
Supervised Learning Recap
The Center for Signal & Image Processing Georgia Institute of Technology Kernel-Based Detectors and Fusion of Phonological Attributes Brett Matthews Mark.
Chapter 4: Linear Models for Classification
Software Quality Ranking: Bringing Order to Software Modules in Testing Fei Xing Michael R. Lyu Ping Guo.
Laboratory for Social & Neural Systems Research (SNS) PATTERN RECOGNITION AND MACHINE LEARNING Institute of Empirical Research in Economics (IEW)
Ai in game programming it university of copenhagen Statistical Learning Methods Marco Loog.
Microarrays: algorithms for knowledge discovery in oncology and molecular biology Frank De Smet Katholieke Universiteit Leuven Faculteit Toegepaste Wetenschappen.
Fuzzy Support Vector Machines (FSVMs) Weijia Wang, Huanren Zhang, Vijendra Purohit, Aditi Gupta.
x – independent variable (input)
Arizona State University DMML Kernel Methods – Gaussian Processes Presented by Shankar Bhargav.
Modeling Gene Interactions in Disease CS 686 Bioinformatics.
Statistical Learning: Pattern Classification, Prediction, and Control Peter Bartlett August 2002, UC Berkeley CIS.
Review Rong Jin. Comparison of Different Classification Models  The goal of all classifiers Predicating class label y for an input x Estimate p(y|x)
General Information Course Id: COSC6342 Machine Learning Time: MO/WE 2:30-4p Instructor: Christoph F. Eick Classroom:SEC 201
Data Mining Techniques
PhD defense C. LU 25/01/ Probabilistic Machine Learning Approaches to Medical Classification Problems Probabilistic Machine Learning Approaches to.
B. RAMAMURTHY EAP#2: Data Mining, Statistical Analysis and Predictive Analytics for Automotive Domain CSE651C, B. Ramamurthy 1 6/28/2014.
Support Vector Machine Applications Electrical Load Forecasting ICONS Presentation Spring 2007 N. Sapankevych 20 April 2007.
This week: overview on pattern recognition (related to machine learning)
Introduction to variable selection I Qi Yu. 2 Problems due to poor variable selection: Input dimension is too large; the curse of dimensionality problem.
Prediction model building and feature selection with SVM in breast cancer diagnosis Cheng-Lung Huang, Hung-Chang Liao, Mu- Chen Chen Expert Systems with.
A Comparative Study on Variable Selection for Nonlinear Classifiers C. Lu 1, T. Van Gestel 1, J. A. K. Suykens 1, S. Van Huffel 1, I. Vergote 2, D. Timmerman.
Reduced the 4-class classification problem into 6 pairwise binary classification problems, which yielded the conditional pairwise probability estimates.
Prediction of Malignancy of Ovarian Tumors Using Least Squares Support Vector Machines C. Lu 1, T. Van Gestel 1, J. A. K. Suykens 1, S. Van Huffel 1, I.
EMBC2001 Using Artificial Neural Networks to Predict Malignancy of Ovarian Tumors C. Lu 1, J. De Brabanter 1, S. Van Huffel 1, I. Vergote 2, D. Timmerman.
Jun-Won Suh Intelligent Electronic Systems Human and Systems Engineering Department of Electrical and Computer Engineering Speaker Verification System.
1 Chapter 6. Classification and Prediction Overview Classification algorithms and methods Decision tree induction Bayesian classification Lazy learning.
Machine Learning CUNY Graduate Center Lecture 4: Logistic Regression.
Christopher M. Bishop, Pattern Recognition and Machine Learning.
Overview of the final test for CSC Overview PART A: 7 easy questions –You should answer 5 of them. If you answer more we will select 5 at random.
Sparse Kernel Methods 1 Sparse Kernel Methods for Classification and Regression October 17, 2007 Kyungchul Park SKKU.
Support Vector Machines in Marketing Georgi Nalbantov MICC, Maastricht University.
Guest lecture: Feature Selection Alan Qi Dec 2, 2004.
Chapter 20 Classification and Estimation Classification – Feature selection Good feature have four characteristics: –Discrimination. Features.
ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition LECTURE 12: Advanced Discriminant Analysis Objectives:
Dimensionality Reduction in Unsupervised Learning of Conditional Gaussian Networks Authors: Pegna, J.M., Lozano, J.A., Larragnaga, P., and Inza, I. In.
Data Mining and Decision Support
Chapter 6. Classification and Prediction Classification by decision tree induction Bayesian classification Rule-based classification Classification by.
CS Statistical Machine learning Lecture 12 Yuan (Alan) Qi Purdue CS Oct
NTU & MSRA Ming-Feng Tsai
Supervised Machine Learning: Classification Techniques Chaleece Sandberg Chris Bradley Kyle Walsh.
Feature Selction for SVMs J. Weston et al., NIPS 2000 오장민 (2000/01/04) Second reference : Mark A. Holl, Correlation-based Feature Selection for Machine.
Blackbox classifiers for preoperative discrimination between malignant and benign ovarian tumors C. Lu 1, T. Van Gestel 1, J. A. K. Suykens 1, S. Van Huffel.
Machine Learning in CSC 196K
Developing outcome prediction models for acute intracerebral hemorrhage patients: evaluation of a Support Vector Machine based method A. Jakab 1, L. Lánczi.
Introduction Background Medical decision support systems based on patient data and expert knowledge A need to analyze the collected data in order to draw.
Support Vector Machines Optimization objective Machine Learning.
SUPERVISED AND UNSUPERVISED LEARNING Presentation by Ege Saygıner CENG 784.
General Information Course Id: COSC6342 Machine Learning Time: TU/TH 1-2:30p Instructor: Christoph F. Eick Classroom:AH301
Institute of Statistics and Decision Sciences In Defense of a Dissertation Submitted for the Degree of Doctor of Philosophy 26 July 2005 Regression Model.
1 CISC 841 Bioinformatics (Fall 2008) Review Session.
CSE 4705 Artificial Intelligence
BRAIN Alliance Research Team Annual Progress Report (Jul – Feb
Machine Learning for Computer Security
LECTURE 11: Advanced Discriminant Analysis
Sparse Kernel Machines
School of Computer Science & Engineering
Ch3: Model Building through Regression
Robust Full Bayesian Learning for Neural Networks
Parametric Methods Berlin Chen, 2005 References:
Feature Selection Methods
Machine Learning with Clinical Data
Linear Discrimination
Presentation transcript:

PhD Hearing (Oct 15, 2003) Predictive Computer Models for Medical Classification Problems Predictive Computer Models for Medical Classification Problems PhD progress report ( ~ ) Student : Chuan LU Promoters: Prof. Dr. Ir. Sabine Van Huffel Prof. Dr. Ir. Johan Suykens Advisers : Prof. Dr. Dirk Timmerman Prof. Dr. Ir. Joos Vandewalle Prof. Dr. Jan Beirlant

PhD Hearing (Oct 15, 2003) Overview PhD topic Doctoral Programme: courses, publication, meetings... Research Work Work plan and timing

PhD Hearing (Oct 15, 2003) PhD Topic The development, statistical analysis and clinical evaluation of a new class of predictive models which optimally extract information from patient data. The attention is focused on intelligence machine learning methods such as neural networks, kernel based algorithms, and their integration with Bayesian framework.

PhD Hearing (Oct 15, 2003) Joint Research Activities Classification of ovarian tumors logistic regression (LR) artificial neural networks (ANNs) Bayesian least squares support vector machines (LS-SVMs) Prediction of pregnancy of unknown location (PUL) LR, LS-SVMs, relevance vector machines (RVMs) Variable selection for medical classification problems: (Bayesian framework)

PhD Hearing (Oct 15, 2003) The Doctoral Programme ‘Direct Tuition’ – (520 h) Doctoral Courses Case Studies in Biomedical Data Processing (25x6=150 h) Phd training course: Longitudinal Data,Incomplete Data, and Causal Inference (18x1=18 h) Courses in master of statistics Basic Concepts of Statistical Modeling (60x4=240 h) Applied Statistical Models (60x4=240 h) Seminars Presentation at BioMed Seminar, SISTA (50 x 1 h) Presentation at SISTA seminar on Feb 28, 2002 (50x0.5 h)

PhD Hearing (Oct 15, 2003) The Doctoral Programme Other Study Activities and Achievements Publications (1) – with first authorship [1]‘Preoperative prediction of malignancy of ovarian tumors using least squares support vector machines’. Artificial Intelligence in Medicine, vol. 28, no. 3, Jul. 2003, pp (200 h) [2] ‘Using Artificial Neural Networks to Predict Malignancy of Ovarian Cancers’, in Proc. Of the 23rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society - EMBC2001, Istanbul, Turkey, Oct. 2001, CD-ROM. (100 h) [3] ‘Classification of ovarian tumor using Bayesian least squares support vector machines’, accepted for publication in the 9th Conference on Artificial Intelligence in Medicine Europe (AIME 03), Oct 18-22, Cyprus. ( h) [4] ‘Bayesian Least Squares Support Vector Machines for Classification of Ovarian Tumors’, Internal Report , ESAT-SISTA, K.U.Leuven (Leuven, Belgium), 2002.

PhD Hearing (Oct 15, 2003) The Doctoral Programme Publications (2) – with coauthorship [1] ‘Prediction of mental development of preterm newborns at birth time using LS- SVM’, in Proc. of ESANN'02, Bruges, Belgium, Apr. 2002, pp [2] ‘Color Doppler and Gray-Scale Ultrasound Evaluation of the Postpartum uterus’, Ultrasound Obstet. Gynecol., vol. 20, 2002, pp [3] ‘Prospective evaluation of blood flow in the myometrium and in the uterine arteries in the puerperium’, Jan 2003, accepted for publication in Ultrasound in Obstetrics and Gynecology. [4] ‘Subjective use of serum human chorionic gonadotrophin and progesterone levels for the investigation of pregnancies of unknown location : analysis of interuser variability and experience’, [5] ‘Can ectopic pregnancies be predicted using serum hormone levels ?’, submitted, [6] ‘A novel neural approach to inverse problems with discontinuities (the GMR neural network)’, in IJCNN'03, Portland, Oregon, Jul. 2003, pp [7] ‘Direct Torque control of induction motors by use of the GMR Neural network’, in IJCNN'03, Portland, Oregon, July 20-24, 2003, pp

PhD Hearing (Oct 15, 2003) The Doctoral Programme Other Study Activities and Achievements Participation in Scientific Meetings 9th Annual Meeting of the Belgian Statistical Society-BSS2001, Oostende, Oct 2001, Poster Presentation: ‘Prediction of Malignancy of Ovarian Tumors Using Logistic Regression and Artificial Neural Network Models’ the Advanced NATO Study Institute on Learning Theory and Practice (NATO-ASI LTP 2002), Leuven, Belgium, July 8-19, 2002, Poster Presentation: ‘Blackbox classifiers for preoperative discrimination between malignant and benign ovarian tumors’. 10th Annual Meeting of the Belgian Statistical Society-BSS2002, Kerkrade, The Netherlands, October 18-19, 2002 Poster Presentation: ‘Comparative study on variable selection for nonlinear classifiers ’ Poster presentation at study day of IAP network 2001, 2002 and Belgian Day of Biomedical Engineering in Brussels,, October 17th 2003, Poster presentation: ‘Variable selection using linear sparse Bayesian models for medical classification problems’.

PhD Hearing (Oct 15, 2003) The Doctoral Programme Other Study Activities and Achievements Supervision of licentiate thesis (2x =400 h) Thesis supervision for Master of applied statistics: ‘Mathematical models for predicting the evolution of a pregnancy of unknown location’, 2003 Thesis supervision for Master of applied statistics: ‘Prediction of pregnancy evolution and ectopic pregnancy’, Thesis supervision for ERASMUS student from university de Picardie Jules Verne, France, 2003, Approches neuronales pour la résolution de problèmes inverses avec discontinuités

PhD Hearing (Oct 15, 2003) Research - Building blocks Explorative data analysis (EDA) Probabilistic modeling techniques Variable selection Applications

PhD Hearing (Oct 15, 2003) Explorative Data Analysis Gain insights into a data set: structure, imprtant variables, outiers, and the model suggested by the data. Techniques: scatterplots, boxplots, histograms, PCA, FA, CCA, biplots, etc. New nonlinear techniques in EDA: kernel PCA, kernel CCA, and nonlinear biplots. Uncovering the nonlinear structure of the data, aid in nonlinear modeling such as LS-SVM.

PhD Hearing (Oct 15, 2003) Explorative Data Analysis Fig. Biplot of Ovarian Tumor data. The observations are plotted as points (o - benign, x - malignant), the variables are plotted as vectors from the origin. - visualization of the correlation between the variables - visualization of the relations between the variables and clusters.

PhD Hearing (Oct 15, 2003) Explorative Data Analysis Fig. Nonlinear Biplot for kernel PCA with RBF kernels Data projected onto pairs of PCs (PCs with the largest correlation with y were selected for visualization), computed by kernel trick. Approximate decision boundary: ridge regression (y=  1) using pairs of PCs For kth variable, pseudosamples generated by: setting data mean as starting point, varying the value of variable k while fixing the others. Variable trajectory: tracing the projection of the pseudosample onto the pairs of PCs.

PhD Hearing (Oct 15, 2003) Probabilistic Modeling Probabilistic modeling needed in medical Decis. Supp.  the uncertainty and different mis. class. cost. Traditional statistical linear probabilistic classifiers: Linear discriminant analysis (LDA) Logistic regression (LR) Bayesian MLPs Bayesian + Kernel based modeling: Bayesian LS-SVM classifiers (Suykens 1999, 2001, 2002) Sparse Bayesian modeling and relevance vector machines (RVMs) (Tipping 2001,2003)

PhD Hearing (Oct 15, 2003) Recipe Goal: Linear model y=w T x  Nonlinear model Dealing with uncertainty Model selection Sparseness Ingredients: Kernel trick: x   (x) higher dim. feature space Bayesian framework

PhD Hearing (Oct 15, 2003) Bayesian Inference Find the maximum a posterior (MAP) estimates of model parameters w MP and b MP, using conventional LS-SVM training. The posterior probability of the parameters can be estimated via marginalization using Gaussian probability at w MP, b MP Assuming a uniform prior p(H j ) over all model, rank the model by the evidence p(D|H j ) evaluated using Gaussian approximation.

PhD Hearing (Oct 15, 2003) Variable selection Importance in medical classification problems economics of data acquisition accuracy and complexity of the classifiers gain insights into the underlying medical problem. Filter approaches: filter out irrelevant attributes before induction occurs Wrapper approaches: focus on finding attributes that are useful for performance for a specific type of model, rather than necessarily finding the relevant ones.

PhD Hearing (Oct 15, 2003) Variable selection Heuristic search: forward, backward, stepwise hill-climbing, branch and bound… Variable selection criteria: Correlation, fisher score, mutual information Evidence in Bayesian framework Classification performance, e.g. AUC Sensitivity analysis: change in the objective function J by removing variable i: DJ(i) Statistical chi-square test

PhD Hearing (Oct 15, 2003) Variable selection We focus on evidence (marginal likelihood) based method within the Bayesian framework Forward / stepwise selection Bayesian LS-SVM Sparse Bayesian models Accounting for uncertainty in variable selection

PhD Hearing (Oct 15, 2003) Application - Ovarian tumor classification Problem develop a reliable diagnostic tool to discriminate preoperatively between benign and malignant tumors. assist clinicians in choosing the appropriate treatment. Data (from IOTA project) Patient data collected at Univ. Hospitals Leuven, Belgium, 1994~ records, 25 features. 291 benign tumors, 134 (32%) malignant tumors.

PhD Hearing (Oct 15, 2003) Application - Ovarian tumor classification Forward variable selection based on Bayesian LS-SVM Evolution of the model evidence 10 variables were selected based on the training set (first treated 265 patient data) using RBF kernels.

PhD Hearing (Oct 15, 2003) Application - Ovarian tumor classification  Predictive power of the models given the selected variables ROC curves on test Set (data from 160 newest treated patients)

PhD Hearing (Oct 15, 2003) Application - Ovarian tumor classification  Performance on test set with rejection based on, e.g.,  The rejected patients need further examination by human experts  Posterior probability essential for medical decision making

PhD Hearing (Oct 15, 2003) Application- binary cancer classification based on microarray data  Variable selection using linear sparse Bayesian logit model, LOO CV accuracy cancerno. samplesno. genestask leukemia subtypes colon622000disease/normal

PhD Hearing (Oct 15, 2003) Application- brain tumor multiclass classification based on MRS spectra data  4 types of brain tumors, 205x138 magnitude value  Variable selection using linear sparse Bayesian logit model, 30 runs of random CV accuracy

PhD Hearing (Oct 15, 2003) Work Plan and Timing Nov - Overview paper on Linear and nonlinear preoperative classification of ovarian tumors (chapter proposal accepted for edited book "Knowledge Based Intelligent System for Health Care.") Dec – Jan 2003, paper on variable selection Feb 2004 – model averaging Model evaluation using IOTA data. April 2004 – writing draft of thesis Sept Defense