Prediction of Sickle Cell Anemia Patient’s Response to Hydroxyurea Treatment Using ARTMAP Network Hongyu Xu, Faramarz Valafar, Marko Vuskovic Department.

Slides:



Advertisements
Similar presentations
ONLINE ARABIC HANDWRITING RECOGNITION By George Kour Supervised by Dr. Raid Saabne.
Advertisements

Sickle cell anemia Jordan Williams.
An Overview of Machine Learning
ETHEM ALPAYDIN © The MIT Press, Lecture Slides for.
Department of Computer Science San Diego State University
Lesson 8: Machine Learning (and the Legionella as a case study) Biological Sequences Analysis, MTA.
Prénom Nom Document Analysis: Data Analysis and Clustering Prof. Rolf Ingold, University of Fribourg Master course, spring semester 2008.
Machine Vision and Dig. Image Analysis 1 Prof. Heikki Kälviäinen C50A6100 Lectures 12: Object Recognition Professor Heikki Kälviäinen Machine Vision and.
Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John Wiley.
Classification of Prehensile EMG Patterns With Simplified Fuzzy ARTMAP Networks Marko Vuskovic and Sijiang Du Department of Computer Science San Diego.
Lecture #1COMP 527 Pattern Recognition1 Pattern Recognition Why? To provide machines with perception & cognition capabilities so that they could interact.
Chapter 4 (part 2): Non-Parametric Classification
Pattern Recognition. Introduction. Definitions.. Recognition process. Recognition process relates input signal to the stored concepts about the object.
CS Instance Based Learning1 Instance Based Learning.
Aula 4 Radial Basis Function Networks
Introduction to Artificial Neural Network and Fuzzy Systems
Hazırlayan NEURAL NETWORKS Radial Basis Function Networks II PROF. DR. YUSUF OYSAL.
Introduction to Data Mining Engineering Group in ACL.
Copyright 2003 by Hongyu Xu Mahalanobis Distance Based ARTMAP Networks Master Thesis Supervisor: Marko Vuskovic Hongyu Xu Department of Computer Science.
JM - 1 Introduction to Bioinformatics: Lecture VIII Classification and Supervised Learning Jarek Meller Jarek Meller Division.
Methods in Medical Image Analysis Statistics of Pattern Recognition: Classification and Clustering Some content provided by Milos Hauskrecht, University.
Bayesian Networks. Male brain wiring Female brain wiring.
ECE 8443 – Pattern Recognition LECTURE 03: GAUSSIAN CLASSIFIERS Objectives: Normal Distributions Whitening Transformations Linear Discriminants Resources.
Matlab Matlab Sigmoid Sigmoid Perceptron Perceptron Linear Linear Training Training Small, Round Blue-Cell Tumor Classification Example Small, Round Blue-Cell.
COMMON EVALUATION FINAL PROJECT Vira Oleksyuk ECE 8110: Introduction to machine Learning and Pattern Recognition.
Some working definitions…. ‘Data Mining’ and ‘Knowledge Discovery in Databases’ (KDD) are used interchangeably Data mining = –the discovery of interesting,
Introduction: Olfactory Physiology Organic Chemistry Signal Processing Pattern Recognition Computational Learning Electronic Nose Chemical Sensors.
Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John Wiley.
Other NN Models Reinforcement learning (RL)
Speech Communication Lab, State University of New York at Binghamton Dimensionality Reduction Methods for HMM Phonetic Recognition Hongbing Hu, Stephen.
Lecture 2: Statistical learning primer for biologists
1 Unsupervised Learning and Clustering Shyh-Kang Jeng Department of Electrical Engineering/ Graduate Institute of Communication/ Graduate Institute of.
Chapter 20 Classification and Estimation Classification – Feature selection Good feature have four characteristics: –Discrimination. Features.
Introduction to Pattern Recognition (การรู้จํารูปแบบเบื้องต้น)
Speech Lab, ECE, State University of New York at Binghamton  Classification accuracies of neural network (left) and MXL (right) classifiers with various.
Neural Networks Presented by M. Abbasi Course lecturer: Dr.Tohidkhah.
Final Exam Review CS479/679 Pattern Recognition Dr. George Bebis 1.
ECE 471/571 - Lecture 19 Review 11/12/15. A Roadmap 2 Pattern Classification Statistical ApproachNon-Statistical Approach SupervisedUnsupervised Basic.
Sickle Cell Anemia Danny Gardner and Merline Maxi 1/28/10 Period 9/10.
COSC 4426 AJ Boulay Julia Johnson Artificial Neural Networks: Introduction to Soft Computing (Textbook)
Fuzzy Pattern Recognition. Overview of Pattern Recognition Pattern Recognition Procedure Feature Extraction Feature Reduction Classification (supervised)
Cell Segmentation in Microscopy Imagery Using a Bag of Local Bayesian Classifiers Zhaozheng Yin RI/CMU, Fall 2009.
Neural Networks for EMC Modeling of Airplanes Vlastimil Koudelka Department of Radio Electronics FEKT BUT Metz,
Sickle Cell Anemia Danny Gardner and Merline Maxi 1/28/10 Period 9/10.
SICKLE CELL ANEMIA. CODOMINANCE Codominance means both alleles are expressed equally Sickle Cell Disease is caused by a Codominant allele –Affects 1 out.
SUPERVISED AND UNSUPERVISED LEARNING Presentation by Ege Saygıner CENG 784.
ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition Objectives: Mixture Densities Maximum Likelihood Estimates.
PatReco: Introduction Alexandros Potamianos Dept of ECE, Tech. Univ. of Crete Fall
Gaussian Mixture Model classification of Multi-Color Fluorescence In Situ Hybridization (M-FISH) Images Amin Fazel 2006 Department of Computer Science.
An Evolutionary Algorithm for Neural Network Learning using Direct Encoding Paul Batchis Department of Computer Science Rutgers University.
Machine Learning Usman Roshan Dept. of Computer Science NJIT.
Neural network based hybrid computing model for wind speed prediction K. Gnana Sheela, S.N. Deepa Neurocomputing Volume 122, 25 December 2013, Pages 425–429.
Deep Learning Amin Sobhani.
Nonparametric Density Estimation – k-nearest neighbor (kNN) 02/20/17
Special Topics In Scientific Computing
CSE P573 Applications of Artificial Intelligence Neural Networks
Course Outline MODEL INFORMATION COMPLETE INCOMPLETE
Neuro-Computing Lecture 4 Radial Basis Function Network
CSE 573 Introduction to Artificial Intelligence Neural Networks
Introduction to Radial Basis Function Networks
Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John.
Multivariate Methods Berlin Chen
Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John.
Multivariate Methods Berlin Chen, 2005 References:
Hairong Qi, Gonzalez Family Professor
Hairong Qi, Gonzalez Family Professor
Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John.
Review of Statistical Pattern Recognition
ECE – Pattern Recognition Lecture 10 – Nonparametric Density Estimation – k-nearest-neighbor (kNN) Hairong Qi, Gonzalez Family Professor Electrical.
ECE – Pattern Recognition Lecture 4 – Parametric Estimation
Presentation transcript:

Prediction of Sickle Cell Anemia Patient’s Response to Hydroxyurea Treatment Using ARTMAP Network Hongyu Xu, Faramarz Valafar, Marko Vuskovic Department of Computer Science San Diego State University San Diego, CA METMBS 2003 Las Vegas, June 24, 2003 The full paper and these slides are available at: Control/Neuromuscular.htm

2 Contents Sickle cell anemia Data and data preprocessing Linear dependency of features Feature selection Data labeling MART clustering algorithm MART classification algorithm Results Conclusion

3 Sickle Cell Anemia Sickle cell anima is a genetic disorder, caused by single point mutation in the beta globin gene that changes from CCTGAGG to CCTGTGG. The molecules of sickle cell hemoglobin adhere to each other and distort red blood cells (RBC) into sickle shape. They stick in narrow blood vessels, blocking the flow of blood. Sickle cell patients experience severe painful crises. Many sickle cell patients die before the age of 20. In the United States, about 1 in 500 African Americans develops sickle cell anima [5]. In Africa, about 1 in 100 individuals develop the disease. In 1983, a drug called hydroxyurea (HU) was first used on sickle cell patients. The patients who responded to HU treatment positively experienced less pain and their life span were prolonged, but HU can also be quite toxic.

4 Patient Features Note: The data used in this research is obtained from University of Georgia, Structural Genomics Group. Dr. Homayoun Valafar was responsible for the data collection and preprocessing.

5 Excerpt from patient’s data 1.0e+004 *

6 Data Preprocessing Normalization Log transformation Treatment of incomplete features

7 Patient Data (after log transform)

Linear Dependency of Features

Linear Dependency of Features (Cont.)

10 SBAN SBEN SCAM SSEN

11 Before removal: After removal:

12 Feature Selection

13 Feature Selection (Cont.)

14 Data Labeling

15 Data Labeling (Cont.)

16 Representation of Patient’s Data in Reduced Feature Space Double rule

17 Representation of Patient’s Data in Reduced Feature Space (Cont.) 15% rule

18 Approaches in Pattern Recognition Bayes ’ Classifier Neural networks Single layer Perceptrons Probability density estimation Parzen window Multilayer Perceptrons K-nearest neighbor Mixture model Feed forward ART Recurrent Basis functions Maximum liklihood Bayesian inference Pattern recognition Pattern recognition MART K-mean SOM Mixture model Radial Basis Function

19 ART Networks Grossberg, 1976 Unsupervised ART Learning Fuzzy ART Carpenter, Grossberg, etal,1991 ARTMAP Carpenter, Grossberg, etal,1991 Fuzzy ARTMAP Carpenter, Grossberg, etal,1991 Gaussian ARTMAP Williamson,1992 ART1, ART2 Carpenter & Grossberg, 1987 Supervised ART Learning Simplified ART Baraldi and Alpaydin, 1998 Simplified ARTMAP Kasuba, 1993 Mahalanobis distance based ARTMAP Vuskovic & Du, 2001 Vuskovic, Xu & Du, 2002

20 MART clustering Algorithm

21 MART clustering Algorithm (Cont.)

22 MART Functions

23 MART Classification Algorithm The trained network is a Gaussian mixture model. Each class maps to one or more clusters. The class probability is proportional to the sum of posterior probabilities of individual clusters of the same class. The prediction is class that yields the maximum class probability. Class conditional pdf of x given cluster j Prior probability of cluster j Posterior probability

24 Results

25 Conclusion MART has shown superior performance in various benchmarks, which has inspired us to apply MART to sickle cell anemia patients data. MART achieved 96.82% accuracy for predicting responders to HU treatment and give 92.59% global accuracy. Removal of linear dependency of features has improved the numerical stability of the algorithms. Reduction of the feature space from 23 to only 3 features has considerably improved the performance (decreased the numerical complexity and even increased the accuracy) In the future we plan to explore other labeling methods. We also plan to investigate more data preprocessing methods, which include both linear and nonlinear transformations.