FCM WIZARD IN ACTION SCENARIO: HIV DRUG RESISTANCE PREDICTION Main developers: Gonzalo NÁPOLES and Isel GRAU Supervisors: Elpiniki PAPAGEORGIOU and Koen.

Slides:



Advertisements
Similar presentations
© Negnevitsky, Pearson Education, Lecture 12 Hybrid intelligent systems: Evolutionary neural networks and fuzzy evolutionary systems Introduction.
Advertisements

Naïve Bayes. Bayesian Reasoning Bayesian reasoning provides a probabilistic approach to inference. It is based on the assumption that the quantities of.
Unsupervised Learning
SVM—Support Vector Machines
CSCI 347 / CS 4206: Data Mining Module 07: Implementations Topic 03: Linear Models.
Particle swarm optimization for parameter determination and feature selection of support vector machines Shih-Wei Lin, Kuo-Ching Ying, Shih-Chieh Chen,
Multiple Criteria for Evaluating Land Cover Classification Algorithms Summary of a paper by R.S. DeFries and Jonathan Cheung-Wai Chan April, 2000 Remote.
x – independent variable (input)
Decision Tree Algorithm
Final Project: Project 9 Part 1: Neural Networks Part 2: Overview of Classifiers Aparna S. Varde April 28, 2005 CS539: Machine Learning Course Instructor:
Neural Networks Chapter Feed-Forward Neural Networks.
Data Mining: A Closer Look Chapter Data Mining Strategies (p35) Moh!
ML ALGORITHMS. Algorithm Types Classification (supervised) Given -> A set of classified examples “instances” Produce -> A way of classifying new examples.
1 An Excel-based Data Mining Tool Chapter The iData Analyzer.
CS Instance Based Learning1 Instance Based Learning.
Chapter 5 Data mining : A Closer Look.
Gardening Simulation Created by: Sherry Burrill & Laura Hurlbirt OLIT 533 Computer Simulations Dr. Dennis Lester The University of New Mexico Inspired.
Data Mining Techniques
CHAPTER 12 ADVANCED INTELLIGENT SYSTEMS © 2005 Prentice Hall, Decision Support Systems and Intelligent Systems, 7th Edition, Turban, Aronson, and Liang.
Data Mining and Application Part 1: Data Mining Fundamentals Part 2: Tools for Knowledge Discovery Part 3: Advanced Data Mining Techniques Part 4: Intelligent.
Presentation on Neural Networks.. Basics Of Neural Networks Neural networks refers to a connectionist model that simulates the biophysical information.
Bayesian parameter estimation in cosmology with Population Monte Carlo By Darell Moodley (UKZN) Supervisor: Prof. K Moodley (UKZN) SKA Postgraduate conference,
Grant Number: IIS Institution of PI: Arizona State University PIs: Zoé Lacroix Title: Collaborative Research: Semantic Map of Biological Data.
WEKA - Explorer (sumber: WEKA Explorer user Guide for Version 3-5-5)
Slides are based on Negnevitsky, Pearson Education, Lecture 12 Hybrid intelligent systems: Evolutionary neural networks and fuzzy evolutionary systems.
StAR web server tutorial for ROC Analysis. ROC Analysis ROC Analysis: This module allows the user to input data for several classifiers to be tested.
Machine Learning CSE 681 CH2 - Supervised Learning.
NEURAL NETWORKS FOR DATA MINING
Fuzzy Cognitive Maps Y. İlker TOPCU, Ph.D twitter.com/yitopcu.
Data Mining: Classification & Predication Hosam Al-Samarraie, PhD. Centre for Instructional Technology & Multimedia Universiti Sains Malaysia.
Chapter 8 The k-Means Algorithm and Genetic Algorithm.
Experimental Evaluation of Learning Algorithms Part 1.
1 IE 607 Heuristic Optimization Particle Swarm Optimization.
Decision Trees. Decision trees Decision trees are powerful and popular tools for classification and prediction. The attractiveness of decision trees is.
1 1 Slide Using Weka. 2 2 Slide Data Mining Using Weka n What’s Data Mining? We are overwhelmed with data We are overwhelmed with data Data mining is.
FALWEB Fuzzy Aggregated Linkages Within Environmental Bounds  Create and edit FCMs through the use of a square matrix with zeroes along the diagonal 
Chapter 11 Statistical Techniques. Data Warehouse and Data Mining Chapter 11 2 Chapter Objectives  Understand when linear regression is an appropriate.
1 Pattern Recognition: Statistical and Neural Lonnie C. Ludeman Lecture 25 Nov 4, 2005 Nanjing University of Science & Technology.
Data Mining BY JEMINI ISLAM. Data Mining Outline: What is data mining? Why use data mining? How does data mining work The process of data mining Tools.
An Investigation of Commercial Data Mining Presented by Emily Davis Supervisor: John Ebden.
Lecture 2: Statistical learning primer for biologists
Clustering Instructor: Max Welling ICS 178 Machine Learning & Data Mining.
FCM WIZARD IN ACTION SCENARIO: ECO INDUSTRIAL PARK MODELING Main developers: Gonzalo NÁPOLES and Isel GRAU Supervisors: Elpiniki PAPAGEORGIOU and Koen.
An Exercise in Machine Learning
Scenario: Coconut Yield Management
Data Mining and Decision Support
***Classification Model*** Hosam Al-Samarraie, PhD. CITM-USM.
FCM WIZARD IN ACTION SCENARIO: AUTISM PREDICTION IN CHILDREN Main developers: Gonzalo NÁPOLES and Isel GRAU Supervisors: Elpiniki PAPAGEORGIOU and Koen.
1 Classification: predicts categorical class labels (discrete or nominal) classifies data (constructs a model) based on the training set and the values.
FCM WIZARD IN ACTION SCENARIO: RISK MANAGEMENT OF IT PROJECTS Main developers: Gonzalo NÁPOLES and Isel GRAU Supervisors: Elpiniki PAPAGEORGIOU and Koen.
WEKA's Knowledge Flow Interface Data Mining Knowledge Discovery in Databases ELIE TCHEIMEGNI Department of Computer Science Bowie State University, MD.
A field of study that encompasses computational techniques for performing tasks that require intelligence when performed by humans. Simulation of human.
HIV Mutation Classifier HIV Mutation Classifier Hannah Bier’s Project Proposal.
The UNIVERSITY of NORTH CAROLINA at CHAPEL HILL Classification COMP Seminar BCB 713 Module Spring 2011.
Dr. Chen, Data Mining  A/W & Dr. Chen, Data Mining Chapter 3 Basic Data Mining Techniques Jason C. H. Chen, Ph.D. Professor of MIS School of Business.
In part from: Yizhou Sun 2008 An Introduction to WEKA Explorer.
A Presentation on Adaptive Neuro-Fuzzy Inference System using Particle Swarm Optimization and it’s Application By Sumanta Kundu (En.R.No.
 Negnevitsky, Pearson Education, Lecture 12 Hybrid intelligent systems: Evolutionary neural networks and fuzzy evolutionary systems n Introduction.
Machine Learning with Spark MLlib
COGNITIVE APPROACH TO ROBOT SPATIAL MAPPING
Compact Bilinear Pooling
Rule Induction for Classification Using
Data Mining (and machine learning)
Advanced Analytics Using Enterprise Miner
Weka Package Weka package is open source data mining software written in Java. Weka can be applied to your dataset from the GUI, the command line or called.
Classification and Prediction
CSCI N317 Computation for Scientific Applications Unit Weka
Lecture 10 – Introduction to Weka
M. Kezunovic (P.I.) S. S. Luo D. Ristanovic Texas A&M University
Perceptron Learning Rule
Presentation transcript:

FCM WIZARD IN ACTION SCENARIO: HIV DRUG RESISTANCE PREDICTION Main developers: Gonzalo NÁPOLES and Isel GRAU Supervisors: Elpiniki PAPAGEORGIOU and Koen VANHOOF

Scenario description This FCM was conceived for studying the drug resistance of the protease protein when single or multiple mutations take place. The scenario is a classification problem where the instances are protein mutations described by the amino acid on each position and a binary class denoting “Resistance” or “Susceptibility” of the mutation to an HIV drug. Concepts (graph nodes) represent the attributes (descriptors) in the prediction problem: the amino acid contact energy of each protease position associated to drug resistance. Decision concept represents the Resistance class. Dataset source: HIV protease resistance to Indinavir (IDV) See for further information. G. NÁPOLES, I. GRAU, R. BELLO, R. GRAU, TWO-STEPS LEARNING OF FUZZY COGNITIVE MAPS FOR PREDICTION AND KNOWLEDGE DISCOVERY ON THE HIV-1 DRUG RESISTANCE, EXPERT SYSTEMS WITH APPLICATIONS 41 (2014) 821–830.

Create a new map by using the menu File |New Map. The canvas is immediately cleaned and ready to be used in a new map. Designing the FCM model G. NÁPOLES, I. GRAU, R. BELLO, R. GRAU, TWO-STEPS LEARNING OF FUZZY COGNITIVE MAPS FOR PREDICTION AND KNOWLEDGE DISCOVERY ON THE HIV-1 DRUG RESISTANCE, EXPERT SYSTEMS WITH APPLICATIONS 41 (2014) 821–830.

Design a FCM Add concepts and causal links between them by drag and dropping from the source concept to the target one. Concepts represent protein positions and R denotes the resistance class. For this scenario experts do not know the causal influence, therefore we keep a random value for now. G. NÁPOLES, I. GRAU, R. BELLO, R. GRAU, TWO-STEPS LEARNING OF FUZZY COGNITIVE MAPS FOR PREDICTION AND KNOWLEDGE DISCOVERY ON THE HIV-1 DRUG RESISTANCE, EXPERT SYSTEMS WITH APPLICATIONS 41 (2014) 821–830.

Take into account the cut-off for the decision concept R in order to perform a classification. In this scenario the cut-off for resistance is taken from literature and normalized respect to the maximum resistance value in historical data (e.g. for IDV the cut-off is 0.006). Design a FCM Customize concepts and define the ranges for the classes using the cut-off for the decision concept. G. NÁPOLES, I. GRAU, R. BELLO, R. GRAU, TWO-STEPS LEARNING OF FUZZY COGNITIVE MAPS FOR PREDICTION AND KNOWLEDGE DISCOVERY ON THE HIV-1 DRUG RESISTANCE, EXPERT SYSTEMS WITH APPLICATIONS 41 (2014) 821–830.

In this scenario the model is computed from a dataset, and therefore we do not know about the causal relations among protein positions. In such cases a learning algorithm for learning the matrix may be applied (Run | Learning Methods). Learning of causal weights G. NÁPOLES, I. GRAU, R. BELLO, R. GRAU, TWO-STEPS LEARNING OF FUZZY COGNITIVE MAPS FOR PREDICTION AND KNOWLEDGE DISCOVERY ON THE HIV-1 DRUG RESISTANCE, EXPERT SYSTEMS WITH APPLICATIONS 41 (2014) 821–830.

The tool includes unsupervised and supervised learning algorithms. Set up the parameters for the chosen algorithm and browse for the historical dataset source (in ARFF format). Learning of causal weights G. NÁPOLES, I. GRAU, R. BELLO, R. GRAU, TWO-STEPS LEARNING OF FUZZY COGNITIVE MAPS FOR PREDICTION AND KNOWLEDGE DISCOVERY ON THE HIV-1 DRUG RESISTANCE, EXPERT SYSTEMS WITH APPLICATIONS 41 (2014) 821–830.

During the learning progress the statists (e.g. the error curve) are updated. The right panel shows additional statistics related to the knowledge base and the learning process. Learning of causal weights It can be noticed that 98% of instances were correctly classified in the evaluation so far! Now we are able to perform simulation on the FCM and classify new instances of the problem! G. NÁPOLES, I. GRAU, R. BELLO, R. GRAU, TWO-STEPS LEARNING OF FUZZY COGNITIVE MAPS FOR PREDICTION AND KNOWLEDGE DISCOVERY ON THE HIV-1 DRUG RESISTANCE, EXPERT SYSTEMS WITH APPLICATIONS 41 (2014) 821–830.

Set initial values for concepts and trigger the inference Run | Run Inference Process. Next a plotter will be showed illustrating the inference process in detail. FCM Inference G. NÁPOLES, I. GRAU, R. BELLO, R. GRAU, TWO-STEPS LEARNING OF FUZZY COGNITIVE MAPS FOR PREDICTION AND KNOWLEDGE DISCOVERY ON THE HIV-1 DRUG RESISTANCE, EXPERT SYSTEMS WITH APPLICATIONS 41 (2014) 821–830.

Moreover, it is possible to exploit the learned model by classifying new instances (i.e. mutations) by using the predefined decision classes (i.e. Susceptible and Resistant). Classify a new instance G. NÁPOLES, I. GRAU, R. BELLO, R. GRAU, TWO-STEPS LEARNING OF FUZZY COGNITIVE MAPS FOR PREDICTION AND KNOWLEDGE DISCOVERY ON THE HIV-1 DRUG RESISTANCE, EXPERT SYSTEMS WITH APPLICATIONS 41 (2014) 821–830.

Set the values for each attribute concept of the new instance. You can see in the graphic the convergence behavior of the values for each concept during the simulation. Classify a new instance However, since the map was designed from historical data, some concepts can be still superfluous or contradictory. These are the values of concepts after inference process. The label-value ‘0’ denotes that this mutation’s class value is “Susceptible”. G. NÁPOLES, I. GRAU, R. BELLO, R. GRAU, TWO-STEPS LEARNING OF FUZZY COGNITIVE MAPS FOR PREDICTION AND KNOWLEDGE DISCOVERY ON THE HIV-1 DRUG RESISTANCE, EXPERT SYSTEMS WITH APPLICATIONS 41 (2014) 821–830.

The algorithms to optimize the map topology reduce unnecessary concepts and improve the global interpretability. They are based on population-based metaheuristics. Topology optimization G. NÁPOLES, I. GRAU, R. BELLO, R. GRAU, TWO-STEPS LEARNING OF FUZZY COGNITIVE MAPS FOR PREDICTION AND KNOWLEDGE DISCOVERY ON THE HIV-1 DRUG RESISTANCE, EXPERT SYSTEMS WITH APPLICATIONS 41 (2014) 821–830.

First the expert needs to select the proper search method (optimizer) and next configure the specific parameters. Default parameters have been carefully studied in the literature. Topology optimization G. NÁPOLES, I. GRAU, R. BELLO, R. GRAU, TWO-STEPS LEARNING OF FUZZY COGNITIVE MAPS FOR PREDICTION AND KNOWLEDGE DISCOVERY ON THE HIV-1 DRUG RESISTANCE, EXPERT SYSTEMS WITH APPLICATIONS 41 (2014) 821–830.

The runtime plotter shows the error evaluation in blue. Meanwhile the topology is updated on the canvas showing the reduced FCM-based system, which normally is more interpretable. Topology optimization After optimization we obtain 7 concepts from the original 18 (41%), representing the same scenario without loosing accuracy in the classification! G. NÁPOLES, I. GRAU, R. BELLO, R. GRAU, TWO-STEPS LEARNING OF FUZZY COGNITIVE MAPS FOR PREDICTION AND KNOWLEDGE DISCOVERY ON THE HIV-1 DRUG RESISTANCE, EXPERT SYSTEMS WITH APPLICATIONS 41 (2014) 821–830.

Analyzing the convergence of the resistance concept for each instance in the dataset we observe that the inference of the map is not stable, and the results could be compromised. Visualize system convergence This option Run | Convergence Plotter allows to examine the stability. We can improve the map convergence by learning the correct parameter for the transformation function on each concept, without affecting the causal weights and the classification ability of the FCM learned before. G. NÁPOLES, I. GRAU, R. BELLO, R. GRAU, TWO-STEPS LEARNING OF FUZZY COGNITIVE MAPS FOR PREDICTION AND KNOWLEDGE DISCOVERY ON THE HIV-1 DRUG RESISTANCE, EXPERT SYSTEMS WITH APPLICATIONS 41 (2014) 821–830.

It should be remarked that this algorithm is only useful for sigmoid-based FCM since it adjusts the slope of each transfer function to improve the convergence. Improve system convergence G. NÁPOLES, I. GRAU, R. BELLO, R. GRAU, TWO-STEPS LEARNING OF FUZZY COGNITIVE MAPS FOR PREDICTION AND KNOWLEDGE DISCOVERY ON THE HIV-1 DRUG RESISTANCE, EXPERT SYSTEMS WITH APPLICATIONS 41 (2014) 821–830.

First the expert needs to select the proper search method (Swarm Intelligence based algorithms) and next configure the parameters, although default values are provided. Improve system convergence G. NÁPOLES, I. GRAU, R. BELLO, R. GRAU, TWO-STEPS LEARNING OF FUZZY COGNITIVE MAPS FOR PREDICTION AND KNOWLEDGE DISCOVERY ON THE HIV-1 DRUG RESISTANCE, EXPERT SYSTEMS WITH APPLICATIONS 41 (2014) 821–830.

After system convergence have been improved, you can check again the stability of the modeled system, for each instance stored in the training dataset. Improve system convergence G. NÁPOLES, I. GRAU, R. BELLO, R. GRAU, TWO-STEPS LEARNING OF FUZZY COGNITIVE MAPS FOR PREDICTION AND KNOWLEDGE DISCOVERY ON THE HIV-1 DRUG RESISTANCE, EXPERT SYSTEMS WITH APPLICATIONS 41 (2014) 821–830.

FCM WIZARD IN ACTION SCENARIO: HIV DRUG RESISTANCE Main developers: Gonzalo NÁPOLES and Isel GRAU Supervisors: Elpiniki PAPAGEORGIOU and Koen VANHOOF