Rule Extraction from trained Neural Networks Brian Hudson, Centre for Molecular Design, University of Portsmouth.

Slides:



Advertisements
Similar presentations
Lazy Paired Hyper-Parameter Tuning
Advertisements

Machine Learning Instance Based Learning & Case Based Reasoning Exercise Solutions.
Rule extraction in neural networks. A survey. Krzysztof Mossakowski Faculty of Mathematics and Information Science Warsaw University of Technology.
Comparison of Data Mining Algorithms on Bioinformatics Dataset Melissa K. Carroll Advisor: Sung-Hyuk Cha March 4, 2003.
Application of Stacked Generalization to a Protein Localization Prediction Task Melissa K. Carroll, M.S. and Sung-Hyuk Cha, Ph.D. Pace University, School.
Chessmen Position Recognition Using Artificial Neural Networks Jun Hou Dec. 8, 2003.
Heterogeneous Forests of Decision Trees Krzysztof Grąbczewski & Włodzisław Duch Department of Informatics, Nicholas Copernicus University, Torun, Poland.
High Throughput Computing and Protein Structure Stephen E. Hamby.
Gene Prediction Methods G P S Raghava. Prokaryotic gene structure ORF (open reading frame) Start codon Stop codon TATA box ATGACAGATTACAGATTACAGATTACAGGATAG.
Lesson learnt from the UCSD datamining contest Richard Sia 2008/10/10.
When to use Data Mining. Introduction An important question that should be answered before you commence any data mining project is whether data mining.
Rule Extraction From Trained Neural Networks Brian Hudson University of Portsmouth, UK.
Biological Data Mining A comparison of Neural Network and Symbolic Techniques
Neural Networks. R & G Chapter Feed-Forward Neural Networks otherwise known as The Multi-layer Perceptron or The Back-Propagation Neural Network.
Bio 465 Summary. Overview Conserved DNA Conserved DNA Drug Targets, TreeSAAP Drug Targets, TreeSAAP Next Generation Sequencing Next Generation Sequencing.
Lazy Learning k-Nearest Neighbour Motivation: availability of large amounts of processing power improves our ability to tune k-NN classifiers.
Data Mining.
1 Data Integration and Extraction over Molecular Biological Data Cui Tao supported by NSF.
Neural Networks Chapter Feed-Forward Neural Networks.
Artificial Neural Networks (ANNs)
Biological Data Mining A comparison of Neural Network and Symbolic Techniques
Generating Robust and Consensus Clusters from Gene Expression Data Allan Tucker a, Stephen Swift a, Xiaohui Liu a, Nigel Martin b, Christine Orengo c,
Results Comparison with existing approaches on benchmark datasets Evaluation on a uveal melanoma datasetEvaluation on the two-spiral dataset Evaluation.
Data Mining & Data Warehousing PresentedBy: Group 4 Kirk Bishop Joe Draskovich Amber Hottenroth Brandon Lee Stephen Pesavento.
Introduction to Data Mining Engineering Group in ACL.
Artificial Neural Networks for Secondary Structure Prediction CSC391/691 Bioinformatics Spring 2004 Fetrow/Burg/Miller (slides by J. Burg)
Data Mining By Andrie Suherman. Agenda Introduction Major Elements Steps/ Processes Tools used for data mining Advantages and Disadvantages.
Prognostic Modelling and Profiling of Breast Cancer Patients after Surgery Ian Jarman School of Computer and Mathematical Sciences Liverpool John Moores.
Basic Data Mining Techniques
CHAPTER 12 ADVANCED INTELLIGENT SYSTEMS © 2005 Prentice Hall, Decision Support Systems and Intelligent Systems, 7th Edition, Turban, Aronson, and Liang.
1 7-Speech Recognition (Cont’d) HMM Calculating Approaches Neural Components Three Basic HMM Problems Viterbi Algorithm State Duration Modeling Training.
Cost-Sensitive Bayesian Network algorithm Introduction: Machine learning algorithms are becoming an increasingly important area for research and application.
C. Benatti, 3/15/2012, Slide 1 GA/ICA Workshop Carla Benatti 3/15/2012.
Using Neural Networks in Database Mining Tino Jimenez CS157B MW 9-10:15 February 19, 2009.
Artificial Neural Nets and AI Connectionism Sub symbolic reasoning.
Data Mining Knowledge on rough set theory SUSHIL KUMAR SAHU.
Artificial Neural Networks An Introduction. What is a Neural Network? A human Brain A porpoise brain The brain in a living creature A computer program.
Artificial Intelligence in Game Design N-Grams and Decision Tree Learning.
Data Mining In contrast to the traditional (reactive) DSS tools, the data mining premise is proactive. Data mining tools automatically search the data.
1 CSC 9010 Spring Paula Matuszek CSC 9010 ANN Lab Paula Matuszek Spring, 2011.
Soft Computing Lecture 19 Part 2 Hybrid Intelligent Systems.
Fuzzy Systems Michael J. Watts
Intelligent Database Systems Lab N.Y.U.S.T. I. M. A fast nearest neighbor classifier based on self-organizing incremental neural network (SOINN) Neuron.
Data Mining BY JEMINI ISLAM. Data Mining Outline: What is data mining? Why use data mining? How does data mining work The process of data mining Tools.
CHAPTER 8 DISCRIMINATIVE CLASSIFIERS HIDDEN MARKOV MODELS.
Neural Networks Demystified by Louise Francis Francis Analytics and Actuarial Data Mining, Inc.
Splice Site Recognition in DNA Sequences Using K-mer Frequency Based Mapping for Support Vector Machine with Power Series Kernel Dr. Robertas Damaševičius.
Biological Data Mining A comparison of Neural Network and Symbolic Techniques
Application of Data Mining Techniques on Survey Data using R and Weka
Neural Network Application for Fault Analysis
Dr Matthew Berryman.  Total systems intervention (Flood & Jackson): ◦ An umbrella framework for guiding the choice of systems methodologies (system dynamics,
1 7-Speech Recognition Speech Recognition Concepts Speech Recognition Approaches Recognition Theories Bayse Rule Simple Language Model P(A|W) Network Types.
A New Generation of Artificial Neural Networks.  Support Vector Machines (SVM) appeared in the early nineties in the COLT92 ACM Conference.  SVM have.
An Evolutionary Algorithm for Neural Network Learning using Direct Encoding Paul Batchis Department of Computer Science Rutgers University.
Intrusion Detection using Deep Neural Networks
ANN-based program for Tablet PC character recognition
Fuzzy Systems Michael J. Watts
An Artificial Intelligence Approach to Precision Oncology
Date of download: 10/24/2017 Copyright © ASME. All rights reserved.
Table 1. Advantages and Disadvantages of Traditional DM/ML Methods
Generalization ..
Prepared by: Mahmoud Rafeek Al-Farra
Machine Learning Interpretability
Iris Authentication: System
Somi Jacob and Christian Bach
Bioinformatics 김유환, 문현구, 정태진, 정승우.
Modeling of Spliceosome
Using Clustering to Make Prediction Intervals For Neural Networks
Presentation transcript:

Rule Extraction from trained Neural Networks Brian Hudson, Centre for Molecular Design, University of Portsmouth

Artificial Neural Networks  Advantages High accuracy Robust Noisy data  Disadvantages Lack of comprehensibilty

Rule Extraction  Rule extraction from trained Neural Networks  High fidelity to original network  TREPAN features Best-first tree growing Sampling query instances M of N rules

Bioinformatics applications  Black box solutions Neural Networks Hidden Markov models  Good test for TREPAN methodology

Gene Splicing  Well known bioinformatics problem  For details & links see

The “answer” is known  Donor sequence C/G A G | G T A/G A G T  Acceptor sequence C/T C/T C/T C/T C/T C/T C/T C/T C/T C/T A G |G

EBI clean dataset  Tidied up dataset generated at EBI  Donors training set 567 real & 943 unreal test set 229 real & 373 unreal  Acceptors training set 637 real & 468 unreal test set 273 real & 213 unreal

Summary of results

TREPAN tree for donors 3 of {p-2 =A, p-1=G, p+3=A, p+4=A, p+5=G} REAL 869/74 UNREAL 43/533 Network : 28x10x1 Training : 92.25% Testing : 90.7% C/G A G | G T A/G A G T

C5 tree for donors (part) p5=G p3=C or p3=T => FALSE p3=A p2=G => REAL p2=A p4=A or p4=G => REAL p4=C or p4=T => FALSE p2=C p4=A => REAL else => FALSE p2=T p6=A or p6=G => FALSE p6=C or p6=T => REAL p3=G p4=T => FALSE p4=C p6=T => REAL else => FALSE p4=A p2=C or p2=G or p2=T => REAL p2=A p-3=T => FALSE else => REAL p4=G p2=A or p2=C or p2=T => FALSE p2=G p1=A or p1=C => REAL p1=G or p1=T => FALSE

TREPAN tree for acceptors 1 of {p-3 =G, p-5=G} UNREAL 26/190 {p-3 =A} UNREAL 25/95 REAL 571/153 Network : 40x13x1 Training : 80.2% Testing : 80.9% UNREAL 13/32 2 of {p+1!=G, p-5=G}

Conclusions  Reasonable prediction rate  ‘explains’ predictions of ANN  comprehensible rules more suited to bioinformatics?

Acknowledgements BBSRC/EPSRC Dave Whitley (CMD) Tony Browne (LGU) Martyn Ford (CMD)