Kim Kaminsky Gary D. Boetticher Department of Computer Science

Slides:



Advertisements
Similar presentations
ADBIS 2007 Aggregating Multiple Instances in Relational Database Using Semi-Supervised Genetic Algorithm-based Clustering Technique Rayner Alfred Dimitar.
Advertisements

The Assessment and Application of Lineage Information in Genetic Programs for Producing Better Models Gary D. Boetticher Univ. of Houston.
Better Software Defect Prediction Using Equalized Learning With Machine Learners Kim Kaminsky Gary D. Boetticher Department of Computer Science University.
Nearest Neighbor Sampling for Better Defect Prediction Gary D. Boetticher Department of Software Engineering University of Houston - Clear Lake Houston,
Naïve-Bayes Classifiers Business Intelligence for Managers.
School of Applied Technology, Dep. Of Computer Engineering, T.E.I of Epirus A-Class: a novel classification method I.Tsoulos, A. Tzallas, E. Glavas.
Rachel Harrison, Oxford Brookes University Daniel Rodríguez, Univ of Alcala José Riquelme, Univ of Seville Roberto Ruiz, Pablo de Olavide University.
On the Genetic Evolution of a Perfect Tic-Tac-Toe Strategy
GP Applications Two main areas of research Testing genetic programming in areas other techniques have been applied to. Applying genetic programming to.
Automated Software Maintainability through Machine Learning by Eric Mudge.
Genetic Algorithms Sushil J. Louis Evolutionary Computing Systems LAB Dept. of Computer Science University of Nevada, Reno
Institute of Intelligent Power Electronics – IPE Page1 Introduction to Basics of Genetic Algorithms Docent Xiao-Zhi Gao Department of Electrical Engineering.
The GDB Cup: Applying “Real World” Financial Data Mining in an Academic Setting Gary D. Boetticher University of Houston - Clear Lake Houston, Texas, USA.
Applying Genetic Algorithms to Decision Making in Autonomic Computing Systems Authors: Andres J. Ramirez, David B. Knoester, Betty H.C. Cheng, Philip K.
Mining Sequence Patterns from Wind Tunnel Experimental Data Zhenyu Liu †, Wesley W. Chu †, Adam Huang ‡, Chris Folk ‡, Chih-Ming Ho ‡
Genetic Algorithms Learning Machines for knowledge discovery.
A Comparative Analysis of Software Refinement Techniques Ion IVAN Adrian VISOIU.
Math – Getting Information from the Graph of a Function 1.
Engineering Applications of Artificial Intelligence,
A Genetic Algorithms Approach to Feature Subset Selection Problem by Hasan Doğu TAŞKIRAN CS 550 – Machine Learning Workshop Department of Computer Engineering.
Automating the Lee Model. Major Components Simulator code –Verifying outputs –Verifying model equations –Graphical User interface Auto-tuning the model.
Cristian Urs and Ben Riveira. Introduction The article we chose focuses on improving the performance of Genetic Algorithms by: Use of predictive models.
Soft Computing Lecture 18 Foundations of genetic algorithms (GA). Using of GA.
Using Machine Learning to Predict Project Effort: Empirical Case Studies in Data-starved Domains Gary D. Boetticher Department of Software Engineering.
GA-Based Feature Selection and Parameter Optimization for Support Vector Machine Cheng-Lung Huang, Chieh-Jen Wang Expert Systems with Applications, Volume.
Investigation of the Effect of Neutrality on the Evolution of Digital Circuits. Eoin O’Grady Final year Electronic and Computer Engineering Project.
Lecture 4 Software Metrics
Evolutionary Art with Multiple Expression Programming By Quentin Freeman.
Math – What is a Function? 1. 2 input output function.
Project 2: Classification Using Genetic Programming Kim, MinHyeok Biointelligence laboratory Artificial.
Modeling Instruction in Physics By Kelly Forest. Experimentation First 2-3 weeks – Learn how to run an experiment -Volume of water in a glass container.
Alice E. Smith and Mehmet Gulsen Department of Industrial Engineering

Evolving Strategies for the Prisoner’s Dilemma Jennifer Golbeck University of Maryland, College Park Department of Computer Science July 23, 2002.
An Evolutionary Algorithm for Neural Network Learning using Direct Encoding Paul Batchis Department of Computer Science Rutgers University.
George Yauneridge.  Machine learning basics  Types of learning algorithms  Genetic algorithm basics  Applications and the future of genetic algorithms.
GENETIC ALGORITHM By Siti Rohajawati. Definition Genetic algorithms are sets of computational procedures that conceptually follow steps inspired by the.
VISUALIZATION TECHNIQUES UTILIZING THE SENSITIVITY ANALYSIS OF MODELS Ivo Kondapaneni, Pavel Kordík, Pavel Slavík Department of Computer Science and Engineering,
UC Marco Vieira University of Coimbra
Evolutionary Computation Evolving Neural Network Topologies.
Chapter 12 Case Studies Part B. Control System Design.
Implementation of Genetic Algorithms into SNORT, a Network Intrusion Detection System By Brian E. Lavender March 21, 2010 Advisor: Dr. Scott Gordon Department.
CS623: Introduction to Computing with Neural Nets (lecture-18) Pushpak Bhattacharyya Computer Science and Engineering Department IIT Bombay.
Genetic Algorithm (Knapsack Problem)
Inductive model evolved from data
-A introduction with an example
Applying Combinatorial Testing to Data Mining Algorithms
Selected Topics in CI I Genetic Programming Dr. Widodo Budiharto 2014.
Software Metrics 1.
Some tools and a discussion.
Bulgarian Academy of Sciences
Rule Induction for Classification Using
HyperNetworks Engın denız usta
Particle Swarm Optimization
Presented by: Dr Beatriz de la Iglesia
MultiRefactor: Automated Refactoring To Improve Software Quality
How to Predict More with Less: Defect Prediction Using Machine Learners in an Implicitly Data Starved Domain Kim Kaminsky Gary D. Boetticher Department.
Representation and Evolution of Lego-based Assemblies
Regularization of Evolving Polynomial Models
Network Intrusion Detection Using GA
Understanding the Human Estimator
A Unifying View on Instance Selection
Task-Based Design Optimization of Modular Robots
EE368 Soft Computing Genetic Algorithms.
Functions Rules and Tables.
7.1 – Functions of Several Variables
Basics Prof. Hsin-Mu (Michael) Tsai (蔡欣穆)
Physics-guided machine learning for milling stability:
FREERIDE: A Framework for Rapid Implementation of Datamining Engines
Complexity as Fitness for Evolved Cellular Automata Update Rules
Presentation transcript:

Kim Kaminsky Gary D. Boetticher Department of Computer Science Building a Genetically Engineerable Evolvable Program (GEEP) Using Breadth-Based Explicit Knowledge for Predicting Software Defects Kim Kaminsky Gary D. Boetticher Department of Computer Science University of Houston - Clear Lake Houston, Texas, USA

Key Concept: Duality in Research

A Genetically Engineerable Evolvable Program (GEEP) Genetic Program Process - 1 Fitness Value = Model performance on data. 2 (of many) Chromosomes Data + A B * - 3 D 888 out of 1000 913 out of 1000

A Genetically Engineerable Evolvable Program (GEEP) Genetic Program Process - 2 Mutation 2 Chromosomes Crossover + B - 3 D * A + A B + B - D 3.1 * A - 3 D

A Genetically Engineerable Evolvable Program (GEEP) GEEP uses Domain Specific Equations Pythag. Thm. Cosine

Breadth-Based Explicit Knowledge Akiyama D = 4.86 + 0.018 * L D = 0.12 * C – 0.84 Compton D = 0.069 + 0.00156*L + 0.00000047*L2 Gaffney D = 4.2 + 0.0015*(L)4/3 Halstead D = Volume / 3,000 Lipow D = L * (0.000844 + (0.0007842) ln2* L + (0.00001546)ln2L)

The GEEP Experiments Akiyama Compton Gaffney Halstead No External All Lipow All Models No External Knowledge

} } NASA Data Description NASA Defect Data (KC2 Dataset) 379 Unique tuples } Input: Product Metrics (Size, Complexity, Vocaulary) } Output: Defect Count

Experiment Configuration 30 Trials 128 Generations Max. 512 Chromosomes Initial Maximum Tree Height = 6 Fitness = 1 - Standard error

Experimental Results Akiyama Compton Gaffney Halstead No External All Lipow All Models 0.0169 0.0009 0.0012 0.0003 0.0823 0.1626 No External Knowledge 0.0006 8.023E-06 0.0246 1.733E-06

Discussion

Conclusions Feasibility of leveraging explicit knowledge Data Mining Algorithm Mining

Future Directions Apply to other NASA Defect Datasets Equation Reduction ((((Lipow((Compton((9 + 8) ^ ( iv(g) - 9)) + d )) + (AkiyamaLoc((Compton(Gaffney(Lipow(9) ^ lOBlank )) ^ Lipow(b) ^ Lipow(((6 * AkiyamaLoc ((Gaffney((6 * AkiyamaLoc((Lipow(9)) + Lipow((Compton(AkiyamaComp((( Halstead((Compton(( Halstead(Lipow(9) + AkiyamaLoc( AkiyamaLoc(l)) - Halstead(AkiyamaComp( iv(g) ) * Halstead(Compton( AkiyamaComp( AkiyamaLoc(Compton(3)) ^ AkiyamaLoc(Halstead( AkiyamaComp( iv(g) ) ^ Gaffney((lOComment ^ (Gaffney(Lipow(2)) ^ (Compton(6) ^ AkiyamaLoc((6 - 10)) + Compton(Gaffney(9) * Compton(Lipow(iv(g))) ^ Lipow(((6 * AkiyamaLoc((Gaffney(Lipow(((6 * AkiyamaLoc((Gaffney( Lipow((Compton(AkiyamaComp(AkiyamaLoc(Compton(AkiyamaLoc(l) ^ Gaffney(((Halstead((AkiyamaLoc(AkiyamaLoc((3 ^ lOBlank)) - 7)) * Halstead(AkiyamaComp( iv(g))^ (Gaffney(Lipow(2)) ^ (Compton(6) ^ AkiyamaLoc((6 - 10)) + Compton(3) ^ Gaffney(10))) + Compton(l) * Compton(l ))))))))))))))))))))))))))))))))))))))))))))))))))))))))))))) Performance Improvement