Replacement Method and Enhanced Replacement Method Versus the Genetic Algorithm Approach for the Selection of Molecular Descriptors in QSPR/QSAR Theories.

Slides:



Advertisements
Similar presentations
The simplex algorithm The simplex algorithm is the classical method for solving linear programs. Its running time is not polynomial in the worst case.
Advertisements

Biologically Inspired Computing: Operators for Evolutionary Algorithms
Copyright © 2011 Ramez Elmasri and Shamkant Navathe Algorithms for SELECT and JOIN Operations (8) Implementing the JOIN Operation: Join (EQUIJOIN, NATURAL.
Exact and heuristics algorithms
On the Genetic Evolution of a Perfect Tic-Tac-Toe Strategy
« هو اللطیف » By : Atefe Malek. khatabi Spring 90.
Tuesday, May 14 Genetic Algorithms Handouts: Lecture Notes Question: when should there be an additional review session?
Genetic Algorithms Representation of Candidate Solutions GAs on primarily two types of representations: –Binary-Coded –Real-Coded Binary-Coded GAs must.
Genetic Algorithms as a Tool for General Optimization Angel Kuri 2001.
Non-Linear Problems General approach. Non-linear Optimization Many objective functions, tend to be non-linear. Design problems for which the objective.
Feature Selection for Regression Problems
A new crossover technique in Genetic Programming Janet Clegg Intelligent Systems Group Electronics Department.
Data classification based on tolerant rough set reporter: yanan yean.
Lecture 9 Hidden Markov Models BioE 480 Sept 21, 2004.
MAE 552 Heuristic Optimization Instructor: John Eddy Lecture #16 3/1/02 Taguchi’s Orthogonal Arrays.
COMP305. Part II. Genetic Algorithms. Genetic Algorithms.
Game of Life Changhyo Yu Game of Life2 Introduction Conway’s Game of Life  Rule Dies if # of alive neighbor cells =< 2 (loneliness) Dies.
D Nagesh Kumar, IIScOptimization Methods: M1L4 1 Introduction and Basic Concepts Classical and Advanced Techniques for Optimization.
Carmine Cerrone, Raffaele Cerulli, Bruce Golden GO IX Sirmione, Italy July
Using Simulated Annealing and Evolution Strategy scheduling capital products with complex product structure By: Dongping SONG Supervisors: Dr. Chris Hicks.
Genetic Programming. Agenda What is Genetic Programming? Background/History. Why Genetic Programming? How Genetic Principles are Applied. Examples of.
Universidad de los Andes-CODENSA The Continuous Genetic Algorithm.
Genetic Programming.
1 Reasons for parallelization Can we make GA faster? One of the most promising choices is to use parallel implementations. The reasons for parallelization.
Genetic Algorithm.
Evolutionary Intelligence
Efficient Model Selection for Support Vector Machines
Simpson Rule For Integration.
© Negnevitsky, Pearson Education, Lecture 10 Evolutionary Computation: Evolution strategies and genetic programming Evolution strategies Evolution.
1 The Euclidean Non-uniform Steiner Tree Problem by Ian Frommer Bruce Golden Guruprasad Pundoor INFORMS Annual Meeting Denver, Colorado October 2004.
CS 484 – Artificial Intelligence1 Announcements Lab 3 due Tuesday, November 6 Homework 6 due Tuesday, November 6 Lab 4 due Thursday, November 8 Current.
Zorica Stanimirović Faculty of Mathematics, University of Belgrade
Genetic Algorithms Michael J. Watts
Department of Electrical Engineering, Southern Taiwan University Robotic Interaction Learning Lab 1 The optimization of the application of fuzzy ant colony.
Boltzmann Machine (BM) (§6.4) Hopfield model + hidden nodes + simulated annealing BM Architecture –a set of visible nodes: nodes can be accessed from outside.
The Integers. The Division Algorithms A high-school question: Compute 58/17. We can write 58 as 58 = 3 (17) + 7 This forms illustrates the answer: “3.
Applying Genetic Algorithm to the Knapsack Problem Qi Su ECE 539 Spring 2001 Course Project.
DYNAMIC FACILITY LAYOUT : GENETIC ALGORITHM BASED MODEL
Genetic algorithms (GA) for clustering Pasi Fränti Clustering Methods: Part 2e Speech and Image Processing Unit School of Computing University of Eastern.
Kanpur Genetic Algorithms Laboratory IIT Kanpur 25, July 2006 (11:00 AM) Multi-Objective Dynamic Optimization using Evolutionary Algorithms by Udaya Bhaskara.
1 Chapter 6 General Strategy for Gradient methods (1) Calculate a search direction (2) Select a step length in that direction to reduce f(x) Steepest Descent.
CS 8625 High Performance Computing Dr. Hoganson Copyright © 2003, Dr. Ken Hoganson CS8625 Class Will Start Momentarily… CS8625 High Performance.
EE459 I ntroduction to Artificial I ntelligence Genetic Algorithms Practical Issues: Selection.
Sampling Design and Analysis MTH 494 Lecture-21 Ossam Chohan Assistant Professor CIIT Abbottabad.
Neural Networks And Its Applications By Dr. Surya Chitra.
Application of the GA-PSO with the Fuzzy controller to the robot soccer Department of Electrical Engineering, Southern Taiwan University, Tainan, R.O.C.
1 Contents 1. Basic Concepts 2. Algorithm 3. Practical considerations Genetic Algorithm (GA)
Sorting Algorithms Written by J.J. Shepherd. Sorting Review For each one of these sorting problems we are assuming ascending order so smallest to largest.
Genetic Algorithm Dr. Md. Al-amin Bhuiyan Professor, Dept. of CSE Jahangirnagar University.
Agenda  INTRODUCTION  GENETIC ALGORITHMS  GENETIC ALGORITHMS FOR EXPLORING QUERY SPACE  SYSTEM ARCHITECTURE  THE EFFECT OF DIFFERENT MUTATION RATES.
Genetic Algorithms. Overview “A genetic algorithm (or GA) is a variant of stochastic beam search in which successor states are generated by combining.
Chapter 5 Describing Distributions Numerically Describing a Quantitative Variable using Percentiles Percentile –A given percent of the observations are.
Artificial Intelligence By Mr. Ejaz CIIT Sahiwal Evolutionary Computation.
The normal approximation for probability histograms.
Genetic Algorithms for clustering problem Pasi Fränti
Breeding Swarms: A GA/PSO Hybrid 簡明昌 Author and Source Author: Matthew Settles and Terence Soule Source: GECCO 2005, p How to get: (\\nclab.csie.nctu.edu.tw\Repository\Journals-
Estimation of Distribution Algorithm and Genetic Programming Structure Complexity Lab,Seoul National University KIM KANGIL.
Genetic Algorithm(GA)
Evolutionary Design of the Closed Loop Control on the Basis of NN-ANARX Model Using Genetic Algoritm.
 Presented By: Abdul Aziz Ghazi  Roll No:  Presented to: Sir Harris.
Hirophysics.com The Genetic Algorithm vs. Simulated Annealing Charles Barnes PHY 327.
 Negnevitsky, Pearson Education, Lecture 12 Hybrid intelligent systems: Evolutionary neural networks and fuzzy evolutionary systems n Introduction.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Intelligent Exploration for Genetic Algorithms Using Self-Organizing.
Balancing of Parallel Two-Sided Assembly Lines via a GA based Approach

Example: Applying EC to the TSP Problem
6.5 Taylor Series Linearization
Local Search Algorithms
Biologically Inspired Computing: Operators for Evolutionary Algorithms
Presentation transcript:

Replacement Method and Enhanced Replacement Method Versus the Genetic Algorithm Approach for the Selection of Molecular Descriptors in QSPR/QSAR Theories Andrew G. Mercader, Pablo R. Duchowicz, Francisco M. Ferna´ndez, and Eduardo A. Castro J. Chem. Inf. Model. 2010, 50, 1542–1548 By : Atefeh Malek. khatabi Atefeh Malek. khatabi

These methods avoid the impracticable full search for optimal variables in large sets of molecular descriptors. A full search (FS) of the optimal variables is impractical because it requires D!/(D - d)!d! linear regressions. If D=140, d=7 1.8*10^11 Present results for 10 different experimental databases Introduction:

RM is a rapidly convergent iterative algorithm that produces linear regression models with small S in remarkably short computer times. For first time Pablo R. Duchowicz and et al use this method in Replacement Method Replacement Method Step 1: We choose an initial set of descriptors dk at random, replace one of the descriptors, say Xki, with all the remaining D - d descriptors, one by one, and keep the set with the smallest value of S. Pablo R. Duchowicz a, Eduardo A. Castro a, Francisco M. Ferna´ndez a, Maykel P. Gonzalez, Chemical Physics Letters 412 (2005) 376–380

……...’… N D d dn N d d dk dki D-d (D-d)+1 N Replace with each D-d one by one Sought for smallest value of S Model

Step 2: Descriptor with the greatest standard deviation in its coefficient is chosen (the one changed previously is not considered) and substituted with all the remaining D - d descriptors, one by one. Repeat for each of dk descriptors Coefficient For First descriptor dki dk (D-d+1) Calculate standard deviation of selected descriptors

S1 S4 S5 S6 S10 S1 S4 S5 S6 S7 S8 S9 S Smin S6 623 Example: 5 10

S2 S4 S5 S1 S10 Smin S9 693 S2 S4 S5 S1 S7 S8 S9 S If S2 be min No replace If S9 be min Replace with discriptor2 3 8

S3 S4 S5 S1 S10 Smin S5 695 S3 S4 S5 S1 S7 S8 S2 S

Give the optimal set after two cycle

Standard deviation vs number of steps for the RM

ERM: Enhanced Replacement Method ERM: The ERM is a three step combination of two algorithms: first the RM already described above, then a modified RM (MRM), and finally a RM is used again. MRM follows the same strategy as RM except that in each step the descriptor with the largest error is substituted even if that substitution is not accompanied by a smaller value of S (the next smallest value of S is chosen). The main difference in MRM is that it adds some sort of noise that prevents the selected model to stay in a local minimum of S.

S2 S4 S5 S1 S10 Smin S9 693 S2 S4 S5 S1 S7 S8 S9 S If S2 be min No replace If S9 be min Replace with discriptor2

Standard deviation vs number of steps for the ERM

Standard deviation vs population number for GA with number of individuals = 5, generation gap = 0.9, single-point crossover probability = 0.6, and mutation probability = 0.7/d. Result of GA with different number of individuals

Standard deviation vs population number for GA with number of individuals = 20, generation gap = 0.9, single-point crossover probability = 0.6, and mutation probability = 0.7/d.

Standard deviation vs population number for GA with number of individuals = 100, generation gap = 0.9, single-point crossover probability = 0.6, and mutation probability = 0.7/d.

It should to be kept in mind that since the GA is a nondeterministic methodology, then its results may change for different runs using exactly the same initial conditions.

Table 3 also suggests that the GA is better than the RM in 51.4% of the cases, the latter approach is preferable in 31.4% of the cases and both methods produce similar results in 17.1% of the cases. However, it should be kept in mind that the RM is a much simpler algorithm. It is also clear that the ERM is preferable to the RM.

Table 4 shows that both the RM and ERM lead to models with values of S that are smaller or similar to those of GA.

The results suggested that ERM was preferable to GA. To the quality of the results, we should add the fact that the ERM is much simpler than the GA. We have also shown that although the GA is slightly better than the RM, simplicity and lower computational cost make the latter more attractive. Finally, it is worth mentioning the three methods RM, ERM, and GA can be used under different conditions as alternative strategies for the construction of models for chemical properties and activities from quite large pools of descriptors for molecular structure. CONCLUSIONS:

Thanks