PUNCH: An Evolutionary Algorithm for Optimizing Bit Set Selection Adam J. Ruben, Stephen J. Freeland, Laura F. Landweber DNA7, 2001 Summarized by Dongmin.

Slides:



Advertisements
Similar presentations
Crew Pairing Optimization with Genetic Algorithms
Advertisements

Random Testing Tor Stålhane Jonas G. Brustad. What is random testing The principle of random testing is simple and can be described as follows: 1.For.
1 Introduction to Sequence Analysis Utah State University – Spring 2012 STAT 5570: Statistical Bioinformatics Notes 6.1.
Tetris – Genetic Algorithm Presented by, Jeethan & Jun.
Student : Mateja Saković 3015/2011.  Genetic algorithms are based on evolution and natural selection  Evolution is any change across successive generations.
Multiple Sequence Alignment
VK Dice By: Kenny Gutierrez, Vyvy Pham Mentors: Sarah Eichhorn, Robert Campbell.
 Aim in building a phylogenetic tree is to use a knowledge of the characters of organisms to build a tree that reflects the relationships between them.
21-May-15 Genetic Algorithms. 2 Evolution Here’s a very oversimplified description of how evolution works in biology Organisms (animals or plants) produce.
Non-Linear Problems General approach. Non-linear Optimization Many objective functions, tend to be non-linear. Design problems for which the objective.
1 Lecture 8: Genetic Algorithms Contents : Miming nature The steps of the algorithm –Coosing parents –Reproduction –Mutation Deeper in GA –Stochastic Universal.
Optimatization of a New Score Function for the Detection of Remote Homologs Kann et al.
A new crossover technique in Genetic Programming Janet Clegg Intelligent Systems Group Electronics Department.
This material in not in your text (except as exercises) Sequence Comparisons –Problems in molecular biology involve finding the minimum number of edit.
Genetic Algorithms Learning Machines for knowledge discovery.
Jan 6-10th, 2007VLSI Design A Reduced Complexity Algorithm for Minimizing N-Detect Tests Kalyana R. Kantipudi Vishwani D. Agrawal Department of Electrical.
Sequence comparisons June 23, 2009 Learning objectives-Understand the concept of sliding window programs. Understand difference between identity, similarity.
7/2/2015Intelligent Systems and Soft Computing1 Lecture 9 Evolutionary Computation: Genetic algorithms Introduction, or can evolution be intelligent? Introduction,
(C) 2001 SNU CSE Biointelligence Lab Incremental Classification Using Tree- Based Sampling for Large Data H. Yoon, K. Alsabti, and S. Ranka Instance Selection.
Intro to AI Genetic Algorithm Ruth Bergman Fall 2004.
Protein Sequence Comparison Patrice Koehl
Chapter 6: Transform and Conquer Genetic Algorithms The Design and Analysis of Algorithms.
Motif finding: Lecture 1 CS 498 CXZ. From DNA to Protein: In words 1.DNA = nucleotide sequence Alphabet size = 4 (A,C,G,T) 2.DNA  mRNA (single stranded)
Space-Filling DOEs Design of experiments (DOE) for noisy data tend to place points on the boundary of the domain. When the error in the surrogate is due.
Trees, Stars, and Multiple Biological Sequence Alignment Jesse Wolfgang CSE 497 February 19, 2004.
Genetic Algorithm.
Evolutionary Intelligence
© Negnevitsky, Pearson Education, CSC 4510 – Machine Learning Dr. Mary-Angela Papalaskari Department of Computing Sciences Villanova University.
Program Performance & Asymptotic Notations CSE, POSTECH.
CSE 6406: Bioinformatics Algorithms. Course Outline
CS 484 – Artificial Intelligence1 Announcements Lab 3 due Tuesday, November 6 Homework 6 due Tuesday, November 6 Lab 4 due Thursday, November 8 Current.
Barnett/Ziegler/Byleen Finite Mathematics 11e1 Learning Objectives for Section 6.4 The student will be able to set up and solve linear programming problems.
Multiplication of signed-operands
Zorica Stanimirović Faculty of Mathematics, University of Belgrade
Basic Concepts in Number Theory Background for Random Number Generation 1.For any pair of integers n and m, m  0, there exists a unique pair of integers.
Boltzmann Machine (BM) (§6.4) Hopfield model + hidden nodes + simulated annealing BM Architecture –a set of visible nodes: nodes can be accessed from outside.
Business Mathematics MTH-367 Lecture 15. Chapter 11 The Simplex and Computer Solutions Methods continued.
Protein Folding Programs By Asım OKUR CSE 549 November 14, 2002.
Chapter 3 Computational Molecular Biology Michael Smith
© Negnevitsky, Pearson Education, Lecture 9 Evolutionary Computation: Genetic algorithms Introduction, or can evolution be intelligent? Introduction,
Biomolecular Computation in Virtual Test Tubes 7 th International Meeting on DNA Based Computers, p75-83, June 10-13, 2001 Max Garzon, Chris Oehmen Summarized.
Artificial Intelligence Chapter 4. Machine Evolution.
BLAST: Basic Local Alignment Search Tool Altschul et al. J. Mol Bio CS 466 Saurabh Sinha.
 Negnevitsky, Pearson Education, Lecture 9 Evolutionary Computation: Genetic algorithms n Introduction, or can evolution be intelligent? n Simulation.
Genetic Algorithms What is a GA Terms and definitions Basic algorithm.
Homework #2: Functions and Arrays By J. H. Wang Mar. 20, 2012.
EE749 I ntroduction to Artificial I ntelligence Genetic Algorithms The Simple GA.
Chapter 9 Processing Lists with Arrays. Class 9: Arrays Understand the concept of random numbers and how to generate random numbers Describe the similarities.
1 CSCD 326 Data Structures I Hashing. 2 Hashing Background Goal: provide a constant time complexity method of searching for stored data The best traditional.
Pairwise sequence alignment Lecture 02. Overview  Sequence comparison lies at the heart of bioinformatics analysis.  It is the first step towards structural.
Sequence Alignment.
EE459 I ntroduction to Artificial I ntelligence Genetic Algorithms Practical Issues: Selection.
Doug Raiford Phage class: introduction to sequence databases.
Binary Arithmetic for DNA Computers R. Barua and J. Misra Preliminary Proceedings of the Eighth International Meeting on DNA Based Computers, pp ,
Lecture 11 CS5661 Structural Bioinformatics – Structure Comparison Motivation Concepts Structure Comparison.
Genetic Algorithm Dr. Md. Al-amin Bhuiyan Professor, Dept. of CSE Jahangirnagar University.
Ch 20. Parameter Control Ch 21. Self-adaptation Evolutionary Computation vol. 2: Advanced Algorithms and Operators Summarized and presented by Jung-Woo.
Artificial Intelligence By Mr. Ejaz CIIT Sahiwal Evolutionary Computation.
Chapter 15 Running Time Analysis. Topics Orders of Magnitude and Big-Oh Notation Running Time Analysis of Algorithms –Counting Statements –Evaluating.
DNASequenceGenerator: A Program for the construction of DNA sequences Udo Feldkamp, Sam Saghafi, Wolfgang Banzhaf, Hilmar Rauhe DNA7 pp Summarized.
Surface Defect Inspection: an Artificial Immune Approach Dr. Hong Zheng and Dr. Saeid Nahavandi School of Engineering and Technology.
Biointelligence Lab School of Computer Sci. & Eng. Seoul National University Artificial Intelligence Chapter 8 Uninformed Search.
T. P. Runarsson, J. J. Merelo, U. Iceland & U. Granada (Spain) NICSO 2010 Adapting Heuristic Mastermind Strategies to Evolutionary Algorithms.
Chapter 14 Genetic Algorithms.
MAE 552 Heuristic Optimization
C.-S. Shieh, EC, KUAS, Taiwan
On Template Method for DNA Sequence Design
Boltzmann Machine (BM) (§6.4)
Algorithm Course Algorithms Lecture 3 Sorting Algorithm-1
Presentation transcript:

PUNCH: An Evolutionary Algorithm for Optimizing Bit Set Selection Adam J. Ruben, Stephen J. Freeland, Laura F. Landweber DNA7, 2001 Summarized by Dongmin Kim

© 2001 SNU CSE Artificial Intelligence Lab (SCAI) Introduction A randomly selected bit set may experience some physical problems during computation Currently no method exists for finding the “best” bit set. PUNCH (Princeton University Nucleotide Computing Heuristic):  It is compatible with various current DNA computing methods  It is useful in selecting bit sets  One should be able to test the limits of current methodology  One suggest new techniques for DNA computing that may works better

© 2001 SNU CSE Artificial Intelligence Lab (SCAI) Program Architecture (1) The primary functional unit is a three-dimensional array  N : the number of bits in the problem  B : the number of nucleotides in each bit  V : the number of variations on each bit set Algorithm  Initialize  Generate random integers between 0 and 3 and fills the array  Assess  Each bit set is assigned the maximum possible score  Make a one-dimensional string from a two dimensional array  A window (size w) fills itself with the first w nucleotides in the string and scans the entire string, each time it encounters an identical sequences, subtracts w from the current score  Repeat this scanning with window which filled by the reverse of their complements

© 2001 SNU CSE Artificial Intelligence Lab (SCAI) Program Architecture (2) Algorithm (cont’d)  Pick and fill with best  Select the bit set with the highest score, and fills entire three- dimensional array with that bit set  Mutate  m represents a mutation rate, mBN(V – 1) bases are randomly selected and reassigned  Repeat  Repeat this procedure until each generation fails to improve r times Modifications  For certain protocols, the bits are treated as one linear string. since ignoring inter-bit combinations, the maximum possible score becomes

© 2001 SNU CSE Artificial Intelligence Lab (SCAI) Mutation Rate and comparison to Monte Carlo Sequences Best mutation rate is m = 1 / BN Comparison with randomly generated sequences

© 2001 SNU CSE Artificial Intelligence Lab (SCAI) “ Computing on Surfaces ” Using PUNCH There are two ways to test the usefulness of the scoring function  Perform an experiment using the bits it suggests  Assess the bits used in an existing, successful experiment Perform three analysis  10,000 randomly generated sequence, the exact sequences Liu et al., sequences optimized by PUNCH

© 2001 SNU CSE Artificial Intelligence Lab (SCAI) Introducing other scoring routine Base Equality  Now, scoring strategy tends to minimize variation where it matters least for avoiding reverse-complement penalties.  Thus, add routine counts the number of each base in a given bit set, divides it by the total number of bases in the set, and get the absolute value of its deviance from Total the deviance of all four bases, subtracts the result from 1, and use this value for the coefficient of score. Thus, having equal amounts of each base is rewarded Folding Score  Similarity to the reverse complements can be replaced with the scores based on single-stranded DNA folding energy  Vienna RNA package

© 2001 SNU CSE Artificial Intelligence Lab (SCAI) Comparison to another Bit Set Perform three analysis again  10,000 random sequences, Faulhammer et al. sequences, PUNCH sequences.  Now, the base equality scoring routine penalizes three-base alphabet Omitting Base Equality

© 2001 SNU CSE Artificial Intelligence Lab (SCAI) Multi-Base System Introduce a new constant “bases” capable of being defined such that 2 < bases < 9. PUNCH can run as though there are only “bases” bases in existence It can work with more than four bases using unnatural residues as dipropylglycine and dibutylglycine

© 2001 SNU CSE Artificial Intelligence Lab (SCAI) Future Directions Modeling Sexual Evolution Flexibility  Adaptability to various criteria  Definable constants Scalability  PUNCH can help identify which experiments may not be realistic on a larger scale