Using a Genetic Algorithm for Approximate String Matching on Genetic Code Carrie Mantsch December 5, 2003.

Slides:



Advertisements
Similar presentations
Biologically Inspired Computing: Operators for Evolutionary Algorithms
Advertisements

Parallel BioInformatics Sathish Vadhiyar. Parallel Bioinformatics  Many large scale applications in bioinformatics – sequence search, alignment, construction.
Heuristics CPSC 386 Artificial Intelligence Ellen Walker Hiram College.
Genetic Algorithms By: Anna Scheuler and Aaron Smittle.
Genetic Algorithms Contents 1. Basic Concepts 2. Algorithm
Genetic Algorithms Representation of Candidate Solutions GAs on primarily two types of representations: –Binary-Coded –Real-Coded Binary-Coded GAs must.
1 Wendy Williams Metaheuristic Algorithms Genetic Algorithms: A Tutorial “Genetic Algorithms are good at taking large, potentially huge search spaces and.
Sequence Similarity Searching Class 4 March 2010.
Sequence Alignment Storing, retrieving and comparing DNA sequences in Databases. Comparing two or more sequences for similarities. Searching databases.
Heuristic alignment algorithms and cost matrices
Sequencing and Sequence Alignment
Finding approximate palindromes in genomic sequences.
Data Mining CS 341, Spring 2007 Genetic Algorithm.
Introduction to Genetic Algorithms Yonatan Shichel.
Genetic Algorithm for Variable Selection
Learning from Experience: Case Injected Genetic Algorithm Design of Combinational Logic Circuits Sushil J. Louis Genetic Algorithm Systems Lab(gaslab)
Artificial Intelligence Genetic Algorithms and Applications of Genetic Algorithms in Compilers Prasad A. Kulkarni.
Intro to AI Genetic Algorithm Ruth Bergman Fall 2002.
Basic concepts of Data Mining, Clustering and Genetic Algorithms Tsai-Yang Jea Department of Computer Science and Engineering SUNY at Buffalo.
Sequence Alignment III CIS 667 February 10, 2004.
Local Search and Stochastic Algorithms
Bioinformatics Unit 1: Data Bases and Alignments Lecture 3: “Homology” Searches and Sequence Alignments (cont.) The Mechanics of Alignments.
Intro to AI Genetic Algorithm Ruth Bergman Fall 2004.
Blast heuristics Morten Nielsen Department of Systems Biology, DTU.
Chapter 6: Transform and Conquer Genetic Algorithms The Design and Analysis of Algorithms.
Genetic Algorithms Overview Genetic Algorithms: a gentle introduction –What are GAs –How do they work/ Why? –Critical issues Use in Data Mining –GAs.
Genetic Algorithms: A Tutorial
Genetic Algorithm.
An Approach of Artificial Intelligence Application for Laboratory Tests Evaluation Ş.l.univ.dr.ing. Corina SĂVULESCU University of Piteşti.
SOFT COMPUTING (Optimization Techniques using GA) Dr. N.Uma Maheswari Professor/CSE PSNA CET.
Content of the previous class Introduction The evolutionary basis of sequence alignment The Modular Nature of proteins.
BLAST: A Case Study Lecture 25. BLAST: Introduction The Basic Local Alignment Search Tool, BLAST, is a fast approach to finding similar strings of characters.
Lecture 8: 24/5/1435 Genetic Algorithms Lecturer/ Kawther Abas 363CS – Artificial Intelligence.
Genetic Algorithms Michael J. Watts
Hugh E. Williams and Justin Zobel IEEE Transactions on knowledge and data engineering Vol. 14, No. 1, January/February 2002 Presented by Jitimon Keinduangjun.
Genetic algorithms Charles Darwin "A man who dares to waste an hour of life has not discovered the value of life"
Genetic Algorithms Genetic algorithms imitate a natural optimization process: natural selection in evolution. Developed by John Holland at the University.
More on Heuristics Genetic Algorithms (GA) Terminology Chromosome –candidate solution - {x 1, x 2,...., x n } Gene –variable - x j Allele –numerical.
Fuzzy Genetic Algorithm
1 “Genetic Algorithms are good at taking large, potentially huge search spaces and navigating them, looking for optimal combinations of things, solutions.
Derivative Free Optimization G.Anuradha. Contents Genetic Algorithm Simulated Annealing Random search method Downhill simplex method.
Applications of Genetic Algorithms TJHSST Computer Systems Lab By Mary Linnell.
Genetic Algorithms Czech Technical University in Prague, Faculty of Electrical Engineering Ondřej Vaněk, Agent Technology Center ZUI 2011.
Genetic Algorithms What is a GA Terms and definitions Basic algorithm.
Genetic Algorithms Abhishek Sharma Piyush Gupta Department of Instrumentation & Control.
Chapter 12 FUSION OF FUZZY SYSTEM AND GENETIC ALGORITHMS Chi-Yuan Yeh.
Genetic Algorithms. The Basic Genetic Algorithm 1.[Start] Generate random population of n chromosomes (suitable solutions for the problem) 2.[Fitness]
Pairwise Sequence Alignment Part 2. Outline Summary Local and Global alignments FASTA and BLAST algorithms Evaluating significance of alignments Alignment.
Lecture 15 Algorithm Analysis
Doug Raiford Phage class: introduction to sequence databases.
1 Contents 1. Basic Concepts 2. Algorithm 3. Practical considerations Genetic Algorithm (GA)
1 Chapter 3 GAs: Why Do They Work?. 2 Schema Theorem SGA’s features: binary encoding proportional selection one-point crossover strong mutation Schema.
Agenda  INTRODUCTION  GENETIC ALGORITHMS  GENETIC ALGORITHMS FOR EXPLORING QUERY SPACE  SYSTEM ARCHITECTURE  THE EFFECT OF DIFFERENT MUTATION RATES.
Genome Revolution: COMPSCI 004G 8.1 BLAST l What is BLAST? What is it good for?  Basic.
Overview Last two weeks we looked at evolutionary algorithms.
Genetic Algorithms. Solution Search in Problem Space.
Genetic Algorithm(GA)
Genetic Algorithm. Outline Motivation Genetic algorithms An illustrative example Hypothesis space search.
Genetic Algorithms and Evolutionary Programming A Brief Overview.
 Presented By: Abdul Aziz Ghazi  Roll No:  Presented to: Sir Harris.
9/6/07BCB 444/544 F07 ISU Dobbs - Lab 3 - BLAST1 BCB 444/544 Lab 3 BLAST Scoring Matrices & Alignment Statistics Sept6.
Using GA’s to Solve Problems
Author :Shigeomi HARA Hiroshi DOUZONO Yoshio NOGUCHI
USING MICROBIAL GENETIC ALGORITHM TO SOLVE CARD SPLITTING PROBLEM.
Bioinformatics: The pair-wise alignment problem
Genetic Algorithms: A Tutorial
EE368 Soft Computing Genetic Algorithms.
Traveling Salesman Problem by Genetic Algorithm
Genetic Algorithms: A Tutorial
GA.
Presentation transcript:

Using a Genetic Algorithm for Approximate String Matching on Genetic Code Carrie Mantsch December 5, 2003

Outline Problem Statement Current Techniques GA Motivation My Algorithm Results Extension Possibilities

Problem Statement The problem is to search and align strands of DNA using a genetic algorithm.

Current Techniques Approximate string matching –Usually meant for smaller strings –Many are set up for k mismatches 2 DNA strands of size 90 and 85 –Allowing for 5 gaps in the second strand gives almost 44 million possible alignments

Current Techniques (cont.) Needleman-Wunsch –Gap penalty -1 –Match bonus +1 –Mismatch 0 Not practical if the sequence starts in the middle – Counts the gaps at the beginning and end as penalties.

Current Techniques (cont.) BLAST (Basic Local Alignment Search Tool) and FASTA –Use domain specific knowledge

GA Motivation Alien DNA Junk DNA Extendable to similar text searches without domain specific knowledge

My Algorithm The population –Bit strings of 0’s and 1’s –0’s are spaces, 1’s mean a letter is placed there –The number of 1’s stays constant as the number of letters in the smaller search string

My Algorithm (cont.) Breeding –Rank based selection Crossover –The common place markers are kept the same –The rest of the place markers are split evenly between the two children

My Algorithm (cont.) Mutation –If the amount of gaps is less than one tenth of the small string size add a gap –Otherwise delete a gap

Results The target match

Results (cont.) Ran for 50 generations Different random numbers for the same number of generations give best fitness values between about 32 and 67 (optimal fitness - 90)

Extension Possibilities Better representation of population Be able to alter fitness evaluation to be more specific to different problems Ability to add domain specific knowledge Parallel searching

Questions?