Discovery of RNA Structural Elements Using Evolutionary Computation Authors: G. Fogel, V. Porto, D. Weekes, D. Fogel, R. Griffey, J. McNeil, E. Lesnik,

Slides:



Advertisements
Similar presentations
Biologically Inspired Computing: Operators for Evolutionary Algorithms
Advertisements

CS6800 Advanced Theory of Computation
Genetic Algorithms Contents 1. Basic Concepts 2. Algorithm
Protein Threading Zhanggroup Overview Background protein structure protein folding and designability Protein threading Current limitations.
Date:2011/06/08 吳昕澧 BOA: The Bayesian Optimization Algorithm.
1 “INTRODUCTION TO BIOINFORMATICS” “SPRING 2005” “Dr. N AYDIN” Lecture 4 Multiple Sequence Alignment Doç. Dr. Nizamettin AYDIN
Non-coding RNA William Liu CS374: Algorithms in Biology November 23, 2004.
A new crossover technique in Genetic Programming Janet Clegg Intelligent Systems Group Electronics Department.
Finding Compact Structural Motifs Presented By: Xin Gao Authors: Jianbo Qian, Shuai Cheng Li, Dongbo Bu, Ming Li, and Jinbo Xu University of Waterloo,
Genetic Algorithms and Their Applications John Paxton Montana State University August 14, 2003.
Introduction to Bioinformatics - Tutorial no. 5 MEME – Discovering motifs in sequences MAST – Searching for motifs in databanks TRANSFAC – The Transcription.
Artificial Intelligence Genetic Algorithms and Applications of Genetic Algorithms in Compilers Prasad A. Kulkarni.
Fuzzy K means.
RiboSearch Ben Daniel Ariel Kirshner Naomi
CECS Introduction to Bioinformatics University of Louisville Spring 2003 Dr. Eric Rouchka Lecture 3: Multiple Sequence Alignment Eric C. Rouchka,
Sequencing a genome and Basic Sequence Alignment
Genetic Programming.
Multiple Sequence Alignment CSC391/691 Bioinformatics Spring 2004 Fetrow/Burg/Miller (Slides by J. Burg)
Genetic Algorithm.
CSE 6406: Bioinformatics Algorithms. Course Outline
Introduction to Genetic Algorithms and Evolutionary Computation
Good solutions are advantageous Christophe Roos - MediCel ltd Similarity is a tool in understanding the information in a sequence.
Genetic algorithms Prof Kang Li
Zorica Stanimirović Faculty of Mathematics, University of Belgrade
Hugh E. Williams and Justin Zobel IEEE Transactions on knowledge and data engineering Vol. 14, No. 1, January/February 2002 Presented by Jitimon Keinduangjun.
Motif finding with Gibbs sampling CS 466 Saurabh Sinha.
Sequencing a genome and Basic Sequence Alignment
Applying Genetic Algorithm to the Knapsack Problem Qi Su ECE 539 Spring 2001 Course Project.
An Introduction to Genetic Algorithms Lecture 2 November, 2010 Ivan Garibay
1/27 Discrete and Genetic Algorithms in Bioinformatics 許聞廉 中央研究院資訊所.
Phylogenetic Prediction Lecture II by Clarke S. Arnold March 19, 2002.
Construction of Substitution Matrices
PreDetector : Prokaryotic Regulatory Element Detector Samuel Hiard 1, Sébastien Rigali 2, Séverine Colson 2, Raphaël Marée 1 and Louis Wehenkel 1 1 Department.
Bioinformatics Multiple Alignment. Overview Introduction Multiple Alignments Global multiple alignment –Introduction –Scoring –Algorithms.
Motifs BCH364C/391L Systems Biology / Bioinformatics – Spring 2015 Edward Marcotte, Univ of Texas at Austin Edward Marcotte/Univ. of Texas/BCH364C-391L/Spring.
2005MEE Software Engineering Lecture 11 – Optimisation Techniques.
A New Evolutionary Approach for the Optimal Communication Spanning Tree Problem Sang-Moon Soak Speaker: 洪嘉涓、陳麗徽、李振宇、黃怡靜.
Overview of Bioinformatics 1 Module Denis Manley..
1 Genetic Algorithms K.Ganesh Introduction GAs and Simulated Annealing The Biology of Genetics The Logic of Genetic Programmes Demo Summary.
Genetic Algorithms What is a GA Terms and definitions Basic algorithm.
COT 6930 HPC and Bioinformatics Multiple Sequence Alignment Xingquan Zhu Dept. of Computer Science and Engineering.
Chapter 12 FUSION OF FUZZY SYSTEM AND GENETIC ALGORITHMS Chi-Yuan Yeh.
Biologically inspired algorithms BY: Andy Garrett YE Ziyu.
Genetic Algorithms. The Basic Genetic Algorithm 1.[Start] Generate random population of n chromosomes (suitable solutions for the problem) 2.[Fitness]
Waqas Haider Bangyal 1. Evolutionary computing algorithms are very common and used by many researchers in their research to solve the optimization problems.
Sequence Alignment.
Construction of Substitution matrices
D Nagesh Kumar, IIScOptimization Methods: M8L5 1 Advanced Topics in Optimization Evolutionary Algorithms for Optimization and Search.
Step 3: Tools Database Searching
Contribution of second order evolution to evolutionary algorithms Virginie LEFORT July 11 th.
Introduction to Bioinformatics - Tutorial no. 5 MEME – Discovering motifs in sequences MAST – Searching for motifs in databanks TRANSFAC – the Transcription.
An Introduction to Genetic Algorithms Lecture 2 November, 2010 Ivan Garibay
Motif identification with Gibbs Sampler Xuhua Xia
Genetic Algorithms. Underlying Concept  Charles Darwin outlined the principle of natural selection.  Natural Selection is the process by which evolution.
Genetic Algorithm Dr. Md. Al-amin Bhuiyan Professor, Dept. of CSE Jahangirnagar University.
Artificial Intelligence By Mr. Ejaz CIIT Sahiwal Evolutionary Computation.
Protein Structure Prediction: Threading and Rosetta BMI/CS 576 Colin Dewey Fall 2008.
1 Comparative Study of two Genetic Algorithms Based Task Allocation Models in Distributed Computing System Oğuzhan TAŞ 2005.
DNA SEQUENCE ALIGNMENT FOR PROTEIN SIMILARITY ANALYSIS CARL EBERLE, DANIEL MARTINEZ, MENGDI TAO.
EVOLUTIONARY SYSTEMS AND GENETIC ALGORITHMS NAME: AKSHITKUMAR PATEL STUDENT ID: GRAD POSITION PAPER.
Stochastic Local Search Algorithms for DNA word Design Dan C Tulpan, Holger H Hoos, and Anne Condon Summerized by Ji-Eun Yun.
Genetic Algorithm. Outline Motivation Genetic algorithms An illustrative example Hypothesis space search.
Hirophysics.com The Genetic Algorithm vs. Simulated Annealing Charles Barnes PHY 327.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Intelligent Exploration for Genetic Algorithms Using Self-Organizing.
Bioinformatics Overview
Predict Protein Sequence by Fuzzy-Association Rules
Finding Functionally Significant Structural Motifs in Proteins
Bioinformatics Vicki & Joe.
EE368 Soft Computing Genetic Algorithms.
Exploring Genetic Algorithms Through the Iterative Prisoner's Dilemma
Presentation transcript:

Discovery of RNA Structural Elements Using Evolutionary Computation Authors: G. Fogel, V. Porto, D. Weekes, D. Fogel, R. Griffey, J. McNeil, E. Lesnik, D. Ecker, R. Sampath, Natural Selection Inc. and Ibis Therapeutics Presenter: Elena Zheleva April 2, 2004

Introduction  Problem Statement  Background  Evolutionary Computation Population initialization Variation Fitness Selection  Results  Conclusion

Problem Statement  Computational Biology problem: given a RNA secondary structure description, search for similar secondary structures  Currently, exhaustive search techniques are used to narrow down search space  Authors focus on presentation and set of operators to search via evolution

Outline  Problem Statement  Background  Evolutionary Computation Population initialization Variation Fitness Selection  Results  Conclusion

Background  RNA (ribonucleic acid) directs middle steps of protein production single-stranded, certain parts are folded  RNA Secondary Structure - accounts for diverse functional activities

Background  RNA Secondary Structure: Recurs in multiple genes within a single organism Recurs in across the same gene in several organism  Why a computational tool for RNA secondary structure search? Discover new structures Improve understanding of functional and regulatory relationships amongst related RNAs

Background – RNAMotif  RNAMotif: mines nucleotide sequence databases for repeating structure motifs  RNAMotif Input: descriptor contains details about pairing information, length, sequence

Background - RNAMotif  RNAMotif Output: list of real structures   RNAMotif may return a very high number of motifs when descriptor is more flexible  Input to the EA: RNAMotif Output

Outline  Problem Statement  Background  Evolutionary Computation Population initialization Variation Fitness Selection  Results  Conclusion

Evolutionary Computation Population Initialization  P parent bins  B – bin size  Bin = a contending solution  Each bin contains structures from different organisms  Structures chosen at random from RNAMotif Output file Figure 1

Outline  Problem Statement  Background  Evolutionary Computation Population initialization Variation Fitness Selection  Results  Conclusion

Evolutionary Computation Variation  P parent bins are copied to O offspring bins  Variables: operator, number of times to apply it  Variation Operator 1: structure replacement within a specified organism Replacement taken from RNAMotif Output File Local – neighboring replacement structure Global – random replacement structure Example: P organisms = {H. Sapiens, S. Scrofa, E. Coli, G. Gallus}

Evolutionary Computation Variation  Variation Operator 2: Structure replacement from different organisms Variable: # of structures to be replaced Example: # = 2 P organisms = {H. Sapiens, S. Scrofa, E. Coli, G. Gallus} O organisms = {H. Sapiens, C. Griseus, E. Coli, S. Scrofa}

Evolutionary Computation Variation  Variation Operator 3: random single-point bin recombination Generates a second parent from RNAMotif output and applies single-point bin recombination Chooses randomly one of the two offsprings Example: P = {H, S, E, G} P = {D, E, O, B} O = {H, S, E, B} O = {D, E, O, G}  Variation Operator 4: random multi-point bin recombination

Outline  Problem Statement  Background  Evolutionary Computation Population initialization Variation Fitness Selection  Results  Conclusion

Evolutionary Computation Fitness  Fitness Function Scoring Components: Structure nucleotide sequence similarity Structure length similarity Structure thermodynamic stability similarity  These measures are applied pairwise by each structural component and summed into a final bin score

Outline  Problem Statement  Background  Evolutionary Computation Population initialization Variation Fitness Selection  Results  Conclusion

Evolutionary Computation Selection  Selection: For every bin in population, A set of R rival bins is randomly selected Calculate score = # rivals with lower fitness  Lower bins are removed  Iterations continue until number of generations (G) or CPU time is satisfied, or until expected change of fitness/gen  0

Outline  Problem Statement  Background  Evolutionary Computation Population initialization Variation Fitness Selection  Results  Conclusion

Results  Experiment 1: 7.6x10 possible bins Exhaustive search: 125 days  EA examined ~10 bins before converging < 3 minutes 8 4

Results

 To test the utility of this method Run on newly discovered genomes (S. Pyogenes) Compare to database which has an alignment for this RNA secondary structure for previously discovered genomes (S. Mutans) Found similar sequence and structure to close organisms

Outline  Problem Statement  Background  Evolutionary Computation Population initialization Variation Fitness Selection  Results  Conclusion

Conclusion  Evolutionary Algorithm can be applied to find RNA secondary structures over a wide range of organisms  Converges quickly and reliably  Algorithm comes up with a solution which contains information about structural elements for different organisms/genomes