1 Artificial Regulatory Network Evolution Yolanda Sanchez-Dehesa 1, Loïc Cerf 1, José-Maria Peña 2, Jean-François Boulicaut 1 and Guillaume Beslon 1 1.

Slides:



Advertisements
Similar presentations
Approaches, Tools, and Applications Islam A. El-Shaarawy Shoubra Faculty of Eng.
Advertisements

Martin John Bishop UK HGMP Resource Centre Hinxton Cambridge CB10 1 SB
Evidence for Complex Causes
Integrating Genomes D. R. Zerbino, B. Paten, D. Haussler Science 336, 179 (2012) Teacher: Professor Chao, Kun-Mao Speaker: Ho, Bin-Shenq June 4, 2012.
The Structure and Plasticity of the Phenotype as a Network Phenomenon George Kampis Basler Chair Spring 2007, ETSU, Johnson City, TN.
CSE Fall. Summary Goal: infer models of transcriptional regulation with annotated molecular interaction graphs The attributes in the model.
Transcriptomics Breakout. Topics Discussed Transcriptomics Applications and Challenges For Each Systems Biology Project –Host and Pathogen Bacteria Viruses.
Computational characterization of biomolecular networks in physiology and disease Kakajan Komurov, Ph.D Department of Systems Biology University of Texas.
August 19, 2002Slide 1 Bioinformatics at Virginia Tech David Bevan (BCHM) Lenwood S. Heath (CS) Ruth Grene (PPWS) Layne Watson (CS) Chris North (CS) Naren.
Bioinformatics at IU - Ketan Mane. Bioinformatics at IU What is Bioinformatics? Bioinformatics is the study of the inherent structure of biological information.
Petri net modeling of biological networks Claudine Chaouiya.
Challenges to the Modern Synthesis Stephen Jay Gould, Nils Eldredge,
Network Morphospace Andrea Avena-Koenigsberger, Joaquin Goni Ricard Sole, Olaf Sporns Tung Hoang Spring 2015.
Dynamic Network Inference Most statistical work is done on gene regulatory networks, while inference of metabolic pathways and signaling networks are done.
Regulatory networks 10/29/07. Definition of a module Module here has broader meanings than before. A functional module is a discrete entity whose function.
Adaptive evolution of bacterial metabolic networks by horizontal gene transfer Chao Wang Dec 14, 2005.
Evolutionary Algorithms Simon M. Lucas. The basic idea Initialise a random population of individuals repeat { evaluate select vary (e.g. mutate or crossover)
BACKGROUND E. coli is a free living, gram negative bacterium which colonizes the lower gut of animals. Since it is a model organism, a lot of experimental.
An Application of Bendixson-Boincare Theorem MAT 574- Fall 2003 Arizona State University Math & Stat Dept.
Chp 3 Genomics, Proteomics, and Related Approaches to Physiology.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
12.4 Gene Regulation and Mutation
Regulatory factors 1) Gene copy number 2) Transcriptional control 2-1) Promoters 2-2) Terminators, attenuators and anti-terminators 2-3) Induction and.
1 Bio + Informatics AAACTGCTGACCGGTAACTGAGGCCTGCCTGCAATTGCTTAACTTGGC An Overview پرتال پرتال بيوانفورماتيك ايرانيان.
CSE 6406: Bioinformatics Algorithms. Course Outline
Synthetic biology: New engineering rules for emerging discipline Andrianantoandro E; Basu S; Karig D K; Weiss R. Molecular Systems Biology 2006.
Gene Expression and Gene Regulation. The Link between Genes and Proteins At the beginning of the 20 th century, Garrod proposed: – Genetic disorders such.
6/22/04 The RBF-Gene Model – GECCO'04 The RBF-Gene Model A bio-inspired genetic algorithm with a self-organizing genome Virginie LEFORT, Carole KNIBBE,
Genomics and Arabidopsis. What is ‘genomics’? Study of an organism’s entire genome –All the DNA encoded in the organism –Nucleus, mitochondria, chloroplasts.
Modeling of complex systems: what is relevant? Arno Knobbe, Marvin Meeng, Joost Kok Leiden Institute of Advanced Computer Science (LIACS)
Jeffrey Zheng School of Software, Yunnan University August 4, nd International Summit on Integrative Biology August 4-5, 2014 Chicago, USA.
Abstract ODE System Model of GRNs Summary Evolving Small GRNs with a Top-Down Approach Javier Garcia-Bernardo* and Margaret J. Eppstein Department of Computer.
 Scientific evidence shows that life on Earth had one origin or multiple origins?
Overview  Introduction  Biological network data  Text mining  Gene Ontology  Expression data basics  Expression, text mining, and GO  Modules and.
Modeling Genetic Network: Boolean Network Yongyeol Ahn KAIST.
Respective contributions of MIAME, GeneOntology and UMLS for transcriptome analysis Fouzia Moussouni, Anita Burgun, Franck Le Duff, Emilie Guérin, Olivier.
Data Mining the Yeast Genome Expression and Sequence Data Alvis Brazma European Bioinformatics Institute.
Complexities of Gene Expression Cells have regulated, complex systems –Not all genes are expressed in every cell –Many genes are not expressed all of.
09/20/04 Introducing Proteins into Genetic Algorithms – CSIMTA'04 Introducing “Proteins” into Genetic Algorithms Virginie LEFORT, Carole KNIBBE, Guillaume.
Genome Biology and Biotechnology The next frontier: Systems biology Prof. M. Zabeau Department of Plant Systems Biology Flanders Interuniversity Institute.
341- INTRODUCTION TO BIOINFORMATICS Overview of the Course Material 1.
Modelling from Sequence to Gene Regulatory Network to Phenotype.
Biological Networks. Can a biologist fix a radio? Lazebnik, Cancer Cell, 2002.
Contribution of second order evolution to evolutionary algorithms Virginie LEFORT July 11 th.
PLANT BIOTECHNOLOGY & GENETIC ENGINEERING (3 CREDIT HOURS) LECTURE 13 ANALYSIS OF THE TRANSCRIPTOME.
Computational Biology and Genomics at Boston College Biology Gabor T. Marth Department of Biology, Boston College
A Genetic Algorithm for Designing Materials: Gene A. Tagliarini Edward W. Page M. Rene Surgi.
Organic Evolution and Problem Solving Je-Gun Joung.
Finding Motifs Vasileios Hatzivassiloglou University of Texas at Dallas.
1 What forces constrain/drive protein evolution? Looking at all coding sequences across multiple genomes can shed considerable light on which forces contribute.
Dynamical Modeling in Biology: a semiotic perspective Junior Barrera BIOINFO-USP.
High throughput biology data management and data intensive computing drivers George Michaels.
Center for Bioinformatics and Genomic Systems Engineering Bioinformatics, Computational and Systems Biology Research in Life Science and Agriculture.
Individuals in a population may evolve. A.True B.False False! Individuals do NOT evolve; POPULATIONS do!
Genetic Algorithm. Outline Motivation Genetic algorithms An illustrative example Hypothesis space search.
SC.912.L.16.3 DNA Replication. – During DNA replication, a double-stranded DNA molecule divides into two single strands. New nucleotides bond to each.
Bioinformatics Overview
High-throughput data used in bioinformatics
The Transcriptional Landscape of the Mammalian Genome
Optimization Based Design of Robust Synthetic
Control of Gene Expression
Building and Analyzing Genome-Wide Gene Disruption Networks
26.5 Molecular Clocks Help Track Evolutionary Time
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
CSCI2950-C Lecture 13 Network Motifs; Network Integration
Lesson 1: Evolution.
Modelling Structure and Function in Complex Networks
Network Inference Chris Holmes Oxford Centre for Gene Function, &,
INTRODUCTION TO MOLECULAR GENETICS
Presentation transcript:

1 Artificial Regulatory Network Evolution Yolanda Sanchez-Dehesa 1, Loïc Cerf 1, José-Maria Peña 2, Jean-François Boulicaut 1 and Guillaume Beslon 1 1 : LIRIS Laboratory, INSA-Lyon, France 2 : DATSI, Facultad de Informatica, Universidad Politécnica de Madrid, Spain MLSB 07 Evry, September 24th

2 Context: from data to knowledge Large kinetic transcriptome data sets are announced We need to design NOW the related data mining algorithms genes samples time Genetic Network (GN) inference

3 Problems Just a few real data sets are available Today, benchmarking is performed on: –Randomly generated data –Synthetic data w.r.t. models from other fields –Data from GN generators biased by topology

4 Approach Can we use simulation to build biologically plausible GNs and thus more relevant kinetic data sets? GNs are built by an evolutionary process We propose to use artificial evolution to generate plausible GNs

5 Biologically plausible GN To obtain plausible GNs we must respect biological bases of network evolution: –GNs are derived from a genome sequence and a proteome component –Mutation of the genetic sequence –Selection on the phenotype We have developed the RAevol Model

6 Based on the Aevol* Model Studying robustness and evolvability in artificial organisms: –Artificial genome, non-coding sequences, variable number of genes –Genome: circular double-strand binary string –Mutation/selection process * C. Knibbe PhD, INSA-Lyon, October 2006

7 Biological function Protein expression ProteomePhenotypeGenome Biological function Protein expression transcription translation protein interactions Objective function fonctionalities Fuzzy function mw H = B.h Mutations switch indels rearrangements Selection Reproduction Protein expression Biological function Ævol : Artificial Evolution Metabolic error

8 From Aevol to RAevol Interesting properties of Aevol to understand genome evolution: –See C. Knibbe, A long-term evolutionary pressure on the amount of non-coding DNA (2007). Molecular Biology and Evolution, in press. doi: /molbev/msm165 We need to add a regulatory process  RAevol

9 Biological function Expression level ProteomePhenotypeGenome transcription translation protein interactions Fuzzy logic function MutationsSelection Reproduction Expression level Biological function RÆvol : Artificial Evolution Metabolic error The phenotype becomes a dynamic function … Regulation mw H = C.h Banzhaf03:On Evolutionary Design, Embodiment and Artificial Regulatory Networks

10 Experimental setup Simulations: 1000 individuals, mutation rate , generations Organisms must perform 3 metabolic functions The incoming of an external signal (protein) triggers an inhibition process Biological function External protein Individual life

11 First results The metabolic network mainly grows during the 5000 first generations  GN grows likely Transcription factors appear after generations  GN grows independently from metabolism

12 First results First phase: quasi-normal distribution Second phase: multimodal distribution, strong links (mainly inhibitory) … Regulatory Links Values

13 Conclusion and perspectives RAevol generates plausible GNs (protein-gene expression levels) along evolution Studying the generation of kinetic transcriptome data sets is ongoing protein gene Time (evolution) gene expression Time (life) Towards more realistic benchmarks for data mining algorithms

14 Open issues Systematic experiments  effect of mutation rates  effect of environment stability Study the network topology  Compare the network topology with real organisms…  Do frequent motifs/modules appear in the network ?

15 The Aevol Model Interesting properties of the Aevol Model: –Transcription/translation process  Different RNA production levels –Explicit (abstract) proteome  interactions between proteins and genetic sequence –Variable gene number  Variable network size –Complex mutational process (mutations, InDel, rearrangements, …)  Different topology emergence