Languages & Notations for Systems Biology Luca Cardelli Microsoft Research Executive Summary http://www.luca.demon.co.uk/BioComputing.htm http://research.microsoft.com/bioinfo.

Slides:



Advertisements
Similar presentations
Robustness Analysis and Tuning of Synthetic Gene Networks Grégory Batt 1 Boyan Yordanov 1 Calin Belta 1 Ron Weiss 2 1 Centers for Information and Systems.
Advertisements

Signals and Systems March 25, Summary thus far: software engineering Focused on abstraction and modularity in software engineering. Topics: procedures,
François FagesLyon, Dec. 7th 2006 Biologie du système de signalisation cellulaire induit par la FSH ASC 2006, projet AgroBi INRIA Rocquencourt Thème “Systèmes.
Modelling Cell Signalling Pathways in PEPA
François Fages MPRI Bio-info 2006 Formal Biology of the Cell Modeling, Computing and Reasoning with Constraints François Fages, Constraints Group, INRIA.
Simulation of Prokaryotic Genetic Circuits Jonny Wells and Jimmy Bai.
Biological pathway and systems analysis An introduction.
Robustness analysis and tuning of synthetic gene networks February 15, 2008 Eyad Lababidi Based on the paper “Robustness analysis and tuning of synthetic.
Models and methods in systems biology Daniel Kluesing Algorithms in Biology Spring 2009.
Systems Biology Existing and future genome sequencing projects and the follow-on structural and functional analysis of complete genomes will produce an.
Marc Riedel Synthesizing Stochasticity in Biochemical Systems Electrical & Computer Engineering Jehoshua (Shuki) Bruck Caltech joint work with Brian Fett.
Petri net modeling of biological networks Claudine Chaouiya.
XML Documentation of Biopathways and Their Simulations in Genomic Object Net Speaker : Hungwei chen.
Dynamic Modeling Of Biological Systems. Why Model? When it’s a simple, constrained path we can easily go from experimental measurements to intuitive understanding.
Computational Molecular Biology (Spring’03) Chitta Baral Professor of Computer Science & Engg.
Computational Modeling of Genome-wide Transcriptional Regulation Center for Comparative Genomics and Bioinformatics, PSU, UP, 2005 Frank Pugh Department.
Programming Languages for Biology Bor-Yuh Evan Chang November 25, 2003 OSQ Group Meeting.
SCB : 1 Department of Computer Science Simulation and Complexity SCB : Simulating Complex Biosystems Susan Stepney Department of Computer Science Leo Caves.
Computational Systems Biology Prepared by: Rhia Trogo Rafael Cabredo Levi Jones Monteverde.
Computational Models in Systems Biology Karan Mangla 22 nd April, 2008.
Exploration Session Week 8: Computational Biology Melissa Winstanley: (based on slides by Martin Tompa,
BASIC BIOCHEMISTRY MLAB Introduction. INTRODUCTION TO BASIC BIOCHEMISTRY Biochemistry can be defined as the science concerned with the chemical.
Shankar Subramaniam University of California at San Diego Data to Biology.
GTL Facilities Computing Infrastructure for 21 st Century Systems Biology Ed Uberbacher ORNL & Mike Colvin LLNL.
Converting Macromolecular Regulatory Models from Deterministic to Stochastic Formulation Pengyuan Wang, Ranjit Randhawa, Clifford A. Shaffer, Yang Cao,
Benjamin Gamble. What is Time?  Can mean many different things to a computer Dynamic Equation Variable System State 2.
Lecture 4: Metabolism Reaction system as ordinary differential equations Reaction system as stochastic process.
Finish up array applications Move on to proteomics Protein microarrays.
Reconstruction of Transcriptional Regulatory Networks
Combinatorial State Equations and Gene Regulation Jay Raol and Steven J. Cox Computational and Applied Mathematics Rice University.
Hybrid Petri net representation of Gene Regulatory Network
Systems Biology ___ Toward System-level Understanding of Biological Systems Hou-Haifeng.
NY Times Molecular Sciences Institute Started in 1996 by Dr. Syndey Brenner (2002 Nobel Prize winner). Opened in Berkeley in Roger Brent,
Prokaryotes vs Eukaryotes
The Cognate interaction Genomic arrays A new era for modeling the immune response Benoit Morel.
Chap 18 The Genetics of Viruses and Bacteria. Structure of Virus Approximately 20 nm in diameter Their genome can contain DNA or RNA. Enclosed by a.
PLANT BIOTECHNOLOGY & GENETIC ENGINEERING (3 CREDIT HOURS) LECTURE 13 ANALYSIS OF THE TRANSCRIPTOME.
Dynamical Modeling in Biology: a semiotic perspective Junior Barrera BIOINFO-USP.
Traffic Simulation L2 – Introduction to simulation Ing. Ondřej Přibyl, Ph.D.
Sub-fields of computer science. Sub-fields of computer science.
MCB 7200: Molecular Biology
Modelling of biomolecular networks
University of California at San Diego
Whole-cell models: combining genomics and dynamical modeling
What contribution can automated reasoning make to e-Science?
Bud Mishra Professor of Computer Science and Mathematics 12 ¦ 3 ¦ 2001
Cell Communication.
Molecular Docking Profacgen. The interactions between proteins and other molecules play important roles in various biological processes, including gene.
Higher Biology Metabolism & Enzymes Mr G R Davidson.
A Compositional Approach to the Stochastic Dynamics of Gene Networks
Presentation Topic Cloning Vector and its Types Presented By
University of California at San Diego
1 Department of Engineering, 2 Department of Mathematics,
Languages for Systems Biology
Genomes and Their Evolution
Cell Communication.
Cell Communication.
1 Department of Engineering, 2 Department of Mathematics,
1 Department of Engineering, 2 Department of Mathematics,
Summary of the Standards of Learning
You will be given the answer. You must give the correct question.
Cell Communication.
Review Warm-Up What is the Central Dogma?
Unit III Information Essential to Life Processes
Languages for Systems Biology
The Study of Biological Information
Cell Communication.
Cell Communication.
Review Warm-Up What is the Central Dogma?
Computational Biology
Presentation transcript:

Languages & Notations for Systems Biology Luca Cardelli Microsoft Research Executive Summary http://www.luca.demon.co.uk/BioComputing.htm http://research.microsoft.com/bioinfo

Structural Architecture Eukaryotic Cell (10~100 trillion in human body) Nuclear membrane Mitochondria Membranes everywhere Golgi Vesicles E.R. Plasma membrane (<10% of all membranes) 11/23/2018

Functional Architecture Regulation The Virtual Machines of Biochemistry Gene Machine Biochemical Networks - The Protein Machine Gene Regulatory Networks - The Gene Machine Transport Networks - The Membrane Machine Nucleotides Systems Biology “We (kind of) understand the components; but how does the system work?” Makes proteins, where/when/howmuch Holds genome(s), confines regulators Signals conditions and events Directs membrane construction and protein embedding Model Integration Different time and space scales. Holds receptors, actuators hosts reactions Protein Machine Membrane Machine Implements fusion, fission Aminoacids Phospholipids Metabolism, Propulsion Signal Processing Molecular Transport Confinement Storage Bulk Transport 11/23/2018

EU Commission, Health Research Report on Computational Systems Biology General Modelling Requirements Research projects should focus on integrated modelling of several cellular processes leading to as complete an understanding as possible of the dynamic behaviour of a cell. Several projects may be required to develop modules (metabolism, signalling, trafficking, organelles, cell cycle, gene expression, replication, cytoskeleton) in model organisms. This modelling should involve realistic analysis of experimental data, including a wide range of data for transcriptomics, proteomics and functional genomics, and interactions with cellular pathways including signal transduction, regulatory cascades, metabolic pathways etc. It should involve: Coherent, high-quality, quantitative, heterogeneous and dynamic data sets as a basis for novel model constructions to advance from analytical to predictive modelling. Experimental functional analysis tools (in-situ proteomics, protein-protein interactions, metabolic fluxes, etc) 11/23/2018

Challenges for Formal Notations in Biology Describe biological systems precisely For analysis (discovering principles of operation) For simulation (drug development, etc.) For engineering (optimizing output, etc.) New working hypothesis: Describe these complex deeply-layered systems as if they were software systems. I.e. code them up in some analyzable language or notation. Claim (to be validated): modularity and compositionality advantages, just as in software, for scaling-up, w.r.t. traditional methods (chemical equations, differential equations). 11/23/2018

Biochemical Process Notations Chemical reactions is a process calculus! A long, long, flat list of thousands of reactions… highly concurrent and nondeterministic. But there is also structure and modularity in biochemistry. Representing structure Process calculi are the modular representation of discrete concurrent processes. They can be seen as an input language for Petri Nets or for Continuous Time Markov Chains. Just like a sequence of assignments and goto’s is a programming language. There are better (yes?) programming languages. But no ordinary programming language has that level of concurrency and nondeterminism. Let’s take a look at the high-level process notations of biochemistry (mostly diagrams and pictures) … 11/23/2018

1: The Protein Machine Pretty close to the atoms. On/Off switches cf. BioCalculus [Kitano&Nagasaki], k-calculus [Danos&Laneve] On/Off switches Each protein has a structure of binary switches and binding sites. But not all may be always accessible. Inaccessible Protein Inaccessible Binding Sites Switching of accessible switches. - May cause other switches and binding sites to become (in)accessible. - May be triggered or inhibited by nearby specific proteins in specific states. Binding on accessible sites. May cause other switches and binding sites to become (in)accessible. - May be triggered or inhibited by nearby specific proteins in specific states. 11/23/2018

Molecular Interaction Maps http://www.cds.caltech.edu/~hsauro/index.htm JDesigner Taken from Kurt W. Kohn 11/23/2018

“External Choice” The phage lambda switch 2. The Gene Machine Pretty far from the atoms. cf. Hybrid Petri Nets [Matsuno, Doi, Nagasaki, Miyano] Positive Regulation Negative Regulation Transcription Input Output1 Output2 Input Output Coding region Gene (Stretch of DNA) “External Choice” The phage lambda switch Regulatory region Regulation of a gene (positive and negative) influences transcription. The regulatory region has precise DNA sequences, but not meant for coding proteins: meant for binding regulators. Transcription produces molecules (RNA or, through RNA, proteins) that bind to regulatory region of other genes (or that are end-products). Human (and mammalian) Genome Size 3Gbp (Giga base pairs) 750MB @ 4bp/Byte (CD) Non-repetitive: 1Gbp 250MB In genes: 320Mbp 80MB Coding: 160Mbp 40MB Protein-coding genes: 30,000-40,000 M.Genitalium (smallest true organism) 580,073bp 145KB (eBook) E.Coli (bacteria): 4Mbp 1MB (floppy) Yeast (eukarya): 12Mbp 3MB (MP3 song) Wheat 17Gbp 4.25GB (DVD) 11/23/2018

Gene Regulatory Networks http://strc.herts.ac.uk/bio/maria/NetBuilder/ Taken from Eric H Davidson Or And Gate Amplify Sum DNA Begin coding region NetBuilder 11/23/2018

3. The Membrane Machine Q Q P Q P Q Q Q P P P Q P Q P P Very far from the atoms. Zero case P Q Q Q Pino Exo P Q P Q Endo One case Endo R Q Phago R Q Arbitrary subsystem Fusion Fission P Q Zero case P P Drip Mate P Q P Q Mito Mito One case P P R Bud R Arbitrary subsystem Fusion Fission 11/23/2018

Membrane Transport Algorithms LDL-Cholesterol Degradation Protein Production and Secretion Viral Replication Taken from MCB p.730 11/23/2018

Promising Techniques and Technologies 11/23/2018

Stochastic Simulation Basic algorithm: Gillespie Exact (i.e. based on physics) stochastic simulation of chemical kinetics. Can compute concentrations and reaction times for biochemical networks. Stochastic Process Calculi BioSPi [Shapiro, Regev, Priami, et. al.] Stochastic process calculus based on Gillespie. BioAmbients [Regev, Panina, Silverma, Cardelli, Shapiro] Extension of BioSpi for membranes. Stochastic Highwire? [Merdith] Case study: Lymphocytes in Inflamed Blood Vessels [Lecaa, Priami, Quaglia] Original analysis of lymphocyte rolling in blood vessels of different diameters. Case study: Lambda Switch [Celine Kuttler, IRI Lille] Model of phage lambda genome (well-studied system). Case study: VICE [U. Pisa] Minimal prokaryote genome (180 genes) and metabolism of whole VIrtual CEll, in stochastic p-calculus, simulated under stable conditions for 40K transitions. More traditional approaches Charon language [UPenn] Hybrid systems: continuous differential equations + discrete/stochastic mode switching. Etc. 11/23/2018

“Program” Analysis Causality Analysis Control Flow Analysis Biochemical pathways, (“concurrent traces” such as the one here), are found in biology publications, summarizing known facts. This one, however, was automatically generated from a program written in BioSpi by comparing traces of all possible interactions. [Curti, Priami, Degano, Baldari] One can play with the program to investigate various hypotheses about the pathways. Control Flow Analysis Flow analysis techniques applied to process calculi. Overapproximation of behavior used to answer questions about what “cannot happen”. Analysis of positive feedback transcription regulation in BioAmbients [Flemming Nielson]. 11/23/2018

Modelchecking Temporal: NuSMV [Chabrier-Rivier Chiaverini Danos Fages Schachter] Analysis of mammalian cell cycle (after Kohn) in CTL. E.g. is state S1 a necessary checkpoint for reaching state S2? Quantitative: Simpathica/xssys [Antioniotti Park Policriti Ugel Mishra] Quantitative temporal logic queries of human Purine metabolism model. Stochastic: Spring [Parker Normal Kwiatkowska] Designed for stochastic (computer) network analysis Discrete and Continuous Markov Processes. Process input language. Modelchecking of probabilistic queries. Eventually(Always (PRPP = 1.7 * PRPP1) implies steady_state() and Eventually(Always(IMP < 2 * IMP1)) and Eventually(Always(hx_pool < 10*hx_pool1))) 11/23/2018

What Process Calculi Do For Us We can write things down We can modularly describe high structural and combinatorial complexity (“do programming”). Software teaches us that large and deep systems, even well engineered ones where each component is rigidly defined, eventually exhibit “emergent behavior” (damn!). We can calculate and analyze Directly support simulation. Support analysis (e.g. control flow, causality, nondeterminism). Support state exploration (modelchecking). This was invented to discover “emergent behavior” (=bugs) in software and hardware systems. Should have interesting large-scale applications in biology. We can reason Suitable equivalences on processes induce algebraic laws. We can relate different abstraction levels and behaviors. We can use equivalences for state minimization (symmetries). Disclaimers Some of these technologies are basically ready (small-scale stochastic simulation and analysis, medium-scale nondeterministic and stochastic modelchecking). Others need to scale up significantly to be really useful. This is (has been) the challenge for computer scientists. 11/23/2018

END “The problem of biology is not to stand aghast at the complexity but to conquer it.” - Sydney Brenner “Although the road ahead is long and winding, it leads to a future where biology and medicine are transformed into precision engineering.” - Hiroaki Kitano. 11/23/2018