Languages for Systems Biology

Slides:



Advertisements
Similar presentations
Simulation of Prokaryotic Genetic Circuits Jonny Wells and Jimmy Bai.
Advertisements

Prof. Drs. Sutarno, MSc., PhD.. Biology is Study of Life Molecular Biology  Studying life at a molecular level Molecular Biology  modern Biology The.
Cell Structure and Function Chapter 3 Basic Characteristics of Cells Smallest living subdivision of the human body Diverse in structure and function.
DNA/RNA & Molecular Basis of Evolution ASTR 1420 Lecture 4 Sections 5.2, 5.4.
Models and methods in systems biology Daniel Kluesing Algorithms in Biology Spring 2009.
Systems Biology Existing and future genome sequencing projects and the follow-on structural and functional analysis of complete genomes will produce an.
Marc Riedel Synthesizing Stochasticity in Biochemical Systems Electrical & Computer Engineering Jehoshua (Shuki) Bruck Caltech joint work with Brian Fett.
Petri net modeling of biological networks Claudine Chaouiya.
XML Documentation of Biopathways and Their Simulations in Genomic Object Net Speaker : Hungwei chen.
. Class 1: Introduction. The Tree of Life Source: Alberts et al.
Bioinformatics: a Multidisciplinary Challenge Ron Y. Pinter Dept. of Computer Science Technion March 12, 2003.
Use of Ontologies in the Life Sciences: BioPax Graciela Gonzalez, PhD (some slides adapted from presentations available at
CHAPTER 15 Microbial Genomics Genomic Cloning Techniques Vectors for Genomic Cloning and Sequencing MS2, RNA virus nt sequenced in 1976 X17, ssDNA.
Exploration Session Week 8: Computational Biology Melissa Winstanley: (based on slides by Martin Tompa,
BASIC BIOCHEMISTRY MLAB Introduction. INTRODUCTION TO BASIC BIOCHEMISTRY Biochemistry can be defined as the science concerned with the chemical.
歐亞書局 PRINCIPLES OF BIOCHEMISTRY Chapter 9 DNA-Based Information Technologies.
CSE 6406: Bioinformatics Algorithms. Course Outline
Synthetic biology: New engineering rules for emerging discipline Andrianantoandro E; Basu S; Karig D K; Weiss R. Molecular Systems Biology 2006.
Microbial Models I: Genetics of Viruses and Bacteria 7 November, 2005 Text Chapter 18.
Genetics: Chapter 7. What is genetics? The science of heredity; includes the study of genes, how they carry information, how they are replicated, how.
Reconstruction of Transcriptional Regulatory Networks
Chapter 11 Phage strategies.
Noneukaryotic Genetic Information
A Biology Primer Part IV: Gene networks and systems biology Vasileios Hatzivassiloglou University of Texas at Dallas.
Predicting protein degradation rates Karen Page. The central dogma DNA RNA protein Transcription Translation The expression of genetic information stored.
Systems Biology ___ Toward System-level Understanding of Biological Systems Hou-Haifeng.
Biological Signal Detection for Protein Function Prediction Investigators: Yang Dai Prime Grant Support: NSF Problem Statement and Motivation Technical.
NY Times Molecular Sciences Institute Started in 1996 by Dr. Syndey Brenner (2002 Nobel Prize winner). Opened in Berkeley in Roger Brent,
Central dogma: the story of life RNA DNA Protein.
The Cognate interaction Genomic arrays A new era for modeling the immune response Benoit Morel.
Genome Biology and Biotechnology The next frontier: Systems biology Prof. M. Zabeau Department of Plant Systems Biology Flanders Interuniversity Institute.
Chap 18 The Genetics of Viruses and Bacteria. Structure of Virus Approximately 20 nm in diameter Their genome can contain DNA or RNA. Enclosed by a.
Dynamical Modeling in Biology: a semiotic perspective Junior Barrera BIOINFO-USP.
Genes in ActionSection 2 Section 2: Regulating Gene Expression Preview Bellringer Key Ideas Complexities of Gene Regulation Gene Regulation in Prokaryotes.
High throughput biology data management and data intensive computing drivers George Michaels.
The genome of prokaryotes and eukaryotes- nuclear and extranuclear genetic organization.
Chapter 7: The Blueprint of Life, from DNA to Protein.
From the double helix to the genome
Topics to be covers Basic features present on plasmids
MCB 7200: Molecular Biology
Cells & Cell Organelles
ZOOLOGY—STUDY OF ANIMALS
University of California at San Diego
Inside of an Animal and Plant Cell
Themes of Biology Chapter 1
The Basics of Molecular Biology
Course in Molecular Biology
Computer Hardware Introduction.
Molecules to Metabolism
Scientists have concluded the following about the cell theory
University of California at San Diego
Structure and Function
Languages & Notations for Systems Biology Luca Cardelli Microsoft Research Executive Summary
Genomes and Their Evolution
Cellular Metabolism Metabolic processes – all chemical reactions that occur in the body Cellular metabolism- refers to all of the chemical processes that.
Introduction to Gene Expression
Structure and Function
Summary of the Standards of Learning
Introduction to Biochemistry. Objectives  Know what biochemistry is and its principle.  Know the components of a cell and its major types of bio- molecules.
UNIFYING AND COMMON THEMES IS THE STUDY OF LIFE
Unit III Information Essential to Life Processes
Genetics: From Genes to Genomes
Languages for Systems Biology
The Study of Biological Information
Introduction to Biochemistry
what do you mean I’m Hormonal?
Computational Biology
Cells, tissues and organs
Chapter 2 - Cellular activity
CELL STRUCTURE AND FUNCTION
Presentation transcript:

Languages for Systems Biology Luca Cardelli Microsoft Research Cambridge UK http://www.luca.demon.co.uk/BioComputing.htm http://research.microsoft.com/bioinfo Languages for Systems Biology

50 Years of Molecular Cell Biology Genes are made of DNA Store digital information as sequences of 4 different nucleotides Direct protein assembly through RNA and the Genetic Code Proteins (>10000) are made of amino acids Process signals Activate genes Move materials Catalyze reactions to produce substances Control energy production and consumption Bootstrapping still a mystery DNA, RNA, proteins, membranes are today interdependent. Not clear who came first Separation of tasks happened a long time ago Not understood, not essential 11/23/2018

Towards Systems Biology Biologists now understand many of the cellular components A whole team of biologists will typically study a single protein for years When each component and each reaction is understood, the system is understood (?) But this has not led to understand how “the system” works Behavior comes from complex chains of interactions between components Predictive biology and pharmacology still rare Synthetic biology still unreliable New approach: try to understand “the system” Experimentally: massive data gathering and data mining (e.g. Genome projects) Conceptually: modeling and analyzing networks (i.e. interactions) of components What kind of a system? Just beyond the basic chemistry of energy and materials processing… Built right out of digital information (DNA) Based on information processing for both survival and evolution Can we fix it when it breaks? Really becomes: How is information structured and processed? 11/23/2018

Size DVD Pentium II Gate E. Coli 1 micron 1 micron  = 0.25 micron in Pentium II Pentium II Gate E. Coli 1 micron 11/23/2018

Performance Pentium II E. Coli DVD ~3 million transistors ~1/4 megabyte of memory ~100 million operations per second ~1 million macromolecules ~1 megabyte of static genetic memory ~1 million amino-acids per second 4700 megabytes of memory 1.385 megabytes per second Comparison courtesy of Eric Winfree 11/23/2018

Aims Modeling biological systems. By adapting paradigms and techniques developed for modeling information-processing systems. Because they have some similar features: Deep layering of abstractions. Complex composition of simpler components. Discrete (non-linear) evolution. Digital coding of information. Reactive information-driven behavior. Very high degree of concurrency. “Emergent behavior” (not obvious from part list). 11/23/2018

EU Commission, Health Research Report on Computational Systems Biology General Modelling Requirements Research projects should focus on integrated modelling of several cellular processes leading to as complete an understanding as possible of the dynamic behaviour of a cell. Several projects may be required to develop modules (metabolism, signalling, trafficking, organelles, cell cycle, gene expression, replication, cytoskeleton) in model organisms. This modelling should involve realistic analysis of experimental data, including a wide range of data for transcriptomics, proteomics and functional genomics, and interactions with cellular pathways including signal transduction, regulatory cascades, metabolic pathways etc. It should involve: Coherent, high-quality, quantitative, heterogeneous and dynamic data sets as a basis for novel model constructions to advance from analytical to predictive modelling. Experimental functional analysis tools (in-situ proteomics, protein-protein interactions, metabolic fluxes, etc) 11/23/2018

Methods Applying techniques unique to Computing: Model Construction (writing things down precisely) Studying the notations used in systems biology. Devising formal languages to reflect them. Studying their dynamics (semantics). Model Validation (using models for postdiction and prediction) Stochastic Simulation Stochastic = Quantitative concurrent semantics. Based on compositional descriptions. “Program” Analysis Control flow analysis Causality analysis Modelchecking Standard, Quantitative, Probabilistic 11/23/2018

Structural Architecture Eukaryotic Cell (10~100 trillion in human body) Nuclear membrane Mitochondria Membranes everywhere Golgi Vesicles E.R. Plasma membrane (<10% of all membranes) 11/23/2018

Functional Architecture The Abstract Machines of Systems Biology Regulation Gene Machine The “hardware” (biochemistry) is fairly well understood. But what is the “software” that runs on these machines? Notations already used in Biology Nucleotides Biochemical toolkits Makes proteins, where/when/howmuch Holds genome(s), confines regulators Signals conditions and events Directs membrane construction and protein embedding Model Integration Different time and space scales P Q Holds receptors, actuators hosts reactions Protein Machine Machine Membrane Implements fusion, fission Aminoacids Phospholipids Phospholipids Metabolism, Propulsion Signal Processing Molecular Transport Confinement Storage Bulk Transport 11/23/2018

1: The Protein Machine Pretty close to the atoms. On/Off switches cf. BioCalculus [Kitano&Nagasaki], k-calculus [Danos&Laneve] On/Off switches Each protein has a structure of binary switches and binding sites. But not all may be always accessible. Inaccessible Protein Inaccessible Binding Sites Switching of accessible switches. - May cause other switches and binding sites to become (in)accessible. - May be triggered or inhibited by nearby specific proteins in specific states. Binding on accessible sites. May cause other switches and binding sites to become (in)accessible. - May be triggered or inhibited by nearby specific proteins in specific states. 11/23/2018

Molecular Interaction Maps http://www.cds.caltech.edu/~hsauro/index.htm The p53-Mdm2 and DNA Repair Regulatory Network JDesigner Taken from Kurt W. Kohn 11/23/2018

“External Choice” The phage lambda switch 2. The Gene Machine Pretty far from the atoms. cf. Hybrid Petri Nets [Matsuno, Doi, Nagasaki, Miyano] Positive Regulation Negative Regulation Transcription Input Output1 Output2 Input Output Coding region Gene (Stretch of DNA) “External Choice” The phage lambda switch Regulatory region Regulation of a gene (positive and negative) influences transcription. The regulatory region has precise DNA sequences, but not meant for coding proteins: meant for binding regulators. Transcription produces molecules (RNA or, through RNA, proteins) that bind to regulatory region of other genes (or that are end-products). Human (and mammalian) Genome Size 3Gbp (Giga base pairs) 750MB @ 4bp/Byte (CD) Non-repetitive: 1Gbp 250MB In genes: 320Mbp 80MB Coding: 160Mbp 40MB Protein-coding genes: 30,000-40,000 M.Genitalium (smallest true organism) 580,073bp 145KB (eBook) E.Coli (bacteria): 4Mbp 1MB (floppy) Yeast (eukarya): 12Mbp 3MB (MP3 song) Wheat 17Gbp 4.25GB (DVD) 11/23/2018

Gene Regulatory Networks http://strc.herts.ac.uk/bio/maria/NetBuilder/ Taken from Eric H Davidson Or And Gate Amplify Sum DNA Begin coding region NetBuilder 11/23/2018

3. The Membrane Machine P Q P Q P Q P Q Very far from the atoms. Mate Mito Fusion Fission P Q Exo P Q P Q Endo Fusion Fission 11/23/2018

Membrane Transport Algorithms LDL-Cholesterol Degradation Protein Production and Secretion Viral Replication Taken from MCB p.730 11/23/2018

Process Calculi Today we represent, store, and analyze: Gene sequence data Protein structure data Metabolic network data … In the long run, how can we represent, store, and analyze biological processes? We want to do better than informal “circuit diagrams”, or huge list of chemical reactions. Scalable, precise, dynamic, highly structured, maintainable representations for systems biology. Process Calculi General formal framework for the description and analysis of highly concurrent interacting processes. 11/23/2018

Conclusions Identifying the architecture Modeling the system Emphasis on architecture, not components Modeling the system Information-oriented language-based models Analyzing the model Exploiting techniques unique to computing Perturbing, predicting, engineering “The data are accumulating and the computers are humming, what we are lacking are the words, the grammar and the syntax of a new language…” D. Bray (TIBS 22(9):325-326, 1997) “Although the road ahead is long and winding, it leads to a future where biology and medicine are transformed into precision engineering.” Hiroaki Kitano. 11/23/2018