10-Jun-15Anatoly Sorokin Existing Standards in Systems Biology Anatoly Sorokin Computation Systems Biology Group University of Edinburgh.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

ChEBI and SABIO-RK: Association of chemical compound information and reaction kinetics data Ulrike Wittig Scientific Databases and Visualization Group.
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
Look for other standards: - BioPAX (composition) - CellML (aggregation) - Other SBML L3 extensions (modules - rule-based modeling)
Database Systems: Design, Implementation, and Management Tenth Edition
SBML2Murphi: a Translator from a Biology Markup Language to Murphy Andrea Romei Ciclo di Seminari su Model Checking Dipartimento di Informatica Università.
CellDesigner Tutorial Laurence Calzone, Andrei Zinovyev UMR U900 INSERM/Institut Curie/Ecole des Mines de Paris Wednesday, April 30th.
BiNoM Tutorial Andrei Zinovyev, Laurence Calzone UMR U900 INSERM/Institut Curie/Ecole des Mines de Paris Wednesday, April 30th.
OASIS Reference Model for Service Oriented Architecture 1.0
Ontology Notes are from:
Systems Biology Existing and future genome sequencing projects and the follow-on structural and functional analysis of complete genomes will produce an.
TC3 Meeting in Montreal (Montreal/Secretariat)6 page 1 of 10 Structure and purpose of IEC ISO - IEC Specifications for Document Management.
Petri net modeling of biological networks Claudine Chaouiya.
Computational Biology, Part 19 Cell Simulation: Virtual Cell Robert F. Murphy, Shann-Ching Chen, Justin Newberg Copyright  All rights reserved.
XML Documentation of Biopathways and Their Simulations in Genomic Object Net Speaker : Hungwei chen.
©Ian Sommerville 2006Software Engineering, 8th edition. Chapter 8 Slide 1 System models.
SCB : 1 Department of Computer Science Simulation and Complexity SCB : Simulating Complex Biosystems Susan Stepney Department of Computer Science Leo Caves.
Use of Ontologies in the Life Sciences: BioPax Graciela Gonzalez, PhD (some slides adapted from presentations available at
August 29, 2002InforMax Confidential1 Vector PathBlazer Product Overview.
UML CASE Tool. ABSTRACT Domain analysis enables identifying families of applications and capturing their terminology in order to assist and guide system.
Modified from Sommerville’s originalsSoftware Engineering, 7th edition. Chapter 8 Slide 1 System models.
The bioinformatics of biological processes The challenge of temporal data Per J. Kraulis CMCM, Tartu University.
10 December, 2013 Katrin Heinze, Bundesbank CEN/WS XBRL CWA1: DPM Meta model CWA1Page 1.
4 th NeuroML Development Workshop & BrainScaleS CodeJam, Edinburgh, March NeuroML: Where are we at? Padraig Gleeson Department.
2 1 Chapter 2 Data Model Database Systems: Design, Implementation, and Management, Sixth Edition, Rob and Coronel.
RDF (Resource Description Framework) Why?. XML XML is a metalanguage that allows users to define markup XML separates content and structure from formatting.
8/28/97Organization of Information in Collections Introduction to Description: Dublin Core and History University of California, Berkeley School of Information.
Lecture 3: Pathway Generation Tool I: CellDesigner: A modeling tool of biochemical networks Y.Z. Chen Department of Pharmacy National University of Singapore.
Implementation Yaodong Bi. Introduction to Implementation Purposes of Implementation – Plan the system integrations required in each iteration – Distribute.
Chapter 4 System Models A description of the various models that can be used to specify software systems.
System models Abstract descriptions of systems whose requirements are being analysed Abstract descriptions of systems whose requirements are being analysed.
BioUML Fedor Kolpakov Institute of Systems Biology (spin-off of DevelopmentOnTheEdge.com) Laboratory of Bioinformatics, Design Technological Institute.
Introduction to MDA (Model Driven Architecture) CYT.
EBI is an Outstation of the European Molecular Biology Laboratory. BioModels Database, a public model- sharing resource In silico systems biology: network.
Converting Macromolecular Regulatory Models from Deterministic to Stochastic Formulation Pengyuan Wang, Ranjit Randhawa, Clifford A. Shaffer, Yang Cao,
How Ontologies Add Value BioPAX: Biological Pathway Data Exchange Ontology Joanne Luciano BioPAX Workgroup (biopax.org)biopax.org BioPathways Consortium.
Needs to be described: Objects – Have multiple states – Consist of other objects, that may have multiple states – May be connected with each other via.
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
What is MOF? The Meta Object Facility (MOF) specification provides a set of CORBA interfaces that can be used to define and manipulate a set of interoperable.
BioPAX - Biological Pathway Data Exchange Format Tutorial Nadia Anwar BioPAX Workshop November 2009.
Semantic Web for Life Sciences Workshop Session VII: Semantic Aggregation, Integration, and Inference Moderator: Joanne Luciano October, Cambridge,
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
Chapter 7 System models.
System models l Abstract descriptions of systems whose requirements are being analysed.
Modified by Juan M. Gomez Software Engineering, 6th edition. Chapter 7 Slide 1 Chapter 7 System Models.
What is an Ontology? An ontology is a specification of a conceptualization that is designed for reuse across multiple applications and implementations.
Scientific Applications of XML Arvind Hulgeri, Shantanu Godbole
Systems Biology ___ Toward System-level Understanding of Biological Systems Hou-Haifeng.
Modelling epithelial transport David P. Nickerson¹, Kirk L. Hamilton², Peter J. Hunter¹ ¹Auckland Bioengineering Institute, Auckland, New Zealand ²Department.
EEL 5937 Ontologies EEL 5937 Multi Agent Systems Lecture 5, Jan 23 th, 2003 Lotzi Bölöni.
Sharing Models. How Can I Exchange Models? SBML (Systems Biology Markup Language): de facto standard for representing cellular networks. A large number.
RoSBM Registry of Standard Biological Models Barry Canton (MIT) Vincent Rouilly (Imperial College) Registry Workshop November 2007,Boston.
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
Metadata Common Vocabulary a journey from a glossary to an ontology of statistical metadata, and back Sérgio Bacelar
Ontologies Working Group Agenda MGED3 1.Goals for working group. 2.Primer on ontologies 3.Working group progress 4.Example sample descriptions from different.
ESDI Workshop on Conceptual Schema Languages and Tools
Mining the Biomedical Research Literature Ken Baclawski.
Systems Biology Markup Language Ranjit Randhawa Department of Computer Science Virginia Tech.
Composition in Modeling Macromolecular Regulatory Networks Ranjit Randhawa September 9th 2007.
A database of biological pathways and processes (borrowed from a presentation created by Steve Jupe)
BioPAX The Birth of A Data Exchange Language for Biological Pathways Joanne Luciano BioPAX Group 7 th International Annual Bio-Ontologies.
IGEM 2008 Tutorial Modeling. What? Model A model in science is a physical, mathematical, or logical representation of a system of entities, phenomena,
 To explain why the context of a system should be modelled as part of the RE process  To describe behavioural modelling, data modelling and object modelling.
Semantic Interoperability in GIS N. L. Sarda Suman Somavarapu.
IGEM 2009 Tutorial Modelling. What? Model A model in science is a symplified physical, mathematical, or logical representation of a system of entities,
Systems Biology Markup Language (SBML)
Model Curation Edmund J. Crampin Auckland Bioengineering Institute
Data Exchange & Public Reference Data
The Pathway Tools Schema
Metadata Framework as the basis for Metadata-driven Architecture
Presentation transcript:

10-Jun-15Anatoly Sorokin Existing Standards in Systems Biology Anatoly Sorokin Computation Systems Biology Group University of Edinburgh

Standard is decade of standards in biology –31 MIBI standard –56 OBO ontologies –About 80 exchange formats Scope of interest Language Controlled vocabulary

Standards and Languages CML – description of chemical structure MathML – representation of mathematical formulas PSI – standard description of protein interaction data AnatML – language to describe interaction at organ level GeneOntology – standard and ontology to describe gene function and regulation

Standards for Computational System Biology BioPAX – language for database of biological networks exchange SBML – language of biochemical model exchange CellML – language to describe mathematical models SBGN – visual language for biological model description

MI standards Reporting guidelines specify the minimum amount of meta data (information) and data required to meet a specific aimmeta data Aim is to provide enough meta data and data to enable the unambiguous reproduction and interpretation of an experiment. Normally informal human readable specifications that inform the development of formal data models (e.g. XML or UML), data exchange formatsXMLUML 10-Jun-15Anatoly Sorokin

Exchange format Strict structure to exchange data of model Mainly XML Well defined meta-model, often supported by software API 10-Jun-15Anatoly Sorokin

Ontologies “ontology deals with questions concerning what entities exist or can be said to exist, and how such entities can be grouped, related within a hierarchy, and subdivided according to similarities and differences” Wikipediaentitieshierarchy Often used as controlled vocabulary and description support framework GeneOntology 10-Jun-15Anatoly Sorokin

BioPAX “Biological PAthway eXchange - A data exchange ontology and format for biological pathway integration, aggregation and inference”

BioPAX Goals BioPAX = Biological PAthway eXchange Data exchange format for pathway data Include support for these pathway types: –Metabolic pathways –Signaling pathways –Protein-protein, molecular interactions –Gene regulatory pathways –Genetic interactions Accommodate representations used in existing databases such as BioCyc, BIND, WIT, aMAZE, KEGG, Reactome, etc. PathwayCommons – collection of pathways in BioPAX –

BioPAX BioPAX ontology and format in OWL (XML) Ontology built using GKB Editor and Protégé Semantic mapping still an issue Level 1 represents metabolic pathway data Level 2 adds support for molecular interactions, post-translational modifications, experimental description from PSI-MI model (Backwards compatible) Level 3 adds support for generics, protein states, rearrange reaction representation

BioPAX Ontology: Overview

BioPAX Ontology: Top Level Pathway –A set of interactions –E.g. Glycolysis, MAPK, Apoptosis Interaction –A set of entities and some relationship between them –E.g. Reaction, Molecular Association, Catalysis Physical Entity –A building block of simple interactions –E.g. Small molecule, Protein, DNA, RNA Entity Pathway Interaction Physical Entity Subclass (is a) Contains (has a)

BioPAX Ontology: Interactions Interaction Control Conversion CatalysisBiochemicalReaction ComplexAssembly ModulationTransport TransportWithBiochemicalReaction Physical Interaction

BioPAX Ontology: Physical Entities PhysicalEntity ComplexRNA ProteinSmall Molecule DNA

BioPAX and other standards BioPAX PSI-MI 2 SBML, CellML Genetic Interactions Molecular Interactions Pro:Pro All:All Interaction Networks Molecular Non-molecular Pro:Pro TF:Gene Genetic Regulatory Pathways Low Detail High Detail Database Exchange Formats Simulation Model Exchange Formats Rate Formulas Metabolic Pathways Low Detail High Detail Biochemical Reactions Small Molecules Low Detail High Detail

Simulation-related standards 10-Jun-15Anatoly Sorokin Model Simulation Result ? SED-MLSBRML Minimal Requirements Exchange format Ontology implements Makes sense of

SBML “The Systems Biology Markup Language (SBML) is a computer-readable format for representing models of biochemical reaction networks. SBML is applicable to metabolic networks, cell-signaling pathways, regulatory networks, and many others. ”

SBML –Reaction container for rate law –Species reactants, products, or modifiers of reaction –Compartment container for species –Parameter, Rule, Event

Characteristics of SBML Many top-level types, little nesting –Units, Compartment, Species, Parameter, Reaction, Rule, Function, Event Non-modular structure –Next SBML ‘Level’ (3) will introduce modularity Emphasis on reactions Some math implicit –Explicit rate equations; implicit integration –Implicit concentration conversion between compartments Compartments are physical containers for species –Spatial dimensions (volume, surface)

Structure of SBML

Note field of SBase intended to store information for human to read Annotation field of SBase provide a container for software-generated annotations that are not intended to be seen by humans The id field is usually required for most structures and is used to identify a component within the model definition. The name field is optional and provide a human- readable label for the component.

10-Jun-15Anatoly Sorokin Model Simulation Result ? SED-MLSBRML Minimal Requirements Data model Ontology implements Makes sense of

MIRIAM Model description require extra information –Biological Description of elements of model –Mathematical Definition of math concepts –Referential Author name Paper reference etc Jun-15Anatoly Sorokin

Reference correspondence The model must be encoded in a public, standardized, machine- readable format (SBML, CellML, GENESIS...) The model must comply with the standard in which it is encoded! The model must be clearly related to a single reference description. If a model is composed from different parts, there should still be a description of the derived/combined model. The encoded model structure must reflect the biological processes listed in the reference description. The model must be instantiated in a simulation: All quantitative attributes have to be defined, including initial conditions. When instantiated, the model must be able to reproduce all results given in the reference description within an epsilon (algorithms, round-up errors) 10-Jun-15Anatoly Sorokin

Attribution annotation The model has to be named. A citation of the reference description must be joined (completecitation, unique identifier, unambigous URL). The citation should permit to identify the authors of the model. The name and contact of model creators must be joined. The date and time of creation and last modification should be specified. An history is useful but not required. The model should be linked to a precise statement about the terms of distribution. MIRIAM does not require “freedom of use” or “no cost”. 10-Jun-15Anatoly Sorokin

External resource annotation The annotation must permit to unambiguously relate a piece of knowledge to a model constituent. The referenced information should be described using a triplet {data-type, identifier, qualifier} –The data-type should be written as a Unique Resource Identifier (URI) –The identifier is analysed within the framework of the data-type. –Data-type and Identifier can be combined in a single URI urn:lsid:myResource.org:myIdentifier –Qualifiers (optional) should refine the link between the model constitutent and the piece of knowledge: “has a”, “is version of”, “is homolog to” etc. 10-Jun-15Anatoly Sorokin

10-Jun-15Anatoly Sorokin

10-Jun-15Anatoly Sorokin Model Simulation Result ? SED-MLSBRML Minimal Requirements Data model Ontology implements Makes sense of

SBO Part of OBO Foundry Assign meanings to mathematical elements of SBML Allows automatic validation of semantic consistency of math part of model 10-Jun-15Anatoly Sorokin

SBO Types and roles of reaction participants, including terms like “substrate”, “catalyst” etc., but also “macromolecule”, or “channel”. Parameter used in quantitative models. This vocabulary includes terms like “Michaelis constant”, “forward unimolecular rate constant”etc. A term may contain a precise mathematical expression stored as a MathML lambda function. The variables refer to other parameters. Mathematical expressions. Examples of terms are “mass action kinetics”, “Henri-Michaelis-Menten equation” etc. A term may contain a precise mathematical expression stored as a MathML lambda function. The variables refer to the other vocabularies. Modelling framework to precise how to interpret the rate-law. E.g. “continuous modelling”, “discrete modelling” etc. Event type, such as “catalysis” or “addition of a chemical group”. 10-Jun-15Anatoly Sorokin

SBO 10-Jun-15Anatoly Sorokin

10-Jun-15Anatoly Sorokin Model Simulation Result ? SED-MLSBRML Minimal Requirements Data model Ontology implements Makes sense of

MIASE Minimum Information About a Simulation Experiment –What base model to use & which modifications to apply –What simulation task to run on those models (algorithms, see KiSAO; simulation parameters) –How to post-process the numerical results and to present them Subset of MISE bould be encoded in SED-ML 10-Jun-15Anatoly Sorokin

SED-ML Encode MIASE requirements 10-Jun-15Anatoly Sorokin

Description of models 10-Jun-15Anatoly Sorokin

Description of models 10-Jun-15Anatoly Sorokin

Simulations 10-Jun-15Anatoly Sorokin

Simulations 10-Jun-15Anatoly Sorokin

Simulation task 10-Jun-15Anatoly Sorokin

Data generation 10-Jun-15Anatoly Sorokin

Data generation 10-Jun-15Anatoly Sorokin

Production of results 10-Jun-15Anatoly Sorokin

10-Jun-15Anatoly Sorokin Model Simulation Result ? SED-MLSBRML Minimal Requirements Data model Ontology implements Makes sense of

KiSAO Kinetic Simulation Algorithm Ontology – Classification of simulation algorithms & methods –Definition, literature references –Relations between different simulation algorithms & methods srv/kisao/index.html 10-Jun-15Anatoly Sorokin

KiSAO 10-Jun-15Anatoly Sorokin

10-Jun-15Anatoly Sorokin Model Simulation Result ? SED-MLSBRML Minimal Requirements Data model Ontology implements Makes sense of

SBRML Systems Biology Results Markup Language A new markup language for specifying the results from operations on SBML models index.php?page=SBRML 10-Jun-15Anatoly Sorokin

SBRML 10-Jun-15Anatoly Sorokin

SBRML 10-Jun-15Anatoly Sorokin

10-Jun-15Anatoly Sorokin

10-Jun-15Anatoly Sorokin

10-Jun-15Anatoly Sorokin

Dimension example 10-Jun-15Anatoly Sorokin

10-Jun-15Anatoly Sorokin

Dimension example 10-Jun-15Anatoly Sorokin

10-Jun-15Anatoly Sorokin Model Simulation Result ? SED-MLSBRML Minimal Requirements Data model Ontology implements Makes sense of

TEDDY The TErminology for the Description of DYnamics (TEDDY) project aims to provide an ontology for dynamical behaviours, observable dynamical phenomena, and control elements of bio- models and biological systems in Systems Biology and Synthetic Biology Jun-15Anatoly Sorokin

TEDDY top-level structure Temporal Behaviour (concrete behaviours of a model, more or less the same as trajectories): –Oscillation, Steady State, Fixed Point, Cycle,... Behaviour Characteristic (properties to characterise concrete behaviours): –Period, Amplitude,... Behaviour Diversification (system properties describing the ability of systems to exhibit different behaviours): –Bifurcation, Bi-Stability Functional Motif (structural features of a system necessary for specific function): – Negative Feedback, FFL, Jun-15Anatoly Sorokin

TEDDY 10-Jun-15Anatoly Sorokin

Questions 10-Jun-15Anatoly Sorokin