1 BioUML - extensible workbench for systems biology Laboratory of Bioinformatics, Novosibirsk, Russia. www.itcsoftware.com 2004 -2010 ITC Software All.

Slides:



Advertisements
Similar presentations
EcoTherm Plus WGB-K 20 E 4,5 – 20 kW.
Advertisements

Números.
University Paderborn 07 January 2009 RG Knowledge Based Systems Prof. Dr. Hans Kleine Büning Reinforcement Learning.
1 A B C
AGVISE Laboratories %Zone or Grid Samples – Northwood laboratory
Trend for Precision Soil Testing % Zone or Grid Samples Tested compared to Total Samples.
PDAs Accept Context-Free Languages
ALAK ROY. Assistant Professor Dept. of CSE NIT Agartala
EuroCondens SGB E.
Reinforcement Learning
Slide 1Fig 26-CO, p.795. Slide 2Fig 26-1, p.796 Slide 3Fig 26-2, p.797.
Slide 1Fig 25-CO, p.762. Slide 2Fig 25-1, p.765 Slide 3Fig 25-2, p.765.
Chapter 7 System Models.
Sequential Logic Design
Copyright © 2013 Elsevier Inc. All rights reserved.
Addition and Subtraction Equations
David Burdett May 11, 2004 Package Binding for WS CDL.
Create an Application Title 1Y - Youth Chapter 5.
Add Governors Discretionary (1G) Grants Chapter 6.
CALENDAR.
CHAPTER 18 The Ankle and Lower Leg
The 5S numbers game..
突破信息检索壁垒 -SciFinder Scholar 介绍
A Fractional Order (Proportional and Derivative) Motion Controller Design for A Class of Second-order Systems Center for Self-Organizing Intelligent.
Break Time Remaining 10:00.
The basics for simulations
PP Test Review Sections 6-1 to 6-6
1 IMDS Tutorial Integrated Microarray Database System.
MM4A6c: Apply the law of sines and the law of cosines.
2013 Fox Park Adopt-A-Hydrant Fund Raising & Beautification Campaign Now is your chance to take part in an effort to beautify our neighborhood by painting.
Regression with Panel Data
Operating Systems Operating Systems - Winter 2012 Chapter 2 - Processes Vrije Universiteit Amsterdam.
Lexical Analysis Arial Font Family.
Dynamic Access Control the file server, reimagined Presented by Mark on twitter 1 contents copyright 2013 Mark Minasi.
Copyright © 2012, Elsevier Inc. All rights Reserved. 1 Chapter 7 Modeling Structure with Blocks.
Copyright © [2002]. Roger L. Costello. All Rights Reserved. 1 XML Schemas Reference Manual Roger L. Costello XML Technologies Course.
Progressive Aerobic Cardiovascular Endurance Run
Biology 2 Plant Kingdom Identification Test Review.
Chapter 1: Expressions, Equations, & Inequalities
MaK_Full ahead loaded 1 Alarm Page Directory (F11)
Facebook Pages 101: Your Organization’s Foothold on the Social Web A Volunteer Leader Webinar Sponsored by CACO December 1, 2010 Andrew Gossen, Senior.
1 Termination and shape-shifting heaps Byron Cook Microsoft Research, Cambridge Joint work with Josh Berdine, Dino Distefano, and.
When you see… Find the zeros You think….
Midterm Review Part II Midterm Review Part II 40.
Before Between After.
Slide R - 1 Copyright © 2009 Pearson Education, Inc. Publishing as Pearson Prentice Hall Active Learning Lecture Slides For use with Classroom Response.
12 October, 2014 St Joseph's College ADVANCED HIGHER REVISION 1 ADVANCED HIGHER MATHS REVISION AND FORMULAE UNIT 2.
: 3 00.
5 minutes.
1 Non Deterministic Automata. 2 Alphabet = Nondeterministic Finite Accepter (NFA)
Static Equilibrium; Elasticity and Fracture
Converting a Fraction to %
Clock will move after 1 minute
Lial/Hungerford/Holcomb/Mullins: Mathematics with Applications 11e Finite Mathematics with Applications 11e Copyright ©2015 Pearson Education, Inc. All.
Select a time to count down from the clock above
1.step PMIT start + initial project data input Concept Concept.
WARNING This CD is protected by Copyright Laws. FOR HOME USE ONLY. Unauthorised copying, adaptation, rental, lending, distribution, extraction, charging.
9. Two Functions of Two Random Variables
A Data Warehouse Mining Tool Stephen Turner Chris Frala
1 Dr. Scott Schaefer Least Squares Curves, Rational Representations, Splines and Continuity.
1 Non Deterministic Automata. 2 Alphabet = Nondeterministic Finite Accepter (NFA)
Introduction Embedded Universal Tools and Online Features 2.
Schutzvermerk nach DIN 34 beachten 05/04/15 Seite 1 Training EPAM and CANopen Basic Solution: Password * * Level 1 Level 2 * Level 3 Password2 IP-Adr.
BioUML Fedor Kolpakov Institute of Systems Biology (spin-off of DevelopmentOnTheEdge.com) Laboratory of Bioinformatics, Design Technological Institute.
Comprehensive model for formalized description, visualization and simulation of biological systems Fedor A. Kolpakov Biosoft.Ru,
1 BioUML - Biological Universal Modeling Language Biosoft.Ru, Novosibirsk, Russia. Laboratory of Bioinformatics, Digital Design Technologies.
Presentation transcript:

1 BioUML - extensible workbench for systems biology Laboratory of Bioinformatics, Novosibirsk, Russia ITC Software All rights reserved.

2 Schedule 9.30 – 9.40Introduction 9.40 – 10.00Visual modeling, graphical notations – 10.20BioUML main concepts and architecture overview – 11.00Demonstration of BioUML workbench from the customer view point – 11.15Coffee break – 11.30Simulations with BioUML. MATLAB plug-in, Java plug-in – 11.45Usage of BioUML in real life projects – Cyclonet and Biopath (codename) projects – 12.15Current work and further development – 12.30Discussion

3 The challenge "We now have unprecedented ability to collect data about nature but there is now a crisis developing in biology, in that completely unstructured information does not enhance understanding. We need a framework to put all of this knowledge and data into that is going to be the problem in biology. We've reached the stage where we can't talk to each other we've all become highly specialized. We need a framework, a framework where people can come back to us and say, 'Yes, I understand.' Driving toward that framework is really the big challenge. Sydney Brenner Sydney Brenner, 2002 Nobel Prize winner

4

5 Main BioUML concepts and ideas Visual modeling Plug-in based architecture (Eclipse platform runtime from IBM). Meta model – problem domain neutral level of abstraction that describes system as compartmentalized graph. It is also common contract for all parties (software developers, problem domains specialists, annotators, mathematicians). Diagram type concept – formally defines graphical notation and provides its incorporation into BioUML workbench. Module concept - allows developer to incorporate databases on biological pathways into BioUML framework taking into account database peculiarities.

6 Visual Modeling The problem of modeling and simulating of complex systems can be significantly simplified for customers by using computer systems providing visual modeling. These visual depictions offer alternative syntax to completely and formally specify models. A number of visual syntaxes were developed and implemented in computer systems for electrical engineering and computer science. The most known graphical language for computer science is UML – Unified Modeling Language.

7 The OMG specification states: "The Unified Modeling Language (UML) is a graphical language for visualizing, specifying, constructing, and documenting the artifacts of a software-intensive system. The UML offers a standard way to write a system's blueprints, including conceptual things such as business processes and system functions as well as concrete things such as programming language statements, database schemas, and reusable software components."

8 UML diagrams use case diagram class diagram behavior diagrams: –statechart diagram –activity diagram –interaction diagrams: sequence diagram collaboration diagram implementation diagrams: –component diagram –deployment diagram

9 UML use case diagram

10 UML class diagram

11 UML state chart diagram

12 UML activity diagram

13 UML sequence diagram

14 UML collaboration diagram

15 If we will consider UML architecture from developer view point then we will note: 1) UML was really designed for modeling software systems and are hardly suitable for other problem domains. 2) UML has complicated structure that is quite hard for implementation. OMG specification is more the 700 pages. 3)UML was not designed for visual modeling and simulation of dynamics of complex systems. That is why we need new language for modeling biological systems and we called this language BioUML.

16 Graphical notations for biological pathways

17 Some graphical notations for biological pathways Kohn K.W. (1999). Molecular Interaction Map of the Mammalian Cell Cycle Control and DNA Repair Systems. Mol. Biol.Cell. 10, Kitano H. (2003). A graphical notation for biochemical networks. BIOSILICO Vol. 1. No. 5. R. Maimon and S. Browning (2001). Diagrammatic Notation and Computational Grammar for Gene Networks. Proceedings of the International Conference on Systems Biology Cook D.L. et al. (2001). A basis for a visual language for describing, archiving and analyzing functional models of complex biological systems. Genome Biol. 2. RESEARCH Database specific notations: - KEGG/Metabolic pathways; GeneNet system; TRANSPATH …

18 Kohn K.W. (1999). Molecular Interaction Map of the Mammalian Cell Cycle Control and DNA Repair Systems. Mol. Biol.Cell. 10,

19

20 Representation of multimolecular complexes: stimulatory and inhibitory complexes of E2F1, DP1, and pRb. (a) E2F1:DP1 dimer; (b) E2F1:DP1:pRb trimer; (c) E2F1:DP1 bound to promoter element E2 (transcriptional activation shown); (d) E2F1:DP1:pRb bound to E2 (transcriptional inhibition shown). Note that the promoter element can be occupied either by E2F1:DP1 or by E2F1:DP1:pRb (alternative binding represented by interaction lines joined at an acute angle).

21

22 Kitano H. (2003). A graphical notation for biochemical networks. BIOSILICO Vol. 1. No. 5.

23

24

25 R. Maimon and S. Browning. Diagrammatic Notation and Computational Grammar for Gene Networks. Proceedings of the International Conference on Systems Biology

26

27 Formal description and modeling of biological systems require coordinated efforts of different group of researchers: programmers - they should provide computer tools for this task. problem domain experts - they should specify what and how should be described. experimenters and annotators - they should describe corresponding data following to these rules. mathematicians - they should provide methods for models analysis and simulations. BioUML architecture separates these tasks so they can be effectively solved by corresponding group of researchers and provides simple contract how these groups and corresponding software parts should communicate.

28 p53, The gatekeeper of death:

29 Cook D.L. et al. (2001). A basis for a visual language for describing, archiving and analyzing functional models of complex biological systems. Genome Biol. 2. RESEARCH 0012.

30

31

32 KEGG - metabolic pathways

33 KEGG - signaling pathways

34 GeneNet system A chemical formalism was employed as a basis for describing the events occurring in biological pathways 2 types of relationships between entities: reaction - the interaction between the entities that leads to the appearance of new entity regulatory event - the effect of an entity on a certain reaction

35 GeneNet – antiviral responce

36 TRANSPATH database

37 TRANSPATH – p53 pathway

38 BioUML architecture

39 Plug-in based architecture Plug-in - plugin.xml - Java jar files A plug-in is the smallest unit of BioUML workbench function that can be developed and delivered separately into BioUML workbench. A plug-in is described in an XML manifest file, called plugin.xml. The parsed contents of plug-in manifest files are made available programmatically through a plug-in registry API provided by Eclipse runtime. - extension points are well-defined function points in the system where other plug-ins can contribute functionality. - extension is a specific contribution to an extension point. Plug-ins can define their own extension points, so that other plug-ins can integrate tightly with them. Plug-in - plugin.xml - Java jar files Plug-in - plugin.xml - etc. Eclipse platform runtime

40

41

42

43 <plugin id="ru.biosoft.plugins.javascript version="0.7.4"> Plugin.xml example (1)

44 <function name="help" class="ru.biosoft.plugins.javascript.Global" method="help" varargs="true"> <argument name="obj" type="Object" obligatory="true description="%help.arg_1"/> Plugin.xml example (2)

45 help.descr = Returns description for the specified JavaScript function \ or host object.\nThis description should be loaded using \ 'biouml.pligins.javascript.function' or \ \n'biouml.pligins.javascript.hostObject' extension point. help.arg_1 = JavaScript object of function name. help.ex_1c = help() help.ex_1d = will print description of 'help' function itself. help.ex_2c = help(print) help.ex_2d = will print description of 'print' function. help.ex_3c = help(sbw) help.ex_3d = will print description of sbw host object. Plugin.properties example

46 BioUML meta model The core of BioUML workbench is meta model. Unlike UML meta mode BioUML meta model is problem domain neutral and provides an abstract layer for comprehensive formal description of wide range of biological and other complex systems. Content of databases on biological pathways or SBML models are expressed in terms of meta model and then can be used by other workbench plug-ins.

47 A B eq1 eq2 R1 C eq3 eq4 R2 Соответствующая ему математическая модель Пример двух последовательных химических реакций

48 A B -k1[A] R1 C -k2[B]K2[B] R System structure is described as a graph Mathematical model of the system Description of system components in the database ID A CC..... // ID R1 A->B... // ID B CC..... // ID R2 B->C... // ID C CC..... // A B -k1[A] R1 C -k2[B] K2[B] R Example of formalized description of System from two chemical reactions

49 Main BioUML meta-model classes to describe system structure as compartmentalized graph

50

51

52 Diagram type concept Diagram type defines: what system components can be shown in the diagram; diagram view builder - it is used to generate view for each diagram element taking into account problem domain peculiarities; semantic controller - provides semantic integrity of the diagram during its editing; filters – hide or highlight diagram elements according to some selection criteria.

53

54 Module Concept The module concept allows to developer define new diagram types and incorporate other databases on biological pathways into BioUML framework. The module defines mapping of database content into diagram elements and diagram types that can be used with the database. Module also provides query engine that can be used by BioUML workbench to find interactiong components of the system.

55 Modules standard BioUML module for biological pathways; module for models in SBML format; module for models in CellML format (only biochemical models); GeneNet database module; module for KEGG/Pathways datbase (draft); module for TRANSPATH database (draft); GeneOntology (under development).

56 Standard BioUML module for biological pathways The module defines most common biological data types (gene, protein, RNA, substance, reaction, etc.), they mapping into simple text database and three diagram types for description of biological pathways on several semantic levels: 1. Semantic network (ontology) - this diagram type is used to describe semantic relationships between system components, system states, and related problem domain concepts. 2. Pathway diagram type is used for formalized description of biological pathway structure. This diagram type uses GeneNet graphical notation. 3. Pathway simulation diagram type is extension of pathway structure diagram, where variables are associated with graph nodes and differential equations with graph edges. This allows to BioUML workbench automatically generate mathematical model of the system and simulate its dynamics.

57

58

59

60 Suggested approach can be applied for modeling biological systems using: –Systems of ordinary differential equations –Systems of algebra-differential equations –State and transition diagrams –Hybrid models –Boolean and logical networks –Petri nets –Markov chains –Stochastic models –… Limitations –Spatial models –PDE –…

61

62 Formal description and modeling of biological systems require coordinated efforts of different group of researchers: programmers - they should provide computer tools for this task. problem domain experts - they should specify what and how should be described. experimenters and annotators - they should describe corresponding data following to these rules. mathematicians - they should provide methods for models analysis and simulations. BioUML architecture separates these tasks so they can be effectively solved by corresponding group of researchers and provides simple contract how these groups and corresponding software parts should communicate.

63 BioUML live demonstration

64 Biological databases Data search and retrieving Visual modeling Automated code generation for model simulation of model behavior Formal description of structure of biological system MATLAB codeJava code Simulating using MATLAB. JMatLink allows to BioUML workbench to start MATLAB and retrieve simulations results Java simulation plug-in. Contains ODE solvers ported from odeToJava and methods for hybrid models support. … code

65 Cyclonet database

66 Cyclonet database Using BioUML workbench we are developing Cyclonet database – a database on cell cycle regulation in eukaryotes. Using BeanExplorer Enterprise Edition database content is available via the Internet. The database contains information about cell cycle specific genes, proteins, protein complexes and their interactions, diagrams of cell cycle regulation for vertebrates, models of cell cycle and results of their analyses, microarray data, literature references and other related resources.

67 Cyclonet database content Data: for semantic networks - concepts - relations for pathway diagrams - cells - compartments - genes - proteins - RNAs - substances - reactions -for simulation - constants - equations Dictionaries: - databases - relation types - species - units - constants Publications Resources: - formats - software - sites, etc. DiagramsMicroarray data Analysis results EnSmart GeneOntology TRANSPATH TRANSFAC PASS

68 BioUML workbench Servlet container: Tomcat Cyclonet system architecture Cyclonet MySQL database Web browser JDBC BeanExplorer Enterprise Edition Client side: Server side: Cyclonet module GeneOntology MySQL database

69

70

71 Biopath (codename) database Purpose of this project is create formal description of pathogeneses of chronic respiratory diseases and optimization of their treatment. The work is performed jointly with the Institute of Medical Problems of North (Krasnoyarsk) and the Institute of Pulmonology (Moscow).

72 Graph search engine –Graph search engine completion and testing –adaptation of new graph layout library Complete support of SBML level 2 –Time delay –Algebraic rules –Units –SBML semantic tests Formula editor State and transition diagrams. Current work

73 Force directed graph layout (takes nodes size into account) (lay-out time ~50ms, 300 iterations)

74

75

76

77 Library of predefined kinetic laws Stochastic modeling BioPAX support New diagram type for gene regulatory regions TRANSPATH support Microarray data support Further steps

78 Availability BioUML workbench (including source code) is freely available at Cyclonet database } see site Biopath database } for details

79 Acknowledgments Part of this work was partially supported by following grants: Volkswagen-Stiftung (I/75941), INTAS Nr and RFBR Nr а. Author is grateful to for useful comments, discussions and technical support Alexander Kel Sergey Zhatchenko Software developers Annotators Mikhail Puzanov Igor Tyazhev Ruslan Sharipov Vasiliy Hudyakov Vlad Zhvaleev Elena Cheremushkina Alexandr Koshukov Oleg Onegov Artem Shaidukov

80 We hope you enjoyed our presentation ! ITC Software Tel