Reconstructing the metabolic network of a bacterium from its genome: the construction of LacplantCyc Christof Francke In silico reconstruction of the metabolic.

Slides:



Advertisements
Similar presentations
Editing Pathway/Genome Databases. SRI International Bioinformatics Pathway Tools Paradigm Separate database from user interface Navigator provides one.
Advertisements

How pathway databases were created and curated Peifen Zhang Plant Metabolic Network (PMN)
Molecular Biomedical Informatics Machine Learning and Bioinformatics Machine Learning & Bioinformatics 1.
SRI International Bioinformatics Comparative Analysis Q
Microarray Data Analysis Day 2
Overviews and Omics Viewers. SRI International Bioinformatics Introduction Each overview is a genome-scale diagram of a different aspect of the cellular.
SRI International Bioinformatics 1 The consistency Checker, or Overhauling a PGDB By Ron Caspi.
Orthology, paralogy and GO annotation Paul D. Thomas SRI International.
Basics of Comparative Genomics Dr G. P. S. Raghava.
Gene Ontology John Pinney
Systems Biology Existing and future genome sequencing projects and the follow-on structural and functional analysis of complete genomes will produce an.
Computational Molecular Biology (Spring’03) Chitta Baral Professor of Computer Science & Engg.
Experimental and computational assessment of conditionally essential genes in E. coli Chao WANG, Oct
Pathway databases Goto S, Bono H, Ogata H, Fujibuchi W, Nishioka T, Sato K, Kanehisa M. (1997) Organizing and computing metabolic pathway data in terms.
Modeling Functional Genomics Datasets CVM Lesson 1 13 June 2007Bindu Nanduri.
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
Affinity chromatography/mass spec Bait protein GST Page 252.
Subsystem Approach to Genome Annotation National Microbial Pathogen Data Resource Claudia Reich NCSA, University of Illinois, Urbana.
Genome Annotation BCB 660 October 20, From Carson Holt.
1 SRI International Bioinformatics Advanced PGDB Editing: Regulation GO Terms Ingrid M. Keseler Bioinformatics Research Group SRI International
Ch10. Intermolecular Interactions and Biological Pathways
Metagenomic Analysis Using MEGAN4
Overviews, Omics Viewers, and Object Groups. SRI International Bioinformatics Introduction Each overview is a genome-scale diagram of cellular machinery.
ComPath Comparative Metabolic Pathway Analyzer Kwangmin Choi and Sun Kim School of Informatics Indiana University.
Title: GeneWiz browser: An Interactive Tool for Visualizing Sequenced Chromosomes By Peter F. Hallin, Hans-Henrik Stærfeldt, Eva Rotenberg, Tim T. Binnewies,
Overviews and Omics Viewers. SRI International Bioinformatics Introduction Each overview is a genome-scale diagram of cellular machinery l Cellular Overview.
Synthetic biology: New engineering rules for emerging discipline Andrianantoandro E; Basu S; Karig D K; Weiss R. Molecular Systems Biology 2006.
Bioinformatics Dr. Víctor Treviño BT4007
Gene Regulatory Network Inference. Progress in Disease Treatment  Personalized medicine is becoming more prevalent for several kinds of cancer treatment.
Networks and Interactions Boo Virk v1.0.
The BioCyc Collection of Pathway/Genome Databases Alexander Shearer Bioinformatics Research Group SRI International BioCyc.org EcoCyc.org.
SRI International Bioinformatics 1 Recent Developments in Pathway Tools GMOD Workshop November ‘07 Suzanne Paley Bioinformatics Research Group SRI International.
Web Apollo and the VectorBase user community Gloria I. Giraldo-Calderón March 31, 2015.
EBI is an Outstation of the European Molecular Biology Laboratory. Annotation Procedures for Structural Data Deposited in the PDBe at EBI.
Function first: a powerful approach to post-genomic drug discovery Stephen F. Betz, Susan M. Baxter and Jacquelyn S. Fetrow GeneFormatics Presented by.
The following four slides relate to Chapter 4 and will be discussed in that context.
The consistency Checker, or Overhauling a PGDB By Ron Caspi.
1 SRI International Bioinformatics GO Term Integration and Curation in Pathway Tools and EcoCyc Ingrid M. Keseler Bioinformatics Research Group SRI International.
Top Four Essential TAIR Resources Debbie Alexander Metabolic Pathway Databases for Arabidopsis and Other Plants Peifen Zhang.
SRI International Bioinformatics 1 Submitting pathway to MetaCyc Ron Caspi.
PIRSF Classification System PIRSF: Evolutionary relationships of proteins from super- to sub-families Homeomorphic Family: Homologous proteins sharing.
SRI International Bioinformatics 1 SmartTables & Enrichment Analysis Peter Karp SRI Bioinformatics Research Group September 2015.
P HYLO P AT : AN UPDATED VERSION OF THE PHYLOGENETIC PATTERN DATABASE CONTAINS GENE NEIGHBORHOOD Presenter: Reihaneh Rabbany Presented in Bioinformatics.
Structural Models Lecture 11. Structural Models: Introduction Structural models display relationships among entities and have a variety of uses, such.
An overview of Bioinformatics. Cell and Central Dogma.
Functional and Evolutionary Attributes through Analysis of Metabolism Sophia Tsoka European Bioinformatics Institute Cambridge UK.
Genome annotation and search for homologs. Genome of the week Discuss the diversity and features of selected microbial genomes. Link to the paper describing.
Introduction to biological molecular networks
DNAmRNAProtein Small molecules Environment Regulatory RNA How a cell is wired The dynamics of such interactions emerge as cellular processes and functions.
Biomax Informatics AG Bioinformatics designed with you in mind. FunCat TM, a controlled vocabulary encompassing the biology of prokaryotes, plants and.
1 AraCyc Metabolic Pathway Annotation. 2 AraCyc – An overview  AraCyc is a metabolic pathway database for Arabidopsis thaliana;  Computational prediction.
Nonlinear differential equation model for quantification of transcriptional regulation applied to microarray data of Saccharomyces cerevisiae Vu, T. T.,
SRI International Bioinformatics 1 Pathway Tools Features Available Only in the Desktop Version PathoLogic.
Tools in Bioinformatics Ontologies and pathways. Why are ontologies needed? A free text is the best way to describe what a protein does to a human reader.
Recent Developments and Future Directions in Pathway Tools Peter D. Karp SRI International.
 What is MSA (Multiple Sequence Alignment)? What is it good for? How do I use it?  Software and algorithms The programs How they work? Which to use?
PINALOG Protein Interaction Network Alignment and its implication in function prediction and complex detection Hang Phan Prof. Michael J.E. Sternberg.
The Integrated Microbial Genome (IMG) systems
Networks and Interactions
Comparative Analysis in BioCyc
KnowEnG: A SCALABLE KNOWLEDGE ENGINE FOR LARGE SCALE GENOMIC DATA
The Pathway Tools FBA Module
1 Department of Engineering, 2 Department of Mathematics,
Comparative Analysis Q
1 Department of Engineering, 2 Department of Mathematics,
Strategies for annotation of a genome
1 Department of Engineering, 2 Department of Mathematics,
Ensembl Genome Repository.
Overview of the Pathway Tools FBA Module
SRI Bioinformatics Research Group
Presentation transcript:

Reconstructing the metabolic network of a bacterium from its genome: the construction of LacplantCyc Christof Francke In silico reconstruction of the metabolic pathways of Lactobacillus plantarum: comparing predictions of nutrient requirements with growth experiments Te usink, van Enckevort, Francke, Wiersma, Wegkamp, Smid and Siezen 2005 Appl. Environ. Microbiol. 71:

What do we mean by reconstruction? the collection and visualization of all potential physiologically relevant cellular processes Reconstructing the metabolic network of a bacterium from its genome Francke, Siezen and TeusinkTrends in Microbiol :

Why do we want to do it? it serves to sort the individual proteins and thus the potential molecular functions, into a context (like pathways or protein complexes) and as such: - allows for improved functional annotation - provides a platform to visualize and analyze 'omics' data - yields a network the topology of which can be studied - can be converted to a model (metabolic engineering)

How do we annotate? The attribute function is ambiguous context independent (molecular function or properties) - catalyze certain reactions - interact with certain proteins - bind to a specific DNA sequence context dependent (role) - act in a certain pathway - be a member of a certain protein complex(es) - act as a transcription factor

We are interested in lactic acid bacteria (2003) Proc Natl Acad Sci USA 100,1990 #### annotation database PlantDB

recovering gene - protein - reaction -pathway relations the construction of LacplantCyc Pathway Tools (SRI) uses gene-annotation (EC-numbers) and reference database (MetaCyc) to arrive automatically at an encyclopedia of genes connected to proteins proteins connected to reactions reactions connected in pathways

initial automatic reconstruction: some remarks - presence of pathways - gaps in pathways - same reactions and pathways Are the assignments correct and which functions are there that have not been retrieved? - are these numbers correct? - manual changes are not correctly incorporated

the actual labour: curation

What we have done: consult reference databases

What we have done: add information that is not recovered from MetaCyc. Transporters are not recovered by pathway tools

~P EI PEP pyruvate ~P HPr ~P 170 dak1 dak2 dak1 dak2 ~P 170 dihydroxy acetone ~P - include newly discovered and or organism specific reactions and pathways - add information on complex formation What we have done: add information that is not recovered from MetaCyc.

What we have done: evaluation of the attributed molecular function for each individual case Do we trust the gene - protein - reaction association when we consider the similarity between the sequence of the gene-product and the sequence of a protein with the specified molecular function (evidence based on experiment)? - determine orthology (use phylogeny and gene-context to determine evolutionary relationship) - check experimental evidence

The evaluation of the attributed molecular function: Improved annotation of homologous proteins the use of phylogeny and orthologous relations experimental: trehalose phosphorylase map4 experimental: kojibiose phosphorylase map1* * there are slight but significant differences in alignment of cluster 1 which might point to slightly altered specificity L. plantarum has four homologs annotated as maltose phosphorylase

The evaluation of the attributed molecular function: Improved annotation of homologous proteins the use of gene context and metabolic context map3 experimental: maltose phosphorylase map2 experimental: maltose phosphorylase activepassive

the evaluation of gaps in pathways: are genes really missing an example: Tetrahydrofolate synthesis by Lactobacillus plantarum  ?

the evaluation of gaps in pathways: track missing genes

the evaluation of gaps in pathways: the use of knowledge on physiology Validation: no tetrahydrofolate detectable without addition of p-aminobenzoate to the medium absent predicted growth dependence

the evaluation of reactions and pathways: the use knowledge on physiology TCA cycle Lactobacilli do not have a TCA cycle and therefore do not produce succinyl-CoA ==> In all reactions succinyl-CoA is used as a substrate it has to be replaced by acetyl-CoA

cleaning up the database the removal of redundant pathways

a comparison of the automatic and curated LacplantCyc

using LacplantCyc - inconsistencies between observed nutrient requirements and pathway predictions may lead to new insights about regulation further research needed  automatic   

using LacplantCyc - to visualize -omics data - to compare the metabolic network between different species we need improved visualization

About the use of Cyc to visualize 'omics' data - it preferably requires * an interactive overview with more information * the possibility of having multiple selectable overviews * colouring of the genes instead of the reactions remarks

using LacplantCyc - to help the reconstruction of the metabolic network of a related species through orthologous relationships between proteins - to serve as the starting point for making a metabolic model (constraint based modeling) Accelerating the reconstruction of genome-scale metabolic networks Notebaart, van Enckevort, Francke, Siezen and Teusink in preparation

About the use of: Cyc as a source of gene-reaction-pathway association information to be used in other applications - requires easy export of these associations Cyc as a starting point for modeling - requires balanced reactions, detailed and correct molecular information on compounds and balance checks remarks

Pathway Tools is very nice to quickly connect reaction and pathway information to a gene which has been annotated with an EC-code. However: - Generation of a reliable reconstruction requires a lot of work and the implementation of changes is not always straightforward (problems with certain frames) and requires a lot of steps. - Better control over the editor of individual pathways and the pathway overview would be an important asset. - Application of automatic procedures after curation unfortunately destroys the changes that were carefully implemented. - Multiple editors with straightforward import and export functions would enhance the usefulness. final remarks

acknowledgements Frank van Enckevort Christof Francke Richard Notebaart Roland Siezen Eddy Smid Bas Teusink Arno Wegkamp Anne Wiersma LacplantCyc can be found at