Using Pathway-tools for phenotype- directed curation Jeremy Zucker Broad Institute of MIT and Harvard Boston University.

Slides:



Advertisements
Similar presentations
SRI International Bioinformatics 1 A BRG Biofuels Metabolic Engineering Project Bioinformatics Research Group SRI International
Advertisements

Annotation of Gene Function …and how thats useful to you.
Unbalanced Reactions by Markus Krummenacker Q
SRI International Bioinformatics Comparative Analysis Q
Decision trees for hierarchical multilabel classification A case study in functional genomics.
The Eli and Edythe L. Broad Institute A Collaboration of Massachusetts Institute of Technology, Harvard University and affiliated Hospitals, and Whitehead.
Probabilistic modelling in computational biology Dirk Husmeier Biomathematics & Statistics Scotland.
Microarray Data Analysis Day 2
Darwinian Genomics Csaba Pal Biological Research Center Szeged, Hungary.
Metabolic functions of duplicate genes in Saccharomyces cerevisiae Presented by Tony Kuepfer et al
Global Mapping of the Yeast Genetic Interaction Network Tong et. al, Science, Feb 2004 Presented by Bowen Cui.
Computational tools for whole-cell simulation Cara Haney (Plant Science) E-CELL: software environment for whole-cell simulation Tomita et al Bioinformatics.
Prediction of Therapeutic microRNA based on the Human Metabolic Network Ming Wu, Christina Chan Bioinformatics Advance Access Published January 7, 2014.
SRI International Bioinformatics 1 The consistency Checker, or Overhauling a PGDB By Ron Caspi.
The Eli and Edythe L. Broad Institute A Collaboration of Massachusetts Institute of Technology, Harvard University and affiliated Hospitals, and Whitehead.
Generalized Protein Parsimony and Spectral Counting for Functional Enrichment Analysis Nathan Edwards Department of Biochemistry and Molecular & Cellular.
Curation of the EcoCyc Database: The EcoCyc Update Project Martha Arnaud Scientific Database Curator Bioinformatics Research Group SRI International
Interoperation of Molecular Biology Databases Peter D. Karp, Ph.D. Bioinformatics Research Group SRI International Menlo Park, CA
Systems Biology Existing and future genome sequencing projects and the follow-on structural and functional analysis of complete genomes will produce an.
Pathways Gene A Gene B Enzyme A Enzyme B Substrate  intermediate  product Most often the final product of the biochemical pathway is something.
Experimental and computational assessment of conditionally essential genes in E. coli Chao WANG, Oct
陳虹瑋 國立陽明大學 生物資訊學程 Genome Engineering Lab. Genome Engineering Lab The Newest.
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
GTL Facilities Characterization and Imaging of Molecular Machines Lee Makowski.
Update on The Pathway Tools Software Peter D. Karp, Ph.D. Bioinformatics Research Group SRI International BioCyc.org EcoCyc.org MetaCyc.org.
From Databases to Dynamics Dr. Raquell M Holmes Center for Computational Science Boston University.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
A highly abbreviated introduction to proteomics
1 SRI International Bioinformatics BioCyc Tutorial Peter D. Karp, Ph.D. Bioinformatics Research Group SRI International BioCyc.org EcoCyc.org,
ComPath Comparative Metabolic Pathway Analyzer Kwangmin Choi and Sun Kim School of Informatics Indiana University.
1 SRI International Bioinformatics EcoCyc, MetaCyc, and the Pathway Tools Software Peter D. Karp, Ph.D. Bioinformatics Research Group SRI International.
1 SRI International Bioinformatics Large-Scale Metabolic Network Alignment: MetaCyc and KEGG Tomer Altman Bioinformatics Research Group SRI International.
Data Content of the BioCyc Databases. BioCyc Tier 1 Databases.
Lecture #23 Varying Parameters. Outline Varying a single parameter – Robustness analysis – Old core E. coli model – New core E. coli model – Literature.
Functional Genomic Hypothesis Generation and Experimentation by a Robot Scientist King et al, Nature : Presented by Monica C. Sleumer February.
The BioCyc Collection of Pathway/Genome Databases Alexander Shearer Bioinformatics Research Group SRI International BioCyc.org EcoCyc.org.
SRI International Bioinformatics 1 Recent Developments in Pathway Tools GMOD Workshop November ‘07 Suzanne Paley Bioinformatics Research Group SRI International.
GENE ONTOLOGY FOR THE NEWBIES Suparna Mundodi, PhD The Arabidopsis Information Resources, Stanford, CA.
From Structure to Function. Given a protein structure can we predict the function of a protein when we do not have a known homolog in the database ?
1 Bio-Trac 40 (Protein Bioinformatics) October 8, 2009 Zhang-Zhi Hu, M.D. Associate Professor Department of Oncology Department of Biochemistry and Molecular.
From genomes to Pathway tools and from Pathway Tools to Metabolic Models Jeremy Zucker Broad Institute of MIT and Harvard.
Organizing information in the post-genomic era The rise of bioinformatics.
MetaCyc and AraCyc: Plant Metabolic Databases Hartmut Foerster Carnegie Institution.
1 SRI International Bioinformatics GO Term Integration and Curation in Pathway Tools and EcoCyc Ingrid M. Keseler Bioinformatics Research Group SRI International.
1 Departament of Bioengineering, University of California 2 Harvard Medical School Department of Genetics Metabolic Flux Balance Analysis and the in Silico.
SRI International Bioinformatics 1 Submitting pathway to MetaCyc Ron Caspi.
Biological Networks & Systems Anne R. Haake Rhys Price Jones.
An overview of Bioinformatics. Cell and Central Dogma.
Functional and Evolutionary Attributes through Analysis of Metabolism Sophia Tsoka European Bioinformatics Institute Cambridge UK.
Jobs, Careers, Internships, Senior Projects and Research Computer Application Development K-12 education Industrial Training Bioinformatics Validation.
Genome Biology and Biotechnology The next frontier: Systems biology Prof. M. Zabeau Department of Plant Systems Biology Flanders Interuniversity Institute.
Introduction to biological molecular networks
Metabolic pathway alteration, regulation and control (3) Xi Wang 01/29/2013 Spring 2013 BsysE 595 Biosystems Engineering for Fuels and Chemicals.
Flexibility in energy metabolism supports hypoxia tolerance in Drosophila flight muscle: metabolomic and computational systems analysis Jacob Feala 1,2.
Building and Refining AraCyc: Data Content, Sources, and Methodologies Kate Dreher TAIR, AraCyc, PMN Carnegie Institution for Science.
1 AraCyc Metabolic Pathway Annotation. 2 AraCyc – An overview  AraCyc is a metabolic pathway database for Arabidopsis thaliana;  Computational prediction.
Reconstructing the metabolic network of a bacterium from its genome: the construction of LacplantCyc Christof Francke In silico reconstruction of the metabolic.
SRI International Bioinformatics 1 Pathway Tools Features Available Only in the Desktop Version PathoLogic.
SRI International Bioinformatics Selected PathoLogic Refining Tasks Creation of Protein Complexes Assignment of Modified Proteins Operon Prediction.
Recent Developments and Future Directions in Pathway Tools Peter D. Karp SRI International.
Hyunghoon Cho, Bonnie Berger, Jian Peng  Cell Systems 
The Pathway Tools FBA Module
Building Metabolic Models
A Community Effort to Model the Human Microbiome
Maintaining the EcoCyc
Cross-Organism RouteSearch
Overview of Microbial Pathway and Genome Databases
Overview of the Pathway Tools FBA Module
Relationship between Genotype and Phenotype
Hyunghoon Cho, Bonnie Berger, Jian Peng  Cell Systems 
Presentation transcript:

Using Pathway-tools for phenotype- directed curation Jeremy Zucker Broad Institute of MIT and Harvard Boston University

GENOME SEQUENCE TO METABOLIC RECONSTRUCTION

Genome sequence to metabolic model PathwaysLiterature Nutrient media (Vogels) NeurosporaCyc ElementsMetadata Complexes Reactions Transporters Biomass composition

EFICAz2 predicts enzymes … Decision tree Databases HMMs FDR SVM 9934 protein sequences 1993 enzymes 1770 reactions BMC Bioinformatics 2009, 10:107

Protein Complex editor 182 reactions with isozymes or complexes 56 complexes experimentally validated through literature search 2-oxoisovalerate alpha subunit 2-oxoisovalerate beta subunit … fatty acid synthase beta subunit dehydratase fatty acid synthase alpha subunit reductase Identify multiple genes of reaction Allow curator to validate potential complexes 2-oxoisovalerate complex Present all possible combinations of complexes Fatty acid synthase complex …

Transport inference parser (TIP) 9934 free-text Protein annotations 442 transport reactions MFS glucose transporter ATP synthase … sucrose transporter Filter proteins for transporters Infer multimeric complex Infer substrate Infer energy-coupling mechanism … Bioinformatics (2008) 24 (13): i259-i267.

Pathologic predicts pathways 1770 enzyme- catalyzed reactions 893 reactions assigned to 265 Pathways … … X = #rxns in metacyc pwy Y = #rxns with enzyme evidence Z = #unique rxns in pwy P(X|Y|Z) = prob of pwy in Neurospora Science 293:2040-4, 2001.

Literature curation validates pathways … 523 citations associated with 265 pathways 56 complexes 524 genes …

PHENOTYPE-DIRECTED CURATION

Metabolic Model + Steady State Assumption = Predict Rate of Biomass Production Nutrient Uptake Biomass Components Knockout Phenotype Prediction

Nutrient Uptake Biomass Components Simulate KO by removing reaction Are any biomass components blocked when applying FBA? – No: non-essential gene – Yes: essential gene

Why is it useful?

Phenotype-directed curation

461 curated minimal media viable and essential genes

Minimal media gene essentiality training set Predicted GrowthNo-growth ObservedGrowth103 (TP)0 (FN) No-growth2 (FP)46 (TN) SensitivityTP/(TP+FN)100% SpecificityTN/(TN+FP)95.8%

105 auxotroph supplemental rescue phenotypes

Supplemental rescue training set Predicted GrowthNo-growth ObservedGrowth74 (TP)3 (FN) SensitivityTP/(TP+FN)96.1%

Final metabolic reconstruction 224 Pathways523 Citations Nutrient media (Vogels) NeurosporaCyc ElementsMetadata 56 Complexes 1650 Reactions 442 Transporters Biomass composition

Independent test set validation

Minimal media gene essentiality test set Predicted GrowthNo-growth ObservedGrowth274 (TP)20 (FN) No-growth1 (FP)15 (TN) Viable predicted/observed (sensitivity): 93.2% Essential predicted/observed (specificity): 93.8%

Comparison of model organisms under minimal media Yeast E.coli iJO Neurospora Viable Predicted/ Observed 691/772=89.5%1079/1159=91.2%274/294=93.2% Essential Predicted/Observed 49/152=32.3%168/207=81.2%15/16=93.8% [1] Dobson et al. BMC Systems Biology 2010, 4:145 [2] Molecular Systems Biology 7 Article number: 535

Supplemental rescue test set Predicted GrowthNo-growth ObservedGrowth23 (TP)5 (FN) Rescues predicted/observed (sensitivity): 82.1% trp-1,3 rescued by tryptophan nic-3 rescued by nicotinamide cys-11 rescued by sulfate, thiosulfite, methionine, cysteine

MULTIPLE KNOCKOUTS

Synthetic Lethality

Pathway interpretation

Synthetic lethal rescues computationally predicted

Pathway interpretation

Phenotype arrays

Acknowledgements Boston University James Galagan Jonathan Dreyfuss Andrew Krueger OHSU Heather Hood Texas A&M Joe Sturino Matt Sachs UC Riverside Kathy Borkovich Jacqueline Servin University of Leeds Alan Radford SRI Peter Karp Mario Latendresse Broad Institute Christian Stolte Citizen Scientist Linda Ocasio