Kate dreher biocurator / plant molecular biologist The Carnegie Institution for Science Stanford, CA Introduction to the Plant Metabolic Network: Data.

Slides:



Advertisements
Similar presentations
Bienvenidos al PMN! Kate Dreher curator PMN/TAIR.
Advertisements

How pathway databases were created and curated Peifen Zhang Plant Metabolic Network (PMN)
TAIR: Bringing together data for the global plant biology community kate dreher curator TAIR/PMN.
The Arabidopsis Information Resource (TAIR)
Arabidopsis as a model for plant development Eva Huala.
Kate Dreher AraCyc, TAIR, PMN Carnegie Institution for Science
El PMN: Tu amigo en el metabolismo de plantas Kate Dreher curator PMN/AraCyc/TAIR.
Part I: Tips and techniques from curators Kate Dreher TAIR, AraCyc, PMN Carnegie Institution for Science.
GDR, the Genome Database for Rosaceae, in Chado and Tripal Sook Jung, Stephen Ficklin, Taein Lee, Chun-Huai Cheng, Anna Blenda, Sushan Ru, Ping Zheng,
The Plant Metabolic Network: PlantCyc, AraCyc, and NEW Metabolic Pathway Databases for Plant Research *K. Dreher, P. Zhang, L. Chae, R.A. Nilo Poyanco,
Introduction to the Plant Metabolic Network: 18 Databases and Omics-Level Tools for Analysis and Discovery kate dreher The Carnegie Institution for Science.
Curation of the EcoCyc Database: The EcoCyc Update Project Martha Arnaud Scientific Database Curator Bioinformatics Research Group SRI International
August 29, 2002InforMax Confidential1 Vector PathBlazer Product Overview.
Gene Expression Resources Available from MaizeGDB Kokulapalan Wimalanathan 1, Jack Gardiner 4 5, Bremen Braun 2, Ethalinda KS Cannon 4, Mary Schaeffer.
Update on The Pathway Tools Software Peter D. Karp, Ph.D. Bioinformatics Research Group SRI International BioCyc.org EcoCyc.org MetaCyc.org.
Enzymatic Function Module (KEGG, MetaCyc, and EC Numbers)
Accessing the Data You Need at the Plant Metabolic Network kate dreher biocurator PMN The Carnegie Institution for Science Stanford, CA.
SRI International Bioinformatics 1 Searching BioCyc Ron Caspi.
TAIR resources for plant biology research kate dreher curator TAIR/PMN.
1 SRI International Bioinformatics BioCyc Tutorial Peter D. Karp, Ph.D. Bioinformatics Research Group SRI International BioCyc.org EcoCyc.org,
Overviews, Omics Viewers, and Object Groups. SRI International Bioinformatics Introduction Each overview is a genome-scale diagram of cellular machinery.
Viewing & Getting GO COST Functional Modeling Workshop April, Helsinki.
Data Content of the BioCyc Databases. BioCyc Tier 1 Databases.
Copyright OpenHelix. No use or reproduction without express written consent1.
New data and tools at TAIR (The Arabidopsis Information Resource)
Bioinformatics Dr. Víctor Treviño BT4007
CACAO Training Fall Community Assessment of Community Annotation with Ontologies (CACAO)
Copyright OpenHelix. No use or reproduction without express written consent1.
Copyright OpenHelix. No use or reproduction without express written consent1.
Accessing information in plant metabolic pathway databases at the PMN, Gramene, and SGN Part I: Contents, Search Strategies, and Data Sharing Opportunities.
TAIR/Gramene/SGN Workshop I ASPB Meeting July 08, 2007 Chicago, IL Metabolic Databases.
TAIR Workshop Model Organism Databases and Community Annotation Plant and Animal Genome XVI Conference, San Diego January 13, 2008.
SRI International Bioinformatics 1 Recent Developments in Pathway Tools GMOD Workshop November ‘07 Suzanne Paley Bioinformatics Research Group SRI International.
MMAP: mouse Metabolomics Analysis Platform Preeti Bais 09/09/2014.
SRI International Bioinformatics 1 Object Groups & Enrichment Analysis Suzanne Paley Pathway Tools Workshop 2010.
Copyright OpenHelix. No use or reproduction without express written consent1.
PlantCyc, AraCyc, PoplarCyc and more... Building databases and connecting to researchers at the Plant Metabolic Network kate dreher curator PMN/TAIR.
MetaCyc and AraCyc: Plant Metabolic Databases Hartmut Foerster Carnegie Institution.
Community Interactions: Feedback, Support and Curation Eva Huala The Arabidopsis Information Resource (TAIR)
1 SRI International Bioinformatics GO Term Integration and Curation in Pathway Tools and EcoCyc Ingrid M. Keseler Bioinformatics Research Group SRI International.
Ontologies GO Workshop 3-6 August Ontologies  What are ontologies?  Why use ontologies?  Open Biological Ontologies (OBO), National Center for.
Top Four Essential TAIR Resources Debbie Alexander Metabolic Pathway Databases for Arabidopsis and Other Plants Peifen Zhang.
Introduction to the GO: a user’s guide Iowa State Workshop 11 June 2009.
SRI International Bioinformatics 1 Submitting pathway to MetaCyc Ron Caspi.
SRI International Bioinformatics 1 SmartTables & Enrichment Analysis Peter Karp SRI Bioinformatics Research Group September 2015.
© 2014 SRI International About OMICS Group OMICS Group International is an amalgamation of Open Access publications and worldwide international science.
Combining Computational Prediction and Manual Curation to Create Plant Metabolic Pathway Databases Peifen Zhang Carnegie Institution For Science Department.
Metabolic Pathway Databases and Tools Speaker and Schedule Update PMN (Peifen Zhang) KEGG (auto-slide show) MetaCrop (cancelled)
SRI International Bioinformatics 1 The Structured Advanced Query Page Tomer Altman Bioinformatics Research Group SRI, International February 1, 2008.
PPI team Progress Report PPI team, IDB Lab. Sangwon Yoo, Hoyoung Jeong, Taewhi Lee Mar 2006.
Copyright OpenHelix. No use or reproduction without express written consent1.
A collaborative tool for sequence annotation. Contact:
Introduction to the GO: a user’s guide NCSU GO Workshop 29 October 2009.
PlantCyc, AraCyc, PoplarCyc and more... Building databases with YOUR help at the Plant Metabolic Network kate dreher curator PMN/TAIR.
EBI is an Outstation of the European Molecular Biology Laboratory. UniProtKB Sandra Orchard.
Building and Refining AraCyc: Data Content, Sources, and Methodologies Kate Dreher TAIR, AraCyc, PMN Carnegie Institution for Science.
Getting GO: how to get GO for functional modeling Iowa State Workshop 11 June 2009.
Welcome to Gramene’s RiceCyc (Pathways) Tutorial RiceCyc allows biochemical pathways to be analyzed and visualized. This tutorial has been developed for.
1 AraCyc Metabolic Pathway Annotation. 2 AraCyc – An overview  AraCyc is a metabolic pathway database for Arabidopsis thaliana;  Computational prediction.
Module 5: Future 1 Canadian Bioinformatics Workshops
Copyright OpenHelix. No use or reproduction without express written consent1 1.
2006 ICAR: TAIR workshop Organizers: Katica Ilic and Peifen Zhang Location: Reception Room, 4th floor A general overview of TAIR website and demonstration.
Reconstructing the metabolic network of a bacterium from its genome: the construction of LacplantCyc Christof Francke In silico reconstruction of the metabolic.
SRI International Bioinformatics 1 Pathway Tools Features Available Only in the Desktop Version PathoLogic.
Cheminformatics and Metabolism Team The EBI Enzyme Portal.
An Advanced Web Query Interface for Biological Databases
The Pathway Tools Software and BioCyc Database Collection
Annotation Presentation
SRI Bioinformatics Research Group
Part II SeqViewer AraCyc Help
Presentation transcript:

kate dreher biocurator / plant molecular biologist The Carnegie Institution for Science Stanford, CA Introduction to the Plant Metabolic Network: Data and Tools for Analysis and Discovery

Free access to high quality, curated data promotes beneficial research on plant metabolism  Plants provide crucial benefits to humanity and the ecosystem  A better understanding of plant metabolism may contribute to: More nutritious foods More pest-resistant plants More stress-tolerant crops Higher photosynthetic capacity and higher yield in agricultural and biofuel crops... many more applications  These efforts benefit from access to high quality plant metabolism data

PMN provides data and analysis tools:

The Plant Metabolic Network has multiple goals  Support research, breeding, and education  Transform published results into data-rich metabolic pathways Capture information on Enzymes, Reactions, and Compounds  Create and deploy improved methods for predicting enzyme function and metabolic capacity using plant genome sequences  Provide public resources: PlantCyc AraCyc 16 additional species-specific databases  Facilitate data analysis

Numerous collaborators contribute to the Plant Metabolic Network  SRI International – BioCyc project Provide Pathway Tools Software Maintain and update MetaCyc  Other collaborators / contributors include: MaizeGDB GoFORSYS SoyBase Sol Genomics Network (SGN) / Boyce Thompson Institute Cosmoss Gramene PlantMetabolomics group... and more SoyBase Editorial Board Community Submissions

17 PMN species are phylogenetically and "functionally" diverse Eudicots Cereals Basal land plants Basal land plants Green alga Major crops Model species Legume Woody and herbaceous species Annuals and perennials C3 and C4 species

PlantCyc provides access to important pathways not found in species-specific databases  PlantCyc contains over 1000 pathways and information from over 400 plant species 1050 pathways 1050 pathways 476 pathways (average) 476 pathways (average)

PlantCyc provides access to numerous specialized metabolic pathways  Many PlantCyc-specific metabolic pathways produce or break-down specialized ("secondary") metabolites  Many of the enzymes in these pathways have experimental evidence Caffeine biosynthesis I (Caffea arabica) Morphine biosynthesis (Papaver somniferum) Taxol biosynthesis (cancer drug) (Taxus brevifolia) Raspberry ketone biosynthesis (Rubus idaeus) Vicianin bioactivation (defense compound) (Vicia sativa) Alliin degradation (garlic odor) (Allium sativum)

The PMN developed a new method to predict enzyme functions  A high-confidence Reference Protein Sequence Database was built RPSD 2.0 contains 34,269 enzymes and 82,216 non-enzymes  The Ensemble Enzyme Prediction Pipeline (E2P2) uses the RPSD to predict functions based on protein sequence It assigns protein functions using standard EC (Enzyme Commission) numbers AND using specific reaction IDs Many reactions are not associated with full EC numbers but are associated with specific reaction IDs E2P2 can predict a higher number of enzyme functions E2P2 may help to fine-tune predictions among large enzyme families

Evidence codes enhance transparency  Evidence codes clearly define when enzymes are assigned to reactions based on computational or experimental evidence  Users can check references for computational prediction methods and original experiments in journal articles

A simple search connects you to a wealth of information AT3G16785

Quickly move between genes, enzymes, reactions, compounds, and pathways

Pathways may look deceptively simple at first glance

Pathway Enzyme Gene Compound Evidence Codes Regulation Upstream pathway The PMN provides valuable pathway context information Reaction

Curated summaries provide background information, cite papers, and link to other PMN pages

Different search tools help meet the needs of many users from high school biology students to bioinformaticians  Quick Search  Detailed search menus for for Compounds, Pathways, and Enzymes  Advanced Search (queryable with BioVelo language)  BLAST against PMN-specific enzyme sets  Comparative analysis  JavaCyc and PerlCyc  Complete downloadable files - Biopax and other file formats  Groups - for custom dataset building

"Groups" facilitate data retrieval, enrichment, and sharing

Transform your search results into a rich data set

Work with groups

Different properties and transformations are available for enzymes, reactions, pathways, and compounds

Groups can be downloaded and shared

Omics datasets can be displayed on a metabolic map

A simple tab-delimited text file should IDs/names and data value(s) ID (Gene ID) (enzyme name) (compound name) etc. ID (Gene ID) (enzyme name) (compound name) etc. Numerical data: (Relative values) (Absolute levels) Numerical data: (Relative values) (Absolute levels)

Results can be examined at different levels and connect to relevant pages

Additional analysis tools and resources are available

The PMN actively interacts with its users  We answer questions!  We make suggested corrections!  We make public presentations and meet with individuals Booth 200 Poster  We ask for your help  We thank you publicly CHOCOLATE !! (Theobroma cacao)

The PMN is working on additional enhancements  Continue to improve our enzyme annotation pipeline  Increase coverage of secondary metabolism  Add transcription factors that affect enzyme expression  Predict plant metabolic pathways for additional species

Plant metabolic NETWORKING  Please use our data  Please use our tools  Please help us to improve our databases!  Please contact us if we can be of any help!

PMN Acknowledgements Curator: - kate dreher Post-docs: - Lee Chae - Ricardo Nilo Poyanco - Chuan Wang Interns - Caryn Johansen Tech Team Members: - Bob Muller (Manager) - Larry Ploetz (Sys. Admin) - Shanker Singh - Bill Nelson Rhee Lab Members: - Flavia Bossi - Jue Fan - Hye-in Nam - Taehyong Kim - Meng Xu Peifen Zhang (Director and curator) Sue Rhee (PI) Collaborators: SRI - Peter Karp - Ron Caspi - Hartmut Foerster - Suzanne Paley - SRI Tech Team MaizeGDB - Mary Schaeffer - Lisa Harper - Jack Gardiner - Taner Sen ChlamyCyc - Patrick May - Dirk Walther - Lukas Mueller (SGN) - Gramene and MedicCyc DOE PMN Alumni: - A. S. Karthikeyan (curator) - Christophe Tissier (curator) - Hartmut Foerster (curator) - Eva Huala (co-PI) - Tam Tran (intern) - Varun Dwaraka (intern) - Damian Priamurskiy (intern) - Ricardo Leitao (intern) - Michael Ahn (intern) - Purva Karia (intern) - Anuradha Pujar (SGN curator) Tech Team Alumni - Anjo Chi - Cynthia Lee - Tom Meyer - Vanessa Kirkup - Chris Wilks - Raymond Chetty

 Please use our data  Please use our tools  Sue Rhee (PI) Peifen Zhang (Director) We're here to help... Booth 200 Poster 7029 Booth 200 Poster 7029 Thank you!!