Top Four Essential TAIR Resources Debbie Alexander Metabolic Pathway Databases for Arabidopsis and Other Plants Peifen Zhang.

Slides:



Advertisements
Similar presentations
Bienvenidos al PMN! Kate Dreher curator PMN/TAIR.
Advertisements

How pathway databases were created and curated Peifen Zhang Plant Metabolic Network (PMN)
Annotation of Gene Function …and how thats useful to you.
The Arabidopsis Information Resource (TAIR)
Kate Dreher AraCyc, TAIR, PMN Carnegie Institution for Science
El PMN: Tu amigo en el metabolismo de plantas Kate Dreher curator PMN/AraCyc/TAIR.
Part I: Tips and techniques from curators Kate Dreher TAIR, AraCyc, PMN Carnegie Institution for Science.
SRI International Bioinformatics Comparative Analysis Q
Overviews and Omics Viewers. SRI International Bioinformatics Introduction Each overview is a genome-scale diagram of a different aspect of the cellular.
The Plant Metabolic Network: PlantCyc, AraCyc, and NEW Metabolic Pathway Databases for Plant Research *K. Dreher, P. Zhang, L. Chae, R.A. Nilo Poyanco,
Abstract BarleyBase ( is a USDA-funded public repository for plant microarray data. BarleyBase houses raw and normalized expression.
Curation of the EcoCyc Database: The EcoCyc Update Project Martha Arnaud Scientific Database Curator Bioinformatics Research Group SRI International
Bioinformatics at WSU Matt Settles Bioinformatics Core Washington State University Wednesday, April 23, 2008 WSU Linux User Group (LUG)‏
Systems Biology Existing and future genome sequencing projects and the follow-on structural and functional analysis of complete genomes will produce an.
Bioinformatics: a Multidisciplinary Challenge Ron Y. Pinter Dept. of Computer Science Technion March 12, 2003.
Introduction to the Pathway Tools Software David Walsh and Simon Eng bigDATA Workshop—May 29, 2010.
August 29, 2002InforMax Confidential1 Vector PathBlazer Product Overview.
陳虹瑋 國立陽明大學 生物資訊學程 Genome Engineering Lab. Genome Engineering Lab The Newest.
Genomics, Proteomics and Metabolomics. Genomics l The complete set of DNA found in each cell is known as the genome l Most crop plant genomes have billions.
Modeling Functional Genomics Datasets CVM Lesson 1 13 June 2007Bindu Nanduri.
Update on The Pathway Tools Software Peter D. Karp, Ph.D. Bioinformatics Research Group SRI International BioCyc.org EcoCyc.org MetaCyc.org.
Creating a … Community Database Organism-Specific Database Model-Organism Database.
Accessing the Data You Need at the Plant Metabolic Network kate dreher biocurator PMN The Carnegie Institution for Science Stanford, CA.
TAIR resources for plant biology research kate dreher curator TAIR/PMN.
Ch10. Intermolecular Interactions and Biological Pathways
1 SRI International Bioinformatics BioCyc Tutorial Peter D. Karp, Ph.D. Bioinformatics Research Group SRI International BioCyc.org EcoCyc.org,
SRI International Bioinformatics 1 Pathway Tools: Recent Developments GMOD Meeting, June 2006.
Overviews, Omics Viewers, and Object Groups. SRI International Bioinformatics Introduction Each overview is a genome-scale diagram of cellular machinery.
Review of Ondex Bernice Rogowitz G2P Visualization and Visual Analytics Team March 18, 2010.
Overviews and Omics Viewers. SRI International Bioinformatics Introduction Each overview is a genome-scale diagram of cellular machinery l Cellular Overview.
Abstract BarleyBase is a USDA-funded public repository for plant microarray data. BarleyBase houses raw and normalized expression data from the 22K Affymetrix.
Accessing information in plant metabolic pathway databases at the PMN, Gramene, and SGN Part I: Contents, Search Strategies, and Data Sharing Opportunities.
TAIR/Gramene/SGN Workshop I ASPB Meeting July 08, 2007 Chicago, IL Metabolic Databases.
TAIR Workshop Model Organism Databases and Community Annotation Plant and Animal Genome XVI Conference, San Diego January 13, 2008.
The BioCyc Collection of Pathway/Genome Databases Alexander Shearer Bioinformatics Research Group SRI International BioCyc.org EcoCyc.org.
SRI International Bioinformatics 1 Recent Developments in Pathway Tools GMOD Workshop November ‘07 Suzanne Paley Bioinformatics Research Group SRI International.
SRI International Bioinformatics 1 Object Groups & Enrichment Analysis Suzanne Paley Pathway Tools Workshop 2010.
ANALYSIS AND VISUALIZATION OF SINGLE COPY ORTHOLOGS IN ARABIDOPSIS, LETTUCE, SUNFLOWER AND OTHER PLANT SPECIES. Alexander Kozik and Richard W. Michelmore.
PlantCyc, AraCyc, PoplarCyc and more... Building databases and connecting to researchers at the Plant Metabolic Network kate dreher curator PMN/TAIR.
Browsing the Genome Using Genome Browsers to Visualize and Mine Data.
MetaCyc and AraCyc: Plant Metabolic Databases Hartmut Foerster Carnegie Institution.
1 SRI International Bioinformatics GO Term Integration and Curation in Pathway Tools and EcoCyc Ingrid M. Keseler Bioinformatics Research Group SRI International.
SRI International Bioinformatics 1 Submitting pathway to MetaCyc Ron Caspi.
Cellular Overview and Omics Viewer. SRI International Bioinformatics The Cellular Overview Diagram A way to quickly visualize an organism’s metabolism.
SRI International Bioinformatics 1 SmartTables & Enrichment Analysis Peter Karp SRI Bioinformatics Research Group September 2015.
Pathway Database Pathway Comparison Expression Viewer Discovery. Pankaj Jaiswal Oregon State University 1.
Combining Computational Prediction and Manual Curation to Create Plant Metabolic Pathway Databases Peifen Zhang Carnegie Institution For Science Department.
Metabolic Pathway Databases and Tools Speaker and Schedule Update PMN (Peifen Zhang) KEGG (auto-slide show) MetaCrop (cancelled)
A collaborative tool for sequence annotation. Contact:
An approach to carry out research and teaching in Bioinformatics in remote areas Alok Bhattacharya Centre for Computational Biology & Bioinformatics JAWAHARLAL.
PlantCyc, AraCyc, PoplarCyc and more... Building databases with YOUR help at the Plant Metabolic Network kate dreher curator PMN/TAIR.
Introduction to biological molecular networks
Building and Refining AraCyc: Data Content, Sources, and Methodologies Kate Dreher TAIR, AraCyc, PMN Carnegie Institution for Science.
Welcome to Gramene’s RiceCyc (Pathways) Tutorial RiceCyc allows biochemical pathways to be analyzed and visualized. This tutorial has been developed for.
1 AraCyc Metabolic Pathway Annotation. 2 AraCyc – An overview  AraCyc is a metabolic pathway database for Arabidopsis thaliana;  Computational prediction.
Copyright OpenHelix. No use or reproduction without express written consent1 1.
Reconstructing the metabolic network of a bacterium from its genome: the construction of LacplantCyc Christof Francke In silico reconstruction of the metabolic.
SRI International Bioinformatics 1 Pathway Tools Features Available Only in the Desktop Version PathoLogic.
Recent Developments and Future Directions in Pathway Tools Peter D. Karp SRI International.
High throughput biology data management and data intensive computing drivers George Michaels.
Comparative Analysis in BioCyc
Why Create a PGDB? Perform pathway analyses as part of a genome project Analyze omics data Create a central public information resource for the organism,
Overview of Microbial Pathway and Genome Databases
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Incremental PathoLogic
Network biology An introduction to STRING and Cytoscape
Welcome to Gramene’s RiceCyc (Pathways) Tutorial
The Omics Dashboard.
Part II SeqViewer AraCyc Help
Presentation transcript:

Top Four Essential TAIR Resources Debbie Alexander Metabolic Pathway Databases for Arabidopsis and Other Plants Peifen Zhang

Metabolic Pathway Databases for Arabidopsis and Other Plants Peifen Zhang The Plant Metabolic Network (PMN)

Outline Introduction to PMN Query and retrieve the integrated information about pathways, metabolites, enzymes and genes Analyze sample gene expression data to identify changes in Arabidopsis metabolic pathways Behind the scene, how we create and curate a pathway database

encyclopedia

PMN is A network of plant metabolic pathway databases and database curation community –A plant reference database, PlantCyc Pathways, enzymes and genes consolidated from all plant species –A collection of single-species pathway databases Pathway Genome Databases (PGDB) Pathways, enzymes and genes in a particular species –A community for data curation Curators at databases (PMN, Gramene, SGN etc) Researchers in the plant biochemistry field

Comparison between plantcyc and single-species PGDBs Plantcyc –Comprehensive collection of pathways for all plants –Representative collection of all known enzymes in plants A PGDB –Comprehensive collection of pathways in a particular species –Comprehensive collection of enzymes, known or predicted, in that species

The same underlying software, Pathway Tools Data creation, storage and exchange User interface

Major data types in PMN databases Pathway –Diagram, summary, evidence, species Reaction –Equation, EC number Compound –Synonyms, structure Enzyme –Assignments to reaction/pathway, evidence, summary, properties (inhibitor, Km etc) Gene

Currently available PGDBs PGDBSpeciesHostStatus AraCycArabidopsisTAIR/PMNSubstantial curation RiceCycRiceGrameneSome curation SorghumGrameneNo curation MedicCycMedicagoNoble Foundationsome curation LycoCycTomatoSGNsome curation PotatoSGNNo curation PepperSGNNo curation TobaccoSGNNo curation PetuniaSGNNo curation CoffeeSGNNo curation

General query use cases How does a cell make or metabolize XXX? My gene is predicted to be a XXX, what does the enzyme do? What are the other genes involved in the same biochemical process as my gene? What is known and unknown of XXX pathway?

caffeine

Outline Introduction to PMN Query and retrieve the integrated information about pathways, metabolites, enzymes and genes Analyze sample gene expression data to identify changes in Arabidopsis metabolic pathways Behind the scene, how we create and curate a pathway database

Omics Viewer

Omics Viewer use cases Visualize and interpret large scale omics data in a metabolism context –Gene expression data –Proteomic data –Metabolic profiling data –Reaction flux data

Step one: prepare data input file

Step two: choose which data to be displayed

Last, choose color scheme and display type

Red: enhanced expression over my threshold Yellow: repressed expression over my threshold Blue: not significant under my threshold

Close-up to a pathway of interest

Generate a table of individual pathways exceeding certain threshold

Outline Introduction to PMN Query and retrieve the integrated information about pathways, metabolites, enzymes and genes Analyze sample gene expression data to identify changes in Arabidopsis metabolic pathways Behind the scene, how we create and curate a PGDB

Create PGDBs, why Huge sequence data are generated from genome and EST projects Put individual genes into the context of metabolic network Use the network to –discover missing enzymes –visualize and analyze large experimental data sets –design metabolic engineering –conduct comparative and evolutionary studies

Create PGDBs, how Manual extraction of pathways from the literature, assigning genes/enzymes to pathways Computational assigning genes/enzymes to reference pathways, manual validation/correction and further curation

Create PGDBs, how Annotated sequences, molecular function A reference database (such as MetaCyc and PlantCyc) PathoLogic (Pathway Tools software)

PathoLogic ANNOTATED GENOME AT1G69370 chorismate mutase prephenate aminotransferase arogenate dehydratase chorismateprephenateL-arogenateL-phenylalanine Gene calls Gene functions DNA sequences AT1G69370 chorismate mutase MetaCyc PGDB

New PGDB pipeline in PMN Prioritization –Available sequences, economic impact High priority –Poplar, Soybean, Maize, Wheat Others –Cotton, Grape, Sugarcane, Sunflower, Switchgrass…

A quality database requires manual validation and curation

Validation: prune false-positive predictions Pathways not operating in plants or not in a target species –glycogen biosynthesis –C4 photosynthesis –caffeine biosynthesis Pathways operating via a different route –Phenylalanine biosynthesis in bacteria v.s. in plants

Validation: add evidence and literature support Molecular data, enzymes and genes Radio tracer experiments Expert hypothesis (paper chemistry) Pure computational prediction

Curation: correct pathway diagrams

Curation: correct gene/enzyme assignments to reaction/pathway UGT89C1

Further curation Add missing or new pathways Add missing or new enzymes Add detailed literature information about a pathway, an enzyme etc

Community curation Adopt a newly created PGDB by a genome database Participate as a lab/group Participate as an individual Contact us:

Thank you!