Slide-1 ONTOLOGY DEVELOPMENT AND INTEGRATION Tutorial exercise: A preview.

Slides:



Advertisements
Similar presentations
A Comparative mapping resource ONTOLOGY DEVELOPMENT AND INTEGRATION IN GRAMENE Pankaj Jaiswal Cornell University.
Advertisements

Annotation of Gene Function …and how thats useful to you.
Applications of GO. Goals of Gene Ontology Project.
GO : the Gene Ontology “because you know sometimes words have two meanings” Amelia Ireland GO Curator EBI, Cambridge, UK.
1 Welcome to the Protein Database Tutorial This tutorial will describe how to navigate the section of Gramene that provides collective information on proteins.
Gene Ontology John Pinney
POC tutorial#3: Annotation This tutorial will run automatically in Quicktime. To run the tutorial at your own pace use the internal controllers within.
Introduction to Functional Analysis J.L. Mosquera and Alex Sanchez.
COG and GO tutorial.
Biological Databases Notes adapted from lecture notes of Dr. Larry Hunter at the University of Colorado.
Today’s menu: -UniProt - SwissProt/TrEMBL -PROSITE -Pfam -Gene Onltology Protein and Function Databases Tutorial 7.
Protein analysis and proteomics Friday, 27 January 2006 Introduction to Bioinformatics DA McClellan
Biology 224 Dr. Tom Peavy Sept 27 & 29 Protein Structure & Analysis- part 2.
Today’s menu: -UniProt - SwissProt/TrEMBL -PROSITE -Pfam -Gene Onltology Protein and Function Databases Tutorial 7.
Today’s menu: -SwissProt/TrEMBL -PROSITE -Pfam -Gene Onltology Protein and Function Databases Tutorial 7.
POC tutorial #2: Ontology Development This tutorial will run automatically in Quicktime. To run the tutorial at your own pace use the internal controllers.
Protein and Function Databases
Today’s menu: -UniProt - SwissProt/TrEMBL -PROSITE -Pfam -Gene Onltology Protein and Function Databases Tutorial 7.
An introduction to using the AmiGO Gene Ontology tool.
BTN323: INTRODUCTION TO BIOLOGICAL DATABASES Day2: Specialized Databases Lecturer: Junaid Gamieldien, PhD
Methods for Creating GO Annotations Emily Dimmer European Bioinformatics Institute Wellcome Trust Genome Campus Cambridge UK.
1 Identify the location of a particular gene, trait, QTL or marker - and the grass species they have been mapped to - on genetic, QTL, physical, sequence,
Automatic methods for functional annotation of sequences Petri Törönen.
Viewing & Getting GO COST Functional Modeling Workshop April, Helsinki.
1 Welcome to the Quantitative Trait Loci (QTL) Tutorial This tutorial will describe how to navigate the section of Gramene that provides information on.
GO : the Gene Ontology “because you know sometimes words have two meanings” Amelia Ireland GO Curator EBI, Cambridge, UK.
Slide-1 DEVELOPMENT AND INTEGRATION OF ONTOLOGIES IN GRAMENE Scientific Advisory Board Meeting January 2005.
Gramene Objectives Develop a database and tools to store, visualize and analyze data on genetics, genomics, proteomics, and biochemistry of grass plants.
Annotating Gene Products to the GO Harold J Drabkin Senior Scientific Curator The Jackson Laboratory Mouse.
Biology 224 Instructor: Tom Peavy Feb 21 & 26, Protein Structure & Analysis.
1 Building Communities Around Ontology Development Pankaj Jaiswal Dept. of Plant Breeding and Genetics Cornell University Ithaca, NY FAO,
GENE ONTOLOGY FOR THE NEWBIES Suparna Mundodi, PhD The Arabidopsis Information Resources, Stanford, CA.
A Comparative Genomics Resource for Grains. Tutorial Tips If you are viewing this tutorial with Adobe Acrobat Reader, click the "bookmarks" on the left.
The Gene Ontology project Jane Lomax. Ontology (for our purposes) “an explicit specification of some topic” – Stanford Knowledge Systems Lab Includes:
Grup.bio.unipd.it CRIBI Genomics group Erika Feltrin PhD student in Biotechnology 6 months at EBI.
Gene expression analysis
BIOINFORMATIK I UEBUNG 2 mRNA processing.
A Comparative Genomic Mapping Resource for Grains.
Ontologies GO Workshop 3-6 August Ontologies  What are ontologies?  Why use ontologies?  Open Biological Ontologies (OBO), National Center for.
Monday, November 8, 2:30:07 PM  Ontology is the philosophical study of the nature of being, existence or reality as such, as well as the basic categories.
Gramene Objectives Provide researchers working on grasses and plants in general with a bird’s eye view of the grass genomes and their organization. Work.
Manual GO annotation Evidence: Source AnnotationsProteins IEA:Total Manual: Total
Introduction to the GO: a user’s guide Iowa State Workshop 11 June 2009.
The Plant Ontology Consortium Lincoln Stein 1, Susan McCouch 2, Elizabeth Kellogg 3, Seung Rhee 4, Pankaj Jaiswal 2, Doreen Ware 1, Peter Stevens 5 1 Cold.
Tutorial 7 Gene expression analysis 1. Expression data –GEO –UCSC –ArrayExpress General clustering methods –Unsupervised Clustering Hierarchical clustering.
Getting Started: a user’s guide to the GO GO Workshop 3-6 August 2010.
1 Gene function annotation. 2 Outline  Functional annotation  Controlled vocabularies  Functional annotation at TAIR  Resources and tools at TAIR.
Getting Started: a user’s guide to the GO TAMU GO Workshop 17 May 2010.
POC tutorial#4: POC website and Browser This tutorial will run automatically in Quicktime. To run the tutorial at your own pace use the internal controllers.
Rice Proteins Data acquisition Curation Resources Development and integration of controlled vocabulary Gene Ontology Trait Ontology Plant Ontology
A Comparative Genomics Resource for Grains V26. Tutorial Tips If you are viewing this tutorial with Adobe Acrobat Reader, click the "bookmarks" on the.
This tutorial will describe how to navigate the section of Gramene that provides descriptions of alleles associated with morphological, developmental,
Phenotype Curation Susan R. McCouch Department of Plant Breeding Cornell University.
Introduction to the GO: a user’s guide NCSU GO Workshop 29 October 2009.
A Comparative Mapping Resource for Grains Gramene Navigation Tutorial Gramene v.19.1.
Welcome to Gramene’s RiceCyc (Pathways) Tutorial RiceCyc allows biochemical pathways to be analyzed and visualized. This tutorial has been developed for.
This tutorial will describe how to navigate the section of Gramene that allows you to view various types of maps (e.g., genetic, physical, or sequence-based)
Copyright OpenHelix. No use or reproduction without express written consent1.
Tools in Bioinformatics Ontologies and pathways. Why are ontologies needed? A free text is the best way to describe what a protein does to a human reader.
Protein databases Petri Törönen Shamelessly copied from material done by Eija Korpelainen and from CSC bio-opas
Gene Ontology TM (GO) Consortium
Welcome to the Protein Database Tutorial. This tutorial will describe how to navigate the section of Gramene that provides collective information on proteins.
A Comparative Genomic Mapping Resource for Grains.
GO : the Gene Ontology & Functional enrichment analysis
Department of Genetics • Stanford University School of Medicine
Welcome to the Gene and Allele Database Tutorial
Welcome to the Protein Database Tutorial
Welcome to the Quantitative Trait Loci (QTL) Tutorial
Gramene’s Ontologies Tutorial
Tutorial for: Gramene Website Navigation
Presentation transcript:

Slide-1 ONTOLOGY DEVELOPMENT AND INTEGRATION Tutorial exercise: A preview

Slide-2 What’s in a name/vocabulary? How do we define “cell”? –the basic structural and functional unit of all living organisms –a device that delivers an electric current as the result of a chemical reaction –a room where a prisoner is kept –any small compartment (eg. cells of a honeycomb) –a small unit serving as part of or as the nucleus of a larger political movement A cell can be a whole organism or a part of it Source: GO teaching resources

Slide-3 What is an Ontology? The problem: –Vast amounts of biological data –Different names/terms for the same concepts  Cross-species comparison is difficult A (part of the) solution: –Ontology : “a controlled vocabulary that can be applied to either all organisms or at least with in a kingdom/sub-class/family even as knowledge of phenotypes and the associated gene and their roles in cells is accumulating and changing” An Ontology is a glossary of keywords arranged in a structured order or a network based on the biological concepts Source: GO teaching resources

Slide-4 What is an Ontology? NOT a system of nomenclature or a list of gene products/phenotypes It doesn’t attempt to cover all aspects of biology or evolutionary relationships NOT a dictated standard NOT a way to unify databases. It allows the users to query the different databases using the same keywords and query strings provided those different databases have implemented the commonly adopted ontologies. Source: GO teaching resources

Slide-5 In Gramene we have ontologies describing three different types of biological concepts. Gene Ontology (GO) to describe a protein/gene's biochemical property Molecular Function (e.g. transporter, enzyme) Role in a Biological Process (e.g. photosynthesis, defense response) Localization in a Cellular Component (e.g. plastid, cell wall) Plant Ontology (PO) to describe a protein/gene/phenotype expression In a Plant Structure (e.g. panicle, flower, xylem, phloem) At a Growth Stage (e.g. germination, embryo development) Trait Ontology (TO) to describe the observable feature assayed to determine the phenotype. Plant traits (e.g. leaf color, plant height, disease resistance) How does it work?

Slide-6 Anatomy of an ontology Ontology terms are composed of –Term name –Unique ID –Definition (more than 75% of terms defined) –Synonyms (optional) –Database references (optional) –Relationships to other terms in the same ontology Gene Ontology terms (from GO consortium)GO consortium 400+ Trait Ontology terms (from Gramene) 400+ plant structure terms (from PO consortium)PO consortium 200+ cereal plant growth stages terms (from Gramene)

Slide-7 Instance of (is a, type of): Used to describe the relationship between a child term that represents a specific type of a more general parent term. For example: a caryopsis is a type of fruit; a panicle is an inflorescence. Part of: Used to indicate the relationship between a child term that is a part of the parent term. For example: the ectocarp is a part of the pericarp, which in turn is part of the fruit. Develops from: (used only in plant structure ontology) Used to describe the relationship between a child term that develops from its parent term. For example: the root hair develops from trichoblast Each 'child term' has a unique relationship to its 'parent term'.

Slide-8 term Plant structure inflorescence flower tissue organ tapetum stamen anther pollen shoot floral organ sepalpetal Ontology Structure: Plant structure example In a generic tree one does see a relationship between the terms but it is not apparent.

Slide-9 term Plant structure inflorescence flower tissue organ tapetum stamen anther pollen shoot floral organ sepalpetal Part of Instance of Ontology Structure: Plant Structure example In ontology tree the relationships between the terms become more apparent based on the biological information

Slide-10 Ontology Structure: Cellular component example from GO cell Mitochondria membrane plastid chloroplast mitochondrial chloroplast membrane Similarly, in a generic tree one does see a relationship between the cellular component terms but it is NOT clear how they are related. This becomes important because if a user does not know about all the detail components of an organelle he/she will not be able to search/find all the appropriate annotations to a parent organ. The information remains scattered with no single way to find them all or cluster them.

Slide-11 Ontology Structure: Cellular component example from GO cell Mitochondria membrane plastid chloroplast mitochondrial chloroplast membrane Part of Instance of Whereas if the relationship types are established, then it is easy to browse up or down in a tree based on the biological knowledge. Lower down in the tree are finer components, whereas as we go upwards the gross level components are organized.

Slide-12 Molecular function Enzyme activity Ligase activity Hydrolase activity glutamate-ammonia ligase activity Alpha-amylase activity Ontology Structure: Molecular function example from GO Instance of

Slide-13 Molecular function Enzyme activity Ligase activity Hydrolase activity glutamate-ammonia ligase activity Alpha-amylase activity How ontology helps find your favorite gene/phenotype? GS1 Amy1, Amy2, Amy3 OSA1, SAP2, Amy1, Amy2, Amy3 GS1 GS1, OSA1, SAP2, Amy1, Amy2, Amy3 Instance of As one moves upwards in a tree, the associations (based on annotations) from the children terms are accumulated by the parent terms based on their relationship. Thus you see two types of associations Direct associations: which are exact finer level association to a ontology term. e.g. Amy genes are directly associated to Alpha-amylase activity. Indirect association: which are accumulated by the parents from their children terms. e.g. Amy genes are indirectly associated to Alpha-amylase activity, because it is an instance of hydrolase activity. Thus if a user enters the ontology search/browse using the hydrolase activity, the results will return not only the OSA1 and SAP1 genes that are directly associated to this term but also the indirectly associated Amy genes. From this point onwards the user has an option to find the finer level of annotations by going downwards in the tree or get a collective info at gross level by going upwards.

Slide-14 Ontology Structure: Biological process example with associations

Slide-15 How to search or browse ontologies on the Gramene website at ? Please follow the instructions / pointers in the following slides.

Slide Click “Ontology” on the Gramene navigation bar 2. Click on “Current Ontologies” 3. Click on “BROWSE” to navigate through the desired ontology type. Browsing the Ontology Database

Slide-17 Click “Ontology” on the Gramene navigation bar Select “Gene Ontology” Type your query e.g. search for function alpha-amylase Searching the Gene Ontology (GO) Database

Slide-18 Accession for the Ontology term. Select to view detailed information. Exact ontology term Synonyms (if any) Definition of the term Gene Ontology (GO) search results

Slide-19 The lineage of alpha- amylase activity as a molecular function Term-term relationship [i]: IS A (instance/type of) Number of gene products listed in the database associated with this activity Features of a GO term Exact ontology term Definition of the term Expandable tree Click on term to expand.

Slide-20 Suggests the type of experiments carried out to ascertain its function. Gene symbol (allows alphabetical sorting) Protein/gene name. Links to the Gramene Protein Database. Children terms in the tree following the Primary vocabulary term for which the protein function was annotated Download the whole list Click here to find functional homologs from other model organisms. Links to source. The Gene ontology website GO Associations

Slide-21 Select “Plant structure (PO)” Type your query e.g. search for the plant part culm Searching Plant Ontology (PO): Plant structure culm

Slide-22 Accession for the Ontology term. Select to view detailed information. Exact ontology term Synonyms (if any) Definition of the term Plant Ontology (PO) search results Culm is a synonym for Stem

Slide-23 Stem is a PART OF “Shoot”” # Number of mutants associated with this plant part Download/Display all the phenotypes associated with “stem” Features of a PO term

Slide-24 Mutant gene symbol (allows alphabetical sorting) Mutant gene name. Links to the Gramene Mutant Database. Children terms in the tree following the Primary vocabulary term for which the mutant gene was annotated PO Associations

Slide-25 Select “Growth stage (GRO)” Type your query e.g. search for plant growth stage germination Searching Plant Ontology: Growth stages Follow the search results by selecting the term e.g.“germination” in rice (GRO: ). Display / download all associations to view associated phenotypes.

Slide-26 B. Select “Trait (TO)” A. Type your query e.g. search for plant trait plant height Searching the Trait Ontology (TO) Database

Slide-27 Accession for the Ontology term. Select to view detailed information. TO search results

Slide-28 The ontology tree suggests the higher class of trait/category e.g. stature or vigor Number of mutants associated with this trait. Download the list of phenotypes associated with trait plant height TO Features and Associations

Slide-29 How are associations built in an annotation process? The following slides will guide you though themethodologies used by Gramene on associating –Gene products to Gene Ontology terms for molecular function, biological process and locatlization (expression) in a cellular component. –Phenotypes to the plant ontology terms where (plant part) and when (growth stage) the phenotype is expressed.

Slide-30 Published report -PubMed -BIOSIS -Others Plant Ontology Anatomy & growth stages Phenotypes Mutants QTL Trait Ontology Electronic Curation information Sequence similarity ClustalW / BLAST Traceable author statement Predictions/identification Gen Ontology mapping Gramene & Interpro (EBI) Pfam PROSITE PROTOMAP Transmembrane helices Cellular localization Predictions based on HMM Physiochemical properties ProDom 3D-Structural alignments DBXref / References Gene Ontology Molecular function Biological process Cellular localization Gene products Protein Sequences Annotation-I: How are associations built in an annotation process? Manual Vs Electronic (computed) Computed Manual

Slide-31 Oxidase enzymeoxidoreductase activity GO: Biosynthesis of gibbrellin gibberellic acid biosynthesis GO: Reduced heightPlant heightTO: Oxidase enzymegibberellin 20- oxidase activity ** GO: Biosynthesis of gibbrellin gibberellic acid biosynthesis GO: Reduced height Culm length ** TO: From abstract (manual/computed)From manual curation and further reading ** Annotations were modified based on further reading How are associations built as part of annotation exercise? -Function -Process -Trait

Slide-32 Ontology Exercise TRY ON YOUR OWN ! Make your own assertions on which of the ontology terms from either the GO, PO or TO vocabularies appropriately match to the function and phenotype traits associated to PLASTOCHRON1 gene.

Slide-33 Ontology Exercise TRY ON YOUR OWN ! For clues, please see the underlined portions of the text, to make your own assertions on the use of either the GO, PO or TO vocabularies.

Slide-34 Ontology Annotation includes various experimental evidence codes suggesting how the ontology term to gene/phenotype association was made. ISSInferred from Sequence/Structural Similarity IDAInferred from Direct Assay IPIInferred from Physical Interaction TASTraceable Author Statement NASNon-traceable Author Statement IMPInferred from Mutant Phenotype IGIInferred from Genetic Interaction IEPInferred from Expression Pattern ICInferred by Curator NDNo Data available IEAInferred from electronic annotation

Slide-35 What else can YOU do? Send us your review of the terms, definitions and relationships to ensure accuracy. Suggest new terms, definitions, or improvements to the structures. Use the terms in describing data in publications and databases. If your project on cereal plants is generating data sets that may require these kinds of annotations and associations, please feel free to reach us at We will be happy to help guide you through the annotation process and if necessary in setting up an Ontology database.

Slide-36 Thank you for using this tutorial. We appreciate your comments or suggestions. Please click here to send your feedback. If you have questions? Please browse the Frequently Asked Questions (FAQ) You can also reach us by sending at