The Gene Ontology Project: Content for the Semantic Web.

Slides:



Advertisements
Similar presentations
Martin John Bishop UK HGMP Resource Centre Hinxton Cambridge CB10 1 SB
Advertisements

1 Gene Ontology and Functional Annotation Donghui Li ASPB Plant Biology, June 29, 2008, Merida.
Annotation of Gene Function …and how thats useful to you.
The MGED Ontology: Providing Descriptors for Microarray Data Trish Whetzel Department of Genetics Center for Bioinformatics University of Pennsylvania.
An Introduction to the Gene Ontology (GO)
24th Feb 2006 Jane Lomax Gene Ontology tutorial Talk:Using the Gene Ontology (GO) for Expression Analysis Practical:Onto-Express analysis tool Talk: GO.
25th June 2007 Jane Lomax Using the Gene Ontology (GO) for analysis of expression data Jane Lomax EMBL-EBI.
Www. GeneOntology.org Gene Ontology Collaboration.
GO : the Gene Ontology “because you know sometimes words have two meanings” Amelia Ireland GO Curator EBI, Cambridge, UK.
“Biomedical computing is entering an age where creative exploration of huge amounts of data will lay the foundation of hypotheses. Much work must still.
Gene Ontology John Pinney
Welcome to mini-symposium on ontologies for biological sample description EMBL-EBI Wellcome Trust Genome Campus Deceber 5, 2001.
The design, construction and use of software tools to generate, store, annotate, access and analyse data and information relating to Molecular Biology.
Gene function analysis Stem Cell Network Microarray Course, Unit 5 May 2007.
Terry F. Hayamizu Mouse Genome Informatics, The Jackson Laboratory M OUSE A NATOMY O NTOLOGIES AND GXD.
Extending to the GO model OBO open biology ontologies aka - extended go - (ego)
1 Using Gene Ontology. 2 Assigning (or Hypothesizing About) Biological Meaning to Clusters What do you want to be able to to? –Identify over-represented.
COG and GO tutorial.
Sequence-Structure-Function Sequence Structure Function Threading Ab initio BLAST Folding: impossible but for the smallest structures Function prediction.
Internet tools for genomic analysis: part 2
Mouse Genome Informatics November 2008 Paul Szauter MGI User Support.
BTN323: INTRODUCTION TO BIOLOGICAL DATABASES Day2: Specialized Databases Lecturer: Junaid Gamieldien, PhD
Using The Gene Ontology: Gene Product Annotation.
Gene Ontology (GO) Project
GO : the Gene Ontology “because you know sometimes words have two meanings” Amelia Ireland GO Curator EBI, Cambridge, UK.
GO and OBO: an introduction. Jane Lomax EMBL-EBI What is the Gene Ontology? What is OBO? OBO-Edit demo & practical What is the Gene Ontology? What is.
The aims of the Gene Ontology project are threefold: - to compile vocabularies to describe components, functions and processes - to produce tools to query.
The European Bioinformatics Institute MGED ontology for consistent annotation of microarray experiments Manchester Bioinformatics Week Ontologies Workshop1.
Gene Ontology Overview and Perspective Lung Development Ontology Workshop.
Open Biomedical Ontologies. Open Biomedical Ontologies (OBO) An umbrella project for grouping different ontologies in biological/medical field –a repository.
March 24, Integrating genomic knowledge sources through an anatomy ontology Gennari JH, Silberfein A, and Wiley JC Pac Symp Biocomputing 2005:
GENE ONTOLOGY FOR THE NEWBIES Suparna Mundodi, PhD The Arabidopsis Information Resources, Stanford, CA.
Gene Ontology Consortium
The Gene Ontology: a real-life ontology, progress and future. Jane Lomax EMBL-EBI.
The Gene Ontology project Jane Lomax. Ontology (for our purposes) “an explicit specification of some topic” – Stanford Knowledge Systems Lab Includes:
Gene Ontology Project
What is an Ontology? An ontology is a specification of a conceptualization that is designed for reuse across multiple applications and implementations.
Grup.bio.unipd.it CRIBI Genomics group Erika Feltrin PhD student in Biotechnology 6 months at EBI.
Gene Ontology TM (GO) Consortium Jennifer I Clark EMBL Outstation - European Bioinformatics Institute (EBI), Hinxton, Cambridge CB10 1SD, UK Objectives:
Ontologies GO Workshop 3-6 August Ontologies  What are ontologies?  Why use ontologies?  Open Biological Ontologies (OBO), National Center for.
DAVID R. SMITH DR. MARY DOLAN DR. JUDITH BLAKE Integrating the Cell Cycle Ontology with the Mouse Genome Database.
Gene Onotology Part 1: what is the GO? Harold J Drabkin Senior Scientific Curator The Jackson Laboratory Mouse Genome Informatics.
MIAMExpress and the development of annotation ontologies for gene expression experiments Ele Holloway Microarray Informatics European Bioinformatics Institute.
24th Feb 2006 Jane Lomax GO Further. 24th Feb 2006 Jane Lomax GO annotations Where do the links between genes and GO terms come from?
The Plant Ontology Consortium Lincoln Stein 1, Susan McCouch 2, Elizabeth Kellogg 3, Seung Rhee 4, Pankaj Jaiswal 2, Doreen Ware 1, Peter Stevens 5 1 Cold.
The Gene Ontology and its insertion into UMLS Jane Lomax.
Sharing Ontologies in the Biomedical Domain Alexa T. McCray National Library of Medicine National Institutes of Health Department of Health & Human Services.
1 Gene function annotation. 2 Outline  Functional annotation  Controlled vocabularies  Functional annotation at TAIR  Resources and tools at TAIR.
Other biological databases and ontologies. Biological systems Taxonomic data Literature Protein folding and 3D structure Small molecules Pathways and.
Generating Useful Information in Toxicogenomics: Focused Efforts: Microarray Standards Feb. 6, 2003, The National Academies Chris Stoeckert, Ph.D. Center.
To Boldly GO… Amelia Ireland GO Curator EBI, Hinxton, UK.
Gene Ontology Consortium
Ontologies Working Group Agenda MGED3 1.Goals for working group. 2.Primer on ontologies 3.Working group progress 4.Example sample descriptions from different.
Scope of the Gene Ontology Vocabularies. Compile structured vocabularies describing aspects of molecular biology Describe gene products using vocabulary.
Organization Challenges for Building Biomedical Ontologies Talk by Jennifer Clark, Slides by Michael Ashburner and Suzanna Lewis
Development and Use of Controlled Vocabularies at the Arabidopsis Information Resource (TAIR) Sue Rhee Carnegie Institution Dept. Plant Biology
MAPPING OF SEQUENCES TO GENE ONTOLOGY. GO consortium.
Describing Bioinformatic Metadata at EBI James Malone
Tools in Bioinformatics Ontologies and pathways. Why are ontologies needed? A free text is the best way to describe what a protein does to a human reader.
Protein databases Petri Törönen Shamelessly copied from material done by Eija Korpelainen and from CSC bio-opas
Gene Ontology TM (GO) Consortium
Joined up ontologies: incorporating the Gene Ontology into the UMLS.
Gene Annotation & Gene Ontology May 24, Gene lists from RNAseq analysis What do you do with a list of 100s of genes that contain only the following.
` Comparison of Gene Ontology Term Annotations Between E.coli K12 Databases REDDYSAILAJA MARPURI WESTERN KENTUCKY UNIVERSITY.
Sequence-Structure-Function Sequence Structure Function Threading Ab initio BLAST Folding: impossible but for the smallest structures Function prediction.
Annotating with GO: an overview
GO : the Gene Ontology & Functional enrichment analysis
Department of Genetics • Stanford University School of Medicine
What is an Ontology An ontology is a set of terms, relationships and definitions that capture the knowledge of a certain domain. (common ontology ≠ common.
Ontologies in Bioinformatics
Presentation transcript:

The Gene Ontology Project: Content for the Semantic Web

Compile structured vocabularies describing aspects of molecular biology Describe gene products using vocabulary terms (annotation) Develop tools: to query and modify the vocabularies and annotations annotation tools for curators GO Project Goals

GO provides two bodies of data: Terms with definitions and cross- references Gene product annotations with supporting data GO Data

Molecular Function elemental activity or task nuclease, DNA binding, transcription factor Biological Process broad objective or goal mitosis, signal transduction, metabolism Cellular Component location or complex nucleus, ribosome, origin recognition complex The Three Ontologies

DAG Structure Directed acyclic graph: each child may have one or more parents

is-a subclass; a is a type of b part-of physical part of (component) subprocess of (process) Relationship Types

Every path from a node back to the root must be biologically accurate The True Path Rule

ID Text string Definition with source Synonyms (optional) Cross-references (optional) GO Terms: Associated Data

Enzyme Commission (EC) Transport Commission (TC) University of Minnesota Biocatalysis/ Biodegradation Database (UM-BBD) MetaCyc GO Terms: Cross-References

Association between gene product and applicable GO terms Provided by member databases Made by manual or automated methods GO Annotation

Database object: gene or gene product GO term ID Reference publication or computational method Evidence supporting annotation GO Annotation: Data

DAG Structure Annotate to any level within DAG

Improve coverage: Developmental processes Physiological processes Relational database Support ontology development for additional domains of biology The Future of GO:

Names of gene products Protein domains Protein sequence features Phenotypes; diseases Anatomical terms (except as part of terms generated by cross-products) Terms outside the Scope of GO

Global Open Biology Ontologies Umbrella site for shared genomics and proteomics vocabularies Present incarnation: subdirectory within GO repository: ftp://ftp.geneontology.org/pub/go/gobo/README The GOBO Proposal

FlyBase & Berkeley Drosophila Genome Project WormBase Saccharomyces Genome Database DictyBase Mouse Genome Informatics Compugen, Inc The Arabidopsis Information Resource Swiss-Prot/TrEMBL/InterPro Pathogen Sequencing Unit (Sanger Institute) PomBase (Sanger Institute) Rat Genome Database Genome Knowledge Base (CSHL) The Institute for Genomic Research The Gene Ontology Consortium is supported by NHGRI grant HG02273 (R01). The Gene Ontology project thanks AstraZeneca for financial support. The Stanford group acknowledges a gift from Incyte Genomics.

Conference: Standards and Ontologies for Functional Genomics (SOFG) Towards unified ontologies for describing biology and biomedicine 17 – 20 November 2002 Hinxton Hall Conference Centre Hinxton, Cambridge, UK

First Standards and Ontologies for Functional Genomics (SOFG) Keynote Speakers Ken Buetow, NCI, USA Win Hide, SANBI, South Africa Peter Karp, SRI International, USA November 2002, Hinxton, UK

Aims and Objectives Bring together scientists developing standards and ontologies, both biologists, bioinformaticians and computer scientists

Topics Introduction to Ontologies Tools for building ontologies Go and related ontologies Species specific ontologies Implementation Inter-ontology mapping Ontologies for pathology, toxicology Chemical ontologies

Structure 3 keynote speakers ~20 invited talks 10 short talks selected from poster abstracts Panel discussion Parallel working groups/tutorials

Programme Committee Michael Ashburner, University of Cambridge, UK (Chair) Cathy Ball, Stanford University, USA Mike Bittner, NHGRI, USA Alvis Brazma, EMBL-EBI, UK Catherine Brooksbank, EMBL-EBI, UK Duncan Davidson, MRC HGU, Edinburgh, UK Liz Ford, EMBL-EBI, UK Midori Harris, EMBL-EBI, UK Victor Markowitz, Gene Logic, USA Helen Parkinson, EMBL-EBI, UK John Quackenbush, TIGR, USA Martin Ringwald, The Jackson Laboratories, USA Steffen Schulze-Kremer, RZPD, Germany Paul Spellman, U.C. Berkeley, USA Robert Stevens, University of Manchester, UK Chris Stoeckert, University of Pennsylvania, USA

URL

chitin metabolism: before revision The True Path Rule chitin biosynthesis cuticle synthesis chitin catabolism cell wall biosynthesis chitin metabolism

chitin metabolism: after revision The True Path Rule

chitin metabolism: after revision The True Path Rule cuticle synthesischitin metabolism cuticle chitin biosynthesis chitin biosynthesiscuticle chitin metabolism

Open source Can be instantiated in DAML+OIL or GO syntax Orthogonal Shared ID space Defined terms GOBO Criteria

hexose glucose fructose DAG Cross-Products metabolism biosynthesis catabolism hexose metabolism hexose biosynthesis glucose biosynthesis fructose biosynthesis hexose catabolism glucose catabolism fructose catabolism glucose metabolism... etc.

gene gene_attribute gene_structureSO gene_variationME gene_product gene_product_attribute molecular_functionGO protein_familyINTERPRO phenotype mutant phenotype anatomy For complete current draft see ftp://ftp.geneontology.org/pub/go/gobo/README Some GOBO Ontologies

Not a way to unify biological databases Not a dictated standard Does not define evolutionary relationships Additional ontologies needed to model biology and experimentation What GO is NOT:

DAG Structure Annotate to any level within DAG mitosis S.c. NNF1 mitotic chromosome condensation S.c. BRN1, D.m. barren

Using GO Annotation: Example Workflow text

ID definition cross-reference synonyms

Using GO Annotation: Example Workflow