Gene Ontology Overview and Perspective Lung Development Ontology Workshop.

Slides:



Advertisements
Similar presentations
Annotation of Gene Function …and how thats useful to you.
Advertisements

Judith A. Blake, David P. Hill, Barry Smith BioOntologies SIG: Vienna July 20, 2007 Gene Ontology Annotations: What they mean and where they come from.
Applications of GO. Goals of Gene Ontology Project.
GO : the Gene Ontology “because you know sometimes words have two meanings” Amelia Ireland GO Curator EBI, Cambridge, UK.
Microarray Data Analysis Day 2
The Gene Ontology and Immune System Processes Alexander D. Diehl 6/11/12.
Annotating Gene Products to the GO Harold J Drabkin Senior Scientific Curator The Jackson Laboratory Mouse.
Gene Ontology John Pinney
Gene function analysis Stem Cell Network Microarray Course, Unit 5 May 2007.
CACAO - Remote training Gene Function and Gene Ontology Fall 2011
What is an ontology and Why should you care? Barry Smith with thanks to Jane Lomax, Gene Ontology Consortium 1.
COG and GO tutorial.
CACAO - Remote training Gene Function and Gene Ontology Fall 2011
BI class 2010 Gene Ontology Overview and Perspective.
Comprehensive Annotation System for Infectious Disease Data Alexander Diehl University at Buffalo/The Jackson Laboratory IDO Workshop /9/2010.
CACAO - Penn State Gene Function and Gene Ontology January 2011
Methods for Creating GO Annotations Emily Dimmer European Bioinformatics Institute Wellcome Trust Genome Campus Cambridge UK.
PAT project Advanced bioinformatics tools for analyzing the Arabidopsis genome Proteins of Arabidopsis thaliana (PAT) & Gene Ontology (GO) Hongyu Zhang,
A Common Language for Annotation of Genes from Yeast, Flies and Mice The Gene Ontologies …and Plants and Worms …and Humans …and anything else!
Mouse; NOTC1; Q01705 Function: protein binding Process: Notch signaling pathway Process: positive regulation of transcription from RNA polymerase II promoter.
Using The Gene Ontology: Gene Product Annotation.
Gene Ontology (GO) Project
Curatorial Procedures at Mouse Genome Informatics with an Emphasis on Expression Data Constance M. Smith The Jackson Laboratory Bar Harbor, ME.
GO and OBO: an introduction. Jane Lomax EMBL-EBI What is the Gene Ontology? What is OBO? OBO-Edit demo & practical What is the Gene Ontology? What is.
Annotating Gene Products to the GO Harold J Drabkin Senior Scientific Curator The Jackson Laboratory Mouse.
SPH 247 Statistical Analysis of Laboratory Data 1May 14, 2013SPH 247 Statistical Analysis of Laboratory Data.
GENE ONTOLOGY FOR THE NEWBIES Suparna Mundodi, PhD The Arabidopsis Information Resources, Stanford, CA.
Gene Ontology Consortium
The Gene Ontology: a real-life ontology, progress and future. Jane Lomax EMBL-EBI.
The Gene Ontology project Jane Lomax. Ontology (for our purposes) “an explicit specification of some topic” – Stanford Knowledge Systems Lab Includes:
Gene Ontology Project
Gene Ontology TM (GO) Consortium Jennifer I Clark EMBL Outstation - European Bioinformatics Institute (EBI), Hinxton, Cambridge CB10 1SD, UK Objectives:
Gene expression analysis
Ontologies GO Workshop 3-6 August Ontologies  What are ontologies?  Why use ontologies?  Open Biological Ontologies (OBO), National Center for.
DAVID R. SMITH DR. MARY DOLAN DR. JUDITH BLAKE Integrating the Cell Cycle Ontology with the Mouse Genome Database.
Integrating the Cell Cycle Ontology with the Mouse Genome Database David R. Smith Mary Dolan Dr. Judith Blake.
Gene Onotology Part 1: what is the GO? Harold J Drabkin Senior Scientific Curator The Jackson Laboratory Mouse Genome Informatics.
Monday, November 8, 2:30:07 PM  Ontology is the philosophical study of the nature of being, existence or reality as such, as well as the basic categories.
From Functional Genomics to Physiological Model: Using the Gene Ontology Fiona McCarthy, Shane Burgess, Susan Bridges The AgBase Databases, Institute of.
Manual GO annotation Evidence: Source AnnotationsProteins IEA:Total Manual: Total
Introduction to the GO: a user’s guide Iowa State Workshop 11 June 2009.
24th Feb 2006 Jane Lomax GO Further. 24th Feb 2006 Jane Lomax GO annotations Where do the links between genes and GO terms come from?
Gene Product Annotation using the GO ml Harold J Drabkin Senior Scientific Curator The Jackson Laboratory.
DAVID R. SMITH DR. MARY DOLAN DR. JUDITH BLAKE Integrating the Cell Cycle Ontology with the Mouse Genome Database.
Alastair Kerr, Ph.D. WTCCB Bioinformatics Core An introduction to DNA and Protein Sequence Databases.
The Gene Ontology and its insertion into UMLS Jane Lomax.
Getting Started: a user’s guide to the GO GO Workshop 3-6 August 2010.
Functional Annotation and Functional Enrichment. Annotation Structural Annotation – defining the boundaries of features of interest (coding regions, regulatory.
1 Gene function annotation. 2 Outline  Functional annotation  Controlled vocabularies  Functional annotation at TAIR  Resources and tools at TAIR.
Increasing GO Annotation Through Community Involvement Fiona McCarthy*, Nan Wang*, Susan Bridges** and Shane Burgess** GO.
Getting Started: a user’s guide to the GO TAMU GO Workshop 17 May 2010.
A Common Language for Annotation of Genes from Yeast, Flies and Mice The Gene Ontologies …and Plants and Worms …and Humans …and anything else!
Rice Proteins Data acquisition Curation Resources Development and integration of controlled vocabulary Gene Ontology Trait Ontology Plant Ontology
Gene Ontology Consortium
Introduction to the GO: a user’s guide NCSU GO Workshop 29 October 2009.
Scope of the Gene Ontology Vocabularies. Compile structured vocabularies describing aspects of molecular biology Describe gene products using vocabulary.
Update Susan Bridges, Fiona McCarthy, Shane Burgess NRI
1 Annotation EPP 245/298 Statistical Analysis of Laboratory Data.
Tools in Bioinformatics Ontologies and pathways. Why are ontologies needed? A free text is the best way to describe what a protein does to a human reader.
An example of GO annotation from a primary paper GO Annotation Camp, July 2006 PMID:
Gene Ontology TM (GO) Consortium
Towards a unified MOD resource: An Overview
Annotating with GO: an overview
Introduction to the Gene Ontology
Mental Functioning and the Gene Ontology
What is an Ontology An ontology is a set of terms, relationships and definitions that capture the knowledge of a certain domain. (common ontology ≠ common.
Gene expression analysis
QTL Annotation in MGI Susan M Bello, Ira Lu, Cynthia L Smith, Janan T Eppig, and the Mouse Genome Informatics Group.
Browsing the GO at MGI Harold Drabkin, Ph.D. Senior Scientific Curator
Insight into GO and GOA Angelica Tulipano , INFN Bari CNR
Presentation transcript:

Gene Ontology Overview and Perspective Lung Development Ontology Workshop

2  what kinds of things exist?  what are the relationships between these things? eye part_of sclera is_a sense organ develops from Optic placode A biological ontology is: A (machine) interpretable representation of some aspect of biological reality

3 Gene Ontology (GO) Consortium Formed to develop a shared language adequate for the annotation of molecular characteristics across organisms; a common language to share knowledge. Seeks to achieve a mutual understanding of the definition and meaning of any word used; thus we are able to support cross- database queries. Members agree to contribute gene product annotations and associated sequences to GO database; thus facilitating data analysis and semantic interoperability.

4 Gene Ontology widely adopted AgBase

5 Molecular Function = elemental activity/task  the tasks performed by individual gene products; examples are carbohydrate binding and ATPase activity Biological Process = biological goal or objective  broad biological goals, such as mitosis or purine metabolism, that are accomplished by ordered assemblies of molecular functions Cellular Component = location or complex  subcellular structures, locations, and macromolecular complexes; examples include nucleus, telomere, and RNA polymerase II holoenzyme GO represents three biological domains

Terms are defined graphically relative to other terms

7 Build and maintain logically rigorous and biologically accurate ontologies Comprehensively annotate reference genomes Support genome annotation projects for all organisms Freely provide ontologies, annotations and tools to the research community The Gene Ontology (GO) 1. Build and maintain logically rigorous and biologically accurate ontologies 2. Comprehensively annotate reference genomes 3. Support genome annotation projects for all organisms 4. Freely provide ontologies, annotations and tools to the research community

8 The GO is still developing daily both in ontological structures and in domain knowledge Ontology development workshops focus on specific domains needing revision and bring together ontology developers and domain experts Currently running ~2 workshops / year 1. Metabolism and cell cycle (Aug, 2004) 2. Immunology and defense response (Nov05, Apr06) 3. Early CNS development (June, 2006) 4. Peripheral nervous system development (Feb, 2007) 5. Blood Pressure Regulation (June, 2007) 6. Muscle Development (July, 2007) Building the ontologies

9 725 new terms related to immunology 127 new terms added to cell type ontology Building the ontology: Immune System Process Red part_of Blue is_a Alex Diehl

10 P05147 PMID: GO: IDA P05147GO: IDA PMID: GO Term Reference Evidence Annotating Gene Products using GO Gene Product

11 Annotations are assertions There is evidence that this gene product can be best classified using this term The source of the evidence and other information is included There is agreement on the meaning of the term

12 Annotations are the connections between genomic information and the GO. Experiments provide the data that enables us to annotate gene products with terms from the ontologies. Annotations for App: amyloid beta (A4) precursor proteinApp Annotations are assertions

13 NO Direct Experiment Inferred from evidence Direct Experiment in organism We use evidence codes to describe the basis of the annotation IDA: Inferred from direct assay IPI: Inferred from physical interaction IMP: Inferred from mutant phenotype IGI: Inferred from genetic interaction IEP: Inferred from expression pattern IEA: Inferred from electronic annotation ISS: Inferred from sequence or structural similarity TAS: Traceable author statement NAS: Non-traceable author statement IC: Inferred by curator RCA: Reviewed Computational Analysis ND: no data available

14 GO Annotation Stats: I GO Annotations Total manual GO annotations - 388,633 Total proteins with manual annotations – 80,402 Contributing Groups (including MGI): - 19 Total Pub Med References – 346,002 Total number predicted annotations – 17,029,553 Total number taxa – 129,318 Total number distinct proteins – 2,971,374 April 24, 2007

15 Now we can query across all annotations based on shared biological activity. Annotations of gene products to GO are genome specific

16 GO is a functional annotation system of great utility to the data-driven biologist

17 GO enables genomic data analysis Microarrays allow biologists to record changes in gene function across entire genomes Result: Vast amounts of gene expression data desperately needing cataloging and tagging Many data analysis tools use GO graph structure to statistically evaluate clusters of co-expressed genes based on shared functional annotations  680 pub (of 1517) on GO list  46 microarray tools contributed

18 OCT 13, 2006 Cancer Genome Projects GO supports functional classifications

19 GO is wildly successful FIGURE 3. Representative cell-type-specific genes and corresponding molecular functions. Nature: January 2007

20 Comprehensively annotate Reference Genomes Saccharomyces cerevisiae Schizosaccharomyces pombe Arabidopsis thaliana Human Mouse Fly Rat Chicken Zebrafish Worm Dicty E.coli

21 Priority genes: those implicated in human diseases Determine orthologs/homologs in reference genomes For these genes, comprehensively curate biomedical literature Reference Genome Annotation Project Mary Dolan

22 Shared annotation focus = Coordinated attention to ontology structure Orthology/homology set across primary model organisms Reference ID mappings including associations of sequences, gene/proteins, and human diseases Ultimately, transparent access to comprehensive information about genes among the primary data providers Reference Genome Development Projects

23 1. Verifying and maintaining domain representations in the ontology that reflect best knowledge of the real world. - Depends on the involvement of biologists (domain experts) - Difficult to automate - Must accommodate continuing changes in what we think we understand about biological systems 2. Providing comprehensive annotations, where experimental evidence is available, for all genes - Dependant on the quality of annotations from experimental literature - Combines manual curation by highly-trained scientists supplemented by computational inference prediction annotations - Comprehensiveness may depend on changes in biomedical publishing Ongoing Challenges for the GO Consortium

24 acknowledgements MGI Carol Bult Janan Eppig Jim Kadin Joel Richardson Martin Ringwald Lois Maltais TBK Reddy Monica McAndrews-Hill Nancy Butler GO Michael Ashburner (Cambridge) J. Michael Cherry (Stanford) Suzanna Lewis (LBNL) Rex Chisholm (NWU) David Hill (Jackson Lab) Midori Harris (EBI) Chris Mungall (LBNL) Jane Lomax (EBI) Eurie Hong (Stanford) Jen Clark (EBI) MGI Alex Diehl Mary Dolan Harold Drabkin David Hill Li Ni Dmitry Sitnikov

25 Gene Ontology MGI projects are supported by NIH [NHGRI, NICH, and NCI]. Bar Harbor, Maine, USA Mouse Genome Informatics GO Consortium is supported by NIH-NHGRI and by the European Union RTD Programme PRO is supported by NIGMS Corpora is supported by NLM