Relating Animal Model Phenotypes to Human Disease Genes Project Goals: To develop methods and syntax for describing phenotypes using ontologies To compare.

Slides:



Advertisements
Similar presentations
Mouse Phenotype Ontology George Gkoutos. Phenotype Annotation Traditional phenotypic descriptions are captures as free text Information retrieval based.
Advertisements

Homology.
More than one way to dissect an animal Melissa Haendel ZFIN Scientific Curator.
Linking Animal Models to Human Diseases Supported by NIH P41 HG and U54 HG the University of Oregon, Eugene, OR.
Linking ontologies to one another and to the Cell Ontology with the COBrA ontology editor Jonathan Bard & Stuart Aitken Biomedical Science & Informatics.
Homology Review Human arm Lobed-fin fish fin Bat wing Bird wing Insect wing Homologous forelimbs not homologous as forelimbs or wings Definition: Structures.
The problem How to integrate the massive amounts of data on Drosophila neurobiology to explore anatomy, formulate hypotheses and find reagents?
Ontologies in the Fish Tank: Using the Zebrafish Anatomy Ontology with other OBO Ontologies to Annotate Expression and Phenotype Yvonne Bradford*, Ceri.
Linking Animal Models to Human Diseases Supported by NIH P41 HG and U54 HG the University of Oregon, Eugene, OR
PATO An Ontology of Phenotypic Qualities
Automated tools to help construction of Trait Ontologies Chris Mungall Monarch Initiative Gene.
Iowa State University Animal Science Department Bioinformatics & Computational Biology Program - 01/16/06 1 Overview of Animal Trait Ontology and PATO.
Multiscale Information Modelling for Heart Morphogenesis Tariq Abdulla 13 th IMEKO TC1-TC7 Joint Symposium 02/09/2010.
Kate Milova MolGen retreat March 24, Microarray experiments. Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
Developmental Neurobiology Fall 2005 Credit: Two Time: 1:30 PM-3:30 PM, Tuesday Place: 316, 3rd Floor, Nursing Building Instructors: 范明基 ( ), 簡正鼎.
PATO & Phenotypes: From model organisms to clinical medicine Suzanna Lewis September 4th, 2008 Signs, Symptoms and Findings Workshop First Steps Toward.
Kate Milova MolGen retreat March 24, Microarray experiments. Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
BTN323: INTRODUCTION TO BIOLOGICAL DATABASES Day2: Specialized Databases Lecturer: Junaid Gamieldien, PhD
Comparative Genomics of the Eukaryotes
Moving beyond free text. Authors Scientist does research Scientist publishes research results in journal article Old Paradigm:
PATO An ontology for phenotypes. The development of PATO is the work of George Gkoutos, supported by the NCBO, working in Cambridge.
Viewing & Getting GO COST Functional Modeling Workshop April, Helsinki.
The Plant Ontology: Linking Phenotypes and Genomics Across Plant Taxa Laurel D. Cooper* 1, Ramona L. Walls 2, Justin Elser 1, Justin Preece 1, Dennis W.
July 2015 CSHL Data analysis: GO tools and YeastMine, use-case examples.
Phenoscape: Connectin g evolutionary phenotypes to genes Paula Mabee Hilmar Lapp, Todd Vision, Monte Westerfield Jim Balhoff, Wasila Dahdul, Peter Midford.
The National Center for Biomedical Ontology Stanford – Berkeley Mayo – Victoria – Buffalo UCSF – Oregon – Cambridge.
An (OBO) ontology is NOT a model of language, it is a model of reality. Words are ambiguous – especially in isolation. Take the word 'wing' what type of.
CANDID: A candidate gene identification tool Janna Hutz March 19, 2007.
GENOME-CENTRIC DATABASES Daniel Svozil. NCBI Gene Search for DUT gene in human.
Inferring Function From Known Genes Naomi Altman Nov. 06.
$100 $400 $300 $200 $400 $200 $100$100 $400 $200$200 $500$500 $300 $200 $500 $100 $300 $100 $300 $500 $300 $400$400 $500.
Cell Ontology 2.0 Elimination of multiple is_a inheritance through instantiation of relationships to terms in outside ontologies, such as the GO cellular.
DONNA MAGLOTT, PH.D. PRO AND MEDICAL GENETICS RESOURCES AT NCBI.
Phenote: new developments and new communities. A basic screen shot Entries are sorted by 'entity'
Gene Ontology TM (GO) Consortium Jennifer I Clark EMBL Outstation - European Bioinformatics Institute (EBI), Hinxton, Cambridge CB10 1SD, UK Objectives:
Ontology of Disease and the OBO Foundry Chris Mungall NCBO GO Nov 2006.
Alan Ruttenberg PONS R&D Task force Alan Ruttenberg Science Commons.
Sunday, July 22, 2012 Plan Areas of coverage: high-level neurological system process, inc. sensory perception, sensory processing, cognition transmission.
The “über-ontology” (Uberon) Melissa Häendel, Chris Müngall, George Gkoütos Cell Ontology Workshop May, 2010.
Web Databases for Drosophila Introduction to FlyBase and Ensembl Database Wilson Leung6/06.
Linking Animal Models and Human Diseases Supported by NIH P41 HG002659, U54 HG004028, & R01 HG Cambridge University & the University of Oregon.
Functional Annotation and Functional Enrichment. Annotation Structural Annotation – defining the boundaries of features of interest (coding regions, regulatory.
Copyright OpenHelix. No use or reproduction without express written consent1.
Phenote Mark Gibson Berkeley Bioinformatics and Ontology Project (BBOP) National Center for Biomedical Ontologies(NCBO) Lawrence Berkeley National Lab.
P HYLO P AT : AN UPDATED VERSION OF THE PHYLOGENETIC PATTERN DATABASE CONTAINS GENE NEIGHBORHOOD Presenter: Reihaneh Rabbany Presented in Bioinformatics.
About ontologies Melissa Haendel. And who am I that I am giving you this talk? Melissa Haendel Anatomist, developmental neuroscientist, molecular biologist,
This tutorial will describe how to navigate the section of Gramene that provides descriptions of alleles associated with morphological, developmental,
Expanding species-specific anatomy ontologies to include the cell ontology Melissa Haendel (1), Ceri Van Slyke (1), Chris Mungall (2), Peiran Song (1),
University of California, San Diego Ontology-based annotation of multiscale imaging data: Utilizing and building the Neuroscience Information Framework.
Primary vs. Secondary Databases Primary databases are repositories of “raw” data. These are also referred to as archival databases. -This is one of the.
Phenote Mark Gibson Berkeley Bioinformatics and Ontology Project (BBOP) National Center for Biomedical Ontologies(NCBO) Lawrence Berkeley National Lab.
OBO Foundry Workshop 2009 Cell Ontology (CL) Preliminary review.
Phenotype And Trait Ontology (PATO) and plant phenotypes
Gene Ontology Consortium The Pathogen Group Schizosaccharomyces pombe Genome Sequencing Project DictyBase.
Copyright OpenHelix. No use or reproduction without express written consent1.
1 of 28 Evaluating Genes and Transcripts (“Genebuild”)
Anatomy Ontologies & Potential Users: Bridging the Gap Ravensara Travillian European Bioinformatics Institute
Biocomputational Languages December 1, 2011 Greg Antell & Khoa Nguyen.
The Vertebrate Bridging Ontology (VBO) Ravensara Travillian, James Malone, Chao Pang, John Hancock, Peter W.H. Holland, Paul Schofield, and Helen Parkinson.
Linking Animal Models and Human Diseases
A statistical method for comparing phenotypes in the OBD
Hub Updates for Year 3 Carl Kesselman.
Phenote Mark Gibson Berkeley Bioinformatics and Ontology Project (BBOP) National Center for Biomedical Ontologies(NCBO) Lawrence Berkeley National Lab.
The Teleost Anatomy Ontology: computable evolutionary morphology for teleost fishes Wasila Dahdul University of South Dakota & National Evolutionary Synthesis.
The Common Anatomy Reference Ontology (CARO) and queries across species Melissa Haendel ZFIN.
The Human-Mouse: Disease Connection in MGI (BETA)
Phenoscape Data Jamboree 2
Mental Functioning and the Gene Ontology
Java-based curation tool with a spreadsheet-like interface
Basic Local Alignment Search Tool (BLAST)
Presentation transcript:

Relating Animal Model Phenotypes to Human Disease Genes Project Goals: To develop methods and syntax for describing phenotypes using ontologies To compare consistency among annotators using these methods To query for similar phenotypes within and across species To provide use cases and feedback for tool development Driving Biological Project Update, May 2008

Yvonne Bradford Melissa Haendel Kevin Schaper Erik Segerdell Amy Singer Sierra Taylor Michael Ashburner Rachel Drysdale George Gkoutos David Sutherland Mark Gibson Suzi Lewis Chris Mungall Nicole Washington BBOP

HumansAnimal models Mutant Gene Mutant or missing Protein Mutant Phenotype (disease) Mutant Gene Mutant or missing Protein Mutant Phenotype (disease model) Currently there is no easy way to connect mutant phenotypes to candidate human disease genes

HumansAnimal models Mutant Gene Mutant or missing Protein Mutant Phenotype (disease) Mutant Gene Mutant or missing Protein Mutant Phenotype (disease model) Sequence analysis (BLAST) can connect animal genes to human genes

HumansAnimal models Mutant Gene Mutant or missing Protein Mutant Phenotype (disease) Mutant Gene Mutant or missing Protein Mutant Phenotype (disease model) Shared ontologies and syntax can connect mutant phenotypes to candidate human disease genes

Information retrieval is not straightforward. OMIM Query# of records Large bones251 Large bone713 Enlarged bones75 Enlarged bone136 Big bone16 Huge bone4 Massive bone28 Hyperplastic bones8 Hyperplastic bone34 Bone hyperplasia122 Increased bone growth543 Disease descriptions in text-based resources

Relating Animal Model Phenotypes to Human Disease Genes Project Goals: To develop methods and syntax for describing phenotypes using ontologies To compare consistency among annotators using these methods To query for similar phenotypes within and across species To provide use cases and feedback for tool development Driving Biological Project Update, May 2008

Annotate phenotypes of orthologous genes in human, zebrafish, and Drosophila using PATO and EQ syntax Triple blind annotation of human phenotypes Compare annotations + + ZFIN BBOP FlyBase OBD

Curate data from zebrafish publications for mutant and morphant phenotypes into ZFIN using EQ syntax Zebrafish Curate data from OMIM entries for gene and related diseases into PBD using EQ syntax Results: Number of annotations added to OBD Human

Curate data from zebrafish publications for mutant and morphant phenotypes into ZFIN and OBD using EQ syntax 4,355 genes and genotypes into OBD 17,782 EQ annotations into OBD Zebrafish Curate data from OMIM entries for gene and related diseases into OBD using EQ syntax 10 genes 677 EQ annotations from ZFIN 314 EQ annotations from Flybase 507 EQ annotations from BBOP Results: Number of annotations added to OBD Human

Tests of the method 1.How consistently do curators use the ontologies and EQ syntax? 2.Can the phenotype annotations for one mutation be used to retrieve annotations to another allele of the same gene? 3.Given a human phenotype, can we retrieve similar phenotypes from mutations in model organisms? Are these mutations in homologous genes? 4.Given a model organism phenotype, can we find other known (or unknown) pathway members with similar phenotypes? 5.Do zebrafish paralogs have phenotypes that are complementary to their mammalian ortholog?

~10% of the EQ statements for 1 gene

We need a quantitative method to calculate the similarity of annotations. ~10% of the EQ statements for 1 gene

E = facial ganglion + Q = morphology Query: cranial ganglion ganglion shape epibranchial ganglion is_a influences Gene variant Similar annotations have the same or more general entity types is_a convexfolded structuresize wavy is_a Similar annotations have the same or more specific PATO qualities Similarity of phenotype annotations is calculated by reasoning across the ontologies

Similarity between two genotypes is weighted by: the total number and rarity of similar annotations, and the degree of relatedness within each ontology. E = facial ganglion + Q = morphology Query: cranial ganglion ganglion shape epibranchial ganglion is_a influences Gene variant Similar annotations have the same or more general entity types is_a convexfolded structuresize wavy is_a Similar annotations have the same or more specific PATO qualities Similarity of phenotype annotations is calculated by reasoning across the ontologies

At the intersection of different curators’ term choices, colors merge, with white indicating consensus among all 3. Curator 1Curator 2Curator 3 How consistently do curators use the ontologies and EQ syntax?

subsumption GeneGenotypeQuality Anatomical Entity Morphology Organ Shape Part of organ Thickness Cranial nerve Increased thickness Vestibulo- cochlear nerve Increased thickness Trigeminal nerve Perforated Wall of heart Deformed Retina GeneAA A A Gene BB B B p value e-43.2e-56.1e-62.0e-71.9e-6 Unlikely to match by chance Subsumption in similarity scoring

EYA1SOX10SOX9PAX # Annotations Average annotation consistency among curators congruence total annotations similar annotations We can quantitate similarity and, thus, optimize consistency.

Results: best practices for phenotype curation to ensure curator consistency 1.Use the same set of ontologies 2.Use the same ID format 3.Use the same Phenote configuration 4.Constrain post-composition of entity terms to the same type and relation 5.Annotate both the anatomical entity and the process where applicable 6.Annotate to a more general term in the PATO hierarchy when the correct term is unavailable, rather than not making an annotation 7.For OMIM, annotate both the general description as well as specific alleles

Relating Animal Model Phenotypes to Human Disease Genes Project Goals: To develop methods and syntax for describing phenotypes using ontologies To compare consistency among annotators using these methods To query for similar phenotypes within and across species To provide use cases and feedback for tool development Driving Biological Project Update, May 2008

Can the phenotype annotations for one mutation be used to retrieve annotations to another allele of the same gene? Example: A search for phenotypes similar to each human EYA1 allele returns other human EYA1 alleles (The smaller the number, the more similar) Allele number

Given a human phenotype, can we retrieve similar phenotypes from model organisms? Are these due to mutations in orthologous genes? A query for phenotypes similar to: Human EYA1 variant OMIM: MP:deafness = E = Sensory perception of sound Q = absent returns: Mouse Eya1 bor/bor and Eya1 tm1Rilm/tm1Rilm E = Sensory perception of sound Q = decreased

Given a human phenotype, can we retrieve similar phenotypes from model organisms? Are these due to mutations in orthologous genes? A query for phenotypes similar to: Human EYA1 variant OMIM: MP:deafness = E = Sensory perception of sound Q = absent returns: Mouse Eya1 bor/bor and Eya1 tm1Rilm/tm1Rilm E = Sensory perception of sound Q = decreased These similarities are based on the same GO term entity. Anatomical cross-species queries require classification of anatomical structures, in the different species, based on function and/or homology.

inheres_in OMIM: nn EYA1 variant human:EYA1 PATO:decreased thickness Homologene: EYA1 FMA:cochlear nerve inheres_in MP:abnormal vestibulocochlear nerve morphology mouse:EYA1 zebrafish:eya1 ZFA:cranial nerve VIII ZFIN:geno-nnn Eya1 variant nn MGI:nnn eya1 variant nn PATO:decreased thickness NCBO ZFIN MGI Currently, different terms are often used to describe anatomy in different species

inheres_in OMIM: nn EYA1 variant human:EYA1 PATO:decreased thickness FMA:vestibulocochlear nerve Homologene: EYA1 FMA:cochlear nerve inheres_in MP:abnormal vestibulocochlear nerve morphology mouse:EYA1 zebrafish:eya1 ZFA:cranial nerve VIII MA:vestibulocochlear VIII nerve ZFIN:geno-nnn Eya1 variant nn MGI:nnn eya1 variant nn PATO:morphology = PATO:decreased thickness NCBO ZFIN MGI homology annotations mammalian phenotype decomposed into EQ Future queries across species will utilize homology annotations

Human and zebrafish SOX9 annotations are similar Human, SOX9 (Campomelic dysplasia) Zebrafish, sox9a (jellyfish) Scapula: hypoplasticScapulocorocoid: aplastic Lower jaw: decreased size Cranial cartilage: hypoplastic Heart: malformed or edematousHeart: edematous Phalanges: decreased lengthPectoral fin: decreased length Long bones: bowedCartilage development: disrupted

Human, SOX9 (Campomelic dysplasia) Zebrafish, sox9a (jellyfish) Scapula: hypoplasticScapulocorocoid: aplastic Lower jaw: decreased size Cranial cartilage: hypoplastic Heart: malformed or edematousHeart: edematous Phalanges: decreased lengthPectoral fin: decreased length Long bones: bowedCartilage development: disrupted Curation of mutant phenotypes and human diseases using common ontologies & syntax can provide candidate genes and animal models of disease Human and zebrafish SOX9 annotations are similar

Similarity search for zebrafish shha t4/t4 identifies pathway members GenotypeCongruence# of allelesFunction disp1 ty60 9.6E-196regulates long range Shh signaling gli1 ts E-132downstream transcriptional repressor wnt5b te1c 4.3E-115downstream target gene smo hi1640Tg 4.1E-114membrane protein mediates Shh intracellular signaling pathway hhip hu540a 3.1E-52binds Shh in membrane Gli1 Disp1 Given a model organism phenotype, can we find other known (or unknown) pathway members based on similar phenotypes?

Do zebrafish paralogs have phenotypes that are complementary to their mammalian ortholog? somite neural plate nervous system motor axons peripheral ectoderm development Ortholog = Paralog aParalog b+ ? The paralog phenotypes are somewhat complementary, and somewhat overlapping. Mouse Notch1 -/- Zebrafish notch1a -/- Zebrafish notch1b MO somite formationsomite neural tubeneural plate nervous system motor axons peripheral ectoderm development myocardium pericardial sacs cardiac muscle angiogenesisblood vessel morphogenesis intersegmental vessel endothelial cell migration notochord

Relating Animal Model Phenotypes to Human Disease Genes Project Goals: To develop methods and syntax for describing phenotypes using ontologies To compare consistency among annotators using these methods To query for similar phenotypes within and across species To provide use cases and feedback for tool development Driving Biological Project Update, May 2008

Unanticipated outcomes for ZFIN: Phenote is now used by ZFIN collaborators to submit data Phenote checks versioning of the ontologies, eliminating the need to update templates for researchers Phenote has a familiar spreadsheet feel Phenote supports reading and writing in multiple file formats Phenote provided template for ZFIN curator interface development Results of tool development