Phenoscape: Connectin g evolutionary phenotypes to genes Paula Mabee Hilmar Lapp, Todd Vision, Monte Westerfield Jim Balhoff, Wasila Dahdul, Peter Midford.

Slides:



Advertisements
Similar presentations
Annotation of Gene Function …and how thats useful to you.
Advertisements

Specimen Image Database (SID) Vince Smith*, Simon Rycroft & Rod Page University of Glasgow, Glasgow, UK *Present Address: Illinois Natural History Survey,
Homology.
What is NESCent? Inspired by the National Center for Ecological Analysis and Synthesis What is NESCent?
More than one way to dissect an animal Melissa Haendel ZFIN Scientific Curator.
Homology Review Human arm Lobed-fin fish fin Bat wing Bird wing Insect wing Homologous forelimbs not homologous as forelimbs or wings Definition: Structures.
Gene Ontology John Pinney
Linking Animal Models to Human Diseases Supported by NIH P41 HG and U54 HG the University of Oregon, Eugene, OR
Paula Mabee, University of South Dakota Eva Huala, Carnegie Institution for Science Andy Deans, North Carolina State University Suzanna Lewis, Lawrence.
Automated tools to help construction of Trait Ontologies Chris Mungall Monarch Initiative Gene.
Bioinformatics GIS Applications Anatoly Petrov.
Jennifer A. Dunne Santa Fe Institute Pacific Ecoinformatics & Computational Ecology Lab Rich William, Neo Martinez, et al. Challenges.
Iowa State University Animal Science Department Bioinformatics & Computational Biology Program - 01/16/06 1 Overview of Animal Trait Ontology and PATO.
Fungal Semantic Web Stephen Scott, Scott Henninger, Leen-Kiat Soh (CSE) Etsuko Moriyama, Ken Nickerson, Audrey Atkin (Biological Sciences) Steve Harris.
1 CIS607, Fall 2006 Semantic Information Integration Instructor: Dejing Dou Week 10 (Nov. 29)
We are developing a web database for plant comparative genomics, named Phytome, that, when complete, will integrate organismal phylogenies, genetic maps.
Genome database & information system for Daphnia Don Gilbert, October 2002 Talk doc at
BTN323: INTRODUCTION TO BIOLOGICAL DATABASES Day2: Specialized Databases Lecturer: Junaid Gamieldien, PhD
Moving beyond free text. Authors Scientist does research Scientist publishes research results in journal article Old Paradigm:
Drivers for a PRAGMA Biodiversity Science Expedition Reed Beaman Florida Museum of Natural History University of Florida.
PATO An ontology for phenotypes. The development of PATO is the work of George Gkoutos, supported by the NCBO, working in Cambridge.
Defining Disease Across Organisms Buffalo PRO-PO-GO May 2013 Judith Blake Jackson Laboratory.
Relating Animal Model Phenotypes to Human Disease Genes Project Goals: To develop methods and syntax for describing phenotypes using ontologies To compare.
Practical interoperability across semantic stores of data for blah blah
Richard White Biodiversity Data. Outline Biodiversity: what is it? – Definitions: is biodiversity: A resource? Something which can be measured? How to.
The Plant Ontology: Linking Phenotypes and Genomics Across Plant Taxa Laurel D. Cooper* 1, Ramona L. Walls 2, Justin Elser 1, Justin Preece 1, Dennis W.
Taxonomic ontologies: Bridging phylogenetic and taxonomic history Peter Midford University of Kansas Phenoscape Project.
Gene Expression Databases: Where and When Dave Clements EuReGene and Mouse Atlas projects Medical Research Council Human Genetics.
Developing anatomy ontologies in the context of others Melissa Haendel, Chris Mungall, Carlo Torniai, Matt Yoder.
Connecting Specimens, Images and Vocabulary Specify, Morphbank, Morphster Beach, Noble, Spears – KU Mast, Riccardi – FSU Miranker, Tirmizi UT.
The Modern Synthesis Population genetics Systematics Paleontology Botany and Zoology.
Cell Ontology 2.0 Elimination of multiple is_a inheritance through instantiation of relationships to terms in outside ontologies, such as the GO cellular.
Phenote: new developments and new communities. A basic screen shot Entries are sorted by 'entity'
A Biodiversity Content Management System for Research, Education, and Outreach Cynthia Sims Parr University of Maryland, College Park Co-authors Roger.
The Gene Ontology project Jane Lomax. Ontology (for our purposes) “an explicit specification of some topic” – Stanford Knowledge Systems Lab Includes:
What is an Ontology? An ontology is a specification of a conceptualization that is designed for reuse across multiple applications and implementations.
The “über-ontology” (Uberon) Melissa Häendel, Chris Müngall, George Gkoütos Cell Ontology Workshop May, 2010.
Linking Animal Models and Human Diseases Supported by NIH P41 HG002659, U54 HG004028, & R01 HG Cambridge University & the University of Oregon.
Digesting the Genome Glut Promoting the Use and Extension of GMOD To Emerging Model Organisms David Clements 1 Brian Osborne 2 Hilmar Lapp 1 Xianhua Liu.
1 Gene function annotation. 2 Outline  Functional annotation  Controlled vocabularies  Functional annotation at TAIR  Resources and tools at TAIR.
The Plant Ontology: Development of a Reference Ontology for all Plants Plant Ontology Consortium Members and Curators*: Laurel D.
Phenote Mark Gibson Berkeley Bioinformatics and Ontology Project (BBOP) National Center for Biomedical Ontologies(NCBO) Lawrence Berkeley National Lab.
Computations using pathways and networks Nigam Shah
BBN Technologies Copyright 2009 Slide 1 The S*QL Plugin for Cytoscape Visual Analytics on the Web of Linked Data Rusty (Robert J.) Bobrow Jeff Berliner,
GeWorkbench John Watkinson Columbia University. geWorkbench The bioinformatics platform of the National Center for the Multi-scale Analysis of Genomic.
Ontologies Working Group Agenda MGED3 1.Goals for working group. 2.Primer on ontologies 3.Working group progress 4.Example sample descriptions from different.
Hierarchical Search in SemantEco Support Varied Ontology Design Patterns Session: "Semantics for Biodiversity: Interoperability with genomic and ecological.
An update on the Transforming Taxonomic Interfaces Initiative Matt Yoder Michael Twidale Andrea Thomer Kenney Guo.
Phenote Mark Gibson Berkeley Bioinformatics and Ontology Project (BBOP) National Center for Biomedical Ontologies(NCBO) Lawrence Berkeley National Lab.
Phenotype And Trait Ontology (PATO) and plant phenotypes
Riccardi: DIALOGUE Workshop August 1, 2005 Supported by NSF BDI 1 Representing and Using Phylogenetic Characters in Morphbank Greg Riccardi, David Gaitros,
Anatomy Ontologies & Potential Users: Bridging the Gap Ravensara Travillian European Bioinformatics Institute
Ontology Driven Data Collection for EuPathDB Jie Zheng, Omar Harb, Chris Stoeckert Center for Bioinformatics, University of Pennsylvania.
Investigations of HIV-1 Env Evolution Evolutionary Bioinformatics Education: A BioQUEST Curriculum Consortium Approach Grand Valley State University August.
The Vertebrate Bridging Ontology (VBO) Ravensara Travillian, James Malone, Chao Pang, John Hancock, Peter W.H. Holland, Paul Schofield, and Helen Parkinson.
Linking Animal Models and Human Diseases
Names, Ranks, Clades, and Taxonomy Ontologies
Linking evolution to genomics using phenotype ontologies
Concept Grounding to Multiple Knowledge Bases via Indirect Supervision
Behavior and Phenotype in GMOD Natural Diversity in GMOD
The Teleost Anatomy Ontology: computable evolutionary morphology for teleost fishes Wasila Dahdul University of South Dakota & National Evolutionary Synthesis.
LINKING EVOLUTIONARY MORPHOLOGY TO GENOMICS USING ONTOLOGIES
Outline Motivation: data mining Ontologies and all-some relationships
The Common Anatomy Reference Ontology (CARO) and queries across species Melissa Haendel ZFIN.
Development of the Amphibian Anatomical Ontology
Phenoscape Data Jamboree 2
LINKING EVOLUTIONARY MORPHOLOGY TO GENOMICS USING ONTOLOGIES
Functional Annotation of the Horse Genome
What is an Ontology An ontology is a set of terms, relationships and definitions that capture the knowledge of a certain domain. (common ontology ≠ common.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Presentation transcript:

Phenoscape: Connectin g evolutionary phenotypes to genes Paula Mabee Hilmar Lapp, Todd Vision, Monte Westerfield Jim Balhoff, Wasila Dahdul, Peter Midford kb.phenoscape.org

Difficult to synthesize morphology across studies

Difficult to relate to genetics & development Cyprinus carpio Pangio anguillaris Nemacheilus fasciatus Catostomus commersoni Gyrinocheilus aymonieri Phenacogrammus interruptus

have their basis in changes in genetic control over development Phenotypic differences

Development understood from study of model organisms Chen & Mayden, 2010

Zebrafish ntla Model for vertebrate development Mutagenesis/gene knock-down Mutant phenotypes genes ZFIN resource

Problem: photo J. Lundberg, ANSP 2002 Pimelodus maculatus Species phenotypes genes at evolutionary scale

slc24a5 involved in pigmentation similarity between fish and humans (Lamason et al., 2005) Zebrafish -- Human Conservation of gene sequence & function and phenotype

Fig. 1, Washington et al., 2010 Translational medicine

Ameiurus nebulosus Fig. 1, Washington et al., 2010 Catfish Trogloglanis pattersoni gene? EQ: eye absent Translational biodiversity?

Phenoscape kb.phenoscape.org 2007 Goal: To prototype a curated, ontology-based evolutionary phenotype database that maps to genetic databases Devo-evo synthesis; candidate gene discovery Enable data-mining and discovery for broad scale evolutionary patterns 25 July 2009; ToL web Ostariophysan fishes

User needs drove KB development: Search for candidate genes underlying evolutionary morphology Search for taxa with particular morphologies Aggregate morphological data across studies

Requirements (generic): 1. Ontologies 2. Curation 3. Database

1. Ontology: terms and relationships is_a part_of is_a develops_from part_of replacement bone basihyal bone ventral hyoid arch basihyal cartilage pharyngeal arch cartilage is_a basihyal element

Teleost Anatomy Ontology Dahdul, W. M., J. G. Lundberg, P. E. Midford, J. P. Balhoff, H. Lapp, T. J. Vision, M. A. Haendel, M. Westerfield, and P. M. Mabee. The Teleost Anatomy Ontology: Anatomical representation for the genomics age. Systematic Biology (Cover art: Kyle Luckenbill)

Ontologies used by Phenoscape Zebrafish Anatomy Ontology (2196 terms; 310 skeletal) Zebrafish Anatomy Ontology (2196 terms; 310 skeletal) Teleost Taxonomy Ontology (36,060 terms; 38,000 synonyms) Teleost Taxonomy Ontology (36,060 terms; 38,000 synonyms) Taxonomic Rank Ontology (8->31 terms) Taxonomic Rank Ontology (8->31 terms) Teleost Anatomy Ontology (2371 terms; 618 skeletal) Teleost Anatomy Ontology (2371 terms; 618 skeletal) Phenotype and Trait Ontology (PATO) (1,075 terms) Spatial Ontology (106 terms) 6 Jan 2009 EvidenceCodeOntology

Curated 4,732 characters in 2,474 species from 52 papers 501,862 taxon phenotype annotations From ZFIN: 21,829 phenotype annotations about 3,893 genes Curated 4,732 characters in 2,474 species from 52 papers 501,862 taxon phenotype annotations From ZFIN: 21,829 phenotype annotations about 3,893 genes 2. Curation Dahdul et al., 2010 PLoS ONE 2. Students: Manual entry of free text character descriptions, matrix, taxon list, specimens and museum numbers using Phenex 2. Students: Manual entry of free text character descriptions, matrix, taxon list, specimens and museum numbers using Phenex 3. Character annotation by experts: Entry of phenotypes using Phenex Curators: Wasila Dahdul Miles Coburn Jeff Engemen Terry Grande Eric Hilton John Lundberg Paula Mabee Richard Mayden Mark Sabaj Pérez ~ 5 person years 1. Students: gather publications (scan hard copies, produce OCR PDFs) 4. Phenoscape Knowledgebase: OBD, data services, web application

5K Karacter Challenge Ontology boot camp, NESCent 2005 Phenoscape Data Roundup photo: NESCent photo: Monte Westerfield Buffalo Roundup, SD

Entity-Quality Model for Taxon Phenotypes ethmoid cartilage ethmoid plate form Entity (TAO) rounded Character round Quality (PATO) State } Phenotype

Taxon phenotype annotations round that inheres_in some ethmoid cartilage round that inheres_in some ethmoid cartilage exhibits some Taxon ontology term Anatomy ontology term Phenotypic Quality ontology term Links a quality to the entity that is its bearer Brachyplatystoma capapretum

split that inheres_in some ethmoid cartilage split that inheres_in some ethmoid cartilage influences some Gene phenotype annotations Import from ZFIN data in EQ format tfap2a ts213/ts213

Brachyplatystoma capapretum round that inheres_in some ethmoid cartilage round that inheres_in some ethmoid cartilage exhibits some tfap2a ts213/ts213 split that inheres_in some ethmoid cartilage split that inheres_in some ethmoid cartilage influences some round split ethmoid cartilage Brachyplatystoma tfap2a is_a variant_o f inheres_i n is_a shape chondrocraniu m cartilage olfactory region Pimelodidae sequence-specific DNA binding transcription factor activity is_a has_functio n is_a part_of is_a

Knowledgebase architecture

Phenoscape Knowledgebase 501,862 taxon phenotypes 21,829 gene phenotypes for 3,893 genes kb.phenoscape.o rg Interative user testing of interface

KB inferred candidate genes Phenotypic profile for Siluriformes includes: Scales absent; Basihyal absent Phenotypic profile for Siluriformes includes: Scales absent; Basihyal absent

30% <1% position composition 25% 20%15% 10% count shape structure size PATO qualities used in phenotypes

Siluriform synapomorphies: Ictalurus punctatus Copyright © Jean Ricardo Simões Vitule, All Rights Reserved Scales absentBasihyal absent Ictalurus punctatus Photo: Richard Edmunds

Zebrafish mutant phenotypes Harris et al., 2007 eda: scale lossbrpf1: basihyal loss Laue et al., 2008

In silico prediction of candidate genes Ictalurus punctatus Copyright © Jean Ricardo Simões Vitule, All Rights Reserved Scales absent: eda?Basihyal absent: brpf1? Ictalurus punctatus Photo: Richard Edmunds

Wet lab test (Richard Edmunds) Lack of eda expression in the epidermis supports Phenoscape KB hypothesis Ictalurus punctatus eda

Wet lab test (Richard Edmunds) Ictalurus punctatus 78 hpf 86 hpf Lack of brpf1 expression in the basihyal supports Phenoscape KB hypothesis

E.g., distribution of all characters across anatomical systems in taxa 25 July 2008

E.g., distribution of skeletal characters in broad regions across taxa Image from Sabaj-Perez 25 July 2008

Summary Powerful queries not previously possible for evolutionary phenotype data Meaningful integration with model organism phenotypic and genetic data Makes data accessible for broad group of researchers and creates opportunities for new and synthetic research Semantic framework and reasoning tools provide:

Phenoscape as a resource Ontologies (anatomy, quality, taxon) + EQ data for any taxon Reasoning across EQ data types, uniting multiple studies, genes, etc. Your data?

Acknowledgments Phenoscape Personnel & PIs: P. Mabee, M. Westerfield, T. Vision, H. Lapp, C. Kothari, W. Dahdul, P. Midford, R. Edmunds Phenoscape curators & workshop participants Berkeley Bioinformatics & Ontologies Project (BBOP): C.Mungall, S.Lewis National Evolutionary Synthesis Center (NESCent) NSF (DBI )