Computational Biology and Informatics Laboratory Development of an Application Ontology for Beta Cell Genomics Based On the Ontology for Biomedical Investigations.

Slides:



Advertisements
Similar presentations
The MGED Ontology: Providing Descriptors for Microarray Data Trish Whetzel Department of Genetics Center for Bioinformatics University of Pennsylvania.
Advertisements

Developing an application ontology for biomedical resource annotation and retrieval: challenges and lessons learned C. Torniai, M. Brush, N. Vasilevsky,
More than one way to dissect an animal Melissa Haendel ZFIN Scientific Curator.
Representing the Immune Epitope Database in OWL Jason A. Greenbaum 1, Randi Vita 1, Laura Zarebski 1, Hussein Emami 2, Alessandro Sette 1, Alan Ruttenberg.
Species-Neutral vs. Multi-Species Ontologies Barry Smith.
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
Goal and Status of the OBO Foundry Barry Smith. 2 Semantic Web, Moby, wikis, crowd sourcing, NLP, etc.  let a million flowers (and weeds) bloom  to.
Automated tools to help construction of Trait Ontologies Chris Mungall Monarch Initiative Gene.
Revising the Cell Ontology Terrence Meehan, Christopher Mungall, Alexander Diehl The Jackson Laboratory Lawrence Berkeley National Laboratory University.
Ontology Notes are from:
Systems Biology Existing and future genome sequencing projects and the follow-on structural and functional analysis of complete genomes will produce an.
The Cell Line Ontology Sirarat Sarntivijai, Zuoshuang Xiang, Terrence F Meehan, Alexander D Diehl, Uma Vempati, Stephan Schurer, Chao Pang, James Malone,
What is an ontology and Why should you care? Barry Smith with thanks to Jane Lomax, Gene Ontology Consortium 1.
Abstract The Cell Ontology (CL) is a candidate OBO Foundry 1 ontology for the representation of in vivo cell types. As part of our work in redeveloping.
Use of Ontologies in the Life Sciences: BioPax Graciela Gonzalez, PhD (some slides adapted from presentations available at
Room for Lunch: Arlington Room Room for Evening Reception: Grand Prairie Room.
Biological Ontologies Neocles Leontis April 20, 2005.
The RNA Ontology RNAO Colin Batchelor Neocles Leontis May 2009 Eckart, Colin and Jane In Cambridge.
How to Organize the World of Ontologies Barry Smith 1.
Comprehensive Annotation System for Infectious Disease Data Alexander Diehl University at Buffalo/The Jackson Laboratory IDO Workshop /9/2010.
New York State Center of Excellence in Bioinformatics & Life Sciences Biomedical Ontology in Buffalo Part I: The Gene Ontology Barry Smith and Werner Ceusters.
Building the Ontology Landscape for Cancer Big Data Research Barry Smith May 12, 2015.
Enriching the Ontology for Biomedical Investigations (OBI) to Improve Its Suitability for Web Service Annotations Chaitanya Guttula, Alok Dhamanaskar,
Viewing & Getting GO COST Functional Modeling Workshop April, Helsinki.
Ontology Development and Usage for Protozoan Parasite Research John A. Miller and Alok Dhamanaskar Collaborators: Michael E. Cotterell, Chaitanya Guttula,
The MGED Society Facilitating Data Sharing and Integration with Standards CTSA Omics Data Standards Working Group Chris Stoeckert Dept. of Genetics and.
Ontology of Sensors: Some Examples from Biology
Ontological realism as a strategy for integrating ontologies Ontology Summit February 7, 2013 Barry Smith 1.
Imports, MIREOT Contributors: Carlo Torniai, Melanie Courtot, Chris Mungall, Allen Xiang.
Resurrecting SOWG BS, Baltimore, CTS Ontology Workshop April
Principles and Practice of Ontology Development: Making Definitions Computable Chris Mungall LBL.
OBI Tutorial Overview 8:30 AM Introduction To OBI – Presenter: Christian Stoeckert – Content: Overview of.
Gene Expression Data Annotation – an application of the cell type ontology Helen Parkinson, PhD 19 May 2010.
Building Ontologies with Basic Formal Ontology Barry Smith May 27, 2015.
Leveraging Ontologies for Human Immunology Research Barry Smith, Alexander Diehl, Anna- Maria Masci Presented at Leveraging Standards and Ontologies to.
The Gene Ontology project Jane Lomax. Ontology (for our purposes) “an explicit specification of some topic” – Stanford Knowledge Systems Lab Includes:
Alan Ruttenberg PONS R&D Task force Alan Ruttenberg Science Commons.
Ontologies GO Workshop 3-6 August Ontologies  What are ontologies?  Why use ontologies?  Open Biological Ontologies (OBO), National Center for.
DAVID R. SMITH DR. MARY DOLAN DR. JUDITH BLAKE Integrating the Cell Cycle Ontology with the Mouse Genome Database.
Integrating the Cell Cycle Ontology with the Mouse Genome Database David R. Smith Mary Dolan Dr. Judith Blake.
University of Michigan Medical School 1 Towards a Semantic Web application: Ontology-driven ortholog clustering analysis Yu Lin, Zuoshuang Xiang, Yongqun.
Introduction to Biomedical Ontology for Imaging Informatics Barry Smith, PhD, FACMI University at Buffalo May 11, 2015.
RADical microarray data: standards, databases, and analysis Chris Stoeckert, Ph.D. University of Pennsylvania Yale Microarray Data Analysis Workshop December.
DAVID R. SMITH DR. MARY DOLAN DR. JUDITH BLAKE Integrating the Cell Cycle Ontology with the Mouse Genome Database.
Protein Information Resource Protein Information Resource, 3300 Whitehaven St., Georgetown University, Washington, DC Contact
2 3 where in the body ? where in the cell ?
Anatomy Ontology Community Melissa Haendel. The OBO Foundry More than just a website, it’s a community of ontology developers.
Ontologies Working Group Agenda MGED3 1.Goals for working group. 2.Primer on ontologies 3.Working group progress 4.Example sample descriptions from different.
12/7/2015Page 1 Service-enabling Biomedical Research Enterprise Chapter 5 B. Ramamurthy.
Master headline RDFizing the EBI Gene Expression Atlas James Malone, Electra Tapanari
Mining the Biomedical Research Literature Ken Baclawski.
The Ontology for Biobanking Chris Stoeckert, Jie Zheng, and Mathias Brochhausen University of Pennsylvania and University of Arkansas for Medical Sciences.
Need for common standard upper ontology
Introduction to Biomedical Ontology for Imaging Informatics Barry Smith, PhD, FACMI University at Buffalo May 11, 2015.
Habitat-Lite & EnvO Jin Mao Postdoc, School of Information, University of Arizona Nov. 20, 2015.
1 An Introduction to Ontology for Scientists Barry Smith University at Buffalo
Describing Bioinformatic Metadata at EBI James Malone
TRANSITION FROM BFO 1.1 TO BFO 2.0 (OWL FORMAT) Jie Zheng Department of Genetics University of Pennsylvania May 13 th, 2013.
Suggestions for Galaxy Workflow Design Using Semantically Annotated Services Alok Dhamanaskar, Michael E. Cotterell, Jessica C. Kissinger, and John Miller.
Ontology Driven Data Collection for EuPathDB Jie Zheng, Omar Harb, Chris Stoeckert Center for Bioinformatics, University of Pennsylvania.
Big Data that might benefit from ontology technology, but why this usually fails Barry Smith National Center for Ontological Research 1.
Building Ontologies with Basic Formal Ontology Barry Smith May 27, 2015.
Of 24 lecture 11: ontology – mediation, merging & aligning.
The Cardiovascular Disease Ontology (CVDO) Mercedes Arguello Casteleiro 1, Julie Klein 2 and Robert Stevens 1 1 School of Computer Science, University.
Bio-ontologies SIG in conjunction with ISMB July Boston, USA
Exploiting semantic technologies to build an application ontology
Doron Goldfarb & Yann LE FRANC
Ontology of biomedical investigations (OBI)
OBO Foundry Update: April 2010
Service-enabling Biomedical Research Enterprise
Presentation transcript:

Computational Biology and Informatics Laboratory Development of an Application Ontology for Beta Cell Genomics Based On the Ontology for Biomedical Investigations Jie Zheng, Elisabetta Manduchi and Christian J. Stoeckert Jr Department of Genetics, Perelman School of Medicine, University of Pennsylvania ICBO July 2013, Montreal

Computational Biology and Informatics Laboratory Beta Cell Genomics Database A functional genomics resource focused on pancreatic beta cell research supporting a consortium of 62 investigators and their groups 128 studies (version 4.1) addressing the biology of beta cells, aspects of diabetes, and the production of functional beta cells from – embryonic stem cells – mature cells of other types such as exocrine cells

Computational Biology and Informatics Laboratory Desired Features of A Beta Cell Genomics Ontology Support semantic annotation of beta cell studies with enough granularity covering both biological and experimental aspects – Specimen characteristics, species, strain, anatomical entity, cell type, etc. – Assay, protocol, data analysis methods, etc. Enable queries of increasing complexity (competency questions) – Find gene expression data of endocrine cells – Find studies using cells which develop from either mesoderm or endoderm – Find high throughput sequencing gene expression data in samples obtained during the embryo stage from mouse strains with genetic background C57BL/6J Enable knowledge discovery based on computable definitions – Automated cell type classification based on cell phenotype/functions and/or genetic signatures using reasoners Leverages existing efforts covering the domains of investigations, cells, anatomy, proteins, and genes – OBO Foundry ontologies

Computational Biology and Informatics Laboratory OBO Foundry Reference Ontologies Shared common upper level ontology, Basic Formal Ontology (BFO) and common relations Orthogonal interoperable ontologies – reuse existing terms defined in OBO Foundry ontologies Each reference ontology covers a specific domain: – Cell type ontology (CL) : cell type – Gene ontology (GO): biological process, molecular function, cell components – Protein ontology (PR): protein (cross species) – Uber anatomy ontology (UBERON): cross-species anatomy – Ontology for biomedical investigations (OBI): all aspects of an experiments Facilitate ontology integration

Computational Biology and Informatics Laboratory Motivation for Developing An Application Ontology for Beta Cell Genomics Research No single OBO Foundry ontology can meet our needs No ontology available covers enough granularity needed by beta cell genomics research Typical use of disconnected multiple ontologies loses semantic power

Computational Biology and Informatics Laboratory Principles of Beta Cell Genomics Ontology (BCGO) Development Reuse terms existing in the OBO Foundry ontologies if possible Reuse existing ontology design patterns Use OBI as the ontology framework and integrate subsets of other OBO Foundry ontologies into it Enrich the ontology with additional axioms when needed

Computational Biology and Informatics Laboratory Ontology for Biomedical Investigations (OBI) Cover all aspects of an investigation Contains classes that connect OBI with other OBO Foundry reference ontologies, such as CL, UBERON, and GO, and serve as the parent of referenced external terms gross anatomical entity cellular_component molecular entity material entity specimen Cell cultured cell data transformation biological_process processassay data item measurement unit label information content entity protocol OBI UBERON GO CL CLO UO ChEBI... subClass of is a

Computational Biology and Informatics Laboratory Development of BCGO 1.Identification of terms defined in OBO Foundry Ontologies 2.Extraction of terms from OBO Foundry ontologies 3.Integration of terms from different OBO Foundry ontologies 4.Enrichment of BCGO by adding additional terms and axioms

Computational Biology and Informatics Laboratory Step 1: Identification of Terms Defined in OBO Foundry Ontologies 1.Draw terms from the MO to OBI mapping list – Beta Cell Genomics Database was annotated using multiple controlled vocabularies and ontologies including the MGED Ontology (MO) 2.Bioportal Annotation Tool – High accuracy (>95%) – May not include the latest version of ontologies 3.Bioportal Search Tool – Includes partial and exact matches of input text – Requires more manual review as compared to the Bioportal Annotation Tool

Computational Biology and Informatics Laboratory Most Terms Needed Could Be Matched to Small Subsets of Many Ontologies 852 terms used in the Beta Cell Genomics database 644 terms were matched to 543 ontology terms Mapped terms defined in 24 OBO Foundry ontologies including BFO and IAO *: application ontology BTO: BRENDA tissue / enzyme source CARO: Common Anatomy Reference Ontology EnVO: Environment Ontology ERO: eagle-i resource ontology FMA: Foundational Model of Anatomy GAZ: Gazetteer MP: Mammalian Phenotype OGMS: Ontology for General Medical Science RS: Rat Strain ontology SO: Sequence types and features SWO: Software Ontology EFO: Experimental Factor Ontology ChEBI: Chemical entities of biological interest CLO: cell line ontology NCBITaxon: NCBI organismal classification PR: protein ontology UO: Units of measurement PATO: Phenotypic quality

Computational Biology and Informatics Laboratory Step 2: Extraction of Terms from OBO Foundry Ontologies Ontodog tool: OBI subset extraction – Generates a community view including all related terms and axioms Reference: Zheng et al. International Conference on Biomedical Ontology (ICBO), Graz, Austria, July 2012 OntoFox tool for extracting terms from all other OBO Foundry ontologies – Option 1: MIREOT – Option 2: include minimal intermediate ontology terms – Option 3: all related terms and axioms Reference: Xiang et al. (2010) BMC Research Notes, 3:175

Computational Biology and Informatics Laboratory Extraction Option 1 Applied when five or less terms in an ontology were used by BCGO MIREOT: minimum information to reference an external ontology term Reference: Courtot et al. (2011) Applied Ontology, 6:23 – IRI of the term – IRI of the source ontology – IRI of the term parent in the target ontology – Can be done manually

Computational Biology and Informatics Laboratory Extraction Option 2 Keep hierarchical structure with minimal intermediates Example: reference human, mouse, rat in NCBITaxon … 14 intermediate classes MIREOT Include all intermediate classes Include computed intermediate classes Option 2

Computational Biology and Informatics Laboratory Extraction Option 3 Reuse logical axioms of terms defined in source ontologies Example – ontology design pattern of cell in CL Meehan et al. BMC Bioinformatics 2011, 12:6

Computational Biology and Informatics Laboratory Summary of Extraction Methods And Results

Computational Biology and Informatics Laboratory Step 3: Integration of Terms Extracted From Different OBO Ontologies (1) Import retrieved terms into OBI subset (BCGO community view) under corresponding parent classes ontology OntoFox output file subClass of is a gross anatomical entity cellular_component molecular entity material entity specimen Cell cultured cell data transformation biological_process processassay data item measurement unit label information content entity protocol Beta Cell Genomics view of OBI subset of UBERON subset of GO subset of CL subset of CLO subset of UO subset of ChEBI... terms of interest In other OBO Foundry ontologies Subset of OBI - Using OWL:imports - Keep retrieved terms belong to same source ontology in one OWL file - Contains 2389 classes

Computational Biology and Informatics Laboratory Step 3: Integration of Terms Extracted From Different OBO Ontologies (2) To avoid inconsistencies caused by integrating terms from different paths we remove textual and logical definitions of terms referenced to external ontologies PATO terms retrieved from OBI PATO deprecated Removal of definitions of PATO terms in retrieved OBI subset Retrieval of definitions from PATO

Computational Biology and Informatics Laboratory Summary of Extraction Methods And Results

Computational Biology and Informatics Laboratory Step 4: Enrichment of BCGO 208 terms that could not be matched to OBO Foundry ontologies 42 new terms have been added into BCGO Example – ‘insulin-expressing mature beta cell’ Meehan et al. BMC Bioinformatics 2011, 12:6 insulin-expressing mature beta cell mature insulin islet of Langerhans insulin secretion detection of glucose type B pancreatic cell insulin secretion islet of Langerhans

Computational Biology and Informatics Laboratory Ontology Validation Annotation: 83% terms covered by BCGO Competency questions can be answered: Find gene expression data of endocrine cells Find studies using cells which develop from either mesoderm or endoderm Find high throughput sequencing gene expression data in samples obtained during the embryo stage from mouse strains with genetic background C57BL/6J Automated cell type classification: ongoing

Computational Biology and Informatics Laboratory Challenges OBO Foundry ontologies use different versions of upper level ontology – BFO Inconsistent representation of the same entities in different OBO Foundry ontologies – Example, ‘cell line cell’, alignment work has been done by CL, CLO and OBI developers – Resolution: Alignment work presented in the ICBO poster session with title ‘Alignment of Cultured Cell Modeling Across OBO Foundry Ontologies: Key Outcomes and Insights’ by Dr. Matthew Brush

Computational Biology and Informatics Laboratory Summary BCGO is available on: All related documents are available on: Development of a cross-domain application ontology – based on the OBI framework – reuse existent reference ontologies and ontology design patterns The approach should be generally applicable when using interoperable source ontologies Orthogonal interoperable OBO Foundry ontologies facilitate ontology integration

Computational Biology and Informatics Laboratory Acknowledgements Emily Greenfest-Allen Matthew Brush And OBI, CLO, CL developers Oliver He and Allen Xiang NIH grant 1R01GM and by 5 U01 DK

Computational Biology and Informatics Laboratory Questions?

Computational Biology and Informatics Laboratory Advantages Of Using OntoFox Provide many different options for ontology terms extractions Backend RDF store contains all OBO Foundry ontologies and reload daily if updated Input settings can be saved as a text format file and can be reused