Developing an application ontology for biomedical resource annotation and retrieval: challenges and lessons learned C. Torniai, M. Brush, N. Vasilevsky,

Slides:



Advertisements
Similar presentations
BioPortal Status and Plans September 2011 Ray Fergerson NCBO Project Director Stanford University 1.
Advertisements

1 Ontolog OOR Use Case Review Todd Schneider 1 April 2010 (v 1.2)
1 Ontolog Open Ontology Repository Review 19 February 2009.
Upper Ontology Summit Tuesday March 14 The BFO perspective Barry Smith Department of Philosophy, University at Buffalo National Center.
Improving Learning Object Description Mechanisms to Support an Integrated Framework for Ubiquitous Learning Scenarios María Felisa Verdejo Carlos Celorrio.
Using Several Ontologies for Describing Audio-Visual Documents: A Case Study in the Medical Domain Sunday 29 th of May, 2005 Antoine Isaac 1 & Raphaël.
From Ontology Design to Deployment Semantic Application Development with TopBraid Holger Knublauch
Upper Ontology Summit Wednesday March 15 The BFO perspective Barry Smith Department of Philosophy, University at Buffalo National.
A Stepwise Modeling Approach for Individual Media Semantics Annett Mitschick, Klaus Meißner TU Dresden, Department of Computer Science, Multimedia Technology.
Haystack: Per-User Information Environment 1999 Conference on Information and Knowledge Management Eytan Adar et al Presented by Xiao Hu CS491CXZ.
Key-word Driven Automation Framework Shiva Kumar Soumya Dalvi May 25, 2007.
Looking ahead: caGrid community requirements in the context of caGrid 2.0 Lawrence Brem 7 February 2011.
Using the Semantic Web to Construct an Ontology- Based Repository for Software Patterns Scott Henninger Computer Science and Engineering University of.
Ontology Notes are from:
L4-1-S1 UML Overview © M.E. Fayad SJSU -- CmpE Software Architectures Dr. M.E. Fayad, Professor Computer Engineering Department, Room #283I.
How can Computer Science contribute to Research Publishing?
How to Organize the World of Ontologies Barry Smith 1.
1 Objective of today’s lesson S oftware engineering occurs as a consequence of a process called system engineering. Instead of concentrating solely on.
SemanTic Interoperability To access Cultural Heritage Frank van Harmelen Henk Matthezing Peter Wittenburg Marjolein van Gendt Antoine Isaac Lourens van.
Course Instructor: Aisha Azeem
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Architectural Design.
UML - Development Process 1 Software Development Process Using UML (2)
Enriching the Ontology for Biomedical Investigations (OBI) to Improve Its Suitability for Web Service Annotations Chaitanya Guttula, Alok Dhamanaskar,
Chapter 6 System Engineering - Computer-based system - System engineering process - “Business process” engineering - Product engineering (Source: Pressman,
Configurable User Interface Framework for Cross-Disciplinary and Citizen Science Presented by: Peter Fox Authors: Eric Rozell, Han Wang, Patrick West,
Computational Biology and Informatics Laboratory Development of an Application Ontology for Beta Cell Genomics Based On the Ontology for Biomedical Investigations.
An Introduction to Software Architecture
Department of Biomedical Informatics Service Oriented Bioscience Cluster at OSC Umit V. Catalyurek Associate Professor Dept. of Biomedical Informatics.
Ontological realism as a strategy for integrating ontologies Ontology Summit February 7, 2013 Barry Smith 1.
Save time. Reduce costs. Find and reuse interoperability solutions on Joinup for developing European public services Nikolaos Loutas
Imports, MIREOT Contributors: Carlo Torniai, Melanie Courtot, Chris Mungall, Allen Xiang.
Architecture for a Database System
Emerging Semantic Web Commercialization Opportunities Ken Baclawski Northeastern University.
The Chronious Ontology Suite: Methodology and Design Principles Luc Schneider[1], Mathias Brochhausen[1,2] [1] Institute for Formal Ontology and Medical.
The Agricultural Ontology Service (AOS) A Tool for Facilitating Access to Knowledge AGRIS/CARIS and Documentation Group Library and Documentation Systems.
Systems Analysis and Design in a Changing World, 3rd Edition
Open Terminology Portal (TOP) Frank Hartel, Ph.D. Associate Director, Enterprise Vocabulary Services National Cancer Institute, Center for Biomedical Informatics.
A Context Model based on Ontological Languages: a Proposal for Information Visualization School of Informatics Castilla-La Mancha University Ramón Hervás.
Grid Computing & Semantic Web. Grid Computing Proposed with the idea of electric power grid; Aims at integrating large-scale (global scale) computing.
GREGORY SILVER KUSHEL RIA BELLPADY JOHN MILLER KRYS KOCHUT WILLIAM YORK Supporting Interoperability Using the Discrete-event Modeling Ontology (DeMO)
Using Several Ontologies for Describing Audio-Visual Documents: A Case Study in the Medical Domain Sunday 29 th of May, 2005 Antoine Isaac 1 & Raphaël.
10/24/09CK The Open Ontology Repository Initiative: Requirements and Research Challenges Ken Baclawski Todd Schneider.
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
Core 2: Bioinformatics NCBO-Berkeley. Core 2 Specific Aims 1.Apply ontologies  Software toolkit for describing and classifying data 2.Capture, manage,
Anatomy Ontology Community Melissa Haendel. The OBO Foundry More than just a website, it’s a community of ontology developers.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
PRO and the NIF / ImmPort Antibody Registries Alexander Diehl Protein Ontology Workshop 6/18/14.
1 Class exercise II: Use Case Implementation Deborah McGuinness and Peter Fox CSCI Week 8, October 20, 2008.
1 Ontolog OOR-BioPortal Comparative Analysis Todd Schneider 15 October 2009.
DANIELA KOLAROVA INSTITUTE OF INFORMATION TECHNOLOGIES, BAS Multimedia Semantics and the Semantic Web.
APPLICATION OF ONTOLOGIES IN CANCER NANOTECHNOLOGY RESEARCH Faculty of Engineering in Foreign Languages 1 Student: Andreea Buga Group: 1241E – FILS Coordinating.
Software Engineering Lecture 10: System Engineering.
Big Data that might benefit from ontology technology, but why this usually fails Barry Smith National Center for Ontological Research 1.
CIMA and Semantic Interoperability for Networked Instruments and Sensors Donald F. (Rick) McMullen Pervasive Technology Labs at Indiana University
Supporting Collaborative Ontology Development in Protégé International Semantic Web Conference 2008 Tania Tudorache, Natalya F. Noy, Mark A. Musen Stanford.
Upper Ontology Summit The BFO perspective Barry Smith Department of Philosophy, University at Buffalo National Center for Ontological Research National.
Semantic Media Wiki Open Terminology Development - Initial Steps - Frank Hartel, Ph.D. Associate Director, Enterprise Vocabulary Services National Cancer.
CCNT Lab of Zhejiang University
Lecture #11: Ontology Engineering Dr. Bhavani Thuraisingham
Outline Pursue Interoperability: Digital Libraries
OBO Foundry Principles
An ecosystem of contributions
OBI – Standard Semantic
NCBO CTSA workshop, Baltimore
An ontology for e-Research
An Introduction to Software Architecture
Chapter 6: Architectural Design
Presentation transcript:

Developing an application ontology for biomedical resource annotation and retrieval: challenges and lessons learned C. Torniai, M. Brush, N. Vasilevsky, E. Segerdell, M. Wilson, T. Johnson, K. Corday, C. Shaffer and M. Haendel

 eagle-i project  Aims  Ontology role  eagle-i ontology  Requirements  Implementation  Implementation choices  Challenges Outline c o n s o r t i u m

eagle-i NIH funded pilot project working to make scientific resources more visible via a federated network of nine institutional repositories  Index invisible resources reagents, protocols, techniques, instruments, expertise, organisms, software, training, human studies, biological specimens, etc.  Ontology-driven approach to research resource annotation and discovery  Facilitate development of shared semantic entities that can be referenced in publications, databases, experiments, etc. c o n s o r t i u m

1)Represent collected resource information 2)Use the set of ontologies to control the data collection and search applications user-interface (UI) and logic 1)Build a set of ontologies that are reusable and interoperable with other ontologies and existing efforts for representing biomedical entities Ontology development drivers

c o n s o r t i u m Ontology role in eagle-i architecture eagle-i ontologies Federated Network Repositories (RDF) NIF, PubMed Entrez Gene Search Application Data Collection Application Resource information collection

Ontology/MethodScope/Purpose Basic Formal Ontology (BFO)Upper ontology Information Artifact Ontology (IAO) Ontology metadata Relation Ontology (RO)Common properties Minimum Information to Represent External Ontology Terms (MIREOT) Reuse classes and properties from external ontologies Implementation c o n s o r t i u m

Ontology layers Goal: to decouple research resources representation from information used for application appearance and behavior Application specific module – Classes, annotation properties and individuals required to drive the UIs eagle-i core ontology – Classes and properties used to represent information about biomedical research resources MIREOT files – Externally sourced classes and properties

c o n s o r t i u m eagle-i core and MIREOTed sources eagle-i core ontology: 1283 classes, 56 object properties, and 61 data properties. External OntologiesPurpose/subsetsClasses Ontology of Biomedical Investigations (OBI) research material entities, processes, devices, roles 509 NCBI TaxonomyOrganisms taxa192 VIVO ontology people, organization, publications 20 Ontology of Clinical Research (OCRe) human study designs and facets 19 Biomdedical Resource Ontology (BRO) instruments13

Application-specific module Contains properties and classes required to drive the UIs of the data collection and search applications – UI Annotation Definition file – Definition of UI annotation properties and sets of values for these properties – UI Annotations file – Holds annotations made on eagle-I core and MIREOTed classes and properties

c o n s o r t i u m Examples of annotation values and use LabelDescriptionExample Primary Resource Type Denotes classes for which instances are collected ‘instrument’, ‘biospecimen’, ‘protocol’ Data Model Exclude Denotes classes or properties that are not included in the model used for the data tool or the search tool UIs BFO classes such ‘continuant’ or ‘occurrent’ or RO relations such ‘precedes’ Embedded Class Denotes a class for which instances can only be created in the context of an embedding class ‘antibody immunogen’ created within ‘antibody’, ‘construct insert’ created within ‘plasmid’

Additional application-specific properties Property Label DescriptionExampleProperty Type eagle-i domain constraint Used to specify the domain of an imported property. Each annotation will contain the URI of one class Value set to “OBI_ ” (‘organization’) for RO property ‘location_of’’ Data Property eagle-i range constraint Used to specify the range of an imported property. Each annotation will contain the URI of one class Value set to “ERO_ ” (‘instrument’) for RO property ‘located_in’ Data Property eagle-i preferred label Defines the value of preferred label to display in the data collection tool and search UIs Capitalized ‘Organization’ for OBI_ (‘organization’) Annotation Property

Classes annotated with ‘primary resource type ’ Construct insert is an example of a resource annotated as an ‘embedded class’, ‘eagle-i preferred definition’ is used for tooltips ‘eagle-i preferred label’ is used for the display name Property annotated as ‘’primary property’ Technique is annotated as ‘referenced taxonomy’ Data Collection Application

c o n s o r t i u m  Reuse of existent ontologies  Ontology Layers  Application-specific module  Community coordination and alignment  Best practices and tools Challenges and benefits

c o n s o r t i u m  BFO and the relation ontology (RO)  OBO Foundry orthogonality principle Advantages – Integration with other ontologies – Ease the design process – Data integration and publication (Linked Open Data) Challenges – Need to exclude some classes (continuant, occurrent) from UI visualization after the inferred module has computed – Domain and Range in RO not specified or not specific enough for an application – Not all relevant ontologies are built using BFO and RO Reuse of existent ontologies

c o n s o r t i u m Advantages – Effective means to drive an application UI while maintaining interoperability with external ontologies and data sources – Facilitate parallel concurrent development Challenges – Keeping the annotations current with the core module – Risk of excessive proliferation of annotation properties as quick way to simplify application development complexity Ontology layers

c o n s o r t i u m Requirements for bridging the gap between an application and domain-specific ontologies – Application-specific labels and definitions – Exclusion of sets of classes and properties from the model used by the application – Restriction of domain and range for some imported properties – Definition of display order of object and data properties at class level Application-specific module

c o n s o r t i u m Commitment to collaboration with similar efforts aimed at resource modeling – Aligned high level models with NIF, RDS, VIVO – Service, instrument (device) implemented in OBI and reused by NIF and eagle-i – Coordinated representation of reagents, biospecimens, and genotype information (in progress) Challenges – Process is time consuming and it requires extra implementation efforts Implement and import back from reference ontologies – Application ontologies have peculiar requirements Example: Service hierarchy in eagle-i based on type of process rather than input and output of the process (OBI) Community coordination

c o n s o r t i u m  Reusing/referencing existent ontologies – Ontofox, OWL module extractor, NCBO extractor service  Have tools integrated in ontology editors (Protégé) – Effective methods for managing and syncing MIREOTed terms  Have several “community views” or ‘slims’ that could be directly imported with different level of complexity Best practices and tools

c o n s o r t i u m  Developing an ontology-driven application has been an important benchmark for usage of biomedical ontologies  We have designed a layered set of ontologies, consisting of a broadly applicable core ontology and application- specific module – Requirements and principles to inform a general design pattern  Future steps  Refining, documenting and sharing requirements and lessons learned  Engage in efforts addressing the issues we have experienced Conclusion

c o n s o r t i u m Thank you eagle-i core module: eagle-i search: Carlo Torniai Acknowledgments: Ted Bashor, Rob Frost, Larry Stone and Daniela Bourges Project funded through NIH/NCRR ARRA award #U24RR029825