An Advanced Strategy for Integration of Biological Measurement Data Hiroshi Masuya 1, Georgios V. Gkoutos 2, Nobuhiko Tanaka 1, Kazunori Waki 1, Yoshihiro.

Slides:



Advertisements
Similar presentations
OMV Ontology Metadata Vocabulary April 10, 2008 Peter Haase.
Advertisements

Chapter 6: Modeling and Representation Service-Oriented Computing: Semantics, Processes, Agents – Munindar P. Singh and Michael N. Huhns, Wiley, 2005.
GOAT: The Gene Ontology Annotation Tool Dr. Mike Bada Department of Computer Science University of Manchester
Automated tools to help construction of Trait Ontologies Chris Mungall Monarch Initiative Gene.
Research topics Semantic Web - Spring 2007 Computer Engineering Department Sharif University of Technology.
Ontology Notes are from:
Iowa State University Animal Science Department Bioinformatics & Computational Biology Program - 01/16/06 1 Overview of Animal Trait Ontology and PATO.
PR-OWL: A Framework for Probabilistic Ontologies by Paulo C. G. COSTA, Kathryn B. LASKEY George Mason University presented by Thomas Packer 1PR-OWL.
Chapter 6: Modeling and Representation Service-Oriented Computing: Semantics, Processes, Agents – Munindar P. Singh and Michael N. Huhns, Wiley, 2005.
Sensemaking and Ground Truth Ontology Development Chinua Umoja William M. Pottenger Jason Perry Christopher Janneck.
Reconciliation between BFO and DOLCE leads to YATO ー On quality description ー Riichiro Mizoguchi ISIR, Osaka University.
Use of Ontologies in the Life Sciences: BioPax Graciela Gonzalez, PhD (some slides adapted from presentations available at
Semantics For the Semantic Web: The Implicit, the Formal and The Powerful Amit Sheth, Cartic Ramakrishnan, Christopher Thomas CS751 Spring 2005 Presenter:
From Knowledge Engineering to Ontological Engineering -- A New Trend in Knowledge Model Building for ITSs -- Riichiro Mizoguchi ISIR, Osaka University.
Review Session Monday, November 8 Shantz 242 E (the usual place) 5:00-7:00 PM I’ll answer questions on my material, then Chad will answer questions on.
Business Domain Modelling Principles Theory and Practice HYPERCUBE Ltd 7 CURTAIN RD, LONDON EC2A 3LT Mike Bennett, Hypercube Ltd.
Crosscutting Concepts and Disciplinary Core Ideas February24, 2012 Heidi Schweingruber Deputy Director, Board on Science Education, NRC/NAS.
PATO An ontology for phenotypes. The development of PATO is the work of George Gkoutos, supported by the NCBO, working in Cambridge.
Enriching the Ontology for Biomedical Investigations (OBI) to Improve Its Suitability for Web Service Annotations Chaitanya Guttula, Alok Dhamanaskar,
Developing an OWL-DL Ontology for Research and Care of Intracranial Aneurysms – Challenges and Limitations Holger Stenzhorn, Martin Boeker, Stefan Schulz,
Computational Biology and Informatics Laboratory Development of an Application Ontology for Beta Cell Genomics Based On the Ontology for Biomedical Investigations.
Knowledge representation
Ontological realism as a strategy for integrating ontologies Ontology Summit February 7, 2013 Barry Smith 1.
Information Systems & Semantic Web University of Koblenz ▪ Landau, Germany Semantic Web - Multimedia Annotation – Steffen Staab
1 Inside Theory-Aware Authoring System Riichiro Mizoguchi, Yusuke Hayashi Osaka University Jacqueline Bourdeau Tele University.
An Ontology-Based Approach for Sharing Digital Resources in Teacher Education 7 th International Workshop on Ontologies and Semantic Web for E-Learning.
Annotating Words using WordNet Semantic Glosses Julian Szymański Department of Computer Systems Architecture, Faculty of Electronics, Telecommunications.
DONNA MAGLOTT, PH.D. PRO AND MEDICAL GENETICS RESOURCES AT NCBI.
The Chronious Ontology Suite: Methodology and Design Principles Luc Schneider[1], Mathias Brochhausen[1,2] [1] Institute for Formal Ontology and Medical.
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
Definition of a taxonomy “System for naming and organizing things into groups that share similar characteristics” Taxonomy Architectures Applications.
What is an Ontology? An ontology is a specification of a conceptualization that is designed for reuse across multiple applications and implementations.
Chapter 6: Modeling and Representation Service-Oriented Computing: Semantics, Processes, Agents – Munindar P. Singh and Michael N. Huhns, Wiley, 2005.
Coastal Atlas Interoperability - Ontologies (Advanced topics that we did not get to in detail) Luis Bermudez Stephanie Watson Marine Metadata Interoperability.
Alan Ruttenberg PONS R&D Task force Alan Ruttenberg Science Commons.
Sunday, July 22, 2012 Plan Areas of coverage: high-level neurological system process, inc. sensory perception, sensory processing, cognition transmission.
Ontologies GO Workshop 3-6 August Ontologies  What are ontologies?  Why use ontologies?  Open Biological Ontologies (OBO), National Center for.
Description of some multimedia ontologies Rapha ë l Troncy Thursday 1 st of December, 2005.
An International Centre for Mouse Genetics EuroPhenome and the International Mouse Phenotyping Consortium John Hancock MRC Harwell.
SSO: THE SYNDROMIC SURVEILLANCE ONTOLOGY Okhmatovskaia A, Chapman WW, Collier N, Espino J, Conway M, Buckeridge DL Ontology Description The SSO was developed.
Discovering Descriptive Knowledge Lecture 18. Descriptive Knowledge in Science In an earlier lecture, we introduced the representation and use of taxonomies.
Sharing Ontologies in the Biomedical Domain Alexa T. McCray National Library of Medicine National Institutes of Health Department of Health & Human Services.
2005/11/04 1 A Framework for Organizing Role Concepts in Ontology Development Tool: Hozo Sunagawa, E., Kozaki, K., Kitamura, Y., Mizoguchi, R. I.S.I.R.,
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
02. November 2007 Florian Probst Data and Knowledge Modelling for the Geosciences - Chris Date Seminar - e-Science Institute, Edinburgh Semantic Reference.
A Semantic-Web Representation of Clinical Element Models
Oreste Signore- Quality/1 Amman, December 2006 Standards for quality of cultural websites Ministerial NEtwoRk for Valorising Activities in digitisation.
Text Mining & NLP based Algorithm to populate ontology with A-Box individuals and object properties Alexandre Kouznetsov and Christopher J. O. Baker, University.
A LexWiki-based Representation and Harmonization Framework for caDSR Common Data Elements Guoqian Jiang, Ph.D. Robert Freimuth, Ph.D. Harold Solbrig Mayo.
Ontologies Working Group Agenda MGED3 1.Goals for working group. 2.Primer on ontologies 3.Working group progress 4.Example sample descriptions from different.
Towards a Top-Domain Ontology for Linking Biomedical Ontologies Holger Stenzhorn a,b Elena Beißwanger c Stefan Schulz a a Department of Medical Informatics,
Trait ontology approach Marie-Angélique LAPORTE NCEAS June 7 th 2010.
Phenotype And Trait Ontology (PATO) and plant phenotypes
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
Higgs bosons, mars missions, and unicorn delusions: How to deal with terms of dubious reference in scientific ontologies Stefan Schulz a,b Mathias Brochhausen.
OWL Web Ontology Language Summary IHan HSIAO (Sharon)
WonderWeb. Ontology Infrastructure for the Semantic Web. IST WP4: Ontology Engineering Heiner Stuckenschmidt, Michel Klein Vrije Universiteit.
Knowledge Representation Part I Ontology Jan Pettersen Nytun Knowledge Representation Part I, JPN, UiA1.
Service-Oriented Computing: Semantics, Processes, Agents
Comparative Data Analysis Ontology (CDAO)
Development of the Amphibian Anatomical Ontology
Mental Functioning and the Gene Ontology
Lecture 02 The Basics pf Creating a GIS Map
Functional Annotation of the Horse Genome
Ontology Reuse In MBSE Henson Graves Abstract January 2011
Service-Oriented Computing: Semantics, Processes, Agents
CCO: concept & current status
Service-Oriented Computing: Semantics, Processes, Agents
INTRODUCTION TO STATISTICS
Versioning in Adaptive Hypermedia
Presentation transcript:

An Advanced Strategy for Integration of Biological Measurement Data Hiroshi Masuya 1, Georgios V. Gkoutos 2, Nobuhiko Tanaka 1, Kazunori Waki 1, Yoshihiro Okuda 3, Tatsuya Kushida 3, Norio Kobayashi 4, Koji Doi 4, Kouji Kozaki 5, Robert Hoehndorf 1, Shigeharu Wakana 1, Tetsuro Toyoda 4 and Riichiro Mizoguchi 5 ICBO 2011 July 28-30, 2011 Buffalo, New York, 1: RIKEN BioResource Center, Tsukuba, Japan 2: Department of Genetics, University of Cambridge, UK 3: NalaPro Technologies, Inc, Tokyo, Japan 4: RIKEN BASE, Yokohama Japan 5: Department of Knowledge Systems, ISIR, Osaka University, Japan

Motivation of this study Organism A Organism B Organism C Organism D Phenotypes represent a broad range of variations in measured qualities To contribute to development of the informatics infrastructure for the description, exchange and mining of phenotypic data. Integrated phenotypic information whole Sophisticated informatics infrastructure (ontology) Sophisticated informatics infrastructure (ontology) Biological knowledge Mining…

Phenotypic Quality (PATO): PATO provides a practical basis for vocabulary and semantics for the description of phenotype information across species. Single hierarchy model of “quality” suite for BFO Less confusions than “EAV” annotation for non-ontology-familiar people. HP: ! Amyotrophy PATO: ! atrophied FMA:30316 ! muscle MA: ! muscle PATO: ! atrophied MP: ! muscular atrophy Basis of inferences of cross-species phenotype equivalence with EQ. (e.g. mouse phenotype and disease) Standard of phenotype annotation across species. (“EQ” annotation) E Q E Q

Expansion of PATO We attempted to expand the PATO ontology to ensure a more advanced, explicit and consistent knowledge framework. 1.To provide fundamental classification of quality values on the basis of measurement scales. 2.To provide strict data model to operate context- dependencies of ordinal values. 3.To provide model of datum (or description) as a informational entity with the structure of common formalisms. Objectives:

Refrain from 2-hiearchy model (and EAV formalism) Refrain from 2-hiearchy model (and EAV formalism) There were a lot of discussions for PATO to take 1-hiearchy and EQ… Fundamental classification of quality-value (1) lengthtemperaturecolor 20cm 37 ℃ K Long, shorthigh, low red, blue.. This classification takes as starting point the mathematical operation! 1.Number of studies claims that the fundamental classification of values: “scales of measurement” (Stevens S.S, 1946) is beneficial for data integration in the field of experimental science.

2.Foundation of explicit description of change of quality is needed Growing boy and his height quality OntologySystem of quality Formalism BFO, PATO 1-hiearchyEQ DOLCE2-hiearchyEAV Qualitative and quantitative descriptions are integrated in a single knowledge framework in DOLCE. For the coordination of ongoing efforts, equivalence mapping of these systems is beneficial. Fundamental classification of quality-value (2) t1 t Explicit description of color change is needed. Color 1: green to orange Color 2: orange to green

“Small” class “Large” class I’m big!! I’m small.. Problem of “large ant and small elephant” value A value B value C value D How to classify value instances? Context A: simple comparison Threshold X (some value) larger smaller Model of context-dependency of ordinal value (1)

“abnormally large” class “abnormally small” class “normal size” class I’m big!! I’m small.. value A value C value B value D Context B: deviation based comparison (context of inference of cross-species equivalence of phenotypes) larger smaller larger smaller Threshold Y1 and Y2 (deviation-based value) deviation Model of context-dependency of ordinal value (2) Problem of “large ant and small elephant” How to classify value instances? Knowledge model of context dependencies of ordinal scale values is needed!

1.Distinction of a “true value” and an “empirical measurement” as an approximation is needed. 2.Modeling of informational entities with common formalisms (eg. EQ, EAV and so on) and their relationships would be useful! weigh t EQ EAV weigh t Model of datum as an informational entity Reality Information Current version of DOLCE, BFO and PATO deal only with the primary reality and do not deal with quality description. (Unknown…) (Unknown...) Current version of

A reference ontology “PATO2YAMATO” Expansion of PATO with YAMATO framework Features: Framework of interoperability of quality-related concepts between top-level ontologies. Support of classification of scales of measurements. Model of context dependency with “role” Detailed model of “representation” (an informational object) that involves quality representation. Yet Another More Advanced Top-Level Ontology (YAMATO: Mizoguchi, 2009) BFOYAMATO DOLCE PATO quality quale quality-space Practical use based proposals… Equivalence mapping between 1- and 2-hiearcy models Model of context dependency Model of datum with common formalisms OBI Mapping Interoperability

region quale quality space quality_ quantity quality value quality property quality BFO YAMATO DOLCE (Upper level) generic quality (convertible) (Upper level) Equivalence mapping of 1- and 2-hearcy model identical Classification of quality value (scales of measurements : Stevens S.S, 1946) About 1,000 PATO terms were manually mapped to YAMATO framework.

Modeling of context dependency with “role” An entity often plays different “roles” with different characteristics under different contexts (at school) (at home) Abnormally heavy large-role weight quality value Distribution for weight role-holder ( Entity playing a role) role potential player context heavier than normal value qualitative value for weight depend on playable In the distribution for weight, some weight quality values playing large-roles thereby becomes role holders, abnormally heavy Concept model of role and role-holder I’m a teacher.I’m a husband

Modeling of context dependency with “role” (at school) (at home) In the distribution for weight, some weight quality values playing large-roles thereby becomes role holders, abnormally heavy I’m a teacher.I’m a husband An entity often plays different “roles” with different characteristics under different contexts Implementation and representation in Hozo ontology editor context Role-holder player Potential

Inter-relationships among contexts Classification of organisms Inherit Inference of the Classification of “abnormally heavy” ”Abnormally light in elephant is lighter than abnormally heavy in ant” in the simple comparison context. Inference of classification:

Context of distribution of weight in elephant Context of distribution of weight in ant Simple comparison context Coordination of ordinary values under different contexts Abnormally heavy in elephant Normal weight in elephant Abnormally light in elephant Abnormally heavy in ant Normal weight in ant Abnormally light in ant larger

Quality representation in YAMATO Quality Weight Reality (Symbolization) Quality representation Informational entities YAMATO provides “quality representation” for the foundation of formalized informational entities such as EQ, EAV and so on. Basic structure for representation by symbol EP (=EQ) (BFO, PATO) EAV (DOLCE) Sentence of natural language Coding of genetic information TuppleTriplenatural languagenucleotide sequence *entity, #property*entity, #generic quality, value alphabetmolecular symbol quality measurement anything…Specification of gene product Quality representation is modeled in the consistent way for content bearing informational entity, “representation”. quality representation *: symbolization operation, #: Class => individual operation (equivalent with punning in OWL 2) (Mizoguchi, 2004)

Current status of the reference ontology: PATO2YAMATO 1,450 EQ annotation: ( OBO cross-product file for Mammalian Phenotype ontology ) EAV-quality representations in YAMATO framework reference: PATO2YAMATO Including about 1,000 PATO terms into YAMATO framework Basic form of context-dependent ordinal values are defined. They are workable under the classification of organisms. Basic form of quality representation (EAV and EQ) are already defined in YAMATO. Preliminary trial of simple conversion of EQ to EAV The ontology helps the automatic conversion from EQ to EAV! We are planning full conversion of EQ across multiple species with coordinated EAV- quality representation.

This study shows: YAMATO’s framework helps to coordinate different “qualities” for phenotype information in both of reality and description level. Role-model successfully coordinated ordinal values dependent on multiple contexts (deviation-based and simple comparison). Future views: Automatic conversion of EQ of multiple species to EAV. Modeling of contexts of experimental conditions. Integration of qualitative and quantitative phenotype data. Coordination of more complicated phenotype data sets from multiple species and experiments. Summary of this talk

RIKEN BioResource Center Nobuhiko Tanaka, Kazunori Waki, Terue Takatsuki University of Cambridge Georgios V. Gkoutos, Robert Hoehndorf NalaPro Technologies Inc Yoshihiro Okuda, Tatsuya Kushida Enegate corp Mamoru Ota RIKEN BASE Norio Kobayashi, Koji Doi, Tetsuro Toyoda Department of Knowledge Systems, ISIR, Osaka University Koji Kozaki, Riichiro mizoguchi Acknowledgements

貴為和以 “Harmony is to be valued.” In “Seventeen-article constitution” (A.D 603, YAMATO imperial court in ancient Japan) Authored by Prince Sh ō toku (A.D. 573–621) Thank you !