Binding Ontologies and Coding Systems to Electronic Health Records and Message Alan Rector 1, Rahil Qamar 1 & Tom Marley 2 1 University of Manchester 2.

Slides:



Advertisements
Similar presentations
Limitations of the relational model 1. 2 Overview application areas for which the relational model is inadequate - reasons drawbacks of relational DBMSs.
Advertisements

ARCHITECTURES FOR ARTIFICIAL INTELLIGENCE SYSTEMS
Semantics Static semantics Dynamic semantics attribute grammars
Copyright © Cengage Learning. All rights reserved.
Copyright © Cengage Learning. All rights reserved.
Everything You Need to Know (since the midterm). Diagnosis Abductive diagnosis: a minimal set of (positive and negative) assumptions that entails the.
Background information Formal verification methods based on theorem proving techniques and model­checking –to prove the absence of errors (in the formal.
Solutions to Review Questions. 4.1 Define object, class and instance. The UML Glossary gives these definitions: Object: an instance of a class. Class:
Copyright © 2006 Addison-Wesley. All rights reserved.1-1 ICS 410: Programming Languages Chapter 3 : Describing Syntax and Semantics Axiomatic Semantics.
ISBN Chapter 3 Describing Syntax and Semantics.
Lecturer: Sebastian Coope Ashton Building, Room G.18 COMP 201 web-page: Lecture.
PR-OWL: A Framework for Probabilistic Ontologies by Paulo C. G. COSTA, Kathryn B. LASKEY George Mason University presented by Thomas Packer 1PR-OWL.
Chapter 6 Methodology Logical Database Design for the Relational Model Transparencies © Pearson Education Limited 1995, 2005.
7M701 1 Software Engineering Object-oriented Design Sommerville, Ian (2001) Software Engineering, 6 th edition: Chapter 12 )
Chapter 8: Web Ontology Language (OWL) Service-Oriented Computing: Semantics, Processes, Agents – Munindar P. Singh and Michael N. Huhns, Wiley, 2005.
Methodology Logical Database Design for the Relational Model
The Semantic Web Week 12 Term 1 Recap Lee McCluskey, room 2/07 Department of Computing And Mathematical Sciences Module Website:
Overview of Software Requirements
CS 330 Programming Languages 09 / 16 / 2008 Instructor: Michael Eckmann.
Describing Syntax and Semantics
CET203 SOFTWARE DEVELOPMENT Session 1B Modelling and the Theory of Inheritance.
3/18/19990© 1999, Health Level Seven, Inc. Introduction: Vocabulary domains Marital Status –single (never married) –married –divorced –separated “Vocabulary”
Copyright © Cengage Learning. All rights reserved.
C OURSE : D ISCRETE STRUCTURE CODE : ICS 252 Lecturer: Shamiel Hashim 1 lecturer:Shamiel Hashim second semester Prepared by: amani Omer.
Chapter 4 The Relational Model.
Ontology Development Kenneth Baclawski Northeastern University Harvard Medical School.
NHS CFH Approach to HL7 CDA Rik Smithies Chair HL7 UK NProgram Ltd.
1 CS 456 Software Engineering. 2 Contents 3 Chapter 1: Introduction.
Knowledge representation
Protege OWL Plugin Short Tutorial. OWL Usage The world wide web is a natural application area of ontologies, because ontologies could be used to describe.
© Bertrand Meyer and Yishai Feldman Notice Some of the material is taken from Object-Oriented Software Construction, 2nd edition, by Bertrand Meyer (Prentice.
Nancy Lawler U.S. Department of Defense ISO/IEC Part 2: Classification Schemes Metadata Registries — Part 2: Classification Schemes The revision.
R McFadyen Chapter 7 Conceptual Data Modeling.
1 Relational Expressions Relational expressions: –Expressions that compare operands –Sometimes called conditions –Evaluated to yield a result –Typically.
1 The Relational Database Model. 2 Learning Objectives Terminology of relational model. How tables are used to represent data. Connection between mathematical.
Formal Models in AGI Research Pei Wang Temple University Philadelphia, USA.
9/7/2012ISC329 Isabelle Bichindaritz1 The Relational Database Model.
1 Towards a Unified Representation: Representing HL7 and SNOMED in OWL Alan Rector 1 & Tom Marley 2 1 Information Management Group / Bio Health Informatics.
ISBN Chapter 3 Describing Semantics -Attribute Grammars -Dynamic Semantics.
(Spring 2015) Instructor: Craig Duckett Lecture 10: Tuesday, May 12, 2015 Mere Mortals Chap. 7 Summary, Team Work Time 1.
LOGIC AND ONTOLOGY Both logic and ontology are important areas of philosophy covering large, diverse, and active research projects. These two areas overlap.
Knowledge Representation of Statistic Domain For CBR Application Supervisor : Dr. Aslina Saad Dr. Mashitoh Hashim PM Dr. Nor Hasbiah Ubaidullah.
©Ian Sommerville 2004Software Engineering, 7th edition. Chapter 14 Slide 1 Object-oriented Design.
Semantic web course – Computer Engineering Department – Sharif Univ. of Technology – Fall Knowledge Representation Semantic Web - Fall 2005 Computer.
1 Towards a Unified Representation: Representing HL7 and SNOMED in OWL Alan Rector 1 & Tom Marley 2 1 Information Management Group / Bio Health Informatics.
More on Description Logic(s) Frederick Maier. Note Added 10/27/03 So, there are a few errors that will be obvious to some: So, there are a few errors.
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
1 Chapter 17 Methodology - Local Logical Database Design.
Based on “A Practical Introduction to Ontologies & OWL” © 2005, The University of Manchester A Practical Introduction to Ontologies & OWL Session 2: Defined.
Naïve Set Theory. Basic Definitions Naïve set theory is the non-axiomatic treatment of set theory. In the axiomatic treatment, which we will only allude.
® A Proposed UML Profile For EXPRESS David Price Seattle ISO STEP Meeting October 2004.
ONTOLOGY ENGINEERING Lab #3 – September 15,
Approach to building ontologies A high-level view Chris Wroe.
Object-Oriented Parsing and Transformation Kenneth Baclawski Northeastern University Scott A. DeLoach Air Force Institute of Technology Mieczyslaw Kokar.
Formal Verification. Background Information Formal verification methods based on theorem proving techniques and model­checking –To prove the absence of.
Enable Semantic Interoperability for Decision Support and Risk Management Presented by Dr. David Li Key Contributors: Dr. Ruixin Yang and Dr. John Qu.
CSC3315 (Spring 2009)1 CSC 3315 Languages & Compilers Hamid Harroud School of Science and Engineering, Akhawayn University
Methodology - Logical Database Design. 2 Step 2 Build and Validate Local Logical Data Model To build a local logical data model from a local conceptual.
Verification vs. Validation Verification: "Are we building the product right?" The software should conform to its specification.The software should conform.
Ontologies COMP6028 Semantic Web Technologies Dr Nicholas Gibbins
Models of use and Model of Meaning towards a Model Driven Architecture for Data Entry & decision support Alan Rector, Tom Marley, and Rahil Qamar University.
Proof And Strategies Chapter 2. Lecturer: Amani Mahajoub Omer Department of Computer Science and Software Engineering Discrete Structures Definition Discrete.
Logic.
COMP6215 Semantic Web Technologies
Methodology Logical Database Design for the Relational Model
Ontology From Wikipedia, the free encyclopedia
Business Process Measures
Logical architecture refinement
An Introduction to Software Architecture
Presentation transcript:

Binding Ontologies and Coding Systems to Electronic Health Records and Message Alan Rector 1, Rahil Qamar 1 & Tom Marley 2 1 University of Manchester 2 University of Salford with acknowledgements to the Semantic Mining Network of Excellence, the UK Connecting for Health programme, and the Terminfo Group

2 Plan of the paper ►The use case and requirements ►Theoretical background ►Engineering background ►Issues in using OWL ►Summary

3 Medical IT’s odd organisational structure ►Separate / independent development ►Medical Ontologies / Terminologies ►SNOMED, GALEN, NCI thesaurus, potentiual OBO Disease Ontology, etc. ►Medical information models ►HL7 messages ►OpenEHR Archetypes ►A Common Manifestation of the Oddity ► The “value set” problem ►Even after the ontology is designed, the EHR/Message still has to define the valid value sets in terms of it ►Related problems ►Relation of Simple Knowledge Organisation System (SKOS) and other thesauri with ontologies ►“Classes as Values” ►Semantic Web Best Practice Working Group (SWBP) (Noy et al)

4 What this paper is not about ►Whether any specific coding system, ontology, or information model is appropriate, true, good, bad, or otherwise ►The methods presented here are pertinent even with a “perfect” ontology and a fully adequate and well aligned information model ►Although they are more obvious with ill formed ontologies and information systems ►...and do allow us to cope with imperfect ontologies and suboptimal record structures ►There is another paper to be written on how best to construct ontologies and information models to fit together ►This is not it ►We have tested with HL7 and SNOMED becaue that gave us independently developed test cases & NHS paid ►This implies neither endorsement nor criticism of either

5 Requirements summary ►Explicit “Code Binding Interface” (“CBI”) interface ►What codes can be used when and where ►Expressivity ►Meets requirements on next slide ►Compositional coding systems (SNOMED-CT) ►Deal with compositional coding systems ►Part of general methodology for ►progressively constraining information models ►Factoring information models into re-usable submodels ►Use of standard languages with well defined semantics ►Formally specified independently testable constraints

6 Expressivity for the “Code Binding Interface” ►Any enumerated list of codes (without their subcodes ►A code and all its subcodes ►All the subcodes of a code but not the parent code ►Any boolean combination of the above

7 There is no first order solution to these requirements ►You cannot use a first order expression about the members of classes to select a class and exclude its subclasses ►Unless you really mean to exclude the individual in the subclasses ►which in general you do not

8 Pointed out the obvious Data structures and what they carry information about are different ►Information models and “ontologies” are at different levels ►Information structures are “meta” to ontologies ►The purpose of modelling an ontology is to represent our conceptualisation of the world ►The question is does allow us to make correct predictions about our observations of the world ►The purpose of modelling an information structure is to specify valid data structures structures to carry information about that world ►To constrain the data structures to just those which a given software system can process

9 Data structures and what they carry information about have different characteristics ►Example: ►All persons have a sex ►However not all data structures about people have a field for sex ►Information structures are intrinsically closed ►Valid structures can be exhaustively and completely described (up to recursion) ► Ontologies are intrinsically open ►We can never describe the world completely

10 Fundamental task for EHRs and Messages ►Begin with logical statements about the world ►Our best efforts to represent what we believe is true ►Encode those statements in data structures for transmission ►Provably valid for the software in hand ►Decode data structures faithfully back to logical statements about the world ►With only well defined loss of information

From logical statements about the world to information structures and back again (possibly with loss) Valid Specifications for data structures “Valid diabetic data structures have: a topic of code for diabetes, a diagnosis code that is diabetes or one of its subcodes, and a brittleness code that is one of the subcodes for diabetic brittlenes and nothing else” Logical statements about the world: “ All diabetes are metabolic diseases” “John has diabetes & it is brittle and long standing” “John has diabetes and it is brittle”

12 Caveat: Do not confuse higher order knowledge about the world and meta knowledge about the representation ►Higher order knowledge about the classes themselves ►“Endangered species” ►“Darwin described Galapagos Finches” ►Information about the software artifact ►Editorial ►The representation of “species” in this ontology was authored by Alan Rector on the basis of defintions in the OED, Wordnet, and UMLS ►Structural ►The identifier for the class “Diabetes”in this representation is “12345” ►In this representation system ► the immediate primitive superclass of “Diabetes” is “metabolic disorder” ► the inferred immediate superclasses are “Chronic disease”, “Cardiovascular risk factor”, ►what we are concerned with in this paper

13 Representing Information Models and Codes: Basic approach ►An information model can the thought of of as A logical theory of classes of information structures ►The instances of the classes are concrete data structures - EHRs, messages, etc - carrying data about specific patients, tests, organisations, cases of disease,...

14 Simple example of a class of information structures for ‘Diabetes’ ►Three fields - one example of each case Field Valid Code Set TopicExact code for diabetes DiagnosisThe code for diabetes or any kind of diabetes BrittlenessAny of the subcodes of brittleness but not brittleness itself

15 “ontology” representations and “codes” ►The instances of “ontology” represent (conceptualisations of) things in the world ►Cases of diabetes ►Patients ►Insulin metabolism ►islet cells ►The instances in data structures are data items in human artefacts ►Information structures of associations and attributes, elements, etc. ►Individual codes represent representations of classes in given representation of an “ontology” ►What is true of a given logical / mathematical / software artefact

Model of Meaning & Model of Data structure Type 1 Diabetes Observa- tion Observa- tion Diabetes Observa- tion Participa- tion Model of data structures in Information Model topic diagnosis Diabetes Type 1 Diabetes Type 2 Diabetes Metabolic disorder Model of Meaning/ “ontology” code_for_ metabolic_disorder code_for_diabetes code_for_ diabetes_type_1 code_for_ diabetes_type_2 is_subcode_of Model of codes in Information Model Diabetes Type 1 Diabetes Type 2 Diabetes Metabolic disorder Model of Meaning/ “ontology” ONLY

17 Software Engineering Issues Clean testable, maintainable “The Code Binding Interfaced” ►Placholder codes ►Classses of codes used in the the information model to express constraints ►Only a member of this class of placeholder codes can be used here ►But not defined in the information model ►Defer binding ►A notion partly taken from Beale’s Archetype Definition Language’s “Ontology “ section, ►which would better be called a “Code Binding Section” ►And whose semantics are weak - we propose here a methodology for a stronger semantics

Binding using placeholders Code Binding Interface

Ontologies Outline of Process Information Models Coding System Transform Binding Module Import Reasoner Validate Applications Data Structures

20 Doing it in OWL ►Why use OWL? ►Treat OWL as a logic constraint language with tools and community ►Can be used to represent ontologies ►Can also be used to represent constraints data structures ►Generic with well defined semantics: ►It means what it means ►Forces distinctions often left implicit to be made explicit ►The good news: ►Can be explicit about which expressions are open or closed ►Can be explicit about layering and metamodels ►The bad news: ►Must be explicit about which expressions are open or closed ►Must be explicit about layering and meta models ►NB using some constructs from OWL 1.1 ►but included in all current OWL reasoners

21 Key issues in OWL ►Open World Reasoning ►Negation means provably false ►Expressions not present are merely “under-specified” ►OWL, Descriptin Logics, FoL ►False means cannot be proved true ►What is missing is false ►Databases, logic programming, most rules systems, SQL, etc. ►Absence of the Unique Name Assumption ►Two entities are different only if we say they are different ►Means that it is easy to say that two things are logically equivalent ►In most database, logic programming, systems, etc. if two things have different names they are different ►Therefore: Owl often requires additional explicit axioms (since you have the freedom to do either, you must specify which) ►Closure axioms when closed world reasoning is required ►Distinguishing axioms when things are known to be different

22 Example definition of a class of data structures CLASS Diabetic_data_structure ➔ has_attr EXACTLY 1 Topic, has_attr EXACTLY 1 Diagnosis, has_attr EXACTLY 1 Brittleness,

23 Example definition of a class of data structures with placeholders CLASS Diabetic_data_structure ➔ has_attr EXACTLY 1 Topic, has_attr SOME (Topic & has_code SOME Placeholder_diabetes_only_code), has_attr EXACTLY 1 Diagnosis, has_attr SOME (Diagnosis & has_code SOME Placeholder_diabetes_or_sub), has_attr EXACTLY 1 Brittleness, has_attr SOME (Brittleness & has_code ONLY Placeholder_diab_brittleness_sub).

24 Example definition of a class of data structures with placeholders and closure axiom CLASS Diabetic_data_structure ➔ has_attr EXACTLY 1 Topic, has_attr SOME (Topic & has_code SOME Placeholder_diabetes_only_code), has_attr EXACTLY 1 Diagnosis, has_attr SOME (Diagnosis & has_code SOME Placeholder_diabetes_or_sub), has_attr EXACTLY 1 Brittleness, has_attr SOME (Brittleness & has_code ONLY Placeholder_diab_brittleness_sub), has_attr ONLY (Topic OR Diagnosis OR Brittleness).

25 The Code Binding Interface A set of equivalence axioms ► The code for diabetes (only) Placeholder_diabetes_only_code ↔ {code_for_diabetes} ► The code for diabetes or any of its subcodes Placeholder_diabetes_or_sub ↔ {code_for_diabetes} OR is_sub_of VALUE code_for_diabetes ► The code for any subcode of diabetic_brittleness Placeholder_diab_brittleness_sub ↔ is_sub_of VALUE code_for_diabetic_brittleness

26 A fragment of the ontology and the derived coding system ►CLASS Diabetes ➔ Metabolic_disorder, has_quality EXACTLY 1 Brittleness. CLASS Diabetes_type_1 ➔ Diabetes, is_caused_by_some (Damage AND has_locus SOME Pancreatic_islet_cells).... ►INDIVIDUAL code_for_diabetes ∈ CODE, has_sub VALUE code_for_diabetes_type_1, has_sub VALUE code_for_diabetes_type_2, has_sub ONLY {code_for_diabetes_type_1, code_for_diabetes_type_2}.

27 A fragment of the ontologym and the derived coding system ►Note: ►Not everything in the ontology is necessarily represented in the coding system ►In this coding system causation by pancreatic islet cell defect is ignored in ►We could introduce additional features in the coding system not from the ontology ►Whether or not this would be a good idea ►We could deliberately create a more complex relationship for has sub ►A reasonable way to account systematicaly for “broader than/narrower than”

28 Summary ►“Ontologies” represent (our conceptualisation of) the world ►The criteria for correctness is our predictions of observations of the world ►Data Structures are used to convey information about (our conceptualisation of) the world ►They criteria for correctness is validity for use in multiple information systems ►Codes are data structures which represent the representations in ontological artefacts ►They are “meta” to the ontology ►Ontologies & coding systems will continue to be developed independently ►Formal interfaces and deferred binding are therefore required between them EVEN WERE THEY PERFECT ►The mechanism of placeholder codes is presented (in part) here is sufficient to form such a “Code Binding Interface”