1 An ontological analysis of diagnostic assertions in electronic healthcare records International Conference on Biomedical Ontology July 27-30, 2015 –

Slides:



Advertisements
Similar presentations
Nursing Diagnosis: Definition
Advertisements

Constructing a scientific argument The Argument Matrix Katharina Alpers Based on a presentation developed by FETP India (Acknowledgments to Yvan Hutin)
K-6 Science and Technology Consistent teaching – Assessing K-6 Science and Technology © 2006 Curriculum K-12 Directorate, NSW Department of Education and.
Upper Ontology Summit Wednesday March 15 The BFO perspective Barry Smith Department of Philosophy, University at Buffalo National.
1 SESSION 3 FORMAL ASSESSMENT TASKS CAT and IT ASSESSMENT TOOLS.
Linking to the ICF One of the aims of this work is to link this representation scheme to the WHO ICF.
Progress on Registration in Political Science Jeremy M. Weinstein (on behalf of Experiments in Governance and Politics – EGAP)
Catalina Martínez-Costa, Stefan Schulz: Ontology-based reinterpretation of the SNOMED CT context model Ontology-based reinterpretation of the SNOMED CT.
United Nations Statistics Division Principles and concepts of classifications.
Introduction to Research Methodology
Division of Biomedical Informatics Beyond Interoperability: What Ontology Can Do for the EHR William R. Hogan, MD, MS July 30 th, 2011 International Conference.
OASIS Reference Model for Service Oriented Architecture 1.0
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U.
Good Research Questions. A paradigm consists of – a set of fundamental theoretical assumptions that the members of the scientific community accept as.
IS Audit Function Knowledge
Business research methods: data sources
PPA 503 – The Public Policy Making Process
Foundations This chapter lays down the fundamental ideas and choices on which our approach is based. First, it identifies the needs of architects in the.
Chapter One: The Science of Psychology
Chapter 17 Nursing Diagnosis
Uniform Coding and Simplified Pricing HEALTH AUTHORITY – ABU DHABI Health Systems Finance May, 2007.
IT 244 Database Management System Data Modeling 1 Ref: A First Course in Database System Jeffrey D Ullman & Jennifer Widom.
The Standards for Mathematical Practice
The Use of Student Work as a Context for Promoting Student Understanding and Reasoning Yvonne Grant Portland MI Public Schools Michigan State University.
How to Write a Literature Review
The Data Attribution Abdul Saboor PhD Research Student Model Base Development and Software Quality Assurance Research Group Freie.
The Foundational Model of Anatomy and its Ontological Commitment(s) Stefan Schulz University Medical Center, Freiburg, Germany FMA in OWL meeting November.
Chapter One: The Science of Psychology. Ways to Acquire Knowledge Tenacity Tenacity Refers to the continued presentation of a particular bit of information.
TEA Science Workshop #3 October 1, 2012 Kim Lott Utah State University.
Concept Model for observables, investigations, and observation results For the IHTSDO Observables Project Group and LOINC Coordination Project Revision.
T. Dawson, TASC 9/11/13 Use of a Technical Reference in NASA IV&V.
Ontology for General Medical Science Overview and OBO Foundry Criteria Albert Goldfain Blue Highway / University at Buffalo ICBO.
A GENERIC PROCESS FOR REQUIREMENTS ENGINEERING Chapter 2 1 These slides are prepared by Enas Naffar to be used in Software requirements course - Philadelphia.
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U.
This material was developed by Duke University, funded by the Department of Health and Human Services, Office of the National Coordinator for Health Information.
ToK ESSAY The instructions tell you to: Remember to centre your essay on knowledge issues and,where appropriate, refer to other parts of your IB programme.
6. Conceptualization & Measurement
Cambridge Pre-U Getting Started In-service Training Liberating learning Developing successful students.
Division of Biomedical Informatics Representing the Reality Underlying Demographic Data William R. Hogan, MD, MS July 30, 2011 International Conference.
Standards for Mathematical Practice
New York State Center of Excellence in Bioinformatics & Life Sciences R T U Referent Tracking Unit R T U Guest Lecture for Ontological Engineering PHI.
LOGIC AND ONTOLOGY Both logic and ontology are important areas of philosophy covering large, diverse, and active research projects. These two areas overlap.
Knowledge Representation of Statistic Domain For CBR Application Supervisor : Dr. Aslina Saad Dr. Mashitoh Hashim PM Dr. Nor Hasbiah Ubaidullah.
On the futility of attempts to formalize clustering within conventional formal frameworks Lev Goldfarb ETS group Faculty of Computer Science UNB Fredericton,
Conducting and Reading Research in Health and Human Performance.
The ICPS: A taxonomy, a classification, an ontology or an information model? Stefan SCHULZ IMBI, University Medical Center, Freiburg, Germany.
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
L To identify the services that the customer requires from a system and the constraints under which it operates and is developed.
EBM --- Journal Reading Presenter :呂宥達 Date : 2005/10/27.
This was developed as part of the Scottish Government’s Better Community Engagement Programme.
New York State Center of Excellence in Bioinformatics & Life Sciences R T U Referent Tracking Unit R T U Guest Lecture for Ontological Engineering PHI.
Unit 11: Evaluating Epidemiologic Literature. Unit 11 Learning Objectives: 1. Recognize uniform guidelines used in preparing manuscripts for publication.
Copyright © 2011 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 1 Research: An Overview.
1 Ontological investigations into medical diagnoses Grand Round of the Department of Biomedical Informatics August 26, 2015 – Buffalo, NY, USA Werner CEUSTERS,
New York State Center of Excellence in Bioinformatics & Life Sciences R T U Institute for Healthcare Informatics Ontology and Imaging Informatics Third.
1 Diagnoses in Electronic Healthcare Records: What do they mean? School of Informatics and Computing Colloquia Series, Indiana University. Indianapolis,
1 Biomarkers in the Ontology for General Medical Science Medical Informatics Europe (MIE) 2015 May 28, 2015 – Madrid, Spain Werner CEUSTERS 2, MD and Barry.
EBM --- Journal Reading Presenter :黃美琴 Date : 2005/10/27.
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U.
The Toulmin Method. Why Toulmin…  Based on the work of philosopher Stephen Toulmin.  A way to analyze the effectiveness of an argument.  A way to respond.
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U.
Semantic Interoperability in Healthcare State of the Art in the US (position paper) March 15, 2010 CCIB Convention Centre, Barcelona, Spain.
W. Ceusters1, M. Capolupo2, B. Smith1, G. De Moor3
Department of Psychiatry, University at Buffalo, NY, USA
The scope and focus of the Research
Towards the Information Artifact Ontology 2
Biomedical Ontology PHI 548 / BMI 508
Advanced Topics in Biomedical Ontology PHI 637 SEM / BMI 708 SEM
Stefan SCHULZ IMBI, University Medical Center, Freiburg, Germany
Statistical Data Analysis
Presentation transcript:

1 An ontological analysis of diagnostic assertions in electronic healthcare records International Conference on Biomedical Ontology July 27-30, 2015 – Lisbon, Portugal Werner CEUSTERS 1, MD  William R. HOGAN 2, MD 1 Department of Biomedical Informatics, University at Buffalo 2 Department of Health Outcomes and Policy, University of Florida

2 Observation / Claim Patient data, as currently gathered through EHRs, communicated over RHIOs, and collected and aggregated in data warehouses, have minimal, but usually missing, background information, insufficiently precise to allow the construction of a completely accurate view on what is (and has been) the case in reality.

3 Is this patient really so sick ?

4 What are the referents ? Are there really ‘chronic diagnoses’, or is it diseases that are chronic ?

5 What are the referents ? Whose problems and favorites are intended here: the patient’s or the clinician’s?

6 Open questions on a seemingly simple representation Are the two diagnoses about the very same disorder the patient suffers from (thereby highlighting different aspects of that disorder which cannot be expressed using a single ICD-code) or about two distinct disorders the patient suffers from simultaneously? Did the persons that entered the diagnoses also make the diagnoses? How do the dates when the diagnoses were entered relate to the dates when the diagnoses were actually made, or the dates the patient started to suffer from the disease?

7 A solution proposed 10 years ago Referent Tracking explicit reference to the individual entities relevant to the accurate description of some portion of reality,... Ceusters W, Smith B. Strategies for Referent Tracking in Electronic Health Records. J Biomed Inform Jun;39(3):

8 A solution proposed 10 years ago Referent Tracking explicit reference to the individual entities relevant to the accurate description of some portion of reality, … by means of an Instance Unique Identifier (IUI) for each such particular (individual) entity. Ceusters W, Smith B. Strategies for Referent Tracking in Electronic Health Records. J Biomed Inform Jun;39(3):

9 Referent Tracking Tuples (RTTs) From: “ this patient has ICD-9 diagnosis ‘279.4 – Gout, unspecified’ ” To (very roughly, simplified and in abstract syntax) : ‘this-1 is about this-2 which is realized in this-3’, where this-1 instanceOf diagnosis… this-1 isAbout this-2 … this-2 instanceOf disease … this-2 realizedIn this-3… this-3 instanceOf human being … …

10 Referent Tracking Tuples (RTTs) From: “ this patient has ICD-9 diagnosis ‘279.4 – Gout, unspecified’ ” To (very roughly, simplified and in abstract syntax) : ‘this-1 is about this-2 which is realized in this-3’, where this-1 instanceOf diagnosis… this-1 isAbout this-2 … this-2 instanceOf disease … this-2 realizedIn this-3… this-3 instanceOf human being … … denotators for particulars

11 Referent Tracking Tuples (RTTs) From: “ this patient has ICD-9 diagnosis ‘279.4 – Gout, unspecified’ ” To (very roughly, simplified and in abstract syntax) : ‘this-1 is about this-2 which is realized in this-3’, where this-1 instanceOf diagnosis… this-1 isAbout this-2 … this-2 instanceOf disease … this-2 realizedIn this-3… this-3 instanceOf human being … … denotators for appropriate relations

12 Referent Tracking Tuples (RTTs) From: “ this patient has ICD-9 diagnosis ‘279.4 – Gout, unspecified’ ” To (very roughly, simplified and in abstract syntax) : ‘this-1 is about this-2 which is realized in this-3’, where this-1 instanceOf diagnosis… this-1 isAbout this-2 … this-2 instanceOf disease … this-2 realizedIn this-3… this-3 instanceOf human being … … denotators for universals or particulars

13 Referent Tracking Tuples (RTTs) From: “ this patient has ICD-9 diagnosis ‘279.4 – Gout, unspecified’ ” To (very roughly, simplified and in abstract syntax) : ‘this-1 is about this-2 which is realized in this-3’, where this-1 instanceOf diagnosis… this-1 isAbout this-2 … this-2 instanceOf disease … this-2 realizedIn this-3… this-3 instanceOf human being … … time stamp at least one continuant is referenced

14 Representation of relation with time intervals

15 Research question To what extent is it possible for 2 ontologists to develop independently from one another a collection of RTTs that describe the same portion of reality (POR) in a semantically-interoperable way. The analysis presented is the first step in this endeavor and focuses on: (1)the choice of particulars deemed necessary and sufficient for an accurate description of the selected POR, (2)what these particulars are instances of, (3)how they relate to each other, and (4)the motivations of each author for the choices made.

16 Relevance for this work Differences in the choice of ontologies constitute a risk: distinct ontologies may represent reality from distinct perspectives and despite being veridical might not be derivable from each other because the axioms required to do so would be missing, for the simple reason that such axioms would fall outside the purpose of the specific ontologies. This would lead the representations by each of the authors not to be semantically interoperable unless additional ontology bridging axioms would be crafted.

17 Back to our problem: What must be the case and can be the case for the following table to make sense, and How can Referent Tracking and Ontology make that clear? What follows is an incomplete analysis, examples being taken to make the case for this particular presentation.

18 Intellectual experiment Context: An EHR with a problem list shows in a spreadsheet for a specific patient two diagnostic entries entered at the same date, but by distinct providers: It is assumed that the patient with ID ORT58578 has only one disorder. Task : List the different kinds of Referent Tracking statements that would represent this situation. Consider what must and can be the case for the table to make sense. Players : Ceusters and Hogan, two experts in Referent Tracking (hereafter referenced as X and Y, not to be assumed in a specific order)

19 Methodology No instructions on ontologies to use, or format of RTTs. Results exchanged after each author finished work. Analysis: identification of the particulars that both authors referred to in their assertions. re-assign IUIs to particulars referred to by both authors as if the collection of RTTs was merged into one single RT system, thereby still keeping track of which RTT was asserted by which author. analyze and discuss differences in representations, however without paying attention to the temporal indexing required for RTTs describing a POR in which a continuant is involved.

20 Quantitative Results XYX+Y # particulars referenced (TR excluded) # instantiations232849/47/41 # instantiated classes # realism-based ontologies drawn from547 # classes without ontological home347 # particular-to-particular (PtoP) relations # RTTs judged not at all appropriate400 of which PtoP / P-inst4 / 00 / 0 # RTTs judged arguably appropriate of which PtoP / P-inst7 / 1511 / 97 / 9 # RTTs judged for sure appropriate of which PtoP / P-inst48 / 3548 / 4148 / 35

21 Appropriateness measured in terms of what an optimal collection of RTTs for the POR under scrutiny would be; POR under scrutiny: Assertional part: what is in the EHR Non-assertional part: what is on the side of the patient Optimal collection: satisfies the following criteria: (1) it consists of RTTs which describe the non-assertional part of the POR only to the extent to which there is enough evidence for what those RTTs themselves assert to be true (e.g. there is sufficient evidence that the patients are human beings, there is not sufficient evidence that the diagnoses are correct), and (2) it consists of other RTTs which describe the assertional part in relation to the RTTs referenced under (1).

22 Very high inter-rater agreement ObsY 012 X agreement by chance Cohen's kappa

23 Main reasons for disagreement Absence of uniform conventions on which ontologies and relations to use, Problems in the ontological theories, Issues with implementation of ontologies, both authors resorted to OGMS for a large part of their RTTs, Yet, differences in representation were observed because of the source material consulted: X used the OGMS OWL artifact as basis, whereas Y used the definitions and descriptions in the paper that led to the development of OGMS (Scheuermann et al., 2009). Lack of appropriate documentation.

24 Ontologies and orphan classes referenced OntologiesXY Ontology of General Medical Science (OGMS)xx Ontology of Medically Related Social Entities (OMRSE) x the Foundational Model of Anatomy (FMA) xx the Disease Ontology (DO)x Ontology of Biomedical Investigations (OBI)x Basic Formal Ontology (BFO)x Information Artifact Ontology (IAO)x Orphan classes ‘denotator’x ‘EHR’x ‘dataset record’x ‘patient identifier’x ‘ICD-9-CM code and label’x

25 Material entity / human being IndIUIDescriptionOntology Class YX T1P1the patientOBIHomo sapiens 12 T2P2the doctor who made diagnosis #1 OBIHomo sapiens 12 T3P3the doctor who made diagnosis #2 OBIHomo sapiens 12 T15P13the patient's patient role OMR SE Patient role 22 IndIUIDescriptionOntology Class YX T27P1the material entity whose ID is ‘1234’ in the spreadsheet BFOMaterial entity 11 T2P2the person whose name is ‘J. Doe’ in the spreadsheet FMAHuman being 11 T3P3the person whose name is ‘S. Thump’ in the spreadsheet FMAHuman being 11 X represented P1 as a human with a patient role. Y represented P1 as a material entity ( P1 has been a material entity all the time through its existence, but not a human (e.g., it was a zygote at a time prior to being human) ) without assigning a patient role. This difference in representation is related to the temporal indexing that RT requires for continuants. Given the two authors’ temporal indexing, both agree that each other’s views re material entity/human being (cave next slide) were correct.

26 Human being / Homo sapiens IndIUIDescriptionOntology Class YX T1P1the patientOBIHomo sapiens 12 T2P2the doctor who made diagnosis #1 OBIHomo sapiens 12 T3P3the doctor who made diagnosis #2 OBIHomo sapiens 12 T15P13the patient's patient role OMR SE Patient role 22 IndIUIDescriptionOntology Class YX T27P1the material entity whose ID is ‘1234’ in the spreadsheet BFOMaterial entity 11 T2P2the person whose name is ‘J. Doe’ in the spreadsheet FMAHuman being 11 T3P3the person whose name is ‘S. Thump’ in the spreadsheet FMAHuman being 11 Disagreement about how to interpret the representational units for the universal Human being from the selected ontologies: Is ‘human being’ synonym for the FMA’s ‘human body’ class ? Does OBI’s ‘Homo sapiens’ because of its linking to other ontologies in Ontobee confuse ‘Homo sapiens’ as an instance of ‘species’ with those instances of organism that belong to – but are not instances of – the species ‘Homo sapiens’? ‘Homo sapiens’ and similar classes in OBI all descend from a class called ‘organism’. The ‘Homo sapiens’ class in OBI has synonyms ‘Human being’ and ‘human’. The problem here is the lack of face value of terms selected as class names in the respective ontologies.

27 Disorders, diseases, diagnoses and DO Agreement on the existence of a disorder, a disease, two diagnoses and two distinct processes that generated each. Agreement that none of these entities should be confused or conflated: nothing at the same time can be an instance of two or more of the following: disease, disorder, diagnosis, and diagnostic process. Disagreement on the appropriateness of DO. Agreement that DO confuses not only disorders and disease, but also disease courses. E.g.: ‘physical disorder’ is a subtype of ‘disease’, in direct contradiction to OGMS. Goodwill argument: DO at least purports to strive for compliance with realist principles. If perfection were a requirement to use an ontology, we could make no progress. Nevertheless, the persistent, glaring flaws of DO from the perspective of OGMS give serious pause on using it accurately and precisely.

28 What part of the EHR constitutes a diagnosis? Agreement that such part is built out of continuants that are concretizations of instances of ICE reflecting a diagnosis. Disagreement on the extent of the part that denotes the diagnosis: the mere concretization of the ICD-code and label? the above, plus the concretization of the patient identifier? Root cause of disagreement: distinct interpretations of the literature on the nitty-gritty of how to deal with ICE and concretizations thereof, how instances of ICE relate to other instances of ICE, what exactly the relata are of relationships such as aboutness and denotation. Examples: Can ICE be parts of other ICE or does parthood only apply to the independent continuants in which inhere the qualities that concretize the corresponding ICE? Is it the qualities concretizing the ICE that are about something or the ICE itself? See Smith & Ceusters, ICBO 2015

29 Are diagnoses to be assumed correct? (1) X interpreted both (1) the RT tuple that instantiated the disease as gout (by Doe) and (2) the RT tuple that instantiated it as osteoarthritis (by Thump) as being faithful representations of what Thump and Doe believed at the time they formulated their diagnoses. X did not believe himself to be recognizing both diagnoses as straightforwardly accurate and therefore resorted in his representation to a mechanism offered in RT to craft RTTs about RTTs that are later found to have been based on a misunderstanding of the reality at the time they were crafted.

30 Are diagnoses to be assumed correct? (2) Y crafted a representation that does not commit to what specific disease type(s) the patient’s disease actually is an instance of. This was achieved by representing the diagnoses to be simultaneously about the patient on the one hand (in contrast to X who represents the diagnoses to be about the disorder/ disease itself), and about the disease universals – gout and osteoarthrosis resp. – denoted by the respective ICD-codes and labels on the other hand. This aboutness-relation between an instance of ICE and a universal can be represented in RT but of course cannot be represented in OWL without recourse to workarounds such as those discussed by Schulz et al (2014).

31 Conclusions (1) The two authors agreed on the existence of key entities for the diagnoses to make sense. They agreed in general about the types instantiated by the particulars in the scenario, and how the particulars are related to each other. They chose different representational units and relations from different ontologies due to various issues such as potential lack of orthogonality in the OBO Foundry and in some cases disagreement on what types the classes in the ontologies represent. These distinctions exist, not because the authors entertained distinct competing conceptualizations, but because they expressed matters differently. Disagreements primarily due to different interpretations of the literature on ICEs.

32 Conclusions (2) Although this study is limited by the participation of only 2 subjects and the analysis of one report, it highlights the fact that the RT method and the clarity and precision it requires in representing reality is a powerful tool in identifying areas of needed improvement in existing, realism-based ontologies.

33 Acknowledgement This work was supported by award UL1 TR from the National Center for Advancing Translational Sciences. This paper is solely the responsibility of the authors and does not necessarily represent the official views of the NIH..