Terminology Quality Evaluation S60 Rashmie Abeysinghe Joint work with

Slides:



Advertisements
Similar presentations
1 M APPING C OMPOSITION FOR M ATCHING L ARGE L IFE S CIENCE O NTOLOGIES A NIKA G ROSS, M ICHAEL H ARTUNG, T ORALF K IRSTEN, E RHARD R AHM 29 TH J ULY 2011,
Advertisements

Modeling Maze Navigation Consider the case of a stationary robot and a mobile robot moving towards a goal in a maze. We can model the utility of sharing.
Ontological analysis of the semantic types Anand Kumar MBBS, PhD IFOMIS, University of Saarland, Germany. BIOMEDICALONTOLOGYBIOMEDICALONTOLOGY.
ECO R European Centre for Ontological Research Ontology-based Error Detection in SNOMED-CT ® Werner Ceusters European Centre for Ontological Research Universität.
Creating action with information: The Rare Disease Community Cary O. Harding, MD Department of Molecular & Medical Genetics.
Summary Issues and Suggestions Workshop on The Future of the UMLS Semantic Network NLM, April 8, 2005 Olivier Bodenreider Lister Hill National Center for.
Cancer Staging. What is cancer staging? Staging describes the severity of a person’s cancer based on the extent of the original (primary) tumor and whether.
Automated Characterization of cellular migration phenomena Christian Beaudry, Michael E. Berens, Anna M. Joy Translational Genomics Research Institute.
Biomedical Informatics Some Observations on Clinical Data Representation in EHRs Christopher G. Chute, MD DrPH, Mayo Clinic Chair, ICD11 Revision, World.
What Do Toxicologists Do?
Component 1: Introduction to Health Care and Public Health in the U.S. 1.5: Unit 5: Financing Health Care (Part 2) 1.5d: Controlling Medical Expenses.
DR NIRANJAN P DR K LAKSHMAN DR M S SRIDHAR AUDIT ON DISCHARGE SUMMARIES.
1 The Refined Semantic Network James Geller Yehoshua Perl New Jersey Institute of Technology.
© Copyright 2003 Cardinal Health, Inc. or one of its subsidiaries. All rights reserved. PET in Colorectal Cancer Early detection of disease Precise Staging.
Fluorescent In Situ Hybridization (FISH) to Identify Genetic Changes in Fine Needle Biopsy of Lung Lesions Prepared by Jin Jen NCI.
Olivier Bodenreider Lister Hill National Center for Biomedical Communications Bethesda, Maryland - USA Experiences in visualizing and navigating biomedical.
Annual reports and feedback from UMLS licensees Kin Wah Fung MD, MSc, MA The UMLS Team National Library of Medicine Workshop on the Future of the UMLS.
Highlights of the 2013 NCI Guidelines Ira Goodman Associate Director for Administration.
1 Enriching and Designing Metaschemas for the UMLS Semantic Network Department of Computer Science New Jersey Institute of Technology Yehoshua Perl James.
Ontology Evolution and Regression Analysis Insights into Ontology Regression Testing Maria Copeland Rafael Goncalvez Robert Stevens Bijan Parsia Uli Sattler.
Clinical Data Interchange Standards Consortium (CDISC) uses NCIt for its Study Data Tabulation Model (SDTM) and other global data standards for medical.
Agent-based methods for translational cancer multilevel modelling Sylvia Nagl PhD Cancer Systems Science & Biomedical Informatics UCL Cancer Institute.
Xiangnan Kong,Philip S. Yu Multi-Label Feature Selection for Graph Classification Department of Computer Science University of Illinois at Chicago.
Consistency between Metathesaurus and Semantic Network Workshop on The Future of the UMLS Semantic Network NLM, April 8, 2005 Olivier Bodenreider Lister.
Sharing Ontologies in the Biomedical Domain Alexa T. McCray National Library of Medicine National Institutes of Health Department of Health & Human Services.
Using Domain Ontologies to Improve Information Retrieval in Scientific Publications Engineering Informatics Lab at Stanford.
- EVS Overview - Biomedical Terminology and Ontology Resources Frank Hartel, Ph.D. Director, Enterprise Vocabulary Services NCI Center for Bioinformatics.
Biomedical Engineering at the Centre for Preclinical Research and Technology Contemporary medicine is challenged to seek methods for delaying and alleviating.
Class 23, 2001 CBCl/AI MIT Bioinformatics Applications and Feature Selection for SVMs S. Mukherjee.
Template provided by: “posters4research.com” Challenges and Solutions for Mapping Pathology Data to SEND Mike Wasko, Rich Buchanan, Fred Mura, and Laura.
Protégé 3.4 Plug-in for Editing and Maintaining the NCI Thesaurus Protégé Conference June 23, 2009 Amsterdam Sherri de Coronado, Gilberto Fragoso.
Detection of underspecifications in SNOMED CT concept definitions using language processing 1 Federal Technical University of Paraná (UTFPR), Curitiba,
Oncologic Pathology in Biomedical Terminologies Challenges for Data Integration Olivier Bodenreider National Library of Medicine Bethesda, Maryland -
Vicki LaRue, CTR KCR Abstractor’s Training February 12,
SNOMED mapping for Pan-Canadian Surgery Templates Elaine Maloney January 27, 2015 ITHSDO Implementation SIG.
Representing nursing in SNOMED CT Proposal for TR or Guideline.
5 A Day Fruits and Vegetables By Yasmine Ghattas.
Study on the Design for Consumer Health Knowledge Organization in China Institute of Medical Information Chinese Academy of Medical Sciences Jul. 10th,
1 Alberta Health Services Capital Health Palliative Care Program Clinical Vocabulary Pilot Project Project Update Friday April 24, 2009 Dennis Lee & Francis.
Mapping the NCI Thesaurus and the Collaborative Inter-Lingual Index Amanda Hicks University of Florida HealthInsight Workshop, Oslo, Norway.
Oncology in SNOMED CT NCI Workshop The Role of Ontology in Big Cancer Data Session 3: Cancer big data and the Ontology of Disease Bethesda, Maryland May.
OMICS Journals are welcoming Submissions
David Amar, Tom Hait, and Ron Shamir
SNOMED CT and Surgical Pathology
Quality Improvement Program: Special Needs Plans
TCRN F2F Meeting 2016.
UNIFIED MEDICAL LANGUAGE SYSTEMS (UMLS)
The UMLS and the Semantic Web
Translational Research Methodology
Achieving Semantic Interoperability of Cancer Registries
SNOMED CT and Surgical Pathology
The Influence of Domain-Specific Metric Development on Evaluation and Design: An Example from National Institutes of Health Technology Development Programs.
Using Partial Reference Alignments to Align Ontologies
Biomedical Engineer By: Amina Taslim.
Efficient Remediation of Terms Inactivated by Dictionary Updates
LOCAL EXPERIENCES Innovation practices and experiences related to FIC development and implementation Ariadna Rius Clinical Dictionary for iSalut.
Knowledge-Based Organ Identification from CT Images
Lecture 14: Data Repairing
Lexical ambiguity in SNOMED CT
عنوان: بررسي فراواني انواع كانسرهاي تيروئيد از نظر زير گروه هيستوپاتولوژيك، توزيع سني، جنسي و محل آناتوميك آن در انستيتو كانسر و امام در طي سالهاي 1381.
PAST, PRESENT AND FUTURE
Ontological analysis of the semantic types
Provider Resistance to Pathways Physician Buy-In & Adoption
An Ontology-driven Faceted Query Engine
Component 1: Introduction to Health Care and Public Health in the U.S.
Submitted By : Pratish Singh Kuldeep Choudhary Chinmay Panchal
Regulatory Perspective of the Use of EHRs in RCTs
NAACCR/IACR Combined Annual Conference 2019
Metamorphic Exploration of an Unsupervised Clustering Program
The Impact of Changes in Network Structure on Diffusion of Warnings
Presentation transcript:

Quality Assurance of NCI Thesaurus by Mining Structural-Lexical Patterns Terminology Quality Evaluation S60 Rashmie Abeysinghe Joint work with Michael A. Brooks, Jeffery Talbert, Licong Cui University of Kentucky

Disclosure Licong Cui is part of the startup called Synamtics Inc. AMIA 2017 | amia.org

Outline NCI Thesaurus Terminology Quality Assurance Non-lattice Subgraphs Structural-Lexical Features Containment Union Intersection Union-Intersection Inference-Union Inference-Contradiction Results Evaluation Conclusion and Future Directions AMIA 2017 | amia.org

NCI Thesaurus (NCIt) National Cancer Institute (NCI) Thesaurus First published in 2000 Contains over 118,000 concepts Hierarchically organized in 19 domains Abnormal Cell Anatomic Structure, System, or Substance Biological Process Disease, Disorder or Finding Molecular Abnormality etc. maintained by a multidisciplinary team of editors. 900 concepts added each month. covers terminology for clinical care, translational and basic research, public information and administrative activities. AMIA 2017 | amia.org

Terminology Quality Assurance (TQA) Essential part of terminology management lifecycle Manual review: labor-intensive and time-consuming Automating TQA is an active area of research Missing Relation! AMIA 2017 | amia.org

Non-lattice Subgraphs Lattice – a desirable property for a well-formed terminology* Lattice – a DAG such that any two nodes have a unique maximal common descendant as well as a unique minimal common ancestor A non-lattice subgraph Upper Bounds (U) Lower Bounds (L) *Zhang GQ, Bodenreider O. Large-scale, exhaustive lattice-based structural auditing of SNOMED CT. AMIA Annual Symposium Proc. 2010;922-26. AMIA 2017 | amia.org

Structural-Lexical Features Considering the label of a concept as a set of words in lower case: Containment*: Union*: Intersection*: Union-Intersection*: Inference-Union: Inference-Contradiction 𝑈 𝑖 ⊂ 𝑈 𝑗 𝑜𝑟 𝐿 𝑖 ⊂ 𝐿 𝑗 𝑈 𝑖 U 𝑈 𝑗 = 𝐿 𝑘 𝐿 𝑖 ∩ 𝐿 𝑗 = 𝑈 𝑘 𝑈 𝑖 U 𝑈 𝑗 = 𝐿 𝑠 ∩𝐿 𝑡 𝑈 𝑠 U (𝐿 𝑖 ∩ 𝐿 𝑗 )= 𝐿 𝑘 *Cui L, Zhu W, Tao S, Case JT, Bodenreider O, Zhang GQ. Mining non-lattice subgraphs for detecting missing hierarchical relations and concepts in SNOMED CT. JAMIA. 2017 Jul 1;24(4):788-798 AMIA 2017 | amia.org

Containment 𝐿 𝑗 ⊂ 𝐿 𝑖 𝑈 𝑖 ⊂ 𝑈 𝑗 𝑜𝑟 𝐿 𝑖 ⊂ 𝐿 𝑗 𝐿 𝑖 𝐿 𝑗 𝑈 𝑖 ⊂ 𝑈 𝑗 𝑜𝑟 𝐿 𝑖 ⊂ 𝐿 𝑗 Non-lattice subgraph 𝐿 𝑗 ⊂ 𝐿 𝑖 𝐿 𝑖 𝐿 𝑗 AMIA 2017 | amia.org

Containment 𝑈 𝑖 ⊂ 𝑈 𝑗 𝑜𝑟 𝐿 𝑖 ⊂ 𝐿 𝑗 Suggested Fix AMIA 2017 | amia.org

Union 𝑈 𝑖 U 𝑈 𝑗 = 𝐿 𝑘 𝑈 𝑖 𝑈 𝑗 𝑈 𝑖 U 𝑈 𝑗 = 𝐿 𝑘 Non-lattice subgraph malignant, testicular, non-seminomatous, germ, cell, tumor 𝐿 𝑘 AMIA 2017 | amia.org

Union 𝑈 𝑖 U 𝑈 𝑗 = 𝐿 𝑘 Suggested Fix AMIA 2017 | amia.org

Intersection 𝐿 𝑖 ∩ 𝐿 𝑗 = 𝑈 𝑘 𝑈 𝑘 𝐿 𝑖 ∩ 𝐿 𝑗 = 𝐿 𝑖 𝐿 𝑗 𝐿 𝑖 ∩ 𝐿 𝑗 = 𝑈 𝑘 Non-lattice subgraph 𝑈 𝑘 𝐿 𝑖 ∩ 𝐿 𝑗 = splenic, lymphoblastic, lymphoma 𝐿 𝑖 𝐿 𝑗 AMIA 2017 | amia.org

Intersection 𝐿 𝑖 ∩ 𝐿 𝑗 = 𝑈 𝑘 Suggested Fix AMIA 2017 | amia.org

Union-Intersection 𝑈 𝑖 U 𝑈 𝑗 = 𝐿 𝑠 ∩𝐿 𝑡 𝑈 𝑖 𝑈 𝑗 𝑈 𝑖 U 𝑈 𝑗 = Non-lattice subgraph 𝑈 𝑖 𝑈 𝑗 𝑈 𝑖 U 𝑈 𝑗 = 𝐿 𝑠 ∩ 𝐿 𝑡 = localized, adult liver, carcinoma localized, adult liver, carcinoma 𝐿 𝑠 𝐿 𝑡 AMIA 2017 | amia.org

Union-Intersection 𝑈 𝑖 U 𝑈 𝑗 = 𝐿 𝑠 ∩𝐿 𝑡 Suggested Fix AMIA 2017 | amia.org

Inference-Union =𝐿 𝑖 𝑈 𝑠 U (𝐿 𝑖 ∩ 𝐿 𝑗 )= 𝐿 𝑘 𝑈 𝑠 𝐿 𝑖 ∩ 𝐿 𝑗 = Non-lattice subgraph 𝑈 𝑠 𝐿 𝑖 ∩ 𝐿 𝑗 = gallbladder, papillary 𝑈 𝑠 U (𝐿 𝑖 ∩ 𝐿 𝑗 )= gallbladder, papillary, neoplasm =𝐿 𝑖 𝐿 𝑖 𝐿 𝑗 AMIA 2017 | amia.org

Inference-Union 𝑈 𝑠 U (𝐿 𝑖 ∩ 𝐿 𝑗 )= 𝐿 𝑘 Suggested Fix AMIA 2017 | amia.org

Inference-Contradiction Non-lattice subgraph anaplastic : neoplastic large anaplastic : neoplastic large AMIA 2017 | amia.org

Inference-Contradiction Suggested Fix AMIA 2017 | amia.org

Five Patterns! Union, Union-Intersection, Inference-Union, Inference-Contradiction, Containment AMIA 2017 | amia.org

Results In total 8,143 non-lattice subgraphs were identified 809 of those exhibited lexical patterns 678 single patterns 131 multiple patterns AMIA 2017 | amia.org

Evaluation AMIA 2017 | amia.org

Evaluation Single-pattern non-lattice subgraphs: 44% Multiple-pattern non-lattice subgraphs: 88% Overall: 66% AMIA 2017 | amia.org

Conclusion We investigated a hybrid approach to identifying potential errors in NCIt Remediations were automatically suggested An effective way for error detection and correction Applicable to other biomedical terminologies AMIA 2017 | amia.org

Future Work Investigate larger non-lattice subgraphs for evaluation Using concept synonyms to complement concept labels Finding new patterns to uncover more errors AMIA 2017 | amia.org

Acknowledgement This work was supported by National Institutes of Health National Center for Advancing Translational Sciences through grant UL1TR001998 National Science Foundation through grant IIS-1657306 I would like to thank Dr. Licong Cui for the guidance AMIA 2017 | amia.org

Email me at: rashmie.abeysinghe@uky.edu Thank you! Email me at: rashmie.abeysinghe@uky.edu