Download presentation
Presentation is loading. Please wait.
Published byHubert Cunningham Modified over 8 years ago
1
Consistency between Metathesaurus and Semantic Network Workshop on The Future of the UMLS Semantic Network NLM, April 8, 2005 Olivier Bodenreider Lister Hill National Center for Biomedical Communications Bethesda, Maryland - USA
2
2 Overview u Defining consistency u What does inconsistency mean? u Testing consistency l Comparing Metathesaurus relations to SN relations l Aligning Metathesaurus concepts and semantic types l Semantic type distribution of sets of descendants of Metathesaurus concepts u Suggestions l Enforcement mechanism l Ontology of relationships l CVF
3
Two levels in the UMLS
4
4 The UMLS: a two-level structure Concept 1 Metathesaurus Semantic Network Semantic Type a Semantic Type b Semantic Type c Concept 2
5
Heart Concepts Metathesaurus 22 225 97 4 12 931 Esophagus Left Phrenic Nerve Heart Valves Fetal Heart Medias- tinum Saccular Viscus Angina Pectoris Cardiotonic Agents Tissue Donors Anatomical Structure Fully Formed Anatomical Structure Embryonic Structure Body Part, Organ or Organ Component Pharmacologic Substance Disease or Syndrome Population Group Semantic Types Semantic Network
6
6 Relationships can inherit semantics Semantic Network Metathesaurus Adrenal Cortex Adrenal Cortical hypofunction Disease or Syndrome Body Part, Organ, or Organ Component Pathologic Function isa Biologic Function isa Fully Formed Anatomical Structure isa location of
7
Defining consistency
8
8 The consistency “square” Concept 1 Metathesaurus Semantic Network Semantic Type a Semantic Type b Concept 2
9
9 The categorization link Semantic Network Professional Society Metathesaurus Salmonella American Medical Association Organism Bacterium isa is an instance of
10
10 Semantic Network relations u 54 types of relationships u 558 asserted relations (SRSTR) u 6703 fully expanded relations (SRSTRE*) Semantic Network Disease or Syndrome Body Part, Organ, or Organ Component Pathologic Function isa Biologic Function isa Fully Formed Anatomical Structure isa location of
11
11 Metathesaurus relations u REL vs. RELA u Not always labeled u 106 additional types of relationships u ~7 M symbolic relations Heart Concepts Metathesaurus 22 225 97 4 12 931 Esophagus Left Phrenic Nerve Heart Valves Fetal Heart Medias- tinum Saccular Viscus Angina Pectoris Cardiotonic Agents Tissue Donors
12
12 Metathesaurus relations u Recorded l at the term level: from source vocabularies l at the concept level: from Metathesaurus editors u Aggregated at the concept level Oat cell carcinoma of lung Carcinoma, Small Cell SCLC Lung structure Lung Pulmonary has_finding_site Oat cell carcinoma of lung Carcinoma, Small Cell SCLC Lung structure Lung Pulmonary has_finding_site
13
13 Not all relationships in hierarchies are isa (1) Autoimmune Diseases Addison’s disease due to autoimmunity Tuberculous Addison’s disease is generally a
14
14 Not all relationships in hierarchies are isa (2) Environment and Public Health [G03] Public Health [G03.850] Accidents [G03.850.110] Accident Prevention [G03.850.110.060] + Accidental Falls [G03.850.110.085] Accidents, Aviation [G03.850.110.185] […] Drowning [G03.850.110.500] +
15
15 Defining consistency u SN rel. and Meta rel. must have the same direction u SN rel. and Meta rel. must be of the same type (both hierarchical or associative) u Meta rel. must be the same as SN rel. or one of its descendants Concept 1 Metathesaurus Semantic Network Semantic Type a Semantic Type b Concept 2
16
16 Examples of consistent relations Lung Body Part, Organ, or Organ Component Disease or Syndrome Pneumonia has_location
17
17 Examples of consistent relations Concept 1 Metathesaurus Semantic Network Semantic Type a Semantic Type b Concept 2 affects treats
18
What does inconsistency mean?
19
19 The consistency “square” revisited Concept 1 Metathesaurus Semantic Network Semantic Type a Semantic Type b Concept 2Concept 1 Metathesaurus Semantic Network Semantic Type a Semantic Type b Concept 2 ? ? ? ?
20
20 What does inconsistency mean? u Inaccurate/missing Semantic Network relation u Inaccurate (/missing?) categorization u Inaccurate Metathesaurus relation
21
Testing consistency
22
22 (A) Consistency of associative relations [McCray & Bodenreider, 2002]
23
23 Results u 6894 pairs of related concepts l 4496 (65%): a SN relation can be inferred unambiguously n Validity confirmed in 1981 cases n 2515 not labeled in the Metathesaurus l 1491 (22%): multiple possible SN relationships n multiple possible Metathesaurus relationships l 907 (13%): inconsistency SN/Meta relationships n 372: no SN relationship between the STs n 415: inconsistent SN/Meta relationship type (REL) n 120: inconsistent SN/Meta relationship attribute (RELA)
24
24 (B) Consistency of hierarchical relations u Relations used l SN: isa l Categorization: isa l Metathesaurus: PAR/CHD + RB/RN u Hypothesis l For a pair of (ST, C), the concepts categorized by ST (and its descendants) correspond to the descendants of the concept C l In the set of descendants of C, expected STs are the ST of C (and its descendants) ➊ ➋
25
25 ST-based classes vs. descendants u Semantic type l List of all concepts having this semantic type u Concept l List of all descendants u Comparing the 2 sets l Intersection of the 2 sets ➊ [Bodenreider & Burgun, 2004]
26
26 Analyzing inconsistencies AmphibianAmphibia 1126 descendants 1135 concepts 1124 in common Tadpole Invertebrate Toad licking Pharmacologic Substance Miscategor- ization (?) Wrong hierarchical relation Missing hierarchical relation Miscategor- ization Rana unclassified Class Reptilia Amphibians and Reptiles
27
27 Semantic types of descendants u Concept l Set of all descendants u Distribution of semantic types in the set l Allowable STs: ST of C and its descendants (strict) or ST from the same semantic group (loose) [Mougin & Bodenreider, 2005] ➋
28
28 Analyzing inconsistencies u 26,584 concepts studied u 59% of their descendants have a semantic type incompatible with that of the original concept Reaction belligerent Finding Hostility Mental Process
29
29 # ------------------------------------------------------------ # C0597249 Neoplasm of placenta (disorder) (neop) # * B: 190 C0597249|ST|acab| 5.50|incp C0597249|ST|anab| 1.50|incp C0597249|ST|cgab| 76.50|incp C0597249|ST|dsyn| 27.50|incp C0597249|ST|inpo| 1.00|incp C0597249|ST|neop| 76.50|comp C0597249|ST|patf| 1.50|incp C0597249|SG|DISO| 190.00|comp # ------------------------------------------------------------ Analyzing inconsistencies
30
Suggestions
31
31 Aligning SN and Meta relationships u 54 types of SN relationships u 106 additional types of Metathesaurus relationships u Some are simply synonymous (caused_by / due_to; follows / temporally_follows) u Some are specialized relationships (manifestation_of / definitional_manifestation_of) u Many types of mapping relationships, not in SN
32
32 Add classification information to SN u Explicit classificatory principles (in addition to textual definition and examples) u Abandon economy principle and return to JEPD (jointly exhaustive/pairwise disjoint) approach
33
33 Metathesaurus editing environment u Use SN/Meta relation consistency as a constraint for assigning semantic types u Use SN relations to suggest labels for unspecified Meta relations u Use SN/Meta relation consistency to guide the review by the Metathesaurus editors l Inaccurate categorization? l Inaccurate Metathesaurus relation?
34
Conclusions
35
35 Conclusions u Simultaneously l Improve SN l Improve categorization u ST assignment can be automated in part
36
36 Some references u McCray AT, Bodenreider O. A conceptual framework for the biomedical domain. In: Green R, Bean CA, Myaeng SH, editors. The semantics of relationships: an interdisciplinary perspective. Boston: Kluwer Academic Publishers; 2002. p. 181-198. u Bodenreider O, Burgun A. Aligning knowledge sources in the UMLS: Methods, quantitative results, and applications. Medinfo 2004:327-331.
37
37 Some references u Burgun A, Bodenreider O. Aspects of the taxonomic relation in the biomedical domain. In: Welty C, Smith B, editors. Collected papers from the Second International Conference "Formal Ontology in Information Systems": ACM Press; 2001. p. 222-233. u Mougin F, Bodenreider O. Approaches to eliminating cycles in the UMLS Metathesaurus: Naive vs. formal. Proceedings of AMIA Annual Symposium 2005:(submitted).
38
Medical Ontology Research Olivier Bodenreider Lister Hill National Center for Biomedical Communications Bethesda, Maryland - USA Contact:Web:olivier@nlm.nih.govmor.nlm.nih.gov
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.