Download presentation
Presentation is loading. Please wait.
Published byMaximillian Baldwin Modified over 8 years ago
1
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U MHI501 – Introduction to Health Informatics SNOMED and the UMLS: an introduction to biomedical terminology SUNY at Buffalo - November 9, 2011 Werner CEUSTERS Center of Excellence in Bioinformatics and Life Sciences Ontology Research Group University at Buffalo, NY, USA http://www.org.buffalo.edu/RTU
2
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U What is going on here ? ah sh uqwu ja ? ywye has uw has!
3
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U L1 L2 L3 What is going on here ? ah sh uqwu ja ? ywye has uw has!
4
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U 4 There is an external reality which is ‘objectively’ the way it is, and That reality is accessible to us; – L1 We build in our brains cognitive representations of reality; – L2 We communicate with others about what is there, and what we believe there is there. – L3 Basic axioms of Ontological Realism Smith B, Kusnierczyk W, Schober D, Ceusters W. Towards a Reference Terminology for Ontology Research and Development in the Biomedical Domain. Proceedings of KR-MED 2006, Biomedical Ontology in Action, November 8, 2006, Baltimore MD, USA
5
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U Language is ambiguous ‘I know that you believe that you understood what you think I said, but I am not sure you realize that what you heard is not what I meant.’ –Robert McCloskey, State Department spokesman (attributed). http://www.quotationspage.com/quotes/Robert_McCloskey/
6
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U Language is ambiguous Often we can figure it out … in Miami hotel lobby warning on plastic bag
7
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U Language is ambiguous in Amsterdam hotel elevator Sometimes, we can not …
8
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U A double mystery (It is argued that) On September 9th, 1935, Carl Austin Weiss shot Senator Huey Long in the Louisiana State Capitol with a.35 calibre pistol. Long died from this wound thirty hours later on September 10th. Weiss, on the other hand, received between thirty-two and sixty.44 and.45 calibre hollow point bullets from Long's agitated bodyguards and died immediately. Sorensen, R., 1985, "Self-Deception and Scattered Events", Mind, 94: 64-69. Questions: –Did Weiss kill Senator Long ? –If so, when did he kill him ?
9
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U 9 Relevance to Healthcare? ‘Nuances in the English language can be both challenging and amusing, however, when variants in language impact treatment, safety and billing, it is all challenge and no humor. Although English contains a reasonable degree of conformity, divergence in phrasing and meaning can compound comprehension problems and impact patient safety. These language "woes" can be minimized through the use of sophisticated healthcare IT systems...’ Schwend GT. The language of healthcare. Variance in the English language is harming patients and hospitals' bottom lines. Is healthcare IT the solution? Health Manag Technol. 2008 Feb;29(2):14, 16, 18
10
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U Indeed, these days, not just people communicate …
11
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U Understanding content (1) “John Doe has a pyogenic granuloma of the left thumb” We see: The machine sees:
12
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U Understanding content (2) John Doe pyogenic granuloma of the left thumb The XML misunderstanding We see: The machine sees:
13
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U The clever (?) business man and his XML card John Nitwit 524 Moon base avenue Utopia … Is this the name of the business card or of the business card owner?
14
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U ({valgising}5(osteotomy)1{[of]3(humerus)2}4)22 ((cutting)21 {[TO_ACHIEVE]6((Deed:valgising)7 {[ACTS_ON]17(Pathology:pathologicalposture)18}19)20}5 {[ACTS_ON]3(Anatomy:humerus)2}4)22 A better example
15
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U Intermediate conclusion We need for sure methods and techniques that allow: –people to express exactly what they mean, –people to understand exactly what is communicated to them, –machines to communicate information without any distortion. If information overload is a problem, we also need methods and techniques that allow machines to understand exactly what is communicated to them.
16
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U Relevance to Healthcare? ‘Nuances in the English language can be both challenging and amusing, however, when variants in language impact treatment, safety and billing, it is all challenge and no humor. Although English contains a reasonable degree of conformity, divergence in phrasing and meaning can compound comprehension problems and impact patient safety. These language "woes" can be minimized through the use of sophisticated healthcare IT systems with terminology management services.’ Schwend GT. The language of healthcare. Variance in the English language is harming patients and hospitals' bottom lines. Is healthcare IT the solution? Health Manag Technol. 2008 Feb;29(2):14, 16, 18
17
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U ah sh uqwu ja ? ywye has uw has! OBO Foundry
18
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U 18 The Terminology / Ontology divide Terminology: –solves certain issues related to language use, i.e. with respect to how we talk about entities in reality (if any); Relations between terms / concepts –does not provide an adequate means to represent independent of use what we talk about, i.e. how reality is structured; Women, Fire and Dangerous Things (Lakoff). Ontology (of the right sort) : –Language and perception neutral view on reality. Relations between entities in first-order reality This is the ‘terminology / ontology divide’
19
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U Terminology (is) for dummies
20
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U But what the word ‘concept’ denotes, is usually not clarified and users of it often refer to different entities in a haphazard way: Most terminologies are ‘concept’-based Smith B, Kusnierczyk W, Schober D, Ceusters W. Towards a Reference Terminology for Ontology Research and Development in the Biomedical Domain. Proceedings of KR-MED 2006, Biomedical Ontology in Action, November 8, 2006, Baltimore MD, USA
21
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U SNOMED about diseases and concepts (until 2010) ‘Disorders are concepts in which there is an explicit or implicit pathological process causing a state of disease which tends to exist for a significant length of time under ordinary circumstances.’ And also: “Concepts are unique units of thought”. Thus: Disorders are unique units of thoughts in which there is a pathological process …??? And thus: to eradicate all diseases in the world at once we simply should stop thinking ?
22
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U SNOMED now (since version 201007) Concept: An ambiguous term. Depending on the context, it may refer to: 1.A clinical idea to which a unique ConceptId has been assigned. 2.The ConceptId itself, which is the key of the Concepts Table (in this case it is less ambiguous to use the term ‘concept code’). 3.The real-world referent(s) of the ConceptId, that is, the class of entities in reality which the ConceptId represents.
23
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U But what the word ‘concept’ denotes, is usually not clarified and users of it often refer to different entities in a haphazard way: meaning shared in common by synonymous terms idea shared in common in the minds of those who use these terms unit of describing meanings knowledge universal that what is shared by all and only all entities in reality of a similar sort Most terminologies are ‘concept’-based Smith B, Kusnierczyk W, Schober D, Ceusters W. Towards a Reference Terminology for Ontology Research and Development in the Biomedical Domain. Proceedings of KR-MED 2006, Biomedical Ontology in Action, November 8, 2006, Baltimore MD, USA
24
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U Structure of remaining part of this talk Discuss several well-known coding systems, classifications, terminologies, etc, … Students should try to find the gazillion ways in which the principles of a coherent language-reality model are violated. Since there are a gazillion violations, it should not be too difficult to find many. Therefore, I make it more challenging by not listing the principles first.
25
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U An easy starter: Border’s classification of ‘medicine’
26
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U 26 Border’s classification of medicine: what’s wrong ? Medicine –Mental health –Internal medicine Endocrinology –Oversized endocrinology Gastro-enterology... –Pediatrics –... –Oversized medicine Refer to the size of the books that do not fit on a normal Border’s Bookshop shelf
27
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U MeSH: Medical Subject Headings
28
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U 28 MeSH
29
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U 29 Use of MeSH in PUBMED Wolfram syndrome
30
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U 30 MeSH: Medical Subject Headings Designed for bibliographic indexing, eg Index Medicus Basis for MedLINE Pubmed focuses on biomedicine and other basic healthcare sciences clinically very impoverished Consistency amongst indexers: –60% for headings –30% for sub-headings
31
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U 31 MeSH Tree Structures - 2004 1. Anatomy [A] 2. Organisms [B] 3. Diseases [C] 4. Chemicals and Drugs [D] 5. Analytical, Diagnostic and Therapeutic Techniques and Equipment [E] 6. Psychiatry and Psychology [F] 7. Biological Sciences [G] 8. Physical Sciences [H] 9. Anthropology, Education, Sociology and Social Phenomena [I] 10. Technology and Food and Beverages [J] 11. Humanities [K] 12. Information Science [L] 13. Persons [M] 14. Health Care [N] 15. Geographic Locations [Z]
32
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U 32 Is this a good idea ? Cover subject matter of papers Cover the form of papers
33
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U 33 Principle A representation should not mix object language and meta language –object language describes the referents in the subject domain –meta language describes the object language
34
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U 34 Geographic Locations: a good hierarchy ? Africa [Z01.058] + Americas [Z01.107] + Antarctic Regions [Z01.158] Arctic Regions [Z01.208] Asia [Z01.252] + Atlantic Islands [Z01.295] + Australia [Z01.338] + Cities [Z01.433] + Europe [Z01.542] + Historical Geographic Locations [Z01.586] + Indian Ocean Islands [Z01.600] + Oceania [Z01.678] + Oceans and Seas [Z01.756] + Pacific Islands [Z01.782] + mereological mess mixture of geographic entities with socio- political entities mixture of space and time
35
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U 35 Geographic Locations [Z01] Africa [Z01.058] + Americas [Z01.107] + Antarctic Regions [Z01.158] Arctic Regions [Z01.208] Asia [Z01.252] + Atlantic Islands [Z01.295] + Australia [Z01.338] + Cities [Z01.433] + Europe [Z01.542] + Historical Geographic Locations [Z01.586] + Indian Ocean Islands [Z01.600] + Oceania [Z01.678] + Oceans and Seas [Z01.756] + Pacific Islands [Z01.782] + Ancient Lands [Z01.586.035] + Austria-Hungary [Z01.586.117] Commonwealth of Independent States [Z01.586.200] + Czechoslovakia [Z01.586.250] + European Union [Z01.586.300] Germany [Z01.586.315] + Korea [Z01.586.407] Middle East [Z01.586.500] + New Guinea [Z01.586.650] Ottoman Empire [Z01.586.687] Prussia [Z01.586.725] Russia (Pre-1917) [Z01.586.800] USSR [Z01.586.950] + Yugoslavia [Z01.586.980] +
36
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U 36 Geographic Locations [Z01] Africa [Z01.058] + Americas [Z01.107] + Antarctic Regions [Z01.158] Arctic Regions [Z01.208] Asia [Z01.252] + Atlantic Islands [Z01.295] + Australia [Z01.338] + Cities [Z01.433] + Europe [Z01.542] + Historical Geographic Locations [Z01.586] + Indian Ocean Islands [Z01.600] + Oceania [Z01.678] + Oceans and Seas [Z01.756] + Pacific Islands [Z01.782] + Ancient Lands [Z01.586.035] + Austria-Hungary [Z01.586.117] Commonwealth of Independent States [Z01.586.200] + Czechoslovakia [Z01.586.250] + European Union [Z01.586.300] Germany [Z01.586.315] + Korea [Z01.586.407] Middle East [Z01.586.500] + New Guinea [Z01.586.650] Ottoman Empire [Z01.586.687] Prussia [Z01.586.725] Russia (Pre-1917) [Z01.586.800] USSR [Z01.586.950] + Yugoslavia [Z01.586.980] +
37
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U 37 Principle A hierarchical structure should not represent distinct hierarchical relations unless they are formally characterized
38
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U 38 Cardiovascular Diseases [C14] – Heart Diseases [C14.280] Arrhythmia [C14.280.067] + Carcinoid Heart Disease [C14.280.129] Cardiomegaly [C14.280.195] + Endocarditis [C14.280.282] + Heart Aneurysm [C14.280.358] Heart Arrest [C14.280.383] + Heart Defects, Congenital [C14.280.400] – Aortic Coarctation [C14.280.400.090] – Arrhythmogenic Right Ventricular Dysplasia [C14.280.400.145] – Cor Triatriatum [C14.280.400.200] – Coronary Vessel Anomalies [C14.280.400.210] – Crisscross Heart [C14.280.400.220] – Dextrocardia [C14.280.400.280] + MeSH Tree Structures – 2004
39
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U 39 Diabetes Mellitus in MeSH 2008 ? Different set of more specific terms when different path from the top is taken. 2
40
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U 40 MeSH: some paths from top to Wolfram Syndrome Wolfram Syndrome All MeSH Categories Diseases Category Nervous System Diseases Cranial Nerve Diseases Optic Nerve Diseases Optic Atrophy Optic Atrophies, Hereditary Neurodegenerative Diseases Heredodegenerative Disorders, Nervous System Eye Diseases Eye Diseases, Hereditary Optic Nerve Diseases Male Urogenital Diseases Urologic Diseases Kidney Diseases Diabetes Insipidus Female Urogenital Diseases and Pregnancy Complications Female Urogenital Diseases
41
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U 41 What would it mean if used in the context of a patient ? Wolfram Syndrome All MeSH Categories Diseases Category Nervous System Diseases Cranial Nerve Diseases Optic Nerve Diseases Optic Atrophy Optic Atrophies, Hereditary has Neurodegenerative Diseases Heredodegenerative Disorders, Nervous System Eye Diseases Eye Diseases, Hereditary Optic Nerve Diseases Female Urogenital Diseases and Pregnancy Complications Female Urogenital Diseases Male Urogenital Diseases Urologic Diseases Kidney Diseases Diabetes Insipidus ??? … has
42
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U 42 Principle If a particular (individual) is related in a specific way to a ‘class’, it should also be related in the same way to all the ‘superclasses’ of that class –Technically: “… to all the classes that subsume that class”
43
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U 43 What would it mean for a particular disease? Wolfram Syndrome All MeSH Categories Diseases Category Nervous System Diseases Cranial Nerve Diseases Optic Nerve Diseases Optic Atrophy Optic Atrophies, Hereditary has Neurodegenerative Diseases Heredodegenerative Disorders, Nervous System Eye Diseases Eye Diseases, Hereditary Optic Nerve Diseases Female Urogenital Diseases and Pregnancy Complications Female Urogenital Diseases Male Urogenital Diseases Urologic Diseases Kidney Diseases Diabetes Insipidus #x isa … Referent tracking ??? How can something which is an eye disease be a urologic disease ?
44
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U 44 Principle If a particular is an instance of a ‘class’, it should also be an instance of all the ‘superclasses’ of that class
45
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U 45 Body Regions [A01] –Extremities [A01.378] Lower Extremity [A01.378.610] –Buttocks [A01.378.610.100] –Foot [A01.378.610.250] »Ankle [A01.378.610.250.149] »Forefoot, Human [A01.378.610.250.300] + »Heel [A01.378.610.250.510] –Hip [A01.378.610.400] –Knee [A01.378.610.450] –Leg [A01.378.610.500] –Thigh [A01.378.610.750] What’s wrong ? MeSH Tree Structures – 2007
46
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U 46 Body Regions [A01] – Abdomen [A01.047] + – Back [A01.176] + – Breast [A01.236] + – Extremities [A01.378] Amputation Stumps [A01.378.100] Lower Extremity [A01.378.610] + Upper Extremity [A01.378.800] + – Head [A01.456] + – Neck [A01.598] – Pelvis [A01.673] + – Perineum [A01.719] – Thorax [A01.911] + – Viscera [A01.960] MeSH Tree Structures – 2007 What’s wrong ?
47
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U SNOMED: Systematized Nomenclature of Medicine
48
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U 48 SNOMED’s origin: SNOP from CAP (1965)
49
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U 49 Since mid 2007
50
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U 50 Snomed International (1995) Number of records (V3.1) TTopography12,385 MMorphology 4,991 FFunction16,352 LLiving Organisms24,265 CDrugs &Biological Products14,075 APhysical Agents, Forces and Activities 1,355 DDisease/ Diagnosis28,623 PProcedures27,033 SSocial Context 433 JOccupations 1,886 GGeneral Modifiers 1,176 TOTAL RECORDS 132,641
51
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U SNOMED CT concepts’ status (July 2010) 100%391,170TOTAL 0.29%1,142inactive because found to contain a mistake 5 4.05%15,858inactive because inherently ambiguous. 4 0.37%1,439inactive because no longer recognized as a valid clinical concept (outdated) 3 9.65%37,752inactive: withdrawn because duplication 2 3.69%14,451inactive because moved elsewhere10 1.92%7,525inactive: ‘retired’ without a specified reason 1 5.35%20,930active with limited clinical value (classification concept or an administrative definition) 6 74.677%292,073active in current use0 %NConcept StatusST
52
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U 52 Number of changes over time (2007) 100%1,042,871100%373,731Total 0.00%0 18 2 07 3 36 430.02%745 0.17%1,7280.54%2,0304 1.23%12,8715.34%19,9723 15.20%158,48832.35%120,9132 83.40%869,73661.74%230,7381 %N%NN DescriptionsClassesMods
53
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U Changes in SNOMED CT Oldest-FSN - CONCEPTID 12345678910111213141516CRDToTToC Retrograde catheter ureteropyelography (procedure) - 176086008 192119118933323632274028012582261242 Pereyra procedure including anterior colporrhaphy (procedure) - 40587007 2014222322 302120 2620410012523256236 Epidural injection of neurolytic solution, caudal (procedure) - 58418003 1722111216182610 371012347022473252235 Infusion of intra-arterial thrombolytic agent with percutaneous transluminal coronary angioplasty, multiple vessels (procedure) - 80762004 2610145219402629282907520012572260234 Infusion of intra-arterial thrombolytic agent with percutaneous transluminal coronary angioplasty, single vessel (procedure) - 91338001 2610145219402629282907520012572260234 Reduction of closed carpometacarpal fracture dislocation of thumb with manipulation and skeletal fixation (procedure) - 46989008 17131110 152520 28331933142386248231 Percutaneous transluminal injection of therapeutic substance into coronary artery NEC (procedure) - 175072001 1712600738364036280758012372240223 Open reduction of fracture of femur with internal fixation (procedure) - 57470004 20178548373430 3316000022382242222 Revision to open reduction of fracture dislocation and external fixation (procedure) - 179692000 30237204131412 313422 20022422246216 Ocular-mucous membrane syndrome (disorder) - 95331002 273315201415262021 300010012375243216 Top 10 of concepts according to direct changes between Jan 2002 and July 2009
54
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U Indirect changes to Adenoma of small intestine
55
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U Indirect changes to Cell phenotyping performed
56
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U 56 Snomed International (1995): knowledge in the codes. posterior anatomic leaflet mitral cardiac valve cardiovascular T-23532 Why was this not a good idea ?
57
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U 57 Snomed International : multiple ways to express the same thing D5-46210Acute appendicitis, NOS D5-46100Appendicitis, NOS G-A231Acute M-41000Acute inflammation, NOS G-C006In T-59200Appendix, NOS G-A231Acute M-40000Inflammation, NOS G-C006In T-59200Appendix, NOS
58
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U 58 Using a formal language SNOMED RT syntax: D5-30150: D5-30100 & (assoc-topography T-56000) & (assoc-morphology M-40000) & (assoc-etiology F-06030) (T-56000)(M-40000)(F-06030) Parent term in the SNOMED III hierarchy: D5-30100 Esophagitis, NOS D5-30150 Postoperative esophagitis T-56000 Esophagus M-40000 Inflammation F-06030 Post-operative state SNOMED III:
59
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U 59 Find the problem SNOMED-RT (2000) SNOMED-CT (2003) DL don’t guarantee you to get parthood right !
60
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U 60 Find the problem
61
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U 61 Find the problem new-1 new-2
62
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U 62 Inconsistent response to queries Term search: heart tumor Concept search: heart structure AND tumor
63
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U 63 Find the problem What is the problem ? Missing: ISA neoplasm of heart
64
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U 64 Find the problem terms
65
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U 65 Find the problem No ! Although suggested, that is not what is expressed. Can there be something that is an excision and an implantation ? Does “testis implantation” mean that a testis is implanted ?
66
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U 66 Find the problem
67
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U 67 Find the problem
68
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U 68 Find the problem
69
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U 69 Find the problem
70
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U 70 Uncorrected lexical mapping
71
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U 71 Find the problem
72
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U 72 Find the problem
73
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U 73 Find the problem
74
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U 74 Find the problem
75
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U 75 Snomed CT (July 2007): “fractured nasal bones”
76
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U 76 SNOMED-CT: abundance of false synonymy nose bones fracture
77
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U 77 Coding / Classification confusion A patient with a fractured nasal bone A patient with a broken nose A patient with a fracture of the nose = =
78
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U 78 A patient with a fractured nasal bone A patient with a broken nose A patient with a fracture of the nose = = Coding / Classification confusion A patient with a fractured nasal bone A patient with a broken nose A patient with a fracture of the nose = =
79
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U 79 Snomed CT (July 2007): “fractured nasal bones”
80
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U 80 Snomed CT (July 2007): “fractured nasal bones” Problems of multiple inheritance: – (1) “… ISA fracture of skull and facial bones” Which facial bones are not part of the skull ? If there would be non-skull facial bones, how many fractures are then required ? –(2) “… ISA fracture of mid-facial bones” Which mid-facials bones or not facial bones ? –If all, then (1) is redundant –(3) “… ISA injury of nasal bones” Are not all fractures “injuries’ and if not, why would then all nasal fractures be injuries ?
81
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U 81 Find the problem
82
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U 82 Evolution in SNOMED-CT
83
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U 83
84
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U UMLS: Unified Medical Language System
85
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U 85 UMLS: Unified Medical Language System Tool for information retrieval of 4 components: –Metathesaurus contains information about biomedical concepts and how they are represented in diverse terminological systems. –Semantic Network contains information about concept categories and the permissible relationships among them –Information Sources Map contains both human-readable and machine-processable information about all kinds of biomedical terminological systems –Specialist lexicon: english words with POS
86
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U
87
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U Linking of concept-based resources Declaring equivalence (synonymy) between concepts from distinct resources comes down to: –stating that these concepts mean the same thing, –however not what they mean. There is no common underlying ontology: –Trying to decipher what is meant by concept structures from separate resources is hampered by inconsistencies between these resources. Use of an ontology language such as OWL might reveal the inconsistencies, but not resolve them.
88
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U 88 UMLS Semantic Network
89
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U 89 Semantic Network Relationships Is_a physically related to spatially related to temporally related to functionally related to conceptually related to
90
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U 90 Semantic Network “Biologic Function” Hierarchy
91
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U 91 Semantic Network "affects" Hierarchy
92
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U 92 Metathesaurus: merging terminologies cycles in hierarchical relationships
93
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U Conclusion
94
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U 94 Summary of current deficiencies in traditional and formal terminologies (1) Terms often require “reading in context” Agrammatical constructions (paper-based indexing) Semantic drift as one moves between hierarchies Not (yet) useful for natural language understanding by software (but were not designed for that purpose)
95
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U 95 Summary of current deficiencies in traditional and formal terminologies (2) labels for terms do not correspond with formal meaning underspecification (leading to erroneous classification in DL-based systems) overspecification (leading to wrong assumptions with respect to instances)
96
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U 96 Axiom Concept-based terminology (and standardisation thereof) is there as a mechanism to improve understanding of messages by humans. It is NOT the right device –to explain why reality is what it is, how it is organised, etc., (although it is needed to allow communication), –to reason about reality, –to make machines understand what is real, –to integrate across different views, languages, conceptualisations,...
97
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U 97 Why not ? Does not take care of universals and particulars appropriately Concepts not necessarily correspond to something that (will) exist(ed) –Sorcerer, unicorn, leprechaun,... Definitions set the conditions under which terms may be used, and may not be abused as conditions an entity must satisfy to be what it is Language can make strings of words look as if it were terms –“Middle lobe of left lung”...
98
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U 98 Link to ontologies Building high quality ontologies is hard. Not everybody has the right skills –Experts in driving cars are not necessarily experts in car mechanics (and the other way round). Ontologies should represent the state of the art in a domain, i.e. the science. –Science is not a matter of consensus or democracy (cfr HL7 RIM problem). Natural language relates more to how humans talk about reality or perceive it, than to how reality is structured. No high quality ontology without the involvement of ontologists.
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.