Browsing SNOMED CT : Finding Terms

Slides:



Advertisements
Similar presentations
A. a Alpha Pronounced a as in hat b B b e b beta.
Advertisements

Organization of the Human Body – Body Cavities and Membranes
Digestive system.
Suzanne D'Anna1 Body Cavities. Suzanne D'Anna2 Body is divided into two portions: n axial - head, neck, and trunk n appendicular - limbs.
K N a r o z e n i n á m T i p ř e j i, a b y T v ů j ž i v o t b y l k r á s n ý, n e k o n e č n ý s e n, a b y l á s k a, š t ě s t í a z d r a v í.
In Ancient Greece, there were two main languages. The first being Latin, and the second being basic Greek. Like English, those two languages went threw.
P ř i š l i j s m e d n e s s l a v i t s T e b o u, t a k n á s t u v j e d n é ř a d ě m á š, v š e c h n a p ř á n í m á m e s e b o u, T y u ž j e.
Unit 1: Introduction to Anatomy
Body Planes and Cavities
Chapter 22 The Chest and Abdomen.
Rat Dissection Review.
Chapter 2: The Language of Anatomy
Blood vessels & circulation
Medical Terminology List 3 Chapter 2.
The Language of Anatomy
Organization of the Human Body
In World War Two, Italy first attacked Greece, but Greece defended themselves. Then Germany attacked Greece and then Greece lost the war. Greece’s.
Major arteries of the body.
 5 th Place  Delta Gamma / Alpha Epsilon Pi  Alpha Phi / Sigma Pi / Tau Kappa Epsilon.
Terms Pertaining to the Body as a Whole
Human Body head, neck, trunk, extremities.
a b g d e z h q i k l m n x o p r s j t u f c y w.
Α Α. a Alpha “short a” as in father β Β b beta.
© 2010 Delmar, Cengage Learning 1 © 2011 Delmar, Cengage Learning PowerPoint Presentation to Accompany.
Ultrasound abdomen atlas A&E medical meeting 28/07/2011 Dr. David Tran.
Medical Terminology Organization of the Body. Organization of the Body Objectives:  To name the body systems and their functions  To identify body cavities.
Radiographic Physiology Cardiovascular System Arteries and Veins Cardiovascular System.
Body Cavities. Dorsal body cavity Cranial cavity Space inside bony skull Contains brain.
Language of Anatomy. Why do we have an “Anatomical position”? Anatomical reference points Common to all health care professionals – Physiotherapy – Nurses.
Body Cavities. Dorsal body cavity Cranial cavity Space inside bony skull Contains brain Spinal cavity Extends from cranial cavity to bottom of spine Spinal.
Large intestine.
ANATOMY LECTURE #4 Body Cavities and Regions. BODY REGIONS Appendicular=upper and lower limbs. Axial=head, neck, and trunk.
Injury Coding Ensuring coding uniformity
Cat Dissection Digestive Labs.
Mink Dissection Review. Menu Neck & Thoracic Cavity Abdominal Cavity Heart Blood Vessels Urinary System Reproductive Systems.
“Beginning New Testament Greek” “Beginning New Testament Greek”
CHAPTER 1 INTRO. TO A&P. Intro to A&P Anatomy – Physiology – deals with functions & how body parts operate.
Body Cavities & Membranes : Organization of the Human Body Body cavities Thoracic cavity Abdominopelvic cavity Abdominal cavity Diaphragm Pelvic.
ANATOMICAL LANGUAGE BIO 137 Anatomy & Physiology I Lab.
Mink Dissection Review
Major Body Arteries.
Circulatory System STRUCTURES Blood Heart Arteries Capillaries Veins
Cat Dissection Lymphatic System.
Anatomic References.
THE HUMAN BODY To administer first aid you do not need to be an expert, but basic knowledge of the body’s structures and how they work will help you recognize.
THE COMPANIES (GENERAL) REGULATIONS, 2015
Circulation + Blood Pressure
CT of the abdomen.
Anatomic References.
Do Now Get with your partner from the case studies you worked on yesterday. Take a few moments to review your information, and get comfortable with it.
Contain the visceral organs
Common cancers and NICE
Human Organ Systems.
Unit 16: Abdomen & Thoracic Injuries
Fetal Pig Dissection Review
Abdominal Cavity Small Intestine Umbilical Vein Rectum Bladder Liver
DIGESTIVE SYSTEM STRUCTURE.
Chapter 2: Body Organization Review
CHAPTER 1 INTRO. TO A&P.
Vascular Anatomy Vascular Anatomy.
Frog Body Parts and Functions (Know the terms in green)
Injuries to the Chest and Abdomen
Excretory Respiratory System: Function(s): System: Function(s):
The Greek Alphabet The Greek Alphabet.
Fetal Pig Dissection Review
Body Systems Text Lectures 6 lecture course: saeed alhussani.
Anatomic References.
ORGAN SYSTEMS.
Presentation transcript:

Browsing SNOMED CT : Finding Terms Dr Jeremy Rogers jeremy.rogers@nhs.net Principal Terminology Specialist, UK Terminology Centre (NHS Connecting for Health) Webinar to IHTSDO I&I SIG, November 1st 2011

Beginning at the beginning…

The Problem: 3 SCT browser implementations + 1st sixteen search phrases I thought of pain lower back low back pain traveller diarrhoea acute asthma breast lump breast mass medication review vaccine given enlarged node morning headache staging scan urge incontinence excision biopsy post operative analgesia transoesophageal ultrasound emergency caesarean

‘low back pain’

‘pain lower back’

‘traveller diarrhoea’

‘acute asthma’

‘breast lump’

‘breast mass’

‘medication review’

‘vaccine given’

The Problem: 3 Browser implementations + 1st sixteen search phrases I thought of pain lower back low back pain traveller diarrhoea acute asthma breast lump breast mass medication review vaccine given enlarged node 3+ 5+ 0 morning headache 1 1 1 staging scan 2 0 0 urge incontinence 5 5 3 excision biopsy 52+ 29+ 1 post operative analgesia 1 0 0 transoesophageal ultrasound 6 0 0 emergency caesarean 6 6 0

The Problem: CodeForString(str) operation Many different strategies can be employed All produce different false +ve and –ve results Most strategies in at least one existing implementation Very few implementations with all strategies Naïve approaches predominate But clever strategies can be slow and need extra lexical resources Different strategies  different results Different results  interrater variability Interrater variability  reduced interoperability

From term to concept… TOKENISATION Non WS token separators Character stripping QUERY EXPANSION Word equivalenceTIG 6.5.4 Diacritic/Char substn Metaphone / SoundEx Levenstein Distance Token concatenation QUERY CONTRACTION Stopword lists Word stemming QUERY EXECUTION Wildcard characters in corpus Exact matching Case sensitivity Force include/exclude tokens Query token proximity Query token (re-)orderingTIG Default wildcarding Multitoken semanticsTIG Token occurrence inheritance Match scoring and ranking Match grouping & ordering

IHTSDO Lexical Resources Stopwords zres_ExcludedWords_en-US_INT_20070731.txt (n=53) Virtually unchanged since 2002 IHTSDO Stopwords 2010    (2002 version) ABOUT CANNOT MORE SPECIFIC ALONGSIDE CHRONICALLY MUST THAN AN CONSISTS NO THAT AND COVERED NOT THE ANYTHING DOES OF THINGS AROUND DURING ON THIS AS EVERY ONLY THROUGHOUT AT FINDS OR TO BECAUSE FOR PROPERLY UP BEFORE FROM SIDE USING BEING IN SIDED USUALLY BOTH INSTEAD SOME WHEN BY INTO SOMETHING WHILE WITHOUT DOWN? WITH?

mySQL 5.5 Stopword List 1st, 2nd, 3rd Trimester ? 1st, 2nd 3rd degree burn ? ALONGSIDE PROPERLY 6 CHRONICALLY 7 SIDE 238 CONSISTS SIDED 73 COVERED 28 SPECIFIC 1356 FINDS THINGS 26 10 tokens in IHTSDO list, but not in mySQL list…

Word Equivalence: 64 ‘tranquilisers’ Tranquilliser(s) n=34 (26) Tranquilizer(s) N=9 (4) 1 20 6 24 3 10 Tranquillizer(s) n=16 (17) Tranquiliser(s) n=11 (0)

IHTSDO Lexical Resources Word Equivalence zres_WordEquivalents_en-US_INT_20020731.txt (n=4666) res_WordEquivalents_en-GB_GB1000000_20060401.txt (n=71) cicatrix   rupture perineal laryngeal gastric scrotal scar tear perineum larynx stomach scrotum erythrocyte bleeding scapula bronchial pancreas penile red blood cell haemorrhage shoulder blade bronchus pancreatic penis gangrene bruise leucocyte mediastinal ileal prostate gangrenous contusion white blood cell mediastinum ileum prostatic avulsed telangiectasia platelet mouth duodenal ovarian avulsion telangiectasis thrombocyte oral cavity duodenum ovaries neurapraxia cancer thymic palatal rectal ovary neuropraxia malignant neoplasm thymus palate rectum uterine injury malignant tumour cardiac roof of mouth caecal uterus trauma spinal heart teeth caecum adenohypophysis agenesis spine spleen tooth anal anterior pituitary aplasia vertebra splenic lingual anus neurohypophysis neoplasm arm aorta tongue kidney posterior pituitary tumour upper limb aortic pharyngeal renal adrenal verruca carpal arterial pharynx hepatic suprarenal wart carpus artery peritoneal liver ear drum boil wrist vein peritoneum gall bladder eardrum furuncle leg venous oesophageal gallbladder tympanic membrane inflammation lower limb cranial oesophagus testes bone inflammatory thoracic cranium bowel testicle bony lump thorax skull intestinal testicular osseous mass abdomen nasal intestine testis cartilage calculus abdominal nose lung cartilaginous stone pelvic pulmonary pelvis

IHTSDO Lexical Resources Character Substitutions (1) DeveloperToolkit\Indexes\tls1_Index_Generator_INT_20100131.zip \conf\greek_character_conversion_table.txt (2004) \conf\char_conversion_table.txt (2004) ALPHA char=Α 0391 char=α 03B1 BETA char=Β 0392 char=β 03B2 GAMMA char=Γ 0393 char=γ 03B3 DELTA char=Δ 0394 char=δ 03B4 EPSILON char=Ε 0395 char=ε 03B5 ZETA char=Ζ 0396 char=ζ 03B6 ETA char=Η 0397 char=η 03B7 THETA char=Θ 0398 char=θ 03B8 IOTA char=Ι 0399 char=ι 03B9 KAPPA char=Κ 039A char=κ 03BA LAMBDA char=Λ 039B char=λ 03BB MU char=Μ 039C char=μ 03BC NU char=Ν 039D char=ν 03BD XI char=Ξ 039E char=ξ 03BE OMICRON char=Ο 039F char=ο 03BF PI char=Π 03A0 char=π 03C0 RHO char=Ρ 03A1 char=ρ 03C1 SIGMA char=Σ 03A3 char=σ 03C3 TAU char=Τ 03A4 char=τ 03C4 UPSILON char=Υ 03A5 char=υ 03C5 PHI char=Φ 03A6 char=φ 03C6 CHI char=Χ 03A7 char=χ 03C7 PSI char=Ψ 03A8 char=ψ 03C8 OMEGA char=Ω 03A9 char=ω 03C9

IHTSDO Lexical Resources Character Substitutions (2) IHTSDO Diacritic Character Substitutions: A    á â ã ä å ā ă ą ǎ ǻ À Ä AE    æ ǽ C    ç ć ĉ ċ č D    ď đ ð E    è é ê ë ē ĕ ė ę ě F    ƒ G    ĝ ğ ġ ģ H    ĥ ħ I    í ì î ï ĩ ī ĭ į ı IJ    ij J    ĵ K    ķ ĸ L    ĺ ļ ľ ŀ ł N    ñ ń ņ ň ʼn ŋ O    ò ó ô õ ö ō ŏ ő ơ ǒ ǿ OE    œ R    ŕ ŗ ř S    ś ŝ ş š T    ţ ť ŧ U    ù ú û ü ũ ū ū ŭ ů ű ų ư ǔ ǖ ǘ ǚ ǜ W    ŵ Y    ý ÿ ŷ Z    ź ż ž A    Á  à CC04; Å à Ā Ă Ą Ǎ ǻ AE    Æ C    Ç Ć Ĉ Ċ Č D    Ď Đ Ð E    È É Ê Ë Ē Ĕ Ė Ę Ě G    Ĝ Ğ Ġ Ģ H    Ĥ Ħ I    Í Ì Î Ï Ĩ Ī Ĭ Į I IJ    IJ J    Ĵ K    Ķ L    Ĺ Ļ Ľ Ŀ Ł N    Ñ Ń Ņ Ň ʼn Ŋ O    Ò Ó Ô Õ Ö Ō Ŏ Ő Ơ Ǒ OE    Œ R    Ŕ Ŗ Ř S    Ś Ŝ Ş Š T    Ţ Ť Ŧ U    Ù Ú Û Ü Ũ Ū Ŭ Ů Ű Ų Ư Ǔ Ǖ Ǘ Ǚ Ǜ W    Ŵ Y    Ÿ Ŷ Z    Ź Ż Ž Diacritic unicode characters in SCT content with no declared unicode decomposition: øØ ₂ (µ α)

Other issues: AIDS - Acquired immunodeficiency syndrome antibody test AIDS (HTLV-III) screening Aids for severely handicapped Aids for severely handicapped (procedure) External memory aids training External memory aids training (procedure) External memory aids training (regime/therapy) Handicapped aids Handicapped aids (procedure) Instruction in the use of aids Notification of AIDS Notification of AIDS (procedure) Provision of aids to daily living Provision of aids to daily living (procedure) Use of aids for daily life Use of aids for daily life (procedure) Use of aids for daily life (regime/therapy) Use of bathroom aids Use of bathroom aids (procedure) Use of bathroom aids (regime/therapy) Use of indoor mobility aids Use of indoor mobility aids (procedure) Use of indoor mobility aids (regime/therapy) Use of kitchen aids Use of kitchen aids (procedure) Use of kitchen aids (regime/therapy) Use of work aids Use of work aids (procedure) Use of work aids (regime/therapy) Case sensitivity ‘AIDS procedure’  Minimum token length 3 char: HIV, WCC etc 2 char: MI, Hb, Fe etc 1 char: Haemoglobin A vs Haemoglobin S

The Other Elephant Generating candidate matches is just the start of the browsing experience… Resultset filtering No non-human, inactive, limited status concepts No non-sensical semantic types (anatomy, record artefacts, attributes etc) Localisation: specialty subsets, local language etc Resultset display All matching descriptions, or unique concepts only? Ranking, grouping and sorting perils of ‘velocity coding’

Questions… IHTSDO publishes lexical resources… Does anybody use them? Does it matter if some do and some don’t? Are they sufficient? How are they / should they actually be maintained? Currently Anglophone; can they be generalised? Guidance on how to actually use them Hard to implement -> no implementation Optionality -> different search results Success criteria: correct search result in top 20 listed?