The Role of the UMLS in Vocabulary Control CENDI Conference “Controlled Vocabulary and the Internet” Stuart J. Nelson, MD.

Slides:



Advertisements
Similar presentations
Consistent and standardized common model to support large-scale vocabulary use and adoption Robust, scalable, and common API to reduce variation in clinical.
Advertisements

Semantic Content Infrastructure for Knowledge Applications Tools of Change 2011 Thane Kerner, CEO Silverchair.
Thane Kerner Silverchair. What is… The Semantic Web? A Semantic Data Layer? Semantic Tagging? Why add semantics to my content? How can I get semantic.
Open Health Tools Distributed Terminology System Presentation Jack Bowie SVP Sales and Marketing Apelon, Inc. 1.
Who am I Gianluca Correndo PhD student (end of PhD) Work in the group of medical informatics (Paolo Terenziani) PhD thesis on contextualization techniques.
A New Learning Tools. Topic Maps is a standard for the representation and interchange of knowledge, with an emphasis on the findability of information.
The Role of Standard Terminologies in Facilitating Integration James J. Cimino, M.D. Departments of Biomedical Informatics and Medicine Columbia University.
Brian A. Carlsen Apelon, Inc. Tools For Classification Integration Networked Knowledge Organization Systems/Services Workshop June 28, 2001.
Thesaurus Design and Development
Lecture Fourteen Methodology - Conceptual Database Design
CSE 730 Information Retrieval of Biomedical Data The use of medical lexicon in biomedical IR.
Clinical Vocabularies James J. Cimino, M.D. Columbia University.
Methodology Conceptual Database Design
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
CSC271 Database Systems Lecture # 21. Summary: Previous Lecture  Phases of database SDLC  Prototyping (optional)  Implementation  Data conversion.
Unified Medical Language System® (UMLS®) NLM Presentation Theater MLA 2007 National Library of Medicine National Institutes of Health U.S. Dept. of Health.
1 Betsy L. Humphreys, MLS Betsy L. Humphreys, MLS National Library of Medicine National Library of Medicine National Institutes of Health National Institutes.
Indexing 1/2 BDK12-3 Information Retrieval William Hersh, MD Department of Medical Informatics & Clinical Epidemiology Oregon Health & Science University.
Controlled Vocabulary & Thesaurus Design Planning & Maintenance.
16-1 The World Wide Web The Web An infrastructure of distributed information combined with software that uses networks as a vehicle to exchange that information.
Unified Medical Language System® (UMLS®) NLM Presentation Theater MLA 2005 May 16 & 17, 2005 Rachel Kleinsorge.
Chapter 7 Web Content Mining Xxxxxx. Introduction Web-content mining techniques are used to discover useful information from content on the web – textual.
Linking Diseases and Genes through Informatics Knowledge Bases and Ontologies Joyce A. Mitchell, Ph.D. National Library of Medicine University of Missouri.
Session II: Scientific Publishing and Semantic Web W3C Semantic Web for Life Sciences Workshop October 27, 2004 Moderator: Alan R. Aronson.
Olivier Bodenreider Lister Hill National Center for Biomedical Communications Bethesda, Maryland - USA Experiences in visualizing and navigating biomedical.
Methodology - Conceptual Database Design Transparencies
Methodology Conceptual Databases Design
1 Chapter 15 Methodology Conceptual Databases Design Transparencies Last Updated: April 2011 By M. Arief
Betsy L. Humphreys Betsy L. Humphreys Associate Director for Library Operations NLM, NIH, HHS NLM, NIH, HHS National Library.
LIS 506 (Fall 2006) LIS 506 Information Technology Week 11: Digital Libraries & Institutional Repositories.
Annual reports and feedback from UMLS licensees Kin Wah Fung MD, MSc, MA The UMLS Team National Library of Medicine Workshop on the Future of the UMLS.
Betsy L. Humphreys Betsy L. Humphreys ~ National Library of Medicine National Institutes of.
1 Technologies for distributed systems Andrew Jones School of Computer Science Cardiff University.
Knowledge Representation and Indexing Using the Unified Medical Language System Kenneth Baclawski* Joseph “Jay” Cigna* Mieczyslaw M. Kokar* Peter Major.
1 st June 2006 St. George’s University of LondonSlide 1 Using UMLS to map from a Library to a Clinical Classification: Improving the Functionality of a.
More and Better Data for Research: U.S. Health Data Content Standards Betsy L. Humphreys Assistant Director for Health Services Research Information National.
Survey of Medical Informatics CS 493 – Fall 2004 September 27, 2004.
1/26/2004TCSS545A Isabelle Bichindaritz1 Database Management Systems Design Methodology.
Recent advances in the field of Family Medicine classifications ICPC into WHO-FIC J K Soler Wonca International Classification Committee.
Shelly Warwick, MLS, Ph.D – Permission is granted to reproduce and edit this work for non-commercial educational use as long as attribution is provided.
UMLS Unified Medical Language System. What is UMLS? A Unified knowledge representation system Project of NLM Large scale Distributed First launched in.
Methodology - Conceptual Database Design
Unit 5 Ch 6: Nomenclatures and Classification Systems Tuesday, April 5 th at 8PM EST HS Adrienne Palmer, BSPH, MHA, FACHE.
Asp/IEETA Health-Grid Workshop Brussels 20 th September 2002 A. Sousa Pereira Univ. Aveiro - IEETA.
The Gene Ontology and its insertion into UMLS Jane Lomax.
Sharing Ontologies in the Biomedical Domain Alexa T. McCray National Library of Medicine National Institutes of Health Department of Health & Human Services.
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
A LexWiki-based Representation and Harmonization Framework for caDSR Common Data Elements Guoqian Jiang, Ph.D. Robert Freimuth, Ph.D. Harold Solbrig Mayo.
- EVS Overview - Biomedical Terminology and Ontology Resources Frank Hartel, Ph.D. Director, Enterprise Vocabulary Services NCI Center for Bioinformatics.
Digital Libraries, Archives, and Large Data Sets Alexa T. McCray National Library of Medicine Bethesda, Maryland USA WHOI, June 3, 2004.
Chapter 19 Manager of Information Systems. Defining Informatics Process of using cognitive skills and computers to manage information.
NLM Value Set Authority Center Curation and delivery of value sets for eMeasures eMeasures Issues Group (eMIG) May 24, 2012 NLM.
The UMLS Semantic Network Alexa T. McCray Center for Clinical Computing Beth Israel Deaconess Medical Center Harvard Medical School
THE SEMANTIC WEB By Conrad Williams. Contents  What is the Semantic Web?  Technologies  XML  RDF  OWL  Implementations  Social Networking  Scholarly.
Trait ontology approach Marie-Angélique LAPORTE NCEAS June 7 th 2010.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
SNOMED CT Vendor Introduction 27 th October :30 (CET) Implementation Special Interest Group Tom Seabury IHTSDO.
Joined up ontologies: incorporating the Gene Ontology into the UMLS.
METADATA MANAGEMENT AT ISTAT: CONCEPTUAL FOUNDATIONS AND TOOLS Istituto Nazionale di Statistica ITALY.
UNIFIED MEDICAL LANGUAGE SYSTEMS (UMLS)
The UMLS and the Semantic Web
Methodology Conceptual Databases Design
Methodology Conceptual Database Design
Achieving Semantic Interoperability of Cancer Registries
The Needs for Coding and Classification Systems
Controlled Vocabularies for Capturing Clinical Encounters
2. An overview of SDMX (What is SDMX? Part I)
Ontology-Based Approaches to Data Integration
Methodology Conceptual Databases Design
PubMed.
Presentation transcript:

The Role of the UMLS in Vocabulary Control CENDI Conference “Controlled Vocabulary and the Internet” Stuart J. Nelson, MD

Observations Words are not enough Word based synonymy is not enough Single phrases are not enough Need “web-scale” synonymy

Synonymy that is “Web-Scale” Concepts (classes of terms) finely granular “Fully expressive” names –Acronyms –Gene Names –“special meanings” Scalable methodologies, large scale vocabularies

UMLS Purpose Make it easy for health professionals and researchers to retrieve and integrate relevant information from disparate automated sources, e.g. –computer-based patient records –factual databanks –bibliographic databases and full-text –expert systems Antedated and anticipated the Web

UMLS Focus Conceptual Connections Build knowledge sources that can be used by intelligent programs to overcome: –disparities in language used by different users and in different information sources; –difficulties in identifying which of many information sources is relevant

UMLS Knowledge Sources Multi-purpose tools or “intellectual middleware” for System Developers Metathesaurus SPECIALIST lexicon and lexical programs Semantic Network

UMLS Knowledge Sources Distribution Annual updates, Free under license agreement with NLM –Need separate license agreements with vocabulary producers for some uses of some vocabularies in the Metathesaurus Available to licensed users (~900) via Internet server and on CDs

1999 UMLS Metathesaurus 626,313 concepts (Oculus, eye =1) 1,134,413 “terms” (Eye, Eyes, eye = 1) 1,358,891 “strings”/concept names –(Eye, Eyes, eye = 3) ~50 source vocabularies

UMLS Metathesaurus Finely Granular Concepts Concepts, terms, and attributes from many controlled vocabularies New inter-source relationships, definitional information, use information Scope determined by combined scope of source vocabularies Strict definition of synonymy Semantic neighborhood

UMLS Source Vocabularies Widely varying purposes, structures, properties –Thesauri, e.g., MeSH –Statistical Classifications, e.g., ICD –Billing Codes, e.g., CPT –Clinical coding systems, e.g., SNOMED –Lists of controlled terms, e.g., COSTAR, HL7 value sets

Metathesaurus Construction The Scalable Methodology Convert machine-readable vocabulary sources to UMLS “normal” form, making source semantics explicit Merge, using source semantics and lexical processing techniques Edit results, adding additional relationships and semantic information

Metathesaurus Characteristics (1) Concept organization Many sources in a common database format Representation of the meaning in each source vocabulary Explicit tagging of each source vocabulary’s information

Current MeSH -- Organized by Preferred Term D Esophageal Motility Disorders (MH) Esophageal Dysmotility (EP - SYN) Nutcracker Esophagus (EP - NRW)

UMLS Metathesaurus -- Organized by Concept C Esophageal Motility Disorders (MeSH, Read) Esophageal Dysmotility (MeSH, Read) Oesphageal Dysmotility (Read) C Nutcracker Esophagus (MeSH, Read) Symptomatic esophageal peristalsis (Read)

Websites Using UMLS Medical World Search - CliniWeb - OHSU MetaZoomA -Lexical/Apelon

Cautions in Searching Mention is not Aboutness, but... Aboutness is not Relevance Relevance is in the eye of the beholder

UMLS Summary Concept-based Extensive content in biomedicine Scalable methodology Supporting both retrieval whether indexed or searching full-text