Mapping the NCI Thesaurus and the Collaborative Inter-Lingual Index Amanda Hicks University of Florida HealthInsight Workshop, Oslo, Norway.

Slides:



Advertisements
Similar presentations
OLIF V2 Gr. Thurmair April OLIF April 2000 OLIF: Overview Rationale Principles Entries Descriptions Header Examples Status.
Advertisements

Semantic Interoperability in Health Informatics: Lessons Learned 10 January 2008Semantic Interoperability in Health Informatics: Lessons Learned 1 Medical.
Ontology Assessment – Proposed Framework and Methodology.
Building Wordnets Piek Vossen, Irion Technologies.
A centralized approach to language resources Piek Vossen S&T Forum on Multilingualism, Luxembourg, June 6th 2005.
1 ICS-FORTH & Univ. of Crete SeLene November 15, 2002 A View Definition Language for the Semantic Web Maganaraki Aimilia.
The 20th International Conference on Software Engineering and Knowledge Engineering (SEKE2008) Department of Electrical and Computer Engineering
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
A Robust Approach to Aligning Heterogeneous Lexical Resources Mohammad Taher Pilehvar Roberto Navigli MultiJEDI ERC
So What Does it All Mean? Geospatial Semantics and Ontologies Dr Kristin Stock.
ICT Monica Monachini – 1° KYOTO Workshop – Amsterdam 2/ KYOTO (ICT ) Yielding Ontologies for Transition-Based Organization Intelligent.
Ewa Rudnicka, Wojciech Witkowski, Maciej Piasecki G4.19 Research Group Institute of Informatics, Wrocław University of Technology nlp.pwr.wroc.pl plwordnet.pwr.wroc.pl.
Graph Data Management Lab, School of Computer Science Put conference information here.
Helping people find content … preparing content to be found Enabling the Semantic Web Joseph Busch.
The Unreasonable Effectiveness of Data Alon Halevy, Peter Norvig, and Fernando Pereira Kristine Monteith May 1, 2009 CS 652.
Mayo LexWiki: A Prototype of Collaborative Platform for Terminology/Ontology Content Development Guoqian Jiang, Ph.D. Division of Biomedical Informatics,
Semantics For the Semantic Web: The Implicit, the Formal and The Powerful Amit Sheth, Cartic Ramakrishnan, Christopher Thomas CS751 Spring 2005 Presenter:
CSE 730 Information Retrieval of Biomedical Data The use of medical lexicon in biomedical IR.
RDF: Concepts and Abstract Syntax W3C Recommendation 10 February Michael Felderer Digital Enterprise.
Semantic Matching Pavel Shvaiko Stanford University, October 31, 2003 Paper with Fausto Giunchiglia Research group (alphabetically ordered): Fausto Giunchiglia,
Ontology Lexicalisation In collaboration with John McCrae, Philipp Cimiano (CITEC, Univ. of Bielefeld) Elena Montiel-Ponsado (Universidad Politecnica Madrid)
Web Explanations for Semantic Heterogeneity Discovery Pavel Shvaiko 2 nd European Semantic Web Conference (ESWC), 1 June 2005, Crete, Greece work in collaboration.
Adam Pease and Christiane Fellbaum Presenter: 吳怡安
February 2007MCST - FP7 Launch1 Michael Rosner Department of Computer Science and Artificial Intelligence University of Malta.
Automatic Lexical Annotation Applied to the SCARLET Ontology Matcher Laura Po and Sonia Bergamaschi DII, University of Modena and Reggio Emilia, Italy.
Learning Object Metadata Mining Masoud Makrehchi Supervisor: Prof. Mohamed Kamel.
Clément Troprès - Damien Coppéré1 Semantic Web Based on: -The semantic web -Ontologies Come of Age.
Environmental Terminology Research in China HE Keqing, HE Yangfan, WANG Chong State Key Lab. Of Software Engineering
The MMI Tools Carlos Rueda Monterey Bay Aquarium Research Institute OOS Semantic Interoperability Workshop Marine Metadata Interoperability Project Boulder,
Multilingual Information Exchange APAN, Bangkok 27 January 2005
Nancy Lawler U.S. Department of Defense ISO/IEC Part 2: Classification Schemes Metadata Registries — Part 2: Classification Schemes The revision.
Related terms search based on WordNet / Wiktionary and its application in ontology matching RCDL'2009 St. Petersburg Institute for Informatics and Automation.
Jennie Ning Zheng Linda Melchor Ferhat Omur. Contents Introduction WordNet Application – WordNet Data Structure - WordNet FrameNet Application – FrameNet.
WORD SENSE DISAMBIGUATION STUDY ON WORD NET ONTOLOGY Akilan Velmurugan Computer Networks – CS 790G.
Péter Schönhofen – Ad Hoc Hungarian → English – CLEF Workshop 20 Sep 2007 Performing Cross-Language Retrieval with Wikipedia Participation report for Ad.
Value Set Resolution: Build generalizable data normalization pipeline using LexEVS infrastructure resources Explore UIMA framework for implementing semantic.
Revelytix SICoP Presentation DRM 3.0 with WordNet Senses in a Semantic Wiki Michael Lang February 6, 2007.
Clinical Data Interchange Standards Consortium (CDISC) uses NCIt for its Study Data Tabulation Model (SDTM) and other global data standards for medical.
A School of Information Science, Federal University of Minas Gerais, Brazil b Medical University of Graz, Austria, c University Medical Center Freiburg,
The KOS interoperability in aquatic science field through mapping processes Carmen Reverté Reverté Aquatic Ecosystems Documentation Center. IRTA. (Sant.
©Ferenc Vajda 1 Semantic Grid Ferenc Vajda Computer and Automation Research Institute Hungarian Academy of Sciences.
SSO: THE SYNDROMIC SURVEILLANCE ONTOLOGY Okhmatovskaia A, Chapman WW, Collier N, Espino J, Conway M, Buckeridge DL Ontology Description The SSO was developed.
Integrating lexical units, synsets and ontology in the Cornetto Database Piek Vossen 1, 2, Isa Maks 1, Roxane Segers 1, Hennie van der Vliet 1 1: Faculty.
December 9, 2005Jurix conference Brussels 1 Using legal definitions to increase the accessibility of legal documents Laurens Mommers, Wim Voermans Department.
Very Large Cross-lingual Resources at OAEI 2008 Laura Hollink Véronique Malaisé Vrije Universiteit Amsterdam.
Sharing Ontologies in the Biomedical Domain Alexa T. McCray National Library of Medicine National Institutes of Health Department of Health & Human Services.
ISO-PWI Lexical ontology some loose remarks Thierry Declerck, DFKI GmbH.
Ontologies and Terminology and how they relate to lexicography Adam Kilgarriff Auckland 20121Kilgarriff: Ontologies and Terminology.
Progress Report (Concept Extraction) Presented by: Mohsen Kamyar.
Element Level Semantic Matching Pavel Shvaiko Meaning Coordination and Negotiation Workshop, ISWC 8 th November 2004, Hiroshima, Japan Paper by Fausto.
Strategies for subject navigation of linked Web sites using RDF topic maps Carol Jean Godby Devon Smith OCLC Online Computer Library Center Knowledge Technologies.
AIFB Ontology Mapping I3CON Workshop PerMIS August 24-26, 2004 Washington D.C., USA Marc Ehrig Institute AIFB, University of Karlsruhe.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
Enable Semantic Interoperability for Decision Support and Risk Management Presented by Dr. David Li Key Contributors: Dr. Ruixin Yang and Dr. John Qu.
Semantic Interoperability in GIS N. L. Sarda Suman Somavarapu.
Semantic Media Wiki Open Terminology Development - Initial Steps - Frank Hartel, Ph.D. Associate Director, Enterprise Vocabulary Services National Cancer.
GoRelations: an Intuitive Query System for DBPedia Lushan Han and Tim Finin 15 November 2011
DALOS Progress Meeting – April 20th Florence The Lois data base A Knowledge Organization System for Dalos Daniela Tiscornia.
UNIFIED MEDICAL LANGUAGE SYSTEMS (UMLS)
CCNT Lab of Zhejiang University
Ontology From Wikipedia, the free encyclopedia
Terminology Quality Evaluation S60 Rashmie Abeysinghe Joint work with
Element Level Semantic Matching
The creation of a standardized citizen health vocabulary
Linguistic Linked Open Data
SmaRT Visualization of Legal Rules for Compliance
Lexical ambiguity in SNOMED CT
Extracting Semantic Concept Relations
RDA Community and linked data
WordNets and TEI-LEX John P. McCrae, Thierry Declerck.
Presentation transcript:

Mapping the NCI Thesaurus and the Collaborative Inter-Lingual Index Amanda Hicks University of Florida HealthInsight Workshop, Oslo, Norway 20 May 2016

Overview Ontologies versus wordnets Inter-Lingual Index Future work to map National Cancer Institute Thesaurus to the Collaborative Inter-Lingual Index In collaboration with Francis Bond, Nanyan Technological University, Singapore Selja Seppälä, University Florida, USA 2

ONTOLOGIES VERSUS WORDNETS 3

Semantic Networks Words, concepts, or classes that are arranged in a network Provide a framework for machine readable meaning and context for what would otherwise be uninterpreted syntax Simultaneously logical objects and mathematical objects, subject to inference and graph theoretical analysis 4

Wordnets versus Ontologies Wordnets are semantic networks that represent how we use language. Meanings are stated in natural language definitions and relatively sparse semantic relations. The word ‘cat’ in context Ontologies are semantic networks that represent properties of things in the world. Meanings are encoded in logical form. What it is to be a cat 5

Comparative Strengths Wordnets NLP applications that require word sense discrimination Cross-lingual comparison of lexical categories Distance or related measures of concepts 6 Ontologies Provide a coherent, stable and unified frame of reference for the interpretation of concepts and specification of classes May support interoperability of data sets Support deductive reasoning over structured data

INTER-LINGUAL INDEX 7

Current State of Mapping Wordnets Wordnets exist for many languages. – 33 open wordnets in the Global Wordnet Grid Mapping often occurs through English WordNet. – English centric – English does not have a word for every concept. Some wordnets are mapped to each other directly. 8

Mapping wordnets to each other directly gets messy. Vossen, GlobalWordNet Conference,

The Collaborative Inter-Lingual Index (CILI) en es no pt 10

CILI Flat list of concepts with a persistent Semantic Web compliant IRI Synsets from wordnets mapped directly to the ILI IRI English WordNet 3.0, 3.1 and Dutch Open Wordnet currently mapped Unique English definitions are associated with each ILI to support mapping (but no English words or labels) Not imposed on linked wordnets Open, anyone can contribute 11

NCI THESAURUS AND CILI 12

National Cancer Institute Thesaurus (NCIt) An English medical reference terminology Definitions crafted by teams of medical experts and terminologists Covers vocabulary for clinical care, translational and basic research, and public information and administrative activities Widely used in biomedical and health informatics in the USA 13

Why map NCIt to CILI? Specialized terminology in CILI should be defined by subject matter experts, not linguists. There is currently no prototype for mapping specialized vocabulary to CILI. More resources may lead to improved formal semantics to be integrated with CILI. To support integration of health knowledge extracted from linguistically heterogeneous sources – Multi-lingual – Layperson/specialized vocab 14

“Patella” in WN and NCIt WordNet patella, kneecap, kneepan A small flat triangular bone in front of the knee that protects the knee joint Part_holonym – knee Hypernym – Sesamoid bone NCI T BONE, PATELLA A small flat triangular bone in front of the knee that articulates with the femur and protects the knee joint. subClassOf – Bone of the Lower Extremity – Short Bone Semantic Type – Anatomical Structure Additional synonyms Additional definitional knowledge: potentially useful for formalizing semantics Additional formal semantic information 15

Semantic Modeling One of the goals of CILI is to have ontologies that provide formal semantics for the indexed concepts. Different semantic resources encode different semantic information. NCIt can be used to enrich the common semantic model. 16

Our Planned Approach To the Project Map NCI Thesaurus to CILI Convert NCI Thesaurus to Lexical Markup Framework Partially automate mapping NCIt to CILI using string matching on WordNet synsets and NCIt names an similarity measures on definitions There will still be false negatives that will need to be identified by hand. Formalize the semantics of NCIt related CILIs with other ontologies. KYOTO BFO 17

… smokes hubbly-bubbly on weekends … Smoking status| L ILI Concept Use for Knowledge Integration 18

THANK YOU 19