A LexWiki-based Representation and Harmonization Framework for caDSR Common Data Elements Guoqian Jiang, Ph.D. Robert Freimuth, Ph.D. Harold Solbrig Mayo.

Slides:



Advertisements
Similar presentations
Introduction The cancerGrid metadata registry (cgMDR) has proved effective as a lightweight, desktop solution, interoperable with caDSR, targeted at the.
Advertisements

Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Consistent and standardized common model to support large-scale vocabulary use and adoption Robust, scalable, and common API to reduce variation in clinical.
© Copyright 2008, Mayo Clinic College of Medicine Mayo Clinic Open Health Tools Application for Membership OHT Board Meeting, Birmingham, UK July 1, 2008.
LexWiki Framework & Use Cases SMW for Distributed Terminology Development Guoqian Jiang, PhD, Robert Freimuth, PhD, Haorld Solbrig Mayo Clinic NCI caBIG.
©2013 MFMER | slide-1 Building A Knowledge Base of Severe Adverse Drug Events Based On AERS Reporting Data Using Semantic Web Technologies Guoqian Jiang,
Guoqian Jiang, MD, PhD Mayo Clinic
EleMAP: An Online Tool for Harmonizing Data Elements using Standardized Metadata Registries and Biomedical Vocabularies Jyotishman Pathak, PhD 1 Janey.
CaGrid Service Metadata Scott Oster - Ohio State
Mayo LexWiki: A Prototype of Collaborative Platform for Terminology/Ontology Content Development Guoqian Jiang, Ph.D. Division of Biomedical Informatics,
Editing Description Logic Ontologies with the Protege OWL Plugin.
Best Practices for Including Enumerated Value Domains in UML Models What are the mechanics of creating CDEs associated with enumerated value domains in.
Harmonization of SHARPn Clinical Element Models with CDISC SHARE Clinical Study Data Standards Guoqian Jiang, MD, PhD Mayo Clinic On behalf of CDISC CEMs.
Vocabulary Knowledge Center Adoption Stories of the NCI Semantic Infrastructure (and VKC Update) Robert Freimuth, PhD October 20, 2009.
Future of MDR - ISO/IEC Metadata Registries (MDR) Larry Fitzwater, SC 32 WG 2 Convener Computer Scientist U.S. Environmental Protection Agency May.
Faculty of Informatics and Information Technologies Slovak University of Technology Personalized Navigation in the Semantic Web Michal Tvarožek Mentor:
OpenMDR: Generating Semantically Annotated Grid Services Rakesh Dhaval Shannon Hastings.
Department of Biomedical Informatics Development of Ontology-anchored Grid-based Data Services to Facilitate Integrative Clinical and Translational Science.
OpenMDR: Alternative Methods for Generating Semantically Annotated Grid Services Rakesh Dhaval Shannon Hastings.
BiodiversityWorld GRID Workshop NeSC, Edinburgh – 30 June and 1 July 2005 Metadata Agents and Semantic Mediation Mikhaila Burgess Cardiff University.
Metadata Open Forum 2008 ISO/IEC/IEC 11179: Metadata Registries A Tutorial from the National Cancer Institute Dianne M. Reeves, RN, MSN National Cancer.
A Case Study of ICD-11 Anatomy Value Set Extraction from SNOMED CT Guoqian Jiang, PhD ©2011 MFMER | slide-1 Division of Biomedical Statistics & Informatics,
CaBIG Semantic Infrastructure 2.0: Supporting TBPT Needs Dave Hau, M.D., M.S. Acting Director, Semantic Infrastructure NCI Center for Biomedical Informatics.
LexEVS Overview Mayo Clinic Rochester, Minnesota June 2009.
Using the Open Metadata Registry (openMDR) to create Data Sharing Interfaces October 14 th, 2010 David Ervin & Rakesh Dhaval, Center for IT Innovations.
Metadata Registries Workshop April 15, 1998 Slide 1 of 20 ANSI X Douglas D. Mann Stewardship Naming & Identification Classification.
H Using the Open Metadata Registry (OpenMDR) to generate semantically annotated grid services Rakesh Dhaval, MS, Calixto Melean,
Value Set Resolution: Build generalizable data normalization pipeline using LexEVS infrastructure resources Explore UIMA framework for implementing semantic.
Briefing: HL7 Working Group Meeting Update for the VCDE Community Dianne M. Reeves Associate Director, Biomedical Data Standards NCI CBIIT VCDE Meeting.
CaBIG ® VCDE Workspace Tactics thru June 14, 2010: How working groups fit together, and other activities Brian Davis April 1, 2010 VCDE WS Teleconference.
Open Terminology Portal (TOP) Frank Hartel, Ph.D. Associate Director, Enterprise Vocabulary Services National Cancer Institute, Center for Biomedical Informatics.
CaDSR Software Users Meeting 3.1 Requirements Review 9/19/2005 caDSR Software Team Host: Denise Warzel NCICB, Assistant Director, caDSR.
Sharing Ontologies in the Biomedical Domain Alexa T. McCray National Library of Medicine National Institutes of Health Department of Health & Human Services.
LexEVS Semantic Tooling Advancements Kevin Peterson Mayo Clinic Mayo 2009.
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
1 eXtended Metadata Registry (XMDR) Ecoterm Rome, Italy May 17, 2006 Bruce Bargmeyer, Lawrence Berkley National Laboratory University of California Tel:
The data standards soup … Is the most exciting topic you can dream of.
Xforms Post generation customisation Generated view element Change Description Transformation
A Semantic-Web Representation of Clinical Element Models
LexGrid Philosophy, Model and Interfaces Harold R Solbrig Division of Biomedical Statistics and Informatics Mayo Clinic.
CaDSR O&M Draft Scope September 2010 Denise Warzel National Cancer Institute Center for Biomedical Informatics and Information Technology.
This material was developed by Duke University, funded by the Department of Health and Human Services, Office of the National Coordinator for Health Information.
-KHUSHBOO BAGHADIYA.  Introduction  System Description  iCAT in use  Evolution of the system  Evolution of modeling  Evolution of features  Evolution.
Shawn Jones INDUS Corporation January 18, 2000 Open Forum on Metadata Registries Santa Fe, NM SDC JE-2029.
2.An overview of SDMX (What is SDMX? Part I) 1 Edward Cook Eurostat Unit B5: “Central data and metadata services” SDMX Basics course, October 2015.
Vocabulary Knowledge Center Update VCDE Workspace July 21, 2011.
May 2007 Registration Status Small Group Meeting 1: August 24, 2009.
Patterns in caBIG Baris E. Suzek 12/21/2009. What is a Pattern? Design pattern “A general reusable solution to a commonly occurring problem in software.
CaBIG™ Terminology Services Path to Grid Enablement Thomas Johnson 1, Scott Bauer 1, Kevin Peterson 1, Christopher Chute 1, Johnita Beasley 2, Frank Hartel.
Extending the Metadata Registry for Semantic Web - Enforcing the MDR for supporting ontology concept - May 28, 2008 ISO/IEC JTC 1/SC 32 WG 2 Meeting Sydney,
Sherri de Coronado Enterprise Vocabulary Services NCI Center for Bioinformatics and Information Technology March 11, 2009 A Terminology.
Compatibility Review System 3.0 Robert Freimuth October 28, 2008 Overview.
Supporting Collaborative Ontology Development in Protégé International Semantic Web Conference 2008 Tania Tudorache, Natalya F. Noy, Mark A. Musen Stanford.
Structured Protocol Representation for the Cancer Biomedical Informatics Grid: caSPR and caPRI.
LexWiki Framework & Use Cases SMW for Distributed Terminology Development Guoqian Jiang, PhD, Harold Solbrig Mayo Clinic Meeting with Dr. Jakob (WHO) May.
Challenges and issues with information sharing: The four pillars of semantic interoperability Douglas B. Fridsma, MD, PhD, FACP University of Pittsburgh.
CaCORE In Action: An Introduction to caDSR and EVS Browsers for End Users A Tool Demonstration from caBIG™ caCORE (Common Ontologic Representation Environment)
National Cancer Institute caDSR Briefing for Small Scale Harmonication Project Denise Warzel Associate Director, Core Infrastructure caCORE Product Line.
Vocabulary Knowledge Center Adoption Stories of the NCI Semantic Infrastructure (and VKC Update) Robert Freimuth, PhD October 20, 2009.
Extended Metadata Registries and Semantics (Part 2: Implementation) Karlo Berket Ecoterm IV Environmental Terminology Workshop April 18, 2007 Diplomatic.
1 caBIG ® Architecture/ VCDE Joint WS F2F Meeting: Semantic Infrastructure MDR Update Oct. 22, 2009.
Semantic Interoperability: caCORE and the Cancer Data Standards Repository (caDSR)  Jennifer Brush.
VCDE WS in EY2 Where we are, where we’re going ICR WS Teleconference Brian Davis – VCDE WS Lead March 26, 2008.
NCI Center for Biomedical Informatics and Information Technology (CBIIT) The CBIIT is the NCI’s strategic and tactical arm for research information management.
UNIFIED MEDICAL LANGUAGE SYSTEMS (UMLS)
Networking and Health Information Exchange
Report on Eighth Open Forum on Metadata Registries, Berlin, April 2005
Guoqian Jiang, Harold R. Solbrig, Christopher G. Chute
2. An overview of SDMX (What is SDMX? Part I)
Carolina Mendoza-Puccini, MD
Presentation transcript:

A LexWiki-based Representation and Harmonization Framework for caDSR Common Data Elements Guoqian Jiang, Ph.D. Robert Freimuth, Ph.D. Harold Solbrig Mayo Clinic Architecture/VCDE Face-to-Face Meeting Atlanta, GA, October 22, 2009

Challenges for Metadata Community The community is facing the harmonization scaling problem and the need for tooling to navigate the model space is urgent. To form better community adoption and governance, a more open, scalable and collaborative platform is desired.

Wiki/Semantic Wiki/LexWiki Wiki as a collaborative system – community generated content. Semantic wiki as an platform – support different levels of the formality continuum (Free text -> OWL). LexWiki - a collaborative authoring platform for large- scale biomedical terminologies. BiomedGT – Biomedical Grid Terminology CTCAE - Common Terminology Criteria for Adverse Events WHO ICD11 – the International Classification of Disease NeuroLex - the Neuroscience Lexicon XMDR - eXtended MetaData Registry (XMDR) Project CSHARE - CDISC Shared Health and Research Electronic Library

Objectives We propose a LexWiki-based representation and harmonization framework for the caDSR CDEs. We intend to provide enhanced capabilities for semantic storage and retrieval community involvement and collaboration

Representation and Harmonization Framework in UML Model (Compatible with ISO 11179) This is a representation and harmonization framework represented in UML model. We think the model is compatible with ISO standard. The boxes in light blue color indicate what is loaded from individual contributors, you may see the classes data elements/ value domians/permissible values. The boxes in pink represent the data element semantics, in another words, the formal definiton of data elements. The boxes in light green represent the terminology element, indicating what comes from the various terminology resources such as SNOMED-CT, MeDra, etc. The boxes in yellow indicates where the harmonization assertions happen.

Representation and Harmonization Framework in UML Model (Compatible with ISO 11179) This diagram indicates the relationship between data element and data element concept. The yellow box between them indicates data element meaning should be asserted. This diagram indicates the relationship between permissible value class and value meaning. The yellow box between them indicates permissible value meaning link should be asserted.

Workflow process This diagram indicates the workflow process derived from cshare project. You may see the data elements are contributed from individual organizations and the community works together to link the data elements with terminologies and standards and then merge and harmonize them and generate new standardized definitions. SNOMED, NCI Meta, ICD 10, BRIDG, CDISC etc.

Individual Data Element Representation - Description This is a screenshot to show the individual data element represented in wiki platform. In lexical description part, You may see lexical descriptions of the data elements and the data elements belongs to domain lesion measurement and its original source is caBIG.

Individual Data Element Representation - Valueset (or codelist) Under the values tab, you may see the data element is linked to a code list or value set and permissible values contained in that value set. Interestingly, only one permissible value has concept reference asserted.

Individual Data Element Representation - Concept reference Under concept reference tab, you may see the concepts attached in this data elements and classified as object class concept and property concept using ISO model.

Underlying Semantic Annotations Each data element has underlying semantic annotations that can be used for formal rendering. We are using prefix in property name to indicate this is a ISO compatible model. Please note that a list of terminologies are also loaded in the wiki to facilitate the harmonization process. The following slides will show this point.

RDF/OWL Rendering This is the RDF/OWL rendering of those semantic annotations for that specific data element. This is a built-in feature of semantic mediawiki which could be potentially extended to interface with other existing semantic web applications.

This slide shows a form based authoring support for lexical definition and ISO data types. You may see the enhanced functionality like autocompletion support and definition display Harmonization Assertion – Lexical definition and ISO datatypes

Harmonization Assertion - Concept reference and BRIDG linkage This slide shows the form based concept reference authoring with autocompletion and definition display support. The prefix NCIM indicates the concepts come from the coding scheme NCI Metathesaurus. The platform also support the BRIDG model mapping.

Slicer and Dicer – Domain-based view The slide shows a domain based view through a Exhibit browser which is developed by MIT SIMILE group. We call this browser as a slicer and dicer which provides a flexible way for browsing the data elements from different sources.

Slicing and dicing We may do slicing and dicing using data type, object class concept, property concept. In this slide, you may see the 6 data elements are grouped together by datatype Character, object class concept Lesion, preperty concept Increase.

CDE Proposal – New Definition Then a CDE proposal can be generated through merging those 6 data elements. The community can work together for harmonization and create a new definition. I would like to mention that the wiki platform has built-in collaboration features for discussion and version management.

Summary As a summary, our lexwiki based framework provides a flexible, extensible and collaborative way for representing and harmonizing the data elements contributed from different sources. SNOMED, NCI Meta, ICD 10, BRIDG, CDISC etc.

Use Cases for a Platform to Support Collaborative Authoring of Data Elements Data Element Harmonization Identification of related CDEs Creation of CDE standards from existing CDEs Development of DAMs (or DAM subdomains) Linkage among CDEs, DAMs and Standards/Terminologies Connections to MDR – caDSR/cgMDR/openMDR Dynamic Extensions Advertisement of proposed or implemented extensions Mirror extensions in other application instances Use of extensions in new applications Connections with Semantic Web applications/communities … The Vocabulary Knowledge Center will host a teleconference to demonstrate CSHARE and discuss other potential use cases

Questions ?