Challenges and issues with information sharing: The four pillars of semantic interoperability Douglas B. Fridsma, MD, PhD, FACP University of Pittsburgh.

Slides:



Advertisements
Similar presentations
Introduction The cancerGrid metadata registry (cgMDR) has proved effective as a lightweight, desktop solution, interoperable with caDSR, targeted at the.
Advertisements

Joint TC Meeting: EHR – RCRIM RCRIM Overview HL7 Working Group Meeting January, 2007 Presented by: Ed Tripp Program Director, eSubmissions Abbott (RCRIM.
27 June 2005caBIG an initiative of the National Cancer Institute, NIH, DHHS caBIG the cancer Biomedical Informatics Grid Arumani Manisundaram caBIG - Project.
BRIDG Overview Clinical Observation Interoperability March 18, 2008.
Meta Data Larry, Stirling md on data access – data types, domain meta-data discovery Scott, Ohio State – caBIG md driven architecture semantic md Alexander.
Consistent and standardized common model to support large-scale vocabulary use and adoption Robust, scalable, and common API to reduce variation in clinical.
Overview of Biomedical Informatics Rakesh Nagarajan.
EleMAP: An Online Tool for Harmonizing Data Elements using Standardized Metadata Registries and Biomedical Vocabularies Jyotishman Pathak, PhD 1 Janey.
Frank Hartel, PhD Enterprise Vocabulary Services National Cancer Institute NCI Enterprise Vocabulary Services (EVS) and Semantic Integration at NCI - An.
Interoperability Framework Overview March 24, 2010 Presented by: Douglas Fridsma, MD, PhD Acting Director, Office of Interoperability & Standards ONC HIT.
1 ECCF Training 2.0 Introduction ECCF Training Working Group January 2011.
Measurable Interoperability for Archival Data Lewis J. Frey, PhD
Best Practices for Including Enumerated Value Domains in UML Models What are the mechanics of creating CDEs associated with enumerated value domains in.
Development Principles PHIN advances the use of standard vocabularies by working with Standards Development Organizations to ensure that public health.
CaBIG: the cancer Biomedical Informatics Grid Ken Buetow NCICB/NCI/NIH/DHHS.
OpenMDR: Generating Semantically Annotated Grid Services Rakesh Dhaval Shannon Hastings.
SCIENCE-DRIVEN INFORMATICS FOR PCORI PPRN Kristen Anton UNC Chapel Hill/ White River Computing Dan Crichton White River Computing February 3, 2014.
OpenMDR: Alternative Methods for Generating Semantically Annotated Grid Services Rakesh Dhaval Shannon Hastings.
1 Federal Health IT Ontology Project (HITOP) Group The Vision Toward Testing Ontology Tools in High Priority Health IT Applications October 5, 2005.
LexEVS 6.0 Overview Scott Bauer Mayo Clinic Rochester, Minnesota February 2011.
The Semantic Web Service Shuying Wang Outline Semantic Web vision Core technologies XML, RDF, Ontology, Agent… Web services DAML-S.
 BRIDG R3.0.2 was released in August 2010  The BRIDG Model passed the initial ISO Joint Initiative Council ballot as a Draft International Standard (DIS)
Introduction to MDA (Model Driven Architecture) CYT.
CaBIG Semantic Infrastructure 2.0: Supporting TBPT Needs Dave Hau, M.D., M.S. Acting Director, Semantic Infrastructure NCI Center for Biomedical Informatics.
LexEVS Overview Mayo Clinic Rochester, Minnesota June 2009.
Using the Open Metadata Registry (openMDR) to create Data Sharing Interfaces October 14 th, 2010 David Ervin & Rakesh Dhaval, Center for IT Innovations.
Page 1 Informatics Pilot Project EDRN Knowledge System Working Group San Antonio, Texas January 21, 2001 Steve Hughes Thuy Tran Dan Crichton Jet Propulsion.
Interfacing Registry Systems December 2000.
Client Registry An enterprise master patient index (EMPI), or Client Registry manages the unique identity of citizens receiving health services with the.
Interoperability Framework Overview Health Information Technology (HIT) Standards Committee June 24, 2010 Presented by: Douglas Fridsma, MD, PhD Acting.
H Using the Open Metadata Registry (OpenMDR) to generate semantically annotated grid services Rakesh Dhaval, MS, Calixto Melean,
CRIX: toward a secure, standards-based, clinical research information exchange.
Dave Iberson-Hurst CDISC VP Technical Strategy
Value Set Resolution: Build generalizable data normalization pipeline using LexEVS infrastructure resources Explore UIMA framework for implementing semantic.
Briefing: HL7 Working Group Meeting Update for the VCDE Community Dianne M. Reeves Associate Director, Biomedical Data Standards NCI CBIIT VCDE Meeting.
Ontology Summit2007 Survey Response Analysis Ken Baclawski Northeastern University.
CaBIG ® VCDE Workspace Tactics thru June 14, 2010: How working groups fit together, and other activities Brian Davis April 1, 2010 VCDE WS Teleconference.
Clinical Data Interchange Standards Consortium (CDISC) uses NCIt for its Study Data Tabulation Model (SDTM) and other global data standards for medical.
Modeling Component-based Software Systems with UML 2.0 George T. Edwards Jaiganesh Balasubramanian Arvind S. Krishna Vanderbilt University Nashville, TN.
Cancer MetaData Standards Peter A. Covitz, Ph.D. HL7 RCRIM October 1, 2002.
Networking and Health Information Exchange Unit 5b Health Data Interchange Standards.
CaCORE Software Development Kit George Komatsoulis 25-Feb-2005.
Health IT Workforce Curriculum Version 1.0 Fall Networking and Health Information Exchange Unit 3b National and International Standards Developing.
1 ECCF Training 2.0 Introduction ECCF Training Working Group January 2011.
This material was developed by Duke University, funded by the Department of Health and Human Services, Office of the National Coordinator for Health Information.
A LexWiki-based Representation and Harmonization Framework for caDSR Common Data Elements Guoqian Jiang, Ph.D. Robert Freimuth, Ph.D. Harold Solbrig Mayo.
- EVS Overview - Biomedical Terminology and Ontology Resources Frank Hartel, Ph.D. Director, Enterprise Vocabulary Services NCI Center for Bioinformatics.
Overview of SC 32/WG 2 Standards Projects Supporting Semantics Management Open Forum 2005 on Metadata Registries 14:45 to 15:30 13 April 2005 Larry Fitzwater.
Interchange vs Interoperability Main Entry: in·ter·op·er·a·bil·i·ty : ability of a system... to use the parts or equipment of another system Source: Merriam-Webster.
SDMX IT Tools Introduction
Patterns in caBIG Baris E. Suzek 12/21/2009. What is a Pattern? Design pattern “A general reusable solution to a commonly occurring problem in software.
Semantic Data Extraction for B2B Integration Syntactic-to-Semantic Middleware Bruno Silva 1, Jorge Cardoso 2 1 2
Ontologies Reasoning Components Agents Simulations An Overview of Model-Driven Engineering and Architecture Jacques Robin.
Structured Protocol Representation for the Cancer Biomedical Informatics Grid: caSPR and caPRI.
CaCORE In Action: An Introduction to caDSR and EVS Browsers for End Users A Tool Demonstration from caBIG™ caCORE (Common Ontologic Representation Environment)
National Cancer Institute caDSR Briefing for Small Scale Harmonication Project Denise Warzel Associate Director, Core Infrastructure caCORE Product Line.
1 caBIG®-aligned Enterprise Metadata Infrastructure to Support Commercial Clinical Trials Management Software: A Pilot Implementation September 11, 2009.
0 caBIG and caGrid: Interoperable Computing Infrastructure for the Nation’s [and World’s] Cancer Research Enterprise Peter A. Covitz, Ph.D. Chief Operating.
Informatics for Scientific Data Bio-informatics and Medical Informatics Week 9 Lecture notes INF 380E: Perspectives on Information.
Semantic Interoperability: caCORE and the Cancer Data Standards Repository (caDSR)  Jennifer Brush.
International Planetary Data Alliance Registry Project Update September 16, 2011.
Margaret Haber, RN, OCN Frank Hartel, PhD Enterprise Vocabulary Services National Cancer Institute Overview of NCI Enterprise Vocabulary Services (EVS)
Submission Standards: The Big Picture Gary G. Walker Associate Director, Programming Standards, Global Data Solutions, Global Data Management.
NCI Center for Biomedical Informatics and Information Technology (CBIIT) The CBIIT is the NCI’s strategic and tactical arm for research information management.
Dave Iberson-Hurst CDISC VP Technical Strategy
Networking and Health Information Exchange
Electronic Health Information Systems
Clinical Observation Interoperability March 18, 2008
W3C HCLS Task Force on Drug Safety and Efficacy
Presentation transcript:

Challenges and issues with information sharing: The four pillars of semantic interoperability Douglas B. Fridsma, MD, PhD, FACP University of Pittsburgh School of Medicine Center for Biomedical Informatics

R&D Expenditures ($B) Investigational New Drug Applications New Chemical Entities Approved BioPharma R&D Conundrum Courtesy of Charles Jaffe

“the nice thing about standards is that everyone has one”

Clinic Research back office

Cancer Research: Two Different Worlds Patient Care World Multiple data sources and types HL7 is a pervasive standard Data are organized around the patient Clinical Research World Protocol defines define elements Linear data flow CDISC is the emerging standard Data are organized around a trial Patient Data in Proprietary Format Acknowledgements: Landen Bain, CDISC

Challenges to Interoperability Multitude of information systems No common data model No common data formats Few common vocabularies No infrastructure for data sharing Informatics Tower of Babel

So what do we need?

The Pillars of (Semantic) Interoperability Necessary but not Sufficient Common object and information models across all domains-of-interest –BRIDG –caBIO Model grounded on robust data type specification –Common data elements –Complex data types Methodology for binding terms from concept-based terminologies –Vocabulary services –Thesaurus A formally defined process for defining specific structures to be exchanged between machines, i.e. a “messaging standard” –HL7 and implementation specifications –Unified process for software development

“Protocol” and the Semiotic Triangle Symbol “ Protocol ” “We need to sign off on the protocol by Friday” Concept 1 Thing 1 Document Study “Protocol XYZ has enrolled 73 patients” Thing 2 Concept 2 “Per the protocol, you must be at least 18 to be enrolled” Concept 3 Thing 3 Plan Source: John Speakman/Charlie Mead

Interchange vs Interoperability Main Entry: in·ter·op·er·a·bil·i·ty : ability of a system... to use the parts or equipment of another system Source: Merriam-Webster web site interoperability : ability of two or more systems or components to exchange information and to predictably use the information that has been exchanged. Source: IEEE Standard Computer Dictionary: A Compilation of IEEE Standard Computer Glossaries, IEEE, 1990] Syntax  Structure Semantics  Meaning Source: Charlie Mead Semantic interoperability Syntactic interoperability (interchange)

caBIG™ Solution to Problem Common, widely distributed infrastructure Shared, harmonized set of terminology, data elements, and data models that facilitate information exchange Leverage existing international standards wherever possible Work towards convergence of existing international standards and data models

This isn’t rocket science A lot of it isn’t even computer science –Most industries did this years ago But it is hard caBIG™’s goal (oversimplified): facilitate the exchange of data useful for cancer research –Between research domains, systems, investigators, organizations

SYNTACTIC SEMANTIC caBIG Compatibility Guidelines

The glue that binds parts together is metadata infrastructure Shape of boundary is defined in APIs Getting to Interoperability Focus on boundaries, interfaces, and how things fit together Not focused on the internal details of how systems are built: assume that will be diverse & changing

The Pillars of (Semantic) Interoperability Necessary but not Sufficient Common object and information models across all domains-of-interest –BRIDG –caBIO Model grounded on robust data type specification –Common data elements –Complex data types Methodology for binding terms from concept-based terminologies –Vocabulary services –Thesaurus A formally defined process for defining specific structures to be exchanged between machines, i.e. a “messaging standard” –HL7 and implementation specifications –Unified process for software development

caCORE Common Ontologic Representation Environment Bioinformatics Objects Enterprise Vocabulary Common Data Elements SECURITYSECURITYApplications

The Pillars of (Semantic) Interoperability Necessary but not Sufficient Common object and information models across all domains-of-interest –BRIDG –caBIO Model grounded on robust data type specification –Common data elements –Complex data types Methodology for binding terms from concept-based terminologies –Vocabulary services –Thesaurus A formally defined process for defining specific structures to be exchanged between machines, i.e. a “messaging standard” –HL7 and implementation specifications –Unified process for software development

The BRIDG Model What is it? A collaborative effort between CDISC, HL7, caBIG TM, NCI, FDA, and Industry to produce one common, shared data exchange standard –The official domain analysis model of RCRIM –The semantic foundation for harmonization of CDISC message specifications –Plans underway for the FDA Janus data base to be harmonized around BRIDG –An open community of stakeholders interested in developing standards for exchanging information about clinical trials

The BRIDG Model Vision: Create a domain analysis model for the clinical research domain to harmonize clinical research standards among each other and to harmonize standards between clinical research and healthcare –A bridge between different models of clinical trials information –A formal model of the shared semantics of regulated clinical trials research

The BRIDG Model Key Goal: Define a structured computable protocol representation that supports the entire life-cycle of clinical trials protocol to achieve syntactic and semantic interoperability The semantic foundation for application and message development in HL7, caBIG, and CDISC

Protocol Authoring and Documentation Clinical Trial Design Structured Statistical Analysis Clinical Trial Registration Eligibility Determination Protocol activities and Safety monitoring (AE) BRIDG

Rich semantics captured in BRIDG CDISC glossary of terms Semantics and examples at the class and attribute level Datatypes help to clarify the semantics around the classes and attributes

caBIO: Objects for Data Access Set of common, cross-domain objects used to obtain information Provides interfaces to access information without worrying about backend data sources Development follows open, vendor-neutral OMG Model Driven Architecture (MDA) standards Supports “syntactic interoperability”

Bioinformatics Objects Domain Information models expressed as Unified Modeling Language (UML) Class Diagrams

The Pillars of (Semantic) Interoperability Necessary but not Sufficient Common object and information models across all domains-of-interest –BRIDG –caBIO Model grounded on robust data type specification –Common data elements –Complex data types Methodology for binding terms from concept-based terminologies –Vocabulary services –Thesaurus A formally defined process for defining specific structures to be exchanged between machines, i.e. a “messaging standard” –HL7 and implementation specifications –Unified process for software development

caDSR: Metadata Repository Provides support for “semantic interoperability” ISO Registry for Common Data Elements (CDEs) Addresses the need for consistency in data descriptions CDEs precisely define the questions and answers –What question are you asking, exactly? –What are the possible answers, and what do they mean? Provides run-time access to metadata and semantics

Common Data Elements (CDEs) Data descriptors or “metadata” for cancer research Precisely defining the questions and answers –What question are you asking, exactly? –What are the possible answers, and what do they mean? –Using controlled vocabularies from EVS Describing data objects and attributes used for presentation to scientific applications Ongoing projects covering various domains –Clinical Trials –Imaging –Biomarkers –Genomics

Cancer Data Standards Repository Registry for Common Data Elements Follows international standard format (ISO/IEC extensions) Public ID + Version = Unique Identifier Interactive Tools: –CDE Curation – Integrated with EVS to draw terms from controlled vocabulary –CDE Browser – search and export –Form Compliance with CDEs

The Pillars of (Semantic) Interoperability Necessary but not Sufficient Common object and information models across all domains-of-interest –BRIDG –caBIO Model grounded on robust data type specification –Common data elements –Complex data types Methodology for binding terms from concept-based terminologies –Vocabulary services –Thesaurus A formally defined process for defining specific structures to be exchanged between machines, i.e. a “messaging standard” –HL7 and implementation specifications –Unified process for software development

Enterprise Vocabulary Services Provides underlying semantic content and vocabulary control for data objects and CDEs Vocabulary Products and Services –NCI Thesaurus: Cancer ontology 46,000 concepts, 92 named inter-concept relations –NCI Metathesaurus: UMLS enhanced with cancer-oriented concepts 930,000+ concepts, 2,200,000 terms and phrases Mappings among over 50 vocabularies Rich synonymy: Over 40,000 terms for “cancer” mapped to 7,000 concepts –External Vocabularies – over 70 including SNOMED, MedDRA, GO, LOINC, the MGED Ontology and many others Formal collaborations with FDA, NHLBI, NIAID, VA, CDC, caBIG TM

The Pillars of (Semantic) Interoperability Necessary but not Sufficient Common object and information models across all domains-of-interest –BRIDG –caBIO Model grounded on robust data type specification –Common data elements –Complex data types Methodology for binding terms from concept-based terminologies –Vocabulary services –Thesaurus A formally defined process for defining specific structures to be exchanged between machines, i.e. a “messaging standard” –HL7 and implementation specifications –Unified process for software development

The Importance of Process Processes to develop interoperable applications –Model-driven architecture, Unified Process Processes to develop interchange standards –HL7 and CDISC Processes to maintain the models –Developing methods of collaboration and versioning control, based on open-source architectures and methods –Developing methods of model harmonization based on HL7 frameworks Goal is to create a community of stakeholders who value the quality of the shared model of clinical research

Tools to Support the People Process –Open source –Used for projects like firefox and mozilla –Bug and feature tracking –Discussion forums –Source control with CVS check-in/check-out –Project administration – and listservs with notification of code, discussion group, or document changes –Ability to link and synchronize CVS repositories across sites

Existing Standards Harmonization Process A –Harmonize existing HL7 DMIM for regulated clinical research Process B –“extract” domain knowledge from existing messages and systems into formal UML models Process C –Harmonize and merge UML models into BRIDG Existing CDISC message BRIDG model UML submodel Existing object model (caBIO) UML submodel Existing HL7 message CC B\B\ B A

New Message and Application Development Process D –Generate a “release” of BRIDG for formal requirements traceability with vetting within stakeholder organizations Process E –HL7 message generation using HL7 mechanisms Process F –caBIG application development with annotation, semantic connector, UML loader DMIM BRIDG model BRIDG model release version X E D HL7 message generation F Annotated UML model caBIG App Development

Take Home Messages Interoperability is hard – it is more than shared data, it requires shared meaning –Four pillars of interoperability Shared information models Common data elements Rich vocabularies Repeatable processes Interoperability requires support tools- –Collaborative websites for modeling ( –caDSR for Common data elements –EVS for vocabularies Interoperability is collaborative--The collaboration and processes are perhaps the most important part of the effort in caBIG –Collaborate until it hurts

Douglas B. Fridsma