Ontology Approaches for Federated Query Systems Lori Phillips MS March 2015.

Slides:



Advertisements
Similar presentations
Use Watch folders to automatically add PDFs to Mendeley Desktop.
Advertisements

Little Used, but Powerful Features with GP Cathy Fregelette, CPA, PMP Practice Manager BroadPoint Technologies September 20, 2012.
CC SQL Utilities.
Microsoft Excel 2003 Illustrated Complete Excel Files and Incorporating Web Information Sharing.
I2b2 Temporal Queries. i2b2 Proposal Aims 1.Implement sets of time points that can be queried by iterative use of simple time-based and Boolean operations.
IS605/606: Information Systems Instructor: Dr. Boris Jukic
Table of Contents Part B Managing Documents & References File organizer Citing references Creating bibliographies/Using MS Word Plugin Sharing documents.
A Tool to Support Ontology Creation Based on Incremental Mini- Ontology Merging Zonghui Lian Data Extraction Research Group Supported by Spring Conference.
Automatic Data Ramon Lawrence University of Manitoba
Some Introductory Programming 1. Structured Query Language - used for queries. - a standard database product. 2. Visual Basic for Applications - use of.
Tutorial 11: Connecting to External Data
I2b2 grid integration with Ontology Mapper
An Introduction to Database Management Systems R. Nakatsu.
Microsoft Access 2007 Microsoft Access 2007 Introduction to Database Programs.
Ontology Mapping Support Lori Phillips June 2014.
Integrating Word, Excel, Access, and PowerPoint
Use Watch folders to automatically add PDFs to Mendeley Desktop. When you place a document in a watched folder, it will be automatically added to Mendeley.
Chapter 5 Application Software.
Source Code Revision Control Software CVS and Subversion (svn)
MS Access Advanced Instructor: Vicki Weidler Assistant:
Query Health Concept-to-Codes (C2C) SWG Meeting #2 December 13,
OpenMDR: Generating Semantically Annotated Grid Services Rakesh Dhaval Shannon Hastings.
Developing i2b2 Ontologies for the Long Haul Lori Phillips, MS Partners HealthCare Systems, Inc April 25, 2012.
Classroom User Training June 29, 2005 Presented by:
OpenMDR: Alternative Methods for Generating Semantically Annotated Grid Services Rakesh Dhaval Shannon Hastings.
PowerPoint 2003 – Level 1 Computer Concepts Cathy Horwitz April 25, 2011.
Class Instructor Name Date. Classroom Tips Class Roster – Please Sign In Class Roster – Please Sign In Internet Usage Internet Usage –Breaks and Lunch.
Data Access Patterns Some of the problems with data access from OO programs: 1.Data source and OO program use different data modelling concepts 2.Decoupling.
1 The following presentation is from the Oracle Webcast “What’s New in P6 EPPM Release 8.1.” As a partner, you may not use the Oracle Power Point template,
PCORI/GPC Standard Data Model Jim Campbell, Dan Connolly, GPC Data standardization team.
I2b2 Version 1.6 Shawn Murphy MD, Ph.D. Griffin Weber MD, Ph.D. Michael Mendis Vivian Gainer MS Nich Wattanasin Lori Phillips MS Rajesh Kuttan Wensong.
OFFICE OF THE CHIEF FINANCIAL OFFICER CFO PeopleSoft nVision - Overview Finance Network Chuck Axthelm March 16, 2006.
PCORI/GPC Standard Data Model Jim Campbell, Dan Connolly, GPC Data standardization team.
PCORI/GPC Standard Data Model Jim Campbell, Dan Connolly, GPC Data standardization team.
WEEK 11 Database Design. TABLE INSTANCE CHARTS Create Tables.
I2b2/NCBO Ontology Tools Lori Phillips, MS AUG meeting July 24-25, 2012.
CSCI 6962: Server-side Design and Programming Database Manipulation in ASP.
© Paradigm Publishing Inc. 5-1 Chapter 5 Application Software.
This eCPIC Quick Guide has been developed to assist System Administrators with creating Hierarchy Grids in eCPIC. The Hierarchy Grid functionality allows.
Introduction to Enterprise Guide Jennifer Schmidt Rhonda Ellis Cassandra Hall.
Reports and Learning Resources Module 5 1. SLMS Primary Administrator Training Module 5: Reports and Learning Resources 2.
Research Network Query Interoperation James R. Campbell University of Nebraska.
Microsoft Excel 2003 Illustrated Complete Data with Other Programs Exchanging.
PCORI/GPC Standard Data Model Jim Campbell, Dan Connolly, GPC Data standardization team.
Metadata By N.Gopinath AP/CSE Metadata and it’s role in the lifecycle. The collection, maintenance, and deployment of metadata Metadata and tool integration.
Worldwide Protein Data Bank Common D&A Project Sequence Processing Modular Demo May 6, 2010 Project Deliverable.
Information Management System “Institutions Module" Information Management System “Institutions Module" The System management module is an integrated part.
EVS Data Curation The processing and publication of data for web browsing and programmatic access.
Chapter 5 Introduction To Form Builder. Lesson C Objectives  Use sequences to automatically generate primary key values in a form  Create lists of values.
SSMS SQL Server Management System. SQL Server Microsoft SQL Server is a Relational Database Management System (RDBMS) Relational Database Management System.
VOCAB REVIEW. A field that can be computed from other fields Calculated field Click for the answer Next Question.
Copyright 2007, Paradigm Publishing Inc. EXCEL 2007 Chapter 8 BACKNEXTEND 8-1 LINKS TO OBJECTIVES Import data from Access, a Web site, or a CSV text file.
Take Your Data Analysis and Reporting to the Next Level by Combining SAS Office Analytics, SAS Visual Analytics, and SAS Studio David Bailey Tim Beese.
Deploying ONC terminology standards in SNOW SHRINE i2b2 data warehouse
Enhancing interoperation: an i2b2 ETL schema for Epic EHRs
Ontology Mapping Support Lori Phillips June 2012
Shawn Murphy, MD PhD Diane Keogh, i2b2 Foundation
Enhancing interoperation: an i2b2 ETL schema for Epic EHRs
SQL MODELER - OPEN There are Three Ways to open the SQL Modeler
Mastering ArcGIS Attribute Data (Continued)
State Report Processing
Mail Merge a letter for Integration Office 2016
Microsoft Excel 2007 – Level 2
Data Model.
Database Systems Instructor Name: Lecture-3.
Mendeley Part B Table of Contents
Creating and Managing Database Tables
COURSE OBJECTIVES Review Case Comparison
Integrating Office 2013 Programs
Interoperability with SNOW SHRINE
Presentation transcript:

Ontology Approaches for Federated Query Systems Lori Phillips MS March 2015

Use Case Local codes need to be integrated into the central hierarchy Tools and strategies to assist with this effort A requirement in some federated query systems such as PCORI is that all i2b2 instances need to map their ontologies to a central ontology.

Constructing queries from metadata select patient_num from observation_fact where [c_facttablecolumnname] IN (select [c_facttablecolumnname] from [c_tablename] where [c_columnname] [c_operator] [c_dimcode])

i2b2 Star Schema

Visit_dimension types of queries:Encounter types select patient_num from visit_dimension where inout_cd IN (‘ED’,‘E’) Edit c_dimcode to match your inout_cds.

Patient_dimension types of queries:Age calculations select patient_num from patient_dimension where birth_date BETWEEN sysdate – (365.25*18) AND sysdate – (365.25*10) No modifications necessary.

Patient_dimension types of queries:Sex select patient_num from patient_dimension where sex_cd IN (‘F’) Edit c_dimcode to match your sex_cds.

Patient_dimension types of queries:Race select patient_num from patient_dimension where race_cd IN (‘01’,‘amer. Indian’,‘i’, ‘na’,‘nat. am’) Edit c_dimcode to match your race_cds.

Patient_dimension types of queries:Ethnicity select patient_num from patient_dimension where ethnicity_cd IN (‘HISPANIC’) Create an ethnicity_cd column in patient_dimension Edit c_dimcode to match your ethnicity_cds.

Concept_dimension types of queries:Biobank select concept_cd from concept_dimension where concept_path like '\PCORI\DEMOGRAPHIC \BIOBANK_FLAG\Y\%' Edit c_basecode to match your code for biobank specimens.

patient_dim v concept_dim approaches HISTORY: early versions of i2b2 did not support the patient_dimension approach. Demographic data was inserted as entry in observation_fact table and as a result appeared as a tick mark on the timeline. Patient_dimension approach does not require an entry in the observation_fact table for that concept. Patient_dimension approach will not result in a tick mark in the timeline for that concept. Patient_dimension approach is easier to maintain.

Modifiers select modifier_cd from modifier_dimension where modifier_path like '\PCORI_MOD\DX_SOURCE \AD%' Edit c_basecode to match your codes for DX Type

Broader concept_dimension based queries Diagnoses, Procedures typically require a merge of local terms within the tree.

ICD-9 Example If your institution uses ICD-9 … Change the c_basecode to match your code format. PCORI_BASECODEC_BASECODE ICD9:250.1MY_ICD9:250.1 update pcornet set c_basecode = replace(pcori_basecode, ‘ICD9:’, ‘MY_ICD9:’) where c_basecode is not null and pcori_basecode like 'ICD9:%' and c_fullname like ‘\PCORI\DIAGNOSIS\09\%‘;

ICD-9 plain_code example If your institution uses ICD-9 but in a plain_code format… Change the c_basecode to match your code format. PCORI_BASECODEC_BASECODE ICD9: update pcornet set c_basecode = substring(pcori_basecode, 6, 25) where pcori_basecode like 'ICD9:%' and c_fullname like ‘\PCORI\DIAGNOSIS\09\%‘; update pcornet set c_basecode = replace(c_basecode, '.', '') where pcori_basecode like 'ICD9:%' and c_fullname like ‘\PCORI\DIAGNOSIS\09\%‘;

Local diagnoses codes (non-ICD9) Local codes need to be mapped/merged. Mapper cell / mapping tool plugins exist for workbench.

Source fields are local terms. Destination fields are PCORI terms. PROJECT_ONT_MAPPING SOURCE_CODING_SYSTEMVARCHAR(50) SOURCE_BASECODEVARCHAR(50) SOURCE_NAMEVARCHAR(2000) SOURCE_FULLNAMEVARCHAR(700) SOURCE_TOOLTIPVARCHAR(700) SOURCE_TABLE_CDVARCHAR(25) SOURCE_KEYVARCHAR(725) DESTINATION_CODING_SYSTEMVARCHAR(50) DESTINATION_BASECODEVARCHAR(50) DESTINATION_NAMEVARCHAR(2000) DESTINATION_FULLNAMEVARCHAR(700) DESTINATION_TABLE_CDVARCHAR(25) DESTINATION_KEYVARCHAR(725) MAPPING_SOURCEVARCHAR(50) VARFLAGVARCHAR(25) FLAGINT STATUS_CDVARCHAR(25) UPDATE_DATEDATETIME C_TOTALNUMINT

Reuse mappings from other projects If your local codes already exist within an ICD9 hierarchy.. Find the c_fullname, c_basecode and c_name of parent of the term and use that as your mapping’s destination.

Reuse mappings, cont. WITH joined as( select r2.c_basecode as sourceCode, r1.c_fullname as destination, r1.c_basecode as destCode, r1.c_name as destName from local_ont r1 inner join local_ont r2 on r1.c_fullname=r2.c_path where r2.c_basecode in (select source_basecode from project_ont_mapping where destination_fullname is null)) update project_ont_mapping set destination_fullname = joined.destination, destination_basecode = joined.destCode, destination_name = joined.destName from joined where joined.sourceCode = source_basecode

Automated mappings with UMLS MetaMap Interactive tool that uses UMLS MetaMap to find mappings for your local unmapped terms.

Manual mappings Tool for assigning/verifying mappings

Integration tool Select path symbol format (S,M,L) \i2b2\Diagnoses\Metabolic and immunity disorders ( )\ (277) Other and unspecified disor~\(277-0) Cystic fibrosis\ Start integration Refresh process status until integration is complete. Tool to build ontology table after mappings are completed

Export tool Exports result of integration process as a single, delimited i2b2 metadata file.

Alternate map/merge method If you have small number of local codes to merge, use the Edit Terms view. User must be set up with EDITOR role

Edit Terms View

New term added No need to export file, term is added to table automatically.