Strategic Health IT Advanced Research Projects (SHARP) Area 4: Secondary Use of EHR Data Project 3: High-Throughput Phenotyping Project Lead: Jyotishman.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

INTERNATIONAL HEALTHCARE STANDARDS LANDSCAPE
Provenance-Aware Storage Systems Margo Seltzer April 29, 2005.
Data Governance and Normalization with the Mayo Clinic Enterprise
HL7 Decision Support Service (DSS) and Virtual Medical Record (vMR) Standards, and OpenCDS Open-Source Implementation August 14, 2012 HL7 Ambassador.
Consistent and standardized common model to support large-scale vocabulary use and adoption Robust, scalable, and common API to reduce variation in clinical.
©2011 MFMER | slide-1 The Linked Clinical Data Project Jyotishman Pathak, PhD HCLS TMO October 27, 2010.
Data Normalization Milestones. Data Normalization  Goals –To conduct the science for realizing semantic interoperability and integration of diverse data.
Drools – Diabetes Phenotype Identification
MOHH – Models Submission Dr Linda Bird 9 th August 2012.
VSAC Users’ Forum Thursday, January 15 2:00 – 3:00 PM, EST.
Session II: Standardization of phenotyping algorithms and data Jyotishman Pathak, PhD Associate Professor of Biomedical Informatics Department of Health.
Amy Sheide Clinical Informaticist 3M Health Information Systems USA Achieving Data Standardization in Health Information Exchange and Quality Measurement.
©2013 MFMER | slide-1 Building A Knowledge Base of Severe Adverse Drug Events Based On AERS Reporting Data Using Semantic Web Technologies Guoqian Jiang,
Area 4 SHARP Face-to-Face Conference Phenotyping Team – Centerphase Project Assessing the Value of Phenotyping Algorithms June 30, 2011.
Preceptor: Louise A. Mawn, M.D. May 30, Medical Documentation Medical record serves many functions For health care providers it facilitates: Communication.
Guoqian Jiang, MD, PhD Mayo Clinic
The work proposed in this study is an attempt to use Semantic Web technologies for integrating patient clinical data derived from Electronic Health Records.
Confidential 1 TM 2010 Strategic Health IT Advanced Research Projects SHARP Face-to-Face Meeting Project 3: High-Throughput Phenotyping Introduction to.
Initial Implementation of an Intelligent Patient Data Review Assistant (IPDRA) LTC Trinka Coster, MD Matthew Medlock, MD LTC Joseph Parker, MD Jim Ong,
Strategic Health IT Advanced Research Projects (SHARP) Area 4: Secondary Use of EHR Data Project 3: High-Throughput Phenotyping June 30, 2011 Jyoti Pathak,
EleMAP: An Online Tool for Harmonizing Data Elements using Standardized Metadata Registries and Biomedical Vocabularies Jyotishman Pathak, PhD 1 Janey.
University of Pittsburgh Department of Biomedical Informatics Healthcare institutions have established local clinical data research repositories to enable.
The Role of Standard Terminologies in Facilitating Integration James J. Cimino, M.D. Departments of Biomedical Informatics and Medicine Columbia University.
Data Heterogeneity Study (Not Data Quality) (OR) “Type 2 Diabetes: A modern day St. Valentine’s Day Massacre” Feb.14, 2011.
LRI Validation Suite Meeting November 1st, Agenda Review of LIS Test Plan Template CLIA Testing EHR testing (Juror Document)—Inspection Testing.
Clinical Quality Language (CQL)
Harmonization of SHARPn Clinical Element Models with CDISC SHARE Clinical Study Data Standards Guoqian Jiang, MD, PhD Mayo Clinic On behalf of CDISC CEMs.
The Final Standards Rule John D. Halamka MD. Categories of Standards Content Vocabulary Privacy/Security.
SHARPn Data Normalization November 18, Data-driven Healthcare Big Data Knowledge Research Practice Analytics Domain Pragmatics Experts.
Terminology in Health Care and Public Health Settings
SHARPn Milestones: Natural Language Processing Guergana Savova, PhD Boston Childrens Hospital and Harvard Medical School.
Initial Prototype for Clinical Data Normalization and High Throughput Phenotyping SHARPn F2F June 30,2011.
Data Quality (a.k.a. “Data Heterogeneity”) Kent Bailey, Susan Rea Welch, Lacey Hart, Kevin Bruce, Susan Fenton.
Silver Medallion CRI Poster Review W. Ed Hammond, Ph.D. Director, Duke Center for Health Informatics And assorted other things.
Strategic Health IT Advanced Research Projects (SHARP) Area 4: Secondary Use Dr. Friedman on-site visit, Mayo Clinic 3 September 2010.
Strategic Health IT Advanced Research Projects (SHARP) Area 4: Secondary Use of EHR Data Project 3: High-Throughput Phenotyping Jyotishman Pathak, PhD.
© 2015 Mayo Foundation for Medical Education and Research Guoqian Jiang, MD, PhD 1, Richard Kiefer 1, Luke V. Rasmussen 2 ; Huan Mo, MD, MS 3, Jennifer.
Computers in Healthcare Jinbo Bi Department of Computer Science and Engineering Connecticut Institute for Clinical and Translational Research University.
A Case Study of ICD-11 Anatomy Value Set Extraction from SNOMED CT Guoqian Jiang, PhD ©2011 MFMER | slide-1 Division of Biomedical Statistics & Informatics,
Query Health Operations Workgroup HQMF & QRDA Query Format - Results Format February 9, :00am – 12:00am ET.
Strategic Health IT Advanced Research Projects (SHARP) Area 4: Secondary Use of EHR Data SHARPfest June 2-3, 2010 PI: Christopher G Chute, MD DrPH.
SHARPn High-Throughput Phenotyping (HTP) November 18, 2013.
Open Health Natural Language Processing Consortium (OHNLP)
Value Set Resolution: Build generalizable data normalization pipeline using LexEVS infrastructure resources Explore UIMA framework for implementing semantic.
OpenHIE Improving health for the underserved. The Open Health Information Exchange (OpenHIE) Community: A diverse community enabling interoperable health.
CTAKES The clinical Text Analysis and Knowledge Extraction System.
Understanding eMeasures – And Their Impact on the EHR June 3, 2014 Linda Hyde, RHIA.
Component 11/Unit 2a Meaningful Use of the Electronic Health Record (EHR)
Strategic Health IT Advanced Research Projects (SHARP) Area 4: Secondary Use if EHR Data Project Initiation Thursday April 29th PI: Christopher G Chute,
Clinical Data Normalization Dr. Chute Aims: Build generalizable data normalization pipeline Semantic normalization annotators involving LexEVS Establish.
Query Health Concept-to-Codes (C2C) SWG Meeting #11 February 28,
A Semantic-Web Representation of Clinical Element Models
Part I: Introduction to SHARPn Normalization Hongfang Liu, PhD, Mayo Clinic Tom Oniki, PhD, Intermountain Healthcare.
Primary Goal: To support case detection and investigation for the reportable infectious diseases (conditions) using electronic information exchanges between.
©2011 MFMER | slide-1 The Linked Clinical Data Project Jyotishman Pathak, PhD Rick Kiefer SemTIG November 4, 2011.
SAGE Nick Beard Vice President, IDX Systems Corp..
Linking Electronic Health Records Across Institutions to Understand Why Women Seek Care at Multiple Sites for Breast Cancer Caroline A. Thompson, PhD,
Software Engineering Algorithms, Compilers, & Lifecycle.
Online Information and Education Conference 2004, Bangkok Dr. Britta Woldering, German National Library Metadata development in The European Library.
Pharmacogenomics Data Standardization using Clinical Element Models
Trifolia: Tooling to Support Standards Development
EDC at Dana-Farber/Harvard Cancer Center: Implementing InForm to Support a Bone Marrow Transplant Program Douglas J. Buell Project Specialist Dana-Farber.
Achieving Semantic Interoperability of Cancer Registries
SHARP: Secondary Use Project 1: Data Normalization
Strategic Health IT Advanced Research Projects (SHARP)
Automated Diabetes Case Identification Using Electronic Health Record Data at a Tertiary Care Facility  Sudhi G. Upadhyaya, MS, Dennis H. Murphree, PhD,
Automated Diabetes Case Identification Using Electronic Health Record Data at a Tertiary Care Facility  Sudhi G. Upadhyaya, MS, Dennis H. Murphree, PhD,
2018, Fall Pusan National University Ki-Joune Li
Clinical Element Models
Presentation transcript:

Strategic Health IT Advanced Research Projects (SHARP) Area 4: Secondary Use of EHR Data Project 3: High-Throughput Phenotyping Project Lead: Jyotishman Pathak, PhD PI: Christopher G. Chute, MD, DrPH June 12, 2012

SHARPn High-Throughput Phenotyping Electronic health records (EHRs) driven phenotyping Overarching goal To develop high-throughput automated techniques and algorithms that operate on normalized EHR data to identify cohorts of potentially eligible subjects on the basis of disease, symptoms, or related findings ©2012 MFMER | slide-2

SHARPn High-Throughput Phenotyping Current HTP project themes Standardization of phenotype definitions Library of phenotyping algorithms Phenotyping workbench Machine learning techniques for phenotyping Just-in-time phenotyping ©2012 MFMER | slide-3

SHARPn High-Throughput Phenotyping Data Transform Algorithm Development Process - Modified ©2012 MFMER | slide-4 Phenotype Algorithm Visualization Evaluation NLP, SQL Rules Mappings Semi-Automatic Execution Standardized representation of clinical data Create new and re-use existing clinical element models (CEMs) Standardized representation of clinical data Create new and re-use existing clinical element models (CEMs) Standardized and structured representation of phenotype definition criteria Use the NQF Quality Data Model (QDM) Standardized and structured representation of phenotype definition criteria Use the NQF Quality Data Model (QDM) Conversion of structured phenotype criteria into executable queries Use JBoss® Drools (DRLs) Conversion of structured phenotype criteria into executable queries Use JBoss® Drools (DRLs) [Welch et al. 2012] [Thompson et al., submitted 2012] [Li et al., submitted 2012]

SHARPn High-Throughput Phenotyping NQF Quality Data Model (QDM) Standard of the National Quality Forum (NQF) A structure and grammar to represent quality measures in a standardized format Groups of codes in a code set (ICD-9, etc.) "Diagnosis, Active: steroid induced diabetes" using "steroid induced diabetes Value Set GROUPING ( )” Supports temporality & sequences AND: "Procedure, Performed: eye exam" > 1 year(s) starts before or during "Measurement end date" Implemented as set of XML schemas Links to standardized terminologies (ICD-9, ICD-10, SNOMED-CT, CPT-4, LOINC, RxNorm etc.) ©2012 MFMER | slide-5

SHARPn High-Throughput Phenotyping ©2012 MFMER | slide Meaningful Use Phase I Quality Measures

SHARPn High-Throughput Phenotyping Example: Diabetes & Lipid Mgmt. - I ©2012 MFMER | slide-7 Human readable HTML

SHARPn High-Throughput Phenotyping Example: Diabetes & Lipid Mgmt. - II ©2012 MFMER | slide-8 Computable XML

SHARPn High-Throughput Phenotyping Data Transform Algorithm Development Process - Modified ©2012 MFMER | slide-9 Phenotype Algorithm Visualization Evaluation NLP, SQL Rules Mappings Semi-Automatic Execution Standardized representation of clinical data Create new and re-use existing clinical element models (CEMs) Standardized representation of clinical data Create new and re-use existing clinical element models (CEMs) Standardized and structured representation of phenotype definition criteria Use the NQF Quality Data Model (QDM) Standardized and structured representation of phenotype definition criteria Use the NQF Quality Data Model (QDM) Conversion of structured phenotype criteria into executable queries Use JBoss® Drools (DRLs) Conversion of structured phenotype criteria into executable queries Use JBoss® Drools (DRLs) [Welch et al. 2012] [Thompson et al., submitted 2012] [Li et al., submitted 2012]

SHARPn High-Throughput Phenotyping Drools-based Phenotyping Architecture ©2012 MFMER | slide-10 Business Logic Clinical Element Database List of Diabetic Patients Data Access Layer Transformation Layer Inference Engine (Drools) Service for Creating Output (File, Database, etc) Transform physical representation  Normalized logical representation (Fact Model)

SHARPn High-Throughput Phenotyping Automatic translation from NQF QDM criteria to Drools ©2012 MFMER | slide-11 [Li et al., submitted 2012]

The “executable” Drools flow ©2012 MFMER | slide-12

©2012 MFMER | slide-13 Phenotype library and workbench - I 1.Converts QDM to Drools 2.Rule execution by querying the CEM database 3.Generate summary reports

©2012 MFMER | slide-14 Phenotype library and workbench - II

SHARPn High-Throughput Phenotyping ©2012 MFMER | slide-15 Phenotype library and workbench - III

SHARPn High-Throughput Phenotyping Machine learning and HTP - I Machine learning and association rule mining Manual creation of algorithms take time Let computers do the “hard work” Validate against expert developed ones ©2012 MFMER | slide-16 [Caroll et al. 2011]

SHARPn High-Throughput Phenotyping Machine learning and HTP - II Origins from sales data Items (columns): co-morbid conditions Transactions (rows): patients Itemsets: sets of co-morbid conditions Goal: find all itemsets (sets of conditions) that frequently co-occur in patients. One of those conditions should be DM. Support: # of transactions the itemset I appeared in Support({TB, DLM, ND})=3 Frequent: an itemset I is frequent, if support(I)>minsup Patien t TBDL M ND…IEC 001YYYY 002YYYY 003YY 004Y 005YYY X: infrequent [Simon et al. 2012]

Electronic Health Records and Phenomics Just-in-Time phenotyping - I Transfusion-related Acute Lung Injury (TRALI) Transfusion-associated Circulatory Overload (TACO)

SHARPn High-Throughput Phenotyping Just-in-Time phenotyping - II ©2012 MFMER | slide-19 TRALI/TACO “sniffer”

Electronic Health Records and Phenomics

SHARPn High-Throughput Phenotyping Active Surveillance for TRALI and TACO Of the 88 TRALI cases correctly identified by the CART algorithm, only 11 (12.5%) of these were reported to the blood bank by the clinical service. Of the 45 TACO cases correctly identified by the CART algorithm, only 5 (11.1%) were reported to the blood bank by the clinical service.

SHARPn High-Throughput Phenotyping Publications till date (conservative) ©2012 MFMER | slide-22

SHARPn High-Throughput Phenotyping 2011 Milestones  Standardized definitions for phenotype criteria  Rules-based environment for phenotype algorithm execution  National library for standardized phenotype definitions (collaboration with eMERGE)  Machine learning techniques for algorithm definitions  Online, real-time phenotype execution  Phenotyping algorithm authoring environment ©2012 MFMER | slide-23

SHARPn High-Throughput Phenotyping 2012 Milestones Machine learning techniques for algorithm definitions Online, real-time phenotype execution Collaboration with NQF, Query Health and i2b2 infrastructures Use cases and demonstrations MU quality metrics (w/ NQF, Query Health) Cohort identification (w/ eMERGE, PGRN) Value analysis (w/ Mayo CSHCD, REP) Clinical trial alerting (w/ Mayo Cancer Ctr./CTSA) ©2012 MFMER | slide-24

SHARPn High-Throughput Phenotyping Project 3: Collaborators & Acknowledgments CDISC (Clinical Data Interchange Standards Consortium) Rebecca Kush, Landen Bain Centerphase Solutions Gary Lubin, Jeff Tarlowe Group Health Seattle David Carrell Harvard University/MIT Guergana Savova, Peter Szolovits Intermountain Healthcare/University of Utah Susan Welch, Herman Post, Darin Wilcox, Peter Haug Mayo Clinic Cory Endle, Rick Kiefer, Sahana Murthy, Gopu Shrestha, Dingcheng Li, Gyorgy Simon, Matt Durski, Craig Stancl, Kevin Peterson, Cui Tao, Lacey Hart, Erin Martin, Kent Bailey, Scott Tabor, Chris Chute ©2012 MFMER | slide-25